Scrape Documentation into Markdown for RAG
Created by
Daniel Dimitrov
Actor
LLM Markdown Crawler
Turn any documentation site into a clean Markdown corpus for retrieval-augmented generation and knowledge bases.
LLM Markdown Crawlersleek_waveform/llm-markdown-crawler
Source URL
Page Title
Markdown (Snippet)
Author
+1 fieldTextNumberBooleanListObject
Input
Start URLs(required)
url:https://docs.python.org/3/
Max Requests:100
Max Crawl Depth:3
Output fields
Source URL
Page Title
Markdown (Snippet)
Author
Word Count
Sign up on Apify01
Create your Apify account to access the LLM Markdown Crawler.
Start the run02
The Actor will start running based on the input automatically.
Receive the output03
Monitor the progress in real-time. You will be notified as soon as your dataset is complete and ready for review.
Integrate into your workflow04
The final output is delivered in JSON, CSV, or Excel format, ready to be plugged into your workflow.
