Docs Markdown Rag Ready Crawler
Pricing
from $5.00 / 1,000 results
Go to Apify Store
Docs Markdown Rag Ready Crawler
Turn any documentation site or website into clean, structured markdown—ready for RAG, embeddings, and AI agents.
Pricing
from $5.00 / 1,000 results
Turn any documentation site or website into clean, structured markdown—ready for RAG, embeddings, and AI agents.
_datasetType
Optional
Dataset type identifier
url
Optional
Page URL
normalizedUrl
Optional
Normalized page URL
canonicalUrl
Optional
Canonical URL if specified
status
Optional
HTTP status code
contentType
Optional
Content type
title
Optional
Page title
h1
Optional
Main H1 heading
language
Optional
Detected language
text
Optional
Cleaned plaintext content
markdown
Optional
Markdown formatted content
excerpt
Optional
Content excerpt (first 300 chars)
depth
Optional
Crawl depth from start URLs
referrers
Optional
Pages that linked to this page
outgoingInternalLinks
Optional
Internal links found on this page
outgoingExternalLinks
Optional
External links found on this page
contentHash
Optional
SHA256 hash of content for change detection
fetchedAt
Optional
When the page was fetched
chunkId
Optional
Unique chunk identifier
chunkIndex
Optional
Chunk index within page
headingPath
Optional
Heading hierarchy for chunk
charStart
Optional
Character start position in full text
charEnd
Optional
Character end position in full text
chunkHash
Optional
SHA256 hash of chunk content
pageContentHash
Optional
SHA256 hash of full page content
tokenEstimate
Optional
Estimated token count for chunk
from
Optional
Source URL for link edge
to
Optional
Target URL for link edge
anchorText
Optional
Link anchor text
type
Optional
Link or issue type
message
Optional
Error or issue message
severity
Optional
Issue severity level