Crawl Help Center Content for AI Search
Created by
nezha
Extract clean Markdown and text from help center pages so the content can be indexed for AI search, support bots, RAG, and knowledge base workflows.
Website Content Extractor for RAG: Markdown, HTML, Textnezha/website-content-crawler
Url
Title
Description
Content format
+6 fieldsTextNumberBooleanListObject
Input
Website, Docs, or Help Center URLs(required)
url:https://help.openai.com/en/
Max Pages to Extract:5
Page Discovery Method:website
Link Depth:1
Target Scope Only:true
Main Content Format:markdown
Output fields
Url
Title
Description
Content format
Word count
Language
Canonical url
Depth
Http status code
Crawled at
Sign up on Apify01
Create your Apify account to access the Website Content Extractor for RAG: Markdown, HTML, Text.
Start the run02
The Actor will start running based on the input automatically.
Receive the output03
Monitor the progress in real-time. You will be notified as soon as your dataset is complete and ready for review.
Integrate into your workflow04
The final output is delivered in JSON, CSV, or Excel format, ready to be plugged into your workflow.
