
Website Content Crawler
Pricing
Pay per usage

Website Content Crawler
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
4.6 (38)
Pricing
Pay per usage
1.1k
Monthly users
5.9k
Runs succeeded
>99%
Response time
2.3 days
Last modified
5 days ago
get this error
Opened 9 hours ago by esc4dinh4, last comment 9 hours ago by esc4dinh4
Crawling not extracting all text on page
Opened 5 days ago by agungbmtra, last comment 2 days ago by Jakub Kopecký (jakub.kopecky)
cannot download pdfs
Opened 6 days ago by ftballguy45, last comment 2 days ago by Jakub Kopecký (jakub.kopecky)
Add Full File Name to the Key-Value-Stores
Opened 8 days ago by CtrlAltElite, last comment 8 days ago by Jan Buchar (janbuchar)
Ability to Group Crawled Page with Followed Link and Its Content in a Single Row
Opened 11 days ago by randomname1234, last comment 11 days ago by randomname1234
Page Title
Opened 21 days ago by CtrlAltElite, last comment 16 days ago by Jakub Kopecký (jakub.kopecky)
Navigating frame was detached
Opened a month ago by stephen.kim, last comment a month ago by Jiří Spilka (jiri.spilka)
scraper don't scrape all the website content like product description
Opened 2 months ago by maabada.shivok, last comment 2 months ago by maabada.shivok
Crawl hung at finished
Opened 2 months ago by mcantrell, last comment 2 months ago by mykola_scrapes
Decode non-UTF-8 text in crawlerType cheerio
Opened a year ago by consoling_knock, last comment a year ago by Jindřich Bär (jindrich.bar)
Pricing
Pricing model
Pay per usageThis Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.