
Website Content Crawler
Pricing
Pay per usage

Website Content Crawler
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
3.7 (41)
Pricing
Pay per usage
1501
Total users
58K
Monthly users
8.1K
Runs succeeded
>99%
Issues response
7.6 days
Last modified
3 hours ago
Is it possible to speed up the processing time?
Closed
My average run time for each crawl is about 1 minute. Sometimes it takes less than 30 seconds, or take more than 3 minutes.
Is is possible to make this faster? Ideally around 40 seconds to 1-minute mark

Hey, thanks for using Website Content Crawler!
To speed up crawling, try tweaking the initialConcurrency
Actor input. This should make crawling faster from the start, but overall runtime depends on multiple external factors.
Jakub

Hi,
Another way to speed up crawling is to use the Cheerio Crawler instead of browser-based crawlers—but only if rendering JavaScript is not required. If the page relies heavily on JavaScript, you'll need to continue using browser-based crawling.
I'll go ahead and close this issue for now, but feel free to reach out with any questions or open a new issue if needed.
Best regards, Jiri