Website Content Crawler
No credit card required
Website Content Crawler
No credit card required
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Do you want to learn more about this Actor?
Get a demoThe normal job should take around 45s but this one had to be aborted after 4.5m. I had to abort this and I'm not really sure what the cause was by looking at the logs.
If I hadn't caught this, it seems like it could have easily used our entire quota. As we're evaluating the framework, this is a bit of a red flag.
Hi,
Thank you for trying the Website Content Crawler.
I attempted to replicate the issue but did not encounter the problem. My run completed successfully in 34 seconds.
This should not happen. We'll investigate further to debug it.
Please let me know if there's anything else I can assist with or if anything is unclear. I'm happy to help.
Best regards,
Jiri
Actor Metrics
4k monthly users
-
839 stars
>99% runs succeeded
1 days response time
Created in Mar 2023
Modified 17 hours ago