Website Content Crawler avatar

Website Content Crawler

Try for free

No credit card required

Go to Store
Website Content Crawler

Website Content Crawler

apify/website-content-crawler
Try for free

No credit card required

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.

Do you want to learn more about this Actor?

Get a demo
MO

Hung crawl

Open

mcantrell-owner opened this issue
14 hours ago

The normal job should take around 45s but this one had to be aborted after 4.5m. I had to abort this and I'm not really sure what the cause was by looking at the logs.

If I hadn't caught this, it seems like it could have easily used our entire quota. As we're evaluating the framework, this is a bit of a red flag.

jiri.spilka avatar

Hi,
Thank you for trying the Website Content Crawler.

I attempted to replicate the issue but did not encounter the problem. My run completed successfully in 34 seconds.

This should not happen. We'll investigate further to debug it.

Please let me know if there's anything else I can assist with or if anything is unclear. I'm happy to help.

Best regards,
Jiri

Developer
Maintained by Apify

Actor Metrics

  • 4k monthly users

  • 839 stars

  • >99% runs succeeded

  • 1 days response time

  • Created in Mar 2023

  • Modified 17 hours ago