Website Content Crawler
No credit card required
Website Content Crawler
No credit card required
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Do you want to learn more about this Actor?
Get a demoWhen I trigger a run from Zapier, it indicates "Result : 0" because of timeout. However the content is received properly, the timeout is set to 60 seconds, but the log says that it exceeded the 30 seconds timeout. When I launch a run from Apify interface, Result is 1, with the same parameters. Is there something I'm doing wrong, or only a zapier integration bug? Thanks
Hello, and thank you for your interest in the Actor! There are two different timeouts involved - one is requestTimeoutSecs
, which, as you observe, is set to 60s, and it is used by the actor internally for each page. Then there is a 30s timeout handled by the Apify platform. My best guess is that the Zapier integration sets that - can you find any way to change this in Zapier?
Ok yeah it was Zapier, I split my Zap into 2 zaps with a webhook after success crawling
It's great that the issue is resolved, so I will close it. Let us know if you encounter any other problems.
Actor Metrics
4k monthly users
-
839 stars
>99% runs succeeded
1 days response time
Created in Mar 2023
Modified 18 hours ago