Website Content Crawler avatar
Website Content Crawler
Try for free

No credit card required

View all Actors
Website Content Crawler

Website Content Crawler

apify/website-content-crawler
Try for free

No credit card required

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗LangChain, LlamaIndex, and the wider LLM ecosystem.

MI

Cloudflare challenge failed

Closed

mihai opened this issue
5 months ago

Crawl failed due to cloudflare, what can be done?

janbuchar avatar

Hello, and thank you for your interest in the actor!

I see that you already used the stealthy firefox crawler type and have a residential proxy enabled. The only recommendation I can give you is to change the proxy group to something in Europe, instead of the US, in the actor input settings.

Let us know if that helps!

Developer
Maintained by Apify
Actor metrics
  • 2.8k monthly users
  • 317 stars
  • 100.0% runs succeeded
  • 4 days response time
  • Created in Mar 2023
  • Modified 1 day ago