Website Content Crawler avatar

Website Content Crawler

Try for free

No credit card required

Go to Store
Website Content Crawler

Website Content Crawler

apify/website-content-crawler
Try for free

No credit card required

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.

MB

scraper don't scrape all the website content like product description

Open
maabada.shivok opened this issue
14 days ago

the crwaler didn't scrape properly data from this website: https://www.tranquilo.co.il/

just scrape headlines, why is that? is the website construction? whatr is the reason?

this is example page that the content didn't scan:

https://www.tranquilo.co.il/page/%D7%A6%D7%9C-%D7%91%D7%A8%D7%99%D7%90

I've also attached files

Developer
Maintained by Apify

Actor Metrics

  • 5.4k monthly users

  • 990 bookmarks

  • >99% runs succeeded

  • 1 days response time

  • Created in Mar 2023

  • Modified 13 days ago