![Website Content Crawler avatar](https://images.apifyusercontent.com/1VrdawICnxIwM4X5JzRJHPBmLx0OpmiNxtHGGLmxdu8/rs:fill:92:92/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9hWUcwbDlzN2RiQjdqM2diUy9QZlRvRU5rSlp4YWh6UER1My1DbGVhblNob3RfMjAyMy0wMy0yOF9hdF8xMC40MC4yMF8yeC5wbmc.webp)
No credit card required
![Website Content Crawler](https://images.apifyusercontent.com/1VrdawICnxIwM4X5JzRJHPBmLx0OpmiNxtHGGLmxdu8/rs:fill:92:92/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9hWUcwbDlzN2RiQjdqM2diUy9QZlRvRU5rSlp4YWh6UER1My1DbGVhblNob3RfMjAyMy0wMy0yOF9hdF8xMC40MC4yMF8yeC5wbmc.webp)
Website Content Crawler
No credit card required
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗LangChain, LlamaIndex, and the wider LLM ecosystem.
Cannot retrieve info in lazy load part
Closed
I tried to scrape the 48th reel's view count info as highlighted from this page (https://www.facebook.com/people/Pang-Piraya/100011525405767/?sk=reels_tab) but it always crawl only first 10 reels only. What should I config in input? please help.
Hello @chutnarin and thank you for your interest in this Actor!
This Actor (Website Content Crawler) is not primarily designed for scraping social media - platforms like Facebook or X (Twitter) often utilize heavy-weight anti-scraping measurements.
If you want to scrape Facebook, look for Facebook-related Actors in the Store. Alternatively, you can try searching GitHub for some open-source Facebook scrapers and actorize them for use on Apify. You can find guides for that in our Documentation.
Thank you for understanding. Cheers!
- 2.8k monthly users
- 317 stars
- 100.0% runs succeeded
- 4 days response time
- Created in Mar 2023
- Modified 1 day ago