Website Content Crawler
No credit card required
Website Content Crawler
No credit card required
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Do you want to learn more about this Actor?
Get a demoHung crawl
Opened 8 hours ago by mcantrell-owner, last comment 5 hours ago by Jiří Spilka (jiri.spilka)
Poor results
Opened 3 days ago by Digital_Mole, last comment 4 hours ago by Jiří Spilka (jiri.spilka)
Integration with openai’s assistant through their functions calling tool
Opened 5 days ago by Cirula, last comment 5 days ago by Jiří Spilka (jiri.spilka)
Not getting PDF Files
Opened 15 days ago by laius, last comment 15 days ago by Jiří Spilka (jiri.spilka)
I can't "get output"
Opened 19 days ago by nicco.teodori, last comment 19 days ago by Jiří Spilka (jiri.spilka)
TypeError: Cannot read properties of undefined (reading 'content-type')
Opened 2 months ago by sevcik, last comment a month ago by Jiří Spilka (jiri.spilka)
Decode non-UTF-8 text in crawlerType cheerio
Opened a year ago by consoling_knock, last comment a year ago by Jindřich Bär (jindrich.bar)
Actor Metrics
4k monthly users
-
837 stars
>99% runs succeeded
1 days response time
Created in Mar 2023
Modified 11 hours ago