
Website Content Crawler
Pricing
Pay per usage

Website Content Crawler
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
3.9 (41)
Pricing
Pay per usage
1544
Total users
60K
Monthly users
7.8K
Runs succeeded
>99%
Issues response
7.9 days
Last modified
3 days ago
too long time for simple crawl spent too much credit
Opened 3 days ago by liquid_jolt, last comment 2 days ago by vernal_mantle
New rust http client failing on valid SSL config: SelectedUnusableCipherSuiteForVersion
Opened 4 days ago by uglyrobot, last comment 4 days ago by Jindřich Bär (jindrich.bar)
Scraper only returns 6 news items
Opened 4 days ago by kristupas, last comment 3 days ago by Jindřich Bär (jindrich.bar)
crawler got hung up
Opened 11 days ago by Tmoney97, last comment 5 days ago by Jindřich Bär (jindrich.bar)
Falta de Aviso
Opened 13 days ago by impeccable_niche, last comment 5 days ago by Jindřich Bär (jindrich.bar)
Add Time Range to Scraped Data
Opened 16 days ago by kristupas, last comment 8 days ago by Jindřich Bär (jindrich.bar)
Glob Patterns are ignored when using Sitemap
Opened 17 days ago by cirez_d, last comment 2 days ago by cirez_d
Incomplete Web Scraping Results for a Webflow website
Opened 18 days ago by sllintestacc, last comment 17 days ago by Jindřich Bär (jindrich.bar)
High costs?
Opened 19 days ago by nordicloom.marketing, last comment 19 days ago by Jindřich Bär (jindrich.bar)
Memory issue
Opened 20 days ago by acarter, last comment 19 days ago by Jindřich Bär (jindrich.bar)
it kept working without stoping
Opened 20 days ago by amitbend, last comment 19 days ago by Jindřich Bär (jindrich.bar)
HTTP Webhook stucked in loading forever
Opened 23 days ago by zacharykoo, last comment 22 days ago by Jakub Kopecký (jakub.kopecky)
Issue with web crawler
Opened a month ago by AndrewEhab, last comment a month ago by Jindřich Bär (jindrich.bar)
Website Content Crawler stuck - cost keeps increasing
Opened a month ago by digtital_moose, last comment 22 days ago by jfnrj2ui
How can i get all hidden fields in my actor result
Opened a month ago by mohit1.vdoit, last comment a month ago by Jindřich Bär (jindrich.bar)
Http website inaccessible
Opened a month ago by souheil, last comment a month ago by Jindřich Bär (jindrich.bar)
Didn't crawl the entire page and seemed to do it in no particular orer
Opened a month ago by arsia, last comment a month ago by Jindřich Bär (jindrich.bar)
No text parsed from from webpage.
Opened a month ago by formidable_quagmire, last comment a month ago by Jindřich Bär (jindrich.bar)
Avoid query parameters when crawling websites
Opened a month ago by innovum_admin, last comment a month ago by Jindřich Bär (jindrich.bar)
To much time
Opened a month ago by florian-morina, last comment a month ago by Jiří Spilka (jiri.spilka)