Website Content Crawler avatar
Website Content Crawler

Pricing

Pay per usage

Go to Store
Website Content Crawler

Website Content Crawler

Developed by

Apify

Apify

Maintained by Apify

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.

3.9 (41)

Pricing

Pay per usage

1544

Total users

60K

Monthly users

7.8K

Runs succeeded

>99%

Issues response

7.9 days

Last modified

3 days ago

too long time for simple crawl spent too much credit

Opened 3 days ago by liquid_jolt, last comment 2 days ago by vernal_mantle

New rust http client failing on valid SSL config: SelectedUnusableCipherSuiteForVersion

Opened 4 days ago by uglyrobot, last comment 4 days ago by Jindřich Bär (jindrich.bar)

Scraper only returns 6 news items

Opened 4 days ago by kristupas, last comment 3 days ago by Jindřich Bär (jindrich.bar)

crawler got hung up

Opened 11 days ago by Tmoney97, last comment 5 days ago by Jindřich Bär (jindrich.bar)

Falta de Aviso

Opened 13 days ago by impeccable_niche, last comment 5 days ago by Jindřich Bär (jindrich.bar)

Add Time Range to Scraped Data

Opened 16 days ago by kristupas, last comment 8 days ago by Jindřich Bär (jindrich.bar)

Glob Patterns are ignored when using Sitemap

Opened 17 days ago by cirez_d, last comment 2 days ago by cirez_d

Incomplete Web Scraping Results for a Webflow website

Opened 18 days ago by sllintestacc, last comment 17 days ago by Jindřich Bär (jindrich.bar)

High costs?

Opened 19 days ago by nordicloom.marketing, last comment 19 days ago by Jindřich Bär (jindrich.bar)

Memory issue

Opened 20 days ago by acarter, last comment 19 days ago by Jindřich Bär (jindrich.bar)

it kept working without stoping

Opened 20 days ago by amitbend, last comment 19 days ago by Jindřich Bär (jindrich.bar)

HTTP Webhook stucked in loading forever

Opened 23 days ago by zacharykoo, last comment 22 days ago by Jakub Kopecký (jakub.kopecky)

Issue with web crawler

Opened a month ago by AndrewEhab, last comment a month ago by Jindřich Bär (jindrich.bar)

Website Content Crawler stuck - cost keeps increasing

Opened a month ago by digtital_moose, last comment 22 days ago by jfnrj2ui

How can i get all hidden fields in my actor result

Opened a month ago by mohit1.vdoit, last comment a month ago by Jindřich Bär (jindrich.bar)

Http website inaccessible

Opened a month ago by souheil, last comment a month ago by Jindřich Bär (jindrich.bar)

Didn't crawl the entire page and seemed to do it in no particular orer

Opened a month ago by arsia, last comment a month ago by Jindřich Bär (jindrich.bar)

No text parsed from from webpage.

Opened a month ago by formidable_quagmire, last comment a month ago by Jindřich Bär (jindrich.bar)

Avoid query parameters when crawling websites

Opened a month ago by innovum_admin, last comment a month ago by Jindřich Bär (jindrich.bar)

To much time

Opened a month ago by florian-morina, last comment a month ago by Jiří Spilka (jiri.spilka)