Website Content Crawler avatar
Website Content Crawler

Pricing

Pay per usage

Go to Store
Website Content Crawler

Website Content Crawler

Developed by

Apify

Apify

Maintained by Apify

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.

4.0 (41)

Pricing

Pay per usage

1597

Total users

62K

Monthly users

8.1K

Runs succeeded

>99%

Issues response

8 days

Last modified

a day ago

swimwearrubbish

Opened 10 days ago by widelykorea, last comment 9 days ago by Jindřich Bär (jindrich.bar)

This just kept going for no reason

Opened 12 days ago by richardcv, last comment 9 days ago by Jindřich Bär (jindrich.bar)

The Actor hit an OOM (out of memory) condition. You can resurrect it with more memory to continue where you left off.

Opened 14 days ago by evgeny.kotelevskiy, last comment 9 days ago by Jindřich Bär (jindrich.bar)

too long time for simple crawl spent too much credit

Opened 16 days ago by liquid_jolt, last comment 9 days ago by Jindřich Bär (jindrich.bar)

Scraper only returns 6 news items

Opened 18 days ago by kristupas, last comment 17 days ago by Jindřich Bär (jindrich.bar)

crawler got hung up

Opened 25 days ago by Tmoney97, last comment 19 days ago by Jindřich Bär (jindrich.bar)

Falta de Aviso

Opened a month ago by impeccable_niche, last comment 19 days ago by Jindřich Bär (jindrich.bar)

Add Time Range to Scraped Data

Opened a month ago by kristupas, last comment 22 days ago by Jindřich Bär (jindrich.bar)

Incomplete Web Scraping Results for a Webflow website

Opened a month ago by sllintestacc, last comment a month ago by Jindřich Bär (jindrich.bar)

High costs?

Opened a month ago by nordicloom.marketing, last comment a month ago by Jindřich Bär (jindrich.bar)

it kept working without stoping

Opened a month ago by amitbend, last comment a month ago by Jindřich Bär (jindrich.bar)

HTTP Webhook stucked in loading forever

Opened a month ago by zacharykoo, last comment a month ago by Jakub Kopecký (jakub.kopecky)

Issue with web crawler

Opened a month ago by AndrewEhab, last comment a month ago by Jindřich Bär (jindrich.bar)

Website Content Crawler stuck - cost keeps increasing

Opened 2 months ago by digtital_moose, last comment a day ago by Jan Buchar (janbuchar)

How can i get all hidden fields in my actor result

Opened 2 months ago by mohit1.vdoit, last comment 2 months ago by Jindřich Bär (jindrich.bar)

Http website inaccessible

Opened 2 months ago by souheil, last comment a month ago by Jindřich Bär (jindrich.bar)

Didn't crawl the entire page and seemed to do it in no particular orer

Opened 2 months ago by arsia, last comment 2 months ago by Jindřich Bär (jindrich.bar)

No text parsed from from webpage.

Opened 2 months ago by formidable_quagmire, last comment 2 months ago by Jindřich Bär (jindrich.bar)

To much time

Opened 2 months ago by florian-morina, last comment 2 months ago by Jiří Spilka (jiri.spilka)

No text parsed from from webpage.

Opened 2 months ago by formidable_quagmire, last comment 2 months ago by Jindřich Bär (jindrich.bar)