Web Scraper avatar

Web Scraper

Try for free

No credit card required

Go to Store
Web Scraper

Web Scraper

apify/web-scraper
Try for free

No credit card required

Crawls arbitrary websites using the Chrome browser and extracts structured data from web pages using a provided JavaScript function. The Actor supports both recursive crawling and lists of URLs, and automatically manages concurrency for maximum performance.

received 401 status code

Opened 4 days ago by Competent Path (competent_path), last comment 4 days ago by Competent Path (competent_path)

Scraping data from my chrome extension.

Opened 7 days ago by LeonardChin, last comment 6 days ago by Jindřich Bär (jindrich.bar)

Crawling takes much longer

Opened 8 days ago by SallaR, last comment 6 days ago by Jindřich Bär (jindrich.bar)

If I verify in pageFunction that the target website has been redirected by the anti-crawl mechanism, how do I get the latest page content.

Opened 9 days ago by productvidau, last comment 6 days ago by Jindřich Bär (jindrich.bar)

No longer working

Opened 16 days ago by intense_sunshine, last comment 6 days ago by Jindřich Bär (jindrich.bar)

container error.

Opened 20 days ago by caravel, last comment 19 days ago by Jindřich Bär (jindrich.bar)

Reclaiming failed request back to the list or queue. Navigation timed out after 60 seconds

Opened 20 days ago by bartos.ondrej, last comment 17 days ago by Jindřich Bär (jindrich.bar)

issue unknown

Opened 20 days ago by caravel, last comment 19 days ago by Jindřich Bär (jindrich.bar)

The parametre "Max result records (optional)" is set to N-1

Opened 23 days ago by RedabenhAKO, last comment 18 days ago by RedabenhAKO

Way to bypass cloudflare human verification?

Opened 24 days ago by quarterly_quicklime, last comment 19 days ago by quarterly_quicklime

RayHForUse

Opened 24 days ago by RayH, last comment 20 days ago by Jindřich Bär (jindrich.bar)

Can't get pagination working.

Opened 25 days ago by quarterly_quicklime, last comment 10 days ago by quarterly_quicklime

actor is not starting

Opened a month ago by everlasting_label, last comment a month ago by Jindřich Bär (jindrich.bar)

Is it Possible to extract the href attribute inside a link HTML tag <a href="...">?

Opened a month ago by artbaggio, last comment a month ago by Jindřich Bär (jindrich.bar)

PuppeteerCrawler: Reclaiming failed request back to the list or queue. Navigation timed out after 60 seconds.

Opened 2 months ago by shon98, last comment 6 days ago by Jindřich Bär (jindrich.bar)

We passed 40 websites -&gt; it returned 1 description

Opened 3 months ago by Nikita Sviridenko (nikita-sviridenko), last comment 3 months ago by Jindřich Bär (jindrich.bar)

How do I setup pagination with a URL

Opened 3 months ago by sacdrexelmba, last comment a month ago by Jindřich Bär (jindrich.bar)

Am I doing something wrong?

Opened 4 months ago by BascharSeven, last comment 3 months ago by Jindřich Bär (jindrich.bar)

Crawler goes off domain

Opened 4 months ago by chimaro, last comment 3 months ago by Jindřich Bär (jindrich.bar)

Giúp tôi về actor này (web scraper)

Opened 4 months ago by DaisanCeo, last comment 3 months ago by Jindřich Bär (jindrich.bar)

Developer
Maintained by Apify

Actor Metrics

  • 3.3k monthly users

  • 456 bookmarks

  • >99% runs succeeded

  • 4.8 days response time

  • Created in Mar 2019

  • Modified a month ago