Web Scraper avatar

Web Scraper

Try for free

No credit card required

Go to Store
Web Scraper

Web Scraper

apify/web-scraper
Try for free

No credit card required

Crawls arbitrary websites using the Chrome browser and extracts data from pages using JavaScript code. The Actor supports both recursive crawling and lists of URLs and automatically manages concurrency for maximum performance. This is Apify's basic tool for web crawling and scraping.

Do you want to learn more about this Actor?

Get a demo
TO

Scrape ignores all my startUrls except the first one

Closed

toddles opened this issue
6 months ago

I'm passing a list of urls in "startUrls" but the scraper just stops after hitting the first one. Any idea why?

jindrich.bar avatar

Hello, and thanks for reporting this!

The scraper is likely deduplicating the URLs since they only differ in the #fragment part.

To fix this, enable the "URL #fragments identify unique pages" option (or set keepUrlFragments to true if you run the Actor via the API). This should resolve the issue.

I'll close this issue now, but feel free to reach out if you need further help. Cheers!

Developer
Maintained by Apify

Actor Metrics

  • 2.4k monthly users

  • 278 stars

  • >99% runs succeeded

  • 38 days response time

  • Created in Mar 2019

  • Modified 4 months ago