Puppeteer Scraper avatar

Puppeteer Scraper

Try for free

No credit card required

Go to Store
Puppeteer Scraper

Puppeteer Scraper

apify/puppeteer-scraper
Try for free

No credit card required

Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.

DT

Could not get any links/anchor tags for few domains

Closed
dlc_test opened this issue
10 months ago

I am trying to get all links from few websites. I notice that for few domains there are no links returned. Here are few sample domains and a run log. Please help fix the issue.

Webpages - https://www.pinterest.com/ https://uniqlo.com/

jindrich.bar avatar

Hello and thank you for your interest in this Actor!

Apologies for the delay in responding to your issue.

Larger online platforms like Uniqlo and Pinterest invest heavily in anti-scraping protection, and the tricks available in Web Scraper might not always be enough. While you can try workarounds such as switching to Residential proxies or enabling image downloads or switching to Chrome (see option Use Chrome), these websites may still recognize Web Scraper’s network fingerprints.

On the Apify Store, you can find multiple third-party Actors specifically designed for scraping these services:

These might be a better fit for your use case. Alternatively, you can develop your own custom solution and deploy it on Apify for more flexibility.

I'll go ahead and close this issue, but feel free to open a new one if you need further assistance!

Developer
Maintained by Apify

Actor Metrics

  • 446 monthly users

  • 96 bookmarks

  • >99% runs succeeded

  • Created in Apr 2019

  • Modified 8 months ago