Playwright Scraper avatar

Playwright Scraper

Try for free

No credit card required

Go to Store
Playwright Scraper

Playwright Scraper

apify/playwright-scraper
Try for free

No credit card required

Crawls websites with the headless Chromium, Chrome, or Firefox browser and Playwright library using a provided server-side Node.js code. Supports both recursive crawling and a list of URLs. Supports login to a website.

Do you want to learn more about this Actor?

Get a demo

Change Log

1.0.11 (2023-08-22)

  • Updated Crawlee version to v3.5.2.
  • Updated Node.js version to v18.
  • Added new options:
    • Dismiss cookie modals (closeCookieModals): Using the I don't care about cookies browser extension. When on, the crawler will automatically try to dismiss cookie consent modals. This can be useful when crawling European websites that show cookie consent modals.
    • Maximum scrolling distance in pixels (maxScrollHeightPixels): The crawler will scroll down the page until all content is loaded or the maximum scrolling distance is reached. Setting this to 0 disables scrolling altogether.
    • Exclude Glob Patterns (excludes): Glob patterns to match links in the page that you want to exclude from being enqueued.

1.0

  • Initial version built on Crawlee.
  • Proxy usage is now required.
Developer
Maintained by Apify

Actor Metrics

  • 68 monthly users

  • 18 stars

  • >99% runs succeeded

  • 54 days response time

  • Created in Aug 2022

  • Modified 6 months ago

Categories