Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler i...
A small efficient actor that loads a web page, parses its HTML using Cheerio library and extracts the following meta-dat...
Website Screenshot Generator
Create a screenshot of a website based on a specified URL. The screenshot is stored as the output in a key-value store. ...
Legacy PhantomJS Crawler
Replacement for the legacy Apify Crawler product with a backward-compatible interface. The actor uses PhantomJS headless...
Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages usin...
Anti Captcha Recaptcha
🧰 Actor for solving Google reCAPTCHA using the anti-captcha.com service. You need to have an anti-captcha subscription.
Example showing how to use headless Chromium with Puppeteer to open a web page, determine its dimensions, save a screens...
HTML to PDF Converter
Loads a web page in headless Chrome using Puppeteer and prints it to PDF. The input is a JSON object and output is a PDF...
Takes a screenshot of one or more web pages using the Chrome browser. The actor enables the setting of custom viewport s...
This actor simply tests given array of URLs against selected proxy URLs or Apify proxy groups.
Naked Domains Analyzer
Crawls and downloads web pages running on a list of provided naked domains e.g. "example.com". The actor stores HTML sna...
Get localStorage, sessionStorage and cookies from logins for usage in other actors.
Sitemap sniffer will check the most used variants of sitemaps and you can use that for crawling. This will just save you...
Image Downloader & Uploader
Download image files from image URLs in your datasets or key-value stores and save them to our key-value store or your A...
Simple actor that loads webpage and scrapes metadata using Metascraper library. Metadata – A library to easily scrape ...
Extract links from an Array of different paths/users parsed with a baseUrl and a pageFunction.
The monitoring runner is a part of the Apify Monitoring Suite (apify/monitoring). See its readme for more information an...
Residential Proxy Probe
Find residential proxy sessions on Apify Proxy with target IP addresses geo-located in specific postal codes or DMAs.
Actor that takes a list of URLs and provides a list of loaded URLs after redirects
The monitoring teardown is a part of the Apify Monitoring Suite (apify/monitoring). See its readme for more information ...
This act takes a crawler execution and inserts it's results into a remote MySQL database.
PhantomJS is 6 to 10 times faster than puppeteer per Compute Unit. Sends an email when the task is complete. The input s...
A fork of the url-to-pdf actor with added name input and delay until the network is idle. Opens a web page in headless C...
Anti Captcha Image
Act for solving image captchas using the anti-captcha.com service.