Web Scraper is a ready-made solution for scraping the web using the Chrome browser. It takes away all the work necessary to set up a browser for crawling, controls the browser automatically and produces machine readable results in...
Google Search Scraper
Crawls Google Search result pages (SERPs) and extracts a list of organic and paid results, ads, snap packs and more. Supports selection of custom country or language, and extraction of custom attributes.
Act sends mail.
Example showing how to use headless Chromium with Puppeteer to open a web page, determine its dimensions, save a screenshot and print the page to PDF. For more information about Puppeteer, please see https://github.com/GoogleChro...
Act which takes URL and array of strings to search for and returns a definition of a crawler.
Crawler Results To S3
Act to upload results from Apify crawler to AWS S3. It is designed to run from crawler finish webhook.
Contains a basic boilerplate of an Apify actor with a custom Dockerfile and hosted in a Git repository. It's purpose is to help you get started quickly to create your own actors.
Legacy Phantomjs Crawler
This is Apify actor implementation of the legacy PhantomJS crawler. The actor supports the same input as the legacy crawler, so you can call it the same way as the old one.
Example act using PHP as the main language.
Example of loading a web page in headless Chrome using Selenium Webdriver.
Example Process Crawl Results
Example act that iterates through all results from a crawler run and counts them. The act shall be called from the crawler's finish webhook. To do so, simply add the following URL to the finish webhook of your crawler: https://ap...
Example Golden Gate Webcam
Example act that opens a webpage with Golden Gate webcam stream. It takes a screenshot from the stream and saves it as output to key-value store. You can easily use it as API that returns a screenshot with: https://api.apify.com/v...
Simple example showing how to call another act. It doesn't accept any input and doesn't generate any output.
Hello world act to demonstrate a simple usage of Apify Actor.
Example Live View
This actor serves as an example of a crawling run using the Live View feature. It crawls through Hacker News page by page and you may inspect any of the pages' screenshot or HTML in the Live View panel.
Example Web Server
This example demonstrates how to use web server in actor as communication channel with outer world. Read article about this crawler in Apify knowledge base: https://kb.apify.com/actor/running-a-web-server
Har Files For Url List
Generates a HTTP Archive (HAR) file for web pages specified in a list of URLs. Optionally, the pages can be loaded using proxies from a specific country - to use this feature, you'll need access to Apify Proxy. On input, the act...
This act simply counts from one up. In each run it prints one number. Its state (counter position) is stored in named key-value store. Name of the store is example-counter and you can find in Apify app under the Storages.
This actor simply tests given array of URLs against selected proxy URLs or Apify proxy groups.
Example using GitHub Gist
Example of an Apify actor with source code in a GitHub Gist. This is useful for small projects that have multiple source code files, where creating a full Git repository does not make sense and you don't feel like hosting the sour...
Returns diff of two given images as JPEG or PNG image.
Webpage screenshot downloader
Generates a screenshot of a webpage on a specified URL. The screenshot is stored to the default key-value store as the output.