Web Scraper is a ready-made solution for scraping the web using the Chrome browser. It takes away all the work necessary to set up a browser for crawling, controls the browser automatically and produces machine readable results in...
SEO Checker + SEO Audit Tool
Crawls all web pages on a specific website and analyzes them from the search engine optimization (SEO) perspective. For example, the actor finds broken links, missing images, and provides information about possible page improvemen...
Google Places Scraper
Extract location details from Google Places which are not provided by Google Maps API like review, photos and popular times.
Google Search Scraper
Crawls Google Search result pages (SERPs) and extracts a list of organic and paid results, ads, snap packs and more. Supports selection of custom country or language, and extraction of custom attributes.
HTML to PDF Converter
Open a web page in headless Chrome using Puppeteer and print it to PDF. The input is a JSON object and output is a PDF file.
Extract hotel data, reviews, listings and prices from Booking when there is no official API from Booking.com.
You can use this actor to monitor any page's content and get a notification when content changes. Technically it extracts text by a given selector and compares it with the previous run. If there is any change, it runs another act...
Amazon crawler - this configuration will extract items for keywords that you will specify in the input, and it will automatically extract all pages for the given keyword. You can specify more keywords on the input for one run.
Contact Information Scraper
Scrape and extract contact information (e-mails, phone numbers, social networks) from any website. Collect or pull and build your own customer database.
Twitter Hashtag Scraper
This Twitter Hashtag Scraper will scrape and extracts all tweets for given hashtag and provide output in JSON, XML, CSV or HTML.
Kickstarter Search Scraper
Missing Kickstarter API? Need fresh Kickstarter news or list of best and finished projects? Try this new wrapper for Kickstarter search, which allows you to configure search filters and get the list of items from Kickstarter searc...
Extract data about homes from Zillow website without API and download into Excel, CSV or JSON.
PDF to HTML Converter
Converts a PDF document to HTML using the pdf2htmlEX tool.
Google Sheets Import & Export
Import data from datasets or crawler executions to your Google spreadsheet. Or even just process the data you already have there!
Extract data from Transfermarkt website without API and export data to JSON, XML or CSV.
JS Code 2 Flowchart
Broken Links Checker
Crawl your website and find broken links. Unlike other similar SEO analysis tools, it also reports broken URL #fragments. The results are stored in a JSON and HTML report.
Google Cheerio Batch
Scrape Google search results in batches. Take a list of URLs as input and save to HTML. It requires GOOGLE_SERP proxy so if you don't have it enabled, contact Apify support
Crawls a website using one or more sitemaps and imports the data to Algolia search index. The text content is identified using simple CSS selectors. The actor simply runs the algolia-webcrawler NPM package (https://www.npmjs.com/...
Act sends mail.
Example showing how to use headless Chromium with Puppeteer to open a web page, determine its dimensions, save a screenshot and print the page to PDF. For more information about Puppeteer, please see https://github.com/GoogleChro...
Act which takes URL and array of strings to search for and returns a definition of a crawler.
How to get data from Xossip? With general crawler from APIFY you can scrape web.archive.org and download all photos, threads and much more.
Article Text Extractor
Simply extracts article text and other meta info from given url. Uses https://github.com/ageitgey/node-unfluff which is a NodeJS implementation of https://github.com/grangier/python-goose.
Crawler To Spreadsheet
This crawler takes last crawler run result and stores new items in Google Docs Spreadsheet.
Anti Captcha Recaptcha
Act for solving google recaptcha using the anti-captcha.com service. You need to have an anti-captcha subscription to be able to use it.
Example Hacker News
Example crawler for news.ycombinator.com build using Apify SDK
Url List Download Html
This act accepts a url list and downloads HTML of each page. It has input parameter - "sources" (see soursec parameter of UrlList https://www.apify.com/docs/sdk/apify-runtime-js/beta#RequestList).
Linkedin Sign In Example
This act shows how you can sign-in to LinkedIn with Apify act with puppeteer image. It is only example how to do it. Usage: 1. Copy&paste this code to your act. 2. Set up email forwarding for user, that use for sign-in: FROM: ...
Send Crawler Results
This act downloads results from Apify crawler and send them to email as attachments. It is designed to run from crawler finish webhook.