Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library and extracts data from the pages using a provided Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance...
Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This actor is an alternative to apify/web-scraper that gives you a low-level control of the crawling process. Supports both ...
Google Search Scraper
Crawls Google Search result pages (SERPs) and extracts a list of organic and paid results, ads, snap packs and more. Supports selection of custom country or language, and extraction of custom attributes.
Google Places Scraper
Extract location details from Google Places that are not provided by Google Maps API, such as reviews, photos and popular times.
Legacy PhantomJS Crawler
Enables scraping of publicly available data from Instagram posts on profile, hashtag and place pages. Extracts links to photos, comments, and detailed information about the Instagram pages. The actor supports search queries as wel...
SEO Checker + SEO Audit Tool
Crawls all web pages on a specific website and analyzes them from the search engine optimization (SEO) perspective. For example, the actor finds broken links, missing images, and provides information about possible page improvemen...
Extract hotel data, reviews, listings and prices from Booking when there is no official API from Booking.com.
Extract reviews, email, addresses, awards and many more from TripAdvisor when there is no reasonable open API by TripAdvisor.
Amazon crawler - this configuration will extract items for keywords that you will specify in the input, and it will automatically extract all pages for the given keyword. You can specify more keywords on the input for one run.
You can use this actor to monitor any page's content and get a notification when content changes. Technically it extracts text by a given selector and compares it with the previous run. If there is any change, it runs another act...
HTML to PDF Converter
Open a web page in headless Chrome using Puppeteer and print it to PDF. The input is a JSON object and output is a PDF file.
Contact Information Scraper
Scrape and extract contact information (e-mails, phone numbers, social networks) from any website. Collect or pull and build your own customer database.
Twitter Hashtag Scraper
This Twitter Hashtag Scraper will scrape and extracts all tweets for given hashtag and provide output in JSON, XML, CSV or HTML.
Google Sheets Import & Export
Import data from datasets or crawler executions to your Google spreadsheet. Or even just process the data you already have there!
Extract data from Transfermarkt website without API and export data to JSON, XML or CSV.
Kickstarter Search Scraper
Missing Kickstarter API? Need fresh Kickstarter news or list of best and finished projects? Try this new wrapper for Kickstarter search, which allows you to configure search filters and get the list of items from Kickstarter searc...
PDF to HTML Converter
Converts a PDF document to HTML using the pdf2htmlEX tool.
JS Code 2 Flowchart
Broken Links Checker
Crawls a website and finds broken links. Unlike other similar SEO analysis tools, the actor also reports broken URL #fragments. The results are stored in a JSON and HTML report.
Google Cheerio Batch
Scrape Google search results in batches. Take a list of URLs as input and save to HTML. It requires GOOGLE_SERP proxy so if you don't have it enabled, contact Apify support
Crawls a website using one or more sitemaps and imports the data to Algolia search index. The text content is identified using simple CSS selectors.
Foursquare Reviews Scraper
Scrape a massive number of reviews in a few seconds. Allows up to 30 places per search query. Look at the example run for INPUT/OUTPUT.
Email Notification Webhook
This actor sends you an email notification with a log file when one of your other actors fails, succeeds, times out, you name it.
Check the results of your scrapers with this flexible checker. Just supply a dataset or key-value store ID and a few simple rules to get a detailed report.
Email and Social handlers extractor
Get emails and social handlers (Twitter, LinkedIn, Instagram) from page/domain/web. Just change the Start url and define the scope.
How to get data from Xossip? With general crawler from APIFY you can scrape web.archive.org and download all photos, threads and much more.
Booking - hotel details
Get info about hotels on booking.com based on search query.
SEO audit tool
Inspects a website and performs a basic SEO analysis of every page. For example, the crawler reports broken links, unoptimized images, too long titles etc.