UT Tower Lighting
This crawler goes to https://tower.utexas.edu/ and extracts the status string of the UT Austin Tower.
Extract hotel data, reviews, listings and prices from Booking when there is no official API from Booking.com.
Google Sheets Import & Export
Import data from datasets or crawler executions to your Google spreadsheet. Or even just process the data you already have there!
Extract reviews, email, addresses, awards and many more from TripAdvisor when there is no reasonable open API by TripAdvisor.
Google Places Scraper
Extract location details from Google Places which are not provided by Google Maps API like review, photos and popular times.
Stack Overflow Search Scraper
Simple Captcha Tesseract
Handy solver for simple image captcha with numbers.
SEO Checker + SEO Audit Tool
Crawls all web pages on a specific website and analyzes them from the search engine optimization (SEO) perspective. For example, the actor finds broken links, missing images, and provides information about possible page improvemen...
Cheerio Scraper is a ready-made solution for crawling the web using plain HTTP requests to retrieve HTML pages and then parsing and inspecting the HTML using the Cheerio library. It's blazing fast.
Puppeteer Scraper is the most powerful scraper tool in our arsenal (aside from developing your own actors). It uses the Puppeteer library to programmatically control a headless Chrome browser and it can make it do almost anything....
Web Scraper is a ready-made solution for scraping the web using the Chrome browser. It takes away all the work necessary to set up a browser for crawling, controls the browser automatically and produces machine readable results in...
Broken Links Checker
Crawls a website and finds broken links. Unlike other similar SEO analysis tools, the actor also reports broken URL #fragments. The results are stored in a JSON and HTML report.
Email Notification Webhook
This actor sends you an email notification with a log file when one of your other actors fails, succeeds, times out, you name it.
Kickstarter Search Scraper
Missing Kickstarter API? Need fresh Kickstarter news or list of best and finished projects? Try this new wrapper for Kickstarter search, which allows you to configure search filters and get the list of items from Kickstarter searc...
Amazon crawler - this configuration will extract items for keywords that you will specify in the input, and it will automatically extract all pages for the given keyword. You can specify more keywords on the input for one run.
PDF to HTML Converter
Converts a PDF document to HTML using the pdf2htmlEX tool.
HTML to PDF Converter
Open a web page in headless Chrome using Puppeteer and print it to PDF. The input is a JSON object and output is a PDF file.
Google Cheerio Batch
Scrape Google search results in batches. Take a list of URLs as input and save to HTML. It requires GOOGLE_SERP proxy so if you don't have it enabled, contact Apify support
Crawls a website using one or more sitemaps and imports the data to Algolia search index. The text content is identified using simple CSS selectors. The actor simply runs the algolia-webcrawler NPM package (https://www.npmjs.com/...
Twitter Hashtag Scraper
This Twitter Hashtag Scraper will scrape and extracts all tweets for given hashtag and provide output in JSON, XML, CSV or HTML.
JS Code 2 Flowchart
Contact Information Scraper
Scrape and extract contact information (e-mails, phone numbers, social networks) from any website. Collect or pull and build your own customer database.
Extract data from Transfermarkt website without API and export data to JSON, XML or CSV.
You can use this actor to monitor any page's content and get a notification when content changes. Technically it extracts text by a given selector and compares it with the previous run. If there is any change, it runs another act...
Google Search Scraper
Crawls Google Search result pages (SERPs) and extracts a list of organic and paid results, ads, snap packs and more. Supports selection of custom country or language, and extraction of custom attributes.
Twitter user info
Get Twitter user info
Legacy PhantomJS Crawler