馃 Get data to feed your AI models, LLMs or GPTs
Product
Apify Store
Start web scraping with ready-made scrapers
Crawlee
Our reliable open-source web scraping library
Code templates
Get started with templates for your scraping project
Actors
Run serverless cloud programs on the Apify platform
Integrations
Seamlessly connect with other apps and services
Proxy
Improve your web scraping performance
Storage
Specialized cloud storage for web scraping and crawling
Apify CLI
Create, develop, build, and run Apify actors locally
Solutions
DELIVERED BY
Apify Enterprise
Certified Partners
TAILORED FOR
Paid Actor developers
USE CASES
Data for generative AI & LLM
Product matching AI
Universal web scrapers
All use cases
INSPIRATION
Success stories
Resources
Help and support
Get advice and answers about the Apify platform
Submit your ideas
Upvote or submit actor or integration ideas
LEARN
Documentation
About Apify
Blog
Web scraping course
Apify platform course
Discord
Docs
Pricing
No credit card required
馃О Actor for solving Google reCAPTCHA using the anti-captcha.com service. You need to have an anti-captcha subscription.
apify/web-scraper
Crawls arbitrary websites using the Chrome browser and extracts data from pages using a provided JavaScript code. The actor supports both recursive crawling and lists of URLs and automatically manages concurrency for maximum performance. This is Apify's basic tool for web crawling and scraping.
Apify
52.4k
apify/website-content-crawler
Automatically crawl and extract text content from websites with documentation, knowledge bases, help centers, or blogs. This Actor is designed to provide data to feed, fine-tune, or train large language models such as ChatGPT or LLaMA.
7k
apify/cheerio-scraper
Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.
3.3k
jakubbalada/content-checker
Monitor a website or web page for content changes. Automatically saves before and after screenshots and sends an email notification when content changes are detected.
Jakub Balada
1.9k
apify/send-mail
The actor automatically sends an email to a specific address. This actor is useful for notifications and reporting. With only 3 lines of javascript code, you'll be on top of your scraping actors and never miss important results or issues.
2.1k
apify/puppeteer-scraper
Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.
2.6k
lukaskrivka/google-sheets
Import data from datasets or JSON files to Google Sheets. Programmatically process data in Sheets. Easier and faster than the official Google Sheets API and perfect for importing data from scraping.
Luk谩拧 K艡ivka
752
lukaskrivka/website-checker
Check any website you plan to scrape for expected Compute unit consumption, anti-scraping software, and reliability.
645
apify/beautifulsoup-scraper
Crawls websites using raw HTTP requests. It parses the HTML with the BeautifulSoup library and extracts data from the pages using Python code. Supports both recursive crawling and lists of URLs. This Actor is a Python alternative to Cheerio Scraper.
232
apify/playwright-scraper
Crawls websites with the headless Chromium, Chrome, or Firefox browser and Playwright library using a provided server-side Node.js code. Supports both recursive crawling and a list of URLs. Supports login to a website.
455
Are you a developer? Build your own Actors and run them on Apify.
Get a complete web scraping or automation solution from Apify experts.