🤖 Get data to feed your AI models, LLMs or GPTs
Start web scraping with ready-made scrapers
Our reliable open-source web scraping library
Get started with templates for your scraping project
Run serverless cloud programs on the Apify platform
Seamlessly connect with other apps and services
Improve your web scraping performance
Specialized cloud storage for web scraping and crawling
Create, develop, build, and run Apify actors locally
Paid Actor developers
Data for generative AI & LLM
Product matching AI
Universal web scrapers
All use cases
Help and support
Get advice and answers about the Apify platform
Submit your ideas
Upvote or submit actor or integration ideas
Web scraping course
Apify platform course
No credit card required
🧰 Actor for solving Google reCAPTCHA using the anti-captcha.com service. You need to have an anti-captcha subscription.
Automatically crawl and extract text content from websites with documentation, knowledge bases, help centers, or blogs. This Actor is designed to provide data to feed, fine-tune, or train large language models such as ChatGPT or LLaMA.
Monitor a website or web page for content changes. Automatically saves before and after screenshots and sends an email notification when content changes are detected.
Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.
Import data from datasets or JSON files to Google Sheets. Programmatically process data in Sheets. Easier and faster than the official Google Sheets API and perfect for importing data from scraping.
Check any website you plan to scrape for expected Compute unit consumption, anti-scraping software, and reliability.
Crawls websites using raw HTTP requests. It parses the HTML with the BeautifulSoup library and extracts data from the pages using Python code. Supports both recursive crawling and lists of URLs. This Actor is a Python alternative to Cheerio Scraper.
Crawls websites with the headless Chromium, Chrome, or Firefox browser and Playwright library using a provided server-side Node.js code. Supports both recursive crawling and a list of URLs. Supports login to a website.
Are you a developer? Build your own Actors and run them on Apify.
Get a complete web scraping or automation solution from Apify experts.