🤖 Get data to feed your AI models, LLMs or GPTs
Start web scraping with ready-made scrapers
Our reliable open-source web scraping library
Get started with templates for your scraping project
Run serverless cloud programs on the Apify platform
Seamlessly connect with other apps and services
Improve your web scraping performance
Specialized cloud storage for web scraping and crawling
Create, develop, build, and run Apify actors locally
Paid Actor developers
Data for generative AI & LLM
Product matching AI
Universal web scrapers
All use cases
Help and support
Get advice and answers about the Apify platform
Submit your ideas
Upvote or submit actor or integration ideas
Web scraping course
Apify platform course
No credit card required
Act for comparing 2 JSON arrays of objects.
By default the final result set will contain only new and updated records.
Automatically crawl and extract text content from websites with documentation, knowledge bases, help centers, or blogs. This Actor is designed to provide data to feed, fine-tune, or train large language models such as ChatGPT or LLaMA.
Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.
Crawls websites using raw HTTP requests. It parses the HTML with the BeautifulSoup library and extracts data from the pages using Python code. Supports both recursive crawling and lists of URLs. This Actor is a Python alternative to Cheerio Scraper.
Crawls websites with the headless Chromium, Chrome, or Firefox browser and Playwright library using a provided server-side Node.js code. Supports both recursive crawling and a list of URLs. Supports login to a website.
Free proxy scraper and checker. Search dozens of free proxy websites. Get list of 100% working public proxies in seconds. Automatically test proxies based on target URL and maximum timeout.
Create a screenshot of a website based on a specified URL. The screenshot is stored as the output in a key-value store. It can be used to monitor web changes regularly after setting up the scheduler.
Automatically triggered on a failed run to analyze if the run should be resurrected and to create an error report for the author.
Are you a developer? Build your own Actors and run them on Apify.
Get a complete web scraping or automation solution from Apify experts.