Get started
Product
Back
Start here!
Get data with ready-made web scrapers for popular websites
Browse 6,000+ Actors
Apify platform
Apify Store
Pre-built web scraping tools
Actors
Build and run serverless programs
Integrations
Connect with apps and services
AI agents
Equip your AI agents with Actors
Anti-blocking
Scrape without getting blocked
Proxy
Rotate scraper IP addresses
Open source
Crawlee
Web scraping and crawling library
Solutions
Build and monetize MCP servers
Learn how to turn MCP servers into revenue with the latest webinar from our AI team. Available now.
Watch now on YouTube
Web data for
Enterprise
Startups
Universities
Nonprofits
Use cases
Data for generative AI
Lead generation
Market research
Sentiment analysis
View more →
Consulting
Apify Professional Services
Apify Partners
Developers
Documentation
Full reference for the Apify platform
Code templates
Python, JavaScript, and TypeScript
Web scraping academy
Courses for beginners and experts
Monetize your code
Publish your scrapers and get paid
Learn
API reference
CLI
SDK
Earn from your code
$400k paid out in July alone. Many developers earn $3k+ every month.
Start earning now
Resources
Help and support
Advice and answers about Apify
Submit your ideas
Tell us the Actors you want
Changelog
See what’s new on Apify
Customer stories
Find out how others use Apify
Company
About Apify
Contact us
Blog
Live events
Partners
Jobs
We're hiring!
Join our Discord
Talk to scraping experts
Pricing
Contact sales
Pay per usage
jupri/html-reader-mode
Developed by
cat
0.0 (0)
1
3
Last modified
2 years ago
Automation
SEO tools
maged120/reader-mode
Maged
2
5.0
apify/website-content-crawler
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Apify
70K
4.4
123webdata/ldlc-scraper
A scraper for LDLC to collect electronics product data, specifications, and prices from both category and item pages. Great for price monitoring, reporting, and AI model training. Pricing: $0.005/result. 10 results on free trial.
Milán Vásárhelyi
powerful_bachelor/html-to-markdown-converter
📄✨ HTML to Markdown Converter transforms web pages into clean, portable Markdown. Simply input a URL to extract content while preserving structure, formatting, and media elements.🔄 Perfect for content repurposing, documentation, and creating readable, platform-independent text from any webpage! 🚀
Powerful Bachelor
16
apify/cheerio-scraper
Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.
9.6K
4.8
epctex/goodreads-scraper
Scrape goodreads.com for data on millions of books. Crawl book details for images, ISBN, author, description, title, buy links, number of reviews, page number, language, and all other details. You can specify search terms, filters, and much more.
epctex
364
apify/super-scraper-api
Generic REST API for scraping websites: send a URL and get back HTML. This Actor is a drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!
719
4.1
autofacts/metadata-scraper
A powerful web scraper that extracts various types of structured metadata from web pages, including JSON-LD, Microdata, Open Graph, Twitter Cards, and more. Perfect for SEO analysis, content aggregation, and research purposes.
Autofactor
30
qbie/kickstarter-scraper
Search Kickstarter projects based on your selected criteria and get structured data on matching projects. Download your data as HTML table, JSON, CSV, Excel, XML, and RSS feed. Forked from misceres/kickstarter-search.
Matt Warwick
294
4.0
apify/beautifulsoup-scraper
Crawls websites using raw HTTP requests. It parses the HTML with the BeautifulSoup library and extracts data from the pages using Python code. Supports both recursive crawling and lists of URLs. This Actor is a Python alternative to Cheerio Scraper.
888
4.2
mstephen190/vanilla-js-scraper
Scrape the web using familiar JavaScript methods! Crawls websites using raw HTTP requests, parses the HTML with the JSDOM package, and extracts data from the pages using Node.js code. Supports both recursive crawling and lists of URLs. This actor is a non jQuery alternative to CheerioScraper.
Matthias Stephens
475