Get started
Product
Back
Start here!
Get data with ready-made web scrapers for popular websites
Browse 40,213 Actors
Apify platform
Apify Store
Pre-built web scraping tools
Actors
Build and run serverless programs
Integrations
Connect with apps and services
MCP
Give your AI access to Actors
Anti-blocking
Scrape without getting blocked
Proxy
Rotate scraper IP addresses
Open source
Crawlee
Web scraping and crawling library
Solutions
MCP server configuration
Configure your Apify MCP server with Actors and tools for seamless integration with MCP clients.
Start building
Web data for
Enterprise
Startups
Universities
Nonprofits
Use cases
Data for generative AI
Data for AI agents
Lead generation
Market research
View more →
Consulting
Apify Professional Services
Apify Partners
Developers
Documentation
Full reference for the Apify platform
Code templates
Python, JavaScript, and TypeScript
Web scraping academy
Courses for beginners and experts
Monetize your code
Publish your scrapers and get paid
Learn
API reference
CLI
SDK
Earn from your code
$1.2M paid out last month. Many developers earn over $3k.
Start earning now
Resources
Help and support
Advice and answers about Apify
Actor ideas
Get inspired to build Actors
Changelog
See what’s new on Apify
Customer stories
Find out how others use Apify
Company
About Apify
Contact us
Blog
Live events
Partners
Jobs
We're hiring!
Join our Discord
Talk to scraping experts
Pricing
Contact sales
HTML Reader Mode
Pay per usage
jupri/html-reader-mode
Rating
0.0
(0)
Developer
cat
Actor stats
1
Bookmarked
3
Total users
Monthly active users
3 years ago
Last modified
Categories
Automation
SEO tools
Share
maged120/reader-mode
Extract clean, readable article content from any web page. Strips ads, navigation, and clutter — returns title, author, full body text, and publish date in structured JSON.
Maged
5
mikolabs/website-content-crawler
Deep-crawl websites to extract clean text, Markdown, or HTML for AI/LLM apps, RAG pipelines, and vector databases. Supports adaptive crawling, HTML cleaning, file downloads, and structured dataset output. Easily integrates with LangChain, LlamaIndex, and other LLM tools.
mikolabs
19
5.0
(1)
ai_solutionist/hyper-reader
High-fidelity web extraction for AI agents. Clean Markdown optimized for Claude, GPT-4 & Gemini. 3-level stealth, Vision screenshots, Deep Read link following. Standby Mode for 1-second responses.
Jason Pellerin
12
automation-lab/webpage-to-markdown-converter
Convert URLs to clean Markdown/JSON for LLM and RAG pipelines. A lightweight Firecrawl/Jina-style option on Apify for pages that work with HTTP + Readability extraction.
Stas Persiianenko
junipr/pdf-to-html
Convert PDFs to clean HTML preserving formatting, headings, tables, and layout. Multi-page support with per-page or combined output. OCR fallback for image PDFs. Inline CSS styling. Download via API.
junipr
9
crawlerbros/mozilla-addons-scraper
Scrape Mozilla Firefox Add-ons (AMO) - search thousands of extensions, themes, and language packs, or fetch specific add-ons by slug. Extracts full metadata: ratings, downloads, version info, authors, screenshots, and more.
Crawler Bros
2
apify/website-content-crawler
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Apify
132K
4.6
(205)
openfrontier_ai/Web-Accessibility-WCAG-Auditor
A11yBot automates WCAG 2.2 audits by "seeing" your site like a human. While standard tools only scan code, our visual AI catches barriers like poor contrast and missing focus, that HTML scanners miss. Its "Triple-Check" trait cross-references the DOM, ARIA tree, and screenshots for total precision.
OpenFrontier AI
ai_solutionist/compliance-web-intel
The scraper AI agents trust. Extract grounded facts with citations, entities, claims & RAG chunks. Built for LangChain, LlamaIndex, AutoGPT. Quality scoring, auto-citations, 6 task modes.
marielise.dev/pdf-to-mp3
Convert PDF, EPUB, DOCX, Markdown, HTML, TXT, and RTF to MP3 audiobooks. Free Microsoft Edge TTS (no API key) with OCR for scanned PDFs, 70+ languages, and optional OpenAI or ElevenLabs voices. ~$0.04/min.
Marielise
delectable_incubator/goodreads-reviews-scraper-low-cost
Scrape Goodreads book reviews 📚⭐ with a powerful review scraper. Extract reviewer names, ratings, review text, review dates, and profile links from any Goodreads book page. Ideal for book market research, sentiment analysis, literary studies, reader feedback analysis, and AI/NLP datasets 📊🚀
Prime Scrape