Get started
Product
Back
Start here!
Get data with ready-made web scrapers for popular websites
Browse 5,000+ Actors
Apify platform
Apify Store
Pre-built web scraping tools
Actors
Build and run serverless programs
Integrations
Connect with apps and services
AI agents
Equip your AI agents with Actors
Anti-blocking
Scrape without getting blocked
Proxy
Rotate scraper IP addresses
Open source
Crawlee
Web scraping and crawling library
Solutions
Professional AI agents, zero cost
Tell us about your slow or hard-to-scale process, and we'll build you a custom AI agent for free!
Find out more
Web data for
Enterprise
Startups
Universities
Nonprofits
Use cases
Data for generative AI
Lead generation
Market research
Sentiment analysis
View more →
Consulting
Apify Professional Services
Apify Partners
Developers
Documentation
Full reference for the Apify platform
Code templates
Python, JavaScript, and TypeScript
Web scraping academy
Courses for beginners and experts
Deploy to Apify
With CLI or GitHub integration
Monetize your code
Publish your scrapers and get paid
Learn
API reference
CLI
SDK
Apify open source fair share
We will support and reward every open-source project on Apify Store
Join now
Resources
Help and support
Advice and answers about Apify
Submit your ideas
Tell us the Actors you want
Changelog
See what’s new on Apify
Customer stories
Find out how others use Apify
Company
About Apify
Contact us
Blog
Live events
Partners
Jobs
We're hiring!
Join our Discord
Talk to scraping experts
Pricing
Contact sales
4
STATUS
Open to develop
CATEGORIES
Developer tools
SUBMITTED
Sep 7, 2021
Easily scrape through Tor's anonymized content and download data as an HTML table, JSON, CSV, Excel, XML.
apify/website-content-crawler
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Apify
54K
3.6
zeeb0t/web-scraping-api---scrape-any-website
Web Scraping API that quickly and reliably scrapes any website—no selectors required. Premium proxies, CAPTCHA solving, JavaScript rendering, and automated structured data extraction are all included. It’s just $2 per 1,000 web pages scraped, with no minimum spend.
Anthony Ziebell
707
5.0
apify/cheerio-scraper
Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.
8K
4.7
parsera-labs/parsera
Extract data from any website using just a URL and column descriptions
Parsera Labs
253
4.0
apify/web-scraper
Crawls arbitrary websites using a web browser and extracts structured data from web pages using a provided JavaScript function. The Actor supports both recursive crawling and lists of URLs, and automatically manages concurrency for maximum performance.
85K
4.4
apify/puppeteer-scraper
Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.
7.2K
topaz_sharingan/Youtube-Transcript-Scraper
Are you in search of a robust solution for extracting transcripts from YouTube videos? Look no further 😉, YouTube-Transcript-Scraper will meet your needs. Our software not only efficiently retrieves transcripts but also provides additional valuable information .👍 😀 Scrap away 🕵♂️.
Moses Bilal
2K
3.3
muhammetakkurtt/truth-social-scraper
Extract Truth Social profile posts with this professional Apify actor tool. Collect content from Donald Trump and key profiles. Analyze interactions, media and replies with real-time data. Ideal for political monitoring, market research and trend analysis. API integration for real-time data flow.
Muhammet Akkurt
465
4.8
diarmuidr/blog-content-crawler
Crawl an entire blog / knowledge base or filter to just the new content. Supporting relevant AI queries by filtering pages by date
Diarmuid
41
muhammetakkurtt/dexscreener-scraper
DexScreener Token Scraper collects real-time cryptocurrency data from DexScreener. It extracts token prices, liquidity, volumes and transactions across multiple blockchains (Solana, Ethereum, BSC). Supports custom sorting, time frames and delivers comprehensive token analytics for market analysis.
143
Browse our Store and find the right solution for you