Get started
Product
Back
Start here!
Get data with ready-made web scrapers for popular websites
Browse 19,187 Actors
Apify platform
Apify Store
Pre-built web scraping tools
Actors
Build and run serverless programs
Integrations
Connect with apps and services
MCP
Give your AI access to Actors
Anti-blocking
Scrape without getting blocked
Proxy
Rotate scraper IP addresses
Open source
Crawlee
Web scraping and crawling library
Solutions
MCP server configuration
Configure your Apify MCP server with Actors and tools for seamless integration with MCP clients.
Start building
Web data for
Enterprise
Startups
Universities
Nonprofits
Use cases
Data for generative AI
Data for AI agents
Lead generation
Market research
View more →
Consulting
Apify Professional Services
Apify Partners
Developers
Documentation
Full reference for the Apify platform
Code templates
Python, JavaScript, and TypeScript
Web scraping academy
Courses for beginners and experts
Monetize your code
Publish your scrapers and get paid
Learn
API reference
CLI
SDK
Earn from your code
$596k paid out in December. Many developers earn $3k+ every month.
Start earning now
Resources
Help and support
Advice and answers about Apify
Actor ideas
Get inspired to build Actors
Changelog
See what’s new on Apify
Customer stories
Find out how others use Apify
Company
About Apify
Contact us
Blog
Live events
Partners
Jobs
We're hiring!
Join our Discord
Talk to scraping experts
Pricing
Contact sales
Llamalndex Agent (Python)
Pay per usage
cultured_xiphosuran/llamalndex-agent-python
Rating
0.0
(0)
Developer
Perseu Ramos
Actor stats
0
Bookmarked
2
Total users
Monthly active users
4 months ago
Last modified
Categories
AI
Share
vanagha/smart-business-lead-collector---ai-contact-company-scraper
Collect verified business emails, phones, and company summaries with AI. This smart scraper uses LlamaIndex to find and deduplicate contact info from any website. Fast, tested, and free for a limited time!
Van agha
46
visita/rag-browser
This Actor provides essential web browsing and content extraction functionality for AI Agents, LLM applications, and Retrieval-Augmented Generation (RAG) pipelines. It functions similarly to the web search feature in popular LLM chatbots, providing fresh, contextualized data directly from the web.
Visita Intelligence
13
jiri.spilka/dataset-query-engine
Use natural language queries to retrieve results from an Apify dataset. This Actor provides a query engine that loads a dataset, executes SQL queries, and synthesizes results.
Jiří Spilka
23
4.6
(5)
nikhuge/advanced-linkedin-jobs-scraper-with-ai
An intelligent, high-performance LinkedIn job scraper powered by LangGraph multi-agent system, LlamaIndex for semantic search, and Crawlee + Playwright for robust web scraping
charith wijesundara
adinfosys-labs/rag-ready-web-scraper-smart-chunker-for-ai-knowledge-bases
RAG-ready web scraper that collects, cleans, deduplicates, filters, and chunks web content into structured datasets for AI pipelines. Generates high-quality knowledge-base data optimized for LLMs, embeddings, and vector databases
Artashes Arakelyan
ai_solutionist/compliance-web-intel
The scraper AI agents trust. Extract grounded facts with citations, entities, claims & RAG chunks. Built for LangChain, LlamaIndex, AutoGPT. Quality scoring, auto-citations, 6 task modes.
Jason Pellerin
apify/website-content-crawler
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Apify
107K
4.3
(174)
datascoutapi/website-content-crawler-pro
Crawl websites and extract clean, structured content in Markdown, JSON, or plain text for AI models, LLMs, vector DBs, or RAG pipelines. Fast, reliable, and stealthy, with bulk processing, advanced metadata extraction, and seamless integration with LangChain, LlamaIndex, and AI workflows.
halam
475
3.7
(3)
cspnair/rag-knowledge-graph-builder
Transform websites into RAG-ready datasets. Crawls pages, chunks content into semantic segments (500-1000 tokens), and generates hypothetical questions for each chunk. No API key needed with native mode. Output: pre-indexed JSON optimized for AI retrieval with 3x better accuracy than raw text.
csp
95
5.0
(8)
maged120/seobility-seo-checker
the most powerful scraper for seobility SEO checker tool, the output might feel overwhelming but it's full of details and possibilites, check it out
Maged
(1)
devil_port369-owner/web-crawler
Depth-controlled web crawler that transforms websites into structured analytics-ready data. Starting from one or more URLs, it crawls internal links up to a configurable depth and outputs detailed JSON records per page
DataFusionX
33
(7)