Crawl4AI is an Apify Actor that wraps the powerful Crawl4AI library, providing you with a feature-packed web crawler and scraper with additional functionalities like link-following and automatic retries for failed requests.

The Actor can:

Crawl and scrape websites with precision using CSS, XPath, or LLM-based extraction methods.
Generate clean Markdown output, suitable for RAG pipelines or direct ingestion into large language models.
Automatically follow links to explore websites further without manual intervention.
Retry failed requests to ensure maximum data collection with minimal effort.

Usage

Scraping with Crawl4AI is straightforward. Just follow these steps to get your data quickly:

Input your target URLs.
Set your extraction method (optional - CSS, XPath, or LLM-based).
Configure advanced options like proxies or session settings (optional).
Run the Actor to start crawling, link-following, and retrying failed requests automatically.
Retrieve your data in structured Markdown format for further use in your projects.

How much will it cost?

Apify provides $5 free usage credits every month on the Apify Free plan. With Crawl4AI, you can enjoy a certain number of results per month for free.

For larger data needs, consider upgrading to the $49/month Starter plan for increased monthly results volume. Or opt for the Scale plan for even higher result limits.

Results

Here is an example of the data that the Actor produces:

[{
  "url": "https://docs.crawl4ai.com/",
  "markdown": "https://api.apify.com/v2/key-value-stores/m1Sqnke1KWM0AI8co/records/content_4242424242.md",
  "html": "https://api.apify.com/v2/key-value-stores/m1Sqnke1KWM0AI8co/records/content_4242424242.html",
  "metadata": {
    "title": "Home - Crawl4AI Documentation (v0.5.x)",
    "description": "🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper"
  }
},
{
  "url": "https://docs.crawl4ai.com/advanced/ssl-certificate/",
  "markdown": "https://api.apify.com/v2/key-value-stores/m1Sqnke1KWM0AI8co/records/content_4242424242.md",
  "html": "https://api.apify.com/v2/key-value-stores/m1Sqnke1KWM0AI8co/records/content_4242424242.html",
  "metadata": {
    "title": "SSL Certificate - Crawl4AI Documentation (v0.5.x)",
    "description": "🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper"
  }
},
// ...
]

On this page

Share Actor:

AI Web Scraper - Powered by Crawl4AI

raizen/ai-web-scraper

A blazing-fast AI web scraper powered by Crawl4AI. Perfect for LLMs, AI agents, AI automation, model training, sentiment analysis, and content generation. Supports deep crawling, multiple extraction strategies and flexible output (Markdown/JSON). Seamlessly integrates with Make.com, n8n, and Zapier.

Raizen Technology

162

1.0

AI-Powered Web Content & Link Extractor

scrapercoder/ai-powered-web-content-link-extractor

Crawls websites to extract clean, structured content for AI/LLM use, ideal for training datasets, knowledge bases, and RAG systems. Json output includes: * text: Normalized page content * links: Extracted sub-URLs

wallnut.ai

URL Mapper

marcoet/url-mapper

Map every link on any website in seconds. URLMapper instantly crawls a single page, returns a complete JSON of internal URLs, supports keyword filtering, and plugs straight into any Apify workflow or API so you can pre-crawl, audit SEO, or feed clean link lists into larger scrapers.

Marco Elizalde

Website Content Crawler

apify/website-content-crawler

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.

Apify

64K

4.3

Find Sitemap from url

eesti/find-sitemap-from-url

A powerful [Apify Actor] that finds sitemap URLs for any website. This Actor helps you discover XML sitemaps by checking common locations, robots.txt files, and analyzing HTML content for sitemap links.

ando

LinkedIn Post Scraper ✅ No cookies

practicaltools/Apify-linkedin-post-scraper

A robust Actor that extracts structured data from public LinkedIn posts, including author, text, images, videos, date and reactions.

Practical Tools

Slack Message Generator

katerinahronik/slack-message

This actor sends messages to Slack automatically. It can be used instead of email notifications and is ideal to combine with other actors monitoring successful runs, errors, etc.