Pricing

$8.00 / 1,000 results

Go to Store

AI Web Scraper - Powered by Crawl4AI

Try for free

Developed by

Raizen Technology

A blazing-fast AI web scraper powered by Crawl4AI. Perfect for LLMs, AI agents, AI automation, model training, sentiment analysis, and content generation. Supports deep crawling, multiple extraction strategies and flexible output (Markdown/JSON). Seamlessly integrates with Make.com, n8n, and Zapier.

1.0 (1)

Pricing

$8.00 / 1,000 results

Total users

169

Monthly users

Runs succeeded

>99%

Issues response

1.1 hours

Last modified

3 months ago

Agents

Automation

URLs to Scrape

startUrlsarrayRequired

List of webpages to scrape.

Extraction Strategy

extractionStrategyEnumOptional

Select how content is extracted.

Value options:

"SimpleExtractionStrategy": string"LLMExtractionStrategy": string"JsonCssExtractionStrategy": string"JsonXPathExtractionStrategy": string

Default value of this property is "SimpleExtractionStrategy"

Crawl Strategy

crawlStrategyEnumOptional

Select how pages are crawled.

Value options:

"SimpleCrawlStrategy": string"BFSDeepCrawlStrategy": string"DFSDeepCrawlStrategy": string"BestFirstCrawlingStrategy": string

Default value of this property is "SimpleCrawlStrategy"

Browser Configuration

browserConfigobjectOptional

Browser settings as JSON object.

Crawler Configuration

crawlerConfigobjectOptional

Crawler settings as JSON object.

Deep Crawl Configuration

deepCrawlConfigobjectOptional

Settings for deep crawling when using BFS, DFS, or Best-First Strategies.

Markdown Generator Configuration

markdownConfigobjectOptional

Markdown settings as JSON object.

Content Filter Configuration

contentFilterConfigobjectOptional

Content filter settings as JSON object.

User Agent Configuration

userAgentConfigobjectOptional

User agent settings for browser requests.

LLM Configuration

llmConfigobjectOptional

Configure LLM usage for content extraction.

Extraction Schema

extractionSchemaobjectOptional

Define custom extraction rules when using JsonCssExtractionStrategy or JsonXPathExtractionStrategy.

Session ID

session_idstringOptional

Use a session ID to persist browser state across multiple requests.

Default value of this property is ""

Smart Scrape AI

llayaa112/smart-scrape-ai

Smart Scrape AI is an autonomous web automation and scraping actor powered by Playwright and AI. It dynamically interprets prompts, navigates websites, performs tasks, extracts data, and provides intelligent answers. Ideal for zero-code, prompt-driven data extraction and interaction workflows.

laya albshlawy

Universal AI GPT Scraper

louisdeconinck/ai-gpt-scraper

Transform any website into structured data with AI-powered extraction. This versatile tool combines advanced web scraping with intelligent content analysis to deliver clean, customized JSON output - perfect for automating data collection from any web source.

Louis Deconinck

5.0

🔥 FireScrape AI Website Content Markdown Scraper

mohamedgb00714/fireScraper-AI-Website-Content-Markdown-Scraper

Advanced web scraper powered by Crawlee and Puppeteer — extracts website content, converts it to Markdown, and structures it for LLM training datasets.

mohamed el hadi msaid

3.5

RAG Web Browser

apify/rag-web-browser

Web browser for OpenAI Assistants, RAG pipelines, or AI agents, similar to a web browser in ChatGPT. It queries Google Search, scrapes the top N pages, and returns their content as Markdown for further processing by an LLM. It can also scrape individual URLs. Supports Model Context Protocol (MCP).

Apify

4.6K

4.4

🔥fireScraper AI Prompt Website Content Markdown Scraper

mohamedgb00714/fireScraper-AI-prompt-Website-Content-Markdown-Scraper

fireScrape AI is an advanced web scraper built with Crawlee and Puppeteer. It crawls websites, extracts meaningful content, converts it into Markdown, then runs your custom prompt on the extracted text—ideal for generating enriched datasets, summaries or analyses for LLMs and AI pipelines

mohamed el hadi msaid

5.0

AI-Powered Web Content & Link Extractor

scrapercoder/ai-powered-web-content-link-extractor

Crawls websites to extract clean, structured content for AI/LLM use, ideal for training datasets, knowledge bases, and RAG systems. Json output includes: * text: Normalized page content * links: Extracted sub-URLs

wallnut.ai

Website Content to Markdown for LLM Training

easyapi/website-content-to-markdown-for-llm-training

🚀 Transform web content into clean, LLM-ready Markdown! 📘 Scrape multiple pages, extract main content, and convert to Markdown format. Perfect for AI researchers, data scientists, and LLM developers. Fast, efficient, and customizable. Supercharge your AI training data today! 🌐📝🧠

EasyApi

5.0

Web Scraper Task

undrtkr984/web-scraper-task

Matt

119

AI Website Content Markdown Scraper

quaking_pail/ai-website-content-markdown-scraper

This Apify Actor, "Website Content Crawler with Markdown Extraction," is designed to perform a comprehensive crawl of specified websites, extract their text content, convert it into Markdown format, and store it in a structured dataset. The extracted content is suitable for feeding LLMs.

AI_Builder

607

4.3

Web Scraping API

zeeb0t/web-scraping-api---scrape-any-website

Web Scraping API that quickly and reliably scrapes any website—no selectors required. Premium proxies, CAPTCHA solving, JavaScript rendering, and automated structured data extraction are all included. It’s just $2 per 1,000 web pages scraped, with no minimum spend.