AI Web Scraper - Powered by Crawl4AI avatar

AI Web Scraper - Powered by Crawl4AI

Try for free

Pay $25.00 for 1,000 Results

Go to Store
AI Web Scraper - Powered by Crawl4AI

AI Web Scraper - Powered by Crawl4AI

raizen/ai-web-scraper
Try for free

Pay $25.00 for 1,000 Results

A blazing-fast AI web scraper powered by Crawl4AI. Perfect for LLMs, AI agents, AI automation, model training, sentiment analysis, and content generation. Supports deep crawling, multiple extraction strategies and flexible output (Markdown/JSON). Seamlessly integrates with Make.com, n8n, and Zapier.

Developer
Maintained by Community

Actor Metrics

  • 3 monthly users

  • No reviews yet

  • No bookmarks yet

  • Created in Mar 2025

  • Modified 2 hours ago

URLs to Scrape

startUrlsarrayRequired

List of webpages to scrape.

Extraction Strategy

extractionStrategyEnumOptional

Select how content is extracted.

Value options:

"SimpleExtractionStrategy": string"LLMExtractionStrategy": string"JsonCssExtractionStrategy": string"JsonXPathExtractionStrategy": string

Default value of this property is "SimpleExtractionStrategy"

Crawl Strategy

crawlStrategyEnumOptional

Select how pages are crawled.

Value options:

"SimpleCrawlStrategy": string"BFSDeepCrawlStrategy": string"DFSDeepCrawlStrategy": string"BestFirstCrawlingStrategy": string

Default value of this property is "SimpleCrawlStrategy"

Browser Configuration

browserConfigobjectOptional

Browser settings as JSON object.

Crawler Configuration

crawlerConfigobjectOptional

Crawler settings as JSON object.

Deep Crawl Configuration

deepCrawlConfigobjectOptional

Settings for deep crawling when using BFS, DFS, or Best-First Strategies.

Markdown Generator Configuration

markdownConfigobjectOptional

Markdown settings as JSON object.

Content Filter Configuration

contentFilterConfigobjectOptional

Content filter settings as JSON object.

User Agent Configuration

userAgentConfigobjectOptional

User agent settings for browser requests.

LLM Configuration

llmConfigobjectOptional

Configure LLM usage for content extraction.

Extraction Schema

extractionSchemaobjectOptional

Define custom extraction rules when using JsonCssExtractionStrategy or JsonXPathExtractionStrategy.

Session ID

session_idstringOptional

Use a session ID to persist browser state across multiple requests.

Default value of this property is ""