Deprecated

Pricing

Pay per usage

See alternative Actors

Go to Apify Store

Web Search & Scrape by XCrawl Proxy

Deprecated

See alternative Actors

Search the web or scrape any URL using XCrawl residential proxy network. Bypass anti-bot systems with automatic JS rendering fallback, global IP rotation, and configurable concurrency (1-20). Perfect for market research, LLM data collection, and content aggregation.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Charles

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Web Search & Scrape by XCrawl Proxy - Apify Actor

Search the web or scrape any URL using XCrawl's residential proxy network. This Actor combines Google-quality search with intelligent anti-bot page scraping - all in one call.

Why This Actor?

Feature	Benefit
Search + Scrape in One	No need to chain separate tools. Search results are automatically enriched with full page content.
Residential IP Rotation	Every request goes through true residential proxies - far fewer blocks than datacenter IPs.
Smart Anti-Bot Retry	Automatically detects Cloudflare, DataDome, captcha pages and retries with JS rendering.
Configurable Concurrency	1-20 parallel scrapes - tune for speed vs. stealth.
30+ Block Heuristics	Detects and bypasses the most common anti-scraping systems.
Global Geo-Targeting	Search from US, UK, JP, DE, AU - get localized results.
Markdown Output	Clean markdown content, not messy HTML - ready for LLM consumption.

Quick Start

1. Get an XCrawl API Key

2. Set Environment Variable

XCRAWL_API_KEY=your-api-key-here

3. Run the Actor

Search the web:

{
  "action": "search",
  "query": "latest AI startup funding 2026",
  "location": "US",
  "limit": 5,
  "withContent": true
}

Scrape a single page:

{
  "action": "scrape",
  "url": "https://example.com/article",
  "formats": "markdown,summary"
}

Input Parameters

Parameter	Type	Default	Description
`action`	`"search"` or `"scrape"`	`"search"`	Choose operation mode
`query`	string	-	Search query (required for `search`)
`url`	string	-	Target URL (required for `scrape`)
`location`	string	`"US"`	Geo-location code (US, UK, CN, JP, DE, AU...)
`language`	string	`"en"`	Search language code
`limit`	integer	`10`	Max results (1-50)
`withContent`	boolean	`true`	Auto-fetch full page content for each search result
`formats`	string	`"markdown,summary"`	Comma-separated: `markdown`, `summary`, `html`
`render`	boolean	`false`	Enable JS rendering for heavily protected sites
`screenshot`	boolean	`false`	Capture PNG screenshot (requires `render=true`)
`concurrency`	integer	`5`	Parallel scrapes (1-20)

Output

Each result contains:

{
  "title":      "Page title",
  "url":        "https://...",
  "snippet":    "Search snippet or description",
  "markdown":   "Full page content in markdown (up to 100K chars)",
  "summary":    "AI-generated page summary",
  "status":     "completed" | "failed",
  "credits":    "XCrawl credits used",
  "source":     "search" | "scrape"
}

Use Cases

Market Research - Scrape competitor pages, track pricing
LLM Training Data - Clean markdown ready for fine-tuning
Content Aggregation - Build news/trending feeds
Lead Generation - Search + scrape prospect pages in one workflow
SEO Monitoring - Track SERP positions and page changes

Pricing

This Actor bills via Apify platform usage plus XCrawl API credits (charged by dash.xcrawl.com). The XCrawl free tier is generous enough to get started - no upfront payment needed.

Environment Variables

Variable	Required	Source
`XCRAWL_API_KEY`	?	dash.xcrawl.com

Links

Built on XCrawl - the web scraping proxy platform with built-in anti-bot protection.

Xcrawl Search Scrape Actor

empathetic_chorus/xcrawl-search-scrape-actor

Charles

RAG Web Browser

simpleapi/rag-web-browser

SimpleAPI

RAG Web Browser

api-empire/rag-web-browser

API Empire

RAG Web Browser

scraper-engine/rag-web-browser

Scraper Engine

RAG Web Browser

scrapier/rag-web-browser

🌐 RAG Web Browser (rag-web-browser) is an intelligent tool for retrieving and generating answers from web sources with RAG. ⚡ Speed up research, get accurate citations, and streamline workflows for developers & analysts.

Scrapier

RAG Web Browser

scrapio/rag-web-browser

Scrapio

Clean Web Scraper - Markdown for AI via Firecrawl

clearpath/web-to-markdown

Convert any website to clean, LLM-optimized markdown using Firecrawl. Perfect for RAG pipelines, AI training data, and knowledge bases. No login required, 25% cheaper than Firecrawl direct. Batch process hundreds of URLs. Supports PDF/DOCX. Pay only $0.004 per page - no monthly fees.

ClearPath

Firecrawl MCP

red.cars/firecrawl-mcp

AI agents that need web data without anti-bot headaches. 20 tools for API-based web scraping, crawl, search, and extract — no proxy rotation, no stealth needed.

AutomateLab

Booking.com Scraper — Hotels, Prices, Reviews & Host Data

pro100chok/booking-all-in-one-scraper

Extract Booking.com hotel listings, live room prices & availability, guest reviews and ratings, full property details, and professional host contacts. Scrape by destination or URL with automatic pagination — no code, no login. Export to JSON, CSV, Excel or via API.

Raven

Bing Search Scraper

automation-lab/bing-search-scraper

Scrape Bing web search results for any query. Get titles, URLs, snippets, dates, and rank positions for organic results. HTTP-only, fast, no proxy needed. Supports multiple queries, pagination, and market/language settings.