Web Search & Scrape by XCrawl Proxy avatar

Web Search & Scrape by XCrawl Proxy

Under maintenance

Pricing

Pay per usage

Go to Apify Store
Web Search & Scrape by XCrawl Proxy

Web Search & Scrape by XCrawl Proxy

Under maintenance

Search the web or scrape any URL using XCrawl residential proxy network. Bypass anti-bot systems with automatic JS rendering fallback, global IP rotation, and configurable concurrency (1-20). Perfect for market research, LLM data collection, and content aggregation.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Charles

Charles

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Web Search & Scrape by XCrawl Proxy — Apify Actor

XCrawl

Search the web or scrape any URL using XCrawl's residential proxy network. This Actor combines Google-quality search with intelligent anti-bot page scraping — all in one call.

Why This Actor?

FeatureBenefit
Search + Scrape in OneNo need to chain separate tools. Search results are automatically enriched with full page content.
Residential IP RotationEvery request goes through true residential proxies — far fewer blocks than datacenter IPs.
Smart Anti-Bot RetryAutomatically detects Cloudflare, DataDome, captcha pages and retries with JS rendering.
Configurable Concurrency1–20 parallel scrapes — tune for speed vs. stealth.
30+ Block HeuristicsDetects and bypasses the most common anti-scraping systems.
Global Geo-TargetingSearch from US, UK, JP, DE, AU — get localized results.
Markdown OutputClean markdown content, not messy HTML — ready for LLM consumption.

Quick Start

1. Get an XCrawl API Key

Sign up at dash.xcrawl.com and get your free API key.

2. Set Environment Variable

XCRAWL_API_KEY=your-api-key-here

3. Run the Actor

Search the web:

{
"action": "search",
"query": "latest AI startup funding 2026",
"location": "US",
"limit": 5,
"withContent": true
}

Scrape a single page:

{
"action": "scrape",
"url": "https://example.com/article",
"formats": "markdown,summary"
}

Input Parameters

ParameterTypeDefaultDescription
action"search" or "scrape""search"Choose operation mode
querystringSearch query (required for search)
urlstringTarget URL (required for scrape)
locationstring"US"Geo-location code (US, UK, CN, JP, DE, AU...)
languagestring"en"Search language code
limitinteger10Max results (1–50)
withContentbooleantrueAuto-fetch full page content for each search result
formatsstring"markdown,summary"Comma-separated: markdown, summary, html
renderbooleanfalseEnable JS rendering for heavily protected sites
screenshotbooleanfalseCapture PNG screenshot (requires render=true)
concurrencyinteger5Parallel scrapes (1–20)

Output

Each result contains:

{
"title": "Page title",
"url": "https://...",
"snippet": "Search snippet or description",
"markdown": "Full page content in markdown (up to 100K chars)",
"summary": "AI-generated page summary",
"status": "completed" | "failed",
"credits": "XCrawl credits used",
"source": "search" | "scrape"
}

Use Cases

  • Market Research — Scrape competitor pages, track pricing
  • LLM Training Data — Clean markdown ready for fine-tuning
  • Content Aggregation — Build news/trending feeds
  • Lead Generation — Search + scrape prospect pages in one workflow
  • SEO Monitoring — Track SERP positions and page changes

Pricing

This Actor bills via Apify platform usage plus XCrawl API credits (charged by dash.xcrawl.com). The XCrawl free tier is generous enough to get started — no upfront payment needed.

Environment Variables

VariableRequiredSource
XCRAWL_API_KEYdash.xcrawl.com

Built on XCrawl — the web scraping proxy platform with built-in anti-bot protection.