Deprecated

Pricing

Pay per usage

See alternative Actors

Go to Apify Store

Xcrawl Search Scrape Actor

Deprecated

See alternative Actors

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Charles

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

XCrawl Web Search & Scrape â€” Apify Actor

Search the web and scrape any URL using XCrawl's residential proxy network. Bypass anti-bot systems with automatic JS rendering fallback and global IP rotation.

Actor: yanxvdong123/xcrawl-search-scrape | Runtime: Node.js 22 | License: MIT

ðŸš€ Quick Start

Open the Actor Console
Set XCRAWL_API_KEY in Environment Variables (get a free key at dash.xcrawl.com)
Choose Search or Scrape mode, fill in the inputs
Hit Run

No credit card needed â€” XCrawl gives free trial credits on signup.

ðŸ“‹ Input Parameters

Search Mode (`action: "search"`)

Parameter	Type	Default	Description
`query`	string	required	Web search query (max 200 chars)
`limit`	integer	`10`	Number of results (1â€“50)
`location`	string	`"US"`	Geo-location code (`US`, `UK`, `CN`, `JP`, `DE`, etc.)
`language`	string	`"en"`	Search language (`en`, `zh`, `ja`, `fr`, etc.)
`withContent`	boolean	`true`	Fetch full page content for each result
`render`	boolean	`false`	JS rendering for anti-bot bypass
`formats`	string	`"markdown,summary"`	Output formats: comma-separated (`markdown`, `summary`, `html`)
`screenshot`	boolean	`false`	Capture page screenshot (requires `render=true`)

Scrape Mode (`action: "scrape"`)

Parameter	Type	Default	Description
`url`	string	required	Single URL to scrape (max 2000 chars)
`render`	boolean	`false`	JS rendering for anti-bot bypass
`formats`	string	`"markdown,summary"`	Output formats
`screenshot`	boolean	`false`	Capture screenshot (requires `render=true`)

ðŸ§ Intelligent Anti-Block System

This actor is built to handle modern anti-bot systems out of the box:

Automatic block detection â€” Heuristically checks for Cloudflare, DataDome, and other challenge pages (looks for captcha forms, browser verification, access denied messages)
Smart retry â€” If a page appears blocked, automatically retries with headless browser rendering (Chromium via XCrawl's jsRender)
Concurrent crawling â€” Uses p-limit to run up to 5 parallel scrapes (balanced for speed + reliability)
Global proxy pool â€” Requests route through XCrawl's residential proxy network with configurable geo-location
Per-URL resilience â€” Each URL gets at least 2 attempts; if both fail, the error is recorded per-entry without stopping the batch

When to enable `render`

âœ… Turn ON for: News sites with paywalls (Reuters, WSJ), sites behind Cloudflare/DataDome, JavaScript-heavy SPAs
âŒ Keep OFF for: Simple HTML pages, blogs, documentation (faster and cheaper without rendering)

ðŸ“¦ Output Format

Each result is pushed to the Apify dataset:

{
  "title": "Page Title",
  "url": "https://example.com",
  "snippet": "Search result description",
  "markdown": "Full page content converted to markdown...",
  "summary": "AI-generated summary from XCrawl...",
  "scrapeStatus": "completed",
  "screenshot": "base64-encoded PNG (if enabled)",
  "credits": "0.5",
  "scrapeError": null
}

Search mode returns an array of enriched results.
Scrape mode returns a single result object.

ðŸ’° Usage & Pricing

Mode	XCrawl Credits Consumed
Search (1 query)	~1 credit
Scrape (no render)	~1â€“3 credits
Scrape (with render)	~3â€“8 credits
Free trial	âœ… Included with XCrawl signup

The actor itself is free to run on Apify â€” you only pay for XCrawl API credits consumed.

ðŸ”§ Environment Variables

Variable	Required	Description
`XCRAWL_API_KEY`	âœ… Yes	Your API key from dash.xcrawl.com. Sign up â†’ Dashboard â†’ API Keys

ðŸŽ¯ Use Cases

Content research â€” Collect articles, blog posts, and documentation on any topic
Market intelligence â€” Scrape competitor pricing, product listings, and reviews
SEO / SERP monitoring â€” Track search rankings across different geo-locations
RAG / LLM pipelines â€” Feed clean markdown content into vector databases or AI agents
E-commerce â€” Monitor product catalogs with location-specific searches
News aggregation â€” Gather articles from multiple sources with automatic paywall bypass

ðŸ— Architecture

Apify Run
  â””â”€ src/main.js (entry point)
      â”œâ”€ XCrawl Search API  â†’  Get top results
      â”œâ”€ XCrawl Scrape API  â†’  Extract page content
      â”‚   â””â”€ p-limit (concurrency = 5)
      â”‚       â”œâ”€ Normal scrape (fast)
      â”‚       â””â”€ Retry with JS render (anti-bot fallback)
      â””â”€ Apify Dataset     â†  Push all results

ðŸ“„ Links

Source code: GitHub
XCrawl Dashboard: dash.xcrawl.com
XCrawl API Docs: docs.xcrawl.com
Report issues: GitHub Issues

Web Search & Scrape by XCrawl Proxy

empathetic_chorus/xcrawl-search-scrape

Search the web or scrape any URL using XCrawl residential proxy network. Bypass anti-bot systems with automatic JS rendering fallback, global IP rotation, and configurable concurrency (1-20). Perfect for market research, LLM data collection, and content aggregation.

Charles

Vimeo & Dailymotion Toolkit

moving_beacon-owner1/vimeo-dailymotion-toolkit

Scrape metadata, extract streams, search videos, crawl channels, and download from Vimeo and Dailymotion. All public functions work without API keys.

Jamshaid Arif

Fragrantica Scraper - Perfume Data Extractor

plastic_warrior/Fragrantica-Scraper-1

5$ per 1000 listings. Scrape fragrance data from Fragrantica.com - including perfume names, brands, notes, ratings, reviews, images and their clones. Fast and structured data extraction from largest perfume community.

Faheem Ahmed

Bazos.sk Scraper

appealing_jingle/bazos-sk-scraper

Scrape all listings from bazos.sk, bazos.cz, bazos.pl, bazos.at with title, URL, price, location, zip code and views in structured data.

09 try

Amazon Product & Search Scraper — Apify Actor

moving_beacon-owner1/my-actor-46

Easily scrape Amazon listings with an Apify actor. Use SEARCH mode for keywords (ASIN, title, price, rating) or PRODUCT mode for full details (brand, features, availability). Supports deep scraping, automatic retries, and handles Cloudflare/TLS for reliable data collection.

Jamshaid Arif

Eniro.se Scraper

rainminer/eniro-se-scraper

Scrape Eniro.se business listings, contact details, addresses, ratings, opening hours, websites, and company profile data from Swedish local search pages.

rainminer

Noon.com Scraper

moving_beacon-owner1/noon-com-scraper

Gather comprehensive information from Noon.com, including product listings, pricing details, customer ratings, available coupons, and delivery options specifically for the regions of the United Arab Emirates, Saudi Arabia, and Egypt.

Jamshaid Arif

Kijiji Auto & Classifieds Scraper Actor

fayoussef/kijiji-ca-scraper

Kijiji.ca listings scraper. Extracts detailed data including phone number, price, location, specs, and images from search results (with pagination) or direct ad URLs. Ideal for market research and data collection.

youssef farhan

Youtube & Video Data Extractor

atif176/youtube-video-data-extractor

This Actor scrapes YouTube search results using a keyword. It extracts video titles and video URLs using Playwright and saves the data in JSON format. Useful for YouTube research, SEO, and automation.

Muhammad Atif

Yelp Advanced Business Scraper: Pay Per Result

delicious_zebu/yelp-advanced-business-scraper-pay-per-result

Effortlessly scrape detailed restaurant data from Yelp, including ratings, reviews, amenities, and operating hours. Perfect for building robust datasets for market analysis, apps, or research projects.