Universal Web Scraper API avatar

Universal Web Scraper API

Pricing

from $1.00 / 1,000 results

Go to Apify Store
Universal Web Scraper API

Universal Web Scraper API

Scrape any URL through a global, anti-bot-resistant Scraper API. JS rendering, geo-targeting, residential proxies, CSS-selector extraction, and optional LLM processing.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

youssef farhan

youssef farhan

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

10 hours ago

Last modified

Share

Universal Web Scraper API — Extract Any URL With Anti-Bot Proxies + AI

A universal web scraper API that turns any URL into clean HTML, plain text, and structured JSON, using global anti-bot proxies in 46 countries. Built for developers, researchers, growth teams, and AI agents that need reliable web data without managing proxies or browsers. Add CSS selectors or an LLM prompt and get structured output in one run.

What you get

Every URL produces one dataset record with these fields:

  • url — the page that was scraped.
  • statusCode — HTTP status returned by the target.
  • oktrue when the status was 2xx.
  • error — error message for failed URLs (null on success).
  • scrapedAt — ISO 8601 UTC timestamp of the scrape.
  • html — full raw HTML (toggle off with storeHtml).
  • text — readable text with scripts, styles, and markup stripped.
  • textLength — character count of text.
  • extractedFields — your CSS-selector fields as a key/value object.
  • llm — optional AI result: output (text or parsed JSON), model, usage.
  • options — echo of render, geoCode, and superProxy used per request.

Sample output

{
"url": "https://quotes.toscrape.com",
"statusCode": 200,
"ok": true,
"html": "<!DOCTYPE html><html>...</html>",
"text": "Quotes to Scrape. The world as we have created it is a process of our thinking.",
"textLength": 4213,
"extractedFields": {
"title": "Quotes to Scrape",
"firstQuote": "The world as we have created it is a process of our thinking.",
"firstAuthor": "Albert Einstein"
},
"llm": {
"model": "google/gemini-3.1-flash-lite",
"output": { "topic": "inspirational quotes", "itemCount": 10 },
"usage": { "prompt_tokens": 1820, "completion_tokens": 24, "total_tokens": 1844 }
},
"options": { "render": true, "geoCode": "us", "superProxy": false },
"scrapedAt": "2026-06-25T12:00:00.000000+00:00",
"error": null
}

Use cases

  • ✓ Data engineers feeding clean page text and JSON into RAG and LLM pipelines.
  • ✓ Price researchers monitoring competitor product pages across multiple countries.
  • ✓ Growth and lead-gen teams pulling structured data from directories at scale.
  • ✓ AI agent builders calling this web scraper API over MCP for live web data.
  • ✓ SEO and content teams auditing pages behind anti-bot protection.
  • ✓ Analysts extracting named fields from any site with simple CSS selectors.

Pricing

This Actor is pay-per-result — you only pay for records pushed to the dataset.

EventPrice per 1,000Real example
Scraped result (one URL)[PLACEHOLDER: $X.XX per 1,000 results]Scraping 10,000 URLs ≈ [PLACEHOLDER: $XX]

Free tier: your first [PLACEHOLDER: N] results are free — no credit card needed to try it. You also use your own Scraper API key, and you can get 1,000 free monthly scraping credits here to cover the underlying requests.

How it works

  • Input: a list of startUrls plus your Scraper API token.
  • Fetch: each URL is routed through rotating anti-bot proxies, with optional JavaScript rendering and geo-targeting.
  • Process: apply CSS extractRules and/or an LLM prompt to each page.
  • Output: structured records land in the Apify dataset (export as JSON, CSV, Excel, or XML).
  • Automate: run on a schedule and trigger webhooks on finish — built into the Apify platform.

Why this web scraper API

  • Use your own key: pay the scraping provider directly, with 1,000 free monthly credits to start.
  • Global anti-bot coverage: residential/premium proxies and geo-targeting across 46 countries.
  • Built-in AI: summarize, classify, or extract JSON via OpenRouter — pay with Apify credits or your own OpenRouter key.
  • Resumable, migration-safe runs: state is persisted automatically, so a migrated run resumes without re-scraping or duplicating data.
  • Auto-scaled concurrency: parallelism is sized to a percentage of your live plan limit, so you never trip rate limits.

Input example

{
"apiKey": "YOUR_API_TOKEN",
"startUrls": [{ "url": "https://quotes.toscrape.com" }],
"render": true,
"geoCode": "us",
"extractRules": { "title": "h1", "firstQuote": ".quote .text" }
}

Omit extractRules and llm* fields to just return clean HTML and text. Set concurrencyPercentage to control speed.

FAQ

Does it handle anti-bot protection and proxies? Yes. Every request is routed through a rotating proxy network, with optional residential/premium proxies and per-country geo-targeting.

What output formats are supported? The dataset exports to JSON, CSV, Excel, and XML, or via the Apify API.

Can I scrape one URL or many? Both. Pass a single URL or thousands in startUrls; they are scraped concurrently.

Does it support scheduling and webhooks? Yes — use Apify schedules to run it automatically and webhooks to push results downstream.

Is the data live or cached? Live. Each run fetches the current page in real time.

How do I extract specific fields? Provide extractRules as a { "fieldName": "css selector" } map, or set an llmPrompt for AI extraction.

Can AI agents call it? Yes — it's available via the Apify REST API and as an MCP server for Claude, ChatGPT, and Cursor (see below).

Can I request custom fields or a different site? Yes — see the custom scraper section at the bottom.

Use via API or MCP

Run it programmatically via the Apify REST API:

POST https://api.apify.com/v2/acts/fayoussef~universal-scraper-api/runs?token=YOUR_TOKEN

Or connect it as an MCP server so AI agents can call it directly:

https://mcp.apify.com/actors/fayoussef~universal-scraper-api

Need a custom scraper?

Need different fields, a specific site, or a fully managed pipeline? Visit automationbyexperts.

PLACEHOLDERS TO FILL IN:
- Pay-per-result price per 1,000 results
- Real-run cost example (e.g. 10,000 URLs ≈ $XX)
- Free-tier amount (number of free results)
- Confirm custom-scraper CTA URL (automationbyexperts.com)