Pricing

from $2.00 / 1,000 successful pages

Stealth Web Scraper

General-purpose stealth collection helper for public pages when a dedicated source Actor is not available. Prefer specialized Actors for jobs, ads, maps, and e-commerce.

Pricing

from $2.00 / 1,000 successful pages

Rating

0.0

(0)

Developer

kane liu

Actor stats

Bookmarked

171

Total users

Monthly active users

a day ago

Last modified

What you can do with it

1. Monitor competitor pricing on protected shops

In plain English: give the Actor a list of product page URLs → get back a clean table with product name, price, and stock status, ready to drop into Excel or an alert system.

You give:

Field	What to enter
URLs	List of competitor product page links (one per row)
Fields	`productName`, `price`, `availability`

You get back (table you can download as Excel / CSV / JSON):

Product Name	Price	Availability
Example Product	$49.00	In stock
Another Product	$29.00	Out of stock
...	...	...

2. Pull company listings from directories (Clutch, G2, Capterra)

In plain English: point the Actor at a directory category page → get a list of provider cards with company names, locations, ratings, and review counts.

You give:

Field	What to enter
URLs	Category page URLs (e.g. Clutch digital marketing)
Wait for	`[data-testid='provider-card']` (optional)

You get back (one row per provider):

Company	Location	Rating	Reviews	Category
Agency One	New York	4.9	42	Digital Marketing
Agency Two	Los Angeles	4.7	31	Digital Marketing

3. Extract reviews from Trustpilot / G2 / Capterra

In plain English: provide the review page URL → get a clean list of reviews you can feed into sentiment analysis or a spreadsheet.

You get back:

Reviewer	Rating	Date	Review
John D.	5	2026-03-15	Great product, fast shipping...
Sarah K.	2	2026-02-28	Had issues with the packaging...

4. Feed AI agents with rendered page content

In plain English: your AI agent needs the actual visible content of a page (not a 403 page) → pass the URL, get back readable plain text and HTML. Works natively with LangChain, Make, n8n, and Zapier.

Plug the Actor output straight into:

Your prompt as context
A vector database for RAG
A custom summarization pipeline

5. Watch for content changes

In plain English: run the same URLs on a schedule (daily / weekly) → Actor returns a timestamp and quality signal for each page, so you can diff changes over time.

Useful for tracking:

Competitor landing page updates
Pricing page changes
Legal / Terms of Service updates
Product launch announcements

How to use (no code required)

Click "Try for Free" at the top of this page
Paste your list of URLs (one per line)
(Optional) Add CSS selectors if you want specific fields like price or title
(Optional) Set waitForSelector if the page has content that loads dynamically
Click Start — results appear in the Dataset tab within seconds

Download your results as CSV, Excel, or JSON. That's it.

From May 21, 2026, the free $5 monthly Apify credit gets you around 2,400 successful pages in practice, depending on how many separate runs you use.

What you get back

Every successfully scraped page returns:

Page title — the <title> of the page
Text content — clean plain text, ready for analysis
HTML — full rendered HTML (if you need it)
Extracted fields — whatever you asked for with CSS selectors
Quality signal — full, partial, minimal, or blocked, so you know what you got
Timestamp — when the page was scraped

Two separate datasets:

✅ Successful pages go to the main dataset (and count toward billing)
❌ Failed or blocked pages go to a failures dataset and are not billed as successful pages

You always know exactly what you paid for.

Pricing

Pay only for successful pages written to the default dataset. Blocked pages, failed pages, and pages where your required wait selector never appears go to the failures dataset and are not billed as successful pages.

Scheduled pricing from May 21, 2026:

What triggers a charge	Price
Actor start	$0.005 per run
Successful page	$0.002 per page

Volume	Estimated cost
100 successful pages	~$0.205
1,000 successful pages	~$2.005
10,000 successful pages	~$20.005

How this compares:

Building your own stealth scraper: 20+ hours of dev work, ongoing maintenance
Bright Data / Zyte scraping API: $500+/month subscription
This Actor: pay only when you scrape, no subscription

Example: Scrape 500 competitor product pages once a week = about $4.02/month. Scrape 50 Clutch pages once = about $0.105.

From May 21, 2026, the $5 free monthly Apify credit covers around 2,400 successful pages — enough to test whether this fits your workflow before you spend anything.

Apify platform compute/memory is billed separately by Apify, typically pennies per run for small jobs.

Connect to your tools

Use this Actor from your existing stack — no coding needed:

Platform	How to connect
Make.com	Search "Apify" → "Run Actor" → Actor ID: `lentic_clockss/stealth-web-scraper`
n8n	Add Apify node → "Run Actor" action → same Actor ID
Zapier	Apify integration → "Run Actor" trigger
LangChain	`ApifyActorsTool("lentic_clockss/stealth-web-scraper")`
Python / Node.js	Apify SDK or direct HTTPS call

API call example

curl "https://api.apify.com/v2/acts/lentic_clockss~stealth-web-scraper/runs" \
  -X POST \
  -H "Authorization: Bearer YOUR_APIFY_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"urls": ["https://www.clutch.co/it-services"], "outputFormat": "text"}'

Results come back in JSON via the Apify Dataset API:

GET https://api.apify.com/v2/datasets/{datasetId}/items?format=json

Live-view web server OpenAPI schema

This Actor enables Apify Standby and ships a documented Live-view schema via .actor/openapi.json. The standby surface is intentionally lightweight:

GET / — service information and readiness response.
GET /health — health check for the standby container.
GET /input-example — safe low-cost input example for a normal Actor run.
GET /openapi.json — the OpenAPI 3.0.3 document used by Live-view.

Full scraping jobs should be launched as normal Apify Actor runs so successful pages, failure reports, RUN_REPORT, RUN_SUMMARY, and ERROR_SUMMARY are persisted to the configured datasets and key-value store.

Operational visibility and anti-bot hardening

The Actor emits explicit progress logs for long or protected-page runs:

progress_event=run_input_ready — input was normalized and echoed to KVS.
progress_event=run_start — URL count, concurrency, timeout, and proxy group are known.
progress_event=url_queue_start / url_start — a URL is about to be processed.
progress_event=attempt_start / attempt_done — browser attempt status, proxy index, title, and content size.
progress_event=challenge_detected — Cloudflare or anti-bot challenge was detected.
progress_event=solver_started / solver_task_created / solver_task_ready / solver_solved — CapSolver path is active and solved.
progress_event=solver_token_applied — cf_clearance or Turnstile token was applied before retry/navigation.
progress_event=url_success / url_failed — final per-URL outcome.
progress_event=run_summary_ready — KVS summary is about to be written.

The browser path runs under Xvfb in headed mode and injects a safer fingerprint profile: non-headless desktop Chrome user agent, navigator.webdriver masking, window.chrome, plugins/mimeTypes, language/platform/vendor fields, WebGL vendor/renderer overrides, and lightweight canvas noise. These changes reduce obvious HeadlessChrome and zero-plugin fingerprints while preserving the existing Patchright + proxy + CapSolver flow.

When to use something else

This Actor is great for public content on protected sites. It's NOT the right tool for:

If you need...	Use this instead
Login-only pages (your account dashboard)	A custom Actor with session handling
Long sessions with complex interactions	Apify's Web Scraper or a custom Actor
Guaranteed success on every request	No tool can promise this — websites change
Simple non-protected websites	Apify's cheaper Web Scraper handles these fine

FAQ

Q: What counts as a successful page? A: A page is successful when it returns status 200, isn't blocked, and (if you set waitForSelector) the element appeared. Only successful pages are billed.

Q: What happens when a page is blocked? A: It goes to the separate failures dataset with the error details. You're not charged for blocked pages.

Q: Do I need my own proxies? A: No. The Actor has built-in proxy support. RESIDENTIAL is the default and works best on heavily protected sites. You can also bring your own proxy list if you prefer.

Q: Can I extract specific fields like prices or titles? A: Yes. Pass extractSelectors with CSS selectors. If the selector matches one element you get a string, if multiple you get a list.

Q: Will this work on LinkedIn / Instagram / Facebook? A: No — those require login sessions. This Actor is for public pages that just happen to be protected by anti-bot systems.

Q: How is this different from Apify's Web Scraper? A: Apify's Web Scraper handles standard sites. This Actor is specifically built for pages blocked by Cloudflare, Akamai, PerimeterX, and similar anti-bot systems. Use the standard Web Scraper for easier targets to save money.

Q: How do I know if my target site needs this Actor? A: Try Apify's standard Web Scraper first. If you get 403 errors or an empty page, switch to this one.

Input reference

For developers who want full control:

Parameter	Type	Description
`urls`	array	List of URLs to scrape (required)
`extractSelectors`	object	CSS selectors for specific fields, e.g. `{"title": "h1", "price": ".price"}`
`outputFormat`	string	`html`, `text`, or `both` (default: `both`)
`waitForSelector`	string	CSS selector that must appear before extraction completes
`maxConcurrency`	integer	Parallel pages, 1-5 (default: 1)
`pageTimeout`	integer	Page load timeout in seconds, 30-300 (default: 90)
`proxyGroup`	string	`auto` (datacenter, cheapest), `RESIDENTIAL` (recommended for protected sites), or `BUYPROXIES94952`

Full output schema is available in the Dataset tab.

Global Company Search — verify companies across 82 official government registries in 40+ countries
Global Sanctions Screening — search OFAC, EU, Canada sanctions lists in one call
US Government Contracts Search — find federal and state procurement opportunities

→ Browse all Actors: apify.com/lentic_clockss

Cloudflare Web Scraper

ecomscrape/cloudflare-web-scraper

Advanced web scraper designed to extract data from Cloudflare-protected websites with CAPTCHA bypass, proxy rotation, and JavaScript execution capabilities.

ecomscrape

793

3.3

Cloudflare Web Scraper (Pay per event)

ecomscrape/cloudflare-web-scraper-ppe

Advanced web scraper designed to extract data from Cloudflare-protected websites with CAPTCHA bypass, proxy rotation, and JavaScript execution capabilities.

ecomscrape

212

Smart Page Fetcher — HTML, Markdown & Text

shelvick/smart-page-fetcher

Fetch a batch of URLs and get the page as HTML, Markdown, or clean text. Tries plain HTTP first, renders JavaScript in a real browser when needed, and escalates to stealth + residential proxy for Cloudflare-protected, bot-defended pages, per URL. Pay only for the difficulty each URL needed.

Scott Helvick

Cloudflare Bypass Scraper Pro

xtech/cloudflare-scraper-pro

Cloudflare Scraper Pro: The ultimate solution for scraping Cloudflare-protected websites. Advanced browser automation with intelligent Turnstile & CAPTCHA bypass, automatic Cloudflare challenge resolution, and robust proxy rotation to extract data from the most heavily protected sites.

Xtech

1.0

Stealth Scraper

shvmgrx/stealth-scraper

Shivam Goraksha

Cloudflare Bypass

pamberton/cloudflare-bypass

Bypass Cloudflare protected routes. Works for API endpoints too unlike the web crawler.

Pamberton

5.0

Stealth Website Scraper | 💰$1.5 per 1,000 results

solutionssmart/stealth-website-scraper

Extract text, links, metadata, HTML, markdown, and structured page data with HTTP-first crawling and stealth-aware browser fallback.

Solutions Smart

Scrappey Web Scraper – Managed Browser & Proxy API

dormic/apify-scrappey

Scrape data from modern, JavaScript-heavy web pages using the Scrappey.com API integrated with an Apify Actor. A robust solution for complex scraping scenarios — managed browser sessions, proxy rotation, and full browser automation.

Pim

173

5.0

Universal Web Scraper with Playwright

sepiropht/my-actor

Powerful web scraper using Playwright to extract data from any website. Define custom CSS selectors, handle JavaScript-rendered pages, support pagination, and collect multiple data fields per page. Perfect for price monitoring, news extraction, and lead generation.

William Mbotta

Stealth Website Crawler

nocturne/stealth-website-crawler

Crawl websites protected by Cloudflare, DataDome, and other anti-bot systems. Extract clean text or markdown for AI/LLM pipelines. Like Website Content Crawler, but for sites that block you.