Pricing

from $0.65 / actor start

Claude AI Web Automation

A real browser with Anthropic's Claude models to navigate any website and extract structured data — no CSS selectors or page-specific scraping code required.

Pricing

from $0.65 / actor start

Rating

0.0

(0)

Developer

Tin

Actor stats

Bookmarked

Total users

Monthly active users

6 days ago

Last modified

Claude AI Web Automation

Claude AI Web Automation is an Apify Actor that drives a real browser with Anthropic's Claude models to navigate any website and extract structured data — no CSS selectors or page-specific scraping code required. Give it a starting URL and a plain-English instruction (e.g. "Open Sporting Goods > Golf, sort by newly listed, and scrape the first 5 listings") and the agent handles the search, navigation, pagination, and detail-page extraction for you.

Try it directly from the Input tab — fill in a target site and a prompt, hit Start, and the dataset will fill up with clean JSON. Run it on the Apify platform to take advantage of API access, scheduling, integrations (Zapier, Make, Slack), automatic proxy rotation, and run monitoring.

What does Claude AI Web Automation do?

The Actor combines four AI building blocks into a single end-to-end web scraper:

Intent parsing — Claude reads your prompt and decides what action to perform, what fields to extract, and how many items you want.
Browser automation — A Stagehand agent powered by Claude takes screenshots, clicks, types, and scrolls until it reaches the search results.
Selector inference — Claude inspects the DOM of the results page and derives CSS selectors for every listing and the next-page link.
Structured extraction — Each detail page is opened and Claude pulls the fields you asked for into a Zod-validated record.

Everything runs serverless on the Apify platform — no local infrastructure required.

Why use Claude AI Web Automation?

No selectors, no glue code. Describe the task in English and the agent figures out the rest.
Works across sites. The same Actor scrapes eBay, Etsy, news portals, real-estate listings, or any other public site.
Resilient defaults. Residential proxy rotation, infinite-scroll handling, cross-page deduplication, and hard timeouts are all built in.
Cheap models, big results. Uses claude-haiku-4-5 for parsing/selector derivation, keeping per-run costs low.
Structured output. Each record is a typed JSON object you can pipe into a sheet, database, or downstream Actor.

How to use Claude AI Web Automation

Open the Actor page on Apify Console and click Try for free.
In the Input tab, paste a starting URL (e.g. https://www.ebay.com).
Write your prompt in plain English — what to navigate to and what to scrape.
(Optional) Set maxItems to control how many detail records you want, and maxSteps to bound the agent's reasoning loop.
(Optional) Provide explicit selectors and a uniqueKeyPattern to skip the LLM selector-inference step on sites you already know.
Click Start and watch the run log. When it finishes, switch to the Output tab or call the dataset API.

Input

Field	Type	Required	Description
`prompt`	string	✅	What to do on the site and what to extract. Include a count if you want (e.g. "the first 10 results").
`startUrl`	string	✅	The URL the browser opens before the agent runs.
`maxItems`	integer	—	Hard cap on detail records. Overrides any count parsed from the prompt. Default: `5`.
`maxSteps`	integer	—	Max reasoning steps for the pre-crawl agent. Default: `10`.
`selectors`	object	—	Optional `listingLink` and `nextPage` CSS selectors. If omitted, Claude derives them.
`uniqueKeyPattern`	string	—	Optional regex with one capture group for deduping listings (e.g. `/itm/(\d{9,})` for eBay).
`countryCode`	string	—	Two-letter ISO code for residential proxy geolocation. Default: `US`.
`loginCookies`	array	—	Cookies to inject before navigation, for authenticated scraping. Accepts Cookie-Editor / EditThisCookie exports or raw Playwright cookie objects.

Example input

{
    "startUrl": "https://www.ebay.com",
    "prompt": "Open the category page Sporting Goods > Golf, then sort by newly listed and scrape the first 5 listings by going to the detail page to extract title, price, and shipping cost.",
    "maxItems": 5,
    "maxSteps": 10,
    "countryCode": "US",
    "selectors": {
        "listingLink": "a[href*='/itm/']",
        "nextPage": "a[aria-label='Next page'], a[rel='next']"
    },
    "uniqueKeyPattern": "/itm/(\\d{9,})"
}

Output

Each detail page produces one JSON record in the default dataset. The fields are derived from your prompt — if you ask for "title, price, and shipping cost", you get exactly those plus the source URL.

{
    "url": "https://www.ebay.com/itm/186372216016",
    "title": "Callaway Rogue ST Max Driver — 10.5° Stiff Flex",
    "price": "$249.99",
    "shippingCost": "Free shipping"
}

You can download the dataset in JSON, CSV, Excel, HTML, XML, or RSS formats from the Storage tab or via the Apify API.

Data fields

Field	Type	Description
`url`	string	Canonical URL of the detail page.
(prompt-defined fields)	string / number / boolean	Whatever you asked the agent to extract. Currency-like fields (price, shipping, fee) are always returned as strings to preserve symbols and free-form text such as "Free shipping".

Cost estimation

The Actor charges only the standard Apify platform costs (compute units + residential proxy traffic) plus a small Anthropic API spend. A typical run that scrapes 5 detail records on a clean site:

Compute: ~0.02–0.05 CU (a few cents on a paid plan).
Proxy traffic: a few MB of residential bandwidth.
Anthropic tokens: roughly $0.01–$0.03 per run with claude-haiku-4-5.

Apify's free tier covers many runs per month. Use maxItems and maxSteps to bound costs on slow or anti-bot-protected sites.

Tips and advanced options

Be specific in the prompt. "Extract the title, price in USD, and number of reviews" beats "Extract product info".
Pre-supply selectors for sites you scrape often — it skips an LLM call and makes runs faster and more deterministic.
Bump maxSteps for multi-page flows (search → filter → sort → results often needs 8–12 steps).
Lower maxSteps to fail fast when debugging a new site.
Pin a countryCode that matches the target site's primary market — pricing, currency, and availability often depend on geo.
Anti-bot pages. Some sites (eBay, Amazon, Cloudflare-protected portals) occasionally serve CAPTCHAs to residential proxies. Retry the run or switch country code if you see a challenge page in the logs.
Authenticated scraping. Sign in to the target site in your normal browser, export the cookies with the Cookie-Editor extension, and paste the JSON array into loginCookies. The Actor runs at concurrency 1 so the session is reused across all requests and is less likely to be flagged.

FAQ and support

Is web scraping legal? Scraping publicly available data is generally legal, but you must respect each site's Terms of Service and robots.txt and avoid extracting personal data without a lawful basis. You are responsible for how you use the output.

What model does it use? claude-haiku-4-5 for prompt parsing, selector derivation, and the Stagehand browser agent. You can change the model by editing src/main.js.

Can I run it on my laptop? Yes — clone the repo, run npm install, set ANTHROPIC_API_KEY in .env, and run apify run. Local runs use a local browser (no Apify proxy unless you pass credentials).

Where do I report issues? Use the Issues tab on the Actor page. For custom scraping projects, contact dtrungtin@gmail.com.

OpenAI Web Scraper — dtrungtin/openai-web-scraper

OpenAI Web Automation

dtrungtin/openai-web-automation

Controls a real browser with an OpenAI model to interact with web pages and extract structured data — no CSS selectors or page-specific scraping code required.

Tin

AI Web Scraper — Any Site to JSON with GPT or Claude

flash_scraper/ai-universal-scraper

AI web scraper that turns any URL into clean, structured JSON. List the fields you want or describe them in plain English, bring your own OpenAI (GPT) or Anthropic (Claude) key, and the model reads the page like a human — no CSS selectors, no per-site code. Export JSON, CSV, or Excel.

Flash Scrape

5.0

Claude code

agentify/claude-code

Instant access to Claude Code - Anthropic's AI coding assistant running on Apify.

agentify

5.7K

4.1

Claude API Actor – BYOK Anthropic Claude AI Proxy for Apify

hgservices/claude-api-actor

Call Anthropic's Claude API directly from any Apify Actor or automation workflow. This Actor is a lightweight, BYOK (Bring Your Own Key) proxy that lets you integrate Claude AI, including Opus, Sonnet, and Haiku, into your Apify scraping pipelines, data extraction workflows, and AI automation tasks

Harish Garg

5.0

AI Web Scraper with Playwright Browser (No-Code, MCP)

data_rig/ai-web-scraper

Run a real Playwright browser as an AI web scraper. Extract structured data from any site using natural language—no selectors or scripts. Handles JS-heavy pages, pagination, and interactions. Built for MCP agents like OpenCode and Claude Code.

Data Rig

YouTube Claude Mentions Tracker

ianymu/youtube-claude-mentions

Track YouTube videos mentioning Claude Code, Claude AI, claude.ai, or Anthropic Claude. Pulls metadata + transcript, surfaces the mention sentence, tags sentiment (positive/neutral/negative), and ranks results. Brand monitoring, competitor analysis, and tutorial discovery in one scheduled run.

Yanlong Mu

No-Code Web Scraper: Extract Any Website by Example

flash_scraper/smart-scraper-by-example

No-code web scraper that extracts any website by example — no CSS selectors, no API key. Paste a URL and values you can see on the page; it learns the HTML pattern and pulls every matching item into structured rows with a confidence score. Follows pagination. Export CSV, JSON, Excel.

Flash Scrape

LLM Web Scraper

incredible_moment/llm-scraper

Turn any website into structured JSON using AI. Supports OpenAI GPT-4 and Anthropic Claude. Built in Rust to minimize compute costs while waiting for LLM responses. Extract data without selectors.

Daniel Rosen

Best AI Web Scraper

hgservices/Best-AI-Web-Scraper

Extract any data from any website by simply describing what you want in plain English. AI-powered web scraping with no code, no selectors, and no per-site setup.

Harish Garg

AI Web Scraper - Scrape Any Website by Example, No Code

flash_scraper/ai-web-scraper

Turn any website into structured data with no code, no CSS selectors, and no API key. Paste a URL plus an example value you see on the page; it learns the HTML pattern and pulls every matching item into rows with confidence scores. Follows pagination. Export CSV, JSON, or Excel.

Flash Scrape

5.0