Stagehand Crawler avatar

Stagehand Crawler

Pricing

from $20.00 / 1,000 results

Go to Apify Store
Stagehand Crawler

Stagehand Crawler

Stagehand is a browser automation framework used to control web browsers with natural language and code.

Pricing

from $20.00 / 1,000 results

Rating

0.0

(0)

Developer

Tin

Tin

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

2

Monthly active users

2 days ago

Last modified

Categories

Share

An Apify Actor that uses Stagehand — an AI-powered browser automation framework — to execute natural language instructions on web pages. Point it at any URL, describe what you want in plain English, and the AI agent will interact with the page and return results along with a final screenshot.

What it does

  1. Visits each URL in startUrls
  2. Runs an AI agent that follows your instruction using real browser interactions (clicks, typing, scrolling, navigation)
  3. Captures a screenshot of the final page state and saves it to the key-value store
  4. Pushes a result record to the dataset containing the outcome, a summary message, and the screenshot URL

Input

FieldTypeRequiredDescription
startUrlsarrayyesURLs the crawler will visit
instructionstringyesNatural language task for the AI agent (e.g. "Find the price of the first product")
modestringyesAgent mode: dom, hybrid, or cua
modelstringyesAI model in provider/model format (e.g. google/gemini-3-flash-preview)
apiKeystringyesAPI key for the selected model provider
proxyConfigobjectyesProxy settings — Apify Proxy recommended on the platform
maxStepsintegernoMax browser interactions before stopping (default: 20, range: 1–20)

Agent modes

  • dom — Reads the page DOM to determine actions. Fast and cost-efficient.
  • hybrid — Combines DOM and visual understanding. Best for complex layouts. Requires a vision-capable model (Gemini, Claude).
  • cua — Computer-use agent; operates purely from screenshots. Most powerful, highest cost.

Example input

{
"startUrls": [{ "url": "https://www.ebay.com/" }],
"instruction": "Search for 'mechanical keyboard' and return the title and price of the first result.",
"mode": "hybrid",
"model": "google/gemini-3-flash-preview",
"apiKey": "YOUR_GOOGLE_AI_API_KEY",
"maxSteps": 15,
"proxyConfig": { "useApifyProxy": true }
}

Output

Each processed URL produces one dataset record:

{
"url": "https://www.ebay.com/",
"success": true,
"message": "The first result is 'Keychron K2 Pro' priced at $89.99.",
"imageUrl": "https://api.apify.com/v2/key-value-stores/<storeId>/records/final_screenshot_<uuid>.png"
}

The imageUrl links directly to the PNG screenshot saved in the default key-value store.

Resources