MCP Screenshot — AI Agent Full-Page Capture Tool avatar

MCP Screenshot — AI Agent Full-Page Capture Tool

Pricing

Pay per usage

Go to Apify Store
MCP Screenshot — AI Agent Full-Page Capture Tool

MCP Screenshot — AI Agent Full-Page Capture Tool

MCP-compatible screenshot tool for AI agents. Capture full-page website screenshots programmatically. Returns base64 images for LLM vision analysis. Supports custom viewports and wait conditions. Build visual AI workflows. Custom pipeline: $100 pilot, spinov001@gmail.com

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Alex

Alex

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

MCP Screenshot - Capture Any Webpage as PNG for AI Agents & Visual Monitoring

Need your AI agent to "see" a webpage but LLMs only process text? Screenshots bridge the gap between visual web content and AI understanding. Manual screenshot workflows do not scale. What if your AI agent could capture any URL as a high-resolution PNG in seconds — viewport or full page — and get back a direct download link?

MCP Screenshot is an Apify actor that uses Playwright (Chromium) to render any webpage and capture it as a PNG image. Supports custom viewport dimensions (width/height), full-page scrollable captures, and stores the result in Apify Key-Value Store with a direct download URL. Built for AI vision pipelines, visual regression testing, change monitoring, and content archiving.

Features

  • Full Chromium rendering via Playwright — captures JavaScript-rendered pages, SPAs, and dynamic content
  • Viewport or full-page capture modes
  • Custom viewport dimensions (default: 1280x720)
  • Waits for networkidle + 2-second buffer to ensure complete page rendering
  • PNG output stored in Apify Key-Value Store with direct download URL
  • Returns file size, page title, and metadata alongside the screenshot
  • Headless execution — no GUI required
  • MCP-compatible JSON output with screenshot URL for downstream AI agents
  • Graceful error handling — returns error message instead of crashing on unreachable URLs

Output Data Example

{
"url": "https://stripe.com",
"title": "Stripe | Financial Infrastructure for the Internet",
"screenshotKey": "screenshot_1710842400000",
"screenshotUrl": "https://api.apify.com/v2/key-value-stores/abc123/records/screenshot_1710842400000",
"width": 1280,
"height": 720,
"fullPage": false,
"fileSize": 487231,
"scrapedAt": "2026-03-19T10:30:00.000Z"
}

Use Cases

  1. AI Vision Pipelines — Feed webpage screenshots into GPT-4V, Claude, or Gemini for visual analysis: layout review, design feedback, content extraction from images.
  2. Visual Regression Testing — Capture screenshots before and after deployments. Compare them to detect unintended UI changes across your web application.
  3. Website Change Monitoring — Schedule periodic screenshots of competitor landing pages, pricing pages, or product listings. Detect visual changes that text-based monitoring misses.
  4. Content Archiving — Create visual snapshots of web pages for legal compliance, evidence preservation, or historical records.
  5. Thumbnail Generation — Automatically generate preview thumbnails for link aggregators, dashboards, or content management systems.
  6. Social Media Previews — Capture how your webpage looks to create accurate social media preview images instead of relying on OG tags.
  7. Accessibility Audits — Screenshot pages at different viewport sizes to verify responsive design and mobile compatibility.

Input Parameters

ParameterTypeRequiredDefaultDescription
urlstringYesWebpage URL to capture (e.g., https://stripe.com)
fullPagebooleanNofalseSet to true to capture the entire scrollable page, not just the viewport
widthintegerNo1280Viewport width in pixels
heightintegerNo720Viewport height in pixels

How It Works

  1. Browser Launch — Starts a headless Chromium instance via Playwright.
  2. Page Navigation — Navigates to the target URL with networkidle wait strategy (waits until no more than 0 network connections for 500ms).
  3. Render Buffer — Waits an additional 2 seconds after network idle to allow late-loading elements (lazy images, animations, deferred scripts) to render.
  4. Screenshot Capture — Takes a PNG screenshot of the viewport (or full scrollable page if fullPage is true).
  5. Storage — Saves the PNG buffer to Apify Key-Value Store with a timestamped key.
  6. Output — Pushes metadata (URL, title, screenshot download URL, file size, dimensions) to the dataset.

Viewport Presets

PresetWidthHeightUse Case
Desktop HD19201080Standard desktop monitoring
Desktop1280720Default, good balance of speed and detail
Tablet7681024iPad-style responsive testing
Mobile375812iPhone-style mobile screenshots

Integration with AI Agents

// Capture a screenshot and pass to GPT-4V for analysis
const result = await apifyClient.actor("mcp-screenshot").call({
url: "https://competitor.com/pricing",
fullPage: true,
width: 1920,
height: 1080
});
const screenshotUrl = result.items[0].screenshotUrl;
// Pass screenshotUrl to your vision model

More Tools for AI Agents