Under maintenance

Pricing

Pay per usage

Try for free

Go to Apify Store

Playwright Mcp

Under maintenance

Try for free

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Dilip S Chakravarthi

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

Video-to-Playwright Automation Actor

Transform screen recordings into executable Playwright automation scripts using Google Gemini AI vision capabilities.

🎯 What It Does

This actor analyzes video recordings of user interactions with websites and automatically generates Python Playwright scripts that replicate those actions. Perfect for:

QA Testing: Record manual tests once, replay them automatically
Web Scraping: Show the actor how to navigate a site, get a working script
RPA Automation: Convert manual workflows into automated browser tasks
Documentation: Generate script documentation from video demos

🚀 How It Works

Upload Video: Place your screen recording (mp4, avi, mov, webm) in the videos directory
AI Analysis: Google Gemini 2.0 Flash analyzes every frame to identify:
- Mouse clicks and movements
- Keyboard inputs and text entries
- Navigation patterns
- Element interactions (buttons, forms, links)
Script Generation: Creates a complete, production-ready Playwright Python script with:
- Proper selectors (CSS, text-based, IDs)
- Wait conditions and timeouts
- Error handling and screenshots on failure
- Detailed comments explaining each step
Auto-Execution: Optionally runs the generated script to verify it works
Storage: Saves scripts to Apify key-value store for later use

📋 Input Parameters

Parameter	Type	Description	Default
`videoFile`	string	Name of video file in videos directory	`test_1.mp4`
`taskDescription`	string	Optional context about the task shown	-
`includeScreenshots`	boolean	Add screenshot captures to script	`false`
`executeAfter`	boolean	Auto-run script after generation	`true`
`headless`	boolean	Run browser in headless mode	`false`
`slowMo`	integer	Slow motion delay (ms) for debugging	`100`

🔧 Environment Variables

Set these in the Apify Console:

GEMINI_API_KEY (required): Your Google Gemini API key - Get one here
VIDEO_UPLOAD_DIR: Custom directory for videos (default: ./videos)
MAX_VIDEO_SIZE_MB: Max upload size in MB (default: 100)
GEMINI_MODEL: Model to use (default: gemini-2.0-flash-exp)

📦 Outputs

Key-Value Store

generated_script.py: The complete Playwright automation script
Additional scripts if you request modifications

Dataset (if `save_output=true`)

Execution results with stdout/stderr
Success/failure status
Return codes

💡 Example Usage

Basic Usage

{
  "videoFile": "login_workflow.mp4",
  "taskDescription": "User logging into dashboard",
  "executeAfter": true
}

Advanced Usage

{
  "videoFile": "checkout_process.mp4",
  "taskDescription": "Complete e-commerce checkout flow",
  "includeScreenshots": true,
  "executeAfter": true,
  "headless": false,
  "slowMo": 500
}

🎥 Video Recording Tips

For best results when recording videos:

Clear Actions: Perform actions deliberately with visible mouse movements
Wait for Loads: Pause after page loads and before clicking
Stable Elements: Interact with elements that have consistent selectors
Resolution: Record in 1080p or higher for better element detection
Duration: Keep videos under 2-3 minutes (Gemini context limits)
Single Task: One clear workflow per video

🏗️ Architecture

┌─────────────────┐
│   Upload Video  │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Gemini Vision  │ ◄── Frame-by-frame analysis
│   AI Analysis   │     Interaction detection
└────────┬────────┘     Element identification
         │
         ▼
┌─────────────────┐
│ Script Generator│ ◄── Playwright template
│  (AI Prompted)  │     Selector optimization
└────────┬────────┘     Error handling
         │
         ▼
┌─────────────────┐
│  Execute Script │ ◄── Browser automation
│   (Optional)    │     Verification
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Save to Store  │ ◄── Key-value storage
│   Return Results│     Dataset output
└─────────────────┘

🛠️ Technical Details

Supported Websites

Works best with public websites (no login walls for search engines)
Handles dynamic content (SPAs, AJAX)
Supports consent dialogs and popups
Compatible with YouTube, Google, e-commerce sites, etc.

Script Features

Generated scripts include:

Async/await pattern for modern Playwright
Proper browser context setup
Viewport and user-agent configuration
Timeout management (15s elements, 30s navigation)
Try/catch error handling
Screenshot on failure
Console logging for debugging

Limitations

Video must be clear and not too fast
Complex multi-step workflows may need script refinement
Personalized content (like YouTube homepage) requires search functionality
Very long videos (>5 min) may hit token limits

🔄 MCP Server Mode

The actor can also run as an MCP (Model Context Protocol) server for interactive use:

Set AUTO_ANALYZE_VIDEO=false to enable MCP mode, then use these tools:

analyze_video: Generate script from video
modify_script: Refine script with natural language
execute_script: Run the automation
get_script: Retrieve current script
save_script: Store to key-value store

📚 Example Output

from playwright.async_api import async_playwright
import asyncio

async def main():
    async with async_playwright() as p:
        browser = await p.chromium.launch(headless=False)
        page = await browser.new_page()
        
        # Step 1: Navigate to YouTube
        print("Navigating to YouTube...")
        await page.goto('https://www.youtube.com')
        
        # Step 2: Search for video
        print("Searching...")
        search_input = page.locator('input[name="search_query"]')
        await search_input.fill('Minecraft gameplay')
        await page.keyboard.press('Enter')
        await asyncio.sleep(2)
        
        # Step 3: Click first video
        print("Clicking video...")
        video = page.locator('a#video-title').nth(0)
        await video.click()
        
        print("[SUCCESS] Automation completed!")
        await browser.close()

asyncio.run(main())

🐛 Troubleshooting

Script times out finding elements:

Video content may not be available (e.g., personalized feeds)
Use search functionality instead of expecting content on homepage
Increase timeout values in generated script

Unicode/encoding errors:

Already handled with UTF-8 subprocess environment
If issues persist, remove emojis from print statements

Video upload fails:

Check file size (max 100MB by default)
Ensure video format is supported (mp4, avi, mov, webm)
Verify video path in VIDEO_UPLOAD_DIR

Generated script doesn't work:

Use modify_script tool to refine selectors
Add more wait conditions
Handle dynamic content with explicit waits

📄 License

MIT

🔗 Links

👥 Support

For issues or questions:

Check the execution logs in Apify Console
Review generated script for errors
Try modifying the script with natural language instructions
Ensure your Gemini API key is valid and has quota

Playwright Test Agent Mcp

bronze_quarterback/playwright-test-agent-mcp

Segun Zubair

Playwright MCP Server

jiri.spilka/playwright-mcp-server

A Model Context Protocol (MCP) server that provides browser automation capabilities using Playwright

Jiří Spilka

151

Playwright MCP Actor

aluminum_jam/playwright-mcp-actor

The Playwright MCP Actor integrates the robust browser automation capabilities of Playwright with the Model Context Protocol (MCP), enabling AI agents and language models to perform web scraping, testing, and automation tasks through a standardized interface.

anuj upadhyay

5.0

Web Scraper Mcp

sovereigntaylor/web-scraper-mcp

Ricardo Akiyoshi

Excel Mcp Server

lovely_radiologist/excel-mcp-server

Vivek Gaur

Python Playwright template

ellustar/my-actor-59

A ready-to-use Python Playwright template for building reliable web automation and scraping actors. Includes clean project structure, browser setup, async support, and best practices to help you quickly launch, customize, and scale Playwright-based workflows.”

Ellustar

Seo Analyzer Mcp

sovereigntaylor/seo-analyzer-mcp

Ricardo Akiyoshi

Time MCP Server

agentify/time-mcp-server

An MCP server implementing the Model Context Protocol (MCP) for time-related operations.

agentify

Mcp Server Generator

fiery_dream/mcp-server-generator

Cody Churchwell

Python Playwright Template

ellustar/my-actor-74

Python Playwright Template is a ready to use automation actor for web scraping and testing. It provides a clean project structure, browser setup, page models, and example scripts so you can quickly build, customize, and deploy reliable Playwright workflows at scale. Ideal for cloud or local runs

Ellustar