Hybrid Vision Spider | AI-Powered Universal Web Scraper avatar
Hybrid Vision Spider | AI-Powered Universal Web Scraper

Pricing

from $13.00 / 1,000 results

Go to Apify Store
Hybrid Vision Spider | AI-Powered Universal Web Scraper

Hybrid Vision Spider | AI-Powered Universal Web Scraper

AI-driven hybrid web scraper that merges Playwright and Vision intelligence to extract structured data from any dynamic site. Schema-aware, proxy-ready, budget-safe, and fully compatible with Apify datasets.

Pricing

from $13.00 / 1,000 results

Rating

5.0

(2)

Developer

Țugui Dragoș

Țugui Dragoș

Maintained by Community

Actor stats

2

Bookmarked

7

Total users

3

Monthly active users

2 days ago

Last modified

Share

Apify Actor Node.js TypeScript OpenAI Playwright Crawlee Python libphonenumber License: Proprietary

Point. Click. Extract. Data from any webpage in seconds.


What This Actor Does

This actor visits web pages and extracts the data you need. Just select what you want to extract using simple checkboxes - no coding required.

Works great for:

  • Contact information (emails, phone numbers)
  • Product data (names, prices, descriptions)
  • Social media links
  • Any structured data from websites

Quick Start (3 Steps)

Step 1: Enter URLs

Paste the URLs you want to scrape, one per line:

https://example.com/contact
https://example.com/about
https://example.com/products

Step 2: Select What to Extract

Check the boxes for the data you want:

CheckboxWhat It Extracts
Extract Page TitleMain title/heading
Extract EmailEmail addresses found on page
Extract PhonePhone numbers
Extract LinksImportant links
Extract ImagesImage URLs
Extract PricePrice values ($, EUR, etc.)
Extract Social LinksFacebook, Twitter, LinkedIn, etc.
Extract AddressPhysical addresses
Extract DescriptionPage summary/meta description

Step 3: Choose Mode and Run

ModeWhen to UseCost
HTML OnlySimple pages, contact infoFree
HybridMost websites (recommended)Low
Vision OnlyComplex pages, visual dataHigher

Click Start and wait for results!

Extraction Modes Explained

HTML Only Mode (Free)

  • Uses pattern matching to find data
  • Works offline, no AI costs
  • Best for: emails, phones, links, prices
  • Limitation: Cannot understand page context
  • Tries HTML first, uses AI if needed
  • Best balance of speed and accuracy
  • Requires: OpenAI API key
  • Cost: ~$0.001-0.005 per page

Vision Only Mode

  • AI "sees" the page like a human
  • Can extract any visible data
  • Requires: OpenAI API key
  • Cost: ~$0.002-0.010 per page

OpenAI API Key

Required for Hybrid and Vision Only modes.

How to get one:

  1. Go to platform.openai.com/api-keys
  2. Create an account or sign in
  3. Click "Create new secret key"
  4. Copy the key (starts with sk-)
  5. Paste it in the "OpenAI API Key" field

Advanced: Custom JSON Schema

For complex extractions, use the "Custom Schema" field. This overrides the checkboxes.

Example - Extract product data:

{
"type": "object",
"properties": {
"productName": {
"type": "string",
"description": "Name of the product"
},
"price": {
"type": "number",
"description": "Price in USD"
},
"rating": {
"type": "number",
"description": "Rating out of 5 stars"
},
"reviews": {
"type": "array",
"items": { "type": "string" },
"description": "Customer reviews"
}
},
"required": ["productName", "price"]
}

Tips:

  • Add description to help the AI understand what to look for
  • Use required for must-have fields
  • Use type: "array" for lists of items

Output Format

Each result contains:

{
"url": "https://example.com/contact",
"data": {
"title": "Contact Us",
"email": "hello@example.com",
"phone": "+1 555-123-4567"
},
"confidence": {
"title": 0.95,
"email": 0.90,
"phone": 0.85
},
"timestamp": "2024-12-14T10:30:00.000Z"
}

Cost Control

Set limits to control your spending:

SettingWhat It DoesRecommended
Max ResultsStop after X items100
Max AI PagesLimit AI-processed pages10
Token BudgetCap total AI tokens50,000

Estimated costs:

  • HTML Only: Free
  • Hybrid (10 pages): ~$0.01-0.05
  • Vision Only (10 pages): ~$0.02-0.10

Limitations

  • Requires OpenAI API key for AI modes
  • Cannot follow links (processes only provided URLs)
  • Some JavaScript-heavy sites may need Vision mode
  • Rate limited by OpenAI API

Built with 🩶 for the Apify community 🫡