Pricing

Pay per usage

OmniStruct Cost-Optimized Scraper

Need clean, structured data from a website without burning through your Apify compute credits? OmniStruct Cost-Optimized Scraper is built from the ground up to be the most efficient, budget-friendly universal scraper on the platform.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

velurix

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

Universal Web Scraper

A high-performance, cost-optimized Apify Actor designed to scrape massive lists of URLs (40,000+) efficiently. It employs a two-phase crawling strategy: a fast, cheap HTTP crawl using BeautifulSoup, followed by an automatic browser fallback (Playwright) only for pages that require JavaScript rendering.

Features

Cost-Optimized Two-Phase Crawling: 90% of sites are scraped using cheap HTTP requests. Only strictly necessary sites (e.g., SPAs, aggressive anti-bot <noscript> pages) trigger a headless browser.
Fail-Fast Logic: Instantly drops dead domains and 40X/50X errors to free up concurrency slots and prevent infinite proxy retry loops.
Pre-Flight Validation: The included run_and_monitor.py script validates DNS and basic HTTP connectivity before spending Apify compute units.
Intelligent Extraction: Uses Mozilla's Readability.js logic to extract the main article content, alongside robust email and phone number regex extraction, and standard meta tags.
Batch Processing: Safely splits large URL lists into manageable chunks to avoid Apify run limits.

How to Run

Option 1: Using the Apify UI

Go to the run console of the Actor.
Provide your list of urls in the input JSON format.
Configure settings:
- For bulk lists, set Max crawl depth (max_crawl_depth) to 0.
- Set Max request retries (max_request_retries) to 0 or 1.

Option 2: Using the Orchestration Script (Recommended for Bulk)

For lists > 1,000 URLs, use the provided run_and_monitor.py script locally to orchestrate the API calls.

Ensure your Apify API Token is set in run_and_monitor.py.
Place your URLs in a CSV file (e.g., Untitled spreadsheet - Sheet1 (2).csv or update the script path).

Run the script:

# Runs with pre-validation (DNS + HEAD check) to save costs
python run_and_monitor.py

# Or, to skip validation and send directly to Apify:
python run_and_monitor.py --no-validate

Configuration Reference

Field	Default	Description
`urls`	`[]`	List of starting URLs.
`max_crawl_depth`	`0`	Recommended exactly `0` for lists of URLs to prevent spidering the whole site.
`max_request_retries`	`1`	Retries per failed request. `0` recommended for bulk runs.
`http_timeout_secs`	`10`	Wait limit for standard pages.
`browser_timeout_secs`	`15`	Wait limit for browser-rendered pages.
`enable_browser_fallback`	`true`	Allows Playwright to step in when BeautifulSoup fails to extract content or detects an SPA.

Output Format

The Actor pushes items to the Apify dataset in the following format:

{
  "url": "https://example.com",
  "domain": "example.com",
  "status": "success",
  "title": "Example Domain",
  "description": "This domain is for use in illustrative examples.",
  "content_text": "Example Domain This domain is for use in illustrative examples...",
  "content_length_chars": 62,
  "emails": ["contact@example.com"],
  "phone_numbers": ["1-800-555-1234"],
  "used_browser_fallback": false,
  "timestamp": "2026-02-21T00:00:00.000000"
}

Zillow Search Scraper

leandrocb88/zillow-search-scraper

Extract search data from Zillow with a tool optimized for speed and cost-effectiveness.

Leandro Castellanos

Universal Website Scraper

jowi/universal-website-scraper

The Universal Website Scraper is a powerful all-purpose crawler that extracts structured data from ANY website, including dynamic pages, product listings, articles, tables, blogs, and more. No coding needed just enter a URL and let the actor do the rest.

Fred

Google Maps Scraper

vortex_data/google-maps

Stop wasting your budget on slow, resource-heavy browser-based scrapers. This is the fastest, most cost-effective, and data-rich Google Maps scraper on Apify, designed for high-scale lead generation and market research.

VortexData

251

5.0

Zillow Home Details Scraper

leandrocb88/zillow-home-details-scraper

Extract detailed property data from Zillow with a tool optimized for speed and cost-effectiveness.

Leandro Castellanos

Youtube Video, Channel, Transcript

scrappy-scraper/YoutubeScraper-Apify

The most efficient YouTube parser in terms of cost and performance

Scrappy Scraper

Apify Cost Optimizer — Find & Fix Wasted Compute

busy_donkey/apify-cost-optimizer

Analyzes your Apify actor runs, identifies which actors burn the most compute units, flags inefficiencies, and returns prioritized fixes with estimated savings.

busy_donkey

Facebook Pages Scraper

vortex_data/facebook-pages

💰$1/1K result💰Stop wasting your budget on slow, resource-heavy browser-based scrapers. This is the fastest, most cost-effective, and data-rich Facebook Pages scraper on Apify, designed for high-scale lead generation, reputation monitoring, and competitor research.

VortexData

5.0

Universal Contract Hunter (Email, Phone, Socials)

0penagi/universal-contract-hunter-email-phone-socials

# 🕵️‍♀️ Universal Contact Hunter (Email, Phone, Socials) What is this Actor? The web is a chaotic ocean of information. Universal Contact Hunter is your compass. It navigates through websites to find the most valuable signal in the noise: human connection points.

0penAGI

My Zillow Scraper

gopalakrishnan/my-zillow-scraper

Fast and efficient Zillow property scraper that collects up to 2000 listings per search URL with high success rate. Optimized for reliability and cost-effectiveness - perfect for real estate market research, lead generation, and property analysis.