Pricing

Pay per usage

Product Data Extractor (price, stock, rating)

Extract clean, normalized product data — name, price, currency, availability, brand, rating, SKU/GTIN, image — from public product pages via JSON-LD, microdata, and OpenGraph. HTML-only, fast, structured output.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Tommy G

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Product Data Extractor (Apify Actor)

Give it public product page URLs, get back clean, normalized product data — name, price, currency, availability, in-stock, brand, rating, SKU/GTIN/MPN, image — pulled from JSON-LD, microdata, and OpenGraph. HTML-only (no headless browser) so it's fast and cheap. Ideal for price monitoring, competitor tracking, catalog enrichment, and feed building.

Why it's useful (and money-first)

Price/stock monitoring is one of the most-demanded scraping jobs. This actor turns messy product markup (which comes in dozens of shapes — Offer vs AggregateOffer, price as string vs number, 1.299,00 vs $1,299.00, availability URLs vs text) into one stable, tidy record.

Input

{ "startUrls": [{ "url": "https://scrapeme.live/shop/Bulbasaur/" }], "maxConcurrency": 5, "maxPages": 100 }

maxPages capped at 200, maxConcurrency at 20 (cost guard).

{
  "status": "ok",
  "requested_url": "https://shop.example.com/widget",
  "final_url": "https://shop.example.com/widget",
  "http_status": 200,
  "found": true,
  "source": "json-ld",
  "name": "Acme Widget",
  "brand": "Acme",
  "price": 19.99,
  "currency": "USD",
  "availability": "InStock",
  "in_stock": true,
  "rating_value": 4.5,
  "rating_count": 231,
  "sku": "AW-1",
  "gtin": "0123456789012",
  "mpn": null,
  "image": "https://cdn.example.com/w.jpg",
  "description": "...",
  "offers_count": 1,
  "extracted_at": "2026-05-29T..."
}

source is json-ld | microdata | opengraph | none. found:false means no product data was present in the page markup (e.g. a blog or a JS-rendered shop). Failed fetches return the same keys with status:"error" + error.

Run locally / test

npm install
npm test     # unit tests on the pure extractor (node:test)

Publish to Apify (account-holder's step)

npm install -g apify-cli
apify login          # free Apify account
apify push           # from this directory

Keep it free initially; enable pricing later via the adult account-holder once it shows repeat organic usage and clears a margin gate.

Notes / safety

SSRF-guarded (scheme + private/metadata IP block + redirect re-check), robots-respecting, rate-limited, cost-capped — all via the shared src/lib/actor_runner.js.
Stores only derived product fields — no raw page bodies / PII.
HTML-only: client-rendered shops that inject product JSON via JS will return found:false (no server-side markup to read). Core logic in src/extract.js (pure, unit-tested).

Universal Product Price Scraper

flipper_ai/universal-product-price-scraper

Extract product price, title, currency, availability, brand, SKU, and image from any product URL using structured data (JSON-LD / Open Graph). No browser, fast and cheap.

Josh Baker

Ecommerce Price Scraper

flipper_ai/Ecommerce-Price-Scraper

Extract product price, title, currency, availability, brand, SKU, and image from any product URL using structured data (JSON-LD / Open Graph). No browser, fast and cheap.

Josh Baker

Amazon Product Price & Availability Scraper

seeb/amazon-product-price-availability-scraper

Extract Amazon product price, stock, rating, review, seller, brand, image, and availability data from public product URLs or ASINs.

Techionik

JSON-LD Product Parser

mahogany_songbird/json-ld-product-parser

Extract Product schema.org JSON-LD from e-commerce pages: name, brand, price, rating.

Britton Furness

Local Business Data Extractor (NAP, hours, geo)

tom2turnt/localbusiness-extractor

Extract normalized local-business data — name, type, phone, email, full address, lat/long, opening hours, price range, rating — from public pages via JSON-LD (LocalBusiness subtypes, Organization), microdata, and OpenGraph. HTML-only, fast, structured ok/error output.

Tommy G

E-commerce Product Scraper

timely_quarterstaff/ecommerce-scraper

Deterministic SSRF-guarded extraction of structured product data from a SINGLE public product-page URL: title, price, currency, availability, brand, rating, reviews, images, SKU, description via JSON-LD/OpenGraph/meta. Pure code, no proxy/headless/AI/paid API. Single-page, not bulk crawling.

Ahmed Moussa

Structured Data Extractor - JSON-LD, OpenGraph, Meta

piposlab/structured-data-extractor

Extract JSON-LD, OpenGraph, Twitter cards, microdata and meta tags from any URL. For SEO audits, AI dataset building and competitor research. No API key.

Alejandro Bufarini

Product Page Price Extractor

aetheragent/product-page-price-extractor

Extracts pricing and product details from any e-commerce product page. Captures product name, price, currency, availability, SKU, brand, and image URL. Uses intelligent HTML parsing for any site structure. Supports Shopify, WooCommerce, Magento, Amazon, and more.

Grant Mitchell

Event Data Extractor (date, venue, tickets, performers)

tom2turnt/newtype-extractor

Extract clean, normalized event data — name, start/end date, venue & address, geo, online/offline mode, performers, ticket price & availability — from public event pages via JSON-LD (schema.org/Event), microdata, and OpenGraph. HTML-only, fast, structured output.

Tommy G

Structured Data Extractor - JSON-LD, OpenGraph, Microdata

gratifying_graph/structured-data-extractor

Extract every piece of structured data from any URL: JSON-LD blocks by schema.org type, OpenGraph and Twitter Card tags, microdata items, canonical and meta basics. Batch over URL lists or call synchronously from AI agents.