Pricing

from $50.00 / 1,000 successful extractions

Ecommerce Price Extractor

Monitor competitor prices on any online store. Extracts name, price, currency, stock status, SKU, and description using AI. AJV-validated output. Only charged on successful extraction — $0.05 per URL.

Pricing

from $50.00 / 1,000 successful extractions

Rating

0.0

(0)

Developer

Herbert Yeboah

Actor stats

Bookmarked

Total users

Monthly active users

5 months ago

Last modified

E-Commerce Price Extractor

Only pay when it works. $0.05 per verified extraction — nothing charged on failure or retries.

Extract structured JSON from any product page using a Groq-compatible LLM.

AI Agent Compatible

This actor is AEO-native. The input_schema.json and output_schema.json expose exact field types, defaults, and constraints in machine-readable format. Any AI agent connected to the Apify MCP server — Claude Desktop, Cursor, VS Code — can discover, configure, and execute this actor autonomously without human input. No prompt engineering required.

What It Does

Scrapes the page at your URL using a real browser-grade crawler (CheerioCrawler)
Strips all HTML, navigation, scripts, and boilerplate → clean plain text
Prompts a Groq-compatible LLM to extract data matching your schema
Validates the response with AJV (JSON Schema validator)
Retries up to 3 times if the LLM returns invalid JSON, injecting the error back into the prompt
Returns validated structured data in the Apify dataset

Charge: $0.05 per successful extraction. Nothing charged on failure.

Input Schema

Field	Type	Required	Default	Description
`url`	string	✅	—	Page to scrape
`output_schema`	object	✅	—	JSON Schema defining the data to extract
`groq_api_key`	string	✅	—	API key (Groq, OpenAI, Together AI, etc.)
`model`	string	❌	`llama-3.3-70b-versatile`	Model name
`base_url`	string	❌	Groq endpoint	For OpenAI-compatible providers

Usage Examples

Example 1: Groq (default, free tier)

Get a free API key at console.groq.com.

{
    "url": "https://example.com/product/widget-pro",
    "groq_api_key": "gsk_YOUR_GROQ_KEY_HERE",
    "output_schema": {
        "type": "object",
        "required": ["name", "price"],
        "properties": {
            "name":        { "type": "string" },
            "price":       { "type": "number" },
            "description": { "type": "string" },
            "in_stock":    { "type": "boolean" }
        }
    }
}

Output:

{
    "url": "https://example.com/product/widget-pro",
    "extracted": {
        "name": "Widget Pro",
        "price": 29.99,
        "description": "The best widget on the market.",
        "in_stock": true
    },
    "model": "llama-3.3-70b-versatile",
    "attempts": 1
}

Example 2: OpenAI-compatible endpoint (Together AI, Fireworks AI)

Use any OpenAI-compatible provider by setting base_url:

{
    "url": "https://jobs.lever.co/anthropic/engineer",
    "groq_api_key": "YOUR_TOGETHER_AI_KEY",
    "base_url": "https://api.together.xyz/v1",
    "model": "meta-llama/Llama-3.3-70B-Instruct-Turbo",
    "output_schema": {
        "type": "object",
        "required": ["title", "company", "location", "salary_range"],
        "properties": {
            "title":        { "type": "string" },
            "company":      { "type": "string" },
            "location":     { "type": "string" },
            "salary_range": { "type": "string" },
            "remote":       { "type": "boolean" },
            "requirements": {
                "type": "array",
                "items": { "type": "string" }
            }
        }
    }
}

Other compatible endpoints:

Fireworks AI: https://api.fireworks.ai/inference/v1
OpenAI: https://api.openai.com/v1

Example 3: Ollama (local, completely free)

Run models locally at zero cost with Ollama:

# Start Ollama with a model
ollama serve
ollama pull llama3.3

{
    "url": "https://news.ycombinator.com/item?id=12345",
    "groq_api_key": "ollama",
    "base_url": "http://localhost:11434/v1",
    "model": "llama3.3",
    "output_schema": {
        "type": "object",
        "required": ["title", "score", "comments_count"],
        "properties": {
            "title":          { "type": "string" },
            "score":          { "type": "integer" },
            "comments_count": { "type": "integer" },
            "author":         { "type": "string" },
            "url":            { "type": "string" }
        }
    }
}

Note: When running the Actor on Apify cloud, Ollama requires a remote endpoint. For local testing, use apify run with localhost.

Common Use Cases

Use Case	Schema Fields
Product extraction	name, price, description, in_stock, SKU
Job postings	title, company, location, salary, requirements
News articles	headline, author, published_date, summary, tags
Real estate listings	address, price, bedrooms, bathrooms, sqft
Restaurant menus	restaurant_name, items (name, price, description)
Resume parsing	name, email, skills, experience, education
Event listings	name, date, venue, ticket_price, organizer

How Retry Logic Works

The actor uses the same retry-with-feedback pattern as constrained.py from the DagPipe core library:

Attempt 1: Send text + schema → LLM responds → AJV validates
On failure: Inject the exact AJV error message into the next prompt → retry
Attempt 2: LLM receives error and corrects → validate again
After 3 failures: Throw with a descriptive error message

This approach reliably extracts valid structured data even from smaller/cheaper models.

Pricing

$0.05 per successful extraction (Pay-Per-Event)
Free if extraction fails — you're never charged for failed attempts
Groq's free tier provides 30 requests/minute at zero cost to you

Scheduling

Example: Schedule against 50 competitor product URLs daily. Total cost: $2.50/day. Zero infrastructure. Zero maintenance.

Technical Details

Scraper: CheerioCrawler (zero-JS, fast, reliable)
Validation: AJV v8 + ajv-formats (JSON Schema Draft-07/2019/2020 compatible)
LLM client: OpenAI SDK (works with any OpenAI-compatible endpoint)
Retry strategy: Error-feedback prompting (same pattern as DagPipe constrained.py)
Language: TypeScript, Node.js 20+
Tests: 9 vitest tests (100% passing)

Built With

DagPipe — Zero-cost, crash-proof LLM pipeline orchestrator.

$pip install dagpipe-core

Structured Extract

gastronomic_desk/structured-extract

Only pay when it works. $0.05 per verified extraction — nothing charged on failure or retries. Extract structured JSON from any webpage using your own schema. AJV-validated output guaranteed. Compatible with Groq, OpenAI, Together AI, and Ollama.

Herbert Yeboah

Ecommerce Price Scraper

flipper_ai/Ecommerce-Price-Scraper

Extract product price, title, currency, availability, brand, SKU, and image from any product URL using structured data (JSON-LD / Open Graph). No browser, fast and cheap.

Josh Baker

Walmart Price & Stock Scraper - Product Data by ID or URL

bujhmml/walmart-price-stock-scraper

Look up real-time Walmart.com price, in-stock availability, was-price, unit price, and seller for any product by item ID or URL. Returns clean structured JSON. Built for AI agents (MCP) and price monitoring - bypasses Walmart anti-bot on US residential proxies, charged per successful result.

Ihor Bielievskiy

Hepsiburada Scraper — Product, Price & Reviews

great_saint/hepsiburada-scraper

Scrape Hepsiburada product name, price, stock, rating & review count by keyword. Built for AI agents & price monitoring: charged only on successful results (charge-on-success). TR residential proxies bypass anti-bot. MCP & x402 ready.

Öge

Trendyol Scraper — Product, Price & Reviews

great_saint/trendyol-scraper

Scrape Trendyol product name, price, rating & review count by keyword. Built for AI agents & price monitoring: charged only on successful results (charge-on-success). TR residential proxies bypass anti-bot. MCP & x402 ready.

Öge

Shopify Price & Stock Monitor — Track Any Store

vertaizen/shopify-price-monitor

Shopify price tracker & competitor monitoring via any store's public products.json. Get prices, sale prices, stock & new arrivals in clean JSON. Monitor mode returns only CHANGES — price drops, out-of-stock, restocks & launches. Webhook alerts, pay per result.

Diego Moragues

Shopify Scraper

autofacts/shopify

Shopify online store collection and product data extractor. Supports realtime price/stock monitor. Crawl product list or single product in a structured form, including title,description,price,sku, etc.

Richard Feng

2.2K

4.3

n11 Scraper — Product, Price & Reviews

great_saint/n11-scraper

Scrape n11.com product name, price & rating by keyword. Built for AI agents & price monitoring: charged only on successful results (charge-on-success). TR residential proxies bypass anti-bot. MCP & x402 ready.

Öge

My shopify scraper

lsdflying/shopify-product-scraper

Liang Undef

118

2.1

Product Data Extractor (price, stock, rating)

tom2turnt/product-extractor

Extract clean, normalized product data — name, price, currency, availability, brand, rating, SKU/GTIN, image — from public product pages via JSON-LD, microdata, and OpenGraph. HTML-only, fast, structured output.