AI Extraction Agent - Smart Scraper avatar

AI Extraction Agent - Smart Scraper

Pricing

from $0.01 / 1,000 results

Go to Apify Store
AI Extraction Agent - Smart Scraper

AI Extraction Agent - Smart Scraper

AI-powered data extraction using natural language prompts. Describe what you need & let AI extract structured data from any webpage automatically.

Pricing

from $0.01 / 1,000 results

Rating

0.0

(0)

Developer

The Howlers

The Howlers

Maintained by Community

Actor stats

3

Bookmarked

14

Total users

3

Monthly active users

a day ago

Last modified

Share

AI Extraction Agent - Autonomous Web Data Extraction

AI-powered web agent that autonomously extracts data from websites using natural language objectives. Uses Claude AI for intelligent extraction - NO Firecrawl dependency required. Built by John Rippy (https://www.linkedin.com/in/johnrippy/ | https://johnrippy.link/).

BYOK (Bring Your Own Key) -- you provide your own API credentials.


Before You Start

This actor requires your own API credentials to fetch real data.

Where to get your key: Your Anthropic API key for Claude AI extraction. Get one at https://console.anthropic.com

You can test with Demo Mode first (free, no key needed) to see the output format before committing.


Quick Start

Test with Demo Mode (free, no API key needed)

{
"demoMode": true,
"url": "https://example.com",
"objective": "Find the pricing information and list all available plans with their features"
}

Run with real data

{
"demoMode": false,
"url": "https://example.com",
"objective": "Find the pricing information and list all available plans with their features",
"maxPages": 5,
"followLinks": true,
"anthropicApiKey": "YOUR_API_KEY_HERE",
"webhookPlatform": "custom"
}

Input Parameters

ParameterTypeDefaultRequiredDescription
urlstring-NoThe starting URL to begin extraction from
objectivestring-NoNatural language description of what data to extract (e.g., 'Find all pricing plans and their features')
schemaobject-NoOptional JSON schema to structure the extracted data. If provided, the AI will conform output to this schema.
maxPagesinteger5NoMaximum number of pages to crawl and analyze
followLinksbooleantrueNoWhether to follow links on the page to discover more content
anthropicApiKeystring-Yes*Your Anthropic API key for Claude AI extraction. Get one at https://console.anthropic.com
demoModebooleanfalseNoRun with sample data (no API key required). Useful for testing the actor.
webhookUrlstring-NoURL to send extraction results via POST request. Supports Zapier, Make, n8n, and custom webhooks.
webhookPlatformstring"custom"NoSelect your webhook platform for optimized payload formatting
webhookHeadersobject-NoAdditional HTTP headers to include in webhook requests (e.g., authorization tokens)

*Required when Demo Mode is off.


Pricing

This actor uses pay-per-event billing:

EventDescriptionPrice
Extraction CompletedEach AI-powered web data extraction completed$0.05

Demo mode is free -- no charges for sample data.


Troubleshooting

"API key is required"

You have Demo Mode turned off but didn't provide an API key. Either:

  • Turn Demo Mode on to test with sample data
  • Add your API key in the input

"API error 403" or "Unauthorized"

Your API key is invalid, expired, or doesn't have access to this specific API endpoint. Double-check your key and account permissions.

"API error 429" or "Rate limit"

Too many requests. Wait a minute and try again, or reduce the number of items per run.

No results or empty dataset

Check the run log for error messages. Common causes:

  • Invalid input format (check the examples above)
  • API key without proper permissions
  • The target data doesn't exist or is too small to track

How do I test without an API key?

Enable Demo Mode in the input. This returns realistic sample data so you can verify the output format works for your workflow.


Built by John Rippy | Actor Arsenal