Pricing

Pay per event

ARC Prize Leaderboard Scraper

Scrapes ARC Prize leaderboard data (ARC-AGI-1/2/3 benchmarks) for all AI models including scores, costs, providers, and rankings

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

📖 What does it do?

The ARC Prize Leaderboard Scraper extracts structured benchmark data from arcprize.org/leaderboard, the official leaderboard for the ARC Prize competition — the leading benchmark for measuring progress toward artificial general intelligence (AGI).

Give it a list of benchmark versions (v1, v2, v3) and it returns every model's performance data including scores, costs per task, provider information, model types, and release dates.

What you get for each leaderboard entry:

Model name and display label
Provider/organization name
ARC-AGI benchmark version and dataset
Score (0–1 accuracy)
Cost per task (v1/v2) or total evaluation cost (v3)
Model type (Base LLM, CoT, Custom, etc.)
Model group/family
Release date

👥 Who is it for?

🔬 AI researchers and academics

Tracking the frontier of AI capabilities? Use this actor to collect time-series data on how different model families progress on ARC-AGI benchmarks without manually scraping the leaderboard.

📊 Data scientists and analysts

Building dashboards comparing LLM capabilities, cost-efficiency frontiers, or vendor performance? Get structured, queryable JSON output for all models across all benchmark versions.

🤖 AI product teams and investors

Monitoring competitor model performance on the hardest reasoning benchmarks, tracking cost-efficiency trends, or building automated capability-tracking pipelines.

📰 AI journalists and content creators

Writing about AGI progress? Pull fresh leaderboard data programmatically to power articles, newsletters, or automated reports.

🏫 Educators and course creators

Teaching AI capabilities and limitations? Use live leaderboard data in lectures, assignments, and demos.

🚀 Why use it?

Direct JSON endpoint — ARC Prize exposes clean public JSON endpoints; no HTML parsing or browser automation needed
All benchmark versions — covers ARC-AGI-1, ARC-AGI-2, and ARC-AGI-3 in a single run
Structured output — fully typed fields, consistent schema across versions
Always fresh — fetches live data from arcprize.org on every run
Low cost — pure HTTP requests, runs in under 30 seconds with minimal compute

📊 Data fields extracted

Field	Type	Description
`version`	string	Benchmark version: `v1`, `v2`, or `v3`
`datasetId`	string	Internal dataset identifier (e.g., `v1_Semi_Private`)
`datasetDisplayName`	string	Human-readable dataset name (ARC-AGI-1, ARC-AGI-2, ARC-AGI-3)
`modelId`	string	Unique model identifier
`modelDisplayName`	string	Human-readable model name
`modelType`	string \| null	Model type: Base LLM, CoT, Custom, CoT + Synthesis, etc.
`modelGroup`	string \| null	Model family/group name
`providerId`	string	Provider identifier (e.g., `Anthropic`, `OpenAI`)
`providerDisplayName`	string	Human-readable provider name
`score`	number	Accuracy score (0–1, where 1.0 = 100%)
`costPerTask`	number \| null	Cost in USD per task solved (v1/v2); null for v3
`totalCost`	number \| null	Total evaluation cost in USD (v3); null for v1/v2
`modelReleaseDate`	string \| null	Model release date (ISO 8601)
`display`	boolean	Whether this entry is shown on the public leaderboard
`resultsUrl`	string	URL to detailed results (if available)
`leaderboardUrl`	string	URL to the ARC Prize leaderboard

💰 How much does it cost?

This scraper uses Pay-Per-Event (PPE) pricing — you only pay for entries actually extracted.

What you pay for	Cost
Run started (one-time)	$0.005
Per leaderboard entry extracted	$0.0005

Example costs:

All 3 benchmark versions (~300 entries total): ~$0.155
One benchmark version (~100–150 entries): ~$0.055–0.080
Monthly monitoring run (weekly, all versions): ~$0.62/month

⚙️ Input configuration

Parameter	Type	Default	Description
`datasets`	array	`["v1","v2","v3"]`	Which benchmark versions to scrape
`includeHidden`	boolean	`false`	Include entries not shown on public leaderboard
`maxRequestRetries`	integer	`3`	Retry attempts for failed HTTP requests

📋 Example input

Scrape all three ARC-AGI benchmark leaderboards:

{
    "datasets": ["v1", "v2", "v3"],
    "includeHidden": false
}

Scrape only ARC-AGI-2 including hidden entries:

{
    "datasets": ["v2"],
    "includeHidden": true
}

📤 Example output

{
    "version": "v1",
    "datasetId": "v1_Semi_Private",
    "datasetDisplayName": "ARC-AGI-1",
    "modelId": "Claude 3.7",
    "modelDisplayName": "Claude 3.7",
    "modelType": "Base LLM",
    "modelGroup": null,
    "providerId": "Anthropic",
    "providerDisplayName": "Anthropic",
    "score": 0.136,
    "costPerTask": 0.058,
    "totalCost": null,
    "modelReleaseDate": "2025-02-24T00:00:00.000Z",
    "display": true,
    "resultsUrl": "",
    "leaderboardUrl": "https://arcprize.org/leaderboard"
}

🛠️ How to use

Follow these steps to get leaderboard data from the Apify Store:

Open the actor — go to ARC Prize Leaderboard Scraper on the Apify Store and click Try for free.
Configure input — in the Input tab, choose which benchmark versions (v1, v2, v3) to scrape and whether to include hidden entries.
Run — click Start and wait for the actor to finish (typically under 30 seconds).
Download results — go to the Dataset tab to view extracted entries. Export as JSON, CSV, or JSONL using the Export button, or fetch via the Apify API.
Schedule recurring runs (optional) — click Schedule to run automatically (e.g., weekly) and always have fresh leaderboard data.
Connect to downstream tools — use the Apify integrations to send data to Google Sheets, Slack, Webhooks, or any HTTP endpoint after each run.

🔗 Integrations

Connect ARC Prize Leaderboard Scraper to your existing tools and workflows:

Google Sheets

Use the Apify → Google Sheets integration to automatically append fresh leaderboard data to a spreadsheet after each run. Ideal for building live-updating dashboards or sharing data with your team.

Slack notifications

Trigger a Slack message whenever the leaderboard updates (e.g., a new model breaks a top-10 score). Wire up the Apify → Slack integration in the Integrations tab.

Webhooks

After each run completes, fire a webhook to any HTTP endpoint — your own backend, a Zapier/Make workflow, or an n8n automation. Configure in the actor's Integrations tab → Webhook.

Apify API + Python / Node.js

Embed leaderboard scraping in your own data pipeline using the Apify Python client or Node.js client. See the API usage examples section below.

Make (Integromat) / Zapier

Use Apify's native Make and Zapier connectors to route leaderboard data into spreadsheets, databases, or notification services without writing code.

AI agents via MCP

Expose live benchmark data to Claude, Cursor, or VS Code AI features — see the MCP integration section below.

🔧 Technical details

Architecture: Pure HTTP, no browser needed
Source: arcprize.org/media/data/leaderboard/{v1,v2,v3}.json
Typical runtime: < 30 seconds
Memory: 256 MB
Rate limits: Public JSON endpoints, no rate limiting observed

🤖 MCP integration (Claude, Cursor, VS Code)

Use this actor as a live data source inside AI coding assistants via the Apify MCP server.

Claude Code (terminal)

Install the Apify MCP server into Claude Code with one command:

$claude mcp add apify -- npx -y @apify/mcp-server@latest

Then set your API token:

$export APIFY_API_KEY=your_apify_api_token

In any Claude conversation you can then ask:

"Run the arcprize-leaderboard-scraper actor and show me the top 10 models by score on ARC-AGI-2."
"Show me all models from Anthropic on ARC-AGI-1 with their scores, sorted by score descending."
"Which models have scores above 50% on ARC-AGI-2? Run the scraper and filter the results."

Claude Desktop

Add the Apify MCP server to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "apify": {
      "command": "npx",
      "args": ["-y", "@apify/mcp-server@latest"],
      "env": {
        "APIFY_API_KEY": "your_apify_api_token"
      }
    }
  }
}

Cursor / VS Code

Add to your MCP settings (.cursor/mcp.json or VS Code MCP config):

{
  "mcpServers": {
    "apify": {
      "command": "npx",
      "args": ["-y", "@apify/mcp-server@latest"],
      "env": {
        "APIFY_API_KEY": "your_apify_api_token"
      }
    }
  }
}

Once configured, your AI assistant can call run_actor with actor ID automation-lab/arcprize-leaderboard-scraper and input like {"datasets": ["v1","v2","v3"]} to fetch live leaderboard data mid-conversation.

🤔 FAQ — Frequently asked questions

What is ARC-AGI? The Abstraction and Reasoning Corpus for Artificial General Intelligence (ARC-AGI) is a benchmark created by François Chollet to measure general reasoning abilities — tasks that require pattern recognition and abstract reasoning rather than knowledge retrieval.

What's the difference between ARC-AGI-1, -2, and -3?

ARC-AGI-1: Original 2020 benchmark; many top models now exceed 85% accuracy
ARC-AGI-2: Harder 2025 version; current best models score under 30%
ARC-AGI-3: Hardest 2025/2026 version; frontier models score under 1%

Why are some entries hidden? Hidden entries (display: false) include superseded models, internal test runs, or entries the ARC Prize team chose not to highlight. Enable includeHidden: true to see them.

How often is the leaderboard updated? arcprize.org updates their JSON files when new evaluation results are submitted. Run this actor regularly (e.g., weekly via Apify schedules) to track changes over time.

Why does v3 use totalCost instead of costPerTask? ARC-AGI-3 reports total evaluation cost rather than per-task cost, reflecting the full infrastructure cost of a complete evaluation run.

💻 API usage examples

You can trigger this actor programmatically via the Apify API or SDKs.

Node.js (ApifyClient):

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });

const run = await client.actor('automation-lab/arcprize-leaderboard-scraper').call({
    datasets: ['v1', 'v2', 'v3'],
    includeHidden: false,
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python (ApifyClient):

from apify_client import ApifyClient

client = ApifyClient('YOUR_API_TOKEN')

run = client.actor('automation-lab/arcprize-leaderboard-scraper').call(run_input={
    'datasets': ['v1', 'v2', 'v3'],
    'includeHidden': False,
})

items = client.dataset(run['defaultDatasetId']).list_items().items
print(items)

cURL:

curl -X POST \
  "https://api.apify.com/v2/acts/automation-lab~arcprize-leaderboard-scraper/runs?token=YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"datasets":["v1","v2","v3"],"includeHidden":false}'

EvalPlus Leaderboard Scraper — Scrapes the EvalPlus code generation benchmark leaderboard
AlpacaEval Leaderboard Scraper — Scrapes the AlpacaEval instruction-following leaderboard
LiveBench Scraper — Scrapes the LiveBench LLM benchmark leaderboard
EQ-Bench Scraper — Scrapes the EQ-Bench emotional intelligence benchmark leaderboard

⚖️ Legality and terms of use

This actor accesses publicly available JSON endpoints on arcprize.org — the same data that powers the public leaderboard website. No authentication is required, and the data is intentionally made public for research and benchmarking transparency.

No login, credentials, or bypassing of access controls is involved
The data is publicly published by the ARC Prize organization
Usage should comply with arcprize.org's terms of service
Do not use for commercial redistribution of the data without permission from the ARC Prize Foundation

Arc Dev Jobs Scraper

automation-lab/arc-dev-jobs-scraper

Extract structured Arc.dev remote developer job listings by skill, category, and location for recruiting, job boards, and market monitoring.

Stas Persiianenko

Arc.dev Scraper

crawlerbros/arcdev-scraper

Scrape Arc.dev for remote developer job listings. Search by keyword, technology stack, or browse featured remote roles. Extract job title, company, salary range, tech stack, job type, and more.

Crawler Bros

Arc.dev Remote Engineering Jobs Scraper

parseforge/arc-dev-jobs-scraper

Extract remote engineering and developer job listings from Arc.dev with full company data, salary bands, tech stacks, and locations. Built for recruiters and sourcing teams.

ParseForge

Arc Civic

royal_xenomorph/arc-civic

Arc Civic functions as a Digital Analyst, autonomously filtering through hours of public proceedings to extract the specific zoning permits, municipal funding votes, and regulatory shifts that matter to your business

Aris

Nobel Prize Laureates Scraper

parseforge/nobel-prize-laureates-scraper

Pull every Nobel Prize laureate from the official Nobel Prize API. Search by name, filter by category and award year, or sweep a full range. Returns award motivation, prize share, birth country, affiliations, and biography. Great for research, journalism, and trivia builders.

ParseForge

Nobel Prize Data Scraper - Laureates & Prizes

lulzasaur/nobelprize-scraper

Scrape Nobel Prize data. Search laureates by name or category. Extract prize details, motivations, affiliations, and biographical data since 1901.

lulz bot

PR⭕DUCT HUNT Leaderboard Scraper

jupri/producthunt-leaderboard

💫 Scrape ProductHunt Leaderboard

cat

Product Hunt Scraper - Launches, Products & Leaderboard

thirdwatch/producthunt-scraper

Scrape Product Hunt launches, product profiles, and leaderboard rankings. Extract name, tagline, votes, comments, topics, pricing, reviews, and website URLs. Daily featured products, category browsing, and historical leaderboard data.

Thirdwatch

5.0

Arcteryx Category Scraper

getdataforme/arcteryx-category-scraper

The Arcteryx Category Scraper efficiently extracts detailed product data from the Arc'teryx website, ideal for market research and competitive analysis....

GetDataForMe

Esports Earnings Scraper

crawlerbros/esports-earnings-scraper

Scrape EsportsEarnings.com - cross-game esports prize-money rankings for top players, top teams, top games by total prize pool, and tournament results. No login or API key required.