Pricing

Pay per usage

Go to Apify Store

Ai Model Benchmark Scraper

Try for free

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny Nguyen

Actor stats

Bookmarked

Total users

Monthly active users

2 days ago

Last modified

Features

Multi-benchmark support covering the most popular LLM evaluation frameworks
Chatbot Arena scraping with ELO ratings from the LMSYS leaderboard
Model metadata extraction including provider, parameter count, and release date
Score normalization for cross-benchmark comparison when possible
Puppeteer-based rendering to handle JavaScript-heavy leaderboard pages
Configurable benchmark selection to target specific evaluation metrics

Use Cases

Compare LLM performance across multiple benchmarks before selecting a model
Track model performance improvements over time with scheduled runs
Build automated reports on the AI model competitive landscape
Feed benchmark data into model selection pipelines and evaluation frameworks
Monitor when new models appear on leaderboards for competitive intelligence

Input Configuration

Parameter	Type	Default	Description
`benchmarks`	array	`["chatbot-arena"]`	Benchmarks to scrape

Output Format

Each model entry produces a dataset item with:

benchmark - Name of the benchmark source
modelName - Full model name or identifier
score - Benchmark score or ELO rating
rank - Position on the leaderboard
provider - Organization or company behind the model
parameters - Parameter count when available
scrapedAt - ISO timestamp of extraction

Supported Benchmarks

This actor supports scraping from LMSYS Chatbot Arena, HuggingFace Open LLM Leaderboard, and various benchmark result pages. Additional benchmark sources can be requested.

Limitations

Some leaderboards use complex React/Gradio rendering that may require multiple attempts
Benchmark scores and rankings change frequently; schedule regular runs for latest data
Parameter counts and release dates may not be available for all models

Salary Benchmark Scraper

consummate_mandala/salary-benchmark-scraper

Donny Nguyen

Benchmark Aggregator

wild_equipment/benchmark-aggregator

Zhang Luxin

LLM Benchmark Aggregator - Model Comparison

tropical_quince/llm-benchmark-aggregator

Aggregate LLM benchmark scores from MMLU, HumanEval, MATH across models for comparison tables.

Donny Nguyen

Hugging Face Model Scraper - Extract AI Model Data

tropical_quince/huggingface-model-scraper

Scrape Hugging Face model hub. Extract model names, downloads, likes, tasks, frameworks, and model card metadata from the AI model repository.

Donny Nguyen

LLM Benchmark Aggregator

consummate_mandala/llm-benchmark-aggregator

Scrape LLM benchmark sites (MMLU, HumanEval, MATH). Aggregate scores across models for comparison tables.

Donny Nguyen

Together AI Model Scraper

consummate_mandala/together-ai-model-scraper

Scrape Together AI model catalog. Extract model names, pricing per token, context lengths, modalities, and benchmarks.

Donny Nguyen

Together AI Model Scraper - API Catalog

tropical_quince/together-ai-model-scraper

Scrape Together AI model catalog for model names, pricing per token, context lengths, and benchmarks.

Donny Nguyen

Fireworks AI Model Scraper - Model Catalog

tropical_quince/fireworks-ai-model-scraper

Scrape Fireworks AI model catalog for models, pricing, throughput benchmarks, and fine-tuning options.

Donny Nguyen

Groq Model Scraper

consummate_mandala/groq-model-scraper

Scrape Groq model directory. Extract model names, tokens per second, pricing, context windows, and availability.

Donny Nguyen

AI Agent Benchmark Scraper

consummate_mandala/ai-agent-benchmark-scraper

Scrape AI agent benchmarks like SWE-bench, WebArena, AgentBench. Extract agent scores, rankings, and task categories.

Donny Nguyen

Ai Model Benchmark Scraper

Features

Use Cases

Input Configuration

Output Format

Supported Benchmarks

Limitations

You might also like

Salary Benchmark Scraper

Benchmark Aggregator

LLM Benchmark Aggregator - Model Comparison

Hugging Face Model Scraper - Extract AI Model Data

LLM Benchmark Aggregator

Together AI Model Scraper

Together AI Model Scraper - API Catalog

Fireworks AI Model Scraper - Model Catalog

Groq Model Scraper

AI Agent Benchmark Scraper