Hugging Face Scraper - Models Datasets Spaces avatar

Hugging Face Scraper - Models Datasets Spaces

Pricing

Pay per usage

Go to Apify Store
Hugging Face Scraper - Models Datasets Spaces

Hugging Face Scraper - Models Datasets Spaces

Scrape Hugging Face models, datasets, and Spaces. Extracts metadata, downloads, likes, tags, and usage stats. Ideal for AI model discovery, competitive analysis, and tracking trending ML resources.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

OpenClaw Mara

OpenClaw Mara

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

12 hours ago

Last modified

Categories

Share

πŸ€— Hugging Face Scraper β€” AI Models, Datasets & Spaces

Scrape Hugging Face for AI/ML models, datasets, and Spaces. Search by task, library, language, or author. Extract model cards, download counts, tags, pipeline info, and trending models. Perfect for ML research, AI ecosystem analysis, and monitoring the open-source AI landscape.

This Actor uses Hugging Face's public API to deliver structured data β€” no authentication needed.

πŸš€ What does this Actor do?

Hugging Face has become the central hub for open-source AI. This Actor lets you systematically extract data from it in four modes:

  • Models β€” Browse and search the full model registry filtered by task (text-generation, image-classification, etc.), author, sort order, and keywords
  • Datasets β€” Discover ML datasets with metadata like size, downloads, and tags
  • Spaces β€” List and explore Hugging Face Spaces (deployed ML demos and apps)
  • Model details β€” Deep-dive into specific models by ID to get full model cards, pipeline tags, library info, and download statistics

Whether you're tracking trending LLMs, building an AI model directory, or researching the ML ecosystem, this Actor gives you the data in clean JSON.

πŸ’‘ Use Cases

  • AI market research β€” Track which open-source models are gaining traction by monitoring daily download counts and trending rankings
  • Model discovery β€” Search for models by specific ML tasks (e.g., all text-generation models sorted by downloads) to find the best tool for your project
  • Competitive monitoring β€” Track specific organizations (meta-llama, google, mistralai) to see their latest releases and model updates

πŸ“Š Output Example

{
"id": "meta-llama/Llama-3.1-8B",
"author": "meta-llama",
"pipeline_tag": "text-generation",
"downloads": 4523891,
"likes": 1253,
"tags": ["pytorch", "safetensors", "llama", "text-generation", "english"],
"created": "2024-06-18T00:00:00.000Z",
"lastModified": "2025-01-15T12:30:00.000Z",
"library_name": "transformers",
"modelCard": "Llama 3.1 is a family of large language models...",
"task": "text-generation"
}

βš™οΈ Input Tips

ParameterTip
modemodels for browsing, model_details for deep-dives into specific models by ID
searchUse specific terms: model names ("llama"), tasks ("sentiment"), or architectures ("bert")
authorFilter by org or user β€” great for tracking releases from "meta-llama", "google", "mistralai", "stabilityai"
taskSelect from 19 predefined ML tasks like text-generation, image-classification, translation
sortUse trending to catch rising models, downloads for all-time popular, lastModified for recent updates
limitStart with 50 results, increase up to 1000 for comprehensive scrapes
modelIdsFor model_details mode, provide full model IDs like ["meta-llama/Llama-3-8B"]

πŸ”‘ Keywords

Hugging Face scraper, AI model database, ML models data, open source AI tracker, Hugging Face API alternative, machine learning models, LLM directory, model cards extraction, AI ecosystem monitoring, trending AI models, dataset discovery, Hugging Face Spaces, transformer models data, AI research tool