Deprecated

Pricing

Pay per usage

See alternative Actors

Go to Apify Store

Hugging Face Collections Scraper

Deprecated

See alternative Actors

Scrape Hugging Face curated collections of AI models, datasets & spaces. Browse trending, top-voted or filter by organization. No API key required.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Stas Persiianenko

Actor stats

Bookmarked

Total users

Monthly active users

6 days ago

Last modified

🤔 What does it do?

This actor scrapes Hugging Face Collections — curated groupings of models, datasets, and spaces organized by researchers, companies, and the community. You can:

🔥 Browse trending or most-upvoted public collections
👤 Fetch all collections from a specific user or organization (e.g., Google, Meta, Mistral AI)
🔗 Retrieve specific collections by slug for targeted extraction

For each collection, you get the title, description, owner info, upvotes, theme, all contained items (model/dataset/space IDs, types, authors, likes, downloads), and more.

The actor calls the official HuggingFace public API — no login, no API key, no browser automation required.

👥 Who is it for?

🧑‍🔬 AI Researchers & Data Scientists

Monitor which model collections are gaining traction in the community. Track when leading labs (Google, Meta, Mistral, Cohere) publish new curated groupings of models. Use collection membership as a signal for quality filtering.

🏢 Enterprise AI Teams

Audit what models competitors have curated. Build automated pipelines to import model metadata from industry-relevant collections into your internal model registry or catalog.

📊 Market Intelligence Analysts

Track the AI ecosystem by monitoring trending collections over time. Identify which organizations are publishing the most influential model groupings. Measure upvote velocity as a proxy for community interest.

🛠️ MLOps & Tooling Developers

Build collection-aware model discovery tools. Automatically fetch and sync collection contents into your model management platform. Power recommendation engines with social signals from HF collections.

💡 Why use this actor?

No authentication required — Hugging Face Collections API is fully public
Fast & cheap — pure HTTP, no browser overhead, near-zero compute cost
Pagination handled — automatically follows cursor-based pagination to fetch any number of collections
Multiple modes — browse globally, filter by owner, or fetch specific slugs
Structured output — each item in a collection includes type, author, likes, downloads, pipeline tag

📦 Data extracted

Field	Description
`slug`	Unique collection identifier (e.g., `google/gemma-2-...`)
`title`	Collection display title
`description`	Collection description
`ownerName`	HuggingFace username or org
`ownerType`	`user` or `org`
`ownerUrl`	Link to owner's HF profile
`collectionUrl`	Full URL to the collection
`upvotes`	Number of upvotes
`theme`	Collection theme color
`private`	Whether private
`gating`	Whether gated access
`lastUpdated`	ISO timestamp of last update
`itemCount`	Number of items in collection
`itemTypes`	Comma-separated item types (model, dataset, space)
`items`	Array of collection items with id, type, author, likes, downloads, pipelineTag
`scrapedAt`	Extraction timestamp

💰 How much does it cost to scrape Hugging Face collections?

This actor uses Pay-Per-Event (PPE) pricing — you pay only for what you extract.

Tier	Start fee	Per collection
FREE	$0.001	$0.00115
BRONZE	$0.001	$0.001
SILVER	$0.001	$0.00078
GOLD	$0.001	$0.0006
PLATINUM	$0.001	$0.0004
DIAMOND	$0.001	$0.00028

Example costs (BRONZE tier):

20 trending collections → ~$0.021
100 collections from Meta → ~$0.101
500 top collections → ~$0.501

With a free Apify account (up to $5 free compute/month), you can extract approximately 4,000+ collections per month at no cost.

🚀 How to use

Step 1 — Choose your mode

Select from three scraping modes:

Browse — gets trending or most-voted public collections
Owner — fetches all collections by a specific user or organization
Slugs — retrieves exact collections you specify

Step 2 — Configure limits

Set maxCollections to control how many collections to extract. Default is 100.

Step 3 — Run and export

Click Start and wait for results. Export to JSON, CSV, or connect to downstream workflows.

⚙️ Input parameters

Parameter	Type	Description	Default
`mode`	string	`browse`, `owner`, or `slugs`	`browse`
`sort`	string	`trending` or `upvotes` (browse mode only)	`trending`
`owner`	string	Username or org name (owner mode only)	—
`collectionSlugs`	array	List of collection slugs (slugs mode only)	`[]`
`maxCollections`	integer	Max collections to extract	`100`
`includeItems`	boolean	Include item details in output	`true`
`maxRequestRetries`	integer	Retry attempts per failed request	`3`

Example inputs

Browse trending collections:

{
  "mode": "browse",
  "sort": "trending",
  "maxCollections": 50
}

Collections from a specific organization:

{
  "mode": "owner",
  "owner": "google",
  "maxCollections": 100
}

Fetch specific collections:

{
  "mode": "slugs",
  "collectionSlugs": [
    "google/gemma-2-665d5624d9e0312f5dfb1a1a",
    "meta-llama/llama-3-1-669233f0b30c5aa8b7b40b52"
  ]
}

📤 Output example

{
  "slug": "google/gemma-2-665d5624d9e0312f5dfb1a1a",
  "title": "Gemma 2",
  "description": "Google's Gemma 2 open models collection.",
  "ownerName": "google",
  "ownerType": "org",
  "ownerUrl": "https://huggingface.co/google",
  "collectionUrl": "https://huggingface.co/collections/google/gemma-2-665d5624d9e0312f5dfb1a1a",
  "upvotes": 1245,
  "theme": "blue",
  "private": false,
  "gating": false,
  "lastUpdated": "2025-12-01T10:00:00.000Z",
  "itemCount": 5,
  "itemTypes": "model",
  "items": [
    {
      "id": "google/gemma-2-2b",
      "type": "model",
      "author": "google",
      "position": 0,
      "likes": 1892,
      "downloads": 554321,
      "pipelineTag": "text-generation",
      "lastModified": "2025-11-20T08:00:00.000Z"
    }
  ],
  "scrapedAt": "2026-05-04T12:00:00.000Z"
}

💡 Tips & tricks

Use sort: "upvotes" for quality signals — collections with many upvotes tend to contain high-quality, vetted models
Owner mode is ideal for competitive intelligence — fetch all collections from google, meta-llama, mistralai, cohere regularly
Disable includeItems for fast metadata-only runs — useful when you just need collection counts and upvote rankings
Slugs mode for targeted monitoring — watch specific high-value collections (e.g., official Llama 3 collection) for new additions
Combine with HuggingFace Models Scraper — use collection item IDs as seeds to fetch full model details

🔌 Integrations

Google Sheets — AI model tracking dashboard

Use the Apify Google Sheets integration to append trending collection data weekly. Build a dashboard tracking which organizations are publishing new model groupings and their upvote velocity.

Chain this actor with Slack integration to send a weekly digest of top trending collections. Set maxCollections: 10 and sort: trending as the alert input.

Model catalog enrichment pipeline

Run this actor nightly for a curated list of organization slugs. Feed the output into your internal model registry to automatically tag models that appear in official company collections.

Make / Zapier automation

Use collection membership changes to trigger downstream workflows — e.g., automatically download or evaluate newly added models when a watched collection is updated.

🖥️ API usage

Node.js (Apify SDK)

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });

const run = await client.actor('automation-lab/huggingface-collections-scraper').call({
    mode: 'browse',
    sort: 'trending',
    maxCollections: 50,
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python

from apify_client import ApifyClient

client = ApifyClient(token="YOUR_API_TOKEN")

run = client.actor("automation-lab/huggingface-collections-scraper").call(run_input={
    "mode": "owner",
    "owner": "google",
    "maxCollections": 100,
})

for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item["title"], item["upvotes"])

cURL

curl -X POST \
  "https://api.apify.com/v2/acts/automation-lab~huggingface-collections-scraper/runs" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "mode": "browse",
    "sort": "upvotes",
    "maxCollections": 100
  }'

🤖 MCP (Claude, Cursor, VS Code)

Use this actor directly from AI assistants via the Apify MCP server.

Claude Code:

$claude mcp add --transport http apify "https://mcp.apify.com?tools=automation-lab/huggingface-collections-scraper"

Claude Desktop / Cursor / VS Code — add to your MCP config:

{
  "mcpServers": {
    "apify": {
      "type": "http",
      "url": "https://mcp.apify.com?tools=automation-lab/huggingface-collections-scraper",
      "headers": {
        "Authorization": "Bearer YOUR_API_TOKEN"
      }
    }
  }
}

Example prompts:

"Fetch the top 20 trending Hugging Face collections and list them by upvotes"
"Get all collections published by google on HuggingFace"
"Scrape the Llama 3 collection from meta-llama and show me the models it contains"

⚖️ Legality & terms of service

This actor uses Hugging Face's public API (huggingface.co/api/collections), which is freely accessible without authentication and intended for programmatic access. All data returned is publicly visible on the HuggingFace website.

Only public collections are accessible
Private or gated collections cannot be accessed
No login credentials are used or stored
Respects the public API rate limits
Use responsibly in accordance with Hugging Face Terms of Service

❓ FAQ

Q: Do I need a Hugging Face API key? No. The Collections API is fully public and requires no authentication.

Q: Can I scrape private collections? No. The public API only returns public collections. Private collections require HF authentication which this actor does not support.

Q: How many collections can I extract? Theoretically unlimited — the actor paginates through all available results. In practice, HuggingFace has tens of thousands of public collections.

Q: Why am I getting fewer results than expected? Some owners may have no public collections, or the keyword may match fewer collections than your maxCollections limit. Check the actor logs for details.

Q: The actor ran but returned 0 results — what happened? For owner mode, verify the username/org name is correct (case-sensitive, e.g., google not Google). For slugs mode, verify the full slug including the owner prefix (e.g., google/gemma-2-abc123).

Q: Can I get the full model details for items in a collection? This actor returns the core item metadata (ID, type, author, likes, downloads). For full model cards and metadata, use the Hugging Face Models Scraper with the model IDs as input.

Hugging Face Scraper — scrape model cards, parameters, and full metadata
Hugging Face Datasets Scraper — extract dataset metadata and download links
Hugging Face Papers Scraper — scrape ML research papers with AI summaries
Hugging Face Spaces Scraper — discover and extract AI demo spaces

Carsandbids.com Scraper - Cheap 🔨📊🚗

scrapestorm/carsandbids-com-scraper---cheap

🔎 Easily collect car auction listings by providing one or multiple search URLs 🚗🔨 Extract valuable auction insights such as 🚗 Vehicle Title 📌 Auction Status 💰 Current Bid / Sale Price✨ Image 🔗 Listing URL & more Perfect for car auction monitoring & automotive market analysis 📊🚘📈

Storm_Scraper

5.0

(1)

Hugging Face Scraper — (Models, Datasets, Spaces, Papers etc.)

khadinakbar/huggingface-all-in-one-scraper

All-in-one Hugging Face Hub scraper. Paste any URL or text query — auto-detects model, dataset, space, paper, user, org, or collection. Deep model card, lineage, evaluation results, dataset configs. MCP-ready. $0.006 per result.

Khadin Akbar

Hugging Face Scraper — AI Models, Datasets, Spaces & Papers

logiover/huggingface-hub-intelligence-scraper

Export every AI model, dataset, space and daily paper from the Hugging Face Hub. Filter by task, library (transformers, diffusers, GGUF), language, license, author. Sort by downloads, likes, trending. Sibling files + README. Public HF API, no token. For AI builders, ML research, RAG and VC AI intel.

Logiover

Kayak Car Rentals Scraper - Cheap 🚗🌍💰

scrapestorm/kayak-car-rentals-scraper---cheap

🔎 Easily collect Kayak car rental listings by providing one or multiple Kayak car search URLs Extract useful insights such as 🚗 Car Brand 🏷 Car Class 💵 Price per Day 💰 Total Price 🏢 Rental Provider 🔗 Booking URL and more Perfect for travel price comparison & mobility data analysis 🚗🌍📊

Storm_Scraper

5.0

(1)

Zazzle Product Scraper - Cheap 🛍️🎨📊

scrapestorm/zazzle-product-scraper---cheap

🔎 Easily collect product listings from Zazzle Provide one or multiple search or category URLs and extract product intelligence such as 🏷 Product Title 💰 Sale Price 💵 Original Price 🔗 Product URL 🖼 Image URL & more. Perfect for print-on-demand research and e-commerce market insights 📊🚀

Storm_Scraper

5.0

(1)

Stripe App Marketplace Scraper - Cheap 💳🛍️🔎

scrapestorm/stripe-app-marketplace-scraper---cheap

🟣 Easily collect Apps from the Stripe Marketplace Provide one or multiple Stripe Marketplace category or search URLs and extract app data including 🆔 App Slug 🏷️ Name 📝 Tagline 🗂️ Source URL 🔗 App URL & more… Perfect for Stripe ecosystem research and SaaS competitive monitoring 🚀📊💳

Storm_Scraper

5.0

(1)

Chrome Web Store Reviews Scraper - Cheap ⭐🧩💬

scrapestorm/chrome-web-store-reviews-scraper---cheap

🔎 Easily collect Chrome Web Store extension reviews by providing one or multiple extension review URLs Extract insights such as 👤 Reviewer Name ⭐ Rating 📅 Review Date 💬 Review Text 🖼 Reviewer 🔗 Source URL and more Perfect for user sentiment analysis and Chrome extension market research 📊⭐🧩

Storm_Scraper

Ocado Products Scraper - Cheap 🛒📊🥛

scrapestorm/ocado-products-scraper---cheap

🔎 Easily collect Ocado products by providing one or multiple Ocado search or category URLs Extract product insights such as 🆔 Product ID 🏷️ Name 💰 Price 📦 Price per Unit 📏 Size ⭐ Rating 🖼 Product Image 🔗 URL & more Perfect for grocery price monitoring & e-commerce product research 📊🛒📈

Storm_Scraper

5.0

(1)

Chrome Web Store Scraper - Cheap 🔍🧩📊

scrapestorm/chrome-web-store-scraper---cheap

🔎 Easily collect Chrome Web Store extension listings by providing one or multiple search keywords Extract insights such as 🧩 Extension Name 👤 Publisher ⭐ Rating 🏷 Rating Label🔗 Extension URL and more Perfect for Chrome extension market research, competitor analysis, and extension discovery 🧩

Storm_Scraper

ForRentUniversity Scraper - Cheap🎓📊🏠

scrapestorm/forrentuniversity-scraper---cheap

🔎 Easily collect ForRentUniversity listings by providing one or multiple ForRentUniversity URLs Extract insights such as 🏠 Property Address 🏢 Property Type 💰 Rental Price 🛏 Beds 📞 Phone 🖼 Image 🔗 URL a& nd more Perfect for student housing research and real estate data collection 🎓📊🏠

Storm_Scraper

5.0

(1)

Kelley Blue Book Vehicles Scraper - Cheap 📊🚗

scrapestorm/kelley-blue-book-vehicles-scraper---cheap

🔎 Easily collect vehicles by providing one or multiple Kelley Blue Book brand or URLs 🚗 Extract valuable vehicle insights such as 🚗 Vehicle Model 🏷️ Body Style 💰 Starting Price 💬 Consumer Rating 🖼 Image 🔗 Listing URL & more Perfect for automotive price monitoring & vehicle research 📊🚘📈

Storm_Scraper

Hugging Face Collections Scraper

🤔 What does it do?

👥 Who is it for?

🧑‍🔬 AI Researchers & Data Scientists

🏢 Enterprise AI Teams

📊 Market Intelligence Analysts

🛠️ MLOps & Tooling Developers

💡 Why use this actor?

📦 Data extracted

💰 How much does it cost to scrape Hugging Face collections?

🚀 How to use

Step 1 — Choose your mode

Step 2 — Configure limits

Step 3 — Run and export

⚙️ Input parameters

Example inputs

📤 Output example

💡 Tips & tricks

🔌 Integrations

Google Sheets — AI model tracking dashboard

Slack alerts on new trending collections

Model catalog enrichment pipeline

Make / Zapier automation

🖥️ API usage

Node.js (Apify SDK)

Python

cURL

🤖 MCP (Claude, Cursor, VS Code)

⚖️ Legality & terms of service

❓ FAQ

🔗 Related scrapers

You might also like

Carsandbids.com Scraper - Cheap 🔨📊🚗

Hugging Face Scraper — (Models, Datasets, Spaces, Papers etc.)

Hugging Face Scraper — AI Models, Datasets, Spaces & Papers

Kayak Car Rentals Scraper - Cheap 🚗🌍💰

Zazzle Product Scraper - Cheap 🛍️🎨📊

Stripe App Marketplace Scraper - Cheap 💳🛍️🔎

Chrome Web Store Reviews Scraper - Cheap ⭐🧩💬

Ocado Products Scraper - Cheap 🛒📊🥛

Chrome Web Store Scraper - Cheap 🔍🧩📊

ForRentUniversity Scraper - Cheap🎓📊🏠

Kelley Blue Book Vehicles Scraper - Cheap 📊🚗