Pricing

Pay per usage

Huggingface Model Scraper

Huggingface Model Scraper. Extract structured data with automatic pagination, proxy rotation, and JSON/CSV export. Pay only for results.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny

Actor stats

Bookmarked

Total users

Monthly active users

15 hours ago

Last modified

Hugging Face Model Scraper

What it does

Hugging Face Model Scraper extracts detailed information about machine learning models hosted on the Hugging Face Hub. It queries the Hugging Face API to retrieve model metadata including model IDs, authors, download counts, likes, tags, pipeline types, and modification dates. You can search for models by keyword, task type, or any search term supported by the Hugging Face platform. Results are sorted by download count to surface the most popular and widely-used models first.

Why use it

The Hugging Face Hub hosts hundreds of thousands of machine learning models, making manual discovery and comparison time-consuming. This actor automates the process of searching and cataloging models, which is valuable for ML engineers evaluating model options, researchers tracking model popularity trends, and companies conducting competitive analysis in the AI space. By extracting structured data, you can easily compare models across metrics like downloads, likes, and supported tasks without manually browsing through individual model pages.

How it works

The actor accepts a search query and maximum results count as input parameters.
It constructs a request to the Hugging Face API endpoint with the specified search parameters.
Using CheerioCrawler, it fetches and parses the JSON response from the API.
For each model returned, it extracts key metadata fields and formats them into a consistent structure.
Results are sorted by download count (descending) and pushed to the Apify dataset.
If no models match the search query, a fallback record is created to indicate empty results.

Input parameters

Parameter	Type	Default	Description
`searchQuery`	String	`text-generation`	Search term for finding models (e.g., text-generation, sentiment-analysis, gpt)
`maxResults`	Integer	`50`	Maximum number of models to return (1-200)

Output fields

Field	Type	Description
`modelId`	String	Full model identifier (author/model-name)
`author`	String	Model author or organization
`downloads`	Number	Total download count
`likes`	Number	Number of likes on the model page
`tags`	Array	List of tags associated with the model
`pipeline`	String	Pipeline task type (e.g., text-generation, image-classification)
`lastModified`	String	Date of the last modification
`url`	String	Direct link to the model page on Hugging Face

Cost estimate

This actor uses Cheerio-based scraping with minimal resource consumption. A typical run fetching 50 models costs approximately $0.001 in Apify platform credits. The default 1024 MB memory setting provides ample resources for all standard queries.

Tips

Use specific pipeline task names like "text-generation", "text-classification", or "image-segmentation" for more targeted results.
Increase maxResults to 200 when you need comprehensive coverage of available models for a particular task.
Schedule regular runs to track how model popularity changes over time.
Combine results with the OpenAI Status Monitor to maintain a full picture of the AI tooling landscape.
Check out the ArXiv Paper Search actor to find the research papers behind popular models.

Hugging Face Model Scraper

parseforge/hugging-face-model-scraper

Collect models from Hugging Face Hub via public API endpoints. Get metadata including author, downloads, likes, lastModified, task, library, license, tags and filenames.

ParseForge

5.0

(3)

Hugging Face Model Scraper - Extract AI Model Data

tropical_quince/huggingface-model-scraper

Scrape Hugging Face model hub. Extract model names, downloads, likes, tasks, frameworks, and model card metadata from the AI model repository.

Donny Nguyen

Hugging Face Model Scraper

consummate_mandala/huggingface-model-scraper

Donny Nguyen

Hugging Face Model & Dataset Scraper

cloud9_ai/huggingface-scraper

Search and extract ML models and datasets from Hugging Face Hub. Get model cards, download stats, tasks, and architectures. No API key needed.

cloud9

Auto Video Thumbnail Generator

parseforge/auto-video-thumbnail-generator

Generate eye-catching video thumbnails automatically using Google Gemini AI. Upload a video or provide a URL, and get multiple high-quality thumbnail options in seconds. Perfect for content creators, marketers, and video editors who need professional thumbnails without design skills.

ParseForge

5.0

(2)

StubHub Event Scraper: Tickets & Venues Data

parseforge/stubhub-scraper

Extract event listings from StubHub including concerts, sports, theater shows. Get real-time data on event names, dates, venues, pricing, and locations. Supports search queries, geography filters, categories, and performers. Perfect for market research, competitive analysis, and event discovery.

ParseForge

5.0

(2)

Scrape Github Trending — Repos, Stars & Dependencies

tropical_quince/github-trending-scraper

Scrape github trending data at scale with this powerful Apify actor. Extracts repos, stars & dependencies with automatic pagination and proxy rotation. Perfect for market research, competitive intelligence, and data-driven decision making.

Donny Nguyen

GitHub Release Monitor

tropical_quince/github-release-monitor

Track GitHub releases across repositories. Extract release names, tags, dates, changelogs, and download URLs.

Donny Nguyen

Track Github Stars — Repos, Stars & Dependencies

tropical_quince/github-stars-tracker

Track github stars data at scale with this powerful Apify actor. Extracts repos, stars & dependencies with automatic pagination and proxy rotation. Perfect for market research, competitive intelligence, and data-driven decision making.

Donny Nguyen