Hugging Face Models Scraper - Cheap ๐Ÿค—๐Ÿค–๐Ÿ”Ž avatar

Hugging Face Models Scraper - Cheap ๐Ÿค—๐Ÿค–๐Ÿ”Ž

Pricing

$19.89/month + usage

Go to Apify Store
Hugging Face Models Scraper - Cheap ๐Ÿค—๐Ÿค–๐Ÿ”Ž

Hugging Face Models Scraper - Cheap ๐Ÿค—๐Ÿค–๐Ÿ”Ž

๐ŸŸ  Easily collect Models from Hugging Face Provide one or multiple search keywords and extract structured model data including model name, owner, likes, downloads, tags, last update date, match count & more ๐Ÿค–๐Ÿ“Š Perfect for AI model research, popularity tracking & model ecosystem monitoring ๐Ÿš€

Pricing

$19.89/month + usage

Rating

5.0

(1)

Developer

Storm_Scraper

Storm_Scraper

Maintained by Community

Actor stats

2

Bookmarked

2

Total users

1

Monthly active users

10 hours ago

Last modified

Categories

Share

Hugging Face Models Full-Text Search Scraper ๐Ÿš€๐Ÿ”Ž๐Ÿค–

The Hugging Face Models Full-Text Search Scraper is a powerful automation tool designed to extract model search results only from Hugging Face.

Whether you're researching AI architectures, analyzing open-source implementations, tracking keyword usage in repositories, or building ML intelligence datasets โ€” this scraper helps you collect structured model-level search data quickly and reliably.


๐Ÿ” What It Does

Simply provide one or more keywords, and the scraper will:

โœ… Search Models only (no Datasets or Spaces)

โœ… Process multiple keywords independently

โœ… Respect maxItemsPerKeyword limits

โœ… Extract repository-level and file-level match data

โœ… Provide ready-to-use dataset views


๐Ÿ“Š Data Extracted

๐Ÿ”น Overview Fields

FieldDescription
๐Ÿ‘ค ownerModel repository owner
๐Ÿค– repoNameModel repository name
๐Ÿ” matchCountNumber of keyword matches found
๐Ÿ”‘ keywordSearch keyword used
๐Ÿ”— repoFullUrlFull model page URL
๐Ÿ“„ fileFullUrlURL of matched file

๐Ÿ”น Detailed Fields

FieldDescription
ownerModel owner
repoNameModel name
repoHrefRepository relative path
repoFullUrlFull model URL
fileNameFile containing match
fileHrefFile relative path
fileFullUrlFull file URL
matchCountNumber of matches
tagsParsed model tags
tagsRawRaw tags data
codeSnippetExtracted matching snippet
keywordSearch keyword
sourceUrlSearch results URL

๐Ÿ›  How to Use

1๏ธโƒฃ Deploy the Actor on Apify

2๏ธโƒฃ Provide one or more keywords (e.g., bert, llama, stable-diffusion)

3๏ธโƒฃ Set maxItemsPerKeyword

4๏ธโƒฃ Run the scraper

5๏ธโƒฃ Export your data in:

โœ… JSON

โœ… CSV

โœ… Excel

โœ… XML

โœ… HTML


๐Ÿ’ธ Pricing

This scraper runs on a monthly subscription model.

You only pay for successful runs.

๐Ÿ’ณ Price: $19.89 / month


If you're interested in other Rutube, E-commerce, Events, Real Estate, Jobs, Company Leads, YouTube or Facebook scraping solutions, check out these related tools:


You can even apply sentiment analysis on the data text we've extracted! ๐Ÿ˜ƒ๐Ÿ“Š:

โš™๏ธ Input Configuration

๐Ÿ“ฅ Input Example

{
"keywords": ["bert", "llama"],
"maxItemsPerKeyword": 100
}

Input Fields

FieldTypeDescription
keywordsArrayOne or more keywords to search for models (required)
maxItemsPerKeywordIntegerMaximum number of results per keyword (0 = unlimited until pagination stops automatically)

๐Ÿ“ค Output Example

{
"owner": "google-bert",
"repoName": "bert-base-uncased",
"repoHref": "/google-bert/bert-base-uncased",
"repoFullUrl": "https://huggingface.co/google-bert/bert-base-uncased",
"fileName": "README.md",
"fileHref": "/google-bert/bert-base-uncased/blob/main/README.md?code=true",
"fileFullUrl": "https://huggingface.co/google-bert/bert-base-uncased/blob/main/README.md?code=true",
"matchCount": "40 matches",
"tags": [
"transformers",
"pytorch",
"tf",
"jax",
"rust",
"coreml",
"onnx",
"safetensors",
"bert",
"fill-mask",
"exbert",
"en",
"dataset:bookcorpus",
"dataset:wikipedia",
"arxiv:1810.04805",
"license:apache-2.0",
"endpoints_compatible",
"deploy:azure",
"region:us"
],
"tagsRaw": "tags: transformers, pytorch, tf, jax, rust, coreml, onnx, safetensors, bert, fill-mask, exbert, en, dataset:bookcorpus, dataset:wikipedia, arxiv:1810.04805, license:apache-2.0, endpoints_compatible, deploy:azure, region:us",
"codeSnippet": "# BERT base model (uncased)\nPretrained model on English language using a masked language modeling (MLM) objective. It was introduced in\n[this paper](https://arxiv.org/abs/1810.04805) and first released in\n[this repository](https://github.com/google-research/bert). This model is uncased: it does not make a difference",
"keyword": "bert",
"sourceUrl": "https://huggingface.co/search/full-text?q=bert&type=model"
}

๐Ÿ“Š Preconfigured Dataset Views

๐Ÿ”น Overview

Clean table including:

Owner

Model Name

Match Count

Keyword

Model Page

Matched File

Perfect for fast keyword-based model comparison.

๐Ÿ”น Detailed

Extended dataset including:

Repository paths

File-level matches

Tags (parsed & raw)

Code snippets

Source search URL

Ideal for:

๐Ÿค– AI architecture research ๐Ÿ”Ž Code-level keyword discovery ๐Ÿ“Š Model ecosystem intelligence ๐Ÿ“ˆ Technical trend monitoring

๐Ÿ”น By Keyword

Grouped by keyword:

Keyword

Owner

Model

Match count

Model page

Perfect for comparing model coverage across multiple AI topics.

๐ŸŒ Why Use This Scraper?

๐Ÿ“Š Model Ecosystem Intelligence

๐Ÿ”Ž Full-Text Repository Search Automation

๐Ÿค– AI Architecture Research

๐Ÿ“ˆ Trend & Keyword Monitoring

โšก Scalable โ€” From niche keyword scans to broad AI research

๐Ÿค– Automation Ready โ€” Schedule recurring monitoring


Disclaimer

This scraper is an independent automation tool and is not affiliated with, endorsed by, or sponsored by Hugging Face.


๐Ÿ“ซ Support & Contact

๐Ÿ˜Š Leave a 5-star rating โญโญโญโญโญ if youโ€™re satisfied

๐ŸŒช๏ธ Storm Scraper https://apify.com/scrapestorm

For questions, feature requests, or custom scraping solutions, contact us directly via Apify or email.