HuggingFace Models Datasets Spaces Scraper - Low-costπ²π₯π€π€
Pricing
from $0.00005 / actor start
HuggingFace Models Datasets Spaces Scraper - Low-costπ²π₯π€π€
Scrape Hugging Face Models, Datasets & Spaces π€π with a powerful AI ecosystem scraper. Extract repository names, owners, tags, downloads, likes, update dates, source URLs and more from keyword searches. Ideal for AI research, model discovery, dataset analysis and machine learning intelligence ππ
Pricing
from $0.00005 / actor start
Rating
0.0
(0)
Developer
Prime Scrape
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
π€π HuggingFace All-in-One Full-Text Search Scraper | Models, Datasets & Spaces | Apify Actor
π Extract Hugging Face Search Results in Bulk (No Code)
The HuggingFace All-in-One Full-Text Search Scraper (Apify Actor) is a powerful, scalable and SEO-optimized scraping tool designed to extract Models, Datasets and Spaces directly from Hugging Face full-text search results.
Whether you're researching AI models, tracking dataset adoption, monitoring machine learning trends, discovering open-source projects, or building AI intelligence datasets, this actor helps you collect structured repository-level and file-level search data at scale.
π₯ Why This Hugging Face Scraper?
β All-in-One Models + Datasets + Spaces scraper
β Bulk keyword search support (SEO BOOST π)
β Full-text repository search automation
β Repository-level + file-level match extraction
β Structured JSON / CSV / Excel exports
β Perfect for AI research & trend monitoring
β No coding required
β Fast & scalable cloud execution
π― What This Scraper Does
This Apify Actor performs automated full-text searches across the Hugging Face ecosystem and extracts structured search results.
π Core Features
β Search Hugging Face Models
β Search Hugging Face Datasets
β Search Hugging Face Spaces
β Combine all content types in one run
β Bulk keyword processing
β Independent keyword tracking
β Extract repository metadata
β Extract matched file information
β Extract code snippets
β Extract tags & classifications
β Auto-pagination handling
β Structured export-ready output
β‘ Input Configuration (Simple & Powerful)
π₯ BULK KEYWORD MODE (SEO BOOST π)
{"keywords": ["bert","llama","stable-diffusion","rag","mistral","multimodal"],"searchTypes": ["Models","Datasets","Spaces"],"maxItemsPerKeyword": 60}
π Extracted Data Fields
| Field | Description |
|---|---|
| contentType | Model, Dataset or Space |
| owner | Repository owner |
| repoName | Repository name |
| repoHref | Repository path |
| repoFullUrl | Full repository URL |
| fileName | Matched file name |
| fileHref | File path |
| fileFullUrl | Full file URL |
| matchCount | Number of keyword matches |
| keyword | Search keyword |
| tags | Parsed repository tags |
| tagsRaw | Raw tags string |
| codeSnippet | Extracted matching content |
| searchTypes | Selected content filters |
| sourceUrl | Original Hugging Face search URL |
π‘ Use Cases (High Demand AI SEO Keywords)
This Hugging Face scraper is ideal for:
π€ AI model discovery
π Machine learning research
π§ LLM ecosystem monitoring
π Open-source AI intelligence
π AI trend analysis
π Dataset discovery
β‘ Full-text repository search
π’ Competitive AI research
π‘ AI monitoring pipelines
π Automated AI market intelligence
π RAG project research
π― Generative AI tracking
π Key Features
β‘ Bulk keyword scraping support
π€ Models, Datasets & Spaces extraction
π Full-text search automation
π File-level match extraction
π§ Repository intelligence gathering
π Structured output datasets
πΎ Export-ready results
π Reliable cloud execution
βοΈ Apify-native scalability
π Preconfigured Dataset Views
The actor automatically generates ready-to-use dataset views.
πΉ Overview View
Includes:
β’ Content Type
β’ Repository Owner
β’ Repository Name
β’ Match Count
β’ Keyword
β’ Repository URL
β’ Matched File
Perfect for quick analysis.
πΉ Detailed View
Includes:
β’ Repository URLs
β’ File URLs
β’ Match counts
β’ Tags
β’ Code snippets
β’ Search URLs
Ideal for:
π€ AI research
π Dataset intelligence
π Keyword monitoring
π§ Repository analysis
πΉ By Keyword View
Group results by keyword.
Perfect for topic comparison.
πΉ By Type View
Group results by:
β’ Models
β’ Datasets
β’ Spaces
Perfect for ecosystem distribution analysis.
π€ Output Formats Supported
β JSON
β CSV
β Excel XLSX
β XML
β HTML
π¦ Example Output
{"contentType": "dataset","owner": "Giannis79","repoName": "BERT_Journalism_Sentiment","repoHref": "/datasets/Giannis79/BERT_Journalism_Sentiment","repoFullUrl": "https://huggingface.co/datasets/Giannis79/BERT_Journalism_Sentiment","fileName": "README.md","fileHref": "/datasets/Giannis79/BERT_Journalism_Sentiment/blob/main/README.md?code=true","fileFullUrl": "https://huggingface.co/datasets/Giannis79/BERT_Journalism_Sentiment/blob/main/README.md?code=true","matchCount": "12 matches","tags": ["region:us"],"tagsRaw": "tags: region:us","codeSnippet": "BERT Model Sentiment Analysis Project Overview...","keyword": "bert","searchTypes": ["Datasets","Spaces"],"sourceUrl": "https://huggingface.co/search/full-text?q=bert&type=dataset&type=space"}
π₯ Why This is the BEST Hugging Face Full-Text Search Scraper on Apify?
β All-in-One search solution
β Models + Datasets + Spaces support
β Bulk keyword processing
β File-level result extraction
β AI ecosystem intelligence
β Enterprise-ready scalability
β SEO optimized marketplace listing
β High-performance extraction engine
πΈ Pricing
This scraper runs on a pay-per-result pricing model.
You only pay for successfully extracted records.
π³ Price: $0.98 / 1,000 results
β FAQ (SEO BOOST SECTION)
Can I search multiple keywords at once?
Yes β bulk keyword mode is fully supported.
Can I scrape Models, Datasets and Spaces together?
Yes β all content types can be combined in a single run.
Does the scraper extract file-level matches?
Yes β matched files, URLs and snippets are included.
Is coding required?
No β 100% no-code Apify Actor.
Can I export the results?
Yes β JSON, CSV, Excel, XML and HTML are supported.
Is this useful for AI research?
Absolutely. It is designed specifically for AI ecosystem intelligence and trend monitoring.
β οΈ Disclaimer
This tool is an independent automation solution and is not affiliated with, endorsed by, or sponsored by Hugging Face.
π Related Actors
-
Hugging Face Models Scraper - Cheap π€π€π
-
GitHub Repositories Scraper π¦ππ
And many more in the PrimeScrape ecosystem.
π PrimeScrape Ecosystem
Built for large-scale:
π€ AI intelligence
π Data extraction
π Market research
π Search monitoring
π’ Competitive intelligence
βοΈ Automation pipelines
π§ AI training datasets
π Enterprise scraping
π¬ Support
βββββ Leave a review if you enjoy this scraper.
π© Contact us for custom scraping solutions, enterprise automation projects, and private data extraction services.