HuggingFace Models Datasets Spaces Scraper - Low-costπŸ’²πŸ”₯πŸ€–πŸ€— avatar

HuggingFace Models Datasets Spaces Scraper - Low-costπŸ’²πŸ”₯πŸ€–πŸ€—

Pricing

from $0.00005 / actor start

Go to Apify Store
HuggingFace Models Datasets Spaces Scraper - Low-costπŸ’²πŸ”₯πŸ€–πŸ€—

HuggingFace Models Datasets Spaces Scraper - Low-costπŸ’²πŸ”₯πŸ€–πŸ€—

Scrape Hugging Face Models, Datasets & Spaces πŸ€–πŸ“Š with a powerful AI ecosystem scraper. Extract repository names, owners, tags, downloads, likes, update dates, source URLs and more from keyword searches. Ideal for AI research, model discovery, dataset analysis and machine learning intelligence πŸš€πŸŒ

Pricing

from $0.00005 / actor start

Rating

0.0

(0)

Developer

Prime Scrape

Prime Scrape

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

HuggingFace All-in-One Full-Text Search Scraper


πŸ€—πŸ”Ž HuggingFace All-in-One Full-Text Search Scraper | Models, Datasets & Spaces | Apify Actor

πŸš€ Extract Hugging Face Search Results in Bulk (No Code)

The HuggingFace All-in-One Full-Text Search Scraper (Apify Actor) is a powerful, scalable and SEO-optimized scraping tool designed to extract Models, Datasets and Spaces directly from Hugging Face full-text search results.

Whether you're researching AI models, tracking dataset adoption, monitoring machine learning trends, discovering open-source projects, or building AI intelligence datasets, this actor helps you collect structured repository-level and file-level search data at scale.


πŸ”₯ Why This Hugging Face Scraper?

βœ” All-in-One Models + Datasets + Spaces scraper

βœ” Bulk keyword search support (SEO BOOST πŸš€)

βœ” Full-text repository search automation

βœ” Repository-level + file-level match extraction

βœ” Structured JSON / CSV / Excel exports

βœ” Perfect for AI research & trend monitoring

βœ” No coding required

βœ” Fast & scalable cloud execution


🎯 What This Scraper Does

This Apify Actor performs automated full-text searches across the Hugging Face ecosystem and extracts structured search results.

πŸ“Œ Core Features

βœ… Search Hugging Face Models

βœ… Search Hugging Face Datasets

βœ… Search Hugging Face Spaces

βœ… Combine all content types in one run

βœ… Bulk keyword processing

βœ… Independent keyword tracking

βœ… Extract repository metadata

βœ… Extract matched file information

βœ… Extract code snippets

βœ… Extract tags & classifications

βœ… Auto-pagination handling

βœ… Structured export-ready output


⚑ Input Configuration (Simple & Powerful)

πŸ”₯ BULK KEYWORD MODE (SEO BOOST πŸš€)

{
"keywords": [
"bert",
"llama",
"stable-diffusion",
"rag",
"mistral",
"multimodal"
],
"searchTypes": [
"Models",
"Datasets",
"Spaces"
],
"maxItemsPerKeyword": 60
}

πŸ“Š Extracted Data Fields

FieldDescription
contentTypeModel, Dataset or Space
ownerRepository owner
repoNameRepository name
repoHrefRepository path
repoFullUrlFull repository URL
fileNameMatched file name
fileHrefFile path
fileFullUrlFull file URL
matchCountNumber of keyword matches
keywordSearch keyword
tagsParsed repository tags
tagsRawRaw tags string
codeSnippetExtracted matching content
searchTypesSelected content filters
sourceUrlOriginal Hugging Face search URL

πŸ’‘ Use Cases (High Demand AI SEO Keywords)

This Hugging Face scraper is ideal for:

πŸ€– AI model discovery

πŸ“Š Machine learning research

🧠 LLM ecosystem monitoring

πŸ”Ž Open-source AI intelligence

πŸ“ˆ AI trend analysis

πŸ“š Dataset discovery

⚑ Full-text repository search

🏒 Competitive AI research

πŸ“‘ AI monitoring pipelines

πŸ”„ Automated AI market intelligence

πŸš€ RAG project research

🎯 Generative AI tracking


πŸš€ Key Features

⚑ Bulk keyword scraping support

πŸ€– Models, Datasets & Spaces extraction

πŸ“Œ Full-text search automation

πŸ”Ž File-level match extraction

🧠 Repository intelligence gathering

πŸ“Š Structured output datasets

πŸ’Ύ Export-ready results

πŸ” Reliable cloud execution

βš™οΈ Apify-native scalability


πŸ“Š Preconfigured Dataset Views

The actor automatically generates ready-to-use dataset views.

πŸ”Ή Overview View

Includes:

β€’ Content Type

β€’ Repository Owner

β€’ Repository Name

β€’ Match Count

β€’ Keyword

β€’ Repository URL

β€’ Matched File

Perfect for quick analysis.

πŸ”Ή Detailed View

Includes:

β€’ Repository URLs

β€’ File URLs

β€’ Match counts

β€’ Tags

β€’ Code snippets

β€’ Search URLs

Ideal for:

πŸ€– AI research

πŸ“Š Dataset intelligence

πŸ”Ž Keyword monitoring

🧠 Repository analysis

πŸ”Ή By Keyword View

Group results by keyword.

Perfect for topic comparison.

πŸ”Ή By Type View

Group results by:

β€’ Models

β€’ Datasets

β€’ Spaces

Perfect for ecosystem distribution analysis.


πŸ“€ Output Formats Supported

βœ” JSON

βœ” CSV

βœ” Excel XLSX

βœ” XML

βœ” HTML


πŸ“¦ Example Output

{
"contentType": "dataset",
"owner": "Giannis79",
"repoName": "BERT_Journalism_Sentiment",
"repoHref": "/datasets/Giannis79/BERT_Journalism_Sentiment",
"repoFullUrl": "https://huggingface.co/datasets/Giannis79/BERT_Journalism_Sentiment",
"fileName": "README.md",
"fileHref": "/datasets/Giannis79/BERT_Journalism_Sentiment/blob/main/README.md?code=true",
"fileFullUrl": "https://huggingface.co/datasets/Giannis79/BERT_Journalism_Sentiment/blob/main/README.md?code=true",
"matchCount": "12 matches",
"tags": [
"region:us"
],
"tagsRaw": "tags: region:us",
"codeSnippet": "BERT Model Sentiment Analysis Project Overview...",
"keyword": "bert",
"searchTypes": [
"Datasets",
"Spaces"
],
"sourceUrl": "https://huggingface.co/search/full-text?q=bert&type=dataset&type=space"
}

πŸ”₯ Why This is the BEST Hugging Face Full-Text Search Scraper on Apify?

βœ” All-in-One search solution

βœ” Models + Datasets + Spaces support

βœ” Bulk keyword processing

βœ” File-level result extraction

βœ” AI ecosystem intelligence

βœ” Enterprise-ready scalability

βœ” SEO optimized marketplace listing

βœ” High-performance extraction engine


πŸ’Έ Pricing

This scraper runs on a pay-per-result pricing model.

You only pay for successfully extracted records.

πŸ’³ Price: $0.98 / 1,000 results


❓ FAQ (SEO BOOST SECTION)

Can I search multiple keywords at once?

Yes β€” bulk keyword mode is fully supported.

Can I scrape Models, Datasets and Spaces together?

Yes β€” all content types can be combined in a single run.

Does the scraper extract file-level matches?

Yes β€” matched files, URLs and snippets are included.

Is coding required?

No β€” 100% no-code Apify Actor.

Can I export the results?

Yes β€” JSON, CSV, Excel, XML and HTML are supported.

Is this useful for AI research?

Absolutely. It is designed specifically for AI ecosystem intelligence and trend monitoring.


⚠️ Disclaimer

This tool is an independent automation solution and is not affiliated with, endorsed by, or sponsored by Hugging Face.


  • Hugging Face Models Scraper - Cheap πŸ€—πŸ€–πŸ”Ž

  • GitHub Repositories Scraper πŸ“¦πŸ™πŸ”

And many more in the PrimeScrape ecosystem.


🌍 PrimeScrape Ecosystem

Built for large-scale:

πŸ€– AI intelligence

πŸ“Š Data extraction

πŸ“ˆ Market research

πŸ”Ž Search monitoring

🏒 Competitive intelligence

βš™οΈ Automation pipelines

🧠 AI training datasets

πŸš€ Enterprise scraping


πŸ“¬ Support

⭐⭐⭐⭐⭐ Leave a review if you enjoy this scraper.

πŸ“© Contact us for custom scraping solutions, enterprise automation projects, and private data extraction services.