Pricing

Pay per event

ModelScope Model Catalog Scraper

Scrape the ModelScope (modelscope.cn) AI model catalog — China's Alibaba-backed model hub. Export model IDs, tasks, frameworks, download stats, stars, licenses, and READMEs.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

Actor stats

Bookmarked

Total users

Monthly active users

2 hours ago

Last modified

What it does

Sweeps the ModelScope JSON API task-by-task (text-generation, image-generation, multimodal, and 26 other task categories), deduplicates across task overlaps, and optionally enriches each model record with the full README from the per-model detail endpoint.

Output fields per model:

model_id — full identifier (namespace/name)
namespace, name — publisher slug and model name
chinese_name — display name in Chinese if present
task — primary task tag used for discovery
tasks_all — all task tags, pipe-separated
frameworks — ML frameworks (pytorch, tensorflow, mindspore, etc.), pipe-separated
languages — supported languages (en, zh, multilingual, etc.), pipe-separated
license — SPDX identifier (apache-2.0, mit, etc.)
downloads_30d — downloads in the last 30 days
stars — star count
last_updated, created_at — ISO-8601 timestamps
readme_text — README content, truncated to 8 KB (requires includeDetails: true)
model_size_params — parameter count label when tagged (7B, 72B, MoE-22B-A2B)
quantization_variants — available quantization types from tensor metadata, pipe-separated
base_model — base model ID if this is a fine-tune
publisher_org, publisher_url — organization name and profile URL
has_demo, has_inference_api — boolean flags

Input

Field	Type	Default	Description
`tasks`	array	(all tasks)	Limit to specific task slugs (e.g. `text-generation`, `image-generation`). Leave empty to sweep all 29 canonical tasks.
`maxItems`	integer	100	Maximum number of models to return. Set to `0` for unlimited (full catalog run).
`includeDetails`	boolean	true	Fetch the per-model detail endpoint for full README text and quantization variant metadata. Disabling this speeds up runs but leaves `readme_text` and `quantization_variants` empty.

Example use cases

West+East parity datasets — pair with the HuggingFace Model Scraper to build a combined index of both Western and Chinese open-weights releases (Qwen, DeepSeek, Yi, GLM, InternLM, ERNIE, MiniMax, etc.).
Model landscape research — filter by task, framework, or license to survey which Chinese labs are publishing in specific domains.
Download trend tracking — schedule regular runs and track downloads_30d growth for specific namespaces or model families.
README content analysis — extract model cards from readme_text for NLP-based capability assessment or feature extraction.

Notes

The API requires no authentication. No proxy is needed — direct access from Apify infrastructure works without restriction.
Full catalog sweeps (all tasks, includeDetails: true) are long-running. Use maxItems to cap output for targeted queries.
Array output fields (tasks_all, frameworks, languages, quantization_variants) use | as separator for flat dataset compatibility. Split on | in downstream processing.

HuggingFace Model Scraper - AI/ML Model Data

jungle_synthesizer/huggingface-model-scraper

Scrape AI/ML model metadata from the HuggingFace Hub. Extract model names, task types, download counts, likes, libraries, authors, tags, licenses, model sizes, and model card excerpts. Filter by task type, library, author, and search query.

BowTiedRaccoon

Hugging Face Models Scraper - Low-cost💲🔥🤖📌

delectable_incubator/hugging-face-models-scraper-low-cost

Scrape Hugging Face model listings 🤖📊 with a powerful AI model scraper. Extract model names, creators, downloads, likes, tags, update dates, model URLs, and popularity metrics from keyword searches. Ideal for AI research, model discovery, ecosystem monitoring and machine learning datasets 🚀

Prime Scrape

Ai Model Pricing Availability

haehnchen/ai-model-pricing-availability

Compare AI model prices across providers. Find where a LLM model is available and compare input/output pricing.

Haehnchen

Hugging Face Model Search & Stats Scraper (Free)

fit_melon/huggingface-model-search-stats

Search the Hugging Face Hub and export AI model stats as JSON: downloads, likes, task, library, license, base models, datasets and arXiv papers. Track AI model popularity and trends for free.

D N

Replicate Models Scraper - AI Model Directory Data

benthepythondev/replicate-models-scraper

Scrape Replicate Explore pages and extract AI model names, URLs, creators, descriptions and model marketplace metadata.

Ben

Civitai Scraper - AI Model Catalog, LORAs, Checkpoints & Stats

jungle_synthesizer/civitai-models-scraper

Extract civitai.com AI model catalog: Checkpoints, LORAs, VAEs, Controlnets, and video models. Get metadata, versions, stats, trigger words, download URLs, and license info. Filter by model type, base model, and creator. NSFW-aware. Optional API key for higher rate limits.

BowTiedRaccoon

Hugging Face Model & Dataset Scraper

cloud9_ai/huggingface-scraper

Search and extract ML models and datasets from Hugging Face Hub. Get model cards, download stats, tasks, and architectures. No API key needed.

cloud9

Hugging Face Model Explorer

lovely_radiologist/hf-model-explorer

Structured export of HF models with task, library, license, download count, and parsed model-card metadata. Built for AI teams doing model selection at scale.

Vivek Gaur

OpenRouter Models Scraper - LLM Pricing Data

benthepythondev/openrouter-models-scraper

Scrape OpenRouter model catalog data: model IDs, context length, pricing, provider metadata and descriptions.

Ben

Hugging Face Models Scraper - Cheap 🤗🤖🔎

scrapestorm/hugging-face-models-scraper---cheap

🟠 Easily collect Models from Hugging Face Provide one or multiple search keywords and extract structured model data including model name, owner, likes, downloads, tags, last update date, match count & more 🤖📊 Perfect for AI model research, popularity tracking & model ecosystem monitoring 🚀