Civitai Models Scraper
DeprecatedPricing
Pay per usage
Civitai Models Scraper
DeprecatedScrape AI models from Civitai. Extract names, types, download stats, ratings, creators, versions, tags, trained words, and licensing info. No auth needed.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Stas Persiianenko
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
6 days ago
Last modified
Categories
Share
Scrape AI models from Civitai — the largest AI model sharing platform. Extract model names, types, download counts, ratings, creator info, version details, file metadata, tags, trained words, and licensing information.
What does Civitai Models Scraper do?
Civitai Models Scraper uses Civitai's public REST API to extract structured data about AI models — Stable Diffusion checkpoints, LoRAs, ControlNets, VAEs, and more. It collects complete model metadata including download statistics, community ratings, version history, file sizes, trained trigger words, base model compatibility, and commercial licensing terms.
The scraper supports keyword search, model type filtering, configurable sorting, and cursor-based pagination to handle large result sets efficiently. No browser automation required — it communicates directly with the Civitai API for fast, reliable extraction.
Who is it for?
- 🤖 AI researchers — cataloging model architectures, tracking popularity trends, and analyzing the AI art model ecosystem
- 📊 Data analysts — building datasets of model statistics for market research and trend analysis
- 🎨 AI artists — finding the best-rated models for specific styles, subjects, or base model architectures
- 🏢 AI companies — monitoring the competitive landscape of open-source AI models
- 🔧 Tool developers — building model recommendation engines, comparison tools, or aggregation platforms
- 📈 Market researchers — understanding adoption patterns across model types and base architectures
Why scrape Civitai?
Civitai is the #1 hub for sharing AI image generation models, with millions of monthly visitors and tens of thousands of models. It hosts checkpoints, LoRAs, textual inversions, ControlNets, and other model types for Stable Diffusion, SDXL, Flux, and more.
Key reasons to scrape Civitai:
- Model discovery — Find the most downloaded or highest-rated models for any style or subject
- Trend analysis — Track which base models (SD 1.5, SDXL, Flux) are gaining or losing popularity
- Competitive intelligence — Monitor new model releases and creator activity
- Dataset building — Collect model metadata for training recommendation systems or search engines
- License auditing — Check commercial use permissions across models for compliance
- Community insights — Analyze download counts, ratings, and tipping patterns
What data can you extract from Civitai?
| Field | Type | Description |
|---|---|---|
modelId | number | Unique model identifier |
name | string | Model name |
type | string | Model type (Checkpoint, LORA, ControlNet, etc.) |
description | string | Model description (HTML stripped, max 1000 chars) |
tags | array | User-assigned tags |
nsfw | boolean | NSFW flag |
nsfwLevel | number | NSFW severity level |
creatorUsername | string | Creator's username |
creatorImageUrl | string | Creator's avatar URL |
downloadCount | number | Total downloads |
thumbsUpCount | number | Total likes |
thumbsDownCount | number | Total dislikes |
commentCount | number | Total comments |
tippedAmountCount | number | Total tip amount |
rating | number | Approval ratio (likes / total votes, 0–1) |
allowNoCredit | boolean | Can use without credit |
allowCommercialUse | string | Commercial use permissions |
allowDerivatives | boolean | Derivative works allowed |
allowDifferentLicense | boolean | Different license allowed |
latestVersionName | string | Latest version name |
latestVersionId | number | Latest version ID |
baseModel | string | Base model (SD 1.5, SDXL, Flux, etc.) |
publishedAt | string | Version publish date (ISO 8601) |
downloadUrl | string | Direct download URL for latest version |
trainedWords | array | Trigger words for the model |
files | array | File details (name, size, format, download URL) |
modelUrl | string | Civitai model page URL |
scrapedAt | string | Extraction timestamp |
How much does it cost to scrape Civitai models?
Civitai Models Scraper uses Apify's pay-per-event pricing. You only pay for what you extract:
- Run start: $0.005 per run
- Per model: $0.003 per model extracted
Example costs:
- 20 models (quick search): $0.005 + 20 × $0.003 = $0.065
- 100 models (standard run): $0.005 + 100 × $0.003 = $0.305
- 1,000 models (bulk extraction): $0.005 + 1,000 × $0.003 = $3.005
💡 Apify's free plan includes $5/month in platform credits — enough to extract ~1,600 models per month at no cost.
How to scrape Civitai models
- Go to Civitai Models Scraper on Apify Store
- Enter a search keyword (e.g., "flux lora") or leave empty to browse all models
- Optionally filter by model type (Checkpoint, LORA, ControlNet, etc.)
- Choose how to sort results (Most Downloaded, Highest Rated, Newest)
- Set the maximum number of models to extract
- Click Start and wait for results
- Download your data as JSON, CSV, or Excel
Input parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
searchQuery | string | "" | Keyword to search for models. Leave empty to browse all. |
modelTypes | string[] | [] | Filter by type: Checkpoint, LORA, ControlNet, VAE, etc. |
sort | string | "Most Downloaded" | Sort by: Most Downloaded, Highest Rated, or Newest |
period | string | "AllTime" | Time period for stats: AllTime, Year, Month, Week, Day |
includeNsfw | boolean | false | Include NSFW models in results |
maxModels | integer | 100 | Maximum models to extract (1–10,000) |
includeVersionDetails | boolean | true | Include file metadata and trained words |
maxRequestRetries | integer | 3 | Retry attempts for failed API requests |
Output example
{"modelId": 827184,"name": "WAI-illustrious-SDXL","type": "Checkpoint","description": "High-quality illustrious checkpoint for anime-style generation...","tags": ["base model", "anime"],"nsfw": false,"nsfwLevel": 1,"creatorUsername": "WAI0731","creatorImageUrl": "https://image.civitai.com/...","downloadCount": 1277231,"thumbsUpCount": 79400,"thumbsDownCount": 204,"commentCount": 0,"tippedAmountCount": 157252,"rating": 1.0,"allowNoCredit": true,"allowCommercialUse": "{Image,RentCivit}","allowDerivatives": true,"allowDifferentLicense": true,"latestVersionName": "v16.0","latestVersionId": 2514310,"baseModel": "Illustrious","publishedAt": "2025-12-18T09:16:12.062Z","downloadUrl": "https://civitai.com/api/download/models/2514310","trainedWords": [],"files": [{"fileName": "waiIllustriousSDXL_v160.safetensors","fileSizeKB": 6775430,"fileType": "Model","format": "SafeTensor","downloadUrl": "https://civitai.com/api/download/models/2514310"}],"modelUrl": "https://civitai.com/models/827184","scrapedAt": "2026-04-19T08:24:22.468Z"}
Tips for scraping Civitai
- 🔍 Use specific search terms — "flux lora portrait" gives more targeted results than just "flux"
- 📦 Filter by model type — narrow results to Checkpoints, LoRAs, or other specific types
- 📊 Sort by Most Downloaded for popular models, Highest Rated for quality, Newest for recent releases
- ⏱️ Use time periods — filter by Week or Month to find trending models
- 📉 Disable version details (
includeVersionDetails: false) for lighter output when you only need model metadata - 🔞 NSFW filtering is on by default — enable it explicitly if needed for research
- 🔄 The API returns up to 100 models per page — large extractions paginate automatically via cursor
Integrations
Connect Civitai Models Scraper to your workflows:
- Google Sheets — automatically export model data to spreadsheets for tracking and analysis
- Slack/Discord — get notifications when new high-rated models appear in your category
- Airtable — build a searchable model database with ratings, downloads, and compatibility info
- Webhooks — trigger downstream processing when new data is available
- Zapier/Make — connect to 5,000+ apps for automated model monitoring pipelines
API usage
Node.js
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });const run = await client.actor('automation-lab/civitai-scraper').call({searchQuery: 'flux lora',modelTypes: ['LORA'],sort: 'Most Downloaded',maxModels: 50,});const { items } = await client.dataset(run.defaultDatasetId).listItems();console.log(items);
Python
from apify_client import ApifyClientclient = ApifyClient("YOUR_API_TOKEN")run = client.actor("automation-lab/civitai-scraper").call(run_input={"searchQuery": "flux lora","modelTypes": ["LORA"],"sort": "Most Downloaded","maxModels": 50,})items = client.dataset(run["defaultDatasetId"]).list_items().itemsprint(items)
cURL
curl -X POST "https://api.apify.com/v2/acts/automation-lab~civitai-scraper/runs?token=YOUR_API_TOKEN" \-H "Content-Type: application/json" \-d '{"searchQuery": "flux lora","modelTypes": ["LORA"],"sort": "Most Downloaded","maxModels": 50}'
Use with MCP (Model Context Protocol)
Claude Code
$claude mcp add --transport http apify "https://mcp.apify.com?tools=automation-lab/civitai-scraper"
Claude Desktop / Cursor / VS Code
Add to your MCP config file:
{"mcpServers": {"apify": {"url": "https://mcp.apify.com?tools=automation-lab/civitai-scraper","headers": {"Authorization": "Bearer YOUR_API_TOKEN"}}}}
Example prompts:
- "Find the top 20 most downloaded Flux LoRAs on Civitai"
- "Get all Checkpoint models tagged 'anime' sorted by rating"
- "Extract model details for SDXL ControlNets published this month"
Is it legal to scrape Civitai?
This scraper uses Civitai's official public REST API (https://civitai.com/api/v1/models), which is freely available without authentication. The API is documented and intended for public use.
The scraper only extracts publicly available model metadata — it does not download model files, bypass authentication, or access private data. Always review Civitai's Terms of Service before using extracted data commercially.
FAQ
Q: Do I need a Civitai account to use this scraper? A: No. The Civitai API is completely public and requires no authentication. The scraper works without any Civitai credentials.
Q: Can I download model files with this scraper? A: The scraper extracts download URLs for model files, but does not download the files themselves. You can use the extracted URLs to download models separately.
Q: Why am I getting fewer results than expected? A: Some model types may have limited listings. Try broadening your search query, removing type filters, or enabling NSFW models to increase results. The Civitai API may also return fewer results for very specific searches.
Q: What model types are available? A: Checkpoint, LORA, TextualInversion, Hypernetwork, AestheticGradient, ControlNet, Poses, Wildcards, Workflows, VAE, Upscaler, MotionModule, LoCon, and DoRA.
Q: How often is Civitai data updated? A: Civitai's API reflects real-time data. Run the scraper on a schedule (daily, weekly) to track changes in download counts, ratings, and new model releases.
Q: The scraper returned 0 results — what happened? A: This is rare since the Civitai API returns results even for broad queries. Check your model type filter — some types may have very few listings. Try removing filters or using a different search query.
Related scrapers
- ArXiv Scraper — scrape research papers from ArXiv
- GitHub Repo Scraper — extract repository data from GitHub
- HuggingFace Scraper — scrape models and datasets from HuggingFace