Civitai Models Scraper avatar

Civitai Models Scraper

Deprecated

Pricing

Pay per usage

Go to Apify Store
Civitai Models Scraper

Civitai Models Scraper

Deprecated

Scrape AI models from Civitai. Extract names, types, download stats, ratings, creators, versions, tags, trained words, and licensing info. No auth needed.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Categories

Share

Scrape AI models from Civitai — the largest AI model sharing platform. Extract model names, types, download counts, ratings, creator info, version details, file metadata, tags, trained words, and licensing information.

What does Civitai Models Scraper do?

Civitai Models Scraper uses Civitai's public REST API to extract structured data about AI models — Stable Diffusion checkpoints, LoRAs, ControlNets, VAEs, and more. It collects complete model metadata including download statistics, community ratings, version history, file sizes, trained trigger words, base model compatibility, and commercial licensing terms.

The scraper supports keyword search, model type filtering, configurable sorting, and cursor-based pagination to handle large result sets efficiently. No browser automation required — it communicates directly with the Civitai API for fast, reliable extraction.

Who is it for?

  • 🤖 AI researchers — cataloging model architectures, tracking popularity trends, and analyzing the AI art model ecosystem
  • 📊 Data analysts — building datasets of model statistics for market research and trend analysis
  • 🎨 AI artists — finding the best-rated models for specific styles, subjects, or base model architectures
  • 🏢 AI companies — monitoring the competitive landscape of open-source AI models
  • 🔧 Tool developers — building model recommendation engines, comparison tools, or aggregation platforms
  • 📈 Market researchers — understanding adoption patterns across model types and base architectures

Why scrape Civitai?

Civitai is the #1 hub for sharing AI image generation models, with millions of monthly visitors and tens of thousands of models. It hosts checkpoints, LoRAs, textual inversions, ControlNets, and other model types for Stable Diffusion, SDXL, Flux, and more.

Key reasons to scrape Civitai:

  • Model discovery — Find the most downloaded or highest-rated models for any style or subject
  • Trend analysis — Track which base models (SD 1.5, SDXL, Flux) are gaining or losing popularity
  • Competitive intelligence — Monitor new model releases and creator activity
  • Dataset building — Collect model metadata for training recommendation systems or search engines
  • License auditing — Check commercial use permissions across models for compliance
  • Community insights — Analyze download counts, ratings, and tipping patterns

What data can you extract from Civitai?

FieldTypeDescription
modelIdnumberUnique model identifier
namestringModel name
typestringModel type (Checkpoint, LORA, ControlNet, etc.)
descriptionstringModel description (HTML stripped, max 1000 chars)
tagsarrayUser-assigned tags
nsfwbooleanNSFW flag
nsfwLevelnumberNSFW severity level
creatorUsernamestringCreator's username
creatorImageUrlstringCreator's avatar URL
downloadCountnumberTotal downloads
thumbsUpCountnumberTotal likes
thumbsDownCountnumberTotal dislikes
commentCountnumberTotal comments
tippedAmountCountnumberTotal tip amount
ratingnumberApproval ratio (likes / total votes, 0–1)
allowNoCreditbooleanCan use without credit
allowCommercialUsestringCommercial use permissions
allowDerivativesbooleanDerivative works allowed
allowDifferentLicensebooleanDifferent license allowed
latestVersionNamestringLatest version name
latestVersionIdnumberLatest version ID
baseModelstringBase model (SD 1.5, SDXL, Flux, etc.)
publishedAtstringVersion publish date (ISO 8601)
downloadUrlstringDirect download URL for latest version
trainedWordsarrayTrigger words for the model
filesarrayFile details (name, size, format, download URL)
modelUrlstringCivitai model page URL
scrapedAtstringExtraction timestamp

How much does it cost to scrape Civitai models?

Civitai Models Scraper uses Apify's pay-per-event pricing. You only pay for what you extract:

  • Run start: $0.005 per run
  • Per model: $0.003 per model extracted

Example costs:

  • 20 models (quick search): $0.005 + 20 × $0.003 = $0.065
  • 100 models (standard run): $0.005 + 100 × $0.003 = $0.305
  • 1,000 models (bulk extraction): $0.005 + 1,000 × $0.003 = $3.005

💡 Apify's free plan includes $5/month in platform credits — enough to extract ~1,600 models per month at no cost.

How to scrape Civitai models

  1. Go to Civitai Models Scraper on Apify Store
  2. Enter a search keyword (e.g., "flux lora") or leave empty to browse all models
  3. Optionally filter by model type (Checkpoint, LORA, ControlNet, etc.)
  4. Choose how to sort results (Most Downloaded, Highest Rated, Newest)
  5. Set the maximum number of models to extract
  6. Click Start and wait for results
  7. Download your data as JSON, CSV, or Excel

Input parameters

ParameterTypeDefaultDescription
searchQuerystring""Keyword to search for models. Leave empty to browse all.
modelTypesstring[][]Filter by type: Checkpoint, LORA, ControlNet, VAE, etc.
sortstring"Most Downloaded"Sort by: Most Downloaded, Highest Rated, or Newest
periodstring"AllTime"Time period for stats: AllTime, Year, Month, Week, Day
includeNsfwbooleanfalseInclude NSFW models in results
maxModelsinteger100Maximum models to extract (1–10,000)
includeVersionDetailsbooleantrueInclude file metadata and trained words
maxRequestRetriesinteger3Retry attempts for failed API requests

Output example

{
"modelId": 827184,
"name": "WAI-illustrious-SDXL",
"type": "Checkpoint",
"description": "High-quality illustrious checkpoint for anime-style generation...",
"tags": ["base model", "anime"],
"nsfw": false,
"nsfwLevel": 1,
"creatorUsername": "WAI0731",
"creatorImageUrl": "https://image.civitai.com/...",
"downloadCount": 1277231,
"thumbsUpCount": 79400,
"thumbsDownCount": 204,
"commentCount": 0,
"tippedAmountCount": 157252,
"rating": 1.0,
"allowNoCredit": true,
"allowCommercialUse": "{Image,RentCivit}",
"allowDerivatives": true,
"allowDifferentLicense": true,
"latestVersionName": "v16.0",
"latestVersionId": 2514310,
"baseModel": "Illustrious",
"publishedAt": "2025-12-18T09:16:12.062Z",
"downloadUrl": "https://civitai.com/api/download/models/2514310",
"trainedWords": [],
"files": [
{
"fileName": "waiIllustriousSDXL_v160.safetensors",
"fileSizeKB": 6775430,
"fileType": "Model",
"format": "SafeTensor",
"downloadUrl": "https://civitai.com/api/download/models/2514310"
}
],
"modelUrl": "https://civitai.com/models/827184",
"scrapedAt": "2026-04-19T08:24:22.468Z"
}

Tips for scraping Civitai

  • 🔍 Use specific search terms — "flux lora portrait" gives more targeted results than just "flux"
  • 📦 Filter by model type — narrow results to Checkpoints, LoRAs, or other specific types
  • 📊 Sort by Most Downloaded for popular models, Highest Rated for quality, Newest for recent releases
  • ⏱️ Use time periods — filter by Week or Month to find trending models
  • 📉 Disable version details (includeVersionDetails: false) for lighter output when you only need model metadata
  • 🔞 NSFW filtering is on by default — enable it explicitly if needed for research
  • 🔄 The API returns up to 100 models per page — large extractions paginate automatically via cursor

Integrations

Connect Civitai Models Scraper to your workflows:

  • Google Sheets — automatically export model data to spreadsheets for tracking and analysis
  • Slack/Discord — get notifications when new high-rated models appear in your category
  • Airtable — build a searchable model database with ratings, downloads, and compatibility info
  • Webhooks — trigger downstream processing when new data is available
  • Zapier/Make — connect to 5,000+ apps for automated model monitoring pipelines

API usage

Node.js

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor('automation-lab/civitai-scraper').call({
searchQuery: 'flux lora',
modelTypes: ['LORA'],
sort: 'Most Downloaded',
maxModels: 50,
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("automation-lab/civitai-scraper").call(run_input={
"searchQuery": "flux lora",
"modelTypes": ["LORA"],
"sort": "Most Downloaded",
"maxModels": 50,
})
items = client.dataset(run["defaultDatasetId"]).list_items().items
print(items)

cURL

curl -X POST "https://api.apify.com/v2/acts/automation-lab~civitai-scraper/runs?token=YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"searchQuery": "flux lora",
"modelTypes": ["LORA"],
"sort": "Most Downloaded",
"maxModels": 50
}'

Use with MCP (Model Context Protocol)

Claude Code

$claude mcp add --transport http apify "https://mcp.apify.com?tools=automation-lab/civitai-scraper"

Claude Desktop / Cursor / VS Code

Add to your MCP config file:

{
"mcpServers": {
"apify": {
"url": "https://mcp.apify.com?tools=automation-lab/civitai-scraper",
"headers": {
"Authorization": "Bearer YOUR_API_TOKEN"
}
}
}
}

Example prompts:

  • "Find the top 20 most downloaded Flux LoRAs on Civitai"
  • "Get all Checkpoint models tagged 'anime' sorted by rating"
  • "Extract model details for SDXL ControlNets published this month"

This scraper uses Civitai's official public REST API (https://civitai.com/api/v1/models), which is freely available without authentication. The API is documented and intended for public use.

The scraper only extracts publicly available model metadata — it does not download model files, bypass authentication, or access private data. Always review Civitai's Terms of Service before using extracted data commercially.

FAQ

Q: Do I need a Civitai account to use this scraper? A: No. The Civitai API is completely public and requires no authentication. The scraper works without any Civitai credentials.

Q: Can I download model files with this scraper? A: The scraper extracts download URLs for model files, but does not download the files themselves. You can use the extracted URLs to download models separately.

Q: Why am I getting fewer results than expected? A: Some model types may have limited listings. Try broadening your search query, removing type filters, or enabling NSFW models to increase results. The Civitai API may also return fewer results for very specific searches.

Q: What model types are available? A: Checkpoint, LORA, TextualInversion, Hypernetwork, AestheticGradient, ControlNet, Poses, Wildcards, Workflows, VAE, Upscaler, MotionModule, LoCon, and DoRA.

Q: How often is Civitai data updated? A: Civitai's API reflects real-time data. Run the scraper on a schedule (daily, weekly) to track changes in download counts, ratings, and new model releases.

Q: The scraper returned 0 results — what happened? A: This is rare since the Civitai API returns results even for broad queries. Check your model type filter — some types may have very few listings. Try removing filters or using a different search query.