Pricing

from $20.00 / 1,000 results

🐳 Docker Hub Scraper — Images & Pull Counts

Extract Docker Hub image data — pull counts, tags, descriptions, maintainers, version history. Snyk, Anchore & Sysdig alternative for container intelligence, SBOMs, supply-chain audits and DevOps dashboards. Pay per image.

Pricing

from $20.00 / 1,000 results

Rating

0.0

(0)

Developer

Stephan Corbeil

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

🐳 Docker Hub Scraper — Images, Tags, Pull Counts & Vulnerability Signals

Pay-per-result Docker Hub scraper — extracts image metadata, tag list, pull counts, last-push timestamps, vulnerabilities reported by Docker Scout, README, and dependency layer info. Built for container security teams, devtool marketers, and OSS-funded analytics as a no-rate-limit alternative to Docker Hub's official API (anonymous 100 pulls/6h; authenticated 200; subscribers 5000+), Docker Scout enterprise tier ($21+/user/mo), Snyk Container ($25-58/user/mo), Aqua Trivy Cloud, and Anchore Enterprise.

Why Docker Hub Scraper Beats the Docker Hub API, Snyk & Docker Scout

Feature	NexGenData Docker Hub Scraper	Docker Hub official API	Snyk Container	Docker Scout
Cost	$0.002 / image, pay-per-result	Free + pull-rate-limited	$25-58 / user / month	$21+ / user / month
Pull-rate cap	None for end user	100-5000 pulls / 6h	Plan-dependent	Plan-dependent
Auth	Apify token	Docker ID + plan	Account + plan	Docker subscription
Bulk image scan	Yes	Per-call REST + pagination	Yes	Yes
Vulnerability signals	Yes	Limited	Yes — deep CVE detail	Yes — Scout findings
Pull-count history	Yes	Per-image only	Limited	Yes
Free trial	Free Apify credits on signup	Free for low volume	14-day trial	Free tier on Docker Personal

Container-security teams and devtool competitive analysts pick this actor instead of wrestling with Docker Hub's API pull-rate limits (especially the anonymous 100-per-6-hours that kills any bulk scan). It is a drop-in alternative to Docker Scout's enterprise tier when you need image metadata + Scout-equivalent vulnerability counts but not the full Scout dashboard.

What You Get Per Image

Each dataset item is a flat JSON record:

namespace, repository, full_name, description, short_description
is_official, is_verified_publisher, is_automated
star_count, pull_count, last_updated
categories, architectures — amd64, arm64, ppc64le, s390x, riscv64, etc.
tags — array of {name, digest, size_bytes, last_pushed, architectures}
latest_tag_size_mb, tag_count, unique_digests
base_image, os_versions
vulnerability_summary — {critical, high, medium, low}
top_vulnerabilities — array of {cve_id, severity, package, fixed_in} (when Docker Scout data is public)
dependency_layers_count, total_layers
readme_text, dockerfile_url, source_repo_url, license

Use Cases

Container-security teams — bulk-scan your organization's full registry catalog and rank by CVE count
Devtool marketers — find images using your competitor's base image and pitch a migration
Open-source intelligence — track adoption velocity of new base images (Alpine vs Distroless vs Wolfi)
Engineering procurement — benchmark target-company image hygiene during diligence
Internal platform teams — audit your private registry's pull-count distribution
Newsletter / Substack content — automate "top 10 fastest-growing Docker images this month"
SBOM workflows — feed the dependency-layer data into your bill-of-materials pipeline

Quick Start

from apify_client import ApifyClient
client = ApifyClient("YOUR_APIFY_TOKEN")
run = client.actor("nexgendata/dockerhub-scraper").call(run_input={
    "queries": ["nginx", "redis", "postgres"],
    "namespaces": ["library", "bitnami"],
    "includeVulnerabilities": True,
    "maxResults": 500
})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item["full_name"], item["pull_count"], item["vulnerability_summary"])

Pricing

Pay-per-event — no Docker Hub subscription required, no monthly minimum.

Actor Start: $0.0001
Per image: $0.002
Per tag enrichment: $0.0005

A 500-image namespace audit with vulnerability data costs about $2-3. Same data via Docker Hub API + Docker Scout subscription is gated behind a $21+/user/month tier.

Use case	Actor
GitHub repos + stars + contributors	GitHub Scraper
GitLab projects + MRs	GitLab Scraper
GitHub trending feed	GitHub Trending Scraper
npm package download stats	npm Package Stats
PyPI package download stats	PyPI Package Stats
Dev.to articles + dev audience	Dev.to Scraper
Developer Tools MCP Server	Developer Tools MCP Server
Tech-stack / Wappalyzer replacement	Wappalyzer Replacement

FAQ

Q: How do you bypass Docker Hub's pull-rate cap? We don't pull image binaries — we extract metadata, tag lists, and pull counts via Docker Hub's public web pages and the metadata API. Image binaries are not downloaded.

Q: Do you scan vulnerabilities yourself? We surface vulnerability counts that Docker Hub exposes publicly via Docker Scout. We do not run our own scanner — that would require pulling the binary.

Q: Can I do this for private registries (ECR, GCR, ACR)? This actor is Docker Hub only. For private-registry scanning, layer on your cloud provider's native scanner.

Q: How fresh is pull-count data? Live per run. Docker Hub aggregates pull-count over time — we capture the cumulative value at scan time.

Q: Schema stability? Field names are versioned. We track Docker Hub's web-page DOM and ship parser updates within 24 hours of breaking changes.

Q: Multi-arch image support? Yes — architectures lists every CPU/OS combo for each tag. Multi-arch manifests are flattened across one row per logical tag.

About NexGenData

NexGenData publishes 260+ buyer-intent actors covering SEC filings, YC alumni, lead generation, competitive intelligence, stock fundamentals across 30+ exchanges, and more. All pay-per-result. Browse the full catalog at https://apify.com/nexgendata?fpr=2ayu9b

How NexGenData Pricing Works

Every NexGenData actor uses pay-per-event pricing — you only pay for results that actually land in your dataset. No monthly minimum, no seat fees, no surprise overage bills.

Actor Start: a single-event charge each time you spin the actor up (scaled to memory size)
Result / item: charged per item written to the default dataset
No charge for retries, internal proxy rotation, or failed sub-requests — those are absorbed by the platform

Apify Platform Bonus

New to Apify? Sign up with the NexGenData referral link — you get free platform credits on signup (enough for several thousand free results) and you help fund the maintenance of this actor fleet.

Integration Surface

Every actor in the NexGenData catalog can be triggered from:

Apify console — point-and-click run
Apify API — REST + webhooks
Apify Python / JS SDKs — programmatic batch
Zapier, Make.com, n8n — official integrations
MCP — many actors are exposed as MCP tools for Claude / ChatGPT / Cursor agents
Schedules — built-in cron for daily / weekly / monthly runs
Webhooks — POST results to any HTTPS endpoint on dataset write

Support

NexGenData maintains 260+ Apify actors and ships updates regularly. Bug reports via the Apify console issues tab get a response within 24 hours. Roadmap requests are welcome — high-demand features ship in the next version.

Home: thenextgennexus.com Full catalog: apify.com/nexgendata

Docker Hub Scraper

automation-lab/dockerhub-scraper

Search Docker Hub and extract repository data. Get image names, descriptions, star counts, pull counts, and official status. Search multiple keywords in one run.

Stas Persiianenko

Docker Hub Scraper

klondikeking/docker-hub-scraper

Pierrick McD0nald

Docker Hub Scraper

shahidirfan/docker-hub-scraper

Scrape Docker Hub repositories, container images & metadata efficiently. Essential for market research, competitive analysis, developer tool insights, registry monitoring & API integrations.

Shahid Irfan

Docker Hub Container Images Scraper

parseforge/docker-hub-images-scraper

Search Docker Hub for container images. Returns repository name, owner, full and short description, official/automated/verified flags, star count, total pull count, last updated, available tags. Search by keyword or look up specific images by name with full tag listings.

ParseForge

Docker Mcp

salesmart-srl/docker-mcp

MCP server for Docker Engine. List, start, stop and restart containers. View logs, pull images, manage volumes and networks. Works with Claude, Cursor and any MCP-compatible AI assistant. Connect to local or remote Docker daemon.

Salesmart Srl

🐍 PyPI Scraper — Python Package Data

nexgendata/pypi-scraper

Extract Python package data from PyPI — download stats, dependencies, version history, maintainers. libraries.io, Snyk Advisor & deps.dev alternative for SBOMs, supply-chain audits, ecosystem analytics and dependency monitoring. Pay per package.

Stephan Corbeil

Football Intelligence Hub

omarchydev/football-intelligence-hub

Football Intelligence Hub - Injuries, Transfers & ML Predictions.

Omarchy Dev

🛡️ Docker Image Update Monitor

taroyamada/dockerhub-image-intelligence

Track public container repositories on a strict schedule to instantly detect tag drift, newly published versions, and storage size changes.

太郎山田

Crunchbase Hub Scraper - Cheap 💼🌐

contactminerlabs/my-actor-4

🔍 Scrape Mass/Bulk Crunchbase Hub Enter a keyword & extract relevant Crunchbase Hub, including full name, headline, company, bio & URL 📊 Perfect for lead generation, recruitment, B2B outreach, talent sourcing & enriching your data pipelines across Google Sheets & automation tools

ContactMinerLabs

Letting Ahead Content-hub Scraper

yourapiservice/lettingahead-content-hub-scraper

Letting Ahead Content-hub Scraper (lettingahead.com) lets you extract content-hub content in HTML, JSON, and plaintext. Get authors, create/update date, images, read time, RSS, titles, SEO titles, featured images & videos, and keywords easily for content analysis and aggregation.