Docker Hub Scraper
Pricing
$1.00 / 1,000 image extracteds
Docker Hub Scraper
Pricing
$1.00 / 1,000 image extracteds
Rating
0.0
(0)
Developer
Pierrick McD0nald
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Extract comprehensive Docker Hub image metadata, tags, pull counts, and security information. This Actor provides detailed intelligence on Docker container images including official and community repositories.
Overview
Docker Hub hosts millions of container images used by developers and DevOps teams worldwide. This scraper extracts rich metadata including download statistics, star ratings, tag information with architecture support, and full repository descriptions. Perfect for competitive intelligence, security auditing, container research, and DevOps automation.
Use Cases
Container Security Auditing Analyze official vs community images, verify publisher information, and track image update frequency for security compliance. Extract architecture support data to ensure multi-platform compatibility.
DevOps Intelligence Monitor popular base images, track trending container technologies, and compare pull counts across different solutions. Identify official images vs unofficial forks for infrastructure decisions.
Market Research Analyze the Docker ecosystem to understand container adoption patterns. Track which technologies and base images dominate the container landscape. Research competitor containerization strategies.
CI/CD Pipeline Optimization Extract image tags and digests to pin exact versions in your pipelines. Compare image sizes across different tags to optimize build and deployment times.
Features
- Complete Image Metadata: Name, namespace, description, star count, pull count, registration date
- Official Image Detection: Identify Docker Official Images (library namespace)
- Tag Intelligence: Extract all tags with digests, sizes, and supported architectures
- Architecture Support: Detect multi-platform support (AMD64, ARM64, ARM, etc.)
- Category Classification: Extract Docker Hub categories for organization
- Full Descriptions: Retrieve complete README content from repositories
- Automated Repository Detection: Identify automated build configurations
- Repository Links: Direct URLs to Docker Hub pages
- Extracted Timestamps: Know exactly when data was collected
- Search Functionality: Find images by keyword with sorting options
- Rate Limiting: Built-in delays to respect Docker Hub API limits
- PPE Charging: Pay-per-event pricing with spending limit checks
Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| imageNames | array | No | [] | List of Docker image names (e.g., "library/ubuntu", "nginx") |
| searchQuery | string | No | "" | Search Docker Hub for images matching this query |
| maxResults | integer | No | 10 | Maximum images to extract in search mode (1-1000) |
| includeTags | boolean | No | true | Extract tag information for each image |
| maxTagsPerImage | integer | No | 10 | Limit tags per image (1-100) |
| sortBy | string | No | "pulls" | Sort results by: pulls, stars, name, last_updated |
| proxyConfiguration | object | No | {} | Optional proxy settings |
Output Example
{"imageName": "library/ubuntu","namespace": "library","repositoryName": "ubuntu","description": "Ubuntu is a Debian-based Linux operating system...","isOfficial": true,"isAutomated": false,"isPrivate": false,"starCount": 17801,"pullCount": 9839039412,"lastUpdated": "2026-03-19T04:04:47.179605Z","dateRegistered": "2013-04-30T20:54:50Z","status": "active","categories": ["Operating systems"],"fullDescription": "# Quick reference\n\n**Maintained by**: Canonical...","repositoryUrl": "https://hub.docker.com/r/library/ubuntu","tags": [{"name": "24.04","fullSize": 29731993,"lastUpdated": "2026-03-19T04:04:38.398097Z","architectures": ["amd64", "arm64", "arm"],"digest": "sha256:186072bba1b2f436cbb91ef2567abca677337cfc786c86e107d25b7072feef0c"}],"architectures": ["linux/amd64", "linux/arm64"],"extractedAt": "2026-03-28T13:30:00.000Z"}
Pricing
This Actor uses Pay-Per-Event (PPE) pricing:
| Event | Price |
|---|---|
| Image extracted | $0.001 per image |
Example costs:
- Extract 100 images with tags: $0.10
- Extract 1,000 images with tags: $1.00
- Search and extract top 50 images: $0.05
No subscription required. You only pay for what you extract.
FAQ
Q: Do I need authentication to scrape Docker Hub? A: No. This Actor uses Docker Hub's public API which requires no authentication for read operations.
Q: Can I extract all tags from an image?
A: Yes, but we recommend limiting tags with maxTagsPerImage as some images have hundreds of tags. The default is 10 tags per image.
Q: Does this work with private repositories? A: No. This Actor only accesses publicly available Docker Hub data. Private repositories require authentication and explicit access.
Q: How fresh is the data?
A: Data is extracted in real-time from Docker Hub's API. The extractedAt field shows exactly when each record was captured.
Limitations
- Only public repositories can be scraped
- Rate limiting is applied (500ms between requests)
- Maximum 1000 images per search operation
- Some very old images may have incomplete metadata
- Tag extraction limited to 100 tags per image maximum
- Architecture detection depends on image manifest availability
Support
Open an issue on this Actor's Apify page for support, feature requests, or bug reports.
Legal
This Actor respects Docker Hub's Terms of Service and API rate limits. Users are responsible for complying with Docker Hub's terms and applicable data protection regulations.