Kaggle Datasets Scraper
Pricing
$1.50 / 1,000 dataset extracteds
Kaggle Datasets Scraper
Pricing
$1.50 / 1,000 dataset extracteds
Rating
0.0
(0)
Developer
Pierrick McD0nald
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
a day ago
Last modified
Categories
Share
Extract datasets, competitions, and user profiles from Kaggle with comprehensive metadata. Perfect for data science research, competitive analysis, and building ML training datasets.
Features
- Dataset Search & Extraction — Find datasets by keywords with full metadata
- User Profile Scraping — Extract public datasets from any Kaggle user
- Competition Data — Get active and past competition information
- Comprehensive Metadata — Downloads, votes, usability ratings, tags, licenses
- Smart Rate Limiting — Configurable delays to respect Kaggle's servers
- PPE Pricing — Pay only for what you extract
Use Cases
Market Research for Data Scientists
Analyze dataset trends, popular topics, and download statistics to identify gaps in available data. Track which machine learning domains are gaining traction based on dataset popularity metrics.
Competitive Intelligence
Monitor what datasets competitors and industry leaders are publishing. Track user activity, publication frequency, and engagement metrics to benchmark your own data science presence.
ML Training Data Discovery
Find high-quality datasets with usability ratings and community engagement. Filter by tags, categories, and popularity to identify the best data for your machine learning projects.
Academic Research
Track the evolution of open data in specific domains. Analyze licensing patterns, dataset sizes, and collaboration trends in the data science community.
Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| searchQueries | array | No | - | Keywords to search for datasets |
| datasetUrls | array | No | - | Direct URLs to specific datasets |
| userProfiles | array | No | - | Kaggle user profile URLs to scrape |
| searchType | string | No | datasets | Type: datasets or competitions |
| sortBy | string | No | hottest | Sort: hottest, votes, updated, active |
| maxResults | integer | No | 50 | Max results per query (1-1000) |
| maxDatasetsPerUser | integer | No | 10 | Datasets to extract per user |
| includeDatasetDetails | boolean | No | true | Fetch full dataset details |
| includeUserStats | boolean | No | true | Extract user statistics |
| rateLimitDelayMs | integer | No | 1000 | Delay between requests (ms) |
| proxyConfiguration | object | No | Apify proxy | Proxy settings |
Output Example
{"datasetId": 12345,"datasetSlug": "iris","ownerSlug": "uciml","title": "Iris Species","subtitle": "Classify iris plants into three species","description": "This famous iris dataset...","url": "https://www.kaggle.com/datasets/uciml/iris","thumbnailUrl": "https://storage.googleapis.com/...","viewCount": 1250000,"downloadCount": 850000,"voteCount": 2340,"kernelCount": 456,"usabilityRating": 9.8,"size": "3.4 MB","fileCount": 1,"lastUpdated": "2024-01-15T10:30:00Z","createdAt": "2016-01-15T00:00:00Z","tags": ["classification", "beginner", "multiclass"],"categories": ["biology", "environment"],"license": "CC0: Public Domain","ownerName": "UCI Machine Learning","ownerAvatarUrl": "https://storage.googleapis.com/...","extractedAt": "2026-03-31T08:30:00Z"}
Pricing
Pay-per-event pricing:
- $0.0015 per item extracted (dataset, competition, or user profile)
Example costs:
- 100 datasets: $0.15
- 1,000 datasets: $1.50
- 10,000 datasets: $15.00
No monthly fees. No minimums. Pay only for what you use.
FAQ
Q: Do I need a Kaggle account? A: No. This scraper extracts publicly available data from Kaggle without requiring authentication.
Q: What data can I extract? A: Public dataset metadata, competition information, and user profile statistics. Private datasets and user data are not accessible.
Q: Is there a rate limit? A: The scraper includes configurable rate limiting (default 1 second between requests) to respect Kaggle's servers. You can adjust this in the input settings.
Q: Can I extract actual dataset files? A: No, this scraper extracts metadata only. To download dataset files, visit the Kaggle dataset page directly.
Limitations
- Extracts metadata only, not actual data files
- Public content only (private datasets not accessible)
- Subject to Kaggle's terms of service
- Rate limits apply to prevent server overload
- Some user statistics may be limited for privacy
Support
Open an issue on this Actor's Apify page for questions or feature requests.
Built for data scientists who need reliable access to Kaggle metadata for research and analysis.