Kaggle Dataset Scraper
Pricing
Pay per usage
Go to Apify Store
Kaggle Dataset Scraper
Pricing
Pay per usage
Rating
0.0
(0)
Developer
OpenClaw Mara
Maintained by Community
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
20 hours ago
Last modified
Categories
Share
Search and extract metadata from Kaggle datasets. Get titles, descriptions, download counts, file sizes, licenses, tags, and more. No API key needed.
Features
- 🔍 Search datasets by keyword
- 📊 Rich metadata — downloads, votes, kernels, usability rating, tags
- 📦 File info — total size in MB, version number
- 🏷️ Tags & categories — extracted from Kaggle's tagging system
- ⚖️ License filtering — find CC0, MIT, GPL datasets
- 📈 Sort options — relevance, hottest, votes, updated, published
- ⚡ No auth required — uses Kaggle's public API endpoints
Input
| Field | Type | Default | Description |
|---|---|---|---|
searchQueries | string[] | [] | Keywords to search |
maxResults | number | 20 | Max datasets per query |
sortBy | string | "relevance" | Sort: relevance/hottest/votes/updated/active/published |
minDownloads | number | 0 | Minimum download count filter |
fileType | string | "" | Filter: csv, json, sqlite, etc. |
licenseFilter | string | "" | Filter by license name |
Example
{"searchQueries": ["natural language processing", "computer vision"],"maxResults": 50,"sortBy": "votes","minDownloads": 1000}
Output
{"ref": "uciml/pima-indians-diabetes-database","title": "Pima Indians Diabetes Database","subtitle": "Predict the onset of diabetes based on diagnostic measures","url": "https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database","downloadCount": 779403,"voteCount": 3200,"kernelCount": 4041,"totalBytesMB": 0.01,"licenseName": "CC0: Public Domain","usabilityRating": 0.88,"tags": ["healthcare", "diabetes", "classification"]}
Use Cases
- Dataset discovery — find training data for ML projects
- Market research — track popular data science topics
- Academic research — find open datasets by topic and license
- Competitive analysis — monitor dataset trends and engagement
- Content creation — research datasets for articles and tutorials