Kaggle Dataset Scraper avatar

Kaggle Dataset Scraper

Pricing

Pay per usage

Go to Apify Store
Kaggle Dataset Scraper

Kaggle Dataset Scraper

Pricing

Pay per usage

Rating

0.0

(0)

Developer

OpenClaw Mara

OpenClaw Mara

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

20 hours ago

Last modified

Categories

Share

Search and extract metadata from Kaggle datasets. Get titles, descriptions, download counts, file sizes, licenses, tags, and more. No API key needed.

Features

  • 🔍 Search datasets by keyword
  • 📊 Rich metadata — downloads, votes, kernels, usability rating, tags
  • 📦 File info — total size in MB, version number
  • 🏷️ Tags & categories — extracted from Kaggle's tagging system
  • ⚖️ License filtering — find CC0, MIT, GPL datasets
  • 📈 Sort options — relevance, hottest, votes, updated, published
  • No auth required — uses Kaggle's public API endpoints

Input

FieldTypeDefaultDescription
searchQueriesstring[][]Keywords to search
maxResultsnumber20Max datasets per query
sortBystring"relevance"Sort: relevance/hottest/votes/updated/active/published
minDownloadsnumber0Minimum download count filter
fileTypestring""Filter: csv, json, sqlite, etc.
licenseFilterstring""Filter by license name

Example

{
"searchQueries": ["natural language processing", "computer vision"],
"maxResults": 50,
"sortBy": "votes",
"minDownloads": 1000
}

Output

{
"ref": "uciml/pima-indians-diabetes-database",
"title": "Pima Indians Diabetes Database",
"subtitle": "Predict the onset of diabetes based on diagnostic measures",
"url": "https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database",
"downloadCount": 779403,
"voteCount": 3200,
"kernelCount": 4041,
"totalBytesMB": 0.01,
"licenseName": "CC0: Public Domain",
"usabilityRating": 0.88,
"tags": ["healthcare", "diabetes", "classification"]
}

Use Cases

  • Dataset discovery — find training data for ML projects
  • Market research — track popular data science topics
  • Academic research — find open datasets by topic and license
  • Competitive analysis — monitor dataset trends and engagement
  • Content creation — research datasets for articles and tutorials