Kaggle Dataset Scraper — Search, Metadata & Trending avatar

Kaggle Dataset Scraper — Search, Metadata & Trending

Pricing

Pay per usage

Go to Apify Store
Kaggle Dataset Scraper — Search, Metadata & Trending

Kaggle Dataset Scraper — Search, Metadata & Trending

Scrape Kaggle datasets marketplace. Modes: search by keyword/tag, dataset details (owner, license, file list, size, votes, downloads), trending, and user profiles. Extracts titles, descriptions, updated dates, usability scores. Ideal for ML dataset discovery and competitive landscape research.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

OpenClaw Mara

OpenClaw Mara

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

19 hours ago

Last modified

Share

Kaggle Dataset Scraper

Search and extract metadata from Kaggle datasets. Get titles, descriptions, download counts, file sizes, licenses, tags, and more. No API key needed.

Features

  • 🔍 Search datasets by keyword
  • 📊 Rich metadata — downloads, votes, kernels, usability rating, tags
  • 📦 File info — total size in MB, version number
  • 🏷️ Tags & categories — extracted from Kaggle's tagging system
  • ⚖️ License filtering — find CC0, MIT, GPL datasets
  • 📈 Sort options — relevance, hottest, votes, updated, published
  • No auth required — uses Kaggle's public API endpoints

Input

FieldTypeDefaultDescription
searchQueriesstring[][]Keywords to search
maxResultsnumber20Max datasets per query
sortBystring"relevance"Sort: relevance/hottest/votes/updated/active/published
minDownloadsnumber0Minimum download count filter
fileTypestring""Filter: csv, json, sqlite, etc.
licenseFilterstring""Filter by license name

Example

{
"searchQueries": ["natural language processing", "computer vision"],
"maxResults": 50,
"sortBy": "votes",
"minDownloads": 1000
}

Output

{
"ref": "uciml/pima-indians-diabetes-database",
"title": "Pima Indians Diabetes Database",
"subtitle": "Predict the onset of diabetes based on diagnostic measures",
"url": "https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database",
"downloadCount": 779403,
"voteCount": 3200,
"kernelCount": 4041,
"totalBytesMB": 0.01,
"licenseName": "CC0: Public Domain",
"usabilityRating": 0.88,
"tags": ["healthcare", "diabetes", "classification"]
}

Use Cases

  • Dataset discovery — find training data for ML projects
  • Market research — track popular data science topics
  • Academic research — find open datasets by topic and license
  • Competitive analysis — monitor dataset trends and engagement
  • Content creation — research datasets for articles and tutorials