Kaggle Datasets Scraper avatar

Kaggle Datasets Scraper

Pricing

$1.50 / 1,000 dataset extracteds

Go to Apify Store
Kaggle Datasets Scraper

Kaggle Datasets Scraper

Pricing

$1.50 / 1,000 dataset extracteds

Rating

0.0

(0)

Developer

Pierrick McD0nald

Pierrick McD0nald

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Extract datasets, competitions, and user profiles from Kaggle with comprehensive metadata. Perfect for data science research, competitive analysis, and building ML training datasets.

Features

  • Dataset Search & Extraction — Find datasets by keywords with full metadata
  • User Profile Scraping — Extract public datasets from any Kaggle user
  • Competition Data — Get active and past competition information
  • Comprehensive Metadata — Downloads, votes, usability ratings, tags, licenses
  • Smart Rate Limiting — Configurable delays to respect Kaggle's servers
  • PPE Pricing — Pay only for what you extract

Use Cases

Market Research for Data Scientists

Analyze dataset trends, popular topics, and download statistics to identify gaps in available data. Track which machine learning domains are gaining traction based on dataset popularity metrics.

Competitive Intelligence

Monitor what datasets competitors and industry leaders are publishing. Track user activity, publication frequency, and engagement metrics to benchmark your own data science presence.

ML Training Data Discovery

Find high-quality datasets with usability ratings and community engagement. Filter by tags, categories, and popularity to identify the best data for your machine learning projects.

Academic Research

Track the evolution of open data in specific domains. Analyze licensing patterns, dataset sizes, and collaboration trends in the data science community.

Input Parameters

ParameterTypeRequiredDefaultDescription
searchQueriesarrayNo-Keywords to search for datasets
datasetUrlsarrayNo-Direct URLs to specific datasets
userProfilesarrayNo-Kaggle user profile URLs to scrape
searchTypestringNodatasetsType: datasets or competitions
sortBystringNohottestSort: hottest, votes, updated, active
maxResultsintegerNo50Max results per query (1-1000)
maxDatasetsPerUserintegerNo10Datasets to extract per user
includeDatasetDetailsbooleanNotrueFetch full dataset details
includeUserStatsbooleanNotrueExtract user statistics
rateLimitDelayMsintegerNo1000Delay between requests (ms)
proxyConfigurationobjectNoApify proxyProxy settings

Output Example

{
"datasetId": 12345,
"datasetSlug": "iris",
"ownerSlug": "uciml",
"title": "Iris Species",
"subtitle": "Classify iris plants into three species",
"description": "This famous iris dataset...",
"url": "https://www.kaggle.com/datasets/uciml/iris",
"thumbnailUrl": "https://storage.googleapis.com/...",
"viewCount": 1250000,
"downloadCount": 850000,
"voteCount": 2340,
"kernelCount": 456,
"usabilityRating": 9.8,
"size": "3.4 MB",
"fileCount": 1,
"lastUpdated": "2024-01-15T10:30:00Z",
"createdAt": "2016-01-15T00:00:00Z",
"tags": ["classification", "beginner", "multiclass"],
"categories": ["biology", "environment"],
"license": "CC0: Public Domain",
"ownerName": "UCI Machine Learning",
"ownerAvatarUrl": "https://storage.googleapis.com/...",
"extractedAt": "2026-03-31T08:30:00Z"
}

Pricing

Pay-per-event pricing:

  • $0.0015 per item extracted (dataset, competition, or user profile)

Example costs:

  • 100 datasets: $0.15
  • 1,000 datasets: $1.50
  • 10,000 datasets: $15.00

No monthly fees. No minimums. Pay only for what you use.

FAQ

Q: Do I need a Kaggle account? A: No. This scraper extracts publicly available data from Kaggle without requiring authentication.

Q: What data can I extract? A: Public dataset metadata, competition information, and user profile statistics. Private datasets and user data are not accessible.

Q: Is there a rate limit? A: The scraper includes configurable rate limiting (default 1 second between requests) to respect Kaggle's servers. You can adjust this in the input settings.

Q: Can I extract actual dataset files? A: No, this scraper extracts metadata only. To download dataset files, visit the Kaggle dataset page directly.

Limitations

  • Extracts metadata only, not actual data files
  • Public content only (private datasets not accessible)
  • Subject to Kaggle's terms of service
  • Rate limits apply to prevent server overload
  • Some user statistics may be limited for privacy

Support

Open an issue on this Actor's Apify page for questions or feature requests.


Built for data scientists who need reliable access to Kaggle metadata for research and analysis.