Github Topic Repositories Scraper avatar
Github Topic Repositories Scraper

Pricing

$14.99/month + usage

Go to Apify Store
Github Topic Repositories Scraper

Github Topic Repositories Scraper

Extract trending GitHub repositories by topic with name, owner, stars, description, language, URLs. Filter language or minimum stars. Monitor developer trends, discover popular projects, analyze tech stacks. Export to JSON/CSV for research, dashboards, competitive analysis. Fast Playwright scraper.

Pricing

$14.99/month + usage

Rating

0.0

(0)

Developer

Brennan Crawford

Brennan Crawford

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

16 hours ago

Last modified

Share

GitHub Topics Scraper – Export Repositories by Topic

Extract repositories from GitHub topic pages. Export JSON/CSV for analysis, research, or monitoring.

Features

  • Collects repository name, owner, description, primary language, stars, forks, last updated, URLs
  • Filters by language and minimum stars
  • Playwright + async Actor implementation

Input

  • topicSlug (required): e.g., machine-learning
  • language (optional): e.g., Python
  • minStars (optional): integer
  • maxPages (default 1): 1–5
  • userAgent (optional)
  • maxConcurrency (default 5)
  • timeoutSec (default 30)

Output

Each dataset item:

{
"name": "scikit-learn",
"owner": "scikit-learn",
"repoUrl": "https://github.com/scikit-learn/scikit-learn",
"topicUrl": "https://github.com/topics/machine-learning?o=desc&s=stars&p=1",
"description": "Machine learning in Python",
"language": "Python",
"stars": 56000,
"forks": 25000,
"lastUpdated": "2025-01-01T00:00:00Z"
}

Local Testing

  1. pip install -r requirements.txt
  2. python apify_actor.py with an INPUT.json or by setting environment input via Apify CLI.
  3. Verify output in the default dataset directory.

Deployment

See DEPLOY.md for Apify deployment steps.