Pricing

Pay per event

Try for free

Go to Apify Store

GitHub Scraper

Try for free

Extract data from GitHub — repository details, developer profiles, trending repos, and search results. Stars, forks, languages, topics, and more. No API key needed.

Pricing

Pay per event

Rating

5.0

(1)

Developer

Stas Persiianenko

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

What does GitHub Scraper do?

GitHub Scraper extracts structured data from GitHub using its public API and web pages. It supports four modes:

Repository details — Full metadata for specific repos (stars, forks, topics, license, dates)
Developer profiles — Bio, followers, location, company, repos count for any user
Trending repositories — Today's/week's/month's hottest repos with star velocity
Search repositories — Find repos by keyword, sorted by stars

No GitHub API token required. Works with GitHub's public endpoints.

Who is it for?

GitHub Scraper is for developer-tool founders, recruiters, open-source researchers, venture analysts, and data teams that need structured repository and profile data without maintaining GitHub scraping infrastructure.

Why scrape GitHub?

GitHub hosts 400M+ repositories and 100M+ developers. It's the primary source for:

📊 Tech trend analysis — Track which languages and frameworks are gaining traction
🔍 Competitive intelligence — Monitor competitor repos, stars growth, and release cadence
📈 Developer recruiting — Find active developers by language, location, and contribution history
🏗️ Open source research — Analyze licensing, dependency patterns, and community health
📰 Content creation — Curate trending repos for newsletters and social media

Use cases

Newsletter creators sharing weekly trending repos
VCs and investors tracking open-source momentum
Hiring managers building candidate lists from active contributors
Researchers studying open-source ecosystem dynamics
DevRel teams monitoring mentions and competitive landscape
Developers discovering new tools and libraries

How to scrape GitHub

Go to GitHub Scraper on Apify Store
Choose a mode: repos, profiles, trending, or search
Enter URLs or search query depending on the mode
Set the max results limit
Click Start and wait for results
Download data as JSON, CSV, or Excel

Data you can extract

Repository data

Field	Type	Description
`fullName`	string	Owner/repo (e.g., `facebook/react`)
`description`	string	Repo description
`stars`	number	Star count
`forks`	number	Fork count
`watchers`	number	Watcher count
`openIssues`	number	Open issue count
`language`	string	Primary language
`topics`	array	Topic tags
`license`	string	License (MIT, Apache-2.0, etc.)
`isArchived`	boolean	Whether the repo is archived
`createdAt`	string	Creation date
`updatedAt`	string	Last update date
`size`	number	Repo size in KB

Profile data

Field	Type	Description
`username`	string	GitHub username
`name`	string	Display name
`bio`	string	Profile bio
`company`	string	Company
`location`	string	Location
`followers`	number	Follower count
`following`	number	Following count
`publicRepos`	number	Public repo count
`blog`	string	Website URL
`twitterUsername`	string	X/Twitter handle

Field	Type	Description
`fullName`	string	Owner/repo
`description`	string	Repo description
`language`	string	Primary language
`stars`	number	Total stars
`starsToday`	number	Stars gained in the period
`forks`	number	Fork count
`builtBy`	array	Top contributors with avatars

Input parameters

Parameter	Type	Default	Description
`mode`	string	`"trending"`	Mode: `repos`, `profiles`, `trending`, `search`
`urls`	array	`[]`	GitHub URLs (for repos/profiles mode)
`searchQuery`	string	`""`	Search query (for search mode)
`trendingSince`	string	`"daily"`	Trending period: `daily`, `weekly`, `monthly`
`trendingLanguage`	string	`""`	Filter by language (e.g., `python`)
`maxResults`	integer	`25`	Max results to return

Input example

{
    "mode": "trending",
    "trendingSince": "daily",
    "maxResults": 25
}

Output example

Repository

{
    "name": "react",
    "fullName": "facebook/react",
    "owner": "facebook",
    "description": "The library for web and native user interfaces.",
    "url": "https://github.com/facebook/react",
    "homepageUrl": "https://react.dev",
    "language": "JavaScript",
    "stars": 243711,
    "forks": 50668,
    "watchers": 243711,
    "openIssues": 1150,
    "topics": ["declarative", "frontend", "javascript", "library", "react", "ui"],
    "license": "MIT",
    "isArchived": false,
    "isFork": false,
    "defaultBranch": "main",
    "createdAt": "2013-05-24T16:15:54Z",
    "updatedAt": "2026-03-08T09:17:07Z",
    "pushedAt": "2026-03-05T15:52:24Z",
    "size": 942058,
    "scrapedAt": "2026-03-08T09:45:54.218Z"
}

Profile

{
    "username": "torvalds",
    "name": "Linus Torvalds",
    "bio": null,
    "company": "Linux Foundation",
    "location": "Portland, OR",
    "followers": 289246,
    "following": 0,
    "publicRepos": 11,
    "avatarUrl": "https://avatars.githubusercontent.com/u/1024025?v=4",
    "url": "https://github.com/torvalds",
    "scrapedAt": "2026-03-08T09:45:50.123Z"
}

How much does it cost to scrape GitHub?

GitHub Scraper uses pay-per-event pricing:

Event	Price
Run started	$0.005
Repository or profile extracted	$0.003 per item

Cost examples

Scenario	Items	Cost
Daily trending (25 repos)	25	~$0.08
10 repo details	10	~$0.04
Search (50 results)	50	~$0.16

Apify's free plan includes $5/month in platform credits — enough for ~60 trending scrapes.

Using GitHub Scraper with the Apify API

Node.js

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });

const run = await client.actor('automation-lab/github-scraper').call({
    mode: 'trending',
    trendingSince: 'daily',
    maxResults: 25,
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach(repo => {
    console.log(`${repo.fullName}: ${repo.stars} stars (+${repo.starsToday} today)`);
});

Python

from apify_client import ApifyClient

client = ApifyClient('YOUR_API_TOKEN')

run = client.actor('automation-lab/github-scraper').call(run_input={
    'mode': 'trending',
    'trendingSince': 'daily',
    'maxResults': 25,
})

dataset = client.dataset(run['defaultDatasetId']).list_items().items
for repo in dataset:
    print(f"{repo['fullName']}: {repo['stars']} stars (+{repo['starsToday']} today)")

cURL

curl "https://api.apify.com/v2/acts/automation-lab~github-scraper/run-sync-get-dataset-items?token=YOUR_API_TOKEN" \
  -X POST -H "Content-Type: application/json" \
  -d '{"mode": "trending", "trendingSince": "daily", "maxResults": 10}'

Integrations

GitHub Scraper works with all Apify integrations:

Scheduled runs — Track trending repos daily or weekly
Webhooks — Get notified when a scrape completes
Google Sheets — Export repos and profiles to spreadsheets
Slack — Post trending repos to your team's channel
Zapier / Make — Automate workflows with GitHub data

Tips

📈 Track trending daily — Schedule runs to build a history of trending repos
🔍 Use search for competitive analysis — Search for keywords in your domain
👥 Profile scraping — Great for building lists of developers by location or company
🏷️ Filter by language — Use trendingLanguage to focus on specific tech stacks
⚡ Rate limits — The actor handles GitHub API rate limits automatically with retries
📊 Combine modes — Run trending to discover repos, then repos mode for full details

Legality

GitHub Scraper collects publicly available repository and profile information. Use the data responsibly, respect GitHub's terms, and avoid collecting private, authenticated, or personal data beyond your lawful purpose.

Is it legal to scrape GitHub?

GitHub provides a public REST API specifically designed for programmatic access. This scraper uses that API and publicly accessible web pages. It does not bypass authentication, rate limits, or access private data. Always review GitHub's Terms of Service and API usage policies.

FAQ

Do I need a GitHub API token? No. The scraper works without authentication using GitHub's public API (60 requests/hour per IP). For higher rate limits, a future version may support optional token input.

How many trending repos are shown? GitHub's trending page shows up to 25 repos per language/period combination.

Can I search for users or organizations? Currently, search mode finds repositories. Profile mode accepts direct profile URLs. User search may be added in a future version.

What about private repos? The scraper only accesses public data. Private repos are not visible without authentication.

Use with Claude AI (MCP)

This actor is available as a tool in Claude AI through the Model Context Protocol (MCP). Add it to Claude Desktop, Cursor, Windsurf, or any MCP-compatible client.

Setup for Claude Code

$claude mcp add --transport http apify "https://mcp.apify.com?tools=automation-lab/github-scraper"

Setup for Claude Desktop, Cursor, or VS Code

Add this to your MCP config file:

{
    "mcpServers": {
        "apify": {
            "url": "https://mcp.apify.com?tools=automation-lab/github-scraper"
        }
    }
}

Example prompts

"What are the trending GitHub repositories today? Show me the top 25 with their star counts"
"Search GitHub for the most starred machine learning repositories and summarize what each one does"
"Get the GitHub profile and public repo stats for these developers: torvalds, gvanrossum, dhh"

Learn more in the Apify MCP documentation.

The scraper returns an error about rate limiting. GitHub's public API allows 60 requests/hour per IP without authentication. If you're scraping many repos or profiles in one run, you may hit this limit. The actor retries automatically, but very large runs may need to be split.

Profile data shows null for some fields like bio or company. Not all GitHub users fill in their profile details. The scraper returns null for fields the user hasn't set. This is expected behavior.

GitHub Trending Scraper — Trending repositories from GitHub with star velocity
Hacker News Scraper — Stories from Hacker News front page, newest, Ask HN, and more
Homebrew Scraper — Homebrew formulas and casks with install counts
Stack Overflow Scraper — Questions, answers, and tags from Stack Overflow
npm Scraper — Package metadata from the npm registry
PyPI Scraper — Python package data from PyPI
Crates Scraper — Rust crate metadata from crates.io
Hash Generator — Generate MD5, SHA-1, SHA-256, and SHA-512 hashes

GitHub Repository Scraper - Stars, Topics, Trending

logiover/github-repository-scraper

Scrape GitHub repos by search query and export stars, topics, forks & license to CSV/JSON. GitHub data export without an API key - trending repos scraper.

Logiover

Github Repositry Scraper

crawlforge/github-repositry-scraper

Scrape GitHub repos by URL, search, or trending. Extract stars, forks, topics, languages, contributors & more. No login needed.

Amna Iftikhar

GitHub Repos Scraper

gio21/github-repos-scraper

Search and scrape GitHub repositories. Extract stars, forks, language, license, topics, and more from the GitHub public API.

Gio

GitHub API Scraper: Repos & Profiles

andok/github-api-scraper

Extract GitHub repository stats, forks, stars, and user profiles directly from the API. Perfect for developer lead gen and competitor tracking.

Andok

GitHub Scraper: Repos & Profiles

andok/github-scraper-repos-profiles

Extract GitHub repository stats, forks, stars, and user profiles directly from the API. Perfect for developer lead gen and competitor tracking.

Andok

GitHub Repository Scraper

troy_007/github-repo-scraper

Search GitHub repos by topic, language, and star count. Extract stars, forks, description, owner, topics, and activity via the official GitHub API.

Pathik Shah

GitHub Repository Scraper — Stars, Forks, Languages & More

joyouscam35875/github-repo-scraper

Scrape GitHub repository data using the REST API v3. Get stars, forks, languages, topics, contributors, releases. Search repos by keyword. Perfect for tech stack analysis and competitive intelligence. $0.002/repo.

Ken Digital

Github Scraper

fortuitous_pirate/github-scraper

Extract GitHub repository data including trending repos, search results, and contributor lists. Get stars, forks, language, topics, license, and activity dates. No authentication required for public data — optional GitHub token for higher rate limits.

Fortuitous Pirate

GitHub Repository Scraper - Stars, Forks, Topics

spiky_pepperoni/github-repository-scraper

Search and scrape GitHub repositories: stars, forks, language, topics, license, issues. No login. Export JSON, CSV, Excel.

Arad S

GitHub Stars Scraper

lulzasaur/github-stars-scraper

Scrape GitHub repository data. Search by keyword or language, fetch specific repos. Extract star counts, forks, topics, licenses, and full repo metadata.

lulz bot