Pricing

from $0.01 / 1,000 results

Github Repositry Scraper

Scrape GitHub repos by URL, search, or trending. Extract stars, forks, topics, languages, contributors & more. No login needed.

Pricing

from $0.01 / 1,000 results

Rating

0.0

(0)

Developer

Amna Iftikhar

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

GitHub Repository Scraper

Extract comprehensive data from GitHub repositories — by direct URL, keyword search, or trending. No login, no API keys required.

Perfect for competitive analysis, lead generation, market research, AI training data, and developer tooling pipelines.

🚀 3 Modes — Pick One

⚠️ Each mode uses different input fields. Only fill in fields for the mode you choose.

Mode	When to use it
`repos`	You already have specific GitHub URLs you want to scrape
`search`	You want to discover repos by keyword or language
`trending`	You want GitHub's trending repos right now

⚙️ Input by Mode

Mode: `repos` — Scrape specific repositories

{
  "mode": "repos",
  "repoUrls": [
    "https://github.com/facebook/react",
    "https://github.com/vercel/next.js"
  ],
  "maxResults": 10,
  "includeReadme": false
}

Field	Required	Description
`mode`	✅	Set to `"repos"`
`repoUrls`	✅	List of GitHub repo URLs to scrape
`maxResults`	optional	Max repos to scrape (default: 10)
`includeReadme`	optional	Also fetch README content (default: false)

Mode: `search` — Find repos by keyword

{
  "mode": "search",
  "searchQuery": "machine learning",
  "searchLanguage": "Python",
  "searchSort": "stars",
  "maxResults": 50,
  "includeReadme": false
}

Field	Required	Description
`mode`	✅	Set to `"search"`
`searchQuery`	✅	Keywords to search (e.g. `"web scraper"`)
`searchLanguage`	optional	Filter by language e.g. `"Python"`, `"JavaScript"`
`searchSort`	optional	Sort by `"stars"`, `"forks"`, or `"updated"` (default: `"stars"`)
`maxResults`	optional	Max repos to return, up to 300 (default: 10)
`includeReadme`	optional	Also fetch README content (default: false)

Mode: `trending` — Get GitHub's trending repos

{
  "mode": "trending",
  "trendingLanguage": "python",
  "trendingPeriod": "weekly",
  "maxResults": 25,
  "includeReadme": false
}

Field	Required	Description
`mode`	✅	Set to `"trending"`
`trendingLanguage`	optional	Filter by language e.g. `"python"`, `"rust"` — leave empty for all
`trendingPeriod`	optional	`"daily"`, `"weekly"`, or `"monthly"` (default: `"daily"`)
`maxResults`	optional	Max repos to return (default: 10)
`includeReadme`	optional	Also fetch README content (default: false)

📦 Output Fields

Each scraped repository returns:

{
  "url": "https://github.com/facebook/react",
  "fullName": "facebook/react",
  "owner": "facebook",
  "name": "react",
  "repoId": "10270250",
  "description": "The library for web and native user interfaces.",
  "website": "https://react.dev",
  "topics": ["react", "javascript", "library", "ui", "frontend"],
  "primaryLanguage": "JavaScript",
  "languages": { "JavaScript": "68.1%", "TypeScript": "29.0%" },
  "license": "MIT",
  "stars": 243937,
  "starsDisplay": "244k",
  "forks": 50761,
  "watchers": 6700,
  "openIssues": 809,
  "openPullRequests": 355,
  "commits": 21425,
  "contributors": 1734,
  "totalReleases": 118,
  "latestRelease": "19.2.4",
  "defaultBranch": "main",
  "lastCommitAt": "2026-01-26T18:29:43Z",
  "scrapedAt": "2026-03-13T10:00:00.000Z"
}

Enable includeReadme: true to also get readmeText and readmeHtml fields — useful for AI/LLM pipelines.

🎯 Use Cases

Market research — Track star growth and activity across competing repos
Lead generation — Find active contributors in a technology stack
AI training data — Bulk-collect repo descriptions, READMEs, and topics
Investment research — Monitor open-source adoption signals
Competitive intelligence — Benchmark your repo vs competitors

💰 Pricing

Pay Per Result — you only pay for repos successfully scraped.

Volume	Cost
10 repos	~$0.02
100 repos	~$0.20
1,000 repos	~$2.00

⚡ Performance

Uses Cheerio — no heavy browser, very low compute cost
Up to 3 concurrent requests
~50–100 repos/minute
No proxies needed for normal volumes

❓ FAQ

Can I use all input fields at once? No. Each mode uses its own fields. Set mode first, then only fill in fields for that mode — other fields are ignored.

Does this require a GitHub account or API key? No. Scrapes only public GitHub data, no login needed.

Can I scrape private repos? No — public repos only.

Can I schedule this to run daily? Yes. Use Apify's built-in scheduler with a cron expression.

Will I get blocked? Unlikely for normal volumes. The Actor uses proper headers and rate limiting. For 1000+ repos, enable Apify proxy.

Built with Apify SDK + Crawlee. Issues or feature requests? Leave a comment on the Actor page.

GitHub Scraper

automation-lab/github-scraper

Extract data from GitHub — repository details, developer profiles, trending repos, and search results. Stars, forks, languages, topics, and more. No API key needed.

Stas Persiianenko

GitHub Repository Scraper - Stars, Topics, Trending

logiover/github-repository-scraper

Scrape GitHub repos by search query and export stars, topics, forks & license to CSV/JSON. GitHub data export without an API key - trending repos scraper.

Logiover

GitHub Repos Scraper

gio21/github-repos-scraper

Search and scrape GitHub repositories. Extract stars, forks, language, license, topics, and more from the GitHub public API.

Gio

GitHub Repository Scraper — Stars, Forks, Languages & More

joyouscam35875/github-repo-scraper

Scrape GitHub repository data using the REST API v3. Get stars, forks, languages, topics, contributors, releases. Search repos by keyword. Perfect for tech stack analysis and competitive intelligence. $0.002/repo.

Ken Digital

GitHub Trending Repos Scraper

lulzasaur/github-trending-scraper

Scrape trending repositories from GitHub. Get repo names, descriptions, stars, forks, language, stars today, and contributors. Filter by language and time range.

lulz bot

GitHub Trending Scraper

pink_fence/GitHub-Trending-Scraper

Scrape GitHub Trending repos — name, description, stars, forks, language and contributors. Filter by programming language and time range: daily, weekly or monthly. No API key needed. Clean JSON output.

Moritz Knopp

Github Repos Scraper

velvety_bedbug/github-repos-scraper

Search and scrape GitHub repositories. Filter by language, topic, stars, and creation date. Returns stars, forks, issues, topics, and more.

Peters Bugs

GitHub Scraper - Repos, Stars, Issues & Profiles

cryptosignals/github-scraper

Scrape GitHub repositories, profiles, and issues — extract stars, forks, contributors, README, commit history, and topics. CSV/JSON output. No login.

Web Data Labs

GitHub Scraper

muscular_quadruplet/github-scraper

Scrape GitHub repositories, users, stars, forks. Extract trending repos, developer profiles, code stats. Monitor open source projects, find contributors, analyze tech stacks. No API limits.

Do It

5.0

GitHub Stars Scraper

lulzasaur/github-stars-scraper

Scrape GitHub repository data. Search by keyword or language, fetch specific repos. Extract star counts, forks, topics, licenses, and full repo metadata.

lulz bot