Pricing

Pay per usage

Go to Apify Store

Github Repo Scraper

Try for free

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny Nguyen

Actor stats

Bookmarked

Total users

Monthly active users

2 days ago

Last modified

GitHub Repository Scraper

What does it do?

GitHub Repository Scraper is an Apify actor that searches GitHub for repositories by keyword or topic and extracts comprehensive repository data. It collects repository names, owners, star counts, fork counts, programming languages, descriptions, last update dates, and topic tags. The scraper navigates through GitHub's search results and individual repository pages to gather detailed information.

This actor is built for developers, tech recruiters, open-source researchers, and anyone analyzing the GitHub ecosystem. It automates the process of discovering and cataloging repositories across any technology domain.

Why use this scraper?

GitHub hosts over 300 million repositories, making it the largest source code hosting platform in the world. Finding and comparing repositories manually across multiple topics is extremely time-consuming. This scraper provides automated discovery and data collection, enabling trend analysis, technology landscape mapping, competitive intelligence, and developer ecosystem research.

Whether you are tracking emerging frameworks, identifying popular libraries in a specific domain, or building a curated list of tools for your organization, this scraper delivers structured data ready for immediate analysis.

How to use it

Navigate to the GitHub Repository Scraper page on Apify Store.
Click Try for free to open the actor configuration.
Enter your search queries (keywords or topics) and set the maximum results.
Click Start to begin scraping.
Download the extracted data in JSON, CSV, Excel, or other formats from the Dataset tab.

You can automate runs using the Apify API or integrate with external tools via webhooks.

Input configuration

Field	Type	Description	Default
queries	Array	List of search keywords or topics	`["web scraping"]`
maxResults	Integer	Maximum repositories to collect per query	`500`
proxyConfiguration	Object	Proxy settings for the scraper	Apify Proxy

Output data

Each repository entry in the dataset contains:

{
  "name": "scrapy",
  "owner": "scrapy",
  "fullName": "scrapy/scrapy",
  "description": "Scrapy, a fast high-level web crawling & scraping framework for Python.",
  "stars": 52000,
  "forks": 10500,
  "language": "Python",
  "lastUpdated": "2026-02-15T10:30:00Z",
  "topics": ["web-scraping", "python", "crawler"],
  "url": "https://github.com/scrapy/scrapy",
  "query": "web scraping",
  "scrapedAt": "2026-02-18T12:00:00.000Z"
}

Cost of usage

This actor uses Pay-Per-Event pricing at $0.0003 per result delivered. A small fee is also charged per actor start. Scraping 500 repositories for a single query costs approximately $0.15. The lightweight Cheerio-based approach keeps resource usage minimal.

Monitor your spending on the Apify billing page. Apify Proxy usage is included in the per-event pricing.

Tips and tricks

Broad vs. specific queries: Use broad terms like "machine learning" for ecosystem overviews, or specific terms like "transformer NLP pytorch" for targeted results.
Track trends: Schedule regular runs with Apify Schedules to monitor star growth and new repositories over time.
Sort by popularity: The scraper returns results in GitHub's default relevance order. Post-process the dataset to sort by stars or forks.
Multiple topics: Add several queries in one run to efficiently compare different technology ecosystems.
Data pipelines: Export data via the Apify API or use integrations with Google Sheets, Airtable, or databases for automated reporting.
Combine with other data: Pair repository data with contributor information or issue tracking for deeper analysis.

Built by consummate_mandala with Crawlee and Apify SDK.

Github Repo User Scraper

inquisitive_sarangi/github-repo-scraper

Github Repo User Scraper is simple tool to extract users of a repo(s) like contributors, stargazers & watchers. You can also export listings to JSON/CSV or any other as format.

API Master

GitHub Repo Scraper

artificially/github-repo-scraper

Scrape GitHub repository stats, README, languages, contributors, and releases.

Artificially

Analyze Github Repo — Repos, Stars & Dependencies

tropical_quince/github-repo-analyzer

Analyze github repo data at scale with this powerful Apify actor. Extracts repos, stars & dependencies with automatic pagination and proxy rotation. Perfect for market research, competitive intelligence, and data-driven decision making.

Donny Nguyen

GitHub Repository Scraper

cloud9_ai/github-scraper

Scrape GitHub repositories, users, and trending projects via REST API. Extract repo names, stars, forks, languages, descriptions, and contributor data.

cloud9

Github Trending Scraper

mohamedgb00714/github-trending-scraper

github trending scraper

mohamed el hadi msaid

Github Trending Scraper

technicaldost/github-trending-scraper

Technical Dost Solutions

Github Trending Scraper

consummate_mandala/github-trending-scraper

Donny Nguyen

Github Search Scraper

saswave/github-search-scraper

Github search scraper. Get all data from search results list

SASWAVE

5.0

GitHub Issues Scraper

incontrovertible_gate/github-issues-actor

Scrape issues from any public GitHub repo - no API token needed. Get titles, authors, labels, dates & more. Perfect for competitor analysis, market research & lead generation. Fast, reliable, 25 issues/sec.