Github Scraper | $2 / 1k | All In One
Pricing
$1.99 / 1,000 results
Github Scraper | $2 / 1k | All In One
Scrape GitHub at real scale with no cap. Get the richest data on repos, issues, PRs, users and orgs including stars, forks, topics, tech stack, users, owners and more. Great for market intel, dev products, lead lists, talent scouting and big, clean datasets.
Pricing
$1.99 / 1,000 results
Rating
5.0
(1)
Developer

Fatih Tahta
Actor stats
1
Bookmarked
3
Total users
2
Monthly active users
3 days ago
Last modified
Categories
Share
Overview
Github.com hosts millions of repositories, issues, pull requests, discussions, users, and packages that reflect real development activity and technology adoption. Github Scraper | $2 / 1k | All In One captures this public data at scale so you can monitor projects, communities, and marketplaces without manual browsing. Configure searches or paste direct URLs, and the actor automates collection with consistent, reliable results to save time on every run.
Why Use This Actor
- Market and product research: Track trending repositories, forks, stars, and topics to understand technology momentum and competitor movement.
- Lead and talent discovery: Surface active maintainers, organizations, and contributors for outreach or partnership research.
- Directory and dataset building: Export structured lists of repos, issues, pull requests, discussions, users, commits, and marketplace listings for enrichment or analytics.
- Operational efficiency: Schedule repeat runs, keep data fresh, and avoid repetitive manual checks on Github.
Input Parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
startUrls | array of strings | Fully formed Github URLs for search results, repositories, issues, pull requests, discussions, users/organizations, or marketplace listings. Each URL is crawled exactly as provided. | Prefilled example provided |
queries | array of strings | Keyword queries turned into Github search URLs using your selected result type and filters. | — |
searchType | string | Github search vertical for query-based runs: repositories, issues, pull requests, discussions, users, commits, registry packages, wikis, topics, or marketplace listings. | repositories |
language | string | Limit results to a specific programming language. | — |
owner | string | Restrict matches to a single user or organization namespace. | — |
followers | string | Filter users or organizations by follower counts using numbers, ranges, or comparisons. | — |
forks | string | Filter repositories by fork counts using numbers, ranges, or comparisons. | — |
stars | string | Target repositories by star counts using numbers, ranges, or comparisons. | — |
topic | string | Require repositories to include a specific topic tag. | — |
license | string | Filter repositories by license keyword or SPDX identifier. | — |
created | string | Filter repositories by creation date using exact dates, ranges, or comparisons. | — |
pushed | string | Filter repositories by most recent commit date using exact dates, ranges, or comparisons. | — |
size | string | Filter repositories by repository size in kilobytes using numbers, ranges, or comparisons. | — |
limit | integer | Maximum number of listings to save across all inputs. | 50000 |
proxyConfiguration | object | Configure Apify proxy settings to distribute requests and keep networking stable. Residential proxies are preselected. | {"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]} |
Example Input
{"queries": ["apify scraping"],"searchType": "repositories","stars": ">100","language": "JavaScript","limit": 200}
Example Output
Each dataset item represents one Github result with key descriptive fields.
{"archived": false,"brand": "vlang","color": "#4f87c4","description": "Simple, fast, safe, compiled language for developing maintainable software. Compiles itself in <1s with zero library dependencies. Suppor…","followers": 37115,"good_first_issue_issues_count": 1,"has_funding_file": true,"has_issues": true,"help_wanted_issues_count": 0,"id": "169677297","language": "V","mirror": false,"owner_id": 46413578,"owner_login": "vlang","owned_by_organization": true,"public": true,"repo_id": 169677297,"repo_name": "v","sponsorable": false,"starred_by_current_user": false,"title": "vlang/ v","topics": ["language","programming-language","compiler","v"],"type": "Public","updated_at": "2025-12-10T14:41:12.039Z","url": "https://github.com/vlang/v"}
archived— Whether the repository is archived.brand,color— Project branding details when available.description— Repository summary from Github.followers,stars,forks(when present) — Popularity and engagement indicators.owner_*andrepo_*fields — Unique identifiers and ownership metadata.topics— Tagged topics for the repository.updated_at— ISO timestamp of the latest observed update.url— Direct link to the Github page captured.
Notes & Limitations
- Use this actor responsibly and only for lawful purposes. Review and respect Github’s terms of service and any applicable policies before collecting or using data.
- Public data may include personal information; ensure you have a legal basis to process it in your jurisdiction.
- Start with moderate limits when testing new queries or URLs to keep runs efficient.
Support
Questions or custom needs? Open an issue on the Issues tab of the actor page in Apify Console and it will be resolved around the clock.
Happy Scraping,
- Fatih
