Github Scraper | $2 / 1k | All In One avatar
Github Scraper | $2 / 1k | All In One

Pricing

$1.99 / 1,000 results

Go to Apify Store
Github Scraper | $2 / 1k | All In One

Github Scraper | $2 / 1k | All In One

Scrape GitHub at real scale with no cap. Get the richest data on repos, issues, PRs, users and orgs including stars, forks, topics, tech stack, users, owners and more. Great for market intel, dev products, lead lists, talent scouting and big, clean datasets.

Pricing

$1.99 / 1,000 results

Rating

5.0

(1)

Developer

Fatih Tahta

Fatih Tahta

Maintained by Community

Actor stats

1

Bookmarked

3

Total users

2

Monthly active users

3 days ago

Last modified

Share

Overview

Github.com hosts millions of repositories, issues, pull requests, discussions, users, and packages that reflect real development activity and technology adoption. Github Scraper | $2 / 1k | All In One captures this public data at scale so you can monitor projects, communities, and marketplaces without manual browsing. Configure searches or paste direct URLs, and the actor automates collection with consistent, reliable results to save time on every run.

Why Use This Actor

  • Market and product research: Track trending repositories, forks, stars, and topics to understand technology momentum and competitor movement.
  • Lead and talent discovery: Surface active maintainers, organizations, and contributors for outreach or partnership research.
  • Directory and dataset building: Export structured lists of repos, issues, pull requests, discussions, users, commits, and marketplace listings for enrichment or analytics.
  • Operational efficiency: Schedule repeat runs, keep data fresh, and avoid repetitive manual checks on Github.

Input Parameters

ParameterTypeDescriptionDefault
startUrlsarray of stringsFully formed Github URLs for search results, repositories, issues, pull requests, discussions, users/organizations, or marketplace listings. Each URL is crawled exactly as provided.Prefilled example provided
queriesarray of stringsKeyword queries turned into Github search URLs using your selected result type and filters.
searchTypestringGithub search vertical for query-based runs: repositories, issues, pull requests, discussions, users, commits, registry packages, wikis, topics, or marketplace listings.repositories
languagestringLimit results to a specific programming language.
ownerstringRestrict matches to a single user or organization namespace.
followersstringFilter users or organizations by follower counts using numbers, ranges, or comparisons.
forksstringFilter repositories by fork counts using numbers, ranges, or comparisons.
starsstringTarget repositories by star counts using numbers, ranges, or comparisons.
topicstringRequire repositories to include a specific topic tag.
licensestringFilter repositories by license keyword or SPDX identifier.
createdstringFilter repositories by creation date using exact dates, ranges, or comparisons.
pushedstringFilter repositories by most recent commit date using exact dates, ranges, or comparisons.
sizestringFilter repositories by repository size in kilobytes using numbers, ranges, or comparisons.
limitintegerMaximum number of listings to save across all inputs.50000
proxyConfigurationobjectConfigure Apify proxy settings to distribute requests and keep networking stable. Residential proxies are preselected.{"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]}

Example Input

{
"queries": ["apify scraping"],
"searchType": "repositories",
"stars": ">100",
"language": "JavaScript",
"limit": 200
}

Example Output

Each dataset item represents one Github result with key descriptive fields.

{
"archived": false,
"brand": "vlang",
"color": "#4f87c4",
"description": "Simple, fast, safe, compiled language for developing maintainable software. Compiles itself in <1s with zero library dependencies. Suppor…",
"followers": 37115,
"good_first_issue_issues_count": 1,
"has_funding_file": true,
"has_issues": true,
"help_wanted_issues_count": 0,
"id": "169677297",
"language": "V",
"mirror": false,
"owner_id": 46413578,
"owner_login": "vlang",
"owned_by_organization": true,
"public": true,
"repo_id": 169677297,
"repo_name": "v",
"sponsorable": false,
"starred_by_current_user": false,
"title": "vlang/ v",
"topics": [
"language",
"programming-language",
"compiler",
"v"
],
"type": "Public",
"updated_at": "2025-12-10T14:41:12.039Z",
"url": "https://github.com/vlang/v"
}
  • archived — Whether the repository is archived.
  • brand, color — Project branding details when available.
  • description — Repository summary from Github.
  • followers, stars, forks (when present) — Popularity and engagement indicators.
  • owner_* and repo_* fields — Unique identifiers and ownership metadata.
  • topics — Tagged topics for the repository.
  • updated_at — ISO timestamp of the latest observed update.
  • url — Direct link to the Github page captured.

Notes & Limitations

  • Use this actor responsibly and only for lawful purposes. Review and respect Github’s terms of service and any applicable policies before collecting or using data.
  • Public data may include personal information; ensure you have a legal basis to process it in your jurisdiction.
  • Start with moderate limits when testing new queries or URLs to keep runs efficient.

Support

Questions or custom needs? Open an issue on the Issues tab of the actor page in Apify Console and it will be resolved around the clock.

Happy Scraping,

  • Fatih