GitHub Repository Search & Scraper avatar

GitHub Repository Search & Scraper

Pricing

from $1.00 / 1,000 results

Go to Apify Store
GitHub Repository Search & Scraper

GitHub Repository Search & Scraper

Search GitHub repositories by keyword, language, topic, stars, and date. Clean structured JSON with stars, forks, license, topics, owner, and activity dates. Optional token for high rate limits.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Nicolas van Arkens

Nicolas van Arkens

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

GitHub Repository Search & Scraper ๐Ÿ™

Search GitHub repositories at scale and get clean, structured JSON โ€” stars, forks, language, topics, license, owner, and activity dates. Powered by the official GitHub REST API, so the data is accurate and the actor is fast and reliable.

Filter by keyword, language, topic, minimum stars, owner, and date, then sort by stars, forks, or recent activity. Perfect for developer-tool market research, competitor and dependency tracking, finding trending or actively-maintained projects, lead generation, and building datasets for analysis or AI.

What you can do

  • ๐Ÿ”Ž Search repositories by free-text keywords across names, descriptions, and READMEs
  • ๐Ÿงฐ Filter by language, topic(s), minimum stars, owner/org, creation date, and last-push date
  • ๐Ÿ“ˆ Find actively-maintained projects with the "pushed after" filter, or new/trending ones with "created after"
  • ๐Ÿšซ Exclude forks and archived repos for a clean signal
  • โ†•๏ธ Sort by stars, forks, recently updated, or best-match relevance
  • ๐Ÿงน Clean output โ€” one tidy record per repo, ready for CSV/Excel/JSON or the API

Example use cases

  • Competitive & market research: list every Python web-scraping library over 50 stars, sorted by activity.
  • Dependency / ecosystem tracking: monitor all repos under an org and their last-push dates.
  • Find trending projects: repos created in the last 90 days, sorted by stars.
  • Lead generation: discover maintainers and organizations active in a topic.
  • Datasets for AI/analysis: pull thousands of repos with structured metadata.

Input

FieldDescription
Search queryFree-text keywords.
LanguageRestrict to a language (e.g. python).
Topic(s)One or more GitHub topics, comma-separated.
User / organizationRestrict to one owner.
Minimum starsStar threshold.
Pushed after / Created afterDate filters (YYYY-MM-DD).
Exclude forks / archivedClean up results.
Sort / OrderStars, forks, updated, or best-match.
Maximum repositoriesUp to 1000 per search (GitHub's cap).
GitHub token (optional)Strongly recommended โ€” raises rate limits massively. A token with no scopes (public data only) is enough.

Output

{
"fullName": "scrapy/scrapy",
"name": "scrapy",
"description": "Scrapy, a fast high-level web crawling & scraping framework for Python.",
"url": "https://github.com/scrapy/scrapy",
"homepage": "https://scrapy.org",
"owner": "scrapy",
"ownerType": "Organization",
"stars": 61942,
"forks": 11585,
"watchers": 61942,
"openIssues": 625,
"language": "Python",
"topics": ["crawler", "scraping", "web-scraping"],
"license": "BSD-3-Clause",
"isFork": false,
"isArchived": false,
"createdAt": "2010-02-22T02:01:14Z",
"updatedAt": "2026-05-28T13:32:56Z",
"pushedAt": "2026-05-20T08:27:24Z"
}

Export to JSON, CSV, or Excel, or pull via the Apify API. Connect to Google Sheets, Slack, Zapier, or Make.

About the GitHub token

The actor works without a token, but GitHub limits unauthenticated search to roughly 10 requests/minute. Adding a free personal access token (classic, no scopes needed for public data) raises this to ~30 search requests/minute and 5000 requests/hour โ€” recommended for anything beyond a quick test. Your token is sent only to GitHub and is never stored.

Notes

  • Accesses only publicly available GitHub data via the official REST API. Independent tool, not affiliated with GitHub.
  • GitHub's Search API caps any single query at 1000 returned results; narrow with filters or date windows for larger coverage.