GitHub Repository Search & Scraper
Pricing
from $1.00 / 1,000 results
GitHub Repository Search & Scraper
Search GitHub repositories by keyword, language, topic, stars, and date. Clean structured JSON with stars, forks, license, topics, owner, and activity dates. Optional token for high rate limits.
Pricing
from $1.00 / 1,000 results
Rating
0.0
(0)
Developer
Nicolas van Arkens
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
GitHub Repository Search & Scraper ๐
Search GitHub repositories at scale and get clean, structured JSON โ stars, forks, language, topics, license, owner, and activity dates. Powered by the official GitHub REST API, so the data is accurate and the actor is fast and reliable.
Filter by keyword, language, topic, minimum stars, owner, and date, then sort by stars, forks, or recent activity. Perfect for developer-tool market research, competitor and dependency tracking, finding trending or actively-maintained projects, lead generation, and building datasets for analysis or AI.
What you can do
- ๐ Search repositories by free-text keywords across names, descriptions, and READMEs
- ๐งฐ Filter by language, topic(s), minimum stars, owner/org, creation date, and last-push date
- ๐ Find actively-maintained projects with the "pushed after" filter, or new/trending ones with "created after"
- ๐ซ Exclude forks and archived repos for a clean signal
- โ๏ธ Sort by stars, forks, recently updated, or best-match relevance
- ๐งน Clean output โ one tidy record per repo, ready for CSV/Excel/JSON or the API
Example use cases
- Competitive & market research: list every Python web-scraping library over 50 stars, sorted by activity.
- Dependency / ecosystem tracking: monitor all repos under an org and their last-push dates.
- Find trending projects: repos created in the last 90 days, sorted by stars.
- Lead generation: discover maintainers and organizations active in a topic.
- Datasets for AI/analysis: pull thousands of repos with structured metadata.
Input
| Field | Description |
|---|---|
| Search query | Free-text keywords. |
| Language | Restrict to a language (e.g. python). |
| Topic(s) | One or more GitHub topics, comma-separated. |
| User / organization | Restrict to one owner. |
| Minimum stars | Star threshold. |
| Pushed after / Created after | Date filters (YYYY-MM-DD). |
| Exclude forks / archived | Clean up results. |
| Sort / Order | Stars, forks, updated, or best-match. |
| Maximum repositories | Up to 1000 per search (GitHub's cap). |
| GitHub token (optional) | Strongly recommended โ raises rate limits massively. A token with no scopes (public data only) is enough. |
Output
{"fullName": "scrapy/scrapy","name": "scrapy","description": "Scrapy, a fast high-level web crawling & scraping framework for Python.","url": "https://github.com/scrapy/scrapy","homepage": "https://scrapy.org","owner": "scrapy","ownerType": "Organization","stars": 61942,"forks": 11585,"watchers": 61942,"openIssues": 625,"language": "Python","topics": ["crawler", "scraping", "web-scraping"],"license": "BSD-3-Clause","isFork": false,"isArchived": false,"createdAt": "2010-02-22T02:01:14Z","updatedAt": "2026-05-28T13:32:56Z","pushedAt": "2026-05-20T08:27:24Z"}
Export to JSON, CSV, or Excel, or pull via the Apify API. Connect to Google Sheets, Slack, Zapier, or Make.
About the GitHub token
The actor works without a token, but GitHub limits unauthenticated search to roughly 10 requests/minute. Adding a free personal access token (classic, no scopes needed for public data) raises this to ~30 search requests/minute and 5000 requests/hour โ recommended for anything beyond a quick test. Your token is sent only to GitHub and is never stored.
Notes
- Accesses only publicly available GitHub data via the official REST API. Independent tool, not affiliated with GitHub.
- GitHub's Search API caps any single query at 1000 returned results; narrow with filters or date windows for larger coverage.