OSS Insight Scraper
Pricing
from $3.00 / 1,000 results
OSS Insight Scraper
Scrape OSS Insight - open-source intelligence on 5M+ GitHub repos. Get trending repos by language, top projects by collection (databases, AI, web frameworks, CI/CD, and more), stars, forks, contributors, and growth metrics.
Pricing
from $3.00 / 1,000 results
Rating
0.0
(0)
Developer
Crawler Bros
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
Scrape OSS Insight — open-source intelligence on 5+ million GitHub repositories. Discover trending repos, top-ranked open-source projects by category (databases, AI, web frameworks, CI/CD, programming languages, and more), with stars, forks, contributor lists, and growth metrics — directly from the public OSS Insight data API.
Built for product managers, founders, investors, developer-tooling teams, and OSS researchers who need fresh, structured, queryable intelligence on the open-source landscape.
What you can do
- Find what is trending right now on GitHub by language and time period
- Discover top projects in a category — open-source databases, AI tools, headless CMSs, web frameworks, etc.
- Track week-over-week growth of repos within any collection
- Enrich your dataset of dev tools with descriptions, contributor lists, and language tags
Modes
| Mode | Description |
|---|---|
trending | Top trending repos across all languages for the chosen time period |
byLanguage | Top trending repos for a specific programming language |
byCollection | Top-ranked repos inside a curated collection (databases, AI, web frameworks, ...) |
listCollections | List every available collection with its ID and name |
Example inputs
Trending repos (past 24 hours)
{"mode": "trending","period": "past_24_hours","maxItems": 20}
Python trending repos (past week)
{"mode": "byLanguage","language": "Python","period": "past_week","maxItems": 30}
Top AI repos by stars gained (past month)
{"mode": "byCollection","collectionId": 10010,"rankBy": "stars","period": "past_month","maxItems": 50}
Discover every collection
{"mode": "listCollections","maxItems": 200}
Output fields
Repo records (trending, byLanguage)
| Field | Description |
|---|---|
repoId | GitHub numeric repo ID |
repoName | owner/name |
owner | Repo owner |
repoShortName | Repo short name |
repoUrl | Direct GitHub URL |
ossInsightUrl | OSS Insight analyze URL |
primaryLanguage | Main programming language |
description | Repository description |
stars | Stars gained in the period |
forks | Forks gained in the period |
pullRequests | Pull requests in the period |
pushes | Push events in the period |
trendingScore | Composite trending score |
topContributors | Top contributor logins |
collections | Collection names this repo belongs to |
period | Time period queried |
languageFilter | Language filter applied (if any) |
scrapedAt | UTC ISO timestamp |
Collection-ranking records (byCollection)
| Field | Description |
|---|---|
repoId, repoName, owner, repoUrl, ossInsightUrl | Repo identifiers |
totalStars | Total all-time stars |
currentPeriodGrowth | Stars/forks/PRs gained in the current period |
currentPeriodRank | Rank within the collection this period |
pastPeriodGrowth, pastPeriodRank | Same metrics for the previous period |
growthPercentChange | Growth change vs. previous period (%) |
rankChange | Rank change vs. previous period |
collectionId, collectionName | Collection identifiers |
rankBy | Ranking metric used (stars, forks, pull_requests) |
Collection records (listCollections)
| Field | Description |
|---|---|
collectionId | Numeric collection ID |
collectionName | Human-readable name |
ossInsightUrl | Direct OSS Insight collection URL |
Empty fields are omitted from output records.
FAQ
Does this scraper use cookies or API keys? No. OSS Insight serves an unauthenticated public REST API.
Will the actor work on the Apify free plan? Yes. No proxy is required — the API is reachable from any datacenter IP.
How fresh is the data? OSS Insight refreshes its underlying GitHub event data continuously; trending endpoints typically reflect activity within the last hour.
Can I filter by category?
Yes. Run mode=listCollections once to enumerate every collection ID, then pass that ID with mode=byCollection. Common IDs include 2 (Open Source Database), 10010 (Artificial Intelligence), 10004 (Web Framework), 10024 (Programming Language).
What about deleted or private repos? OSS Insight only indexes public GitHub repositories; deleted repos drop out automatically.
Can I use this for competitive intelligence on dev-tools companies?
Yes — pairing byCollection with a period window lets you see week-over-week or month-over-month growth, which is a strong leading indicator of category momentum.
Source
- OSS Insight: https://ossinsight.io
- Public data API: https://api.ossinsight.io/v1/