OSS Insight Scraper avatar

OSS Insight Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
OSS Insight Scraper

OSS Insight Scraper

Scrape OSS Insight - open-source intelligence on 5M+ GitHub repos. Get trending repos by language, top projects by collection (databases, AI, web frameworks, CI/CD, and more), stars, forks, contributors, and growth metrics.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Crawler Bros

Crawler Bros

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Scrape OSS Insight — open-source intelligence on 5+ million GitHub repositories. Discover trending repos, top-ranked open-source projects by category (databases, AI, web frameworks, CI/CD, programming languages, and more), with stars, forks, contributor lists, and growth metrics — directly from the public OSS Insight data API.

Built for product managers, founders, investors, developer-tooling teams, and OSS researchers who need fresh, structured, queryable intelligence on the open-source landscape.

What you can do

  • Find what is trending right now on GitHub by language and time period
  • Discover top projects in a category — open-source databases, AI tools, headless CMSs, web frameworks, etc.
  • Track week-over-week growth of repos within any collection
  • Enrich your dataset of dev tools with descriptions, contributor lists, and language tags

Modes

ModeDescription
trendingTop trending repos across all languages for the chosen time period
byLanguageTop trending repos for a specific programming language
byCollectionTop-ranked repos inside a curated collection (databases, AI, web frameworks, ...)
listCollectionsList every available collection with its ID and name

Example inputs

{
"mode": "trending",
"period": "past_24_hours",
"maxItems": 20
}
{
"mode": "byLanguage",
"language": "Python",
"period": "past_week",
"maxItems": 30
}

Top AI repos by stars gained (past month)

{
"mode": "byCollection",
"collectionId": 10010,
"rankBy": "stars",
"period": "past_month",
"maxItems": 50
}

Discover every collection

{
"mode": "listCollections",
"maxItems": 200
}

Output fields

FieldDescription
repoIdGitHub numeric repo ID
repoNameowner/name
ownerRepo owner
repoShortNameRepo short name
repoUrlDirect GitHub URL
ossInsightUrlOSS Insight analyze URL
primaryLanguageMain programming language
descriptionRepository description
starsStars gained in the period
forksForks gained in the period
pullRequestsPull requests in the period
pushesPush events in the period
trendingScoreComposite trending score
topContributorsTop contributor logins
collectionsCollection names this repo belongs to
periodTime period queried
languageFilterLanguage filter applied (if any)
scrapedAtUTC ISO timestamp

Collection-ranking records (byCollection)

FieldDescription
repoId, repoName, owner, repoUrl, ossInsightUrlRepo identifiers
totalStarsTotal all-time stars
currentPeriodGrowthStars/forks/PRs gained in the current period
currentPeriodRankRank within the collection this period
pastPeriodGrowth, pastPeriodRankSame metrics for the previous period
growthPercentChangeGrowth change vs. previous period (%)
rankChangeRank change vs. previous period
collectionId, collectionNameCollection identifiers
rankByRanking metric used (stars, forks, pull_requests)

Collection records (listCollections)

FieldDescription
collectionIdNumeric collection ID
collectionNameHuman-readable name
ossInsightUrlDirect OSS Insight collection URL

Empty fields are omitted from output records.

FAQ

Does this scraper use cookies or API keys? No. OSS Insight serves an unauthenticated public REST API.

Will the actor work on the Apify free plan? Yes. No proxy is required — the API is reachable from any datacenter IP.

How fresh is the data? OSS Insight refreshes its underlying GitHub event data continuously; trending endpoints typically reflect activity within the last hour.

Can I filter by category? Yes. Run mode=listCollections once to enumerate every collection ID, then pass that ID with mode=byCollection. Common IDs include 2 (Open Source Database), 10010 (Artificial Intelligence), 10004 (Web Framework), 10024 (Programming Language).

What about deleted or private repos? OSS Insight only indexes public GitHub repositories; deleted repos drop out automatically.

Can I use this for competitive intelligence on dev-tools companies? Yes — pairing byCollection with a period window lets you see week-over-week or month-over-month growth, which is a strong leading indicator of category momentum.

Source