GitHub Marketplace Scraper
Pricing
$5.00/month + usage
GitHub Marketplace Scraper
Scrape 1,000+ app listings from GitHub Marketplace. Get developer contacts, pricing plans, install counts, categories, and full descriptions.
Pricing
$5.00/month + usage
Rating
0.0
(0)
Developer
BowTiedRaccoon
Actor stats
2
Bookmarked
11
Total users
1
Monthly active users
6 days ago
Last modified
Categories
Share
Scrape app listings from GitHub Marketplace — the central directory for GitHub Apps and integrations. Extracts developer names, support emails, pricing plans, install counts, categories, and full descriptions for 1,000+ apps.
GitHub Marketplace Scraper Features
- Two modes: discover apps via search pagination or supply direct app URLs
- Extracts developer contact info including support emails and privacy policy links
- Collects pricing plans with plan names and prices for each app
- Returns install counts so you can gauge adoption at a glance
- Handles pagination automatically across the full marketplace catalog
- Runs on 256 MB of memory with no proxies required — GitHub does not block scraping traffic
Who Uses GitHub Marketplace Data?
- Sales teams — Build prospecting lists of GitHub App developers with direct support emails and company names
- Market researchers — Analyze the GitHub integrations ecosystem: pricing models, category distribution, install trends
- Competitive intelligence — Monitor what tools developers actually install and how competitors price their GitHub Apps
- Developer advocates — Identify popular integrations in specific categories to prioritize partnership outreach
- Dataset builders — Compile structured catalogs of developer tools for directories, comparison sites, or internal tooling indexes
How the GitHub Marketplace Scraper Works
- Search discovery — Paginates through the GitHub Marketplace listing pages, extracting app slugs from the embedded JSON data on each page
- Detail extraction — Visits each app's detail page and parses the embedded JSON payload for developer info, pricing, install counts, and descriptions
- Direct URL mode — If you supply specific app URLs, the scraper skips search entirely and goes straight to detail extraction
The marketplace serves app data in embedded <script> tags as structured JSON, so the scraper uses a lightweight Cheerio-based crawler (no browser rendering needed). Concurrency is capped at 5 requests to stay well within GitHub's rate limits.
Input
{"searchQuery": "","appsUrls": [],"maxItems": 20}
| Field | Type | Default | Description |
|---|---|---|---|
searchQuery | String | "" | Search query to filter apps. Leave empty to crawl the full marketplace. |
appsUrls | Array | [] | Direct URLs to specific app pages. When provided, search discovery is skipped entirely. |
maxItems | Integer | 20 | Maximum number of apps to scrape. Set to 0 to scrape all available apps. |
proxyConfiguration | Object | { useApifyProxy: false } | Proxy settings. Not required — GitHub Marketplace works without proxies. |
Scrape All CI/CD Tools
{"searchQuery": "query=ci+cd","maxItems": 50}
Scrape Specific Apps by URL
{"appsUrls": [{ "url": "https://github.com/marketplace/codecov" },{ "url": "https://github.com/marketplace/snyk" }],"maxItems": 10}
Full Marketplace Crawl
{"searchQuery": "","maxItems": 0}
GitHub Marketplace Scraper Output Fields
{"app_name": "Codecov","developer_name": "Codecov","description": "Code coverage done right.","description_long": "Codecov provides highly integrated tools to group...","logo_url": "https://avatars.githubusercontent.com/ml/7?s=400","number_of_installs": 45200,"privacy_policy": "https://about.codecov.io/privacy","support_email": "support@codecov.io","categories": [{ "name": "Code quality" }, { "name": "Continuous integration" }],"plans": [{ "name": "Free", "price": "$0" }, { "name": "Team", "price": "$10/month" }],"app_url": "https://github.com/marketplace/codecov"}
| Field | Type | Description |
|---|---|---|
app_name | String | Application name as displayed on the marketplace |
developer_name | String | Developer or organization name |
description | String | Short description of the app |
description_long | String | Full extended description from the app's detail page |
logo_url | String | URL to the app's logo image |
number_of_installs | Number | Total number of installations |
privacy_policy | String | URL to the app's privacy policy |
support_email | String | Developer support email address |
categories | Array | Marketplace categories the app belongs to (each with a name field) |
plans | Array | Pricing plans with name and price fields |
app_url | String | URL to the app's marketplace listing |
FAQ
How many apps can the GitHub Marketplace Scraper collect?
GitHub Marketplace has over 1,000 apps listed. A full crawl with maxItems set to 0 will paginate through the entire catalog. Set a lower value if you only need a sample.
Does the scraper need proxies? No. GitHub Marketplace does not block or rate-limit scraping traffic aggressively. The scraper works with proxies disabled by default. You can enable Apify Proxy if you want, but it is not necessary.
How long does a full marketplace scrape take? A full crawl of all 1,000+ apps typically completes in under 15 minutes on 256 MB of memory. The scraper runs at 5 concurrent requests to stay within rate limits.
What does it cost to run? Each run has a $0.10 start fee plus $0.002 per app record. A full 1,000-app crawl costs roughly $2.10. A 20-app sample run costs about $0.14.
Can I search for apps in a specific category?
Use the searchQuery field with a category keyword, e.g. "query=security" or "query=code+review". The scraper will filter results to apps matching that search term.
What is the difference between searchQuery and appsUrls?
searchQuery discovers apps through the marketplace search pages. appsUrls takes direct links to specific app pages and skips search entirely. If both are provided, appsUrls takes priority.
Need More Features?
Need custom fields, filters, or a scraper for a different developer tool marketplace? File an issue or get in touch.
Why Use the GitHub Marketplace Scraper?
- Contact data included — Support emails and developer names in every record, ready for outreach without manual lookup
- Full pricing visibility — Plan names and prices extracted per app, so you can analyze the marketplace's pricing landscape in a spreadsheet
- Lightweight — No browser rendering, no proxies needed, 256 MB of memory, completes in minutes