Pricing

Pay per usage

GitHub Issues Scraper

Scrape GitHub issues from repos, orgs, or search queries. Extract titles, labels, assignees, comments, reactions. Export to JSON, CSV, Excel.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Glass Ventures

Actor stats

Bookmarked

Total users

Monthly active users

24 days ago

Last modified

What does GitHub Issues Scraper do?

GitHub Issues Scraper lets you extract structured issue data from GitHub repositories at scale. Instead of manually browsing through hundreds of issues or writing custom API scripts, this actor handles pagination, rate limiting, and data normalization automatically.

It works with individual repositories, entire organizations (scrapes issues from all public repos), and GitHub's powerful issue search. The actor hits the official GitHub REST API directly, so the data is always accurate and complete.

Whether you need to track open bugs across a competitor's repos, analyze issue trends for market research, or export issues for project management dashboards, this actor delivers clean, structured data ready for analysis.

Use Cases

Open source maintainers -- Export all issues from your repositories for analysis and triage in spreadsheets or dashboards
Market researchers -- Track competitor product issues and feature requests to identify market gaps
Data analysts -- Analyze issue trends, response times, and community engagement across repositories
Developers -- Monitor libraries and frameworks for bugs and breaking changes
Project managers -- Bulk export issues with labels, milestones, and assignees for reporting

Features

Scrape issues from any public GitHub repository
Scrape all repos in an organization automatically
Search issues across all of GitHub with search queries
Filter by issue state (open, closed, all)
Optionally fetch all comments for each issue
Extract reactions, labels, assignees, milestones
Support for GitHub personal access tokens (5,000 requests/hour vs 60)
Automatic pagination and rate limit handling
Exports to JSON, CSV, Excel, or connect via API

How much will it cost?

GitHub Issues Scraper uses the GitHub REST API, which is very efficient. Most runs complete quickly with minimal compute.

Results	Estimated Cost
100	~$0.01
1,000	~$0.05
10,000	~$0.25

Cost Component	Per 1,000 Results
Platform compute	~$0.03
Proxy (optional)	~$0.00
Total	~$0.03

Note: Fetching comments increases API calls and run time. Without a GitHub token, the rate limit is 60 requests/hour, which limits throughput.

How to use

Go to the GitHub Issues Scraper page on Apify Store
Click "Start" or "Try for free"
Enter GitHub repository URLs (e.g., https://github.com/apify/crawlee) or search terms
Optionally set the issue state filter and whether to include comments
Set the maximum number of issues to scrape
Click "Start" and wait for the results

Input parameters

Parameter	Type	Description	Default
startUrls	array	GitHub repo or org URLs	-
searchTerms	array	Search queries for GitHub issues	-
issueState	string	Filter: all, open, or closed	all
includeComments	boolean	Fetch comments for each issue	false
githubToken	string	Personal access token for higher rate limits	-
maxItems	number	Max issues to return	100
maxConcurrency	number	Parallel API requests	5
proxyConfig	object	Proxy settings	Apify Proxy

Output

The actor produces a dataset with the following fields:

{
    "url": "https://github.com/apify/crawlee/issues/1234",
    "issueNumber": 1234,
    "title": "Bug: CheerioCrawler timeout on large pages",
    "body": "## Description\nWhen crawling pages larger than 5MB...",
    "state": "open",
    "author": "username",
    "authorUrl": "https://github.com/username",
    "labels": ["bug", "priority-high"],
    "assignees": ["maintainer1"],
    "commentsCount": 5,
    "reactionsCount": 3,
    "reactions": { "+1": 2, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 1, "rocket": 0, "eyes": 0 },
    "milestone": "v3.5.0",
    "isPullRequest": false,
    "repository": "apify/crawlee",
    "createdAt": "2024-01-15T10:30:00Z",
    "updatedAt": "2024-01-20T14:00:00Z",
    "closedAt": null,
    "comments": null,
    "scrapedAt": "2024-01-25T12:00:00.000Z"
}

Field	Type	Description
url	string	Issue URL on GitHub
issueNumber	number	Issue number in the repository
title	string	Issue title
body	string	Issue body content (Markdown)
state	string	open or closed
author	string	Username of issue creator
authorUrl	string	Profile URL of issue creator
labels	array	List of label names
assignees	array	List of assigned usernames
commentsCount	number	Number of comments
reactionsCount	number	Total reaction count
reactions	object	Reaction breakdown by type
milestone	string	Milestone name
isPullRequest	boolean	Whether entry is a pull request
repository	string	Repository full name (owner/repo)
createdAt	string	ISO 8601 creation date
updatedAt	string	ISO 8601 last update date
closedAt	string	ISO 8601 close date
comments	array	Full comment data (when includeComments is true)
scrapedAt	string	ISO 8601 scrape timestamp

Integrations

Connect GitHub Issues Scraper with other tools:

Apify API -- REST API for programmatic access
Webhooks -- get notified when a run finishes
Zapier / Make -- connect to 5,000+ apps
Google Sheets -- export directly to spreadsheets

API Example (Node.js)

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_TOKEN' });
const run = await client.actor('YOUR_USERNAME/github-issues-scraper').call({
    startUrls: [{ url: 'https://github.com/apify/crawlee' }],
    maxItems: 100,
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();

API Example (Python)

from apify_client import ApifyClient
client = ApifyClient('YOUR_TOKEN')
run = client.actor('YOUR_USERNAME/github-issues-scraper').call(run_input={
    'startUrls': [{'url': 'https://github.com/apify/crawlee'}],
    'maxItems': 100,
})
items = client.dataset(run['defaultDatasetId']).list_items().items

API Example (cURL)

curl "https://api.apify.com/v2/acts/YOUR_USERNAME~github-issues-scraper/runs" \
  -X POST \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -d '{"startUrls": [{"url": "https://github.com/apify/crawlee"}], "maxItems": 100}'

Tips and tricks

Start with a small maxItems (10-20) to test before running large scrapes
Add a GitHub personal access token to increase rate limits from 60 to 5,000 requests/hour
Use issueState: "open" to only get active issues and reduce data volume
Enable includeComments only when you need comment data, as it significantly increases API calls
For organization URLs, all public repos will be scraped -- combine with maxItems to control volume
GitHub search is limited to 1,000 results per query -- use specific search terms for best results

FAQ

Q: Does this actor require login credentials? A: No. GitHub's REST API is publicly accessible. However, adding a personal access token increases the rate limit from 60 to 5,000 requests per hour.

Q: How fast is the scraping? A: Without a token: ~50-60 issues per hour (rate limited). With a token: ~5,000-10,000 issues per hour depending on whether comments are included.

Q: What should I do if I get rate limited? A: Add a GitHub personal access token in the Authentication section. You can create one at github.com/settings/tokens with no special permissions needed for public repos.

Q: Does it scrape pull requests too? A: GitHub's issues API includes pull requests. Each item has an isPullRequest field so you can filter them out if needed.

Q: Can I scrape private repositories? A: Yes, if you provide a GitHub personal access token that has access to the private repository.

Is it legal to scrape GitHub?

This actor uses the official GitHub REST API, which is a public API designed for programmatic access. It respects rate limits and follows GitHub's API usage guidelines. GitHub's API Terms of Service permit accessing public data. Always review GitHub's Terms of Service for your specific use case. For more information, see Apify's blog on web scraping legality.

Limitations

Without a GitHub token, rate limit is 60 API requests per hour
GitHub search API returns a maximum of 1,000 results per query
Only public repositories are accessible without authentication
Pull requests are included in the issues API (filterable via isPullRequest field)

Changelog

v0.1 (2026-04-23) -- Initial release

GitHub Issues Scraper

incontrovertible_gate/github-issues-actor

Scrape issues from any public GitHub repo - no API token needed. Get titles, authors, labels, dates & more. Perfect for competitor analysis, market research & lead generation. Fast, reliable, 25 issues/sec.

Netsrac

GitHub Scraper

pear_fight/github-scraper

Scrape repositories, stars, issues and more from GitHub

Harald

GitHub Scraper - Repos, Stars, Issues & Profiles

cryptosignals/github-scraper

Scrape GitHub repositories, profiles, and issues — extract stars, forks, contributors, README, commit history, and topics. CSV/JSON output. No login.

Web Data Labs

GitHub Repos Scraper

gio21/github-repos-scraper

Search and scrape GitHub repositories. Extract stars, forks, language, license, topics, and more from the GitHub public API.

Gio

GitHub Issues Scraper for Product Pain Points

happyfhantum/github-issue-pain-signal-finder

Find recurring complaints and product pain points in public GitHub issues.

Kelsey Todd

Find Customer-Visible Problems in GitHub Issues

happyfhantum/github-buyer-pain-monitor

Monitor public GitHub issues for bugs, regressions, and pain signals buyers can see.

Kelsey Todd

Stack Overflow Data Scraper - GitHub Issues Alternative

fatihai-tools/stackoverflow-scraper

Scrape data from Stack Overflow fast. Bulk URL or query input, structured JSON/CSV output, no login required. Free trial — perfect alternative to GitHub Issues. Use for lead generation, market research, competitive analysis.

fatih dağüstü

Github Scraper | $2 / 1k | All In One

fatihtahta/github-scraper

Scrape GitHub at real scale with no cap. Get the richest data on repos, issues, PRs, users and orgs including stars, forks, topics, tech stack, users, owners and more. Great for market intel, dev products, lead lists, talent scouting and big, clean datasets.

Fatih Tahta

Github Search Scraper

saswave/github-search-scraper

Github search scraper. Get all data from search results list

SASWAVE

5.0

GitHub Repository Scraper — Stars, Issues & Activity

sovereigntaylor/github-repo-scraper

Scrape any GitHub repository for stars, forks, issues, PRs, contributors, languages, topics, releases, license, last commit, and README preview. Search repos by keyword with language and star filters. Great for tech research and competitive analysis.