GitHub BR Tech Jobs Scraper avatar

GitHub BR Tech Jobs Scraper

Pricing

Pay per event

Go to Apify Store
GitHub BR Tech Jobs Scraper

GitHub BR Tech Jobs Scraper

Scrapes Brazilian developer job boards from GitHub Issues (frontendbr/vagas, backend-br/vagas and more). Parses free-text bodies into structured records with location, stack, seniority and employment type.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 days ago

Last modified

Share

Scrapes Brazilian developer job boards from GitHub Issues across multiple community-curated repos (frontendbr/vagas, backend-br/vagas, react-brasil/vagas, python-brasil/vagas, and others). Parses free-text issue bodies into structured job records with location, tech stack, seniority, and employment type.

What it collects

Each record contains:

FieldDescription
issue_numberGitHub issue number
titleRaw issue title
stateopen or closed
poster_loginGitHub username of the poster
poster_avatar_urlAvatar URL
body_markdownFull job post body (markdown)
parsed_role_titleRole title extracted from the title
parsed_companyCompany name if detectable
parsed_locationLocation string (e.g. remoto, sao-paulo)
parsed_is_remotetrue if the role is remote
parsed_employment_typeclt, pj, freelance, or estagio
parsed_seniorityjunior, pleno, senior, or especialista
parsed_tech_stackComma-separated tech keywords from title + body
labelsComma-separated GitHub labels
comment_countNumber of comments
reactions_totalTotal reactions
created_atISO 8601 creation timestamp
updated_atISO 8601 last update timestamp
closed_atISO 8601 close timestamp (if applicable)
source_urlGitHub issue URL
source_repoSource repository (e.g. frontendbr/vagas)

Input

ParameterTypeDefaultDescription
reposarray["frontendbr/vagas","backend-br/vagas","react-brasil/vagas","python-brasil/vagas"]GitHub owner/repo repos to scrape
statestringopenIssue state: open, closed, or all
githubTokenstringOptional GitHub Personal Access Token for 5,000 req/hr rate (vs 60/hr unauthenticated)
maxItemsintegerMaximum total records across all repos (0 = unlimited)

Getting a GitHub token

No special scopes are needed for public repos. Create a classic token at github.com/settings/tokens with no permissions selected.

Supported repos

Any public GitHub repo that uses GitHub Issues as a job board works. Community repos known to follow this pattern:

  • frontendbr/vagas — Brazilian front-end jobs (~8k open issues)
  • backend-br/vagas — Brazilian back-end jobs
  • react-brasil/vagas — React/React Native jobs
  • python-brasil/vagas — Python jobs
  • cybersecurity-br/vagas — Cybersecurity jobs

Rate limits

ModeLimit
Unauthenticated60 requests/hr per IP
Authenticated (GitHub PAT)5,000 requests/hr

For a default run of a few hundred records, unauthenticated access is sufficient. For full historical scrapes across many repos, provide a GitHub PAT.

Notes

  • Pull requests are automatically filtered out (they appear in the GitHub Issues API response).
  • The body parser follows community title conventions: [CLT][Remoto] Role na Company.
  • Tech stack keywords are matched against a curated list of ~60 common BR dev technologies.