Gupy.io Jobs Scraper avatar

Gupy.io Jobs Scraper

Pricing

Pay per usage

Go to Apify Store
Gupy.io Jobs Scraper

Gupy.io Jobs Scraper

Extract job listings from Gupy, Brazil's leading recruitment platform. Scrape job titles, company details, salary ranges, and application links in seconds. Perfect for job boards, data analysis, and recruitment automation. Get structured datasets with zero coding required.

Pricing

Pay per usage

Rating

5.0

(1)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

1

Bookmarked

6

Total users

3

Monthly active users

12 days ago

Last modified

Share

Extract comprehensive job listings from the Gupy jobs portal with ease. Scrape vacancy details including job titles, companies, locations, descriptions, work models, and direct application links at scale. Perfect for job market monitoring, recruitment analysis, and lead generation.


Features

  • Portal URL support — Extract jobs directly using a portal.gupy.io/job-search URL to preserve all filters.
  • Keyword and location search — Find jobs by keyword and location without needing a full URL.
  • Pagination controls — Limit the number of results and pages to manage usage and scraping speed.
  • Normalized output — Get clean, structured datasets with consistent field names.
  • Ready-to-use datasets — Export job records for market analysis, lead generation, or custom databases.

Use Cases

Job Market Research

Track hiring volume, job titles, and location patterns across the Gupy ecosystem. Build datasets for recruiting research, salary benchmarking support, or trend analysis.

Lead Generation

Identify companies hiring for specific roles, regions, or work models. Use the dataset to monitor employers and career pages relevant to your niche.

Competitive Intelligence

Compare open roles, publication cadence, and workplace models across employers. Spot which companies are growing, hiring remotely, or expanding into new locations.

Recruitment Automation

Feed job results into spreadsheets, internal dashboards, or workflow tools. Use recurring runs to keep job pipelines fresh without manual searching.


Input Parameters

ParameterTypeRequiredDefaultDescription
urlStringNohttps://portal.gupy.io/job-search/sortBy=publishedDateGupy search URL from portal.gupy.io/job-search. If provided, its filters are used first.
keywordStringNoOptional job keyword when you do not want to use a full URL.
locationStringNoOptional city or state such as Sao Paulo, Sao Paulo - SP, or Pernambuco.
sortByStringNo"publishedDate"Sort order used when searching with keyword and location.
results_wantedIntegerNo20Maximum number of jobs to collect.
max_pagesIntegerNo1Safety cap for pagination.
proxyConfigurationObjectNo{"useApifyProxy": false}Optional Apify proxy settings.

Output Data

Each item in the dataset contains:

FieldTypeDescription
jobIdIntegerUnique Gupy job identifier.
titleStringJob title.
companyStringCompany or career page name.
companyIdIntegerCompany identifier.
careerPageIdIntegerCareer page identifier.
careerPageNameStringCareer page name from Gupy.
careerPageLogoUrlStringCompany logo URL.
careerPageUrlStringCareer page URL.
descriptionHtmlStringRich job description content.
descriptionTextStringPlain-text job description.
jobTypeStringRaw job type code.
jobTypeLabelStringReadable job type label.
publishedDateStringPublish timestamp.
applicationDeadlineStringApplication deadline when available.
isRemoteWorkBooleanWhether the role is marked as remote.
workplaceTypeStringRaw workplace type code.
workplaceTypeLabelStringReadable work model label.
cityStringJob city.
stateStringJob state.
countryStringJob country.
locationStringCombined location string.
jobUrlStringDirect link to the vacancy.
acceptsDisabilitiesBooleanWhether the vacancy is flagged for PWD applicants.
skillsArraySkills when present.
sourceUrlStringSource search URL used for the run.

Usage Examples

Latest Jobs

{
"url": "https://portal.gupy.io/job-search/sortBy=publishedDate",
"results_wanted": 20,
"max_pages": 1
}
{
"keyword": "Social Media",
"results_wanted": 30,
"max_pages": 2
}
{
"location": "Sao Paulo - SP",
"results_wanted": 25,
"max_pages": 2
}

Sample Output

{
"jobId": 11331333,
"title": "ATENDENTE RESTAURANTE 12X36 ( CENTRO - BELFORD ROXO/RJ)",
"company": "McDonald's Restaurante - Arcos Dorados",
"companyId": 68123,
"careerPageId": 164080,
"careerPageName": "McDonald's Restaurante - Arcos Dorados",
"careerPageLogoUrl": "https://attachments.gupy.io/production/companies/68123/career/164080/images/2023-07-20_22-47_companyLogoUrl.png",
"careerPageUrl": "https://restaurantemc.gupy.io/eyJzb3VyY2UiOiJndXB5X3BvcnRhbCJ9",
"descriptionHtml": "#A gente vai amar muito se voce...",
"descriptionText": "#A gente vai amar muito se voce... Responsabilidades e atribuicoes...",
"jobType": "vacancy_type_effective",
"jobTypeLabel": "Effective",
"publishedDate": "2026-05-21T03:00:25.306Z",
"applicationDeadline": "2026-07-20",
"isRemoteWork": false,
"workplaceType": "on-site",
"workplaceTypeLabel": "On-site",
"city": "Belford Roxo",
"state": "Rio de Janeiro",
"country": "Brasil",
"location": "Belford Roxo, Rio de Janeiro, Brasil",
"jobUrl": "https://restaurantemc.gupy.io/job/eyJqb2JJZCI6MTEzMzEzMzMsInNvdXJjZSI6Imd1cHlfcG9ydGFsIn0=?jobBoardSource=gupy_portal",
"acceptsDisabilities": true,
"sourceUrl": "https://portal.gupy.io/job-search/sortBy=publishedDate"
}

Tips for Best Results

Use Real Portal URLs

  • Copy the full Gupy search URL from your browser when you need exact work model, job type, or date filters.
  • This ensures the scraper uses the precise parameters you configured visually on the website.

Start with Small Batches

  • Use a low results_wanted (e.g., 20) and limit max_pages to 1 or 2 during testing.
  • This allows you to verify your query configuration before initiating large scraping runs.

Optimize Location Queries

  • Input clear location details like Sao Paulo, Rio de Janeiro - RJ, or full state names like Parana.
  • This matches the portal's search engine standards for cleaner and more relevant filtering.

Control Large Crawls

  • Large queries can return thousands of results, which consumes more resources.
  • Set a sensible limit using results_wanted and max_pages to keep runs fast and cost-efficient.

Proxy Configuration

For reliable scraping and to avoid rate limits, residential proxies are recommended:

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Integrations

Connect your data with:

  • Google Sheets — Export job listings directly to spreadsheets for easy analysis
  • Airtable — Build searchable job boards and hiring pipelines
  • Zapier — Automate alerts and downstream workflow triggers
  • Make — Connect your runs to hundreds of third-party apps
  • Webhooks — Send real-time data to your custom API endpoints

Export Formats

  • JSON — For developers and seamless API integrations
  • CSV — Ideal for spreadsheet applications and reporting
  • Excel — For sharing with business teams
  • XML — For legacy system integrations

Frequently Asked Questions

How many jobs can I scrape?

You can scrape all public job listings returned by the search query. The exact limit depends on Gupy's platform and pagination restrictions.

Can I scrape multiple pages?

Yes, the scraper automatically navigates through pagination up to your defined max_pages limit.

Do I need a Gupy account to use this scraper?

No, the scraper works with publicly available job search endpoints and does not require credentials or logging in.

Why are some fields empty in the output?

Some job listings may not contain optional fields like skills, application deadlines, or specific work models. Empty values are omitted to keep your dataset clean.

What is the difference between keyword search and using a portal URL?

Using a portal URL allows you to apply complex filters (like specific work formats or companies) directly from the Gupy website. Keyword search is a simpler way to search for jobs by title or term directly.

Can I run the scraper on a schedule?

Yes, you can schedule runs hourly, daily, or weekly using Apify's scheduling tool in the Console.


Support

For issues, feature requests, or custom scraping needs, contact support through the Apify Console.

Resources


This actor is designed for legitimate data collection purposes. Users are responsible for ensuring compliance with website terms of service and applicable laws. Use data responsibly and respect rate limits.