Indeed Job Scraper avatar

Indeed Job Scraper

Pricing

$19.99/month + usage

Go to Apify Store
Indeed Job Scraper

Indeed Job Scraper

Indeed Job Scraper extracts job listings from Indeed search results. Collect job titles, company names, locations, salaries, descriptions, posting dates, and job URLs. Ideal for job market research, recruitment analysis, salary insights, and building job datasets.

Pricing

$19.99/month + usage

Rating

0.0

(0)

Developer

Scrapio

Scrapio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Indeed Jobs Scraper

Apify Actor that scrapes Indeed job listings from search or list URLs. Uses no proxy by default; if Indeed blocks the request, the actor falls back to datacenter proxy, then to residential proxy (with up to 3 retries), and sticks with residential for all remaining requests.

Why Choose Us?

  • Same output as official-style scrapers: Structured job, company, location, apply, sourceMeta, and _raw per result.
  • Proxy fallback: Starts without proxy; automatically escalates to datacenter then residential on block, with clear logs.
  • Bulk URLs: Supports multiple Indeed search/list URLs in one run.
  • Filters: Optional remote/level/date filters and deduplication.

Key Features

FeatureDescription
Bulk URLsInput many Indeed search or list URLs.
ProxyNo proxy by default; auto fallback to datacenter → residential on block.
Rich outputJob title, company, location, salary, benefits, apply URL, full description, etc.
PaginationFollows "next" and start parameter to collect more jobs per URL.
Live saveResults are pushed to the dataset as they are scraped.

Input

FieldTypeRequiredDescription
urlsarray (requestListSources)YesIndeed search or job list URLs (e.g. https://www.indeed.com/jobs?q=sales&l=New+York).
proxyConfigurationobject (proxy)YesProxy settings. Default: no proxy. Fallback is applied automatically on block.
maxRowsintegerNoMax total jobs (default: 50).
maxRowsPerUrlintegerNoMax jobs per URL (default: 50).
includeSimilarJobsbooleanNoInclude "View similar jobs" cards (default: true).
enableUniqueJobsbooleanNoDeduplicate by job ID (default: true).
remotestringNoFilter: remote or hybrid.
levelstringNoUS experience level: entry_level, mid_level, senior_level.
fromDaysintegerNoOnly jobs posted in last N days.
sortstringNodate or relevance (default: relevance).
countryCodestringNoIndeed country (default: us).

Example input

{
"urls": [{ "url": "https://www.indeed.com/jobs?q=sales&l=New+York%2C+NY" }],
"proxyConfiguration": { "useApifyProxy": false },
"maxRows": 50,
"maxRowsPerUrl": 50
}

Output

Each dataset item is one job with this structure (matches the reference output):

  • job: id, jobKey, title, displayTitle, normalizedTitle, descriptionText, descriptionHtml, jobDescription, jobDescriptionHTML, language, locale, isRemote, remoteLocation, jobType, shiftAndSchedule, workingSystem, occupation, requirements, attributes, benefits, salary (text, currency, type, min, max, weeklyMin, weeklyMax, source), hiring, dates, status.
  • company: name, displayName, industry, revenue, numEmployees, foundedYear, description, briefDescription, rating (score, count), ceo, branding, links, emails.
  • location: countryCode, country, state, stateName, city, postalCode, streetAddress, formattedShort, formattedLong, latitude, longitude.
  • apply: jobUrl, applyUrl, originalApplyUrl, thirdPartyApplyUrl, indeedApplyEnabled.
  • sourceMeta: source "Indeed", feedId, fccId, scrapingInfo, rankingScores.
  • _raw: Raw scraped job object.

How to Use (Apify Console)

  1. Open Apify Console and go to Actors.
  2. Find indeed-jobs-scraper and open it.
  3. Set urls (add one or more Indeed search URLs) and proxyConfiguration (default no proxy).
  4. Optionally set maxRows, maxRowsPerUrl, filters.
  5. Click Start.
  6. Watch Log for progress and proxy messages (e.g. [PROXY] Using no proxy, [PROXY] Fallback: switching to datacenter proxy).
  7. Open the Output tab to view or export the dataset (JSON/CSV).

Best Use Cases

  • Aggregating job listings from multiple Indeed searches or locations.
  • Building job feeds, research datasets, or recruitment pipelines.
  • Testing or backup when using no proxy first and falling back only when blocked.

Frequently Asked Questions

Do I need to enable proxy?
No. By default the actor uses no proxy. If Indeed returns 401/403, it will switch to datacenter then residential automatically.

What if residential proxy fails?
The actor retries up to 3 times with residential proxy. If it still fails, the run ends; you can try again or add proxy in input.

Can I use a single search URL?
Yes. Add one object to urls with your Indeed search URL.

Cautions

  • Data is collected from publicly available Indeed pages only.
  • You are responsible for compliance with applicable laws (e.g. privacy, data protection, terms of use).
  • Use reasonable limits (e.g. maxRows) and respect Indeed’s load.

Support and Feedback

For bugs or feature requests, use the Apify actor’s support channel or repository issues.