Indeed Job Scraper
Pricing
$19.99/month + usage
Indeed Job Scraper
Indeed Job Scraper extracts job listings from Indeed search results. Collect job titles, company names, locations, salaries, descriptions, posting dates, and job URLs. Ideal for job market research, recruitment analysis, salary insights, and building job datasets.
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer

Scrapio
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Indeed Jobs Scraper
Apify Actor that scrapes Indeed job listings from search or list URLs. Uses no proxy by default; if Indeed blocks the request, the actor falls back to datacenter proxy, then to residential proxy (with up to 3 retries), and sticks with residential for all remaining requests.
Why Choose Us?
- Same output as official-style scrapers: Structured
job,company,location,apply,sourceMeta, and_rawper result. - Proxy fallback: Starts without proxy; automatically escalates to datacenter then residential on block, with clear logs.
- Bulk URLs: Supports multiple Indeed search/list URLs in one run.
- Filters: Optional remote/level/date filters and deduplication.
Key Features
| Feature | Description |
|---|---|
| Bulk URLs | Input many Indeed search or list URLs. |
| Proxy | No proxy by default; auto fallback to datacenter → residential on block. |
| Rich output | Job title, company, location, salary, benefits, apply URL, full description, etc. |
| Pagination | Follows "next" and start parameter to collect more jobs per URL. |
| Live save | Results are pushed to the dataset as they are scraped. |
Input
| Field | Type | Required | Description |
|---|---|---|---|
| urls | array (requestListSources) | Yes | Indeed search or job list URLs (e.g. https://www.indeed.com/jobs?q=sales&l=New+York). |
| proxyConfiguration | object (proxy) | Yes | Proxy settings. Default: no proxy. Fallback is applied automatically on block. |
| maxRows | integer | No | Max total jobs (default: 50). |
| maxRowsPerUrl | integer | No | Max jobs per URL (default: 50). |
| includeSimilarJobs | boolean | No | Include "View similar jobs" cards (default: true). |
| enableUniqueJobs | boolean | No | Deduplicate by job ID (default: true). |
| remote | string | No | Filter: remote or hybrid. |
| level | string | No | US experience level: entry_level, mid_level, senior_level. |
| fromDays | integer | No | Only jobs posted in last N days. |
| sort | string | No | date or relevance (default: relevance). |
| countryCode | string | No | Indeed country (default: us). |
Example input
{"urls": [{ "url": "https://www.indeed.com/jobs?q=sales&l=New+York%2C+NY" }],"proxyConfiguration": { "useApifyProxy": false },"maxRows": 50,"maxRowsPerUrl": 50}
Output
Each dataset item is one job with this structure (matches the reference output):
- job: id, jobKey, title, displayTitle, normalizedTitle, descriptionText, descriptionHtml, jobDescription, jobDescriptionHTML, language, locale, isRemote, remoteLocation, jobType, shiftAndSchedule, workingSystem, occupation, requirements, attributes, benefits, salary (text, currency, type, min, max, weeklyMin, weeklyMax, source), hiring, dates, status.
- company: name, displayName, industry, revenue, numEmployees, foundedYear, description, briefDescription, rating (score, count), ceo, branding, links, emails.
- location: countryCode, country, state, stateName, city, postalCode, streetAddress, formattedShort, formattedLong, latitude, longitude.
- apply: jobUrl, applyUrl, originalApplyUrl, thirdPartyApplyUrl, indeedApplyEnabled.
- sourceMeta: source "Indeed", feedId, fccId, scrapingInfo, rankingScores.
- _raw: Raw scraped job object.
How to Use (Apify Console)
- Open Apify Console and go to Actors.
- Find indeed-jobs-scraper and open it.
- Set urls (add one or more Indeed search URLs) and proxyConfiguration (default no proxy).
- Optionally set maxRows, maxRowsPerUrl, filters.
- Click Start.
- Watch Log for progress and proxy messages (e.g.
[PROXY] Using no proxy,[PROXY] Fallback: switching to datacenter proxy). - Open the Output tab to view or export the dataset (JSON/CSV).
Best Use Cases
- Aggregating job listings from multiple Indeed searches or locations.
- Building job feeds, research datasets, or recruitment pipelines.
- Testing or backup when using no proxy first and falling back only when blocked.
Frequently Asked Questions
Do I need to enable proxy?
No. By default the actor uses no proxy. If Indeed returns 401/403, it will switch to datacenter then residential automatically.
What if residential proxy fails?
The actor retries up to 3 times with residential proxy. If it still fails, the run ends; you can try again or add proxy in input.
Can I use a single search URL?
Yes. Add one object to urls with your Indeed search URL.
Cautions
- Data is collected from publicly available Indeed pages only.
- You are responsible for compliance with applicable laws (e.g. privacy, data protection, terms of use).
- Use reasonable limits (e.g. maxRows) and respect Indeed’s load.
Support and Feedback
For bugs or feature requests, use the Apify actor’s support channel or repository issues.