Indeed Job Scraper
Pricing
$19.99/month + usage
Indeed Job Scraper
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer
ScrapePilot
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
5 hours ago
Last modified
Categories
Share
Indeed Jobs Scraper
Apify Actor that scrapes Indeed job listings from search or list URLs. Uses no proxy by default; if Indeed blocks the request, the actor falls back to datacenter proxy, then to residential proxy (with up to 3 retries), and sticks with residential for all remaining requests.
Why Choose Us?
- Same output as official-style scrapers: Structured
job,company,location,apply,sourceMeta, and_rawper result. - Proxy fallback: Starts without proxy; automatically escalates to datacenter then residential on block, with clear logs.
- Bulk URLs: Supports multiple Indeed search/list URLs in one run.
- Filters: Optional remote/level/date filters and deduplication.
Key Features
| Feature | Description |
|---|---|
| Bulk URLs | Input many Indeed search or list URLs. |
| Proxy | No proxy by default; auto fallback to datacenter → residential on block. |
| Rich output | Job title, company, location, salary, benefits, apply URL, full description, etc. |
| Pagination | Follows "next" and start parameter to collect more jobs per URL. |
| Live save | Results are pushed to the dataset as they are scraped. |
Input
| Field | Type | Required | Description |
|---|---|---|---|
| urls | array (requestListSources) | Yes | Indeed search or job list URLs (e.g. https://www.indeed.com/jobs?q=sales&l=New+York). |
| proxyConfiguration | object (proxy) | Yes | Proxy settings. Default: no proxy. Fallback is applied automatically on block. |
| maxRows | integer | No | Max total jobs (default: 50). |
| maxRowsPerUrl | integer | No | Max jobs per URL (default: 50). |
| includeSimilarJobs | boolean | No | Include "View similar jobs" cards (default: true). |
| enableUniqueJobs | boolean | No | Deduplicate by job ID (default: true). |
| remote | string | No | Filter: remote or hybrid. |
| level | string | No | US experience level: entry_level, mid_level, senior_level. |
| fromDays | integer | No | Only jobs posted in last N days. |
| sort | string | No | date or relevance (default: relevance). |
| countryCode | string | No | Indeed country (default: us). |
Example input
{"urls": [{ "url": "https://www.indeed.com/jobs?q=sales&l=New+York%2C+NY" }],"proxyConfiguration": { "useApifyProxy": false },"maxRows": 50,"maxRowsPerUrl": 50}
Output
Each dataset item is one job with this structure (matches the reference output):
- job: id, jobKey, title, displayTitle, normalizedTitle, descriptionText, descriptionHtml, jobDescription, jobDescriptionHTML, language, locale, isRemote, remoteLocation, jobType, shiftAndSchedule, workingSystem, occupation, requirements, attributes, benefits, salary (text, currency, type, min, max, weeklyMin, weeklyMax, source), hiring, dates, status.
- company: name, displayName, industry, revenue, numEmployees, foundedYear, description, briefDescription, rating (score, count), ceo, branding, links, emails.
- location: countryCode, country, state, stateName, city, postalCode, streetAddress, formattedShort, formattedLong, latitude, longitude.
- apply: jobUrl, applyUrl, originalApplyUrl, thirdPartyApplyUrl, indeedApplyEnabled.
- sourceMeta: source "Indeed", feedId, fccId, scrapingInfo, rankingScores.
- _raw: Raw scraped job object.
How to Use (Apify Console)
- Open Apify Console and go to Actors.
- Find indeed-jobs-scraper and open it.
- Set urls (add one or more Indeed search URLs) and proxyConfiguration (default no proxy).
- Optionally set maxRows, maxRowsPerUrl, filters.
- Click Start.
- Watch Log for progress and proxy messages (e.g.
[PROXY] Using no proxy,[PROXY] Fallback: switching to datacenter proxy). - Open the Output tab to view or export the dataset (JSON/CSV).
Best Use Cases
- Aggregating job listings from multiple Indeed searches or locations.
- Building job feeds, research datasets, or recruitment pipelines.
- Testing or backup when using no proxy first and falling back only when blocked.
Frequently Asked Questions
Do I need to enable proxy?
No. By default the actor uses no proxy. If Indeed returns 401/403, it will switch to datacenter then residential automatically.
What if residential proxy fails?
The actor retries up to 3 times with residential proxy. If it still fails, the run ends; you can try again or add proxy in input.
Can I use a single search URL?
Yes. Add one object to urls with your Indeed search URL.
Cautions
- Data is collected from publicly available Indeed pages only.
- You are responsible for compliance with applicable laws (e.g. privacy, data protection, terms of use).
- Use reasonable limits (e.g. maxRows) and respect Indeed’s load.
Support and Feedback
For bugs or feature requests, use the Apify actor’s support channel or repository issues.