
Indeed Job Scraper
Pricing
Pay per usage
Go to Apify Store

Indeed Job Scraper
A simple Indeed Job Scraper for minimalist, essential data. Uses residential proxies and cookies to prevent blocks, ensuring smooth and reliable runs. Perfect for getting targeted job data without the clutter.
5.0 (1)
Pricing
Pay per usage
0
3
3
Last modified
5 days ago
Indeed Jobs Scraper (HTTP + Cheerio)
This Apify actor scrapes job listings from Indeed using fast HTTP requests and Cheerio for parsing, avoiding slow and resource-intensive headless browsers.
Features
- Scrapes Indeed job search results.
- Extracts job title, company, location, salary, post date, and full description.
- Supports providing a full search URL (recommended) or building one from keywords and location.
- Can scrape multiple search URLs in a single run.
- Handles pagination to collect multiple pages of results.
- Configurable concurrency and proxy settings to avoid blocks.
- Supports using browser cookies for authenticated sessions or to bypass CAPTCHAs.
- Saves results to the Apify dataset.
Input
The actor accepts the following input fields:
Search Parameters
- Search URL (recommended) (
searchUrl
): A full Indeed search URL. If provided, the keyword and location fields are ignored.- Example:
https://www.indeed.com/jobs?q=software+engineer&l=Remote&fromage=7
- Example:
- Start URLs (
startUrls
): An optional list of multiple search URLs to crawl. - Keyword (
keyword
): The job title or keywords to search for (e.g., 'software engineer'). - Location (
location
): The geographic location to filter jobs by (e.g., 'New York', 'Remote'). - Date Posted (
posted_date
): Filter jobs by when they were posted (Last 24 hours
,Last 7 days
,Last 30 days
).
Scraping Options
- Max Jobs (
maxItems
): The maximum number of jobs to scrape. - Collect job detail pages (
collectDetails
): If enabled, the actor will visit each job's detail page to extract the full description.
Advanced Options
- Proxy configuration (
proxyConfiguration
): Configure Apify Proxy settings. Residential proxies are recommended if you encounter blocking. - Max concurrency (
maxConcurrency
): The number of parallel requests. - Cookies (
cookies
,cookiesJson
): Provide cookies to handle sessions or bypass bot detection.
Output
The actor outputs a dataset of job listings with the following fields:
title
: The job title.company
: The company name.location
: The job location.postedAt
: A string describing when the job was posted (e.g., "Posted 3 days ago").salary
: The estimated salary range, if available.description_html
: The job description in HTML format.description_text
: The job description in plain text.url
: The URL of the job posting.source
: The source of the job listing (always 'indeed').search_url
: The search result page URL where the job was found.
On this page
Share Actor: