šŸ”„Fast Indeed Job Scraper avatar
šŸ”„Fast Indeed Job Scraper

Pricing

Pay per usage

Go to Apify Store
šŸ”„Fast Indeed Job Scraper

šŸ”„Fast Indeed Job Scraper

Developed by

Shahid Irfan

Shahid Irfan

Maintained by Community

Meet the šŸ”„Fast Indeed Job Scraper, your high-speed solution for Indeed data. Target jobs with precision by selecting a country, using a keyword, or providing a direct URL. This actor is optimized for residential proxies; using them is essential for successful and reliable scrapes.

0.0 (0)

Pricing

Pay per usage

0

2

1

Last modified

4 hours ago

Apify Actor

Fast Indeed Job Scraper

A powerful and configurable web scraper for extracting job listings from Indeed.com. This actor efficiently collects job metadata and full descriptions, supporting various search parameters and customization options for production-scale data collection.

Features

  • Comprehensive Job Data: Extracts job titles, companies, locations, salaries, posting dates, and detailed descriptions.
  • Flexible Search Options: Accepts full search URLs or builds queries from keywords and locations.
  • Pagination Support: Automatically handles multiple pages of search results.
  • Customizable Scraping: Control concurrency, proxy usage, and session management.
  • Output to Dataset: Results are stored in Apify's dataset for easy export and integration.

Input Configuration

The actor accepts a JSON input object with the following parameters. All fields are optional, with sensible defaults applied.

Search Parameters

ParameterTypeDescriptionDefault
searchUrlstringFull Indeed search URL. If provided, keyword, location, and country are ignored.-
keywordstringSearch keywords for job queries.-
locationstringGeographic location filter.-
countrystringCountry-specific Indeed domain (e.g., "United States", "United Kingdom")."United States"
posted_datestringFilter by posting date (e.g., "1d", "7d", "30d" for days).-

Scraping Options

ParameterTypeDescriptionDefault
maxItemsnumberMaximum number of job listings to collect.100
collectDetailsbooleanWhether to fetch full job descriptions from detail pages.false
maxConcurrencynumberNumber of parallel requests to make.5
cookies / cookiesJsonobject/stringRequired for high-volume runs: session cookies that keep you logged in and reduce blocking.-
proxyConfigurationobjectProxy settings for requests.-

Output Schema

The actor outputs job listings to the default dataset. Each item is a JSON object with the following structure:

FieldTypeDescription
titlestringJob title
companystringCompany name
locationstringJob location
postedAtstringHuman-readable posting date
salarystringSalary information (if available)
description_htmlstringFull job description in HTML format
description_textstringPlain text version of the job description
urlstringDirect link to the job posting
sourcestringData source identifier ("indeed")
search_urlstringOriginal search URL where the job was found

Usage

Basic Example

To scrape software engineering jobs in remote locations:

{
"keyword": "software engineer",
"location": "Remote",
"maxItems": 100,
"collectDetails": true
}

Advanced Configuration

For large-scale scraping with proxies:

{
"searchUrl": "https://www.indeed.com/jobs?q=developer&l=New+York&fromage=7",
"maxItems": 500,
"collectDetails": true,
"maxConcurrency": 3,
"proxyConfiguration": {
"useApifyProxy": true,
"groups": ["RESIDENTIAL"]
}
}

Running the Actor

  1. Via Apify Console:

    • Navigate to the actor page
    • Configure input parameters
    • Click "Run" and monitor progress
  2. Via API:

    curl -X POST "https://api.apify.com/v2/acts/shahidirfan100~fast-indeed-job-scraper/runs" \
    -H "Authorization: Bearer YOUR_API_TOKEN" \
    -H "Content-Type: application/json" \
    -d '{"keyword": "data scientist", "location": "San Francisco"}'
  3. Via Apify CLI:

    $apify run shahidirfan100/fast-indeed-job-scraper --input '{"keyword": "marketing", "maxItems": 50}'

Configuration Tips

  • Optimizing Performance: Adjust maxConcurrency based on your proxy setup. Higher values speed up scraping but may trigger rate limits.
  • Handling Large Datasets: Provide valid cookies/cookiesJson for any run above ~50 jobs; Indeed blocks anonymous traffic quickly at scale.
  • Proxy Usage: Enable residential proxies for better success rates on large crawls.
  • Authentication: Provide cookies from a logged-in session to access premium features or reduce blocking.
  • Strict Locales: Markets such as South Africa, China, and UAE enforce heavier bot controls—keep maxConcurrency ≤ 4 and expect slower warm-up even with cookies.
  1. Install the free Cookie-Editor extension from the Chrome Web Store.
  2. Open https://www.indeed.com (or your target country domain) and sign in manually.
  3. Click the Cookie-Editor icon → Export → copy the JSON string.
  4. Paste that JSON into the actor input under cookiesJson.
  5. For large, long-running jobs, refresh cookies regularly to keep sessions valid.

Troubleshooting

Common Issues

  • Incomplete Results: Verify search parameters and increase maxItems if needed.
  • Rate Limiting: Reduce maxConcurrency or add delays between requests.
  • Blocking/CAPTCHA: Use high-quality proxies and valid session cookies.
  • No Results: Check if the search URL is valid and accessible.

Best Practices

  • Start with small test runs to validate configuration.
  • Monitor the actor's log for errors and adjust settings accordingly.
  • Use the dataset export features for data analysis and integration.

Support

For issues or feature requests, please contact support through the Apify platform.


This actor is designed for ethical web scraping. Always respect website terms of service and robots.txt files.