Indeed Comprehensive Crawler avatar

Indeed Comprehensive Crawler

Try for free

2 hours trial then $10.00/month - No credit card required now

Go to Store
Indeed Comprehensive Crawler

Indeed Comprehensive Crawler

memo23/apify-indeed
Try for free

2 hours trial then $10.00/month - No credit card required now

Unearth every job gem on Indeed with laser precision. Our advanced scraper dives deep, extracting rich data from listings, reviews, and company profiles. Empower your recruitment strategy or market analysis with unparalleled insights, all at the speed of automation.

Overview

The Ultimate Indeed Scraper is a powerful tool designed for the Apify platform, tailored to bypass protections and efficiently extract job listing information directly from Indeed. It's engineered to navigate and parse data from Indeed's job listings and company profiles, offering users a seamless experience in collecting valuable job market insights.

Features

  • Customizable Searches: Input URLs directly from Indeed to target specific job searches or categories.
  • Comprehensive Data Extraction: Gathers detailed information from job listings, including job titles, company names, locations, salary estimates, job descriptions, and more.
  • Company Intelligence:
    • CEO information with photos
    • Company founding year and revenue
    • Employee size range and industry classification
    • Active job categories and locations

How to Use

  1. Set Up: Ensure you have an Apify account and access to the Indeed Scraper Browser Automation actor.
  2. Input Configuration:
    • Start URLs: Enter your target Indeed job listing URLs. For example: https://de.indeed.com/jobs?q=medizinische+Fachangestellte&sc=0bf%3Aexrec()%2Ck
    • Max jobs to scrape (optional): Set the maximum number of job listings to scrape. Default is 20.
    • Enable company details scraping (new option)
  3. Proxy Configuration: Configure proxy settings for enhanced reliability and to avoid IP blocks.
  4. Run the Scraper: Launch the scraper on the Apify platform.
  5. Data Collection: Extracted data will be available in your preferred format (JSON, CSV, etc.).

Input Example

1{
2    "startUrls": [
3        { "url": "https://de.indeed.com/jobs?q=medizinische+Fachangestellte&sc=0bf%3Aexrec()%2Ck" }
4    ],
5    "includeCompanyDetails": true,
6    "maxJobsToScrape": 20,
7    "proxyConfiguration": {
8        "useApifyProxy": true
9    }
10}

Output Structure

The scraper's output includes detailed information for each job listing. Here's a sample output with explanations for each field:

1{
2  "source_link": "",  // Original source link if the job is aggregated from another site
3  "adBlob": "...",  // Encoded string containing additional job information
4  "adId": "433403349",  // Unique identifier for the advertisement
5  "additionalLocationLink": {
6    "label": "+3 Orte",  // Label for additional locations
7    "url": "..."  // URL to view jobs in additional locations
8  },
9  "advn": "625754421564269",  // Advertiser identification number
10  "company": "Ober Scharrer Gruppe GmbH'",  // Name of the company offering the job
11  "companyRating": 0,  // Company rating on Indeed (if available)
12  "companyReviewCount": 0,  // Number of company reviews on Indeed
13  "createDate": 1721054703000,  // Timestamp of when the job was created
14  "displayTitle": "MFA / Medizinische Fachangestellte (w/m/d) für den OP",  // Job title as displayed
15  "formattedLocation": "76532 Baden-Baden",  // Formatted job location
16  "formattedRelativeTime": "vor 30+ Tagen",  // Relative time since job was posted
17  "jobkey": "ea2f2dbc75f8a438",  // Unique identifier for the job listing
18  "jobTypes": [  // Types of employment offered
19    "Vollzeit",
20    "Teilzeit"
21  ],
22  "locationCount": 4,  // Number of locations this job is available in
23  "pubDate": 1721019600000,  // Timestamp of when the job was published
24  "salarySnippet": {  // Salary information (if available)
25    "currency": "EUR",
26    "salaryTextFormatted": false
27  },
28  "snippet": "...",  // Brief excerpt or summary of the job description
29  "title": "MFA / Medizinische Fachangestellte (w/m/d) für den OP"  // Job title
30}

Field Explanations

  • source_link: If the job is aggregated from another site, this field contains the original link.
  • adBlob: An encoded string that may contain additional job information.
  • adId: A unique identifier for the job advertisement.
  • additionalLocationLink: Information about other locations where this job is available.
  • advn: The advertiser's identification number.
  • company: The name of the company offering the job.
  • companyRating: The company's rating on Indeed, if available.
  • companyReviewCount: The number of reviews the company has on Indeed.
  • createDate: The timestamp when the job listing was created.
  • displayTitle: The job title as it's displayed on the listing.
  • formattedLocation: The job's location in a formatted string.
  • formattedRelativeTime: How long ago the job was posted, in a human-readable format.
  • jobkey: A unique identifier for the specific job listing.
  • jobTypes: An array of job types (e.g., full-time, part-time) offered for this position.
  • locationCount: The number of locations where this job is available.
  • pubDate: The timestamp when the job was published.
  • salarySnippet: Information about the salary, if provided.
  • snippet: A brief excerpt or summary of the job description.
  • title: The title of the job position.

📋 Enhanced Output Structure

Company Details Output (New Section)

1"companyDetails": {
2    "aboutSectionViewModel": {
3        "aboutCeo": {
4            "name": "Sophie Bellon",
5            "photoUrls": {
6                "48px": "https://...photo48.jpg",
7                "96px": "https://...photo96.jpg",
8                "512px": "https://...photo512.jpg"
9            }
10        },
11        "aboutCompany": {
12            "founded": 1966,
13            "revenue": "RRv1_OVER_10B",
14            "employeeRange": "ERv1_10000_PLUS",
15            "industry": "Catering & Verpflegungsdienstleistungen",
16            "description": "Die Sodexo Group...",
17            "headquarters": "Issy-les-Moulineaux"
18        }
19    },
20    "jobsSectionViewModel": {
21        "totalJobCount": 65,
22        "jobCategories": [
23            {
24                "displayName": "Reinigungsdienste",
25                "jobCount": 47,
26                "key": "sanitation"
27            }
28        ],
29        "activeListings": [
30            {
31                "title": "Reinigungskraft",
32                "location": "Wien, W",
33                "jobTypes": ["Teilzeit"],
34                "postDate": "vor 30+ Tagen"
35            }
36        ]
37    }
38}

New Output Fields Explanation

  • CEO Details:
    • Name and multiple resolution photos
    • Executive leadership insights
  • Company Metadata:
    • Founding year and revenue bracket
    • Employee count range and industry classification
    • Physical headquarters location
  • Organizational Insights:
    • Job distribution by category
    • Geographic presence analysis
    • Active job postings catalog

Benefits (Enhanced)

  • Competitive Analysis: Compare company hiring patterns and organizational structures
  • Talent Acquisition: Identify companies with high hiring activity in specific sectors
  • Market Research: Analyze industry distributions and company growth indicators
  • Leadership Networking: Access CEO and executive team information

Support

Additional Services

Developer
Maintained by Community

Actor Metrics

  • 11 monthly users

  • 4 bookmarks

  • >99% runs succeeded

  • 0.66 hours response time

  • Created in Jan 2024

  • Modified 9 hours ago