Stepstone Job Scraper ๐Ÿ”ฅ avatar

Stepstone Job Scraper ๐Ÿ”ฅ

Pricing

from $2.49 / 1,000 results

Go to Apify Store
Stepstone Job Scraper ๐Ÿ”ฅ

Stepstone Job Scraper ๐Ÿ”ฅ

Introducing the Stepstone Job Scraper, a lightweight actor for efficiently scraping job listings from Stepstone. Fast and simple. For best results and reliable data extraction, the use of residential proxies is strongly advised. Get the job data you need!

Pricing

from $2.49 / 1,000 results

Rating

4.6

(5)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

4

Bookmarked

78

Total users

21

Monthly active users

3 days ago

Last modified

Share

Stepstone Job Scraper

Extract comprehensive job data from Stepstone.de with ease. Collect job listings including titles, companies, locations, and descriptions at scale. Perfect for recruitment analytics, market research, and job market monitoring.

Features

  • Comprehensive Data Extraction โ€” Captures key job details such as title, company, location, posting date, job type, category, and salary information
  • Flexible URL Handling โ€” Supports starting from specific URLs or constructing searches based on keywords and categories
  • Advanced Filtering โ€” Includes date-based filtering to focus on recent postings (last 24 hours, 7 days, 30 days, or month)
  • Detail Page Scraping โ€” Optionally collects full job descriptions in both HTML and plain text formats
  • Deduplication โ€” Automatically removes duplicate job listings to ensure clean datasets
  • Proxy Support โ€” Integrates with proxy configurations for enhanced reliability and compliance
  • Scalable Collection โ€” Configurable limits on items and pages to manage data volume
  • Error Handling โ€” Includes retry mechanisms and human-like delays to mimic natural browsing behavior

Use Cases

Recruitment Analytics

Track job market trends and hiring patterns across industries. Analyze salary ranges, job types, and company preferences to inform recruitment strategies and talent acquisition decisions.

Market Research

Monitor job postings to understand industry demands and skill requirements. Identify emerging roles, popular locations, and compensation trends for comprehensive market intelligence.

Job Market Monitoring

Stay updated on new opportunities in specific sectors or regions. Set up automated monitoring for targeted job searches and receive notifications on relevant positions.

Competitive Intelligence

Analyze competitor hiring activities and job descriptions. Gain insights into company growth, expansion plans, and talent needs through their job postings.

Career Planning

Research job opportunities and salary expectations in your field. Compare positions across companies and locations to make informed career decisions.

Input Parameters

ParameterTypeRequiredDefaultDescription
startUrlStringNoโ€”Start from a specific Stepstone listing URL (e.g., https://www.stepstone.de/jobs/verwaltung). If provided, keyword/category filters are ignored.
startUrlsArrayNoโ€”An array of URLs to begin scraping. Useful for multiple starting points.
keywordStringNoโ€”Job search keyword (e.g., 'software engineer', 'marketing'). Used to construct a search URL when startUrl is not provided.
categoryStringNo"verwaltung"Category for Stepstone URLs (e.g., 'verwaltung', 'engineering').
locationStringNoโ€”Location filter (currently not supported in Stepstone URLs, reserved for future use).
datePostedStringNoโ€”Filter jobs by posting date: "last_24h", "last_7d", "last_30d", "last_month".
collectDetailsBooleanNotrueWhether to scrape detailed job descriptions from individual pages.
maxItemsNumberNo100Maximum number of job listings to collect (0 for unlimited).
maxPagesNumberNo20Maximum number of search result pages to process.
dedupeBooleanNotrueEnable deduplication to avoid collecting duplicate jobs.
proxyConfigurationObjectNoโ€”Proxy settings for routing requests through Apify's proxy network.

Output Data

Each item in the dataset contains:

FieldTypeDescription
titleStringThe job position title
companyStringName of the hiring company
locationStringJob location or region
date_postedStringDate when the job was posted (YYYY-MM-DD format)
job_typeStringEmployment type (e.g., Full-time, Part-time)
job_categoryStringJob category or industry
salaryStringSalary information if available
urlStringDirect link to the job posting
description_htmlStringFull job description in HTML format
description_textStringPlain text version of the job description

Usage Examples

Extract jobs from a specific category:

{
"category": "it",
"maxItems": 50
}

Search for specific job types with date filtering:

{
"keyword": "data scientist",
"datePosted": "last_7d",
"maxItems": 100
}

Large-Scale Collection

Comprehensive data collection across multiple categories:

{
"startUrls": [
"https://www.stepstone.de/jobs/it-jobs",
"https://www.stepstone.de/jobs/engineering"
],
"collectDetails": true,
"maxPages": 50,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Sample Output

{
"title": "Software Engineer",
"company": "Stepstone GmbH",
"location": "Berlin, Germany",
"date_posted": "2025-10-18",
"job_type": "Full-time",
"job_category": "Engineering",
"salary": "65,000 EUR/year",
"url": "https://www.stepstone.de/job/software-engineer",
"description_html": "<div>Detailed HTML job description...</div>",
"description_text": "Clean, readable job description text without HTML tags."
}

Tips for Best Results

Choose Working URLs

  • Verify URLs are accessible and point to valid Stepstone job listings
  • Start with popular categories like IT, engineering, or administration
  • Test with different search parameters to find optimal starting points

Optimize Collection Size

  • Start small for testing (20-50 results) to verify data quality
  • Increase limits for production runs based on your needs
  • Balance between comprehensive data and processing time

Use Date Filtering

  • Apply date filters to focus on recent, relevant postings
  • Combine with keywords for targeted job monitoring
  • Reduces dataset size and improves data freshness

Enable Proxy Support

  • Use residential proxies for better success rates
  • Configure proxy rotation for large-scale scraping
  • Monitor proxy usage to manage costs effectively

Handle Deduplication

  • Keep deduplication enabled to maintain clean datasets
  • Review results for any remaining duplicates
  • Adjust search parameters if duplicates persist

Proxy Configuration

For reliable results, residential proxies are recommended:

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Integrations

Connect your data with:

  • Google Sheets โ€” Export for analysis and reporting
  • Airtable โ€” Build searchable job databases
  • Slack โ€” Get notifications on new job postings
  • Webhooks โ€” Send data to custom endpoints
  • Make โ€” Create automated job monitoring workflows
  • Zapier โ€” Trigger actions based on job data

Export Formats

Download data in multiple formats:

  • JSON โ€” For developers and APIs
  • CSV โ€” For spreadsheet analysis
  • Excel โ€” For business reporting
  • XML โ€” For system integrations

Frequently Asked Questions

How many jobs can I collect?

You can collect all available jobs. The practical limit depends on your proxy configuration and run duration.

Can I scrape multiple categories?

Yes, use the startUrls parameter with an array of category URLs to scrape multiple sections simultaneously.

What if job descriptions are missing?

Some jobs may not have detailed descriptions. Enable collectDetails to ensure you get full information where available.

How do I filter by location?

Location filtering is currently not supported by Stepstone's URL structure. Use keywords and categories for targeted searches.

Can I schedule regular scrapes?

Yes, use Apify's scheduling feature to run the actor automatically at set intervals.

What happens if the actor gets blocked?

Use residential proxies and adjust delays. The actor includes built-in retry mechanisms for reliability.

Support

For issues or feature requests, contact support through the Apify Console.

Resources

This actor is designed for legitimate data collection purposes. Users are responsible for ensuring compliance with website terms of service and applicable laws. Use data responsibly and respect rate limits.