Stepstone Job Scraper ๐ฅ
Pricing
from $2.49 / 1,000 results
Stepstone Job Scraper ๐ฅ
Introducing the Stepstone Job Scraper, a lightweight actor for efficiently scraping job listings from Stepstone. Fast and simple. For best results and reliable data extraction, the use of residential proxies is strongly advised. Get the job data you need!
Pricing
from $2.49 / 1,000 results
Rating
4.6
(5)
Developer

Shahid Irfan
Actor stats
4
Bookmarked
78
Total users
21
Monthly active users
3 days ago
Last modified
Categories
Share
Stepstone Job Scraper
Extract comprehensive job data from Stepstone.de with ease. Collect job listings including titles, companies, locations, and descriptions at scale. Perfect for recruitment analytics, market research, and job market monitoring.
Features
- Comprehensive Data Extraction โ Captures key job details such as title, company, location, posting date, job type, category, and salary information
- Flexible URL Handling โ Supports starting from specific URLs or constructing searches based on keywords and categories
- Advanced Filtering โ Includes date-based filtering to focus on recent postings (last 24 hours, 7 days, 30 days, or month)
- Detail Page Scraping โ Optionally collects full job descriptions in both HTML and plain text formats
- Deduplication โ Automatically removes duplicate job listings to ensure clean datasets
- Proxy Support โ Integrates with proxy configurations for enhanced reliability and compliance
- Scalable Collection โ Configurable limits on items and pages to manage data volume
- Error Handling โ Includes retry mechanisms and human-like delays to mimic natural browsing behavior
Use Cases
Recruitment Analytics
Track job market trends and hiring patterns across industries. Analyze salary ranges, job types, and company preferences to inform recruitment strategies and talent acquisition decisions.
Market Research
Monitor job postings to understand industry demands and skill requirements. Identify emerging roles, popular locations, and compensation trends for comprehensive market intelligence.
Job Market Monitoring
Stay updated on new opportunities in specific sectors or regions. Set up automated monitoring for targeted job searches and receive notifications on relevant positions.
Competitive Intelligence
Analyze competitor hiring activities and job descriptions. Gain insights into company growth, expansion plans, and talent needs through their job postings.
Career Planning
Research job opportunities and salary expectations in your field. Compare positions across companies and locations to make informed career decisions.
Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
startUrl | String | No | โ | Start from a specific Stepstone listing URL (e.g., https://www.stepstone.de/jobs/verwaltung). If provided, keyword/category filters are ignored. |
startUrls | Array | No | โ | An array of URLs to begin scraping. Useful for multiple starting points. |
keyword | String | No | โ | Job search keyword (e.g., 'software engineer', 'marketing'). Used to construct a search URL when startUrl is not provided. |
category | String | No | "verwaltung" | Category for Stepstone URLs (e.g., 'verwaltung', 'engineering'). |
location | String | No | โ | Location filter (currently not supported in Stepstone URLs, reserved for future use). |
datePosted | String | No | โ | Filter jobs by posting date: "last_24h", "last_7d", "last_30d", "last_month". |
collectDetails | Boolean | No | true | Whether to scrape detailed job descriptions from individual pages. |
maxItems | Number | No | 100 | Maximum number of job listings to collect (0 for unlimited). |
maxPages | Number | No | 20 | Maximum number of search result pages to process. |
dedupe | Boolean | No | true | Enable deduplication to avoid collecting duplicate jobs. |
proxyConfiguration | Object | No | โ | Proxy settings for routing requests through Apify's proxy network. |
Output Data
Each item in the dataset contains:
| Field | Type | Description |
|---|---|---|
title | String | The job position title |
company | String | Name of the hiring company |
location | String | Job location or region |
date_posted | String | Date when the job was posted (YYYY-MM-DD format) |
job_type | String | Employment type (e.g., Full-time, Part-time) |
job_category | String | Job category or industry |
salary | String | Salary information if available |
url | String | Direct link to the job posting |
description_html | String | Full job description in HTML format |
description_text | String | Plain text version of the job description |
Usage Examples
Basic Job Search
Extract jobs from a specific category:
{"category": "it","maxItems": 50}
Keyword-Based Search
Search for specific job types with date filtering:
{"keyword": "data scientist","datePosted": "last_7d","maxItems": 100}
Large-Scale Collection
Comprehensive data collection across multiple categories:
{"startUrls": ["https://www.stepstone.de/jobs/it-jobs","https://www.stepstone.de/jobs/engineering"],"collectDetails": true,"maxPages": 50,"proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
Sample Output
{"title": "Software Engineer","company": "Stepstone GmbH","location": "Berlin, Germany","date_posted": "2025-10-18","job_type": "Full-time","job_category": "Engineering","salary": "65,000 EUR/year","url": "https://www.stepstone.de/job/software-engineer","description_html": "<div>Detailed HTML job description...</div>","description_text": "Clean, readable job description text without HTML tags."}
Tips for Best Results
Choose Working URLs
- Verify URLs are accessible and point to valid Stepstone job listings
- Start with popular categories like IT, engineering, or administration
- Test with different search parameters to find optimal starting points
Optimize Collection Size
- Start small for testing (20-50 results) to verify data quality
- Increase limits for production runs based on your needs
- Balance between comprehensive data and processing time
Use Date Filtering
- Apply date filters to focus on recent, relevant postings
- Combine with keywords for targeted job monitoring
- Reduces dataset size and improves data freshness
Enable Proxy Support
- Use residential proxies for better success rates
- Configure proxy rotation for large-scale scraping
- Monitor proxy usage to manage costs effectively
Handle Deduplication
- Keep deduplication enabled to maintain clean datasets
- Review results for any remaining duplicates
- Adjust search parameters if duplicates persist
Proxy Configuration
For reliable results, residential proxies are recommended:
{"proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
Integrations
Connect your data with:
- Google Sheets โ Export for analysis and reporting
- Airtable โ Build searchable job databases
- Slack โ Get notifications on new job postings
- Webhooks โ Send data to custom endpoints
- Make โ Create automated job monitoring workflows
- Zapier โ Trigger actions based on job data
Export Formats
Download data in multiple formats:
- JSON โ For developers and APIs
- CSV โ For spreadsheet analysis
- Excel โ For business reporting
- XML โ For system integrations
Frequently Asked Questions
How many jobs can I collect?
You can collect all available jobs. The practical limit depends on your proxy configuration and run duration.
Can I scrape multiple categories?
Yes, use the startUrls parameter with an array of category URLs to scrape multiple sections simultaneously.
What if job descriptions are missing?
Some jobs may not have detailed descriptions. Enable collectDetails to ensure you get full information where available.
How do I filter by location?
Location filtering is currently not supported by Stepstone's URL structure. Use keywords and categories for targeted searches.
Can I schedule regular scrapes?
Yes, use Apify's scheduling feature to run the actor automatically at set intervals.
What happens if the actor gets blocked?
Use residential proxies and adjust delays. The actor includes built-in retry mechanisms for reliability.
Support
For issues or feature requests, contact support through the Apify Console.
Resources
Legal Notice
This actor is designed for legitimate data collection purposes. Users are responsible for ensuring compliance with website terms of service and applicable laws. Use data responsibly and respect rate limits.