Stepstone Jobs Scraper avatar

Stepstone Jobs Scraper

Pricing

from $3.99 / 1,000 results

Go to Apify Store
Stepstone Jobs Scraper

Stepstone Jobs Scraper

Pricing

from $3.99 / 1,000 results

Rating

0.0

(0)

Developer

ScrapePilot

ScrapePilot

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

πŸ‡©πŸ‡ͺ✨ StepStone Jobs Scraper

Pull fresh job listings from StepStone.de in a clean, structured format β€” perfect for recruiters, job seekers, market researchers, and data teams.

Just paste a StepStone search URL (or a bunch of them), pick how many jobs you want, and let the Actor do the heavy lifting while you focus on the good stuff. πŸ’Ό


🌟 Why Choose Us?

  • 🎯 Bulk-friendly β€” drop in as many StepStone search URLs as you like; the Actor paginates through each.
  • πŸͺ„ Smart proxy fallback β€” starts direct, falls back to datacenter, then residential (Germany) if Akamai pushes back, and sticks with residential for the rest of the run.
  • πŸ“‘ Live results β€” every job is pushed to your dataset as soon as it's scraped, so you never lose work to a mid-run hiccup.
  • 🧠 Akamai-aware β€” uses browser-impersonating TLS fingerprints (curl_cffi) so StepStone treats requests like a real Chrome browser.
  • πŸ‡©πŸ‡ͺ EU geo-aware β€” StepStone geo-blocks non-EU traffic, so the Actor auto-routes through a German residential exit when needed.

✨ Key Features

  • πŸ”— Bulk URL input (searchUrls)
  • πŸ”’ Configurable maxItems (1 – 1000)
  • 🌐 Apify Proxy integration with automatic fallback ladder (direct β†’ datacenter β†’ residential πŸ‡©πŸ‡ͺ)
  • πŸ” Per-page retries with exit-IP rotation on retryable HTTP errors (403/408/425/429/5xx)
  • πŸ“¦ Rich, normalized output β€” 35+ fields per job (matches the StepStone unified result schema)
  • πŸͺͺ Anonymous identity bootstrapping (extracts userHashId from the listing HTML automatically)

πŸ“₯ Input

{
"searchUrls": [
{ "url": "https://www.stepstone.de/jobs/ai?searchOrigin=Homepage_top-search" },
{ "url": "https://www.stepstone.de/jobs/data-engineer/in-berlin" }
],
"maxItems": 200,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"apifyProxyCountry": "DE"
}
}
FieldTypeRequiredDescription
searchUrlsarray of { url }βœ…One or more StepStone listing URLs.
maxItemsinteger❌Total jobs to collect across all URLs. Default 200, max 1000.
proxyConfigurationobject❌Apify Proxy config. Recommended: RESIDENTIAL + country DE.

πŸ“€ Output

Each row in the dataset matches the StepStone unified result schema. Example:

{
"searchUrl": "https://www.stepstone.de/jobs/ai?searchOrigin=Homepage_top-search",
"id": 14030765,
"title": "Junior Software Engineer AI (w|m|d)",
"companyName": "ADAC",
"companyUrl": "https://www.stepstone.de/cmp/de/adac-173906/jobs",
"location": "MΓΌnchen",
"datePosted": "2026-05-18T09:21:06+02:00",
"salary": "",
"unifiedSalary": {
"min": null, "max": null, "currency": null, "period": null,
"salaryAvailable": true, "salaryType": null
},
"workFromHome": "2",
"url": "/stellenangebote--Junior-Software-Engineer-AI-wmd-Muenchen-ADAC--14030765-inline.html",
"jobUrlAbsolute": "https://www.stepstone.de/stellenangebote--Junior-Software-Engineer-AI-wmd-Muenchen-ADAC--14030765-inline.html",
"skills": [],
"textSnippet": "Du setzt AI- und Agent-Use-Cases gemÀß fachlichen und technischen Vorgaben um...",
"isHighlighted": false,
"isSponsored": false,
"isTopJob": false
}

Every field from the StepStone unifiedResultlist response is preserved (labels, partnership, metaData, harmonisedId, etc.), plus a derived jobUrlAbsolute for convenience.


πŸš€ How to Use (Apify Console)

  1. Log in at console.apify.com β†’ Actors.
  2. Open StepStone Jobs Scraper.
  3. Paste one or more StepStone search URLs into Search URLs.
  4. Pick a max items budget (e.g. 200).
  5. Leave the proxy on Apify Proxy β†’ Residential β†’ DE (recommended) or pick your own.
  6. Click Start and watch the live log roll. 🍿
  7. When the run finishes, open the Output tab β€” export to JSON / CSV / XLSX.

πŸ€– Use via API

curl -X POST "https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"searchUrls": [{ "url": "https://www.stepstone.de/jobs/ai" }],
"maxItems": 100,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"apifyProxyCountry": "DE"
}
}'

🎯 Best Use Cases

  • πŸ“Š Building a German job-market dashboard
  • πŸ§‘β€πŸ’Ό Sourcing for recruiters & talent agencies
  • πŸ› οΈ Feeding ATS / CRM tooling with fresh listings
  • πŸ”¬ Academic / market research on hiring trends in DACH

πŸ’Έ Pricing

This Actor uses the Pay-Per-Result model (Apify PPE):

EventWhat it bills for
apify-actor-startRun startup (synthetic β€” billed automatically)
apify-default-dataset-itemEach scraped job pushed to the default dataset

Configure exact USD amounts in the Console Monetization tab. The Actor stops gracefully when the user's spend limit is reached β€” no crashes, no half-runs.


❓ FAQ

Q: Why does StepStone block my requests? A: StepStone is fronted by Akamai Bot Manager which geo-blocks non-EU traffic. The Actor handles this automatically by falling back to a German residential proxy when it sees a 403.

Q: Can I use my own proxy? A: Yes β€” set proxyConfiguration to your custom URL list in the input. The Actor still applies its escalation logic if your proxy gets blocked.

Q: How fresh is the data? A: It's whatever StepStone shows on their search results page at the time of the run β€” i.e. real-time as of the request.

Q: What's userHashId? A: An anonymous client token that StepStone embeds in its listing HTML. The Actor extracts it on cold-start so its API calls look identical to a real browser session.


πŸ›Ÿ Support & Feedback

Found a bug? Have a feature idea? Drop a line:

πŸ“§ dev.scraperengine@gmail.com


  • Data is collected only from publicly available StepStone listing pages.
  • You β€” the user β€” are responsible for compliance with GDPR, StepStone's Terms of Service, and your local laws.
  • Be a good citizen: respect rate limits and don't republish scraped data in ways that violate the source's terms.