Bundesagentur fΓΌr Arbeit Scraper avatar

Bundesagentur fΓΌr Arbeit Scraper

Pricing

from $4.99 / 1,000 results

Go to Apify Store
Bundesagentur fΓΌr Arbeit Scraper

Bundesagentur fΓΌr Arbeit Scraper

πŸ‡©πŸ‡ͺ Bundesagentur fΓΌr Arbeit Scraper extracts job listings & key details from Arbeitsagentur pages automatically. ⚑ Save time, streamline recruiting and market research. βœ… Perfect for HR, staffing, and B2B analytics.

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

API Empire

API Empire

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Share

πŸ‡©πŸ‡ͺ Bundesagentur fΓΌr Arbeit Job Scraper

πŸš€ The fastest, most complete Arbeitsagentur.de job scraper on Apify. Pull thousands of fresh German-language job postings β€” title, employer, salary notes, contract type, contact emails, full description, application URL and 30+ other fields β€” straight into a clean dataset.


πŸ’‘ Why Choose This Scraper?

  • ⚑ Blazing fast β€” async, concurrent fetching of search pages and job details simultaneously.
  • 🎯 Bulk input β€” feed any number of Arbeitsagentur.de search URLs and get a single deduped dataset back.
  • πŸ›°οΈ Smart proxy fallback β€” starts direct, auto-escalates to Apify datacenter, then residential proxies on rate limits, then sticks with the upgraded tier.
  • πŸ’Ύ Live saving β€” every job is pushed to the dataset the moment it's ready. A crash mid-run never costs you data.
  • πŸ›‘οΈ Production-grade error handling β€” retries with exponential backoff on every external call.
  • πŸ“§ Contact email extraction β€” automatically pulls application emails out of job descriptions.
  • πŸ‡ͺπŸ‡Ί DACH coverage β€” postings from Germany, Austria, and Switzerland whenever Arbeitsagentur.de surfaces them.

🎯 Key Features

FeatureDescription
πŸ”— Bulk URL inputDrop any number of Arbeitsagentur.de search URLs
πŸ”Ž Keyword searchOptional position + location fallback inputs
πŸ›°οΈ Adaptive proxynone β†’ datacenter β†’ residential (3 retries) escalation
πŸ“Š Rich output38 normalized fields per job
πŸ’Ύ Live persistencepush_data after every record β€” no data loss on crash
πŸ” Retry policy3 attempts with exponential backoff + jitter
πŸ“§ Email extractionRegex-pulled application emails from descriptions
πŸ†” DeduplicationBy Arbeitsagentur reference number across queries

πŸ“₯ Input

{
"startUrls": [
{ "url": "https://www.arbeitsagentur.de/jobsuche/suche?angebotsart=1&was=Softwareentwickler%2Fin" }
],
"position": "Softwareentwickler/in",
"location": "Berlin",
"maxItems": 100,
"proxyConfiguration": { "useApifyProxy": false }
}
FieldTypeRequiredDescription
πŸ”— startUrlsarrayβœ…List of Arbeitsagentur.de job search URLs
πŸ’Ό positionstringβ›”Keyword fallback when no URL is supplied
πŸ“ locationstringβ›”German city, region, or postal code
🎯 maxItemsintegerβ›”Total result cap across all queries (default 100)
πŸ›°οΈ proxyConfigurationobjectβ›”Defaults to no proxy; auto-falls back on blocks

πŸ“€ Output

Each dataset row looks like:

{
"title": "Softwareentwickler/in",
"mainJob": "Softwareentwickler/in",
"descriptionText": "LOCATION: Villach, AT, 9500 REQ ID: 198751 ...",
"jobType": "ARBEIT",
"referenceNumber": "11949-17224369-S",
"publishingStartDate": "2026-05-15",
"firstPublicationDate": "2026-05-15",
"lastUpdatedAt": "2026-05-15T17:04:43.169",
"contractDuration": "UNBEFRISTET",
"startDate": "2026-05-16",
"salaryNote": "KEINE_ANGABEN",
"employerName": "Lam Research AG",
"employerWebsite": null,
"employerIndustry": null,
"location": "9500 Villach-Lind, KAERNTEN, OESTERREICH",
"isFullTime": true,
"isWorkFromHome": false,
"externalURL": "https://jobs.ams.at/public/emps/jobs/...",
"isExternal": true,
"applicationEmail": null,
"url": "https://www.arbeitsagentur.de/jobsuche/jobdetail/11949-17224369-S"
}

Full list of returned fields:

title, mainJob, descriptionText, jobType, referenceNumber, publishingStartDate, firstPublicationDate, lastUpdatedAt, contractDuration, contractDurationLimitedTo, contractDurationMonths, startDate, entryPeriod, salaryNote, employerName, employerCustomerNumberHash, employerWebsite, employerIndustry, employerFirmSize, location, employerAddress, isFullTime, isMarginalEmployment, isPartTimeMorning, isPartTimeAfternoon, isPartTimeEvening, isPartTimeFlexible, isWorkFromHome, nightOrWeekendShift, isDisabilityRequired, isPrivateAgency, isTemporaryEmployment, externalURL, isExternal, applicationEmail, allianzpartnerName, allianzpartnerUrl, url.


πŸš€ How to Use (Apify Console)

  1. πŸ” Log in at https://console.apify.com β†’ Actors.
  2. πŸ”Ž Open this actor (Bundesagentur fΓΌr Arbeit Job Scraper).
  3. βš™οΈ Configure inputs:
    • Paste one or more Arbeitsagentur.de search URLs, or
    • Provide a position keyword + optional location.
    • Set maxItems (default: 100).
  4. ▢️ Click Start.
  5. πŸ“œ Watch the live log β€” every saved job appears in real time.
  6. πŸ“Š When the run finishes, open the Output tab.
  7. πŸ“₯ Export as JSON, CSV, XLSX, HTML, RSS or XML.

πŸ€– Use via API / curl

Run synchronously and stream the dataset back:

curl -X POST "https://api.apify.com/v2/acts/<USER>~bundesagentur-fur-arbeit-scraper/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"startUrls": [
{ "url": "https://www.arbeitsagentur.de/jobsuche/suche?angebotsart=1&was=Softwareentwickler%2Fin" }
],
"maxItems": 50
}'

Run asynchronously and poll:

curl -X POST "https://api.apify.com/v2/acts/<USER>~bundesagentur-fur-arbeit-scraper/runs?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{"startUrls":[{"url":"https://www.arbeitsagentur.de/jobsuche/suche?angebotsart=1&was=Pflegekraft"}],"maxItems":200}'

πŸ’Ό Best Use Cases

  • πŸ§‘β€πŸ’Ό Recruiters β€” track open vacancies in your sector and city
  • 🏒 HR Tech / ATS vendors β€” enrich your job feed with German listings
  • πŸ“Š Market research β€” analyse the German labor market by region, industry, contract type
  • πŸ€– Job aggregators β€” build vertical job boards on top of the official German source
  • πŸ“° Journalists / analysts β€” quote real numbers on supply and demand

πŸ’° Pricing

This actor is on pay-per-event (PPE).

EventDescriptionWhy
πŸš€ apify-actor-startOne charge per run startCovers warm-up compute
πŸ“„ apify-default-dataset-itemOne charge per saved job postingPay only for results you actually receive

Synthetic events are configured in the Apify Console. You only pay for successful, saved results β€” no data, no charge.


❓ Frequently Asked Questions

Q: Where does the data come from? A: The official Arbeitsagentur.de public job search API (the same one powering the website's search and detail pages).

Q: Do I need a proxy? A: No. The actor defaults to a direct connection. If the upstream rate-limits us, the proxy strategy auto-escalates to Apify datacenter, then residential proxies β€” no input needed.

Q: How fresh is the data? A: It's a live API call on every run β€” what you get is what Arbeitsagentur.de is serving right now.

Q: How many results can I scrape? A: As many as Arbeitsagentur.de exposes β€” set maxItems to the upper bound you want.

Q: Are application emails always available? A: No β€” only when the employer included a contact email inside the public description text. We regex-extract whenever present.

Q: Why are some jobs from Austria? A: Arbeitsagentur.de partners with AMS (Austria) and shows DACH-region postings. We expose the allianzpartnerName field so you can filter.


  • 🌐 All data comes from publicly available endpoints.
  • βš–οΈ End users are responsible for compliance with GDPR, the target site's ToS, and any local regulations.
  • 🀝 The actor honors reasonable rate limits and uses exponential backoff to be a good API citizen.

πŸ“¬ Support & Feedback

Spotted a bug? Need a new field? Open an issue from the Apify Console β€” we read every report. πŸ™Œ


Built with ❀️ on Apify · 2026