Pricing

from $2.00 / 1,000 results

Willhaben.at — Austria's Largest Classifieds

Scrape willhaben.at — Austria's largest classifieds platform across every section · listings from any pasted search URL · price & attribute fields per item. Incremental mode tracks new and changed listings across scheduled runs.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Black Falcon Data

Actor stats

Bookmarked

Total users

Monthly active users

9 hours ago

Last modified

0.3.3 — 2026-05-03

Added — Structured salary fields for jobs section

jobSalaryMin — numeric min salary (mirrors the API value; falls back to description-parsed when the API ships nothing).
jobSalaryMax — numeric max salary, parsed from description ranges like "50.908 bis 55.772" when the API has no range.
jobSalaryCurrency — currency code (typically "EUR").
jobSalaryPeriod — normalized period ("hour" / "day" / "week" / "month" / "year"), distinct from the raw jobSalaryTimeFrame Willhaben ships in German.
The pre-existing jobSalary / jobSalaryTimeFrame / jobSalaryText fields are preserved unchanged for backward compatibility.
Salary parser ships with bounds (defaults: min €100, max €10M) and a 5× ratio cap on description-parsed maxes, so phone numbers and VAT ids in descriptions can't leak into salary values. (Same defense as willhaben-scraper v0.2.6.)

Fixed

contactName preserved when detail's contact is empty (parity with willhaben-scraper v0.2.5). When Willhaben ships { email: ... } without firstname/lastname, prior jobContactName is no longer null-blanked.

0.3.2 — 2026-05-03

Fixed — Critical

SERP page-1 failure now propagates in both jobs and attribute (immo / autos / marktplatz) tasks instead of silently returning an empty universe. A single 5xx on page 1 in incremental mode would have marked every prior-state listing as EXPIRED (universe corruption); runs now fail fast so state is preserved.
Retry loop no longer self-cancels. fetchWithRetry now builds a fresh AbortSignal per attempt — the prior single-signal pattern caused the first timeout to abort every subsequent retry instantly. Network-level errors (ECONNRESET, ETIMEDOUT, EAI_AGAIN, fetch wrappers) are now retryable, not just retryable HTTP statuses.
Hostname validation on searchUrl / startUrls. Pasted non-Willhaben URLs are now rejected with a clear error (searchUrl must be a willhaben.at URL) instead of silently parsing into a request that goes nowhere meaningful.

Fixed — Important

EXPIRED stubs no longer charged. Billing event now sizes by live items only (NEW/UPDATED/UNCHANGED/REAPPEARED); tombstones are operational data with no business value.
Run-footer headline split. Footer now reports X new/updated listings exported (live only) plus a separate 🪦 Y expired listings emitted as stubs line — no more "exported X new/updated" when half the dataset is tombstones.
buildExpiredStub is now typed. Adding a new OutputItem field would (correctly) cause a TS error rather than silently leak undefined into the dataset.
State-lock failures throw a plain Error instead of throw await Actor.fail(...) — the prior pattern interfered with the outer try/catch's releaseLock cleanup path.
immoAvailableDate now ISO-normalized, matching all other date fields.
validThrough and jobExpiryDate consistent. Hash input and output field now use the same normalized expiryDate so a presentation-only timestamp drift can't trigger a spurious UPDATED.
State-key auto-scope is section-aware. For non-jobs runs, jobs-* filters no longer enter the state partition; for non-immo runs, immoSubTypes doesn't enter; etc. Prevents accidental state-sharing across unrelated result sets.

Fixed — Nice-to-have

Default memory raised from 256 MB to 512 MB; max raised to 2048 MB. The 256 MB default OOMed on maxResults: 0 runs over large universes.
Dead applyExtraction removed.

Tests

7 new tests covering emission-policy toggles and full classification round-trip (prior-active missing → EXPIRED, firstSeenAt preservation, REAPPEARED detection). Total: 157 unit/integration tests, all green.

0.3.1 — 2026-05-03

Added — Emission policy controls (parity with willhaben-scraper)

emitUnchanged — when enabled, incremental runs also emit listings classified as UNCHANGED (no tracked content drift since last run). Default false — most pipelines only want NEW / UPDATED / REAPPEARED.
emitExpired — when enabled, listings tracked in the prior state but missing from the current run are emitted as EXPIRED stubs (timestamps + listingId only, no live data). Notifications skip these stubs (no title / apply link to render). Default false.
Closes the schema drift between this actor and willhaben-scraper, which has had these toggles since v0.2.0.

0.3.0 — 2026-05-03

Added — Multi-URL `startUrls` input

New startUrls input accepts an array of willhaben.at search URLs. Each URL becomes its own search task; results are merged round-robin across tasks before the global maxResults cap so every URL contributes to the output. Without this, a small maxResults would be filled entirely from URL #1's items even when the other URLs were also fetched (the bug user emmat reported against the jobs scraper).
All URLs in a single run must target the same section (jobs / immobilien / autos / marktplatz). Mixed-section runs are rejected with a clear error asking the user to split into separate runs.
Identical URLs are deduped by canonical form (sorted query params, trimmed trailing slash) so an accidentally-pasted duplicate doesn't fire two identical SERP pipelines.
The single-string searchUrl field still works (treated as a one-entry startUrls array) for backward compatibility with existing schedules.

Fixed — Structural (parity with willhaben-scraper v0.2.4)

Jobs URL filters now extracted from URL. A pasted /jobs/suche URL with ?location=Wien&region=900&employment_type=vollzeit previously dropped these params silently into extraParams (which the jobs API client doesn't read). They're now mapped onto the typed jobLocation / jobRegion / jobEmploymentMode fields so per-URL job filters actually take effect.
Vienna-local timestamps now carry the correct timezone offset. Naked firstPublishDate / PUBLISHED_String / expiryDate from the API used to pass through without normalization, mis-representing Vienna wall-clock values. Now appends +01:00 or +02:00 based on Europe/Vienna DST for the date in question.
EXPIRED stub items no longer sent to notifications. Per-job stubs only carry timestamps + listingId — Telegram/Discord/Slack would have rendered them as (untitled) with no link. Still appear in the dataset; only notification rendering is skipped.
Telegram messages over 4096 chars now split on blank-line (entry-block) boundaries, falling back to hard-cut only when a single entry exceeds the cap.
HTTP 429 is now retryable (was previously bubbled directly to the caller).
Retry delays now include ±20% jitter to break up thundering-herd retries when many parallel SERP pages hit the same upstream 5xx.
Actor.charge failures log at debug level instead of being silently swallowed — systematic billing breakage is now observable in dev/CI logs.
Empty/whitespace stateKey is now coerced to null so the automatic state partition fallback fires when the field is left blank in the UI.
Repost detection is now O(1) per current item via a pre-built content-hash → expired-prior-entry index. Previously linear scan over the full prior state for every item.

Tests

30 new tests (mergeTaskResults, computePerTaskBudget, searchTasks).
Total: 150 tests, all green.

0.2.1 — 2026-05-01

Changed — `stateKey` is now optional

When incremental mode is enabled, stateKey is no longer required. If omitted, a stable identifier is auto-generated from your search inputs (section, query, location, sub-type, price and job filters, plus any params from a pasted searchUrl) so different searches never share state.
Migration note: existing schedules with an explicit stateKey keep their prior state intact. Schedules that previously errored ("stateKey is required") will now succeed and start fresh state.

0.1.x — 2026-04-14

Added: descriptionHtml, descriptionMarkdown output fields (triple-format descriptions for RAG/LLM pipelines)
Added: contentHash output field (stable hash of content-identifying fields, used for change detection)

0.1.x — 2026-04-14

Added: cross-run repost detection (isRepost, repostOfId, repostDetectedAt)
Added: skipReposts input to exclude detected reposts from output

0.2.0 — 2026-03-30

Added full Jobs section support.

Jobs section: all willhaben.at job listings via section: "jobs"
Job filters: keyword, region, location, employment mode, position level, company type, time limit
23 job-specific output fields: company, salary, employment modes, remote work, contact info, apply URL, language skills
Detail enrichment for jobs: contact name/email, remote flag, language skills, expiry date
Jobs incremental mode: change tracking using salary, description and company as tracked fields
URL parsing: willhaben.at/jobs/suche?... URLs now supported in searchUrl input
Compact output includes job fields: jobCompany, jobSalaryText, jobEmploymentModes, jobRemote

0.1.0 — 2026-03-30

Initial release. Covers marktplatz, immobilien (all sub-types) and autos sections.

Keyword search with price filters (priceMin/priceMax)
Paste any willhaben.at search URL — filters extracted automatically
Immobilien sub-types: mietwohnungen, eigentumswohnungen, haus-kaufen, haus-mieten, büro-gewerbe
Detail enrichment: full description, contact info, richer section-specific fields
Incremental mode: free change tracking (NEW/UPDATED/UNCHANGED/EXPIRED)
Compact output mode for AI-agent and MCP workflows
Parallel SERP fetching (pages 2..N concurrent)

Willhaben.at Scraper - Austrian Classifieds

santamaria-automations/willhaben-at-scraper

Scrape classified listings from willhaben.at, Austria's #1 classifieds platform. Cars, real estate, jobs, electronics and more. Paste any search URL with all filters. HTTP-only, fast, pay-per-result.

NanoScrape

Willhaben.at Classifieds Scraper (Austria)

engaging_pyrite/willhaben-at-classifieds

Scrape Austrian classified ads from willhaben.at by search URL: title, price, location, exact publish date, seller, images. Export JSON/CSV/Excel.

lysum

Willhaben Scraper - Austrian Classifieds & Marketplace

studio-amba/willhaben-scraper

Scrape classified listings from Willhaben.at, Austria's largest marketplace. Search by keyword, category, and price range. Extract titles, prices, descriptions, images, location, and seller info. Covers classifieds, real estate, cars, and jobs. No login or cookies required.

Studio Amba

🇦🇹 willhaben Scraper - Austrian Marketplace Listings

benthepythondev/willhaben-scraper

willhaben Scraper to extract classified listings from willhaben.at, Austria's largest marketplace. Get title, price, location, postcode, district, description, images, seller type (private or dealer) and URL by keyword. For price research, market analysis, reselling and lead generation in Austria.

Ben

Willhaben Scraper - Preiswert Low-cost💲🔥🇦🇹

delectable_incubator/willhaben-scraper---preiswert-low-cost

Scrape listings from Willhaben.at 🛒🏠🚗💼🔎 with a powerful Austria classifieds scraper. Extract titles, categories, prices, locations, attributes, sellers, descriptions, images, and URLs. Ideal for market research, price tracking, competitor analysis, and scalable structured datasets 📊🚀

Prime Scrape

5.0

Willhaben Scraper 💰 $0.89/1K — Austria’s Largest Job Portal

blackfalcondata/willhaben-scraper

Scrape willhaben.at - Austria's largest job portal. Structured salary fields and Austrian VAT/UID numbers for B2B outreach. Incremental mode with NEW/UPDATED/EXPIRED/REAPPEARED + repost detection.

Black Falcon Data

Willhaben Scraper | Marketplace, Real Estate, Cars & Jobs

solidcode/willhaben-scraper

[💰 $0.85 / 1K] Scrape willhaben.at — Austria's largest classifieds marketplace. Extract listings with prices, descriptions, seller info, images, and section-specific details for marketplace goods, real estate, cars, and jobs. Search by URL or plain-text terms.

SolidCode

willhaben Austria Car Scraper

devilscrapes/willhaben-austria-cars

Scrape used-car listings from willhaben.at, Austria's #1 marketplace — price, make, model, year, mileage, fuel, transmission, power, body type, colour, seller, location, and photos. Export to JSON or CSV; optionally enrich each listing with its full description.

DevilScrapes

Willhaben.at Marketplace Scraper

automation-lab/willhaben-at-marketplace-scraper

Scrape public Willhaben.at marketplace listings with prices, locations, seller IDs, images, timestamps, and source search context.

Stas Persiianenko

Willhaben Property Details Scraper

ecomscrape/willhaben-property-details-scraper

Access Austria's largest property marketplace data with our Willhaben.at scraper. Extract detailed property listings, pricing, locations, and contact information from millions of real estate ads. Perfect for market analysis, price comparison, and real estate research across Austrian regions.

ecomscrape

Willhaben.at — Austria's Largest Classifieds

Changelog

0.3.3 — 2026-05-03