Herold.at Scraper - Austrian Yellow Pages
Pricing
from $2.00 / 1,000 serp listing scrapeds
Herold.at Scraper - Austrian Yellow Pages
Scrape business listings from herold.at, Austria's Yellow Pages with 360K+ businesses. Extract names, phone numbers, emails, addresses directly from search results. HTTP-only, low memory, competitive pricing.
Pricing
from $2.00 / 1,000 serp listing scrapeds
Rating
0.0
(0)
Developer

Alessandro Santamaria
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Herold.at Scraper
Scrape business listings from herold.at, Austria's Yellow Pages with 360K+ businesses.
Features
- Phone + email on SERP — Direct
tel:andmailto:links, no detail page needed - ~30 results per page with automatic pagination
- Austrian address parsing — 4-digit postal codes + city names
- Optional detail pages — Website URLs, opening hours, geo coordinates, company details
- Qwik JSON extraction — Structured data from herold.at's Qwik framework (rating, industries, founding year, keywords)
- Social media links — Facebook, Instagram, LinkedIn, Xing, YouTube via vCard + HTML parsing
- HTTP-only — Low memory (~128MB), fast execution
- Pay-per-result pricing via Apify billing events
Input
| Field | Type | Default | Description |
|---|---|---|---|
keyword | string | required | Business type (e.g., "Elektriker", "Restaurant") |
location | string | optional | City/region (e.g., "Wien", "Salzburg"). Empty = all Austria |
maxResults | number | 100 | Maximum results to scrape |
includeDetails | boolean | false | Fetch detail pages for website/hours/geo |
proxyConfiguration | object | Apify proxy | Proxy settings |
Output Fields
From SERP (always available)
name— Business namephone— Phone number (directly from SERP!)email— Email address (directly from SERP!)address— Full addresscity,postal_code— Address componentscategory— Business categorydetail_url— Link to detail pagelogo_url— Business logo image URL (from SERP listing)
From Detail Page (when includeDetails=true)
website— Business website URLopening_hours— Business hoursdescription— Business description (from Qwik JSON or meta tags)latitude,longitude— Geo coordinateslogo_url— Business logo image URL (higher quality from Qwik JSON)social_links— Social media profiles as{ facebook: "url", instagram: "url", ... }branches— Comma-separated industry names (primary + secondary industries)rating_score— Average rating (0-5 scale)rating_count— Number of reviewsfounding_year— Company founding yearcompany_register_id— Austrian company register number (Firmenbuchnummer)ksv_url— KSV1870 credit check linkkeywords— Comma-separated business keywords
Pricing Events
| Event | Price |
|---|---|
directory-start | $0.05 |
directory-serp-result | $0.003 |
directory-detail-result | $0.005 |
Enrichment add-ons
After the scrape completes, this actor can automatically trigger AI-powered extraction on every website found in the results. Each add-on runs as a separate actor and produces its own dataset.
Contact extraction
Extracts team member names, email addresses, phone numbers, positions, and departments from company websites. Powered by the Website Contact Extractor.
Enable it by setting enableContactExtraction: true and providing at least one LLM API key. The sub-actor run ID is stored in the key-value store as CONTACT_EXTRACTOR_RUN_ID.
Job listing extraction
Extracts open positions, job titles, locations, departments, and career page URLs from company websites. Powered by the Website Job Extractor.
Enable it by setting enableJobExtraction: true and providing at least one LLM API key. The sub-actor run ID is stored in the key-value store as JOB_EXTRACTOR_RUN_ID.
Browser fallback
Some company websites are built with JavaScript frameworks (React, Vue, Angular) that require a full browser to render. When enableBrowserFallback is set to true, the contact/job extractors will automatically re-scrape these sites with Playwright. This catches ~25% more sites but increases cost and runtime. Only applies when contact or job extraction is enabled.
LLM API keys
Both add-ons use LLMs to extract structured data. Provide one or more API keys. When multiple keys are provided, the system uses them in priority order with automatic fallback:
- Gemini (recommended) -- Best quality-to-cost ratio. Free tier includes 1M tokens/min. Get a key at Google AI Studio.
- Groq (optional) -- Ultra-fast inference. Get a key at Groq Console.
- OpenRouter (optional) -- Access to 100+ models. Get a key at OpenRouter.
One key is sufficient. With multiple keys, if the primary provider hits a rate limit, the system falls back to the next available provider automatically.
Support
Found a bug or have a feature request? Open an issue.