Helgoland Spirits Scraper
Pricing
Pay per usage
Helgoland Spirits Scraper
Scrape spirits, wine, and beverage products from Onlineshop Helgoland. Extract prices, stock levels, bottle sizes, EANs, distillery details, descriptions, and images in structured JSON. Supports category-based and direct URL scraping.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
ScrapySpider
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
What does Onlineshop Helgoland Spirits Scraper do?
This Actor scrapes spirits, wine, and beverage products from onlineshop-helgoland.de — one of Germany's unique duty-free island shops. It crawls listing pages across 35 categories (Whisky, Rum, Gin, Vodka, Wines & more), follows every product to its detail page, and outputs a structured JSON record per product — including prices, availability, bottle size, distillery, product info box data, description, and images.
Run it on the Apify platform for automatic scheduling, proxy rotation, cloud storage, and API/webhook integrations. Try it directly from the Apify Console input form — no coding required.
Why use this Actor?
- Price monitoring — track current and crossed-out list prices across the full catalog
- Inventory tracking — know which products are in stock and how many units remain
- Catalog enrichment — export structured product data (EAN, distillery, bottle size, age) for PIM or e-commerce systems
- Market research — analyse duty-free spirits pricing across categories and origins
- Competitor intelligence — monitor new arrivals, rare bottles, and gift sets
How to use it
- Open the Actor on Apify Console and click Try for free
- Choose your input method:
- Spirit Categories — select one or more categories from the multi-select dropdown
- Start URLs — paste specific listing or product page URLs directly
- Select the Website Language (
EnglishorGerman) to match the category slugs - Optionally set Max Requests Per Crawl to limit the run size
- Click Save & Run — results appear in the Output tab as they arrive
Input
Configure the Actor via the Input tab in Apify Console or pass a JSON object.
| Field | Type | Default | Description |
|---|---|---|---|
startUrls | array | [] | Direct listing or product URLs. Takes priority over Spirit Categories. |
spiritCategories | array | ["whisky-single-malt"] | One or more category keys to scrape. Ignored when Start URLs are provided. |
language | string | "en" | Website language — en or de. Affects URL prefix and category slugs. |
maxRequestsPerCrawl | integer | 0 (unlimited) | Cap on total pages visited (listing + detail). |
proxyConfiguration | object | — | Apify Proxy or custom proxy settings. Recommended for large runs. |
Available spirit categories (35 total):
Whisky Single Malt, Blended Whisky, Bourbon Whiskey, Scotch/Scotland, Ireland, USA, Japan · Rum: Brown, Gold, White, Spiced, Cuba · Gins: Dry, London Dry, New Western Dry, Flavoured, Germany · Vodka: Poland, Germany, USA, Russia · Other: Aquavit, Cognac & Brandy, Liqueur & Herbs, Fruit Brandy & Grappa, Tequila · Wines & Co.: Wine, Port Wine & Sherry, Sparkling, Champagne, Large Bottles, Gift Sets, Rarities, Helgoland Spirits, Non-Alcoholic
Example input (category mode):
{"spiritCategories": ["whisky-single-malt", "whisky-bourbon"],"language": "en","maxRequestsPerCrawl": 200}
Example input (direct URL mode):
{"startUrls": [{ "url": "https://www.onlineshop-helgoland.de/en/spirits/whisky/single-malt/" }]}
Output
Each scraped product is saved as one JSON item in the default dataset. You can download the dataset in various formats such as JSON, CSV, Excel, or HTML from the Storage tab.
Example output record:
{"name": "Lagavulin 16 Years 0.7 liters 43% vol.","product_url": "https://www.onlineshop-helgoland.de/en/produkte/whisky/lagavulin-16-years-0-7-liters-43-vol","canonical_url": "https://www.onlineshop-helgoland.de/en/produkte/whisky/lagavulin-16-years-0-7-liters-43-vol","product_id": "5010496004339","ean_code": "5010496004339","shop_sku": "5010496004339","main_image": "https://www.onlineshop-helgoland.de/media/image/product/xxx/lg/lagavulin-16.jpg","images": ["https://..."],"price_current": 52.95,"price_old": 59.95,"price_per_liter": 75.64,"availability_enum": "in_stock","total_stock": "12+","bottle_size_text": "0.7 Liter","bottle_size_ml": 700,"age_years": 16,"distillery": "Lagavulin","product_info": {"Age class": "16 Years","Type": "Peated, Single Malt","Country / Region": "Scotland","Content": "0.7 l"},"description": "Lagavulin 16 is one of the classic Islay single malts…","breadcrumbs": ["Spirits", "Whisky", "Single Malt"],"scrape_run_id": "tpLehwBM52yEcMFvH","timestamp_utc": "2026-05-29T07:03:25.664Z","last_seen_utc": "2026-05-29T07:03:25.664Z"}
Data fields
| Field | Type | Description |
|---|---|---|
name | string | Full product name as shown on the page |
product_url | string | URL of the product detail page |
canonical_url | string | Canonical URL from <link rel="canonical"> |
product_id | string | EAN / GTIN-13 barcode |
ean_code | string | Same as product_id |
shop_sku | string | Same as product_id |
main_image | string | Primary product image URL |
images | array | All gallery image URLs |
price_current | number | Current sale price in EUR |
price_old | number | Crossed-out list price in EUR (if shown) |
price_per_liter | number | Price per litre in EUR (if shown) |
availability_enum | string | in_stock, out_of_stock, limited, preorder, or unknown |
total_stock | string | Stock quantity indicator (e.g. "12+") |
bottle_size_text | string | Raw bottle size string (e.g. "0.7 Liter") |
bottle_size_ml | number | Bottle volume normalised to millilitres |
age_years | integer | Age statement in years (null if NAS or unstated) |
distillery | string | Producing distillery from manufacturer logo |
product_info | object | All raw key-value pairs from the product info box |
description | string | Full product description text |
breadcrumbs | array | Category path (e.g. ["Spirits", "Whisky", "Single Malt"]) |
scrape_run_id | string | Apify Actor run ID |
timestamp_utc | string | UTC timestamp when the item was scraped |
last_seen_utc | string | UTC timestamp of the most recent successful scrape |
Cost estimation
Helgoland Spirits Scraper uses Puppeteer (headless Chrome) which consumes more compute units than plain HTTP scrapers. Typical performance:
- ~1–2 seconds per detail page
- ~50 products per listing page
- A full category (1–3 pages) = roughly 50–150 detail pages
Rough compute unit estimates:
| Run scope | Pages | Estimated CUs |
|---|---|---|
| 1 category (50 products) | ~51 | ~0.5 CU |
| 5 categories (~250 products) | ~260 | ~2–3 CU |
| Full catalog (~1 750 products) | ~1 800 | ~15–20 CU |
Apify's Free plan includes 5 CUs/month — enough for targeted category runs. Use maxRequestsPerCrawl to cap cost on test runs.
Tips
- Use Start URLs when you want to scrape a single specific product or a custom listing page
- Set
maxRequestsPerCrawl: 50for a quick test run before committing to a full crawl - Enable Apify Proxy (
useApifyProxy: true) for large runs to avoid rate limiting - Products with size variants (e.g. 0.7L and 1L options) produce one dataset record per variant, each with its own price and images
- German language (
language: "de") uses different category slugs — useful if you need DE-language descriptions
FAQ & Legal
Is scraping onlineshop-helgoland.de legal? Web scraping publicly available product data is generally permitted for personal or research use. Always comply with the site's Terms of Service and do not overload the server. This Actor implements rate limiting and retries to behave as a polite crawler.
What if some fields are null? Fields are null only when the information is genuinely absent from the product page. No values are inferred or generated.
Products with multiple sizes show duplicate records — is that correct? Yes. Each size variant (configurator option) produces its own dataset row with the correct price, image, and bottle size for that variant.
Something broken or a feature request? Open an issue in the Issues tab or contact us via the Apify Console.