Helgoland Spirits Scraper avatar

Helgoland Spirits Scraper

Pricing

Pay per usage

Go to Apify Store
Helgoland Spirits Scraper

Helgoland Spirits Scraper

Scrape spirits, wine, and beverage products from Onlineshop Helgoland. Extract prices, stock levels, bottle sizes, EANs, distillery details, descriptions, and images in structured JSON. Supports category-based and direct URL scraping.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

ScrapySpider

ScrapySpider

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

What does Onlineshop Helgoland Spirits Scraper do?

This Actor scrapes spirits, wine, and beverage products from onlineshop-helgoland.de — one of Germany's unique duty-free island shops. It crawls listing pages across 35 categories (Whisky, Rum, Gin, Vodka, Wines & more), follows every product to its detail page, and outputs a structured JSON record per product — including prices, availability, bottle size, distillery, product info box data, description, and images.

Run it on the Apify platform for automatic scheduling, proxy rotation, cloud storage, and API/webhook integrations. Try it directly from the Apify Console input form — no coding required.

Why use this Actor?

  • Price monitoring — track current and crossed-out list prices across the full catalog
  • Inventory tracking — know which products are in stock and how many units remain
  • Catalog enrichment — export structured product data (EAN, distillery, bottle size, age) for PIM or e-commerce systems
  • Market research — analyse duty-free spirits pricing across categories and origins
  • Competitor intelligence — monitor new arrivals, rare bottles, and gift sets

How to use it

  1. Open the Actor on Apify Console and click Try for free
  2. Choose your input method:
    • Spirit Categories — select one or more categories from the multi-select dropdown
    • Start URLs — paste specific listing or product page URLs directly
  3. Select the Website Language (English or German) to match the category slugs
  4. Optionally set Max Requests Per Crawl to limit the run size
  5. Click Save & Run — results appear in the Output tab as they arrive

Input

Configure the Actor via the Input tab in Apify Console or pass a JSON object.

FieldTypeDefaultDescription
startUrlsarray[]Direct listing or product URLs. Takes priority over Spirit Categories.
spiritCategoriesarray["whisky-single-malt"]One or more category keys to scrape. Ignored when Start URLs are provided.
languagestring"en"Website language — en or de. Affects URL prefix and category slugs.
maxRequestsPerCrawlinteger0 (unlimited)Cap on total pages visited (listing + detail).
proxyConfigurationobjectApify Proxy or custom proxy settings. Recommended for large runs.

Available spirit categories (35 total):

Whisky Single Malt, Blended Whisky, Bourbon Whiskey, Scotch/Scotland, Ireland, USA, Japan · Rum: Brown, Gold, White, Spiced, Cuba · Gins: Dry, London Dry, New Western Dry, Flavoured, Germany · Vodka: Poland, Germany, USA, Russia · Other: Aquavit, Cognac & Brandy, Liqueur & Herbs, Fruit Brandy & Grappa, Tequila · Wines & Co.: Wine, Port Wine & Sherry, Sparkling, Champagne, Large Bottles, Gift Sets, Rarities, Helgoland Spirits, Non-Alcoholic

Example input (category mode):

{
"spiritCategories": ["whisky-single-malt", "whisky-bourbon"],
"language": "en",
"maxRequestsPerCrawl": 200
}

Example input (direct URL mode):

{
"startUrls": [
{ "url": "https://www.onlineshop-helgoland.de/en/spirits/whisky/single-malt/" }
]
}

Output

Each scraped product is saved as one JSON item in the default dataset. You can download the dataset in various formats such as JSON, CSV, Excel, or HTML from the Storage tab.

Example output record:

{
"name": "Lagavulin 16 Years 0.7 liters 43% vol.",
"product_url": "https://www.onlineshop-helgoland.de/en/produkte/whisky/lagavulin-16-years-0-7-liters-43-vol",
"canonical_url": "https://www.onlineshop-helgoland.de/en/produkte/whisky/lagavulin-16-years-0-7-liters-43-vol",
"product_id": "5010496004339",
"ean_code": "5010496004339",
"shop_sku": "5010496004339",
"main_image": "https://www.onlineshop-helgoland.de/media/image/product/xxx/lg/lagavulin-16.jpg",
"images": ["https://..."],
"price_current": 52.95,
"price_old": 59.95,
"price_per_liter": 75.64,
"availability_enum": "in_stock",
"total_stock": "12+",
"bottle_size_text": "0.7 Liter",
"bottle_size_ml": 700,
"age_years": 16,
"distillery": "Lagavulin",
"product_info": {
"Age class": "16 Years",
"Type": "Peated, Single Malt",
"Country / Region": "Scotland",
"Content": "0.7 l"
},
"description": "Lagavulin 16 is one of the classic Islay single malts…",
"breadcrumbs": ["Spirits", "Whisky", "Single Malt"],
"scrape_run_id": "tpLehwBM52yEcMFvH",
"timestamp_utc": "2026-05-29T07:03:25.664Z",
"last_seen_utc": "2026-05-29T07:03:25.664Z"
}

Data fields

FieldTypeDescription
namestringFull product name as shown on the page
product_urlstringURL of the product detail page
canonical_urlstringCanonical URL from <link rel="canonical">
product_idstringEAN / GTIN-13 barcode
ean_codestringSame as product_id
shop_skustringSame as product_id
main_imagestringPrimary product image URL
imagesarrayAll gallery image URLs
price_currentnumberCurrent sale price in EUR
price_oldnumberCrossed-out list price in EUR (if shown)
price_per_liternumberPrice per litre in EUR (if shown)
availability_enumstringin_stock, out_of_stock, limited, preorder, or unknown
total_stockstringStock quantity indicator (e.g. "12+")
bottle_size_textstringRaw bottle size string (e.g. "0.7 Liter")
bottle_size_mlnumberBottle volume normalised to millilitres
age_yearsintegerAge statement in years (null if NAS or unstated)
distillerystringProducing distillery from manufacturer logo
product_infoobjectAll raw key-value pairs from the product info box
descriptionstringFull product description text
breadcrumbsarrayCategory path (e.g. ["Spirits", "Whisky", "Single Malt"])
scrape_run_idstringApify Actor run ID
timestamp_utcstringUTC timestamp when the item was scraped
last_seen_utcstringUTC timestamp of the most recent successful scrape

Cost estimation

Helgoland Spirits Scraper uses Puppeteer (headless Chrome) which consumes more compute units than plain HTTP scrapers. Typical performance:

  • ~1–2 seconds per detail page
  • ~50 products per listing page
  • A full category (1–3 pages) = roughly 50–150 detail pages

Rough compute unit estimates:

Run scopePagesEstimated CUs
1 category (50 products)~51~0.5 CU
5 categories (~250 products)~260~2–3 CU
Full catalog (~1 750 products)~1 800~15–20 CU

Apify's Free plan includes 5 CUs/month — enough for targeted category runs. Use maxRequestsPerCrawl to cap cost on test runs.

Tips

  • Use Start URLs when you want to scrape a single specific product or a custom listing page
  • Set maxRequestsPerCrawl: 50 for a quick test run before committing to a full crawl
  • Enable Apify Proxy (useApifyProxy: true) for large runs to avoid rate limiting
  • Products with size variants (e.g. 0.7L and 1L options) produce one dataset record per variant, each with its own price and images
  • German language (language: "de") uses different category slugs — useful if you need DE-language descriptions

Is scraping onlineshop-helgoland.de legal? Web scraping publicly available product data is generally permitted for personal or research use. Always comply with the site's Terms of Service and do not overload the server. This Actor implements rate limiting and retries to behave as a polite crawler.

What if some fields are null? Fields are null only when the information is genuinely absent from the product page. No values are inferred or generated.

Products with multiple sizes show duplicate records — is that correct? Yes. Each size variant (configurator option) produces its own dataset row with the correct price, image, and bottle size for that variant.

Something broken or a feature request? Open an issue in the Issues tab or contact us via the Apify Console.