Dutch Supermarket Price Scraper avatar
Dutch Supermarket Price Scraper

Pricing

$19.00/month + usage

Go to Apify Store
Dutch Supermarket Price Scraper

Dutch Supermarket Price Scraper

Developed by

Harvest Edge

Harvest Edge

Maintained by Community

Albert Heijn, Jumbo, and Dirk Supermarket Scraper extracts product data from all major Dutch supermarkets based on user-defined search terms. Collects product details like name, price, unit price, non-discounted price, and URLs, ideal for price comparison, product matching or market analysis.

0.0 (0)

Pricing

$19.00/month + usage

0

2

2

Last modified

7 days ago

🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🚜

🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽

Harvest Edge / Dutch Supermarkets Scraper

🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱

πŸŒ½πŸŒ½πŸŒ½πŸŒ½πŸ†πŸŒ½πŸŒ½πŸŒ½πŸŒ½πŸŒ½πŸŒ½πŸŒ½πŸšœπŸŒ±πŸŒ±πŸŒ±πŸŒ±πŸŒ±πŸŒ±πŸŒ±πŸŒ±πŸŒ±πŸŒ±

Harvest Edge makes business information available to everyone!

Overview

The Dutch Supermarkets Scraper extracts product data from major Dutch supermarkets (Albert Heijn, Jumbo, Dirk) based on user-defined search keyterms. It collects product details such as name, price, unit price, unit size, discounts, and URLs, ideal for price comparison or market analysis. The scraper implements responsible scraping with a configurable delay between requests (default: 700ms).

Nederland, Holland, Supermarkt, Groceries, Albert Heijn, Jumbo, Dirk, Prijzen, Boodschappen, Hoogvliet, Lidl, Tesco, Carrefour, Fast-Moving Consumer Goods.

Feel free to contact us via Apify for feature requests or bug reports.

Features

  • Scrapes product data from Albert Heijn, Jumbo, and Dirk based on multiple user-defined keyterms.
  • Extracts detailed product information: name, price (€), unit price (€), unit, unit size, discount, and product URL.
  • Supports limiting the total number of scraped products (default: 10,000).
  • Implements a configurable throttle delay between requests (minimum 100ms, default 700ms) to ensure responsible scraping.
  • Summarizes total products fetched and pages processed per keyterm and supermarket, with error logging for issues like timeouts.

Input

The Actor accepts the following input parameters via the Apify platform:

KeyTypeDescriptionDefaultRequired
keytermsArrayKeywords to search for products (e.g., Melk, Brood).["Melk", "Brood", "Spaghetti"]Yes
maxResultsIntegerMaximum number of products to scrape (minimum 1).10000Yes
supermarketsArraySupermarkets to scrape (ah, jumbo, dirk).["ah", "jumbo", "dirk"]No
throttleDelayIntegerDelay between page requests in milliseconds (minimum 100).700Yes

Available Supermarkets:

  • ah (Albert Heijn)
  • jumbo (Jumbo)
  • dirk (Dirk)

Note: The scraper processes each keyterm across selected supermarkets, stopping when the maxResults limit is reached or no more products are found.

Output

The Actor outputs a dataset in JSON format with the following fields for each product:

FieldTypeDescription
idStringUnique identifier for the product.
queryStringSearch keyterm used.
supermarketStringSupermarket name (Jumbo, AH, Dirk).
nameStringProduct name.
price_eurStringProduct price in euros (or empty if unavailable).
unit_price_eurStringPrice per unit in euros (or empty if unavailable).
unitStringUnit of measurement (e.g., kg, l).
unit_sizeStringProduct size (e.g., 1 l, 500 g).
discountStringDiscount information (or empty if none).
urlStringProduct page URL.
scrape_timestampStringTimestamp of when the product was scraped.

Example Output:

{
"id": "550e8400-e29b-41d4-a716-446655440000",
"query": "Melk",
"supermarket": "Jumbo",
"name": "Jumbo Volle Melk",
"price_eur": "1.09",
"unit_price_eur": "1.09",
"unit": "l",
"unit_size": "1 l",
"discount": "",
"url": "https://www.jumbo.com/producten/jumbo-volle-melk-1l",
"scrape_timestamp": "2025-09-25 15:12:00+0200"
}