
Albert Heijn Scraper
Pricing
$3.00 / 1,000 results

Albert Heijn Scraper
The Albert Heijn Price Scraper extracts product data from dutch supermarket Albert Heijn based on user-defined search keyterms. It collects product details such as name, price, unit price, unit size, discounts, and URLs, ideal for price comparison or market analysis.
0.0 (0)
Pricing
$3.00 / 1,000 results
0
1
1
Last modified
3 days ago
π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π
π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½π½
Harvest Edge / Albert Heijn Scraper
π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±π±
π½π½π½π½ππ½π½π½π½π½π½π½ππ±π±π±π±π±π±π±π±π±π±
Harvest Edge makes business information available to everyone!
Overview
The Albert Heijn Scraper extracts product data from the Albert Heijn supermarket based on user-defined search keyterms. It collects product details such as name, price, unit price, unit size, discounts, and URLs, ideal for price comparison or market analysis. The scraper implements responsible scraping with a configurable delay between requests (default: 700ms).
Nederland, Holland, Supermarkt, Groceries, Albert Heijn, Prijzen, Boodschappen, Fast-Moving Consumer Goods.
Feel free to contact us via Apify for feature requests or bug reports.
Features
-
Scrapes product data from Albert Heijn based on multiple user-defined keyterms.
-
Extracts detailed product information: name, price (β¬), unit price (β¬), unit, unit size, and product URL.
-
Supports limiting the total number of scraped products (default: 10,000).
-
Implements a configurable throttle delay between requests (minimum 100ms, default 700ms) to ensure compliant scraping.
-
Summarizes total products fetched and pages processed per keyterm, with error logging for issues like timeouts.
Input
The Actor accepts the following input parameters via the Apify platform:
Key | Type | Description | Default | Required |
---|---|---|---|---|
keyterms | Array | Keywords to search for products (e.g., Melk, Brood). | ["Melk", "Brood", "Spaghetti"] | Yes |
maxResults | Integer | Maximum number of products to scrape (minimum 1). | 10000 | Yes |
throttleDelay | Integer | Delay between page requests in milliseconds (minimum 100). | 700 | Yes |
Output
The Actor outputs a dataset in JSON format with the following fields for each product:
Field | Type | Description |
---|---|---|
id | String | Unique identifier for the product. |
query | String | Search keyterm used. |
supermarket | String | Supermarket name (Jumbo, AH, Dirk). |
name | String | Product name. |
price_eur | String | Product price in euros (or empty if unavailable). |
unit_price_eur | String | Price per unit in euros (or empty if unavailable). |
unit | String | Unit of measurement (e.g., kg, l). |
unit_size | String | Product size (e.g., 1 l, 500 g). |
discount | String | Discount information (or empty if none). |
url | String | Product page URL. |
scrape_timestamp | String | Timestamp of when the product was scraped. |
Example Output:
{"id": "550e8400-e29b-41d4-a716-446655440000","query": "Melk","supermarket": "AH","name": "Volle Melk","price_eur": "1.09","unit_price_eur": "1.09","unit": "l","unit_size": "1 l","discount": "","url": "https://www.ah.nl/","scrape_timestamp": "2025-09-25 15:12:00+0200"}