# Jumbo Scraper — Dutch Supermarket Prices & Promotions (`studio-amba/jumbo-scraper`) Actor

Scrape products, prices, promotions, unit prices, brands and EAN barcodes from Jumbo (jumbo.com). Search by keyword. No login or cookies required.

- **URL**: https://apify.com/studio-amba/jumbo-scraper.md
- **Developed by:** [Studio Amba](https://apify.com/studio-amba) (community)
- **Categories:** E-commerce
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Jumbo Scraper — Dutch Supermarket Prices & Promotions

Scrape product data from **Jumbo** ([jumbo.com](https://www.jumbo.com)), the second-largest supermarket chain in the Netherlands. Search the full Jumbo assortment by keyword and get clean, structured JSON with names, prices, promotions, unit prices, brands and EAN barcodes.

No login. No cookies. No browser. Just give it a search term and it returns the matching products.

### What this Actor does

This Actor queries Jumbo's own product search backend and returns every matching product with its current pricing and promotion data. It is built for price monitoring, assortment research, competitive analysis and building grocery price datasets.

For each product you get:

- **Name** — full product title (e.g. "Jumbo's Beste Koffie Snelfiltermaling Aroma 500 g")
- **Brand** — the product brand
- **Price** — current shelf price in euros
- **Promotion price** — the price when a promotion applies (if any)
- **Promotion** — the promotion mechanism text (e.g. "1+1 gratis", "2e halve prijs")
- **Unit price** — normalized price per kg / litre / piece
- **Unit** — the package size (e.g. "500 g", "2 x 1 L")
- **Category** — the Jumbo product category
- **EAN** — the product barcode / GTIN
- **SKU** — Jumbo's internal product id
- **In stock** — whether the product is currently available
- **Image URL** — product image
- **URL** — link to the product page on jumbo.com
- **Scraped at** — ISO timestamp of the run

### How to scrape Jumbo data

1. Open this Actor in Apify Console (or call it via the Apify API).
2. Set a **Search Query** — a Dutch keyword such as `koffie`, `kaas`, `pasta`, `melk` or a brand name.
3. Set **Max Results** — how many products you want back (default 50).
4. Leave the **Proxy Configuration** on the default. Jumbo's API is behind Cloudflare and only answers Dutch IP addresses, so a Netherlands residential proxy is prefilled and required.
5. Click **Start**. The Actor paginates through the search results and pushes each product to the dataset.
6. Download the results as JSON, CSV, Excel or feed them straight into your own pipeline via the API.

That is the whole flow. One keyword in, a clean product dataset out.

#### Example input

```json
{
    "searchQuery": "koffie",
    "maxResults": 100,
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": ["RESIDENTIAL"],
        "apifyProxyCountry": "NL"
    }
}
````

#### Example output

```json
{
    "name": "Jumbo's Beste Koffie Snelfiltermaling Aroma 500 g",
    "brand": "Jumbo's",
    "ean": "8718452840120",
    "sku": "637315PAK",
    "price": 7.99,
    "currency": "EUR",
    "unit": "500 g",
    "unitPrice": "€15.98 / kg",
    "promotionPrice": null,
    "promotion": null,
    "category": "Koffie en thee",
    "url": "https://www.jumbo.com/producten/jumbo-s-beste-koffie-snelfiltermaling-aroma-500-g-637315PAK",
    "imageUrl": "https://www.jumbo.com/dam-images/fit-in/360x360/Products/...png",
    "inStock": true,
    "scrapedAt": "2026-06-24T10:00:00.000Z"
}
```

### Input reference

| Field | Type | Required | Description |
| --- | --- | --- | --- |
| `searchQuery` | string | no | Dutch keyword or brand to search for. Defaults to `koffie`. |
| `maxResults` | integer | no | Maximum number of products to return. Default 50. |
| `proxyConfiguration` | object | no | Proxy settings. Defaults to Apify residential proxy in the Netherlands, which is required. |

### Output reference

Each dataset item is a single product. Fields:

| Field | Type | Description |
| --- | --- | --- |
| `name` | string | Full product title |
| `brand` | string | Product brand |
| `ean` | string | Barcode / GTIN |
| `sku` | string | Jumbo internal product id |
| `price` | number | Current price in euros |
| `currency` | string | Always `EUR` |
| `unit` | string | Package size |
| `unitPrice` | string | Normalized price per unit |
| `promotionPrice` | number/null | Price under promotion, if any |
| `promotion` | string/null | Promotion mechanism text |
| `category` | string/null | Jumbo product category |
| `url` | string | Product page URL |
| `imageUrl` | string/null | Product image URL |
| `inStock` | boolean | Availability |
| `scrapedAt` | string | ISO timestamp |

### Why a Netherlands residential proxy is required

Jumbo serves its product API from a Dutch-only backend protected by Cloudflare. Requests from datacenter IPs or non-Dutch IPs are blocked. The default proxy configuration uses Apify's residential proxy pool scoped to the Netherlands, which is what makes the Actor work reliably. You normally do not need to change it.

### Common use cases

- **Price monitoring** — track Jumbo prices for a basket of products over time.
- **Competitive analysis** — compare Jumbo against Albert Heijn and other Dutch supermarkets.
- **Promotion tracking** — capture "1+1 gratis" and other promo mechanisms as they appear.
- **Assortment research** — see how many products Jumbo lists for a category or brand.
- **Dataset building** — collect grocery names, EANs and prices for analytics or ML.

### Notes and limits

- Search is keyword-based, one keyword per run. Run the Actor multiple times for multiple keywords.
- Prices are scraped live at run time and reflect Jumbo's national webshop pricing.
- The Actor only reads public catalog data. It does not log in or access any account.
- If a run returns zero items it fails on purpose, so you never get a silent empty dataset.

### Frequently asked questions

#### Do I need a Jumbo account?

No. The Actor reads the public product catalog. No login, no cookies.

#### Can I scrape a whole category instead of searching?

This version is keyword-based. Use a broad keyword or a brand name to pull a large slice of a category.

#### Why do I get fewer items than `maxResults`?

A search only has so many matching products. The Actor stops when Jumbo runs out of matches for your keyword.

#### Is the price in euros or cents?

Euros. The Actor converts Jumbo's internal cents value to a normal euro amount (499 becomes 4.99).

# Actor input Schema

## `searchQuery` (type: `string`):

Search for products by name or keyword (e.g. 'koffie', 'kaas', 'pasta'). One search per run.

## `maxResults` (type: `integer`):

Maximum number of products to return.

## `proxyConfiguration` (type: `object`):

Proxy settings. Jumbo's API is behind Cloudflare and only answers from Dutch IPs, so a Netherlands residential proxy is required. The prefilled Apify residential NL proxy works out of the box.

## Actor input object example

```json
{
  "searchQuery": "koffie",
  "maxResults": 50,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "NL"
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "searchQuery": "koffie",
    "maxResults": 50,
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ],
        "apifyProxyCountry": "NL"
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("studio-amba/jumbo-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "searchQuery": "koffie",
    "maxResults": 50,
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
        "apifyProxyCountry": "NL",
    },
}

# Run the Actor and wait for it to finish
run = client.actor("studio-amba/jumbo-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "searchQuery": "koffie",
  "maxResults": 50,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "NL"
  }
}' |
apify call studio-amba/jumbo-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=studio-amba/jumbo-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Jumbo Scraper — Dutch Supermarket Prices & Promotions",
        "description": "Scrape products, prices, promotions, unit prices, brands and EAN barcodes from Jumbo (jumbo.com). Search by keyword. No login or cookies required.",
        "version": "0.1",
        "x-build-id": "ctYvq0DY2kasvAGfk"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/studio-amba~jumbo-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-studio-amba-jumbo-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/studio-amba~jumbo-scraper/runs": {
            "post": {
                "operationId": "runs-sync-studio-amba-jumbo-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/studio-amba~jumbo-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-studio-amba-jumbo-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search for products by name or keyword (e.g. 'koffie', 'kaas', 'pasta'). One search per run."
                    },
                    "maxResults": {
                        "title": "Max Results",
                        "minimum": 1,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Maximum number of products to return.",
                        "default": 50
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Proxy settings. Jumbo's API is behind Cloudflare and only answers from Dutch IPs, so a Netherlands residential proxy is required. The prefilled Apify residential NL proxy works out of the box."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
