# REMA 1000 Scraper - Danish Grocery Products & Prices (`blackfalcondata/rema1000-scraper`) Actor

Scrape rema1000.dk — Denmark's largest discount-grocery chain with 3900 SKUs. Search products by ingredient, monitor tilbud price changes, and track weekly basket costs over time for grocery planning and price alerts.

- **URL**: https://apify.com/blackfalcondata/rema1000-scraper.md
- **Developed by:** [Black Falcon Data](https://apify.com/blackfalcondata) (community)
- **Categories:** E-commerce, Lead generation, Automation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $4.00 / 1,000 results

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### What does REMA 1000 Scraper do?

REMA 1000 Scraper extracts structured product data from [rema1000.dk](https://rema1000.dk) — including current shelf price and campaign flags, per-unit comparison pricing, bottle-deposit (pant) breakout, department taxonomy, product labels (organic, store-brand, …), temperature-zone flags, and multi-resolution product images. It supports keyword search and controllable result limits, so you can run the same query consistently over time.

**New to Apify?** [Sign up free](https://console.apify.com/sign-up?fpr=1h3gvi) and use the included $5 monthly platform credit to test this actor.

### Key features

<!-- KEY_FEATURES:START -->
- **🛍️ Shopping-basket monitor** — paste your weekly shopping list as `productIds` (or `productUrls` — URLs straight from your browser) and combine with Incremental Mode to get rows back only when something changes: price drop, campaign start, deposit change, or the SKU getting delisted. Weekly tracking of a 20-item basket runs around a cent per check.
- **🥗 Search by ingredient** — find products by what they contain, not just their name. The store's own search indexes product titles and categories but not ingredients, so a search for "pistacie" misses chocolate, ice cream, and granola bars that contain pistachios. The `ingredientFilter` input lets you substring-match on the ingredient list — pass a single term (`"pistacie"`) or an array of terms that must all appear (`["pistacie", "hasselnød"]` for products containing both). Useful for allergen tracking, premium-ingredient discovery, and preservative audits.
- **🔍 Keyword search** — search the catalog with `query` ("mælk", "økologisk", "pasta", brand names …) — matches across product names, descriptions, and category terms. Pair with Incremental Mode to monitor every product that matches a query, even as new SKUs enter the catalog.
- **💰 Current price + tilbud with discount %** — every product row carries the active shelf `price` plus `isCampaign` / `isAdvertised` flags, `priceValidFrom` / `priceValidUntil`, a `priceOverMaxQuantity` value when the chain caps the campaign at N units per customer, and a precomputed `discountPercent` field that ranks tilbud by % off the regular price (Bearnaise on tilbud reads `discountPercent: 61.5`, Tuborg 6-pak `31.5`, etc.). Sort the dataset by `discount-desc` to surface the week's deepest cuts directly.
- **🥗 Ingredients + structured nutrition** — every product row carries the full `ingredients` text (allergens already stripped of inline HTML markup) plus a structured `nutrition[]` array — name/value pairs for energy, fat, saturated fat, carbs, sugars, fibre, protein, salt — pulled straight from the compliance-grade source the chain itself publishes. No regex over marketing prose.
- **🏭 EU GPSR manufacturer + recall signal** — `manufacturer` block on every product: name, full address, country code, website, and `securityAlert` (non-null when the SKU is under active recall / safety warning). Required by EU GPSR regulation since 13 Dec 2024 — surfaced ready-made for compliance audits, recall monitoring, and supplier intelligence.
- **🏷️ Structured unit size + brand** — `unitSize`, `unitMeasure`, and `brand` parsed from the shelf-label string — so "1 LTR. / REMA 1000" becomes `1` + `ltr` + `REMA 1000` as separate fields. No regex on the subtitle. 14 unit tokens recognized across the full catalog (gr / ml / stk / cl / ltr / kg / par / mtr / pk / sæt / bakke / bdt / pose / rl) at 100% parse rate.
- **♻️ Incremental mode** — daily runs emit only products whose price, campaign flag, or validity window changed since the prior run — the rest stay UNCHANGED and are filtered out by default. Perfect for price-tracking apps and tilbud-alert pipelines. Saves 80–95% on recurring monitoring.
- **🔔 Notifications** — Telegram, Slack, Discord, WhatsApp Cloud API, generic webhook — out of the box. Pair with incremental + `notifyOnlyChanges` for "new tilbud" alerts the moment campaigns drop, without polling the flyer.
- **📦 Compact mode** — drop the heavy fields (ingredients text, full nutrition table, image variants, manufacturer block, label objects, price history) and keep shopping essentials: identity, current price, per-unit price, campaign flags + discount %, parsed unit size + brand, and basket line totals. Cuts each row by ~77% — ideal for AI agents, MCP servers, and LLM context windows where allergen text is noise.
<!-- KEY_FEATURES:END -->

### What data can you extract from rema1000.dk?

Each result includes Core listing fields (`productId`, `name`, `subtitle`, `unitSize`, `unitMeasure`, `brand`, `ingredients`, and `nutrition`, and more) and detail fields when enrichment is enabled (`description`). In standard mode, all fields are always present — unavailable data points are returned as `null`, never omitted. In compact mode, only core fields are returned.

### Input

The main inputs are a search keyword and a result limit. Additional filters and options are available in the input schema.

Key parameters:

- **`productIds`** — Track a specific list of products by REMA product ID (e.g. [100004, 60214]). Combine with Incremental Mode to monitor your weekly shopping basket — only get rows back when something changes (price, campaign, delisting). When using Incremental Mode you MUST also set 🔑 State Key (e.g. "my-rema-basket") so multiple baskets don't share state. (default: `[]`)
- **`productUrls`** — Same as productIds but accepts full rema1000.dk product URLs — paste straight from your browser. URLs are merged with productIds. (default: `[]`)
- **`quantities`** — Optional per-product counts for basket mode. Two accepted forms:
• Object map: `{"200313": 12, "21464": 6}` (compact, preferred for the Console editor)
• Array of pairs: `[{"id": 200313, "qty": 12}, {"id": 21464, "qty": 6}]` (useful from Make / n8n / Zapier flows)
Drives subtotal, pant, and max-quantity (over-cap) campaign math. Default qty=1 per product when omitted. Ignored outside basket mode. (default: `{}`)
- **`query`** — Keyword search across the REMA catalog (e.g. "mælk", "økologisk", "pasta"). Searches product names, brands, and category terms (NOT ingredients — use ingredientFilter for that). Mutually exclusive with productIds — if both are set, productIds wins.
- **`sortBy`** — Optional ordering of the dataset rows before push. Default is source-emit order (department + page for browse, search-relevance for search, basket-input order for basket). Null / missing values always sink to the bottom regardless of direction.
- **`ingredientFilter`** — Case-insensitive substring filter on the ingredient list. Accepts a single ingredient (`"pistacie"`) or a JSON array of terms that ALL must appear (`["pistacie", "hasselnød"]` finds products containing both). Useful for finding products with a specific ingredient that isn't in the product name — REMA's search doesn't index ingredients, so this filter runs over fetched results. Narrow your scope with `query` or `productIds` first to keep cost down. Only meaningful in basket and search modes.
- **`departmentIds`** — Limit scraping to specific departments. Leave empty to scrape all 15 departments. IDs: 10 Brød, 20 Frugt & grønt, 30 Kød/fisk/fjerkræ, 40 Køl, 50 Frost, 60 Mejeri, 70 Ost, 80 Kolonial, 90 Drikkevarer, 100 Husholdning, 110 Baby, 120 Pleje, 130 Slik, 140 Kiosk, 160 Nemt & hurtigt. (default: `[]`)
- **`campaignsOnly`** — Only emit products currently on campaign (tilbud) or in the advertised flyer. (default: `false`)
- **`maxResults`** — Maximum total products (0 = unlimited; full catalog ≈ 3,900). (default: `200`)
- **`compact`** — Drop heavy fields (ingredients text, full nutrition table, image variants, manufacturer block, label objects, price history) and keep only shopping essentials: identity, current price, per-unit price, campaign flags + discount %, basket line totals, lifecycle. Cuts each row from ~2,850 chars to ~650 (77% smaller) — ideal for AI agents, MCP servers, and LLM context windows where you don't need allergen text. Same dataset row count, same billing — just smaller payloads. (default: `false`)
- **`incrementalMode`** — Compare against previous run state — emit only NEW / UPDATED / EXPIRED products (price changes, new SKUs, delistings). In Basket mode you MUST also set stateKey. In Search / Browse modes stateKey auto-generates from the scope dimensions. (default: `false`)
- **`stateKey`** — Stable identifier for incremental tracking. In Basket mode this is REQUIRED — choose a name like "my-rema-basket" so multiple baskets don't share state. In Search / Browse mode it auto-generates from your query / departments if left empty.
- ...and 15 more parameters

### Input examples

**Weekly basket with quantities — 10 products, tilbud + pant** — Tracks a realistic 10-product weekly shop with per-item quantities. The mix includes two tilbud-with-pant items (Tuborg 6-pak), two tilbud-without-pant items (Bearnaise, kyllingelår), two normal-with-pant items (Icetea, Ribena), and four normal items. Combined with `incrementalMode: true`, only price changes / new campaigns / delistings get pushed.

→ 10 dataset rows (one per product, with `quantity`, `lineSubtotal`, `lineDeposit`, `lineSavings`, `lineTotal` populated on each). Run status message + `weekly-shopping-summary` KV record carry the basket roll-up: **products 10 · units 24 · subtotal 330.03 + pant 22.50 = total 352.53 DKK · saving 123.30 DKK on 8 tilbud-units**.

```json
{
  "productIds": [
    "200313",
    "110032",
    "250055",
    "10914",
    "21464",
    "100004",
    "100028",
    "100042",
    "11046",
    "100065"
  ],
  "quantities": {
    "10914": 6,
    "11046": 1,
    "21464": 3,
    "100004": 1,
    "100028": 2,
    "100042": 2,
    "100065": 1,
    "110032": 4,
    "200313": 2,
    "250055": 2
  },
  "incrementalMode": true,
  "stateKey": "weekly-shopping",
  "notifyOnlyChanges": true
}
````

**Basic search** — Keyword-driven search with a result cap.

→ Full payload per result — all standard fields populated where the source provides them.

```json
{
  "query": "mælk",
  "maxResults": 50
}
```

**Incremental tracking** — Only emit products that changed since the previous run with this `stateKey`.

→ First run builds the baseline state. Subsequent runs emit only records that are new or whose tracked content changed. Set `emitUnchanged: true` to include unchanged records as well.

```json
{
  "query": "mælk",
  "maxResults": 200,
  "incrementalMode": true,
  "stateKey": "mælk-tracker"
}
```

**Compact output for AI agents** — Return only core fields for AI-agent and MCP workflows.

→ Small payload with the most important fields — ideal for piping into LLMs without token overhead.

```json
{
  "query": "mælk",
  "maxResults": 50,
  "compact": true
}
```

### Output

Each run produces a dataset of structured product records. Results can be downloaded as JSON, CSV, or Excel from the Dataset tab in Apify Console.

### Example product record

```json
{
  "productId": 110032,
  "name": "BEARNAISE SAUCE",
  "subtitle": "66 GR. / KNORR",
  "unitSize": 66,
  "unitMeasure": "gr",
  "brand": "KNORR",
  "description": "BEARNAISE SAUCE KNORR\n\nVarenummer: 110032",
  "ingredients": "Stivelse (kartoffel-/potet- og risstivelse), vegetabilske olier/oljer (solsikke- og majsolie/olje), salt, MÆLKEPROTEIN/MELKEPROTEIN, HVEDEMEL/HVETEMEL, løg/løk¹, gær-/gjærekstrakt, MÆLKESUKKER/MELKESU...",
  "nutrition": [
    {
      "name": "Energi",
      "value": "562 KJ / 135 kcal",
      "sort": 1
    },
    {
      "name": "Fedt",
      "value": "11",
      "sort": 2
    },
    {
      "name": "Heraf mættede fedtsyrer",
      "value": "2,6",
      "sort": 3
    },
    {
      "name": "Kulhydrat",
      "value": "6,2",
      "sort": 4
    },
    {
      "name": "Heraf sukkerarter",
      "value": "3,9",
      "sort": 5
    },
    {
      "name": "Kostfibre",
      "value": "< 0,5",
      "sort": 6
    },
    {
      "name": "Protein",
      "value": "3,1",
      "sort": 7
    },
    {
      "name": "Salt",
      "value": "0,64",
      "sort": 8
    }
  ],
  "barcodes": [
    "8710604750950"
  ],
  "countryOfOrigin": "",
  "categoryId": 5243030,
  "categoryName": "Saucer & nem mad",
  "manufacturer": null,
  "warnings": [],
  "itemDisclaimer": null,
  "url": "https://shop.rema1000.dk/varer/110032",
  "portalUrl": "https://shop.rema1000.dk/varer/110032",
  "source": "rema1000.dk",
  "departmentId": 80,
  "departmentName": "Kolonial",
  "price": 10,
  "currency": "DKK",
  "isCampaign": true,
  "isAdvertised": true,
  "priceValidFrom": null,
  "priceValidUntil": "2026-05-16",
  "deposit": null,
  "compareUnit": "kg",
  "comparePrice": 151.52,
  "maxQuantity": 6,
  "priceOverMaxQuantity": 25.95,
  "priceHistory": [
    {
      "price": 10,
      "isCampaign": true,
      "isAdvertised": true,
      "startingAt": null,
      "endingAt": "2026-05-16",
      "deposit": null,
      "compareUnit": "kg",
      "comparePrice": 151.52,
      "maxQuantity": 6,
      "priceOverMaxQuantity": 25.95
    }
  ],
  "imageUrl": "https://rema-product-images.digital.rema1000.dk/110032/1-medium-nDPWJ6gK9W.webp",
  "images": [
    {
      "small": "https://rema-product-images.digital.rema1000.dk/110032/1-small-nDPWJ6gK9W.webp",
      "medium": "https://rema-product-images.digital.rema1000.dk/110032/1-medium-nDPWJ6gK9W.webp",
      "large": "https://rema-product-images.digital.rema1000.dk/110032/1-large-nDPWJ6gK9W.webp"
    }
  ],
  "labels": [
    {
      "id": null,
      "name": "on_discount",
      "image": null
    },
    {
      "id": null,
      "name": "avisvare",
      "image": null
    },
    {
      "id": null,
      "name": "discount",
      "image": null
    }
  ],
  "ageLimit": null,
  "hazardStatements": [],
  "temperatureZone": null,
  "isSelfScale": false,
  "isWeightItem": false,
  "isBatchItem": false,
  "isAvailableInAllStores": true,
  "scrapedAt": "2026-05-15T17:31:16.660Z",
  "contentHash": "681a8282c50048793ab21c355da5c1e639407764e1ae00e6fb090e7a89f002c7",
  "changeType": null,
  "firstSeenAt": null,
  "lastSeenAt": null,
  "expiredAt": null,
  "isRepost": false,
  "repostOfId": null,
  "repostDetectedAt": null,
  "discountPercent": 61.5,
  "quantity": null,
  "lineSubtotal": null,
  "lineDeposit": null,
  "lineSavings": null,
  "lineTotal": null
}
```

### Incremental fields

When `incremental: true`, each record also carries:

- `changeType` — one of `NEW`, `UPDATED`, `UNCHANGED`, `REAPPEARED`, `EXPIRED`. Default output covers `NEW` / `UPDATED` / `REAPPEARED`; set `emitUnchanged: true` or `emitExpired: true` to opt into the others.
- `firstSeenAt`, `lastSeenAt` — ISO-8601 timestamps tracking the listing across runs.

### How to scrape rema1000.dk

1. Go to [REMA 1000 Scraper](https://apify.com/blackfalcondata/rema1000-scraper?fpr=1h3gvi) in Apify Console.
2. Enter a search keyword.
3. Set `maxResults` to control how many results you need.
4. Click **Start** and wait for the run to finish.
5. Export the dataset as JSON, CSV, or Excel.

### Use cases

- Extract product data from rema1000.dk for market research and competitive analysis.
- Track pricing trends across regions and categories over time.
- Monitor new and changed listings on scheduled runs without processing the full dataset every time.
- Feed structured data into AI agents, MCP tools, and automated pipelines using compact mode.
- Export clean, structured data to dashboards, spreadsheets, or data warehouses.

### How much does it cost to scrape rema1000.dk?

REMA 1000 Scraper uses [pay-per-event](https://docs.apify.com/platform/actors/paid-actors/pay-per-event) pricing. You pay a small fee when the run starts and then for each result that is actually produced.

- **Run start:** $0.01 per run
- **Per result:** $0.004 per product record

Example costs:

- 10 results: **$0.05**
- 100 results: **$0.41**
- 500 results: **$2.01**

#### Example: recurring monitoring savings

These examples compare full re-scrapes with incremental runs at different churn rates. Churn is the share of listings that are new or whose tracked content changed since the previous run. Actual churn depends on your query breadth, source activity, and polling frequency — the scenarios below are examples, not predictions.

Example setup: 250 results per run, daily polling (30 runs/month). Event-pricing examples scale linearly with result count.

| Churn rate | Full re-scrape run cost | Incremental run cost | Savings vs full re-scrape | Monthly cost after baseline |
|---|---:|---:|---:|---:|
| 5% — stable niche query | $1.01 | $0.06 | $0.95 (94%) | $1.80 |
| 15% — moderate broad query | $1.01 | $0.16 | $0.85 (84%) | $4.80 |
| 30% — high-volume aggregator | $1.01 | $0.31 | $0.70 (69%) | $9.30 |

Full re-scrape monthly cost at daily polling: $30.30. First month with incremental costs $2.75 / $5.65 / $10.00 for the 5% / 15% / 30% scenarios because the first run builds baseline state at full cost before incremental savings apply.

Platform usage (compute and proxies) is billed separately by Apify based on actual consumption. Incremental runs consume less on result processing, though fixed per-run overhead stays the same.

### FAQ

#### How many results can I get from rema1000.dk?

The number of results depends on the search query and available listings on rema1000.dk. Use the `maxResults` parameter to control how many results are returned per run.

#### Does REMA 1000 Scraper support recurring monitoring?

Yes. Enable incremental mode to only receive new or changed listings on subsequent runs. This is ideal for scheduled monitoring where you want to track changes over time without re-processing the full dataset.

#### Can I integrate REMA 1000 Scraper with other apps?

Yes. REMA 1000 Scraper works with Apify's [integrations](https://apify.com/integrations?fpr=1h3gvi) to connect with tools like Zapier, Make, Google Sheets, Slack, and more. You can also use webhooks to trigger actions when a run completes.

#### Can I use REMA 1000 Scraper with the Apify API?

Yes. You can start runs, manage inputs, and retrieve results programmatically through the [Apify API](https://docs.apify.com/api/v2). Client libraries are available for JavaScript, Python, and other languages.

#### Can I use REMA 1000 Scraper through an MCP Server?

Yes. Apify provides an [MCP Server](https://apify.com/apify/actors-mcp-server?fpr=1h3gvi) that lets AI assistants and agents call this actor directly. Use compact mode and `descriptionMaxLength` to keep payloads manageable for LLM context windows.

#### Is it legal to scrape rema1000.dk?

This actor extracts publicly available data from rema1000.dk. Web scraping of public information is generally considered legal, but you should always review the target site's terms of service and ensure your use case complies with applicable laws and regulations, including GDPR where relevant.

#### Your feedback

If you have questions, need a feature, or found a bug, please [open an issue](https://apify.com/blackfalcondata/rema1000-scraper/issues?fpr=1h3gvi) on the actor's page in Apify Console. Your feedback helps us improve.

### You might also like

- [Actiris Brussels Job Scraper](https://apify.com/blackfalcondata/actiris-scraper?fpr=1h3gvi) — Scrape all active job listings from actiris.brussels — official Brussels public employment service..
- [Adzuna Job Scraper — Global Jobs with Salary & Coordinates](https://apify.com/blackfalcondata/adzuna-scraper?fpr=1h3gvi) — Scrape adzuna.com job listings across 19 country markets with structured salary data.
- [APEC.fr Scraper - French Executive Jobs](https://apify.com/blackfalcondata/apec-scraper?fpr=1h3gvi) — Scrape apec.fr - French executive job listings with salary ranges, company, location, skills,.
- [Arbeitsagentur Jobs Feed — German Federal Employment Agency](https://apify.com/blackfalcondata/arbeitsagentur-jobs-feed?fpr=1h3gvi) — Extract job listings from arbeitsagentur.de — Germany's official public employment portal with 1M+.
- [Arbeitsagentur Scraper - German Jobs](https://apify.com/blackfalcondata/arbeitsagentur-scraper?fpr=1h3gvi) — Scrape arbeitsagentur.de - Germany’s official employment portal with 1M+ listings. Contact data,.
- [Arbetsformedlingen Job Scraper](https://apify.com/blackfalcondata/arbetsformedlingen-scraper?fpr=1h3gvi) — Scrape arbetsformedlingen.se (Platsbanken) — Sweden's official employment portal. Returns 84.
- [AutoScout24 Scraper — European Car Listings with Dealer Data](https://apify.com/blackfalcondata/autoscout24-scraper?fpr=1h3gvi) — Scrape autoscout24.com - Europe's largest used car marketplace with 770K+ listings. Structured.
- [Bayt.com Scraper — MENA Jobs with Salary & Skills Filter](https://apify.com/blackfalcondata/bayt-scraper?fpr=1h3gvi) — Scrape bayt.com — the leading Middle East job board covering UAE, Saudi Arabia, Qatar, Egypt.

### Getting started with Apify

New to Apify? [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=1h3gvi) — no credit card required.

1. Sign up — $5 platform credit included
2. Open this actor and configure your input
3. Click **Start** — export results as JSON, CSV, or Excel

Need more later? [See Apify pricing](https://apify.com/pricing?fpr=1h3gvi).

# Actor input Schema

## `productIds` (type: `array`):

Track a specific list of products by REMA product ID (e.g. \[100004, 60214]). Combine with Incremental Mode to monitor your weekly shopping basket — only get rows back when something changes (price, campaign, delisting). When using Incremental Mode you MUST also set 🔑 State Key (e.g. "my-rema-basket") so multiple baskets don't share state.

## `productUrls` (type: `array`):

Same as productIds but accepts full rema1000.dk product URLs — paste straight from your browser. URLs are merged with productIds.

## `quantities` (type: `object,array,string`):

Optional per-product counts for basket mode. Two accepted forms:
• Object map: `{"200313": 12, "21464": 6}` (compact, preferred for the Console editor)
• Array of pairs: `[{"id": 200313, "qty": 12}, {"id": 21464, "qty": 6}]` (useful from Make / n8n / Zapier flows)
Drives subtotal, pant, and max-quantity (over-cap) campaign math. Default qty=1 per product when omitted. Ignored outside basket mode.

## `query` (type: `string`):

Keyword search across the REMA catalog (e.g. "mælk", "økologisk", "pasta"). Searches product names, brands, and category terms (NOT ingredients — use ingredientFilter for that). Mutually exclusive with productIds — if both are set, productIds wins.

## `sortBy` (type: `string`):

Optional ordering of the dataset rows before push. Default is source-emit order (department + page for browse, search-relevance for search, basket-input order for basket). Null / missing values always sink to the bottom regardless of direction.

## `ingredientFilter` (type: `string`):

Case-insensitive substring filter on the ingredient list. Accepts a single ingredient (`"pistacie"`) or a JSON array of terms that ALL must appear (`["pistacie", "hasselnød"]` finds products containing both). Useful for finding products with a specific ingredient that isn't in the product name — REMA's search doesn't index ingredients, so this filter runs over fetched results. Narrow your scope with `query` or `productIds` first to keep cost down. Only meaningful in basket and search modes.

## `departmentIds` (type: `array`):

Limit scraping to specific departments. Leave empty to scrape all 15 departments. IDs: 10 Brød, 20 Frugt & grønt, 30 Kød/fisk/fjerkræ, 40 Køl, 50 Frost, 60 Mejeri, 70 Ost, 80 Kolonial, 90 Drikkevarer, 100 Husholdning, 110 Baby, 120 Pleje, 130 Slik, 140 Kiosk, 160 Nemt & hurtigt.

## `campaignsOnly` (type: `boolean`):

Only emit products currently on campaign (tilbud) or in the advertised flyer.

## `maxResults` (type: `integer`):

Maximum total products (0 = unlimited; full catalog ≈ 3,900).

## `compact` (type: `boolean`):

Drop heavy fields (ingredients text, full nutrition table, image variants, manufacturer block, label objects, price history) and keep only shopping essentials: identity, current price, per-unit price, campaign flags + discount %, basket line totals, lifecycle. Cuts each row from ~2,850 chars to ~650 (77% smaller) — ideal for AI agents, MCP servers, and LLM context windows where you don't need allergen text. Same dataset row count, same billing — just smaller payloads.

## `incrementalMode` (type: `boolean`):

Compare against previous run state — emit only NEW / UPDATED / EXPIRED products (price changes, new SKUs, delistings). In Basket mode you MUST also set stateKey. In Search / Browse modes stateKey auto-generates from the scope dimensions.

## `stateKey` (type: `string`):

Stable identifier for incremental tracking. In Basket mode this is REQUIRED — choose a name like "my-rema-basket" so multiple baskets don't share state. In Search / Browse mode it auto-generates from your query / departments if left empty.

## `emitUnchanged` (type: `boolean`):

Include products with no detected change in the output (default: skip).

## `emitExpired` (type: `boolean`):

Include products that disappeared since the prior run (delisted / out of stock).

## `telegramToken` (type: `string`):

Telegram bot token (from @BotFather). Required for Telegram notifications.

## `telegramChatId` (type: `string`):

Telegram chat or channel ID (e.g. "-100123456789"). Required when telegramToken is set.

## `discordWebhookUrl` (type: `string`):

Discord incoming webhook URL.

## `slackWebhookUrl` (type: `string`):

Slack incoming webhook URL.

## `notificationLimit` (type: `integer`):

Maximum number of products included in each notification message (1–20).

## `notifyOnlyChanges` (type: `boolean`):

When Incremental Mode is on, only send notifications for products with a lifecycle event (NEW, UPDATED, or EXPIRED). Unchanged products are still pushed to the dataset but don't trigger Telegram/Slack/Discord/WhatsApp alerts.

## `whatsappAccessToken` (type: `string`):

WhatsApp Cloud API token. Recipient must have messaged the business number within last 24h.

## `whatsappPhoneNumberId` (type: `string`):

Your WhatsApp Business phone-number ID.

## `whatsappTo` (type: `string`):

Recipient phone in E.164 format without + (e.g. "4512345678").

## `webhookUrl` (type: `string`):

Receives a JSON POST with {metadata, items} after each run.

## `webhookHeaders` (type: `object`):

Optional JSON object of custom headers (e.g. {"Authorization":"Bearer ..."}).

## `descriptionFormat` (type: `string`):

Pick a single description representation. `all` keeps the description as-is; `text` returns only the plain-text variant.

## `excludeEmptyFields` (type: `boolean`):

Drop null, empty-string, and empty-array fields from each record before push. Smaller payloads for AI agents and dashboards.

## Actor input object example

```json
{
  "productIds": [],
  "productUrls": [],
  "quantities": {},
  "query": "mælk",
  "departmentIds": [],
  "campaignsOnly": false,
  "maxResults": 50,
  "compact": false,
  "incrementalMode": false,
  "emitUnchanged": false,
  "emitExpired": false,
  "notificationLimit": 5,
  "notifyOnlyChanges": false,
  "descriptionFormat": "all",
  "excludeEmptyFields": false
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "query": "mælk",
    "maxResults": 50,
    "compact": false,
    "descriptionFormat": "all",
    "excludeEmptyFields": false
};

// Run the Actor and wait for it to finish
const run = await client.actor("blackfalcondata/rema1000-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "query": "mælk",
    "maxResults": 50,
    "compact": False,
    "descriptionFormat": "all",
    "excludeEmptyFields": False,
}

# Run the Actor and wait for it to finish
run = client.actor("blackfalcondata/rema1000-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "query": "mælk",
  "maxResults": 50,
  "compact": false,
  "descriptionFormat": "all",
  "excludeEmptyFields": false
}' |
apify call blackfalcondata/rema1000-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=blackfalcondata/rema1000-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "REMA 1000 Scraper - Danish Grocery Products & Prices",
        "description": "Scrape rema1000.dk — Denmark's largest discount-grocery chain with 3900 SKUs. Search products by ingredient, monitor tilbud price changes, and track weekly basket costs over time for grocery planning and price alerts.",
        "version": "0.6",
        "x-build-id": "pcdkF5LAWjEPOEl2Z"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/blackfalcondata~rema1000-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-blackfalcondata-rema1000-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/blackfalcondata~rema1000-scraper/runs": {
            "post": {
                "operationId": "runs-sync-blackfalcondata-rema1000-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/blackfalcondata~rema1000-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-blackfalcondata-rema1000-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "productIds": {
                        "title": "🛍️ Basket — product IDs",
                        "type": "array",
                        "description": "Track a specific list of products by REMA product ID (e.g. [100004, 60214]). Combine with Incremental Mode to monitor your weekly shopping basket — only get rows back when something changes (price, campaign, delisting). When using Incremental Mode you MUST also set 🔑 State Key (e.g. \"my-rema-basket\") so multiple baskets don't share state.",
                        "items": {
                            "type": "string"
                        },
                        "default": []
                    },
                    "productUrls": {
                        "title": "🔗 Basket — product URLs",
                        "type": "array",
                        "description": "Same as productIds but accepts full rema1000.dk product URLs — paste straight from your browser. URLs are merged with productIds.",
                        "items": {
                            "type": "string"
                        },
                        "default": []
                    },
                    "quantities": {
                        "title": "🔢 Basket — quantities per product",
                        "description": "Optional per-product counts for basket mode. Two accepted forms:\n• Object map: `{\"200313\": 12, \"21464\": 6}` (compact, preferred for the Console editor)\n• Array of pairs: `[{\"id\": 200313, \"qty\": 12}, {\"id\": 21464, \"qty\": 6}]` (useful from Make / n8n / Zapier flows)\nDrives subtotal, pant, and max-quantity (over-cap) campaign math. Default qty=1 per product when omitted. Ignored outside basket mode.",
                        "default": {}
                    },
                    "query": {
                        "title": "🔍 Search query",
                        "type": "string",
                        "description": "Keyword search across the REMA catalog (e.g. \"mælk\", \"økologisk\", \"pasta\"). Searches product names, brands, and category terms (NOT ingredients — use ingredientFilter for that). Mutually exclusive with productIds — if both are set, productIds wins."
                    },
                    "sortBy": {
                        "title": "↕️ Sort order",
                        "enum": [
                            "price-asc",
                            "price-desc",
                            "compare-asc",
                            "compare-desc",
                            "line-asc",
                            "line-desc",
                            "savings-desc",
                            "discount-desc",
                            "name-asc"
                        ],
                        "type": "string",
                        "description": "Optional ordering of the dataset rows before push. Default is source-emit order (department + page for browse, search-relevance for search, basket-input order for basket). Null / missing values always sink to the bottom regardless of direction."
                    },
                    "ingredientFilter": {
                        "title": "🥗 Ingredient filter",
                        "type": "string",
                        "description": "Case-insensitive substring filter on the ingredient list. Accepts a single ingredient (`\"pistacie\"`) or a JSON array of terms that ALL must appear (`[\"pistacie\", \"hasselnød\"]` finds products containing both). Useful for finding products with a specific ingredient that isn't in the product name — REMA's search doesn't index ingredients, so this filter runs over fetched results. Narrow your scope with `query` or `productIds` first to keep cost down. Only meaningful in basket and search modes."
                    },
                    "departmentIds": {
                        "title": "🛒 Department IDs",
                        "type": "array",
                        "description": "Limit scraping to specific departments. Leave empty to scrape all 15 departments. IDs: 10 Brød, 20 Frugt & grønt, 30 Kød/fisk/fjerkræ, 40 Køl, 50 Frost, 60 Mejeri, 70 Ost, 80 Kolonial, 90 Drikkevarer, 100 Husholdning, 110 Baby, 120 Pleje, 130 Slik, 140 Kiosk, 160 Nemt & hurtigt.",
                        "items": {
                            "type": "string"
                        },
                        "default": []
                    },
                    "campaignsOnly": {
                        "title": "🏷️ Campaign / Advertised Only",
                        "type": "boolean",
                        "description": "Only emit products currently on campaign (tilbud) or in the advertised flyer.",
                        "default": false
                    },
                    "maxResults": {
                        "title": "💯 Max Results",
                        "minimum": 0,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Maximum total products (0 = unlimited; full catalog ≈ 3,900).",
                        "default": 200
                    },
                    "compact": {
                        "title": "📦 Compact Output",
                        "type": "boolean",
                        "description": "Drop heavy fields (ingredients text, full nutrition table, image variants, manufacturer block, label objects, price history) and keep only shopping essentials: identity, current price, per-unit price, campaign flags + discount %, basket line totals, lifecycle. Cuts each row from ~2,850 chars to ~650 (77% smaller) — ideal for AI agents, MCP servers, and LLM context windows where you don't need allergen text. Same dataset row count, same billing — just smaller payloads.",
                        "default": false
                    },
                    "incrementalMode": {
                        "title": "♻️ Incremental Mode",
                        "type": "boolean",
                        "description": "Compare against previous run state — emit only NEW / UPDATED / EXPIRED products (price changes, new SKUs, delistings). In Basket mode you MUST also set stateKey. In Search / Browse modes stateKey auto-generates from the scope dimensions.",
                        "default": false
                    },
                    "stateKey": {
                        "title": "🔑 State Key",
                        "type": "string",
                        "description": "Stable identifier for incremental tracking. In Basket mode this is REQUIRED — choose a name like \"my-rema-basket\" so multiple baskets don't share state. In Search / Browse mode it auto-generates from your query / departments if left empty."
                    },
                    "emitUnchanged": {
                        "title": "📤 Emit Unchanged",
                        "type": "boolean",
                        "description": "Include products with no detected change in the output (default: skip).",
                        "default": false
                    },
                    "emitExpired": {
                        "title": "🚮 Emit Expired",
                        "type": "boolean",
                        "description": "Include products that disappeared since the prior run (delisted / out of stock).",
                        "default": false
                    },
                    "telegramToken": {
                        "title": "🔑 Telegram Bot Token",
                        "type": "string",
                        "description": "Telegram bot token (from @BotFather). Required for Telegram notifications."
                    },
                    "telegramChatId": {
                        "title": "💬 Telegram Chat ID",
                        "type": "string",
                        "description": "Telegram chat or channel ID (e.g. \"-100123456789\"). Required when telegramToken is set."
                    },
                    "discordWebhookUrl": {
                        "title": "🎮 Discord Webhook URL",
                        "type": "string",
                        "description": "Discord incoming webhook URL."
                    },
                    "slackWebhookUrl": {
                        "title": "💼 Slack Webhook URL",
                        "type": "string",
                        "description": "Slack incoming webhook URL."
                    },
                    "notificationLimit": {
                        "title": "📊 Max Items Per Notification",
                        "minimum": 1,
                        "maximum": 20,
                        "type": "integer",
                        "description": "Maximum number of products included in each notification message (1–20).",
                        "default": 5
                    },
                    "notifyOnlyChanges": {
                        "title": "🔄 Notify Only Changes",
                        "type": "boolean",
                        "description": "When Incremental Mode is on, only send notifications for products with a lifecycle event (NEW, UPDATED, or EXPIRED). Unchanged products are still pushed to the dataset but don't trigger Telegram/Slack/Discord/WhatsApp alerts.",
                        "default": false
                    },
                    "whatsappAccessToken": {
                        "title": "📱 WhatsApp Access Token",
                        "type": "string",
                        "description": "WhatsApp Cloud API token. Recipient must have messaged the business number within last 24h."
                    },
                    "whatsappPhoneNumberId": {
                        "title": "📞 WhatsApp Phone Number ID",
                        "type": "string",
                        "description": "Your WhatsApp Business phone-number ID."
                    },
                    "whatsappTo": {
                        "title": "📲 WhatsApp Recipient",
                        "type": "string",
                        "description": "Recipient phone in E.164 format without + (e.g. \"4512345678\")."
                    },
                    "webhookUrl": {
                        "title": "🪝 Generic Webhook URL",
                        "type": "string",
                        "description": "Receives a JSON POST with {metadata, items} after each run."
                    },
                    "webhookHeaders": {
                        "title": "📋 Webhook Headers",
                        "type": "object",
                        "description": "Optional JSON object of custom headers (e.g. {\"Authorization\":\"Bearer ...\"})."
                    },
                    "descriptionFormat": {
                        "title": "Description format",
                        "enum": [
                            "all",
                            "text"
                        ],
                        "type": "string",
                        "description": "Pick a single description representation. `all` keeps the description as-is; `text` returns only the plain-text variant.",
                        "default": "all"
                    },
                    "excludeEmptyFields": {
                        "title": "Exclude empty fields from output",
                        "type": "boolean",
                        "description": "Drop null, empty-string, and empty-array fields from each record before push. Smaller payloads for AI agents and dashboards.",
                        "default": false
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
