# Colruyt Supermarket BE (`harvestedge/colruyt-supermarket-be`) Actor

The Colruyt Belgium Product Scraper extracts structured grocery product data directly from the Colruyt Belgium webshop (`colruyt.be`) for brand, name, gtin, price, promotion etc.

- **URL**: https://apify.com/harvestedge/colruyt-supermarket-be.md
- **Developed by:** [Harvest Edge](https://apify.com/harvestedge) (community)
- **Categories:** E-commerce, Lead generation, Automation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $10.00 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🚜

🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽🌽

### Harvest Edge / Colruyt Belgium Product Scraper

🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱

🌽🌽🌽🌽🍆🌽🌽🌽🌽🌽🌽🌽🚜🌱🌱🌱🌱🌱🌱🌱🌱🌱🌱

Harvest Edge makes business information available to everyone!

---

### Overview

Suggested residential proxies are BE, FR, NL, and DE

The Colruyt Belgium Product Scraper extracts structured grocery product data directly from the Colruyt Belgium webshop (`colruyt.be`).

The actor performs product searches on Colruyt Belgium and collects detailed ecommerce product information from dynamically rendered product cards.

The scraper uses Playwright with realistic browser behavior simulation, residential proxies, randomized scrolling, and anti-bot mitigation techniques to improve scraping reliability.

This actor is ideal for:

- Grocery price comparison
- FMCG market research
- Retail intelligence
- Promotion monitoring
- Ecommerce product collection
- Product catalog enrichment
- Supermarket assortment analysis
- Competitive pricing analysis

Keywords:

colruyt scraper, colruyt belgium scraper, colruyt.be scraper, belgium grocery scraper, supermarket scraper, grocery ecommerce scraper, retail scraping, FMCG data extraction, supermarket product scraper, grocery price scraper, ecommerce grocery data, retail intelligence, belgian supermarket data, promotion monitoring, online grocery scraper, product catalog scraper

boodschappen scraper, supermarkt scraper, colruyt data, prijsvergelijking, retail data extractie, supermarkt prijzen België, ecommerce scraping, FMCG analyse, promotie tracking, supermarkt assortiment analyse

scraper supermarché, scraping produits alimentaires, extraction données ecommerce, scraping prix produits, données retail Belgique, comparaison de prix, scraping FMCG, promotions supermarché, données produits alimentaires

---

### Features

- Search-based product scraping
- Residential proxy support
- Cookie banner handling
- Dynamic product loading support
- Automatic "load more" pagination
- Product deduplication
- Structured JSON dataset output

---

### Input

The Actor accepts the following input parameters:

| Key              | Type    | Description                                              | Default | Required |
|------------------|---------|----------------------------------------------------------|---------|----------|
| `search_term`    | String  | Product search query (e.g. `"cola"`, `"pizza"`)         | cola    | Yes      |
| `max_products`   | Integer | Maximum number of products to scrape                     | 250     | No       |
| `proxy_country`  | String  | Residential proxy country code                           | GB      | No       |

---

### Example Input

```json
{
  "search_term": "cola",
  "max_products": 100,
  "proxy_country": "BE"
}
````

***

### Output

The Actor outputs a dataset in JSON format with the following fields:

| Field                         | Type     | Description |
|--------------------------------|----------|-------------|
| `store`                        | String   | Store name ("Colruyt Belgium") |
| `search_term`                  | String   | Product search query |
| `position`                     | Integer  | Product position in search results |
| `product_id`                   | String   | Colruyt internal product ID |
| `technical_article_number`     | String   | Technical article identifier |
| `retail_product_number`        | String   | Retail product number |
| `brand`                        | String   | Product brand |
| `name`                         | String   | Product short name |
| `full_name`                    | String   | Full product name |
| `package_info`                 | String   | Packaging / quantity information |
| `price`                        | Number   | Product price in euros |
| `price_per_unit`               | String   | Unit price information |
| `promotion`                    | String   | Promotion label |
| `promotion_type`               | String   | Promotion type |
| `availability`                 | String   | Product availability |
| `nutriscore`                   | String   | Nutri-Score label |
| `country_of_origin`            | String   | Country of origin |
| `gtin`                         | Array    | GTIN / barcode identifiers |
| `image`                        | String   | Product image URL |
| `url`                          | String   | Full product URL |
| `scraped_at`                   | String   | UTC timestamp of scraping |

***

### Example Output

```json
{
  "store": "Colruyt Belgium",
  "search_term": "cola",
  "position": 1,
  "product_id": "307307",
  "technical_article_number": "123456",
  "retail_product_number": "987654",
  "brand": "Coca-Cola",
  "name": "Zero Sugar",
  "full_name": "Coca-Cola Zero Sugar 6 x 1.5 L",
  "package_info": "6 x 1.5 L",
  "price": 8.49,
  "price_per_unit": "€0.94/L",
  "promotion": "Promo",
  "promotion_type": "discount",
  "availability": "available",
  "nutriscore": "B",
  "country_of_origin": "Belgium",
  "gtin": [
    "5449000000996"
  ],
  "image": "https://images.colruyt.be/product.jpg",
  "url": "https://www.colruyt.be/nl/producten/frisdrank/cola",
  "scraped_at": "2026-05-11T21:00:00+00:00"
}
```

***

### Notes

- Product prices and availability may change frequently.
- Promotions are only included when visible on the product card.
- Colruyt may update website structure at any time, which can require scraper updates.
- Some products may not appear due to stock availability or regional assortment differences.
- Duplicate products are filtered automatically.
- Residential proxies are strongly recommended for stable scraping.

***

### Support

Feel free to contact Harvest Edge at combineharvesterdata@gmail.com for:

- Feature requests
- Bug reports
- Enterprise grocery scraping solutions
- Custom retail datasets
- Ecommerce data extraction projects
- Large-scale supermarket intelligence pipelines

***

# Actor input Schema

## `search_term` (type: `string`):

Product search term

## `max_products` (type: `integer`):

Maximum amount of products to scrape

## `proxy_country` (type: `string`):

Country code for residential proxy

## Actor input object example

```json
{
  "search_term": "cola",
  "max_products": 250,
  "proxy_country": "BE"
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("harvestedge/colruyt-supermarket-be").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("harvestedge/colruyt-supermarket-be").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call harvestedge/colruyt-supermarket-be --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=harvestedge/colruyt-supermarket-be",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Colruyt Supermarket BE",
        "description": "The Colruyt Belgium Product Scraper extracts structured grocery product data directly from the Colruyt Belgium webshop (`colruyt.be`) for brand, name, gtin, price, promotion etc.",
        "version": "0.0",
        "x-build-id": "boYNUOqmuNJxESGsD"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/harvestedge~colruyt-supermarket-be/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-harvestedge-colruyt-supermarket-be",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/harvestedge~colruyt-supermarket-be/runs": {
            "post": {
                "operationId": "runs-sync-harvestedge-colruyt-supermarket-be",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/harvestedge~colruyt-supermarket-be/run-sync": {
            "post": {
                "operationId": "run-sync-harvestedge-colruyt-supermarket-be",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "search_term"
                ],
                "properties": {
                    "search_term": {
                        "title": "Search term",
                        "type": "string",
                        "description": "Product search term",
                        "default": "cola"
                    },
                    "max_products": {
                        "title": "Maximum products",
                        "type": "integer",
                        "description": "Maximum amount of products to scrape",
                        "default": 250
                    },
                    "proxy_country": {
                        "title": "Sticky residential proxy country",
                        "type": "string",
                        "description": "Country code for residential proxy",
                        "default": "BE"
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
