# The Warehouse NZ Scraper (`parseforge/thewarehouse-nz-scraper`) Actor

Browse The Warehouse, New Zealand's largest general retailer, and pull product name, brand, price, rating, review count, image, SKU, stock, category, and dimensions. Handy for NZ price monitoring, brand visibility checks, and competitor research across home, electronics, and apparel.

- **URL**: https://apify.com/parseforge/thewarehouse-nz-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** E-commerce, Automation, Integrations
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $7.50 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 🛒 The Warehouse NZ Product Scraper

> 🚀 **Export The Warehouse NZ product listings in seconds. Search by keyword, filter by category, and get clean structured pricing and stock data.**

> 🕒 **Last updated:** 2026-05-29 · **📊 13 fields** per record · New Zealand's largest general merchandise retailer

The Warehouse NZ Product Scraper turns [thewarehouse.co.nz](https://www.thewarehouse.co.nz) into a clean dataset. Search by keyword or browse by category, and the actor paginates and normalizes every listing.

Coverage spans The Warehouse's full assortment: appliances, electronics, fashion, home, garden, toys, kitchen, and seasonal categories across all NZ stores.

| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| 🛍️ Retail analysts | Track NZ pricing and assortment |
| 📦 Brand managers | Monitor brand placement and shelf share |
| 💼 Wholesalers | Identify pricing gaps |
| 📊 Data teams | Feed BI dashboards with NZ retail data |

### 📋 What the The Warehouse NZ Scraper does

- Calls The Warehouse public search endpoint.
- Paginates through results up to `maxItems`.
- Normalizes name, brand, price, rating, reviews, image, SKU, stock, category, dimensions.
- Exports as CSV, Excel, JSON, JSONL, XML, RSS, or HTML.

> 💡 **Why it matters:** The Warehouse is NZ's largest retailer. Manually tracking pricing is impractical at scale.

### 🎬 Full Demo

_🚧 Coming soon._

### ⚙️ Input

<table>
<tr><th>Field</th><th>Type</th><th>Required</th><th>Description</th></tr>
<tr><td><code>query</code></td><td>string</td><td>No</td><td>Search keyword.</td></tr>
<tr><td><code>maxItems</code></td><td>integer</td><td>No</td><td>Free users: 10. Paid users: up to 1,000,000.</td></tr>
<tr><td><code>category</code></td><td>string</td><td>No</td><td>Category slug.</td></tr>
<tr><td><code>sort</code></td><td>enum</td><td>No</td><td>relevance, price-asc, price-desc, newest, rating.</td></tr>
</table>

**Example 1 — Search kettles:**
```json
{ "query": "kettle", "maxItems": 50 }
````

**Example 2 — Category browse:**

```json
{ "category": "appliances", "sort": "price-asc", "maxItems": 100 }
```

> ⚠️ **Good to Know:** Prices are NZD. The actor uses only the public search endpoint, no login required.

### 📊 Output

| Field | Type | Description |
|---|---|---|
| 🖼️ `imageUrl` | string | Primary product image. |
| 🏷️ `name` | string | Product name. |
| 🏭 `brand` | string | Brand. |
| 🔖 `sku` | string | SKU / product ID. |
| 💲 `price` | number | Current price. |
| 💱 `currency` | string | Always NZD. |
| ⭐ `rating` | number | Average rating. |
| 💬 `reviews` | number | Review count. |
| 📦 `stock` | string | Stock status. |
| 🗂️ `category` | string | Category path. |
| 📐 `dimensions` | string | Size / dimensions if listed. |
| 🔗 `url` | string | Product URL. |
| 🕒 `scrapedAt` | string | When the row was fetched. |
| ❌ `error` | string | Set on upstream errors. |

**Sample record:**

```json
{
  "imageUrl": "https://www.thewarehouse.co.nz/.../img.jpg",
  "name": "Russell Hobbs 1.7L Kettle",
  "brand": "Russell Hobbs",
  "sku": "R3010152",
  "price": 49.0,
  "currency": "NZD",
  "rating": 4.5,
  "reviews": 312,
  "stock": "in_stock",
  "category": "Appliances > Kettles",
  "dimensions": "1.7L",
  "url": "https://www.thewarehouse.co.nz/p/russell-hobbs-1-7l-kettle/R3010152",
  "scrapedAt": "2026-05-29T12:00:00.000Z",
  "error": null
}
```

### ✨ Why choose this Actor

| 🇳🇿 | Built for The Warehouse NZ. |
| 🧹 | Clean snake-cased field names ready for BI. |
| 🔌 | Search + category + sort exposed. |
| 🛟 | Errors surface as clean records. |
| 💾 | Instant CSV / Excel / JSON / XML export. |

### 📈 How it compares to alternatives

| Approach | Setup | NZD pricing | Pagination | Stock data |
|---|---|---|---|---|
| Manual browsing | hours | ✅ | ❌ | ✅ |
| Custom scraper | 1 hr+ | partial | partial | partial |
| **This Actor** | 5 sec | ✅ | ✅ | ✅ |

### 🚀 How to use

1. Click **Try for free**.
2. Enter a `query` or `category`.
3. Set `sort` and `maxItems` if needed.
4. Click **Start**.

### 💼 Business use cases

**📊 Competitive pricing.** Track competitors against The Warehouse weekly.

**🏷️ Brand monitoring.** See where your brand sits in NZ's largest retailer.

**📦 Assortment analysis.** Map category depth.

**🛍️ Pricing intelligence.** Spot promo cycles.

### 🔌 Automating The Warehouse NZ Scraper

- **Make / Zapier**: trigger daily runs, push to Sheets.
- **Cron**: Apify scheduler.
- **Webhooks**: notify Slack on completion.
- **Pipe to BigQuery / Snowflake / Postgres**.

### 🌟 Beyond business use cases

**🎓 Academic.** Study NZ retail dynamics.

**🧪 Personal.** Track wishlist items.

**🤝 Consumer advocacy.** Audit pricing claims.

**🧰 Prototyping.** Demo dataset in seconds.

### 🤖 Ask an AI assistant about this scraper

Drop this README into ChatGPT or Claude and ask for a workflow mapping.

### ❓ Frequently Asked Questions

**❓ Login required?** No, public data only.

**❓ Pricing currency?** NZD.

**❓ How fresh?** Real-time at run.

**❓ Filter by category?** Yes.

**❓ Sort by price?** Yes.

**❓ Stock status?** Yes.

**❓ Pagination?** Automatic.

**❓ Schedule runs?** Yes.

**❓ Formats?** CSV, Excel, JSON, JSONL, XML, RSS, HTML.

**❓ Official?** No, ParseForge is independent.

### 🔌 Integrate with any app

Native integrations with Make, Zapier, Slack, Discord, Google Drive, Google Sheets, Gmail, Airbyte, Keboola, Telegram, GitHub, and any REST API or webhook.

### 🔗 Recommended Actors

| Actor | What it does |
|---|---|
| [ParseForge OurAirports Scraper](https://apify.com/parseforge/ourairports-scraper) | Global airport database. |
| [ParseForge Alpha Vantage Scraper](https://apify.com/parseforge/alpha-vantage-public-scraper) | Market data, FX, crypto. |
| [ParseForge collection](https://apify.com/parseforge) | 900+ production scrapers. |

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for 900+ production-grade scrapers.

***

**Disclaimer:** This actor scrapes only publicly available data. ParseForge is not affiliated with, endorsed by, or sponsored by The Warehouse. Users are responsible for complying with the site's terms of service. [Create a free account w/ $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp).

# Actor input Schema

## `query` (type: `string`):

Keyword to search The Warehouse (e.g. 'kettle', 'kids shoes').

## `maxItems` (type: `integer`):

Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000

## `category` (type: `string`):

Optional category path slug from a Warehouse URL.

## `sort` (type: `string`):

Sort by

## Actor input object example

```json
{
  "query": "kettle",
  "maxItems": 10,
  "sort": "relevance"
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "query": "kettle",
    "maxItems": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/thewarehouse-nz-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "query": "kettle",
    "maxItems": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("parseforge/thewarehouse-nz-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "query": "kettle",
  "maxItems": 10
}' |
apify call parseforge/thewarehouse-nz-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/thewarehouse-nz-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "The Warehouse NZ Scraper",
        "description": "Browse The Warehouse, New Zealand's largest general retailer, and pull product name, brand, price, rating, review count, image, SKU, stock, category, and dimensions. Handy for NZ price monitoring, brand visibility checks, and competitor research across home, electronics, and apparel.",
        "version": "0.1",
        "x-build-id": "lVspkJuAyCap3LJfg"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~thewarehouse-nz-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-thewarehouse-nz-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~thewarehouse-nz-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-thewarehouse-nz-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~thewarehouse-nz-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-thewarehouse-nz-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "query": {
                        "title": "Search query",
                        "type": "string",
                        "description": "Keyword to search The Warehouse (e.g. 'kettle', 'kids shoes')."
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000"
                    },
                    "category": {
                        "title": "Category path",
                        "type": "string",
                        "description": "Optional category path slug from a Warehouse URL."
                    },
                    "sort": {
                        "title": "Sort by",
                        "enum": [
                            "relevance",
                            "price-asc",
                            "price-desc",
                            "newest",
                            "rating"
                        ],
                        "type": "string",
                        "description": "Sort by",
                        "default": "relevance"
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
