# Zoot.cz Product Scraper (`shahidirfan/zoot-cz-product-scraper`) Actor

Extract real-time product data from Zoot.cz instantly. Get prices, descriptions, images, stock status & reviews. Monitor Czech fashion market, track competitor pricing & inventory changes. Perfect for price intelligence, market research & ecommerce analytics. Structured, ETL-ready output.

- **URL**: https://apify.com/shahidirfan/zoot-cz-product-scraper.md
- **Developed by:** [Shahid Irfan](https://apify.com/shahidirfan) (community)
- **Categories:** E-commerce, Automation, Developer tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## ZOOT.cz Product Scraper

Collect detailed ZOOT.cz product data from search, category, and subcategory pages. Build clean datasets with pricing, stock status, variant data, category metadata, and image coverage for research, merchandising, and catalog monitoring.

### Features

- **Structured product coverage** — Collect product IDs, variant IDs, EANs, brand data, category data, pricing, stock labels, and media links.
- **Search and category support** — Start from a direct ZOOT.cz URL or build a search from a keyword.
- **Variant-aware output** — Preserve color and size availability for each listed product variant.
- **Deduplicated results** — Skip repeated records across pages by tracking the listing variant key.
- **Clean datasets** — Remove empty fields and skip incomplete records before they reach the dataset.

### Use Cases

#### Catalog Monitoring
Track which products are currently listed, how they are categorized, and which variants remain available. This is useful for merchandising teams that need regular catalog snapshots.

#### Pricing Analysis
Compare current price, sale price, full price, and discount values across listings. This helps with promotion monitoring and competitor benchmarking.

#### Assortment Research
Capture brand, category, color, and size availability in one dataset. This supports assortment planning and product-mix analysis.

#### Image and Content Operations
Export product and gallery image URLs for downstream content workflows. This is useful when building internal catalogs or QA reports.

### Input Parameters

| Parameter | Type | Required | Default | Description |
|-----------|------|----------|---------|-------------|
| `url` | String | No | `"https://www.zoot.cz/vyhledavani/hledani:shirt/"` | Full ZOOT.cz search or category URL. If provided, it takes priority over `keyword`. |
| `keyword` | String | No | `"shirt"` | Search term used when `url` is not provided. |
| `results_wanted` | Integer | No | `20` | Maximum number of products to save. |
| `max_pages` | Integer | No | `20` | Maximum number of listing pages to visit. |
| `proxyConfiguration` | Object | No | — | Optional Apify proxy settings for more resilient collection. |

---

### Output Data

Each dataset item contains:

| Field | Type | Description |
|-------|------|-------------|
| `product_id` | String | ZOOT.cz product identifier. |
| `variant_id` | String | Variant identifier from the product page. |
| `listing_variant_key` | String | Listing-level key used for deduplication. |
| `product_group` | String | Shared group identifier across variants. |
| `ean` | String | Product barcode when available. |
| `name` | String | Product name. |
| `brand` | String | Brand name. |
| `brand_id` | String | Brand identifier. |
| `category_path` | String | Category breadcrumb path. |
| `category_id` | String | Category identifier. |
| `category_url` | String | Category landing page URL. |
| `color` | String | Primary color for the saved variant. |
| `colors_available` | Array | Distinct color values exposed for the listing item. |
| `colors_available_text` | String | Flat comma-separated color list. |
| `sizes_available` | Array | Sizes exposed directly on the listing card. |
| `sizes_available_text` | String | Flat comma-separated size list. |
| `in_stock` | Boolean | Whether the variant is currently in stock. |
| `stock_label` | String | Availability label from the source page. |
| `currency` | String | Currency code used for price fields. |
| `current_price_czk` | Number | Current public price in CZK. |
| `sale_price_czk` | Number | Sale price in CZK when available. |
| `full_price_czk` | Number | Full price in CZK before discount. |
| `discount_amount_czk` | Number | Difference between full price and current price. |
| `discount_percent` | Number | Percentage discount when available. |
| `lowest_30d_price_czk` | Number | Lowest listed price over the last 30 days when shown on the card. |
| `image` | String | Primary image URL. |
| `images` | Array | Additional gallery image URLs. |
| `url` | String | Product URL. |
| `listing_position` | Integer | Position on the listing page. |
| `listing_page` | Integer | Listing page number where the product was found. |
| `search_keyword` | String | Keyword used for the run when applicable. |
| `source` | String | Source marketplace identifier. |

---

### Usage Examples

#### Basic Search

Collect the first 20 results for a keyword:

```json
{
    "keyword": "shirt",
    "results_wanted": 20
}
````

#### Direct Listing URL

Start from a specific listing page:

```json
{
    "url": "https://www.zoot.cz/vyhledavani/hledani:shirt/",
    "results_wanted": 30,
    "max_pages": 3
}
```

#### Proxy-Assisted Run

Use a proxy configuration for more resilient collection:

```json
{
    "keyword": "shirt",
    "results_wanted": 20,
    "proxyConfiguration": {
        "useApifyProxy": true
    }
}
```

***

### Sample Output

```json
{
    "product_id": "5045959",
    "variant_id": "3024867",
    "listing_variant_key": "598799XXBéžová|||Béžová",
    "product_group": "598799",
    "ean": "1200150331330",
    "name": "Svrchní kostkovaná košile Oxford Shirt GAP",
    "brand": "GAP",
    "brand_id": "275",
    "category_path": "Oblečení / Košile / Košile dlouhý rukáv",
    "category_id": "10",
    "category_url": "https://www.zoot.cz/panske/",
    "color": "Béžová",
    "colors_available": [
        "Béžová"
    ],
    "sizes_available": [
        "XS",
        "S",
        "M",
        "L",
        "XL",
        "XXL"
    ],
    "sizes_available_text": "XS, S, M, L, XL, XXL",
    "in_stock": true,
    "stock_label": "Skladem",
    "currency": "CZK",
    "current_price_czk": 979,
    "sale_price_czk": 809.09,
    "full_price_czk": 1399,
    "discount_amount_czk": 420,
    "discount_percent": 30,
    "lowest_30d_price_czk": 778,
    "image": "https://d010202.zoot.cz/_galerie/varianty/373/3734185-d.jpg",
    "images": [
        "https://d010202.zoot.cz/_galerie/varianty/373/3734186-d.jpg",
        "https://d010202.zoot.cz/_galerie/varianty/373/3734187-d.jpg"
    ],
    "url": "https://www.zoot.cz/panske/detail-vyrobku/598799-gap-svchni-kostkovana-kosile-oxford-shirt-gap/vse/20406:bezova-bezova/",
    "listing_position": 1,
    "listing_page": 1,
    "search_keyword": "shirt",
    "source": "zoot.cz"
}
```

***

### Tips for Best Results

#### Start Small

- Begin with `results_wanted: 20` to validate the output quickly.
- Increase the limit after confirming the listing you want is returning the right products.

#### Prefer Stable Listing URLs

- Use a direct listing URL when you already know the exact category or search page you want.
- Keep `max_pages` low during testing to reduce unnecessary requests.

#### Use Proxies When Needed

- Enable Apify Proxy if you expect repeated runs or larger result sets.
- Residential proxy groups can help if the source becomes less tolerant of repeated requests.

***

### Integrations

Connect your data with:

- **Google Sheets** — Export pricing and assortment data for quick review.
- **Airtable** — Build searchable product databases for internal teams.
- **Make** — Route updated product datasets into automated workflows.
- **Zapier** — Trigger actions when new products or pricing changes appear.
- **Webhooks** — Send each run into your own systems for downstream processing.

#### Export Formats

- **JSON** — For downstream apps and APIs.
- **CSV** — For spreadsheet work and reporting.
- **Excel** — For business review and stakeholder sharing.
- **XML** — For systems that require structured exchange formats.

***

### Frequently Asked Questions

#### Can I scrape category pages instead of search results?

Yes. Provide the full category or subcategory URL in `url` and the actor will start from that page.

#### How many products can I collect?

You can collect as many products as the listing exposes, subject to your `results_wanted` and `max_pages` limits.

#### Why are some fields missing from certain products?

Some products simply do not expose every field on the source page. Empty values are removed from the final dataset to keep the output clean.

#### Does the actor remove duplicates?

Yes. Repeated listing variants are skipped before data is pushed to the dataset.

#### Can I use this for price monitoring?

Yes. The output includes current, sale, and full price fields when they are available, along with computed discount values.

***

### Support

For issues or feature requests, contact support through the Apify Console.

#### Resources

- [Apify Documentation](https://docs.apify.com/)
- [API Reference](https://docs.apify.com/api/v2)
- [Scheduling Runs](https://docs.apify.com/platform/schedules)

***

### Legal Notice

This actor is designed for legitimate data collection purposes. Users are responsible for ensuring compliance with website terms of service and applicable laws. Use collected data responsibly and respect the source website.

# Actor input Schema

## `url` (type: `string`):

Full Zoot.cz URL to scrape (search, category, or subcategory). Takes priority over keyword. Examples: https://www.zoot.cz/vyhledavani/hledani:shirt/ or https://www.zoot.cz/damske/

## `keyword` (type: `string`):

Search keyword if not using a direct URL. Ignored when URL is provided.

## `results_wanted` (type: `integer`):

Maximum number of products to collect.

## `max_pages` (type: `integer`):

Maximum number of pages to visit (48 products per page).

## `proxyConfiguration` (type: `object`):

Use Apify Proxy (recommended: residential) to avoid blocks.

## Actor input object example

```json
{
  "url": "https://www.zoot.cz/vyhledavani/hledani:shirt/",
  "keyword": "shirt",
  "results_wanted": 20,
  "max_pages": 20,
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}
```

# Actor output Schema

## `overview` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "url": "https://www.zoot.cz/vyhledavani/hledani:shirt/",
    "keyword": "shirt"
};

// Run the Actor and wait for it to finish
const run = await client.actor("shahidirfan/zoot-cz-product-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "url": "https://www.zoot.cz/vyhledavani/hledani:shirt/",
    "keyword": "shirt",
}

# Run the Actor and wait for it to finish
run = client.actor("shahidirfan/zoot-cz-product-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "url": "https://www.zoot.cz/vyhledavani/hledani:shirt/",
  "keyword": "shirt"
}' |
apify call shahidirfan/zoot-cz-product-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=shahidirfan/zoot-cz-product-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Zoot.cz Product Scraper",
        "description": "Extract real-time product data from Zoot.cz instantly. Get prices, descriptions, images, stock status & reviews. Monitor Czech fashion market, track competitor pricing & inventory changes. Perfect for price intelligence, market research & ecommerce analytics. Structured, ETL-ready output.",
        "version": "0.0",
        "x-build-id": "yuiwlSPl4onfOPnb1"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/shahidirfan~zoot-cz-product-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-shahidirfan-zoot-cz-product-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/shahidirfan~zoot-cz-product-scraper/runs": {
            "post": {
                "operationId": "runs-sync-shahidirfan-zoot-cz-product-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/shahidirfan~zoot-cz-product-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-shahidirfan-zoot-cz-product-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "url": {
                        "title": "URL",
                        "type": "string",
                        "description": "Full Zoot.cz URL to scrape (search, category, or subcategory). Takes priority over keyword. Examples: https://www.zoot.cz/vyhledavani/hledani:shirt/ or https://www.zoot.cz/damske/"
                    },
                    "keyword": {
                        "title": "Search Keyword",
                        "type": "string",
                        "description": "Search keyword if not using a direct URL. Ignored when URL is provided."
                    },
                    "results_wanted": {
                        "title": "Maximum Results",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of products to collect.",
                        "default": 20
                    },
                    "max_pages": {
                        "title": "Maximum Pages",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of pages to visit (48 products per page).",
                        "default": 20
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Use Apify Proxy (recommended: residential) to avoid blocks.",
                        "default": {
                            "useApifyProxy": false
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
