# Takealot Listings Scraper (`piotrv1001/takealot-listings-scraper`) Actor

The Takealot Listings Scraper extracts product listings and enriched details from Takealot.com, South Africa's largest marketplace — capturing titles, brands, prices, savings, stock, ratings, reviews, EAN/MPN barcodes, images, and category taxonomy. Ideal for price monitoring and product matching.

- **URL**: https://apify.com/piotrv1001/takealot-listings-scraper.md
- **Developed by:** [FalconScrape](https://apify.com/piotrv1001) (community)
- **Categories:** E-commerce, Automation, Developer tools
- **Stats:** 2 total users, 1 monthly users, 81.8% runs succeeded, 1 bookmarks
- **User rating**: No ratings yet

## Pricing

from $1.50 / 1,000 product listings

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### 🚀 Takealot Listings Scraper

Easily extract product data from [Takealot.com](https://www.takealot.com), South Africa's largest online marketplace. The **Takealot Listings Scraper** gathers structured product listings and full product details — straight from Takealot's public JSON API, with **no proxy and no browser required** — for price monitoring, catalog research, and product matching.

### ✨ Features

- 🏷️ **Comprehensive Product Data**: Retrieve titles, brands, prices, savings, ratings, reviews, stock, images, and category taxonomy.
- 🔎 **Three Ways to Seed**: Scrape by **search keyword**, by **category/department filter**, or by **direct product URL**.
- 🧬 **Optional Detail Enrichment**: Add **EAN/MPN barcodes, SKU, multi-warehouse stock, full review distribution, and HTML descriptions** for every product.
- 💱 **ZAR Market, Underserved Niche**: Clean, fast data for the South African market in Rands.
- ⚡ **Fast & Proxy-Free**: Hits `api.takealot.com` directly — no anti-bot, no browser, ~36 products per request.

### 🛠️ How It Works

1. **Choose Your Input** – Provide search terms (e.g. `playstation 5`), category filters (e.g. `Type:13`), and/or direct product URLs.
2. **Pick the Depth** – Toggle **Scrape full product details** on for rich PDP data, or off for fast, cheap listings.
3. **Run the Scraper** – Start the run and receive structured product data, ready to export as JSON, CSV, Excel, or HTML.

### ⚙️ Input

Provide **at least one** of `searchTerms`, `categoryFilters`, or `startUrls`.

| Field                  | Type    | Description                                                                          |
| ---------------------- | ------- | ------------------------------------------------------------------------------------ |
| `searchTerms`          | array   | Keywords to search (e.g. `["laptop", "playstation 5"]`).                             |
| `categoryFilters`      | array   | Facet filters to browse, e.g. `["Type:13"]` (Computers & Tablets) or `["Brand:HP"]`. |
| `startUrls`            | array   | Direct Takealot product URLs containing a `PLID`.                                    |
| `scrapeProductDetails` | boolean | Fetch full product details (EAN/MPN, stock, reviews). Default `true`.                |
| `maxItems`             | integer | Maximum products to scrape (0 = unlimited). Default `50`.                            |
| `imageSize`            | string  | Gallery image size token (`pdpxl`, `zoom`, …). Default `pdpxl`.                      |
| `sort`                 | string  | Listing sort order. Default `Relevance`.                                             |
| `proxyConfiguration`   | object  | Optional proxy. Not needed — leave disabled.                                         |

#### Example input

```json
{
    "searchTerms": ["playstation 5"],
    "scrapeProductDetails": true,
    "maxItems": 50
}
````

### 📊 Sample Output Data

The scraper provides structured JSON output with key product details. Example:

```json
[
    {
        "plid": "PLID95228805",
        "tsin": 96098813,
        "title": "PlayStation 5 Slim Console with disc drive (PS5 Slim)",
        "brand": "Sony Playstation",
        "price": 13398,
        "prettyPrice": "R 13,398",
        "listingPrice": 15999,
        "saving": "16%",
        "currency": "ZAR",
        "inStock": true,
        "stockStatus": "In stock",
        "ean": "711719577188",
        "mpn": "711719577188",
        "sku": 233384319,
        "starRating": 4.9,
        "reviewCount": 201,
        "reviewDistribution": { "5": 184, "4": 14, "3": 0, "2": 0, "1": 3 },
        "department": "Gaming",
        "categories": ["Consoles"],
        "distributionCentres": ["JHB"],
        "images": ["https://media.takealot.com/covers_images/<hash>/s-pdpxl.file"],
        "url": "https://www.takealot.com/playstation-5-slim-console-with-disc-drive-ps5-slim/PLID95228805"
    }
]
```

When `scrapeProductDetails` is `false`, only the listing-level fields are returned (no `ean`, `sku`, taxonomy, etc.).

### 💸 Pricing

This Actor uses Apify's **Pay Per Event** model — you pay per delivered result, by how much data it contains. Each dataset item is charged exactly once, and you only pay the higher rate when you opt into the detail enrichment pass.

| Event             | When it's charged                                                                                        | Price       | Per 1,000 |
| ----------------- | -------------------------------------------------------------------------------------------------------- | ----------- | --------- |
| `product-listing` | A shallow listing record is delivered (`scrapeProductDetails` = `false`).                                | **$0.0015** | $1.50     |
| `item-detail`     | A fully enriched product record is delivered (`scrapeProductDetails` = `true`, or a direct product URL). | **$0.005**  | $5.00     |

Set a **maximum total charge** on the run to cap your spend — the Actor stops cleanly once that budget is reached, so it never delivers data you haven't paid for. Start with a small `maxItems` to estimate, then scale up.

Get clean, proxy-free Takealot product data with the **Takealot Listings Scraper** today! 🚀

# Actor input Schema

## `searchTerms` (type: `array`):

Keywords to search on Takealot (e.g. "laptop", "playstation 5"). Each term is paginated through every result page up to Max items.

## `categoryFilters` (type: `array`):

Browse listings by facet filter instead of a keyword. Use facet syntax like "Type:13" (Computers & Tablets) or "Brand:HP". You can discover these from the facets in any search result.

## `startUrls` (type: `array`):

Direct Takealot product page URLs (must contain a PLID, e.g. .../PLID95228805). These are scraped via the product-details endpoint.

## `scrapeProductDetails` (type: `boolean`):

Run the enrichment pass: for every listed product, fetch the product-details endpoint to add EAN/MPN, SKU, full reviews, multi-warehouse stock, taxonomy and description. Triples request count but yields much richer data.

## `maxItems` (type: `integer`):

Maximum number of products to scrape across all search terms and categories (0 = unlimited).

## `imageSize` (type: `string`):

Size token substituted into the gallery image URL template.

## `sort` (type: `string`):

Sort order for search/browse listings.

## `proxyConfiguration` (type: `object`):

Proxy settings. The Takealot API has no bot protection, so no proxy is needed — leave disabled. A datacenter pool is optional insurance for very large crawls.

## Actor input object example

```json
{
  "searchTerms": [
    "playstation 5"
  ],
  "categoryFilters": [],
  "startUrls": [],
  "scrapeProductDetails": false,
  "maxItems": 50,
  "imageSize": "pdpxl",
  "sort": "Relevance",
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "searchTerms": [
        "playstation 5"
    ]
};

// Run the Actor and wait for it to finish
const run = await client.actor("piotrv1001/takealot-listings-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "searchTerms": ["playstation 5"] }

# Run the Actor and wait for it to finish
run = client.actor("piotrv1001/takealot-listings-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "searchTerms": [
    "playstation 5"
  ]
}' |
apify call piotrv1001/takealot-listings-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=piotrv1001/takealot-listings-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Takealot Listings Scraper",
        "description": "The Takealot Listings Scraper extracts product listings and enriched details from Takealot.com, South Africa's largest marketplace — capturing titles, brands, prices, savings, stock, ratings, reviews, EAN/MPN barcodes, images, and category taxonomy. Ideal for price monitoring and product matching.",
        "version": "0.0",
        "x-build-id": "ZiCHpWxGEVgfB33sa"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/piotrv1001~takealot-listings-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-piotrv1001-takealot-listings-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/piotrv1001~takealot-listings-scraper/runs": {
            "post": {
                "operationId": "runs-sync-piotrv1001-takealot-listings-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/piotrv1001~takealot-listings-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-piotrv1001-takealot-listings-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "searchTerms": {
                        "title": "Search terms",
                        "type": "array",
                        "description": "Keywords to search on Takealot (e.g. \"laptop\", \"playstation 5\"). Each term is paginated through every result page up to Max items.",
                        "default": [],
                        "items": {
                            "type": "string"
                        }
                    },
                    "categoryFilters": {
                        "title": "Category / department filters",
                        "type": "array",
                        "description": "Browse listings by facet filter instead of a keyword. Use facet syntax like \"Type:13\" (Computers & Tablets) or \"Brand:HP\". You can discover these from the facets in any search result.",
                        "default": [],
                        "items": {
                            "type": "string"
                        }
                    },
                    "startUrls": {
                        "title": "Product URLs",
                        "type": "array",
                        "description": "Direct Takealot product page URLs (must contain a PLID, e.g. .../PLID95228805). These are scraped via the product-details endpoint.",
                        "default": [],
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    },
                    "scrapeProductDetails": {
                        "title": "Scrape full product details",
                        "type": "boolean",
                        "description": "Run the enrichment pass: for every listed product, fetch the product-details endpoint to add EAN/MPN, SKU, full reviews, multi-warehouse stock, taxonomy and description. Triples request count but yields much richer data.",
                        "default": false
                    },
                    "maxItems": {
                        "title": "Max items",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Maximum number of products to scrape across all search terms and categories (0 = unlimited).",
                        "default": 50
                    },
                    "imageSize": {
                        "title": "Image size",
                        "enum": [
                            "pdpxl",
                            "zoom",
                            "fb",
                            "covers"
                        ],
                        "type": "string",
                        "description": "Size token substituted into the gallery image URL template.",
                        "default": "pdpxl"
                    },
                    "sort": {
                        "title": "Sort order",
                        "enum": [
                            "Relevance",
                            "Price",
                            "PriceHighToLow",
                            "TopSellers",
                            "NewToOld",
                            "Rating"
                        ],
                        "type": "string",
                        "description": "Sort order for search/browse listings.",
                        "default": "Relevance"
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Proxy settings. The Takealot API has no bot protection, so no proxy is needed — leave disabled. A datacenter pool is optional insurance for very large crawls.",
                        "default": {
                            "useApifyProxy": false
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
