# Subito Scraper - Italian Classifieds Data (`benthepythondev/subito-scraper`) Actor

Scrape Subito.it (Italy's largest marketplace) by keyword: title, description, category, price, size, location, advertiser (private or agency) with phone, image and URL. For real-estate, cars and product research and lead generation. Fast & reliable via Subito's JSON API.

- **URL**: https://apify.com/benthepythondev/subito-scraper.md
- **Developed by:** [ben](https://apify.com/benthepythondev) (community)
- **Categories:** E-commerce, Real estate, Business
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $5.00 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## 🇮🇹 Subito Scraper

Scrape classified listings from **Subito.it** — Italy's largest marketplace — by keyword. Get title, description, category, price, size, town/province/region, advertiser (private or agency) with phone, image and listing URL as clean structured rows. Powered by Subito's own JSON search API, so it's fast and reliable: no browser, no login, no API key.

Built for Italian real-estate, car and product research, price monitoring, reselling and lead generation. Export to JSON/CSV/Excel, run on a schedule, call via API, or connect to Make, Zapier or n8n.

### 🔎 What is the Subito Scraper?

Give it a keyword (e.g. "appartamento", "auto", "iphone") and it returns matching listings as structured rows, paginated to your requested limit — with price, location and advertiser details.

#### What data does it extract?

- **Title** and **description**
- **Category** (e.g. apartments, land, cars)
- **Price** (EUR) and raw price text
- **Size** and **rooms** (for property)
- **Town**, **province** and **region**
- **Advertiser** name, **phone** and private-vs-agency flag
- **Image** URL, listing **URL** and date

### ⬇️ Input

| Field | Type | Description |
|-------|------|-------------|
| `query` | string | Keyword to search, e.g. `appartamento`. |
| `type` | string | `s` (for sale/offers) or `k` (wanted). |
| `maxResults` | integer | Max listings to return. Default `50`. |

#### Example input

```json
{
  "query": "appartamento",
  "type": "s",
  "maxResults": 100
}
````

### ⬆️ Output

One record per listing:

```json
{
  "item_id": "urn:item:652272853",
  "title": "Appartamento trilocale con terrazzo",
  "description": "Luminoso trilocale ristrutturato...",
  "category": "Appartamenti",
  "price_eur": 150000.0,
  "price_raw": "150.000 €",
  "size": "85 mq",
  "rooms": "3",
  "town": "Verona",
  "province": "Verona",
  "region": "Veneto",
  "advertiser_name": "Agenzia Rossi",
  "advertiser_phone": "+39 045 ...",
  "is_company": true,
  "image_url": "https://images.subito.it/.../image.jpg",
  "url": "https://www.subito.it/appartamenti/...",
  "query": "appartamento"
}
```

### 💡 Use cases

- 🏠 **Real-estate research** — track property listings and prices across Italy.
- 🚗 **Car & product research** — monitor inventory and pricing.
- 📉 **Price monitoring** — schedule runs and detect changes.
- 🎯 **Lead generation** — find private sellers and agencies with phone numbers.

### ❓ FAQ

**Do I need an API key or login?** No — it uses Subito's public JSON search API.

**Can I search any category?** Yes — any keyword works (property, cars, electronics, …).

**Do I get the price and location?** Yes — price in EUR plus town, province and region.

**Private vs agency?** The advertiser name, phone and a company flag are included.

**How many results?** Set `maxResults` — it paginates automatically.

**How does pricing work?** Pay per listing returned. No subscription.

**Is it legal?** It collects publicly available listings. Scrape responsibly, respect Subito's terms, and follow privacy/anti-spam laws (GDPR) when contacting advertisers.

### ⚙️ How it works

The Subito Scraper talks directly to Subito's own JSON search API and returns clean, structured rows — there is no headless browser, no login and no API key to juggle. That keeps each run fast, cheap and dependable, and it's why the actor keeps passing its automated daily health check instead of breaking on an anti-bot wall. You point it at a search (and optional filters), tell it how many results you want, and it paginates through the results automatically until it reaches your limit, de-duplicating as it goes. Whether you need 20 rows for a quick check or thousands for a full market sweep, the same input shape works — only `maxResults` changes.

### 👥 Who uses Subito data?

Listings from Subito across Italy are valuable to resellers, market researchers, and real-estate & car professionals. A market analyst tracks how prices and inventory move week over week; an agency builds targeted contact lists and enriches its CRM; an investor screens for under-priced stock; a proptech or AI product feeds the structured data into a model, dashboard or app. Because every record is plain JSON with consistent field names, it drops straight into whatever stack you already run — spreadsheets, a database, a BI tool or an LLM pipeline — with no custom parsing.

### 📤 Export, schedule & integrate

Every run is saved to a dataset you can export to **JSON, CSV, Excel, XML or RSS**, or pull through the **Apify API**. Wire it into **Make, Zapier, n8n, Google Sheets, Slack** or your **own database**, run it on a **schedule** (hourly, daily or weekly) so your data stays fresh, and call it from AI agents through the **Apify MCP server**. Combine it with the related scrapers below to cover several markets from one workflow.

### 💡 Tips for best results

- Start with a small `maxResults` to preview the shape of the data, then scale up.
- Narrow the search (location, type, price range) for more relevant, higher-quality rows.
- Schedule recurring runs and diff the output to catch price drops and new listings early.
- Always respect the source's terms of use and local privacy/anti-spam law (e.g. GDPR) when storing or contacting anyone.

### ❓ More FAQ

**How fresh is the data?** It is scraped live on every run, so it reflects what is on Subito at that moment. Schedule runs to keep a fresh copy.

**Can I get more than the default number of results?** Yes — raise `maxResults`; the actor paginates automatically up to the amount you ask for.

**Can I run it automatically?** Yes — use Apify Schedules (cron) to run it hourly, daily or weekly with no manual steps.

**Are duplicates removed?** Yes — records are de-duplicated within each run by their unique id.

**Which export formats are supported?** JSON, CSV, Excel, XML and RSS, plus programmatic access via the Apify API.

**Can I use it with AI agents or LLMs?** Yes — the dataset is available through the Apify API and the Apify MCP server, so agents can call it directly.

### 🔗 You might also like

- [Marktplaats Scraper](https://apify.com/benthepythondev/marktplaats-scraper) — Dutch marketplace data.
- [Kleinanzeigen Scraper](https://apify.com/benthepythondev/kleinanzeigen-scraper) — German marketplace data.
- [willhaben Scraper](https://apify.com/benthepythondev/willhaben-scraper) — Austrian marketplace listings.

***

**Keywords:** subito scraper, italian classifieds, subito api, italy marketplace, real estate italy, immobili subito, second hand italy, price monitoring, classified listings, subito data, lead generation italy, subito it, property listings italy, car listings italy

# Actor input Schema

## `query` (type: `string`):

Keyword to search Subito for, e.g. 'appartamento', 'auto', 'iphone'.

## `type` (type: `string`):

Which listings to fetch.

## `maxResults` (type: `integer`):

Maximum listings to return.

## `useApifyProxy` (type: `boolean`):

Route requests through Apify RESIDENTIAL (Italy) proxy. Required — Subito blocks datacenter IPs.

## Actor input object example

```json
{
  "query": "appartamento",
  "type": "s",
  "maxResults": 50,
  "useApifyProxy": true
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "query": "appartamento",
    "type": "s"
};

// Run the Actor and wait for it to finish
const run = await client.actor("benthepythondev/subito-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "query": "appartamento",
    "type": "s",
}

# Run the Actor and wait for it to finish
run = client.actor("benthepythondev/subito-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "query": "appartamento",
  "type": "s"
}' |
apify call benthepythondev/subito-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=benthepythondev/subito-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Subito Scraper - Italian Classifieds Data",
        "description": "Scrape Subito.it (Italy's largest marketplace) by keyword: title, description, category, price, size, location, advertiser (private or agency) with phone, image and URL. For real-estate, cars and product research and lead generation. Fast & reliable via Subito's JSON API.",
        "version": "1.0",
        "x-build-id": "V6disvSHp9aMd6FF5"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/benthepythondev~subito-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-benthepythondev-subito-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/benthepythondev~subito-scraper/runs": {
            "post": {
                "operationId": "runs-sync-benthepythondev-subito-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/benthepythondev~subito-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-benthepythondev-subito-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "query": {
                        "title": "Search query",
                        "type": "string",
                        "description": "Keyword to search Subito for, e.g. 'appartamento', 'auto', 'iphone'.",
                        "default": "appartamento"
                    },
                    "type": {
                        "title": "Listing type",
                        "enum": [
                            "s",
                            "k"
                        ],
                        "type": "string",
                        "description": "Which listings to fetch.",
                        "default": "s"
                    },
                    "maxResults": {
                        "title": "Max results",
                        "minimum": 1,
                        "maximum": 1000,
                        "type": "integer",
                        "description": "Maximum listings to return.",
                        "default": 50
                    },
                    "useApifyProxy": {
                        "title": "Use Apify proxy",
                        "type": "boolean",
                        "description": "Route requests through Apify RESIDENTIAL (Italy) proxy. Required — Subito blocks datacenter IPs.",
                        "default": true
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
