# Stepstone.it Italy Job Scraper (`unfenced-group/stepstone-it-scraper`) Actor

Scrape job listings from Stepstone Italy. Filter by keyword, location and contract type. Returns salary, full descriptions, geocoordinates and repost detection. No API key required.

- **URL**: https://apify.com/unfenced-group/stepstone-it-scraper.md
- **Developed by:** [Unfenced Group](https://apify.com/unfenced-group) (community)
- **Categories:** Jobs
- **Stats:** 2 total users, 1 monthly users, 0.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $1.50 / 1,000 result items

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Stepstone.it Scraper

![Stepstone Italy Scraper](https://i.imgur.com/zdHdalw.png)

Scrape structured job listings from [Stepstone.it](https://www.stepstone.it) — Italy's leading employment platform. 100,000+ active listings. No API key required.

> **v2.0** — Upgraded to **JSON-HTML extraction** (`__NEXT_DATA__`). Stepstone.it is a Next.js SSR app; all search results are now extracted from the embedded JSON blob rather than CSS selectors, making parsing faster and more resilient to layout changes. Falls back to JSON-LD and CSS selectors automatically if the JSON path is absent.

---

### Why this scraper?

#### 🇮🇹 Stepstone.it — Italy's job market
Retrieve job data from Stepstone Italy covering all major industries, contract types, and regions.

#### 📄 Full job descriptions
Enable `fetchDetails` to retrieve complete job descriptions in HTML, plain text, and Markdown — ideal for AI pipelines and RAG systems.

#### 💰 Structured salary data
Salary ranges are returned as structured numeric fields (`salaryMin`, `salaryMax`, `salaryType`) alongside the formatted display string.

#### 🔁 Repost detection
The scraper detects duplicate listings posted multiple times by the same employer. Use `skipReposts: true` to exclude them or check the `isRepost` field to filter later.

#### 📍 Geocoding included
City names are automatically resolved to `latitude`/`longitude` coordinates via Nominatim — no extra cost.

#### 📋 Three description formats
Job descriptions are available as HTML (for rendering), plain text (for search/indexing), and Markdown (for LLMs/RAG).

---

### Input parameters

| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| `searchQuery` | string | `developer` | Job title, skill, or keyword |
| `location` | string | `Milano` | City or region (e.g. Roma, Torino, Napoli) |
| `radius` | integer | `40` | Search radius in km |
| `maxItems` | integer | `5` | Maximum results to return |
| `daysOld` | integer | — | Only return listings posted within N days |
| `skipReposts` | boolean | `false` | Skip listings posted more than once |
| `fetchDetails` | boolean | `false` | Fetch full description from detail pages |
| `startUrls` | array | — | Specific Stepstone.it search or job URLs |

---

### Output schema

#### Always present

| Field | Type | Description |
|-------|------|-------------|
| `url` | string | Job listing URL |
| `jobId` | string | Unique Stepstone job ID |
| `source` | string | Always `stepstone.it` |
| `language` | string | Always `it` |
| `title` | string | Job title |
| `company` | string | Employer name |
| `companyUrl` | string\|null | Employer website |
| `companyLogoUrl` | string\|null | Company logo image URL (**new in v2.0**) |
| `city` | string\|null | City |
| `location` | string\|null | Normalised location string |
| `country` | string | Always `IT` |
| `contractType` | string\|null | e.g. `FULL_TIME`, `PART_TIME`, `TEMPORARY` |
| `workSchedule` | string\|null | Work schedule description |
| `salary` | string\|null | Formatted salary (e.g. `€35.000 – €45.000 (YEAR)`) |
| `salaryMin` | number\|null | Minimum salary (numeric) |
| `salaryMax` | number\|null | Maximum salary (numeric) |
| `salaryType` | string\|null | `YEAR`, `MONTH`, or `HOUR` |
| `salaryCurrency` | string\|null | Always `EUR` when present |
| `publishDate` | string\|null | Publication date (YYYY-MM-DD) |
| `publishDateISO` | string\|null | Publication date (ISO 8601) |
| `daysOld` | integer\|null | Days since publication |
| `expiryDate` | string\|null | Expiry date (YYYY-MM-DD) |
| `applyUrl` | string\|null | Direct apply URL when available via `__NEXT_DATA__` (**new in v2.0**) |
| `isRemote` | boolean | Whether the role is fully remote (**new in v2.0**) |
| `tags` | array\|null | Job tags / categories from Stepstone metadata (**new in v2.0**) |
| `snippet` | string\|null | Short description snippet from search results (**new in v2.0**) |
| `isRepost` | boolean | Whether this listing was posted before |
| `scrapedAt` | string | Scrape timestamp (ISO 8601) |
| `latitude` | number\|null | Geocoded latitude |
| `longitude` | number\|null | Geocoded longitude |

#### With `fetchDetails: true`

| Field | Type | Description |
|-------|------|-------------|
| `description` | string\|null | Full job description (HTML) |
| `descriptionText` | string\|null | Full job description (plain text) |
| `descriptionMarkdown` | string\|null | Full job description (Markdown) |
| `summary` | string\|null | First 200 characters of description |
| `skills` | array\|null | Skills / requirements list from job data (**new in v2.0**) |
| `benefits` | array\|null | Benefits / perks list from job data (**new in v2.0**) |
| `extractionMethod` | string | Which method extracted data: `nextdata`, `jsonld`, or `css` (**new in v2.0**) |

#### Example output

```json
{
  "url": "https://www.stepstone.it/offerte-lavoro/software-engineer-milano--12345678-inline.html",
  "jobId": "12345678",
  "source": "stepstone.it",
  "language": "it",
  "title": "Software Engineer",
  "company": "Tech Company SPA",
  "companyUrl": "https://www.techcompany.it",
  "companyLogoUrl": "https://www.stepstone.it/upload_de/logo/example.gif",
  "city": "Milano",
  "location": "Milano",
  "country": "IT",
  "contractType": "FULL_TIME",
  "workSchedule": "Full time",
  "salary": "€40.000 – €55.000 (YEAR)",
  "salaryMin": 40000,
  "salaryMax": 55000,
  "salaryType": "YEAR",
  "salaryCurrency": "EUR",
  "publishDate": "2026-05-01",
  "publishDateISO": "2026-05-01T00:00:00Z",
  "daysOld": 3,
  "expiryDate": "2026-06-01",
  "applyUrl": null,
  "isRemote": false,
  "tags": null,
  "snippet": null,
  "isRepost": false,
  "scrapedAt": "2026-05-04T10:00:00.000Z",
  "latitude": 45.4642,
  "longitude": 9.19
}
````

***

### Examples

**Basic search — developer jobs in Milan**

```json
{
  "searchQuery": "developer",
  "location": "Milano",
  "maxItems": 50
}
```

**Filter by age and work type**

```json
{
  "searchQuery": "marketing",
  "location": "Roma",
  "daysOld": 7,
  "maxItems": 100
}
```

**Full descriptions for AI processing**

```json
{
  "searchQuery": "data scientist",
  "location": "Torino",
  "fetchDetails": true,
  "maxItems": 25
}
```

**Specific job URLs**

```json
{
  "startUrls": [
    { "url": "https://www.stepstone.it/offerte-lavoro/software-engineer-milano--12345678-inline.html" }
  ],
  "fetchDetails": true
}
```

***

### 💰 Pricing

**$1.50 per 1,000 results** (Pay-Per-Event)

| Action | Cost |
|--------|------|
| Actor start | $0.00005 |
| Per result | $0.0015 |
| 1,000 results | ~$1.50 |
| 10,000 results | ~$15.00 |

Unlike flat-rate scrapers that charge the same whether you get 10 or 10,000 results, you only pay for what you actually receive.

***

### Performance

| Scenario | Speed | Cost |
|----------|-------|------|
| 100 listings (fetchDetails: false) | ~1 min | ~$0.15 |
| 100 listings (fetchDetails: true) | ~3 min | ~$0.15 + compute |
| 1,000 listings | ~10 min | ~$1.50 |

***

### Known limitations

- `applyUrl` is always `null` — Stepstone Italy uses a login modal for applications
- Salary data is not published by all employers — `null` values are normal
- Without a location, the scraper returns national results
- `fetchDetails: true` is slower and uses more compute credits

***

### Technical details

- HTTP-only scraper (no browser required) — fast and cost-efficient
- RESIDENTIAL proxy for reliable access
- Self-healing health monitoring with automatic failure detection
- Cross-run repost deduplication via Key-Value Store
- Nominatim geocoding for coordinates

***

### Additional services

Need a custom scraper, scheduled runs, or data integration? Contact us at <info@unfencedgroup.nl>.

***

*Built by [unfenced-group](https://apify.com/unfenced-group) · Stepstone.it is a trademark of The Stepstone Group*

# Actor input Schema

## `searchQuery` (type: `string`):

Job title, skill, or keyword to search for.

## `location` (type: `string`):

City or region to search in (e.g. Milano, Roma, Torino).

## `radius` (type: `integer`):

Search radius in kilometres around the location.

## `maxItems` (type: `integer`):

Maximum number of job listings to return.

## `daysOld` (type: `integer`):

Only return jobs posted within this many days. Leave empty for all listings.

## `skipReposts` (type: `boolean`):

Skip job listings that have been posted before (cross-run deduplication).

## `fetchDetails` (type: `boolean`):

Visit each job detail page for full description (HTML, plain text, Markdown) and salary data. Slower but more complete.

## `startUrls` (type: `array`):

Optional: provide specific Stepstone.it search or job detail URLs to scrape instead of using the search parameters above.

## Actor input object example

```json
{
  "searchQuery": "developer",
  "location": "Milano",
  "radius": 40,
  "maxItems": 5,
  "skipReposts": false,
  "fetchDetails": false,
  "startUrls": []
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("unfenced-group/stepstone-it-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("unfenced-group/stepstone-it-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call unfenced-group/stepstone-it-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=unfenced-group/stepstone-it-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Stepstone.it Italy Job Scraper",
        "description": "Scrape job listings from Stepstone Italy. Filter by keyword, location and contract type. Returns salary, full descriptions, geocoordinates and repost detection. No API key required.",
        "version": "0.0",
        "x-build-id": "UFcApmGLJwy7w4xw5"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/unfenced-group~stepstone-it-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-unfenced-group-stepstone-it-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/unfenced-group~stepstone-it-scraper/runs": {
            "post": {
                "operationId": "runs-sync-unfenced-group-stepstone-it-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/unfenced-group~stepstone-it-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-unfenced-group-stepstone-it-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Job title, skill, or keyword to search for.",
                        "default": "developer"
                    },
                    "location": {
                        "title": "Location",
                        "type": "string",
                        "description": "City or region to search in (e.g. Milano, Roma, Torino).",
                        "default": "Milano"
                    },
                    "radius": {
                        "title": "Search Radius (km)",
                        "minimum": 0,
                        "maximum": 100,
                        "type": "integer",
                        "description": "Search radius in kilometres around the location.",
                        "default": 40
                    },
                    "maxItems": {
                        "title": "Maximum Results",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of job listings to return.",
                        "default": 5
                    },
                    "daysOld": {
                        "title": "Maximum Age (days)",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Only return jobs posted within this many days. Leave empty for all listings."
                    },
                    "skipReposts": {
                        "title": "Skip Reposts",
                        "type": "boolean",
                        "description": "Skip job listings that have been posted before (cross-run deduplication).",
                        "default": false
                    },
                    "fetchDetails": {
                        "title": "Fetch Full Descriptions",
                        "type": "boolean",
                        "description": "Visit each job detail page for full description (HTML, plain text, Markdown) and salary data. Slower but more complete.",
                        "default": false
                    },
                    "startUrls": {
                        "title": "Start URLs",
                        "type": "array",
                        "description": "Optional: provide specific Stepstone.it search or job detail URLs to scrape instead of using the search parameters above.",
                        "default": [],
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
