# Gazzetta Ufficiale Scraper - Italian Official Gazette (`studio-amba/gazzetta-ufficiale-scraper`) Actor

Scrape laws, decrees, regulations, and public notices from Italy's official gazette (Gazzetta Ufficiale). Extract act titles, types, numbers, dates, issuing authorities, full text, and PDF links. Browse recent issues or search by keyword. No login or cookies required.

- **URL**: https://apify.com/studio-amba/gazzetta-ufficiale-scraper.md
- **Developed by:** [Studio Amba](https://apify.com/studio-amba) (community)
- **Categories:** Business, Lead generation
- **Stats:** 2 total users, 1 monthly users, 0.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Gazzetta Ufficiale Scraper - Italian Official Gazette Data

Extract laws, decrees, regulations, and public notices from Italy's official gazette (Gazzetta Ufficiale della Repubblica Italiana). Browse recent issues or search by keyword across all gazette series. Returns act titles, types, numbers, dates, issuing authorities, full text, and PDF links. No login or cookies required.

### How to scrape Gazzetta Ufficiale data

This actor scrapes the official Italian government gazette at gazzettaufficiale.it and extracts structured data from published legislative acts. The Gazzetta Ufficiale is Italy's official publication for all laws, legislative decrees, presidential decrees, ministerial orders, public competitions, and government notices.

#### Browse Recent Issues

Enable "Browse Recent Issues" to automatically scrape acts from the last 30 days of gazette publications. The actor navigates through each issue's table of contents (sommario) and extracts every published act with its full text.

#### Search by Keywords

Enter Italian keywords to search across act titles. For example, search for "sicurezza" (security), "ambiente" (environment), "lavoro" (employment), or "decreto legislativo" (legislative decree) to find specific legislation.

#### Filter by Act Type

Narrow results to specific types of legislative acts:
- **Legge** — Parliamentary law
- **Decreto Legislativo** — Legislative decree (delegated legislation)
- **Decreto-Legge** — Decree-law (emergency legislation)
- **Decreto del Presidente della Repubblica** — Presidential decree
- **Decreto** — Ministerial or interministerial decree
- **Delibera** — Resolution
- **Ordinanza** — Ordinance
- **Comunicato** — Official notice
- **Regolamento** — Regulation

#### Choose Gazette Series

The Gazzetta Ufficiale is published in seven series:
- **Serie Generale** — Main legislation (most common)
- **Corte Costituzionale** — Constitutional Court decisions
- **Unione Europea** — EU-related materials
- **Regioni** — Regional legislation
- **Concorsi ed Esami** — Public competitions and exams (government jobs)
- **Contratti Pubblici** — Public contracts and tenders
- **Parte II** — Notices and insertions

### What data does the Gazzetta Ufficiale Scraper extract?

| Field | Type | Description |
|-------|------|-------------|
| **title** | String | Full title of the legislative act |
| **actType** | String | Type (Legge, Decreto Legislativo, Decreto, etc.) |
| **actNumber** | String | Official act number (e.g. "83/2026") |
| **actDate** | String | Date the act was signed |
| **issuingAuthority** | String | Government body that issued the act |
| **publicationDate** | String | Date published in the Gazzetta Ufficiale |
| **gazetteNumber** | String | Issue number of the gazette |
| **series** | String | Gazette series (Serie Generale, Concorsi, etc.) |
| **codiceRedazionale** | String | Unique editorial code (e.g. "26A03143") |
| **fullText** | String | Full text content of the act (up to 50K characters) |
| **pdfUrl** | String | Direct link to the PDF version |
| **url** | String | Source URL on gazzettaufficiale.it |
| **scrapedAt** | String | ISO 8601 timestamp of extraction |

### Input Parameters

| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| **Search Query** | String | — | Keywords to search in act titles (Italian) |
| **Act Type** | Select | All types | Filter by type of legislative act |
| **Gazette Series** | Select | Serie Generale | Which gazette series to search |
| **Browse Recent** | Boolean | true | Browse last 30 days instead of searching |
| **Date From** | String | — | Start date for search (dd/mm/yyyy) |
| **Date To** | String | — | End date for search (dd/mm/yyyy) |
| **Max Results** | Integer | 100 | Maximum number of acts to scrape |
| **Proxy Configuration** | Object | IT Residential | Proxy settings (Italian residential recommended) |

### Example Output

```json
{
    "title": "DECRETO LEGISLATIVO 16 aprile 2026, n. 83",
    "actType": "DECRETO LEGISLATIVO",
    "actNumber": "83",
    "actDate": "16 aprile 2026",
    "issuingAuthority": "PRESIDENZA DELLA REPUBBLICA",
    "publicationDate": "2026-05-20",
    "gazetteNumber": "115",
    "series": "Serie Generale",
    "codiceRedazionale": "26A03001",
    "fullText": "IL PRESIDENTE DELLA REPUBBLICA Visti gli articoli 76 e 87...",
    "pdfUrl": "https://www.gazzettaufficiale.it/do/gazzetta/downloadPdf?...",
    "url": "https://www.gazzettaufficiale.it/atto/serie_generale/caricaDettaglioAtto/originario?...",
    "scrapedAt": "2026-05-23T10:30:00.000Z"
}
````

### Use Cases

- **Legal research** — Monitor new legislation and regulatory changes in Italy
- **Compliance monitoring** — Track decrees and regulations affecting your industry
- **Government job tracking** — Scrape public competitions from the Concorsi series
- **Public procurement** — Monitor new tenders and contracts from the Contratti series
- **Academic research** — Build datasets of Italian legislative history
- **EU law transposition** — Track implementation of EU directives into Italian law

### Technical Notes

- The site uses a WAF that blocks raw HTTP requests, so this actor uses Playwright (headless browser)
- Italian residential proxies are strongly recommended for reliable operation
- The actor supports two modes: browsing recent issues (last 30 days) and keyword search
- Full text is capped at 50,000 characters per act to keep dataset sizes manageable
- Rate limiting is built in to avoid triggering anti-bot measures
- The Gazzetta Ufficiale textual archive covers publications from 1988 onwards

### Proxy Configuration

This actor requires Italian residential proxies for reliable operation. The default configuration uses Apify's residential proxy pool with Italy as the country. If you experience blocking, try reducing `maxResults` or increasing the delay between requests.

### Cost Estimation

- Browsing 30 days of gazette issues: ~500-1500 acts depending on series
- Each act detail page adds one request
- Typical run: 100 acts in ~5-10 minutes
- Uses Playwright (higher compute cost than HTTP-only actors)

### Legal Notice

This actor scrapes publicly available data from the official Italian government gazette. The Gazzetta Ufficiale is a public record. Users are responsible for complying with applicable laws and the website's terms of service.

# Actor input Schema

## `searchQuery` (type: `string`):

Search by keywords in act titles (e.g. 'decreto legislativo', 'sicurezza', 'ambiente'). Leave empty and enable 'Browse Recent' to scrape the latest gazette issues.

## `actType` (type: `string`):

Filter by type of act. Only used in search mode.

## `series` (type: `string`):

Which series of the Gazzetta Ufficiale to search.

## `browseRecent` (type: `boolean`):

Browse the last 30 days of gazette issues instead of searching. Recommended for getting the latest publications.

## `dateFrom` (type: `string`):

Start date for search (format: dd/mm/yyyy). Only used in search mode.

## `dateTo` (type: `string`):

End date for search (format: dd/mm/yyyy). Only used in search mode.

## `maxResults` (type: `integer`):

Maximum number of acts to scrape.

## `proxyConfiguration` (type: `object`):

Select proxies to use for the scraper.

## Actor input object example

```json
{
  "actType": "",
  "series": "serie_generale",
  "browseRecent": true,
  "maxResults": 100,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "IT"
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ],
        "apifyProxyCountry": "IT"
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("studio-amba/gazzetta-ufficiale-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
        "apifyProxyCountry": "IT",
    } }

# Run the Actor and wait for it to finish
run = client.actor("studio-amba/gazzetta-ufficiale-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "IT"
  }
}' |
apify call studio-amba/gazzetta-ufficiale-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=studio-amba/gazzetta-ufficiale-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Gazzetta Ufficiale Scraper - Italian Official Gazette",
        "description": "Scrape laws, decrees, regulations, and public notices from Italy's official gazette (Gazzetta Ufficiale). Extract act titles, types, numbers, dates, issuing authorities, full text, and PDF links. Browse recent issues or search by keyword. No login or cookies required.",
        "version": "0.1",
        "x-build-id": "10R9MeoYGcFxtYeGz"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/studio-amba~gazzetta-ufficiale-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-studio-amba-gazzetta-ufficiale-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/studio-amba~gazzetta-ufficiale-scraper/runs": {
            "post": {
                "operationId": "runs-sync-studio-amba-gazzetta-ufficiale-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/studio-amba~gazzetta-ufficiale-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-studio-amba-gazzetta-ufficiale-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search by keywords in act titles (e.g. 'decreto legislativo', 'sicurezza', 'ambiente'). Leave empty and enable 'Browse Recent' to scrape the latest gazette issues."
                    },
                    "actType": {
                        "title": "Act Type",
                        "enum": [
                            "",
                            "LEGGE",
                            "DECRETO LEGISLATIVO",
                            "DECRETO-LEGGE",
                            "DECRETO DEL PRESIDENTE DELLA REPUBBLICA",
                            "DECRETO",
                            "DELIBERA",
                            "ORDINANZA",
                            "COMUNICATO",
                            "REGOLAMENTO"
                        ],
                        "type": "string",
                        "description": "Filter by type of act. Only used in search mode.",
                        "default": ""
                    },
                    "series": {
                        "title": "Gazette Series",
                        "enum": [
                            "serie_generale",
                            "corte_costituzionale",
                            "unione_europea",
                            "regioni",
                            "concorsi",
                            "contratti",
                            "parte_seconda"
                        ],
                        "type": "string",
                        "description": "Which series of the Gazzetta Ufficiale to search.",
                        "default": "serie_generale"
                    },
                    "browseRecent": {
                        "title": "Browse Recent Issues",
                        "type": "boolean",
                        "description": "Browse the last 30 days of gazette issues instead of searching. Recommended for getting the latest publications.",
                        "default": true
                    },
                    "dateFrom": {
                        "title": "Date From",
                        "type": "string",
                        "description": "Start date for search (format: dd/mm/yyyy). Only used in search mode."
                    },
                    "dateTo": {
                        "title": "Date To",
                        "type": "string",
                        "description": "End date for search (format: dd/mm/yyyy). Only used in search mode."
                    },
                    "maxResults": {
                        "title": "Max Results",
                        "minimum": 1,
                        "maximum": 5000,
                        "type": "integer",
                        "description": "Maximum number of acts to scrape.",
                        "default": 100
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Select proxies to use for the scraper.",
                        "default": {
                            "useApifyProxy": true,
                            "apifyProxyGroups": [
                                "RESIDENTIAL"
                            ],
                            "apifyProxyCountry": "IT"
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
