# Thegazelatteco Listings Spider (`getdataforme/thegazelatteco-listings-spider`) Actor

This Apify Actor automates scraping insolvency notices from The Gazette website, extracting key data like company names, notice IDs, dates, and abstracts....

- **URL**: https://apify.com/getdataforme/thegazelatteco-listings-spider.md
- **Developed by:** [GetDataForMe](https://apify.com/getdataforme) (community)
- **Categories:** AI, E-commerce, Other
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $9.00 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### Description

This Apify Actor automates scraping insolvency notices from The Gazette website, extracting key data like company names, notice IDs, dates, and abstracts....

---

## Thegazelatteco Listings Spider

### Introduction
The Thegazelatteco Listings Spider is a powerful Apify Actor designed to scrape and extract insolvency notices from The Gazette website (thegazette.co.uk). It automates the collection of detailed corporate insolvency data, including company names, notice IDs, publication dates, and abstracts, enabling efficient data gathering for legal, financial, and research purposes. This tool saves time and ensures reliable access to public insolvency records without manual browsing.

### Features
- **Comprehensive Data Extraction**: Scrapes key details from insolvency notices, such as notice IDs, URLs, publication dates, company names, categories, and abstracts.
- **Flexible URL Configuration**: Allows input of multiple URLs to target specific pages or sections on The Gazette.
- **High Reliability**: Built with robust error handling to manage website changes and ensure consistent data retrieval.
- **Structured Output**: Delivers clean, JSON-formatted data ready for integration into databases, analytics tools, or reports.
- **Scalable Performance**: Handles large volumes of notices efficiently, with support for pagination and batch processing.
- **Apify Integration**: Seamlessly runs on the Apify platform, with easy monitoring, scheduling, and export options.
- **Data Quality Assurance**: Filters and validates extracted data to minimize errors and duplicates.

### Input Parameters

| Parameter | Type | Required | Description | Example |
|-----------|------|----------|-------------|---------|
| Urls | array | No | An array of URLs to scrape insolvency notices from. Each URL must be a valid HTTP or HTTPS link. Defaults to the main corporate insolvency page. | `["https://m.thegazette.co.uk/corporate-insolvency"]` |

### Example Usage

#### Input Configuration
```json
{
  "Urls": [
    "https://m.thegazette.co.uk/corporate-insolvency"
  ]
}
````

#### Example Output

```json
[
  {
    "notice_id": "5115106",
    "notice_url": "https://m.thegazette.co.uk/notice/5115106",
    "publication_date": "17 April 2026",
    "company_name": "SUPREME LOSS PREVENTION LIMITED",
    "category": "Petitions to Wind Up (Companies)",
    "abstract": "In the High Court of Justice (Chancery Division) Companies Court No CR-2026-001908 of 2026 In the Matter of SUPREME LOSS PREVENTION LIMITED (Company Number 13726537 ) and in the Matter of the Insolven\u2026",
    "page_number": "1",
    "actor_id": "M7fU4vMecZnAZDdIC",
    "run_id": "xYXwQtLeH2UXqHrsb"
  },
  {
    "notice_id": "5115105",
    "notice_url": "https://m.thegazette.co.uk/notice/5115105",
    "publication_date": "17 April 2026",
    "company_name": "LENNON'S LEGACY SPV LIMITED",
    "category": "Petitions to Wind Up (Companies)",
    "abstract": "In the High Court of Justice (Chancery Division) Companies Court No CR-2026-001957 of 2026 In the Matter of LENNON'S LEGACY SPV LIMITED (Company Number 11212192 ) and in the Matter of the Insolvency A\u2026",
    "page_number": "1",
    "actor_id": "M7fU4vMecZnAZDdIC",
    "run_id": "xYXwQtLeH2UXqHrsb"
  },
  {
    "notice_id": "5115104",
    "notice_url": "https://m.thegazette.co.uk/notice/5115104",
    "publication_date": "17 April 2026",
    "company_name": "WESTERN VEHICLES LIMITED",
    "category": "Petitions to Wind Up (Companies)",
    "abstract": "In the High Court of Justice (Chancery Division) Companies Court No CR-2026-002129 of 2026 In the Matter of WESTERN VEHICLES LIMITED (Company Number 07331083 ) and in the Matter of the Insolvency Act\u2026",
    "page_number": "1",
    "actor_id": "M7fU4vMecZnAZDdIC",
    "run_id": "xYXwQtLeH2UXqHrsb"
  }
]
```

### Use Cases

- **Legal Research**: Gather insolvency notices for case studies, compliance checks, or due diligence in corporate law.
- **Financial Analysis**: Monitor company insolvencies for risk assessment, investment decisions, or market trend analysis.
- **Business Intelligence**: Track competitors or industry players facing insolvency to inform strategic planning.
- **Academic Studies**: Collect data for research on economic downturns, bankruptcy patterns, or corporate governance.
- **Regulatory Compliance**: Automate monitoring of public notices for regulatory bodies or auditors.
- **Data Aggregation**: Build datasets for reporting, dashboards, or integration with CRM systems.

### Installation and Usage

1. Search for "Thegazelatteco Listings Spider" in the Apify Store
2. Click "Try for free" or "Run"
3. Configure input parameters
4. Click "Start" to begin extraction
5. Monitor progress in the log
6. Export results in your preferred format (JSON, CSV, Excel)

### Output Format

The Actor outputs data in JSON format as an array of objects. Each object represents a single insolvency notice and includes fields like `notice_id` (unique identifier), `notice_url` (link to the full notice), `publication_date` (date of publication), `company_name` (name of the insolvent company), `category` (type of notice, e.g., "Petitions to Wind Up"), `abstract` (summary text), `page_number` (page where found), and metadata like `actor_id` and `run_id`. This structure ensures easy parsing and integration.

### Support

For custom/simplified outputs or bug reports, please contact:

- Email: support@getdataforme.com
- Subject line: "custom support"
- Contact form: https://getdataforme.com/contact/

We're here to help you get the most out of this Actor!

# Actor input Schema

## `Urls` (type: `array`):

The urls for the spider.

## `itemLimit` (type: `integer`):

Specifies the maximum number of items to process.

## Actor input object example

```json
{
  "Urls": [
    "https://m.thegazette.co.uk/corporate-insolvency"
  ],
  "itemLimit": 20
}
```

# Actor output Schema

## `results` (type: `string`):

Scraped data items from dataset

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "itemLimit": 20
};

// Run the Actor and wait for it to finish
const run = await client.actor("getdataforme/thegazelatteco-listings-spider").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "itemLimit": 20 }

# Run the Actor and wait for it to finish
run = client.actor("getdataforme/thegazelatteco-listings-spider").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "itemLimit": 20
}' |
apify call getdataforme/thegazelatteco-listings-spider --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=getdataforme/thegazelatteco-listings-spider",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Thegazelatteco Listings Spider",
        "description": "This Apify Actor automates scraping insolvency notices from The Gazette website, extracting key data like company names, notice IDs, dates, and abstracts....",
        "version": "0.0",
        "x-build-id": "IbF1whouY4KMiTz0W"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/getdataforme~thegazelatteco-listings-spider/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-getdataforme-thegazelatteco-listings-spider",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/getdataforme~thegazelatteco-listings-spider/runs": {
            "post": {
                "operationId": "runs-sync-getdataforme-thegazelatteco-listings-spider",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/getdataforme~thegazelatteco-listings-spider/run-sync": {
            "post": {
                "operationId": "run-sync-getdataforme-thegazelatteco-listings-spider",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "Urls": {
                        "title": "Urls",
                        "minItems": 1,
                        "type": "array",
                        "description": "The urls for the spider.",
                        "default": [
                            "https://m.thegazette.co.uk/corporate-insolvency"
                        ],
                        "items": {
                            "type": "string"
                        }
                    },
                    "itemLimit": {
                        "title": "Item Limit",
                        "type": "integer",
                        "description": "Specifies the maximum number of items to process.",
                        "default": 20
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
