# GLEIF Company & LEI Scraper (`benthepythondev/gleif-lei-scraper`) Actor

Search the GLEIF database for companies by name or LEI code and get clean legal-entity records: legal name, LEI, status, jurisdiction, legal form, registered and HQ address, plus registration metadata. Fast and reliable via the public GLEIF API, no key. For KYC, AML and B2B data.

- **URL**: https://apify.com/benthepythondev/gleif-lei-scraper.md
- **Developed by:** [ben](https://apify.com/benthepythondev) (community)
- **Categories:** Business, Lead generation, Other
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $3.00 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## 🏢 GLEIF Company & LEI Scraper

Search the global **GLEIF** database for **companies** by name or **LEI** code and get clean, structured legal-entity records — legal name, LEI, status, jurisdiction, legal form, registered address, headquarters address and full registration metadata. Powered by the public GLEIF API, so it's fast and reliable: no browser, no login, no API key.

Built for **KYC / AML**, counterparty due diligence, B2B data enrichment and compliance research. Export to JSON/CSV/Excel, run on a schedule, call via API, or connect to Make, Zapier or n8n.

### 🔎 What is the GLEIF Company & LEI Scraper?

The Legal Entity Identifier (LEI) is a 20-character global ID for companies that trade or report in financial markets. Give this actor a company name (e.g. "Apple Inc") or an LEI, and it returns matching, officially-registered legal entities — the legal name, where they're registered, their status and their address — as structured rows. Perfect for verifying counterparties and enriching a B2B list with authoritative data.

#### What data does it extract?

- **LEI** code and **legal name**
- **Other / trading names**
- **Entity status** (active, inactive) and **registration status**
- **Jurisdiction** and **legal form**
- **Registered-as** number (local company number)
- **Legal address** and **headquarters address** (lines, city, region, country, postcode)
- **Initial registration**, **last update** and **next renewal** dates
- **Managing LOU** and **BIC** codes (where available)
- A direct **GLEIF record URL**

### ⬇️ Input

| Field | Type | Description |
|-------|------|-------------|
| `searchTerms` | array | Company names or 20-character LEI codes. |
| `maxPerTerm` | integer | Max records per term. Default `25`. |

#### Example input

```json
{
  "searchTerms": ["Apple Inc", "Volkswagen AG"],
  "maxPerTerm": 25
}
````

### ⬆️ Output

One record per legal entity:

```json
{
  "lei": "HWUPKR0MPOU8FGXBT394",
  "legal_name": "APPLE INC.",
  "other_names": [],
  "status": "ACTIVE",
  "jurisdiction": "US-CA",
  "category": "GENERAL",
  "legal_form_code": "XTIQ",
  "registered_as": "C0806592",
  "legal_address": {
    "lines": ["ONE APPLE PARK WAY"],
    "city": "CUPERTINO",
    "region": "US-CA",
    "country": "US",
    "postal_code": "95014"
  },
  "headquarters_address": {
    "lines": ["ONE APPLE PARK WAY"],
    "city": "CUPERTINO",
    "region": "US-CA",
    "country": "US",
    "postal_code": "95014"
  },
  "registration_status": "ISSUED",
  "initial_registration_date": "2012-06-06T18:42:00Z",
  "last_update_date": "2024-05-21T09:00:00Z",
  "next_renewal_date": "2025-06-06T18:42:00Z",
  "managing_lou": "EVK05KS7XY1DEII3R011",
  "bic": [],
  "url": "https://search.gleif.org/#/record/HWUPKR0MPOU8FGXBT394",
  "query": "Apple Inc"
}
```

### 💡 Use cases

- ✅ **KYC / AML** — verify a counterparty's legal name, address and status.
- 🔎 **Due diligence** — confirm a company is a real, registered legal entity.
- 🏷️ **B2B enrichment** — append authoritative legal data to your CRM list.
- 📊 **Compliance datasets** — build a structured entity reference table.

### ❓ FAQ

**Do I need an API key or login?** No — it uses the public GLEIF API.

**Can I search by company name or LEI?** Both — pass either; LEIs are detected automatically.

**Is the address included?** Yes — both legal and headquarters addresses.

**What is an LEI?** A 20-character global identifier for legal entities in financial markets.

**Can I look up several companies at once?** Yes — pass an array of names or LEIs.

**Does it include registration dates?** Yes — initial, last update and next renewal.

**How does pricing work?** Pay per record returned. No subscription.

**Is it legal?** GLEIF data is open, public reference data on legal entities (not individuals). Use responsibly and within GLEIF's terms; for any personal data, follow GDPR/CCPA.

### ⚙️ How it works

The scraper calls the GLEIF LEI-records API directly — no browser and no key. It detects whether each term is an LEI or a company name, paginates through matches, de-duplicates by LEI, and normalizes the JSON:API response into flat, consistent fields (name, status, jurisdiction, addresses, registration dates). Runs are fast and dependable, which is why the actor keeps passing its daily health check. The same input shape works for one lookup or a long list of companies.

### 👥 Who uses company / LEI data?

Legal-entity data is valuable to compliance teams, fintechs, sales teams and analysts. A compliance officer screens counterparties for KYC; a fintech validates onboarding data; a sales team enriches accounts with verified legal names and addresses; an analyst maps corporate structures. Because every record is plain JSON with consistent fields, it drops straight into a spreadsheet, CRM, BI tool or compliance workflow with no custom parsing.

### 📤 Export, schedule & integrate

Every run is saved to a dataset you can export to **JSON, CSV, Excel, XML or RSS**, or pull through the **Apify API**. Wire it into **Make, Zapier, n8n, Google Sheets, Slack** or your **own database**, run it on a **schedule** to refresh entity data, and call it from AI agents through the **Apify MCP server**.

### 💡 Tips for best results

- Use the full legal name (e.g. "Volkswagen AG") for the tightest matches.
- Pass an LEI directly when you already have it for an exact record.
- Schedule a run to catch status or address changes for your counterparties.
- Combine with the SEC EDGAR scraper to enrich US companies with filings.

### ❓ More FAQ

**How fresh is the data?** It is fetched live on each run from GLEIF.

**Can I run it automatically?** Yes — use Apify Schedules (cron).

**Are duplicates removed?** Yes — by LEI within each run.

**Which export formats?** JSON, CSV, Excel, XML and RSS, plus the Apify API.

**Can AI agents use it?** Yes — via the Apify API and MCP server.

### 🔗 You might also like

- [SEC EDGAR Filings Scraper](https://apify.com/benthepythondev/sec-edgar-filings-scraper) — US company filings.
- [Website Contact Extractor](https://apify.com/benthepythondev/website-contact-extractor) — emails & phones from sites.
- [Real Estate Agent Lead Scraper](https://apify.com/benthepythondev/real-estate-agent-lead-scraper) — agent contact leads.

***

**Keywords:** gleif scraper, lei scraper, lei api, legal entity identifier, company data, kyc data, aml screening, company lookup, b2b enrichment, counterparty data, due diligence, compliance data, company registration, entity data, business data

# Actor input Schema

## `searchTerms` (type: `array`):

Company names or 20-character LEI codes to search.

## `maxPerTerm` (type: `integer`):

Max records per search term.

## Actor input object example

```json
{
  "searchTerms": [
    "Apple Inc"
  ],
  "maxPerTerm": 10
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "searchTerms": [
        "Apple Inc"
    ],
    "maxPerTerm": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("benthepythondev/gleif-lei-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "searchTerms": ["Apple Inc"],
    "maxPerTerm": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("benthepythondev/gleif-lei-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "searchTerms": [
    "Apple Inc"
  ],
  "maxPerTerm": 10
}' |
apify call benthepythondev/gleif-lei-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=benthepythondev/gleif-lei-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "GLEIF Company & LEI Scraper",
        "description": "Search the GLEIF database for companies by name or LEI code and get clean legal-entity records: legal name, LEI, status, jurisdiction, legal form, registered and HQ address, plus registration metadata. Fast and reliable via the public GLEIF API, no key. For KYC, AML and B2B data.",
        "version": "1.0",
        "x-build-id": "6FcDfHq7eKIrhXFOt"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/benthepythondev~gleif-lei-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-benthepythondev-gleif-lei-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/benthepythondev~gleif-lei-scraper/runs": {
            "post": {
                "operationId": "runs-sync-benthepythondev-gleif-lei-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/benthepythondev~gleif-lei-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-benthepythondev-gleif-lei-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "searchTerms": {
                        "title": "Company names or LEIs",
                        "type": "array",
                        "description": "Company names or 20-character LEI codes to search.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxPerTerm": {
                        "title": "Max per term",
                        "minimum": 1,
                        "maximum": 200,
                        "type": "integer",
                        "description": "Max records per search term.",
                        "default": 25
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
