# New York Business Entity Scraper (`scrapers_lat/ny-dos-business-scraper`) Actor

Search and extract New York business entity records (corporations, LLCs, limited partnerships) from the NY Department of State public inquiry database by name or DOS ID.

- **URL**: https://apify.com/scrapers\_lat/ny-dos-business-scraper.md
- **Developed by:** [Michael Flores](https://apify.com/scrapers_lat) (community)
- **Categories:** Business, Lead generation, Automation
- **Stats:** 2 total users, 0 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $8.00 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## New York Business Entity Scraper

Search and extract business entity records from the **New York Department of State, Division of Corporations** public inquiry database. Look up corporations, LLCs, limited partnerships and other registered entities by name or by DOS ID, and get a clean structured record for each match.

This actor talks directly to the official NY DOS public inquiry JSON API. No login, no API key and no proxy are required.

### What you get

For every matching entity:

| Field | Description |
| --- | --- |
| `dosId` | NY Department of State ID number |
| `name` | Entity legal name (or assumed name record name) |
| `entityType` | Full entity type, e.g. `DOMESTIC BUSINESS CORPORATION`, `FOREIGN LIMITED LIABILITY COMPANY` |
| `entityTypeCategory` | High-level category, e.g. `CORPORATION`, `LIMITED LIABILITY COMPANY` |
| `status` | `Active` or `Inactive` |
| `filingDate` | Initial filing date (`YYYY-MM-DD`) |
| `county` | County on record |
| `jurisdiction` | State / country of formation, e.g. `New York, United States`, `Delaware, United States` |
| `nameType` | `ACTUAL` for the entity's own name, `ASSUMED` for a DBA record |
| `assumedName` | Assumed (DBA) name, when the record is an assumed name |
| `assumedNameId` | Assumed name ID, when present |
| `url` | Link to the entity's public display page on the NY DOS site |
| `registeredAgentName`* | Registered agent on file |
| `registeredAgentAddress`* | Registered agent street, city, state and ZIP |
| `dosProcessName`* | Agent designated for service of process (DOS process agent) |
| `dosProcessAddress`* | Service-of-process street, city, state and ZIP |
| `chairmanOrCeoName`* | Chairman or chief executive officer on record |
| `chairmanOrCeoAddress`* | Chairman / CEO street, city, state and ZIP |
| `physicalLocationName`* | Principal physical location name |
| `physicalLocationAddress`* | Principal physical location street, city, state and ZIP |
| `detailCounty`* | County confirmed from the open data registry |
| `detailJurisdiction`* | Formation jurisdiction confirmed from the open data registry |
| `scrapedAt` | ISO timestamp of collection |

*\*These fields are added when Fetch full details is enabled (default on). They come from New York's open data registry and are populated when the State publishes them for an entity.*

Missing values are returned as `null`. Nothing is invented.

### Input

| Field | Type | Default | Description |
| --- | --- | --- | --- |
| `searchValue` | string (required) | `google` | Entity name (or part of it), or a DOS ID when `searchBy` is `id`. |
| `searchBy` | `name` \| `id` | `name` | Search by entity name or by DOS ID number. |
| `match` | `beginsWith` \| `contains` \| `partial` | `contains` | How the name is matched. Ignored when searching by DOS ID. |
| `entityStatus` | `all` \| `active` \| `inactive` | `all` | Restrict results by entity status. |
| `maxItems` | integer (1-300) | `50` | Maximum records to collect. |
| `withDetails` | boolean | `true` | Fetch registered agent, service-of-process, chairman / CEO and physical location addresses for each entity from New York's open data registry. |

#### Example input

```json
{
  "searchValue": "google",
  "searchBy": "name",
  "match": "contains",
  "entityStatus": "all",
  "maxItems": 50
}
````

#### Example output

```json
{
  "dosId": "4809335",
  "name": "GOOGLE ACCESS LLC",
  "entityType": "FOREIGN LIMITED LIABILITY COMPANY",
  "entityTypeCategory": "LIMITED LIABILITY COMPANY",
  "status": "Active",
  "filingDate": "2015-08-24",
  "county": "Albany",
  "jurisdiction": "Delaware, United States",
  "nameType": "ACTUAL",
  "assumedName": null,
  "assumedNameId": null,
  "url": "https://apps.dos.ny.gov/publicInquiry/EntityDisplay?dosId=4809335",
  "registeredAgentName": "INCORP SERVICES, INC.",
  "registeredAgentAddress": {
    "street": "ONE COMMERCE PLAZA, 99 WASHINGTON AVE SUITE 805-A",
    "city": "ALBANY",
    "state": "NEW YORK",
    "zip": "12210-2822"
  },
  "dosProcessName": "INCORP SERVICES, INC.",
  "dosProcessAddress": {
    "street": "ONE COMMERCE PLAZA, 99 WASHINGTON AVE SUITE 805-A",
    "city": "ALBANY",
    "state": "NY",
    "zip": "12210-2822"
  },
  "chairmanOrCeoName": null,
  "chairmanOrCeoAddress": null,
  "physicalLocationName": null,
  "physicalLocationAddress": null,
  "detailCounty": "Queens",
  "detailJurisdiction": "New York",
  "scrapedAt": "2026-06-22T18:09:39.947Z"
}
```

### Notes and limits

- **Result cap.** The NY DOS database returns at most **300 matches per search**, in a single response. To narrow broad searches, use a more specific `searchValue`, set `match` to `beginsWith`, or filter by `entityStatus`.
- **Full contact details.** With `withDetails` enabled (default), each entity is enriched from New York's open data registry with the registered agent, the agent designated for service of process, the chairman / CEO and the principal physical location, each with a full mailing address. These fields are populated when the State publishes them for an entity and returned as `null` otherwise. Set `withDetails` to false to return search-level fields only.
- One clean record is produced per search result row, including assumed-name (DBA) rows where applicable.

### Use cases

- KYC, due diligence and compliance checks on New York entities.
- Lead generation and enrichment from a company name.
- Verifying entity existence, status, type and formation jurisdiction.

### Disclaimer

This actor retrieves **public records** published by the New York Department of State, Division of Corporations. The data is provided by the State of New York for public inquiry. You are responsible for using the data lawfully, including for KYC, due diligence and lead-generation purposes, and for complying with applicable privacy and data-protection rules in your jurisdiction. This actor is not affiliated with or endorsed by the New York Department of State.

### Related scrapers

Need data from the same space? Here are other scrapers we build and maintain:

- [USAspending Federal Award Scraper](https://apify.com/scrapers_lat/usaspending-awards-scraper): Scrape US federal prime awards from USAspending.gov. Get recipient, UEI, award amount, awarding...
- [Grants.gov Opportunity Scraper](https://apify.com/scrapers_lat/grants-gov-scraper): Extract U.S. federal grant opportunities from the official Grants.gov API. Scrape title, agency...
- [NPPES NPI Provider Scraper](https://apify.com/scrapers_lat/nppes-npi-scraper): Scrape US healthcare provider records from the official NPPES NPI Registry. Search by name...
- [BizQuest Businesses-for-Sale Listings Scraper](https://apify.com/scrapers_lat/bizquest-scraper): Extract US businesses-for-sale listings from BizQuest with the broker or seller contact (name...
- [California CSLB Contractor License Scraper](https://apify.com/scrapers_lat/california-cslb-scraper): Extract licensed California contractor records from the CSLB public data portal by classification...
- [Florida Professional License Scraper](https://apify.com/scrapers_lat/florida-dbpr-scraper): Scrape Florida DBPR professional license records by name, business, or license number. Get licensee...

### More scrapers at scrapers.lat

This actor is built and maintained by [scrapers.lat](https://scrapers.lat), where we publish scrapers for Latin American and US public platforms: real estate, jobs, e-commerce, company registries and government data. Browse the full catalog, see live sample output for each one, or ask us for a custom scraper at [scrapers.lat](https://scrapers.lat).

# Actor input Schema

## `maxItems` (type: `integer`):

Maximum number of entity records to collect. The source database returns at most 300 matches per search.

## `withDetails` (type: `boolean`):

Fetch the registered agent, DOS process agent, chairman/CEO and physical location addresses for each entity from New York's open data registry. Adds one fast request per entity.

## `searchValue` (type: `string`):

The entity name (or part of it) to search for, or a DOS ID number when 'Search by' is set to DOS ID.

## `searchBy` (type: `string`):

Whether 'Search value' is an entity name or a DOS ID number.

## `match` (type: `string`):

How the search value is matched against entity names. 'Begins with' matches names starting with the value, 'Contains' matches names containing the value anywhere, 'Partial' matches partial words. Ignored when searching by DOS ID.

## `entityStatus` (type: `string`):

Restrict results to active entities, inactive entities, or include all statuses.

## Actor input object example

```json
{
  "maxItems": 50,
  "withDetails": true,
  "searchValue": "google",
  "searchBy": "name",
  "match": "contains",
  "entityStatus": "all"
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "maxItems": 50,
    "searchValue": "google"
};

// Run the Actor and wait for it to finish
const run = await client.actor("scrapers_lat/ny-dos-business-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "maxItems": 50,
    "searchValue": "google",
}

# Run the Actor and wait for it to finish
run = client.actor("scrapers_lat/ny-dos-business-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "maxItems": 50,
  "searchValue": "google"
}' |
apify call scrapers_lat/ny-dos-business-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=scrapers_lat/ny-dos-business-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "New York Business Entity Scraper",
        "description": "Search and extract New York business entity records (corporations, LLCs, limited partnerships) from the NY Department of State public inquiry database by name or DOS ID.",
        "version": "0.1",
        "x-build-id": "dAfEEbi1kWwceXRe5"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/scrapers_lat~ny-dos-business-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-scrapers_lat-ny-dos-business-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/scrapers_lat~ny-dos-business-scraper/runs": {
            "post": {
                "operationId": "runs-sync-scrapers_lat-ny-dos-business-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/scrapers_lat~ny-dos-business-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-scrapers_lat-ny-dos-business-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "searchValue"
                ],
                "properties": {
                    "maxItems": {
                        "title": "Max items",
                        "minimum": 1,
                        "maximum": 300,
                        "type": "integer",
                        "description": "Maximum number of entity records to collect. The source database returns at most 300 matches per search."
                    },
                    "withDetails": {
                        "title": "Fetch full details",
                        "type": "boolean",
                        "description": "Fetch the registered agent, DOS process agent, chairman/CEO and physical location addresses for each entity from New York's open data registry. Adds one fast request per entity.",
                        "default": true
                    },
                    "searchValue": {
                        "title": "Search value",
                        "type": "string",
                        "description": "The entity name (or part of it) to search for, or a DOS ID number when 'Search by' is set to DOS ID."
                    },
                    "searchBy": {
                        "title": "Search by",
                        "enum": [
                            "name",
                            "id"
                        ],
                        "type": "string",
                        "description": "Whether 'Search value' is an entity name or a DOS ID number.",
                        "default": "name"
                    },
                    "match": {
                        "title": "Name match mode",
                        "enum": [
                            "beginsWith",
                            "contains",
                            "partial"
                        ],
                        "type": "string",
                        "description": "How the search value is matched against entity names. 'Begins with' matches names starting with the value, 'Contains' matches names containing the value anywhere, 'Partial' matches partial words. Ignored when searching by DOS ID.",
                        "default": "contains"
                    },
                    "entityStatus": {
                        "title": "Entity status",
                        "enum": [
                            "all",
                            "active",
                            "inactive"
                        ],
                        "type": "string",
                        "description": "Restrict results to active entities, inactive entities, or include all statuses.",
                        "default": "all"
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
