USA New York Company Registry Scraper — B2B Leads avatar

USA New York Company Registry Scraper — B2B Leads

Pricing

from $1.99 / 1,000 results

Go to Apify Store
USA New York Company Registry Scraper — B2B Leads

USA New York Company Registry Scraper — B2B Leads

Scrape New York State's official Active Corporations registry (data.ny.gov). Get DOS ID, entity name, type, jurisdiction, county, filing date and registered agent name & address. Filter by full-text query, county & entity type. No login, no API key.

Pricing

from $1.99 / 1,000 results

Rating

0.0

(0)

Developer

Logiover

Logiover

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share


📌 Overview

The New York Company Registry Scraper turns data.ny.gov — New York State's official Active Corporations open-data dataset — into an analysis-ready dataset. It reads directly from the Socrata open-data API (data.ny.gov/resource/n9v6-gdp6.json), the same open data New York publishes, so there is no login, no API key and no captcha.

For every active New York entity you get the DOS ID, current entity name, entity type, jurisdiction, county, initial DOS filing date and the registered agent (DOS process) name and full mailing address — ready-to-use B2B and compliance data.

Perfect for New York B2B lead generation, sales prospecting, market research, CRM enrichment, KYC / compliance and competitor mapping — exported to CSV, Excel, JSON or pushed to Google Sheets and your CRM via API.

✅ Combine a county + entity-type filter and raise maxResults to assemble a large, targeted New York prospect list in a single run.


❓ What does the New York company registry scraper do?

It queries the official Active Corporations Socrata API with your filters, flattens each record into clean spreadsheet-ready columns, de-dupes by DOS ID, paginates automatically with $limit / $offset and returns a structured dataset. You set filters in a simple form; you get a clean dataset back — no code, no registry account, no Socrata / SoQL integration work.


✨ Features

FeatureDescription
🎯New York B2B lead dataDOS ID, entity name, type, jurisdiction, county, filing date — plus the registered agent's name and mailing address.
🔎Full-text & field filteringFree-text query ($q) plus county and entityType filters, ANDed into a SoQL $where.
🆕Find new companiesSort by initial DOS filing date to surface the newest active New York businesses first.
📦Bulk exportUp to ~100,000 records per query, auto-paginated; export to JSON, CSV, Excel, HTML or pull via API & webhooks.
🔓No API key neededReads the official open NY State data.ny.gov API — no login, no key, no signup, no captcha.
🧹Clean, flat outputSocrata records flattened into tidy, spreadsheet-ready columns, de-duped by DOS ID.

📊 What data you get

One clean row per company. Verified output fields:

FieldDescription
dosIdNY Department of State entity ID (dos_id)
nameCurrent entity name (current_entity_name)
entityTypeEntity type, e.g. DOMESTIC BUSINESS CORPORATION
jurisdictionJurisdiction of formation
countyNew York county
filingDateInitial DOS filing date (initial_dos_filing_date)
agentNameRegistered agent / DOS process name (dos_process_name)
agentAddressRegistered agent mailing street address
agentCity / agentState / agentZipRegistered agent city, state and postal code
urlDirect data.ny.gov API URL for the company
scrapedAtISO timestamp of the run

🚀 How to use

  1. Open the actor and set your filters — a full-text query, a county, and/or an entityType — or leave them empty to stream the whole registry.
  2. Pick a order (newest / oldest / name) and set maxResults.
  3. Run, then export the dataset to CSV, Excel, JSON or Google Sheets.

Input fields

FieldTypeDescription
querystringFree-text full-text search ($q), mostly matches the company name, e.g. consulting
countystringNY county (uppercase), e.g. NEW YORK, KINGS, QUEENS, ERIE
entityTypestringEntity type (uppercase), e.g. DOMESTIC BUSINESS CORPORATION
orderselectnewest / oldest (by filing date) / name (A→Z)
maxResultsintegerCap (default 1000, 0 = unlimited; safety ceiling ~100k records)
useProxybooleanRoute through Apify Proxy (datacenter). Default true
proxyConfigurationobjectProxy override

Example scenarios

1) Newest LLCs in Brooklyn (Kings County)

{
"county": "KINGS",
"entityType": "DOMESTIC LIMITED LIABILITY COMPANY",
"order": "newest",
"maxResults": 2000
}

2) Consulting companies across New York State

{
"query": "consulting",
"maxResults": 5000
}

3) Business corporations in Manhattan (New York County)

{
"county": "NEW YORK",
"entityType": "DOMESTIC BUSINESS CORPORATION",
"order": "newest",
"maxResults": 3000
}

📦 Example output

{
"dosId": "1234567",
"name": "EXAMPLE CONSULTING LLC",
"entityType": "DOMESTIC LIMITED LIABILITY COMPANY",
"jurisdiction": "NEW YORK",
"county": "NEW YORK",
"filingDate": "2025-01-14T00:00:00.000",
"agentName": "THE LLC",
"agentAddress": "123 BROADWAY, SUITE 100",
"agentCity": "NEW YORK",
"agentState": "NY",
"agentZip": "10001",
"url": "https://data.ny.gov/resource/n9v6-gdp6.json?dos_id=1234567",
"scrapedAt": "2026-06-08T00:00:00.000Z"
}

Export to CSV, Excel, JSON or XML, push to Google Sheets / your CRM, or pull via the Apify API and webhooks.


🎯 Use cases

  • B2B lead generation — build targeted New York prospect lists by county + entity type with registered-agent contact details.
  • Sales prospecting — surface the newest active companies by filing date and reach the registered agent directly.
  • Market research — count and profile New York companies by entity type, county and filing date.
  • KYC / compliance & due diligence — verify DOS IDs, entity type, jurisdiction and active status.
  • Competitor mapping — survey every company of a given type in a county and benchmark formation activity.
  • CRM enrichment — look companies up by DOS ID and append official registry fields.

❓ FAQ

Is there a free New York company registry API?

Yes. New York State publishes the Active Corporations dataset as a fully open, free Socrata JSON API (data.ny.gov/resource/n9v6-gdp6.json) — no key, no login, no signup. This actor is a friendly, filterable export layer on top of it: set filters in a form, run, and download a clean dataset with pagination handled and records flattened to columns.

How do I export New York company data to Excel?

Every run writes one row per company to a dataset you can export to Excel, CSV, JSON or HTML, or pull via the Apify API. Set your filters (county, entity type, query), run, and use the Export button to download an .xlsx / .csv of the whole New York company segment.

Does it include company emails and phone numbers?

The Active Corporations dataset does not publish company email or phone. It does include the registered agent (DOS process) name and full mailing address, which is the official contact-for-service on record — useful for outreach, mail and compliance. Pair it with an enrichment tool if you need email / phone.

This actor reads only the official open data that New York State publishes via its public Socrata API — no login or circumvention involved. As always, credit the source (data.ny.gov) and follow the State's open-data terms and applicable law for your specific use case.

How many records can I pull?

Socrata paginates with $limit / $offset; this actor pages at 1,000 records and stops at a 100,000-record deep-offset safety ceiling. For larger pulls, split your run by county or entity type and combine the datasets.

Is this a Socrata / data.ny.gov API alternative for bulk download?

Yes — it's the easiest data.ny.gov API alternative for non-developers: no code, automatic pagination, records flattened to columns, and direct export to CSV / Excel / JSON.


📈 Notes on coverage & limits

  • Active corporations only. This dataset is New York's list of active entities; dissolved / inactive entities are out of scope.
  • No email / phone fields. The source publishes the registered agent name + mailing address, not company email or phone — enrich separately if you need those.
  • ~100,000 records per query. The actor caps at a 100k deep-offset safety ceiling. For larger segments, split your run by county or entity type and combine the datasets.