USA New York Company Registry Scraper — B2B Leads
Pricing
from $1.99 / 1,000 results
USA New York Company Registry Scraper — B2B Leads
Scrape New York State's official Active Corporations registry (data.ny.gov). Get DOS ID, entity name, type, jurisdiction, county, filing date and registered agent name & address. Filter by full-text query, county & entity type. No login, no API key.
Pricing
from $1.99 / 1,000 results
Rating
0.0
(0)
Developer
Logiover
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
📌 Overview
The New York Company Registry Scraper turns data.ny.gov — New York State's official Active Corporations open-data dataset — into an analysis-ready dataset. It reads directly from the Socrata open-data API (data.ny.gov/resource/n9v6-gdp6.json), the same open data New York publishes, so there is no login, no API key and no captcha.
For every active New York entity you get the DOS ID, current entity name, entity type, jurisdiction, county, initial DOS filing date and the registered agent (DOS process) name and full mailing address — ready-to-use B2B and compliance data.
Perfect for New York B2B lead generation, sales prospecting, market research, CRM enrichment, KYC / compliance and competitor mapping — exported to CSV, Excel, JSON or pushed to Google Sheets and your CRM via API.
✅ Combine a county + entity-type filter and raise
maxResultsto assemble a large, targeted New York prospect list in a single run.
❓ What does the New York company registry scraper do?
It queries the official Active Corporations Socrata API with your filters, flattens each record into clean spreadsheet-ready columns, de-dupes by DOS ID, paginates automatically with $limit / $offset and returns a structured dataset. You set filters in a simple form; you get a clean dataset back — no code, no registry account, no Socrata / SoQL integration work.
✨ Features
| Feature | Description | |
|---|---|---|
| 🎯 | New York B2B lead data | DOS ID, entity name, type, jurisdiction, county, filing date — plus the registered agent's name and mailing address. |
| 🔎 | Full-text & field filtering | Free-text query ($q) plus county and entityType filters, ANDed into a SoQL $where. |
| 🆕 | Find new companies | Sort by initial DOS filing date to surface the newest active New York businesses first. |
| 📦 | Bulk export | Up to ~100,000 records per query, auto-paginated; export to JSON, CSV, Excel, HTML or pull via API & webhooks. |
| 🔓 | No API key needed | Reads the official open NY State data.ny.gov API — no login, no key, no signup, no captcha. |
| 🧹 | Clean, flat output | Socrata records flattened into tidy, spreadsheet-ready columns, de-duped by DOS ID. |
📊 What data you get
One clean row per company. Verified output fields:
| Field | Description |
|---|---|
dosId | NY Department of State entity ID (dos_id) |
name | Current entity name (current_entity_name) |
entityType | Entity type, e.g. DOMESTIC BUSINESS CORPORATION |
jurisdiction | Jurisdiction of formation |
county | New York county |
filingDate | Initial DOS filing date (initial_dos_filing_date) |
agentName | Registered agent / DOS process name (dos_process_name) |
agentAddress | Registered agent mailing street address |
agentCity / agentState / agentZip | Registered agent city, state and postal code |
url | Direct data.ny.gov API URL for the company |
scrapedAt | ISO timestamp of the run |
🚀 How to use
- Open the actor and set your filters — a full-text
query, acounty, and/or anentityType— or leave them empty to stream the whole registry. - Pick a
order(newest / oldest / name) and setmaxResults. - Run, then export the dataset to CSV, Excel, JSON or Google Sheets.
Input fields
| Field | Type | Description |
|---|---|---|
query | string | Free-text full-text search ($q), mostly matches the company name, e.g. consulting |
county | string | NY county (uppercase), e.g. NEW YORK, KINGS, QUEENS, ERIE |
entityType | string | Entity type (uppercase), e.g. DOMESTIC BUSINESS CORPORATION |
order | select | newest / oldest (by filing date) / name (A→Z) |
maxResults | integer | Cap (default 1000, 0 = unlimited; safety ceiling ~100k records) |
useProxy | boolean | Route through Apify Proxy (datacenter). Default true |
proxyConfiguration | object | Proxy override |
Example scenarios
1) Newest LLCs in Brooklyn (Kings County)
{"county": "KINGS","entityType": "DOMESTIC LIMITED LIABILITY COMPANY","order": "newest","maxResults": 2000}
2) Consulting companies across New York State
{"query": "consulting","maxResults": 5000}
3) Business corporations in Manhattan (New York County)
{"county": "NEW YORK","entityType": "DOMESTIC BUSINESS CORPORATION","order": "newest","maxResults": 3000}
📦 Example output
{"dosId": "1234567","name": "EXAMPLE CONSULTING LLC","entityType": "DOMESTIC LIMITED LIABILITY COMPANY","jurisdiction": "NEW YORK","county": "NEW YORK","filingDate": "2025-01-14T00:00:00.000","agentName": "THE LLC","agentAddress": "123 BROADWAY, SUITE 100","agentCity": "NEW YORK","agentState": "NY","agentZip": "10001","url": "https://data.ny.gov/resource/n9v6-gdp6.json?dos_id=1234567","scrapedAt": "2026-06-08T00:00:00.000Z"}
Export to CSV, Excel, JSON or XML, push to Google Sheets / your CRM, or pull via the Apify API and webhooks.
🎯 Use cases
- B2B lead generation — build targeted New York prospect lists by county + entity type with registered-agent contact details.
- Sales prospecting — surface the newest active companies by filing date and reach the registered agent directly.
- Market research — count and profile New York companies by entity type, county and filing date.
- KYC / compliance & due diligence — verify DOS IDs, entity type, jurisdiction and active status.
- Competitor mapping — survey every company of a given type in a county and benchmark formation activity.
- CRM enrichment — look companies up by DOS ID and append official registry fields.
❓ FAQ
Is there a free New York company registry API?
Yes. New York State publishes the Active Corporations dataset as a fully open, free Socrata JSON API (data.ny.gov/resource/n9v6-gdp6.json) — no key, no login, no signup. This actor is a friendly, filterable export layer on top of it: set filters in a form, run, and download a clean dataset with pagination handled and records flattened to columns.
How do I export New York company data to Excel?
Every run writes one row per company to a dataset you can export to Excel, CSV, JSON or HTML, or pull via the Apify API. Set your filters (county, entity type, query), run, and use the Export button to download an .xlsx / .csv of the whole New York company segment.
Does it include company emails and phone numbers?
The Active Corporations dataset does not publish company email or phone. It does include the registered agent (DOS process) name and full mailing address, which is the official contact-for-service on record — useful for outreach, mail and compliance. Pair it with an enrichment tool if you need email / phone.
Is scraping data.ny.gov legal?
This actor reads only the official open data that New York State publishes via its public Socrata API — no login or circumvention involved. As always, credit the source (data.ny.gov) and follow the State's open-data terms and applicable law for your specific use case.
How many records can I pull?
Socrata paginates with $limit / $offset; this actor pages at 1,000 records and stops at a 100,000-record deep-offset safety ceiling. For larger pulls, split your run by county or entity type and combine the datasets.
Is this a Socrata / data.ny.gov API alternative for bulk download?
Yes — it's the easiest data.ny.gov API alternative for non-developers: no code, automatic pagination, records flattened to columns, and direct export to CSV / Excel / JSON.
📈 Notes on coverage & limits
- Active corporations only. This dataset is New York's list of active entities; dissolved / inactive entities are out of scope.
- No email / phone fields. The source publishes the registered agent name + mailing address, not company email or phone — enrich separately if you need those.
- ~100,000 records per query. The actor caps at a 100k deep-offset safety ceiling. For larger segments, split your run by county or entity type and combine the datasets.