NY Business Registry Scraper avatar

NY Business Registry Scraper

Pricing

Pay per event

Go to Apify Store
NY Business Registry Scraper

NY Business Registry Scraper

Find and export official New York business registry records by entity name, DOS ID, assumed name, or assumed name ID. Get clear, structured results with status, filing date, county, and jurisdiction to support due diligence, compliance checks, partner screening, and market research.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

5

Total users

2

Monthly active users

16 days ago

Last modified

Share

ParseForge Banner

๐Ÿ›๏ธ NY Business Registry Scraper

๐Ÿš€ Pull New York business entity records in minutes. Entity names, DOS IDs, status, filing date, county, jurisdiction. Search by name, DOS ID, or assumed name. No login.

๐Ÿ•’ Last updated: 2026-05-09 ยท ๐Ÿ“Š 15+ fields per entity ยท ๐Ÿ” Name + DOS + assumed-name search ยท ๐Ÿšซ No auth required

Pull structured records from NY Business Registry โ€” clean fields ready as CSV, JSON, JSONL, Excel, or XML for downstream pipelines.

Copy to your AI assistant

Copy this block into ChatGPT, Claude, Cursor, or any LLM to start using this actor.

parseforge/apps-scraper on Apify. Call: ApifyClient("TOKEN").actor("parseforge/apps-scraper").call(run_input={...}), then client.dataset(run["defaultDatasetId"]).list_items().items for results. Key inputs: maxItems (integer, default 10), searchValue (string, default "APPLE"), searchByTypeIndicator (string, default "EntityName"), searchExpressionIndicator (string, default "Contains"), entityStatusIndicator (string, default "AllStatuses"), entityTypeIndicator (array, default ["Corporation", "LimitedLiabilityCompany", "LimitedPartnersh). Full actor spec: fetch build via GET https://api.apify.com/v2/acts/parseforge~apps-scraper (Bearer TOKEN). Get token: https://console.apify.com/account/integrations

Pull live business entity records from the New York Department of State (DOS) public registry. The actor accepts a search value (entity name, DOS ID, or assumed name) plus filters (status, county), walks the result pages, and returns one structured record per entity ready for compliance work, due-diligence, partner screening, or market research.

Every run fetches data live so you get the current state of the NY DOS registry at run time. Records include the entity name, DOS ID, entity type, status (active/inactive/suspended), filing date, county, jurisdiction, registered agent, and the canonical record URL.

๐Ÿ‘ฅ Built for๐ŸŽฏ Primary use cases
Compliance teamsValidate NY business entity status
Legal and KYCRun partner-screening before agreements
Sales teamsSource NY-registered businesses by industry
Lenders and financeValidate counterparty registration
Research and investigationTrace corporate structures
Lead-gen and CRMBuild prospect lists from NY registrations

๐Ÿ“‹ What the NY Business Registry Scraper does

  • ๐Ÿ” Four search modes. By Entity Name, DOS ID, Assumed Name, or Assumed Name ID.
  • ๐ŸŽฏ Match modes. Contains, Begins With, or Base Word.
  • ๐Ÿ“Š Status filter. All, Active, Inactive, Suspended.
  • ๐Ÿ†” DOS ID. Unique New York Department of State identifier.
  • ๐Ÿ“… Filing date. Initial registration date.
  • ๐Ÿ“ County and jurisdiction. NY county and state of incorporation.

The scraper walks the NY DOS public search, applies your filters, and pushes structured records to the dataset.

๐Ÿ’ก Why it matters: the NY DOS registry is the canonical source of NY corporate filings but its UI is slow and lacks bulk export. A live, structured pull beats manual lookup for compliance, KYC, and B2B sales.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing setup, a live run, and how to pipe results into Salesforce or HubSpot via Apify integrations.


โš™๏ธ Input

FieldTypeNameDescription
searchValuestringSearch ValueName, DOS ID, or assumed name to search.
searchByTypeIndicatorenumSearch ByEntityName, DosID, AssumedName, AssumedNameID.
searchExpressionIndicatorenumMatch ModeContains, BeginsWith, BaseWord.
entityStatusIndicatorenumEntity StatusAllStatuses, Active, Inactive, Suspended.
maxItemsintegerMax ItemsFree users: limited to 10 items (preview). Paid users: optional, max 1,000,000.

Example 1. Active entities matching "Apple".

{
"searchValue": "APPLE",
"searchByTypeIndicator": "EntityName",
"searchExpressionIndicator": "Contains",
"entityStatusIndicator": "Active",
"maxItems": 50
}

Example 2. Direct DOS ID lookup.

{
"searchValue": "12345678",
"searchByTypeIndicator": "DosID",
"maxItems": 1
}

โš ๏ธ Good to Know: match modes affect result count significantly. Contains returns the most matches, BeginsWith is more targeted, BaseWord matches root words.


๐Ÿ“Š Output

The dataset returns one structured record per entity. Each record carries identifiers, name, DOS ID, status, filing date, county, jurisdiction, registered agent, and a back-reference URL. Consume the dataset as JSON, CSV, Excel, XML, or RSS via the Apify console or API.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿ†” dosIdstring12345678
๐Ÿข entityNamestringAPPLE INC.
๐Ÿท๏ธ entityTypestringForeign Business Corporation
๐Ÿ“Š statusstringActive
๐Ÿ“… filingDateISO date1980-04-09
๐Ÿ“ countystringNew York
๐ŸŒŽ jurisdictionstringCalifornia
๐Ÿ‘ค registeredAgentstring or nullCT Corporation System
๐Ÿ  agentAddressstring or null28 Liberty St, New York, NY 10005
๐Ÿท๏ธ dosCitystringNew York
๐Ÿ“œ assumedNamesarray[]
๐Ÿ”— entityUrlstring (url)https://apps.dos.ny.gov/.../entity/12345678
๐Ÿ“… scrapedAtISO datetime2026-05-09T12:00:00.000Z

๐Ÿ“ฆ Sample records

1. Public foreign corporation

{
"dosId": "12345678",
"entityName": "APPLE INC.",
"entityType": "Foreign Business Corporation",
"status": "Active",
"filingDate": "1980-04-09",
"county": "New York",
"jurisdiction": "California",
"registeredAgent": "CT Corporation System",
"agentAddress": "28 Liberty St, New York, NY 10005",
"dosCity": "New York",
"assumedNames": [],
"entityUrl": "https://apps.dos.ny.gov/publicInquiry/entity/12345678",
"scrapedAt": "2026-05-09T12:00:00.000Z"
}

2. Domestic LLC

{
"dosId": "23456789",
"entityName": "ACME VENTURES LLC",
"entityType": "Domestic Limited Liability Company",
"status": "Active",
"filingDate": "2020-06-15",
"county": "Kings",
"jurisdiction": "New York",
"registeredAgent": "John Smith",
"agentAddress": "123 Main St, Brooklyn, NY 11201",
"dosCity": "Brooklyn",
"entityUrl": "https://apps.dos.ny.gov/publicInquiry/entity/23456789",
"scrapedAt": "2026-05-09T12:00:00.000Z"
}

3. Sparse record (inactive entity)

{
"dosId": "99887766",
"entityName": "OLD COMPANY LTD",
"entityType": "Domestic Business Corporation",
"status": "Inactive",
"filingDate": "1995-03-22",
"county": "Bronx",
"jurisdiction": "New York",
"entityUrl": "https://apps.dos.ny.gov/publicInquiry/entity/99887766",
"scrapedAt": "2026-05-09T12:00:00.000Z"
}

โœจ Why choose this Actor

Capability
๐ŸŽฏBuilt for the job. Scoped specifically to NY DOS registry so you skip the parser engineering entirely.
๐Ÿ”–Structured output. Clean, typed fields ready for analysis, dashboards, or downstream pipelines.
โšกFast. Optimized request patterns return results in seconds, not minutes.
๐Ÿ”Always fresh. Every run pulls live data, so the dataset reflects the registry as of run time.
๐ŸŒNo infra to manage. Apify handles proxies, retries, scaling, scheduling, and storage.
๐Ÿ›ก๏ธReliable. Battle-tested across many runs and edge cases, with graceful error handling.
๐ŸšซNo code required. Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK.

๐Ÿ“Š Production-grade structured corporate-registry data without the engineering overhead of building and maintaining your own scraper.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
โญ NY Business Registry Scraper (this Actor)$5 free credit, then pay-per-useFull NY DOS registryLive per runName, DOS ID, statusโšก 2 min
Build your own scraperEngineering hoursFull once builtWhenever you maintain itCustom code๐Ÿข Days to weeks
Paid corporate-registry platforms$$$ monthlyVendor-definedPeriodicVendor-definedโณ Hours
Manual searchesHours per checkOne at a timeStaleManual๐Ÿ•’ Variable

Pick this Actor when you want broad coverage, source-native filtering, and no pipeline maintenance.


๐Ÿš€ How to use

  1. ๐Ÿ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. ๐ŸŒ Open the Actor. Go to the NY Business Registry Scraper page on the Apify Store.
  3. ๐ŸŽฏ Set search. Enter a search value, pick search type and match mode, then set maxItems.
  4. ๐Ÿš€ Run it. Click Start and let the Actor collect your data.
  5. ๐Ÿ“ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


๐Ÿ’ผ Business use cases

๐Ÿ“Š Compliance and KYC

  • Validate NY business entity status
  • Run partner-screening before agreements
  • Build watchlists for monitoring
  • Power vendor-onboarding workflows
  • Trace corporate structures across NY
  • Validate counterparty registration
  • Build entity-relationship maps
  • Power M&A due diligence

๐ŸŽฏ Sales and lead-gen

  • Source NY-registered businesses
  • Build prospect lists by entity type
  • Filter by filing date for new businesses
  • Power CRM enrichment with NY data

๐Ÿ› ๏ธ Engineering and product

  • Power corporate-registry products
  • Replace fragile in-house DOS scrapers
  • Wire datasets into your apps via the Apify API or webhooks
  • Skip the proxy, retry, and parsing maintenance entirely

๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

๐ŸŽจ Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

๐Ÿค Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

๐Ÿงช Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

๐Ÿ”Œ Automating NY Business Registry Scraper

This Actor exposes a REST endpoint, so you can drive it from any language or workflow tool.

Schedules. Use Apify Scheduler to capture daily snapshots of new registrations. Combine with the Apify dataset diff tools to track new filings.


โ“ Frequently Asked Questions

๐Ÿ”Œ Integrate with any app

NY Business Registry Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe results into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes.


๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


๐Ÿ†˜ Need Help? Open our contact form to request a new scraper, propose a custom project, or report an issue.


โš ๏ธ Disclaimer. This Actor is an independent tool. The New York Department of State public registry is a public US state government data source. The Actor accesses only publicly available records and is intended for legitimate compliance, due-diligence, and research use. Users are responsible for compliance with the source site's Terms of Service and applicable law.