NY Business Registry Scraper avatar

NY Business Registry Scraper

Pricing

Pay per event

Go to Apify Store
NY Business Registry Scraper

NY Business Registry Scraper

Find and export official New York business registry records by entity name, DOS ID, assumed name, or assumed name ID. Get clear, structured results with status, filing date, county, and jurisdiction to support due diligence, compliance checks, partner screening, and market research.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

5

Total users

3

Monthly active users

9 days ago

Last modified

Share

ParseForge Banner

๐Ÿ›๏ธ NY Business Registry Scraper

๐Ÿš€ Pull New York business entity records in minutes. Entity names, DOS IDs, status, filing date, county, jurisdiction. Search by name, DOS ID, or assumed name. No login.

๐Ÿ•’ Last updated: 2026-05-09 ยท ๐Ÿ“Š 15+ fields per entity ยท ๐Ÿ” Name + DOS + assumed-name search ยท ๐Ÿšซ No auth required

Pull live business entity records from the New York Department of State (DOS) public registry. The actor accepts a search value (entity name, DOS ID, or assumed name) plus filters (status, county), walks the result pages, and returns one structured record per entity ready for compliance work, due-diligence, partner screening, or market research.

Every run fetches data live so you get the current state of the NY DOS registry at run time. Records include the entity name, DOS ID, entity type, status (active/inactive/suspended), filing date, county, jurisdiction, registered agent, and the canonical record URL.

๐Ÿ‘ฅ Built for๐ŸŽฏ Primary use cases
Compliance teamsValidate NY business entity status
Legal and KYCRun partner-screening before agreements
Sales teamsSource NY-registered businesses by industry
Lenders and financeValidate counterparty registration
Research and investigationTrace corporate structures
Lead-gen and CRMBuild prospect lists from NY registrations

๐Ÿ“‹ What the NY Business Registry Scraper does

  • ๐Ÿ” Four search modes. By Entity Name, DOS ID, Assumed Name, or Assumed Name ID.
  • ๐ŸŽฏ Match modes. Contains, Begins With, or Base Word.
  • ๐Ÿ“Š Status filter. All, Active, Inactive, Suspended.
  • ๐Ÿ†” DOS ID. Unique New York Department of State identifier.
  • ๐Ÿ“… Filing date. Initial registration date.
  • ๐Ÿ“ County and jurisdiction. NY county and state of incorporation.

The scraper walks the NY DOS public search, applies your filters, and pushes structured records to the dataset.

๐Ÿ’ก Why it matters: the NY DOS registry is the canonical source of NY corporate filings but its UI is slow and lacks bulk export. A live, structured pull beats manual lookup for compliance, KYC, and B2B sales.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing setup, a live run, and how to pipe results into Salesforce or HubSpot via Apify integrations.


โš™๏ธ Input

FieldTypeNameDescription
searchValuestringSearch ValueName, DOS ID, or assumed name to search.
searchByTypeIndicatorenumSearch ByEntityName, DosID, AssumedName, AssumedNameID.
searchExpressionIndicatorenumMatch ModeContains, BeginsWith, BaseWord.
entityStatusIndicatorenumEntity StatusAllStatuses, Active, Inactive, Suspended.
maxItemsintegerMax ItemsFree users: limited to 10 items (preview). Paid users: optional, max 1,000,000.

Example 1. Active entities matching "Apple".

{
"searchValue": "APPLE",
"searchByTypeIndicator": "EntityName",
"searchExpressionIndicator": "Contains",
"entityStatusIndicator": "Active",
"maxItems": 50
}

Example 2. Direct DOS ID lookup.

{
"searchValue": "12345678",
"searchByTypeIndicator": "DosID",
"maxItems": 1
}

โš ๏ธ Good to Know: match modes affect result count significantly. Contains returns the most matches, BeginsWith is more targeted, BaseWord matches root words.


๐Ÿ“Š Output

The dataset returns one structured record per entity. Each record carries identifiers, name, DOS ID, status, filing date, county, jurisdiction, registered agent, and a back-reference URL. Consume the dataset as JSON, CSV, Excel, XML, or RSS via the Apify console or API.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿ†” dosIdstring12345678
๐Ÿข entityNamestringAPPLE INC.
๐Ÿท๏ธ entityTypestringForeign Business Corporation
๐Ÿ“Š statusstringActive
๐Ÿ“… filingDateISO date1980-04-09
๐Ÿ“ countystringNew York
๐ŸŒŽ jurisdictionstringCalifornia
๐Ÿ‘ค registeredAgentstring or nullCT Corporation System
๐Ÿ  agentAddressstring or null28 Liberty St, New York, NY 10005
๐Ÿท๏ธ dosCitystringNew York
๐Ÿ“œ assumedNamesarray[]
๐Ÿ”— entityUrlstring (url)https://apps.dos.ny.gov/.../entity/12345678
๐Ÿ“… scrapedAtISO datetime2026-05-09T12:00:00.000Z

๐Ÿ“ฆ Sample records

1. Public foreign corporation

{
"dosId": "12345678",
"entityName": "APPLE INC.",
"entityType": "Foreign Business Corporation",
"status": "Active",
"filingDate": "1980-04-09",
"county": "New York",
"jurisdiction": "California",
"registeredAgent": "CT Corporation System",
"agentAddress": "28 Liberty St, New York, NY 10005",
"dosCity": "New York",
"assumedNames": [],
"entityUrl": "https://apps.dos.ny.gov/publicInquiry/entity/12345678",
"scrapedAt": "2026-05-09T12:00:00.000Z"
}

2. Domestic LLC

{
"dosId": "23456789",
"entityName": "ACME VENTURES LLC",
"entityType": "Domestic Limited Liability Company",
"status": "Active",
"filingDate": "2020-06-15",
"county": "Kings",
"jurisdiction": "New York",
"registeredAgent": "John Smith",
"agentAddress": "123 Main St, Brooklyn, NY 11201",
"dosCity": "Brooklyn",
"entityUrl": "https://apps.dos.ny.gov/publicInquiry/entity/23456789",
"scrapedAt": "2026-05-09T12:00:00.000Z"
}

3. Sparse record (inactive entity)

{
"dosId": "99887766",
"entityName": "OLD COMPANY LTD",
"entityType": "Domestic Business Corporation",
"status": "Inactive",
"filingDate": "1995-03-22",
"county": "Bronx",
"jurisdiction": "New York",
"entityUrl": "https://apps.dos.ny.gov/publicInquiry/entity/99887766",
"scrapedAt": "2026-05-09T12:00:00.000Z"
}

โœจ Why choose this Actor

Capability
๐ŸŽฏBuilt for the job. Scoped specifically to NY DOS registry so you skip the parser engineering entirely.
๐Ÿ”–Structured output. Clean, typed fields ready for analysis, dashboards, or downstream pipelines.
โšกFast. Optimized request patterns return results in seconds, not minutes.
๐Ÿ”Always fresh. Every run pulls live data, so the dataset reflects the registry as of run time.
๐ŸŒNo infra to manage. Apify handles proxies, retries, scaling, scheduling, and storage.
๐Ÿ›ก๏ธReliable. Battle-tested across many runs and edge cases, with graceful error handling.
๐ŸšซNo code required. Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK.

๐Ÿ“Š Production-grade structured corporate-registry data without the engineering overhead of building and maintaining your own scraper.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
โญ NY Business Registry Scraper (this Actor)$5 free credit, then pay-per-useFull NY DOS registryLive per runName, DOS ID, statusโšก 2 min
Build your own scraperEngineering hoursFull once builtWhenever you maintain itCustom code๐Ÿข Days to weeks
Paid corporate-registry platforms$$$ monthlyVendor-definedPeriodicVendor-definedโณ Hours
Manual searchesHours per checkOne at a timeStaleManual๐Ÿ•’ Variable

Pick this Actor when you want broad coverage, source-native filtering, and no pipeline maintenance.


๐Ÿš€ How to use

  1. ๐Ÿ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. ๐ŸŒ Open the Actor. Go to the NY Business Registry Scraper page on the Apify Store.
  3. ๐ŸŽฏ Set search. Enter a search value, pick search type and match mode, then set maxItems.
  4. ๐Ÿš€ Run it. Click Start and let the Actor collect your data.
  5. ๐Ÿ“ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


๐Ÿ’ผ Business use cases

๐Ÿ“Š Compliance and KYC

  • Validate NY business entity status
  • Run partner-screening before agreements
  • Build watchlists for monitoring
  • Power vendor-onboarding workflows
  • Trace corporate structures across NY
  • Validate counterparty registration
  • Build entity-relationship maps
  • Power M&A due diligence

๐ŸŽฏ Sales and lead-gen

  • Source NY-registered businesses
  • Build prospect lists by entity type
  • Filter by filing date for new businesses
  • Power CRM enrichment with NY data

๐Ÿ› ๏ธ Engineering and product

  • Power corporate-registry products
  • Replace fragile in-house DOS scrapers
  • Wire datasets into your apps via the Apify API or webhooks
  • Skip the proxy, retry, and parsing maintenance entirely

๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

๐ŸŽจ Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

๐Ÿค Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

๐Ÿงช Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

๐Ÿ”Œ Automating NY Business Registry Scraper

This Actor exposes a REST endpoint, so you can drive it from any language or workflow tool.

Schedules. Use Apify Scheduler to capture daily snapshots of new registrations. Combine with the Apify dataset diff tools to track new filings.


๐Ÿ’ฐ How much does it cost?

Apify gives you $5 in free monthly credits on the Apify Free plan, enough to test NY Business Registry Scraper and pull a real sample dataset. For ongoing usage:

  • Starter plan ($49/month) โ€” Recommended for individuals running NY Business Registry Scraper regularly. Includes higher concurrency and larger datasets.
  • Scale plan ($499/month) โ€” Recommended for teams running NY Business Registry Scraper at production scale.

Pay-Per-Event pricing means you only pay for what you actually use. Failed runs are never charged. See the Pricing tab on this Actor's page for exact event prices.

๐Ÿ’ก Tips for using NY Business Registry Scraper

  • Start with a small maxItems (3-10) to validate output format before running larger jobs.
  • Use Apify Schedules to run NY Business Registry Scraper on a recurring basis and keep your dataset fresh.
  • Export via Integrations: Apify connects to Google Sheets, Airbyte, Make, Zapier, and direct webhooks โ€” pipe your data anywhere.
  • Monitor with webhooks: trigger downstream workflows the moment a run finishes.
  • Re-run failed items: if any individual records error out, re-run with their inputs only. Failed events are not charged.

Yes. NY Business Registry Scraper only collects publicly available data. Web scraping public data has been confirmed as legal by US courts (see hiQ Labs v. LinkedIn) and is widely used for research, market analysis, and business intelligence.

However, you are responsible for:

  • Respecting the source website's Terms of Service.
  • Complying with GDPR, CCPA, and other applicable data-protection laws when personal data is involved.
  • Not republishing copyrighted content without permission.

If you have specific compliance concerns, consult your legal team. See the Apify legal docs for more.

โ“ Frequently Asked Questions

๐Ÿ”Œ Integrate with any app

NY Business Registry Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe results into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes.


๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


๐Ÿ†˜ Need Help? Open our contact form to request a new scraper, propose a custom project, or report an issue.


โš ๏ธ Disclaimer. This Actor is an independent tool. The New York Department of State public registry is a public US state government data source. The Actor accesses only publicly available records and is intended for legitimate compliance, due-diligence, and research use. Users are responsible for compliance with the source site's Terms of Service and applicable law.