Texas Biz Scraper avatar
Texas Biz Scraper

Pricing

from $25.00 / 1,000 results

Go to Apify Store
Texas Biz Scraper

Texas Biz Scraper

Utilizes the Texas Secretary of State API to pull business information by date range.

Pricing

from $25.00 / 1,000 results

Rating

0.0

(0)

Developer

Jason

Jason

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

1.3 days

Issues response

5 days ago

Last modified

Categories

Share

Texas Secretary of State Business Scraper

Scrape Texas business entities from the Texas Open Data Portal using the official Socrata API.

🎯 What This Actor Does

This Apify actor fetches business entity data from the Texas Comptroller's Active Franchise Tax Permit Holders dataset, which contains information about all businesses registered in Texas.

Data Source: data.texas.gov

✨ Key Features

  • Fast API access - No web scraping, uses official Socrata API
  • Date range filtering - Get businesses registered in specific time periods
  • Entity type filtering - Filter by LLC, Corporation, Partnership, etc.
  • Name search - Find businesses by name
  • Location filtering - Filter by city or state
  • Active status filter - Get only businesses with active right to transact

🚀 Example Inputs

Get all new LLCs from 2025

{
"startDate": "2025-01-01",
"endDate": "2025-12-31",
"entityTypes": ["LLC"],
"activeOnly": true,
"maxResults": 10000
}

Search for businesses by name

{
"nameSearch": "consulting",
"activeOnly": true,
"maxResults": 5000
}

Get all businesses in Austin

{
"city": "AUSTIN",
"startDate": "2025-01-01",
"maxResults": 10000
}

Get all new corporations (any status)

{
"startDate": "2024-01-01",
"endDate": "2024-12-31",
"entityTypes": ["Corporation"],
"maxResults": 50000
}

📊 Output Data

Each record includes:

FieldDescription
taxpayerNumberTexas Comptroller taxpayer number (11 digits)
taxpayerNameBusiness legal name
addressStreet address
cityCity
stateState code
zipZIP code
countyCodeTexas county code
organizationalTypeCodeEntity type code (CI, LI, etc.)
organizationalTypeHuman-readable entity type
recordTypeCodeRecord type
responsibilityBeginningDateDate business became responsible for franchise tax
rightToTransactBusiness"Active" or "Not Active"
rightToTransactBusinessCodeY or N
scrapedAtTimestamp of scrape
source"texas_open_data_portal"

🏢 Entity Type Codes

CodeEntity Type
CICorporation - Domestic
CFCorporation - Foreign
LILLC - Domestic
LFLLC - Foreign
PILimited Partnership - Domestic
PFLimited Partnership - Foreign
NINonprofit Corporation - Domestic
NFNonprofit Corporation - Foreign
AIProfessional Association - Domestic
AFProfessional Association - Foreign
RILimited Liability Partnership - Domestic
RFLimited Liability Partnership - Foreign
SIProfessional Corporation - Domestic
SFProfessional Corporation - Foreign

🔧 Input Configuration

FieldTypeRequiredDescription
startDateStringNoStart date (YYYY-MM-DD)
endDateStringNoEnd date (YYYY-MM-DD)
entityTypesArrayNoEntity types to include
nameSearchStringNoSearch text for business names
cityStringNoCity filter (exact match)
stateStringNoState code filter (TX, CA, etc.)
activeOnlyBooleanNoOnly active businesses (default: false)
maxResultsIntegerNoMax results (default: 10000, 0=unlimited)

⚡ Performance

  • Speed: ~10,000 records per minute
  • API Limit: 1,000 records per batch (automatic pagination)
  • Memory: Only needs 1GB
  • Cost: Very low compute usage

💰 Cost Estimate

  • 10,000 records: ~0.1 compute units
  • 100,000 records: ~0.5 compute units
  • 1,000,000 records: ~2-3 compute units

📈 Dataset Size

The full Active Franchise Tax Permit Holders dataset contains millions of records. Use filters to get manageable result sets:

  • All 2025 registrations: ~500,000-1,000,000 records
  • All LLCs from 2025: ~300,000-500,000 records
  • Specific city for one year: ~1,000-50,000 records

🐛 Debugging

Check the Key-Value store for:

  • summary - Run statistics and breakdowns
  • sample-data - First 10 records (preview)
  • error - Error details if the actor crashed

🚀 Deployment

# Install Apify CLI
npm install -g apify-cli
# Login
apify login
# Navigate to project
cd texas-sos-scraper
# Deploy
apify push

📝 Sample Output

{
"taxpayerNumber": "32095586189",
"taxpayerName": "ALBERTROSE TRANSPORTATION LLC",
"address": "23 TROON DR",
"city": "LAKEWAY",
"state": "TX",
"zip": "78738",
"countyCode": "227",
"organizationalTypeCode": "LI",
"organizationalType": "LLC - Domestic",
"recordTypeCode": "X",
"responsibilityBeginningDate": "2023-02-09",
"rightToTransactBusiness": "Not Active",
"rightToTransactBusinessCode": "N",
"scrapedAt": "2025-01-09T10:30:00.000Z",
"source": "texas_open_data_portal"
}

⚠️ Notes

  1. Date Field: The responsibilityBeginningDate is when the business became responsible for franchise tax, not necessarily the formation date.

  2. Data Freshness: The Texas Open Data Portal dataset is updated periodically by the state.

  3. Foreign Entities: Businesses registered in other states but operating in Texas are included with their original state code.

  4. Active Status: "Right to Transact Business" indicates whether the business is currently in good standing with the Texas Comptroller.