Probate & Foreclosure Leads Scraper avatar

Probate & Foreclosure Leads Scraper

Pricing

Pay per event

Go to Apify Store
Probate & Foreclosure Leads Scraper

Probate & Foreclosure Leads Scraper

Aggregate distressed-property events across US open-data portals: probate, foreclosure, sheriff sale, tax lien, tax sale. Unified schema with case numbers, addresses, parties, sale dates, and amounts.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

3

Monthly active users

4 hours ago

Last modified

Share

US Probate & Foreclosure Leads Scraper

Aggregate distressed-property events from US government open-data portals. Returns probate filings, foreclosure notices, sheriff sales, tax liens, and tax sales in one unified schema — sourced from the Socrata Open Data network that powers dozens of county and state data sites.


Probate & Foreclosure Leads Scraper Features

  • Aggregates five distinct distressed-property event types into one schema
  • Pulls from dozens of county and state open-data portals, no scraping per county
  • Normalizes heterogeneous source fields into a single output record — NOLA sheriff sales, NYC tax liens, King County foreclosures, and others line up automatically
  • Filters by state, county, event type, date range, owner-occupancy, and minimum estimated equity
  • Returns the raw source row alongside the normalized record, so nothing is lost in translation
  • Pure JSON API — no browser, no proxies, no captcha

Who Uses Probate & Foreclosure Data?

  • Real estate investors — Feed distressed-property leads into direct-mail campaigns and CRM pipelines
  • Wholesalers — Find motivated sellers before they hit the MLS; the early bird gets the cash deal
  • Title companies — Track lis pendens, sheriff sales, and tax-lien filings by county
  • Estate and probate attorneys — Monitor new probate records for estate-administration outreach
  • PropTech analytics — Build distressed-property scoring models from a unified, geocoded event feed

How Probate & Foreclosure Leads Scraper Works

  1. Discovers datasets by querying the cross-portal Socrata Discovery API with keywords like "foreclosure", "lis pendens", "tax lien", and "sheriff sale".
  2. Classifies each matching dataset as probate, foreclosure, sheriff sale, tax lien, or tax sale using title and description keywords.
  3. Pages through each dataset's SODA API with $limit / $offset until the record cap is reached.
  4. Normalizes the raw rows into a unified schema — field names like defendant, propertyaddress, saledate get mapped to canonical fields so you do not have to write 30 portal-specific parsers yourself.

Input

{
"eventTypes": ["foreclosure", "sheriff_sale", "tax_lien"],
"states": ["LA", "NY"],
"maxItems": 500,
"sp_intended_usage": "Lead gen for distressed-property direct mail",
"sp_improvement_suggestions": ""
}
FieldTypeDefaultDescription
eventTypesarray[] (all)One or more of probate, foreclosure, sheriff_sale, tax_lien, tax_sale.
statesarray[] (all)US two-letter state codes. Filters source portals by their geographic scope.
countiesarray[] (all)County names like King, Cook, Orleans. Case-insensitive.
dateFromstring""ISO date (YYYY-MM-DD). Keeps records with sale_date or filing_date on/after this date.
dateTostring""ISO date. Keeps records with sale_date or filing_date on/before this date.
minEstimatedEquityinteger0Drops records where estimated_value - mortgage_balance is below this USD amount. 0 disables the filter.
onlyOwnerOccupiedbooleanfalseWhen true, drops records where owner-occupancy cannot be verified from the source data.
maxItemsinteger100Hard cap on records returned, counted across all datasets.
proxyConfigurationobjectApify proxy offSocrata endpoints are open APIs and do not need proxies. Leave defaults.

State-filtered example

{
"eventTypes": ["tax_lien"],
"states": ["NY"],
"maxItems": 1000,
"sp_intended_usage": "Tax lien investor research",
"sp_improvement_suggestions": ""
}

Probate & Foreclosure Leads Scraper Output Fields

{
"event_id": "data.nola.gov:d52w-8nva:2012-5883",
"event_type": "sheriff_sale",
"source_url": "https://data.nola.gov/d/d52w-8nva",
"dataset_name": "Sheriff Sales - Lien Foreclosures",
"domain": "data.nola.gov",
"county": "Orleans",
"state": "LA",
"filing_date": null,
"sale_date": null,
"case_number": "2012-5883",
"decedent_name": null,
"executor_name": null,
"attorney_name": null,
"defendant_name": "GREGORY DELORIMIER",
"plaintiff_name": "CITY OF NEW ORLEANS",
"property_address": "5300 LAFAYE STREET",
"property_address_normalized": "5300 LAFAYE STREET",
"parcel_apn": null,
"property_type": null,
"estimated_value_usd": null,
"mortgage_balance_usd": null,
"lien_amount_usd": null,
"opening_bid_usd": null,
"winning_bid_usd": 0,
"owner_occupied": null,
"property_equity_estimate_usd": null,
"status": "Pending",
"lat": null,
"lng": null,
"raw_source_fields": "{\"cdccasenumber\":\"2012-5883\",\"defendant\":\"GREGORY DELORIMIER\",...}"
}
FieldTypeDescription
event_idstringStable per-event identifier: <domain>:<dataset_id>:<case_number_or_parcel>.
event_typestringOne of probate, foreclosure, sheriff_sale, tax_lien, tax_sale.
source_urlstringPermalink to the underlying open-data dataset.
dataset_namestringName of the Socrata dataset this record came from.
domainstringSource portal domain, e.g. data.nola.gov.
countystring|nullCounty name inferred from the portal domain. Null for state-level portals.
statestringUS two-letter state code.
filing_datestring|nullFiling, recording, or notice date — whichever the source provides.
sale_datestring|nullScheduled or completed sale/auction date.
case_numberstring|nullCourt case or docket number.
decedent_namestring|nullDecedent name (probate records).
executor_namestring|nullExecutor, administrator, or personal representative (probate).
attorney_namestring|nullFiling attorney name when present.
defendant_namestring|nullDefendant, debtor, owner, or mortgagor name.
plaintiff_namestring|nullPlaintiff, creditor, lender, or mortgagee name.
property_addressstring|nullStreet address as it appears in the source dataset.
property_address_normalizedstring|nullUppercased trimmed version of property_address for matching.
parcel_apnstring|nullAssessor parcel number or equivalent identifier.
property_typestring|nullProperty-use or building-class code from the source.
estimated_value_usdnumber|nullAssessed or estimated market value when the source provides it.
mortgage_balance_usdnumber|nullOutstanding mortgage or judgment balance.
lien_amount_usdnumber|nullLien amount or tax owed.
opening_bid_usdnumber|nullOpening, minimum, or upset bid at auction.
winning_bid_usdnumber|nullWinning or final sale amount.
owner_occupiedboolean|nullTrue when the source marks the property owner-occupied or homestead.
property_equity_estimate_usdnumber|nullestimated_value_usd - mortgage_balance_usd, null if either is missing.
statusstring|nullStatus string such as Pending, Sold, or Redeemed.
latnumber|nullLatitude when geocoded in the source.
lngnumber|nullLongitude when geocoded in the source.
raw_source_fieldsstringOriginal source row, JSON-encoded — preserves any fields not covered by the unified schema.

Field population varies by portal. NOLA sheriff sales include case numbers, parties, and addresses; NYC tax liens include borough/block/lot but not names; King County lists parcel IDs only. The raw_source_fields column is always populated, so nothing the source exposes is lost.


FAQ

How do I scrape probate and foreclosure records in the US?

Probate & Foreclosure Leads Scraper queries the Socrata Open Data network — the same backend used by data.cityofnewyork.us, data.nola.gov, datacatalog.cookcountyil.gov, and dozens of other county and state portals. It classifies matching datasets by keyword, pages through each one, and normalizes the heterogeneous columns into a single distressed-property schema.

How much does Probate & Foreclosure Leads Scraper cost to run?

Probate & Foreclosure Leads Scraper uses pay-per-event pricing: $0.10 per run plus $0.001 per record returned. A 100-record run costs $0.20. A 5,000-record county sweep costs $5.10.

What data can I get from this scraper?

Probate & Foreclosure Leads Scraper returns up to 30 normalized fields per record — case numbers, defendant and plaintiff names, property addresses, parcel IDs, sale dates, bid amounts, property types, and owner-occupancy flags. The exact field population depends on what each source dataset publishes. The raw source row is always included so you can dig deeper when the source has fields we did not normalize.

Does this scraper need proxies?

Probate & Foreclosure Leads Scraper does not need proxies. Socrata open-data APIs are public, unauthenticated, and do not rate-limit casually. Leaving the proxy configuration at its defaults is the right call.

Can I filter by state or event type?

Yes. Set states to one or more two-letter state codes to restrict the source portals that get queried. Set eventTypes to any subset of probate, foreclosure, sheriff_sale, tax_lien, tax_sale. Combine with counties, dateFrom/dateTo, and minEstimatedEquity to narrow the feed further.

Which counties does it cover?

Coverage follows the Socrata Open Data network. That means major counties and cities — NYC (New York), King (WA), Cook (IL), Orleans (LA), Prince George's (MD), Riverside (CA), Howard (MD), Montgomery (MD), Sonoma (CA), Middlesex (MA), and others — plus state-level portals for CT, NY, NJ, MD, CO, OR, IA, and WA. Coverage expands as counties publish new datasets; the scraper picks them up automatically on the next run.


Need More Features?

Need custom fields, a specific county portal, or an enrichment add-on like value/equity estimates? File an issue or get in touch.

Why Use Probate & Foreclosure Leads Scraper?

  • Five event types, one schema — Probate, foreclosure, sheriff sale, tax lien, and tax sale records land in the same 30-field record shape. No separate parsers, no reconciliation pipeline.
  • National reach without per-county scrapers — Adding a new county means the source portal joined Socrata; you do not rebuild anything. That is the whole point of picking an open-data backbone over scraping 500 court websites one at a time.
  • Affordable — $0.001 per record. A full state-wide sweep is a rounding error in a direct-mail budget.