DivorceNet US State Divorce & Family Law Guide Scraper avatar

DivorceNet US State Divorce & Family Law Guide Scraper

Pricing

Pay per event

Go to Apify Store
DivorceNet US State Divorce & Family Law Guide Scraper

DivorceNet US State Divorce & Family Law Guide Scraper

Scrapes DivorceNet.com state-by-state divorce and family law guide articles. Extracts body text, summaries, and article metadata for all 50 US states plus DC.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Categories

Share

Scrape DivorceNet.com — the Nolo/Internet Brands legal content network — for state-by-state divorce and family law guide articles covering all 50 US states plus DC. Returns one record per article with structured metadata and full body text.

What It Scrapes

DivorceNet.com publishes an extensive library of state-specific divorce guides covering:

  • Residency requirements before filing
  • Mandatory waiting periods
  • Grounds for divorce (no-fault and fault-based)
  • Alimony / spousal support rules by state
  • Property division (community property vs. equitable distribution)
  • The filing process and required court forms

Coverage: 215 /states/<state>/ articles across 34 states + 50 /topics/ hub articles = ~265 records total.

Output Fields

FieldDescription
stateState name extracted from URL (e.g., "California")
page_titleArticle headline
article_typestate_article or topic_article
residency_requirementSentence mentioning residency requirement (best-effort)
waiting_periodSentence mentioning mandatory waiting period (best-effort)
grounds_summarySentence on no-fault or grounds for divorce (best-effort)
alimony_summarySentence on alimony/spousal support (best-effort)
property_division_summarySentence on property division (best-effort)
filing_process_summarySentence on how to file (best-effort)
body_textFull article body text (up to 10,000 characters)
source_urlOriginal article URL

Note: The structured fields (residency_requirement, waiting_period, etc.) are extracted via keyword matching and are populated when the corresponding information appears in the article. Articles focused on a single topic will have other structured fields as null. The body_text field always carries the complete article content.

Input

ParameterTypeDefaultDescription
maxItemsinteger10Maximum number of articles to scrape. Set to 0 for no limit (scrapes all ~265 matching articles).

Sample Output

{
"state": "California",
"page_title": "Dividing the Debts in a California Divorce",
"article_type": "state_article",
"residency_requirement": null,
"waiting_period": null,
"grounds_summary": null,
"alimony_summary": null,
"property_division_summary": "California is a community property state, meaning assets or debts acquired during marriage are generally considered jointly owned.",
"filing_process_summary": "You can file for divorce yourself or have an attorney file the petition for divorce.",
"body_text": "While there are some basic rules surrounding California divorce debt division...",
"source_url": "https://www.divorcenet.com/states/california/dividing_the_debts_in_divorce"
}

How It Works

  1. Fetches DivorceNet's XML sitemap (sitemap.xml, 1101 total URLs)
  2. Filters to /states/ and /topics/ paths (~265 URLs)
  3. Crawls each article page with CheerioCrawler (no proxy needed — server-rendered HTML)
  4. Extracts body text from the #region-content Drupal content area
  5. Applies regex-based sentence matching for structured fields

Use Cases

  • Legal RAG datasets — feed state divorce law narratives into LLM legal Q&A systems
  • Divorce SaaS and paralegal tools — enrich state-by-state process guides
  • Content marketing — source authoritative state law summaries
  • Research — compare divorce law across states

Technical Notes

  • No proxy required — DivorceNet serves plain server-rendered HTML with no anti-bot measures
  • Polite crawl rate (3-8 concurrent requests, rate-limit handling enabled)
  • Full run (~265 articles) completes in a few minutes at default concurrency