SEC Form D Scraper — Capital-Raise Leads
Pricing
Pay per event
SEC Form D Scraper — Capital-Raise Leads
Scrape SEC EDGAR Form D (Regulation D capital-raise) filings into structured B2B leads, exported to JSON or CSV — every row carries company name, address, phone, industry, offering amounts, investor counts, officers, and exemption codes. No API key, no login.
Pricing
Pay per event
Rating
0.0
(0)
Developer
DevilScrapes
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
SEC Form D Scraper — Regulation D Capital-Raise Leads
We do the dirty work so your dataset stays clean. 😈
$5.05 / 1,000 rows — Turn the U.S. SEC EDGAR Form D filing stream into a structured B2B lead list. Every U.S. company raising private capital under Regulation D must file Form D with the SEC within 15 days of the first sale — making this a near-real-time feed of "this company just raised money" announcements. Every row carries the issuer's legal name, address, phone number, industry group, offering amount, investor count, and the named officers / directors / promoters. Pay only for results that land; no subscription, no annual contract.
🎯 What this scrapes
One ResultRow per qualifying Form D filing pulled from the SEC EDGAR Full-Text Search index and the EDGAR static-archive XML. Each row has 27 top-level fields plus a nested related_persons list of officers, directors, and promoters. Data is U.S. government public domain — there are no terms-of-service restrictions on redistribution.
| Field | Type | Description |
|---|---|---|
accession_number | string | SEC accession number with dashes |
cik | string | Zero-padded 10-character Central Index Key |
entity_name | string | Issuer legal name |
entity_type | string | null | Corporation, Limited Liability Company, Limited Partnership, etc. |
jurisdiction_of_incorporation | string | null | Full state / country name of incorporation |
year_of_incorporation | integer | null | Null when issuer reports overFiveYears=true |
issuer_street_1 / _2 / city / state_or_country / zip_code | strings | null | Issuer postal address |
issuer_phone_number | string | null | Issuer phone as filed (raw) |
industry_group_type | string | null | E.g. Other Technology, Pooled Investment Fund, Biotechnology |
total_offering_amount_usd | number | null | Null when issuer reports Indefinite |
total_amount_sold_usd | number | null | Amount sold to date in USD |
total_remaining_usd | number | null | Null when Indefinite |
is_indefinite_amount | boolean | True when issuer reported Indefinite |
total_number_already_invested | integer | null | Investor count to date |
minimum_investment_usd | number | null | Null when no minimum (raw "0") |
exemption_claimed | string[] | Federal exemption codes (e.g. ["06B", "3C", "3C.1"]) |
is_new_notice | boolean | True for new D, False for D/A amendment |
date_of_first_sale | string | null | ISO date; null when yetToOccur=true |
related_persons | object[] | Officers / directors / promoters with names and locations |
filing_date | string | EDGAR acceptance date (ISO) |
filing_url | string | Canonical EDGAR company-filings page |
scraped_at | string | ISO 8601 UTC timestamp |
🔥 Features
- Real-time SEC EDGAR Form D feed — every Reg D capital raise in the U.S. lands here within 15 days of the first sale.
- 27-field extraction including the structured B2B contact fields (phone, address, named officers) that generic EDGAR scrapers do not surface.
- Free-text query, date range, U.S. state filter, and minimum-raise-size filter — all four input axes optional, all four post-filtered before the per-row charge so you pay only for kept rows.
- Amendment toggle:
includeAmendments=false(default) skips Form D/A entries before fetching the XML — saves both bandwidth and rate-limit budget. - XML schema quirks handled correctly —
"Indefinite"offering amounts,overFiveYearsyear-of-inc,yetToOccurfirst-sale flag, raw"0"minimum investment → null. - EDGAR Fair Access Policy compliant — mandatory
User-Agentheader set on every request; 0.1 s inter-fetch sleep keeps the run under the 10 req/s/IP limit. - Pydantic v2 input validation: ISO date shape checked before any network call,
start_date <= end_dateenforced, state filter normalised to uppercase.
What we handle for you
SEC EDGAR applies rate limits, enforces User-Agent policy, and returns XML quirks that break naive parsers. We absorb all of it:
- We rotate browser fingerprints using
curl-cffi(Chrome / Firefox TLS impersonation) so every request looks like a real browser, not a script — robust against any future EDGAR fingerprint tightening. - We retry with exponential backoff on
429/503responses and honourRetry-Afterheaders. Up to 5 attempts per page before surfacing a clear failure. - We enforce EDGAR's rate-limit pacing — a 0.1 s inter-fetch sleep caps throughput at the Fair Access Policy ceiling; your run never triggers an IP ban.
- We keep the dataset clean — Pydantic-validated rows, ISO-8601 timestamps, stable IDs, null-safe numeric fields. No garbage rows, no silent empties.
- You pay only for results that land. Rows discarded by your state or minimum-raise filters are never charged. No data → no charge (only the small
actor-startwarm-up fee).
💡 Use cases
- B2B sales prospecting — pull every Form D filed in the last 7-30 days in your target state (e.g.
stateFilter=CA,minOfferingAmountUsd=1000000) for fresh trigger-event leads to SaaS / professional-services teams selling into newly-funded startups. - VC deal-flow tracking — query
industryGroupType=Biotechnologyand filter by state to surface every Series A-and-earlier biotech raise in your geography. - Compete-monitoring — pass a
queryterm matching a competitor's domain category to detect new entrants raising private capital. - Journalism + research — bulk-export Form D over a multi-month window to measure private-market capital flow by state, industry, or exemption code.
- Crunchbase / PitchBook alternative for early-stage — Form D is the original-source filing every paid private-markets database derives from, available here at $5/1,000 rows vs $299-999/month.
- Compliance + KYC — verify a counterparty's Reg D filing history by issuer name or CIK before signing a master service agreement.
⚙️ How to use it
- Open the Actor input form on the Apify Console.
- (Optional) Set
queryto a free-text search term (e.g."artificial intelligence") — passes through to EDGAR Full-Text Search. - (Optional) Set
startDateandendDate(ISOYYYY-MM-DD). Defaults: last 30 days. - (Optional) Set
stateFilterto a U.S. state ISO-2 code (e.g.CA,NY,TX) to keep only matching issuers. - (Optional) Set
minOfferingAmountUsdto filter out small raises (indefinite-amount filings always pass). - Set
maxResults(1-5000). Default 100. - Leave
includeAmendments=falseto skip Form D/A entries; settrueto include them withis_new_notice=false. - Toggle
useProxy=trueif you hit persistent429responses on high-volume runs. - Click Start. Results stream into the default dataset as JSON / CSV / Excel / XML.
California raises ≥ $1M in the last 30 days
{"stateFilter": "CA","minOfferingAmountUsd": 1000000,"maxResults": 500}
Last week's biotech filings nationally
{"query": "biotechnology","startDate": "2026-05-09","endDate": "2026-05-16","maxResults": 200}
Every Form D + amendments filed today
{"startDate": "2026-05-16","endDate": "2026-05-16","includeAmendments": true,"maxResults": 5000}
📥 Input
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
query | string | no | — | Free-text EDGAR search (1-200 chars) |
startDate | string | no | 30 days ago | Inclusive lower bound, ISO YYYY-MM-DD |
endDate | string | no | today | Inclusive upper bound, ISO YYYY-MM-DD |
stateFilter | string | no | — | Issuer state ISO-2 code (uppercased) |
minOfferingAmountUsd | integer | no | — | Discard filings below this amount (USD) |
maxResults | integer | no | 100 | Cap on emitted rows (1-5000) |
includeAmendments | boolean | no | false | Include Form D/A amendments |
useProxy | boolean | no | false | Route via Apify Proxy (BUYPROXIES94952) |
startDate and endDate are validated for ISO shape and ordering before any network call — invalid input raises immediately and the run exits non-zero.
📤 Output
One row per qualifying Form D filing, pushed to the default dataset and available as JSON, CSV, Excel, or XML.
{"accession_number": "0002122627-26-000001","cik": "0002122627","entity_name": "AM-0304 Fund II, a series of Delk-SPV, LP","entity_type": "Limited Partnership","jurisdiction_of_incorporation": "DELAWARE","year_of_incorporation": 2026,"issuer_street_1": "2006 196TH ST SW","issuer_street_2": "SUITE 114","issuer_city": "LYNNWOOD","issuer_state_or_country": "WA","issuer_zip_code": "98036","issuer_phone_number": "2068016359","industry_group_type": "Pooled Investment Fund","total_offering_amount_usd": 577807.0,"total_amount_sold_usd": 577807.0,"total_remaining_usd": 0.0,"is_indefinite_amount": false,"total_number_already_invested": 35,"minimum_investment_usd": 1000.0,"exemption_claimed": ["06B", "3C", "3C.1"],"is_new_notice": true,"date_of_first_sale": "2026-05-14","related_persons": [{"first_name": "N/A","last_name": "Fund GP, LLC","city": "Wilmington","state_or_country": "DE","relationships": ["Director"]}],"filing_date": "2026-05-15","filing_url": "https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=0002122627&type=D","scraped_at": "2026-05-16T12:00:00.000000+00:00"}
Export formats
- JSON — full fidelity, newline-delimited.
- CSV — flat; nested
related_personsserialised as JSON string per Apify convention. - Excel —
.xlsxvia the Apify dataset converter. - XML — structured per-item.
All formats are available via the Apify API: GET /datasets/{id}/items?format=csv&clean=true.
💰 Pricing
Pay-Per-Event (PPE) — you pay only for what you use:
| Event | Price (USD) | When |
|---|---|---|
actor-start | $0.05 | Once per run, at boot |
result-row | $0.005 | Per Form D row written to dataset (after filters) |
Example costs
| Run | Rows | Cost |
|---|---|---|
| 7-day default scan, ~50 filings | 50 | $0.30 |
| California ≥$1M, 30 days, ~200 filings | 200 | $1.05 |
| AI keyword, 90 days, ~500 filings | 500 | $2.55 |
| 1,000 filings | 1,000 | $5.05 |
| Maximum cap, 5,000 filings | 5,000 | $25.05 |
At scale, the per-row charge dominates: ~$5 per 1,000 Form D rows. Crunchbase, PitchBook, and DealRoom expose the same underlying SEC data inside subscriptions priced $299-999/month with annual contracts — this Actor delivers the raw filings on a pay-as-you-go basis with no minimums.
🚧 Limitations
- Public EDGAR Full-Text Search only. No EDGAR Online, no authenticated EDGAR features, no premium SEC data products.
- Form D and Form D/A only. Forms 3/4/5 (insider transactions), Form 10-K/Q (public-company financials), and Form S-1/424 (public offerings) are out of scope.
- No email address extraction. Form D does not contain email addresses; only the issuer phone and the named officers' city + state are filed.
- No use-of-proceeds narrative. Free-text fields beyond the structured XML elements listed above are out of scope.
- No sales-compensation extraction. Broker / finder fee data is present in the XML but excluded — out of scope for lead generation.
- No cross-run deduplication. Overlapping date ranges may surface the same filing multiple times across separate runs.
- EDGAR rate limit applies. The Actor enforces a 0.1 s inter-fetch sleep, capping the practical throughput at ~10 filings/second / ~600 filings/minute / 5,000 cap → ~9 minutes for a full run.
- 7-day default storage retention on the Apify FREE plan. Export the dataset immediately after the run or move to a paid plan for longer retention.
- U.S. government public domain. SEC EDGAR data has no copyright; attribution is optional but appreciated.
❓ FAQ
Do I need an API key or login?
No. The EDGAR Full-Text Search endpoint and the static EDGAR archive are both fully public U.S. government resources. The Actor sends a User-Agent header identifying Devil Scrapes per EDGAR Fair Access Policy §2.4 — this is a compliance requirement, not an authentication step.
Is there a Form D filings API I can query programmatically?
Yes — the Apify API exposes this Actor as a Form D filings API. Run the Actor via POST /v2/acts/DevilScrapes~sec-form-d-leads/runs with your filter parameters, then retrieve structured results from the dataset endpoint. Full Apify API docs at docs.apify.com.
Why does the cik URL not have leading zeros?
The EDGAR Archives URL path requires the integer form of the CIK (no padding), while the JSON / search APIs return the zero-padded 10-character form. The Actor normalises both — every output row's cik is the canonical zero-padded form ("0002122627"), and the internal URL builder strips the leading zeros for the XML fetch. Pre-padding the URL returns HTTP 404.
What's the difference between Form D and Form D/A?
Form D is the initial notice of a Reg D offering; Form D/A is an amendment (typically to update the total amount sold, add investors, or correct an error). When includeAmendments=false (default), only new D notices are returned. When true, amendments are included with is_new_notice=false.
Why is total_offering_amount_usd sometimes null?
The issuer reported the literal string "Indefinite" (common for evergreen funds and certain real-estate offerings). The is_indefinite_amount flag is True on those rows, and total_offering_amount_usd + total_remaining_usd are both null. Indefinite-amount rows always pass the minOfferingAmountUsd filter — we never discard them on a numeric comparison they can't satisfy.
Why is year_of_incorporation sometimes null?
The issuer reported overFiveYears=true in the XML — the SEC does not require an exact year when the entity is more than 5 years old. Issuers younger than 5 years report withinFiveYears=true and the specific <value>YYYY</value> we parse to an integer.
Can I redistribute the data?
Yes. SEC EDGAR data is U.S. government public domain — there are no licensing restrictions on redistribution or commercial use. Attribution to the U.S. Securities and Exchange Commission is optional but recommended.
How does this compare to paid Regulation D filings list providers?
Crunchbase ($299-999/mo), PitchBook ($30k+/yr), and CB Insights ($45k+/yr) all derive their private-placement data from SEC Form D as a primary signal, then mark it up substantially. This Actor delivers the same underlying Regulation D filings list at $5.05 per 1,000 rows with no subscription and no minimum spend.
💬 Your feedback
Found a bug, hit a rate limit, or need a new field on the output row? Open an issue on the Actor's Apify Store page or contact the Devil Scrapes team at apify.com/DevilScrapes. We ship updates within days of validated reports.
Funding Intelligence Suite
Cross-link this Actor with the rest of the Devil Scrapes funding + research vertical:
- EU CORDIS Grants Scraper — Horizon Europe + Horizon 2020 grant projects (EU public R&D funding).
- arXiv Papers Scraper — preprint metadata across all arXiv categories.
- PubMed Papers Scraper — biomedical literature from the NIH PubMed index.
- Equity Crowdfunding Leads — Wefunder, Republic, and StartEngine raises for early-stage deal flow.
All suite Actors share consistent PPE pricing ($0.001-$0.005 per row, $0.01-$0.05 per run) and scraped_at ISO 8601 UTC timestamps so cross-source joins work cleanly.