SEC EDGAR Scraper — 10-K, 8-K, Form 4 & 13F
Pricing
from $5.00 / 1,000 results
SEC EDGAR Scraper — 10-K, 8-K, Form 4 & 13F
Scrape SEC EDGAR — 10-K, 10-Q, 8-K, Form 4 insiders, 13F holdings, 13D/G stakes, DEF 14A, Form D, XBRL facts, full-text search. 9 modes auto-detected from ticker/CIK/accession/URL/keyword. MCP-ready, free official SEC API. $0.005/result.
Pricing
from $5.00 / 1,000 results
Rating
0.0
(0)
Developer
Khadin Akbar
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
1
Monthly active users
a day ago
Last modified
Categories
Share
SEC EDGAR Scraper bundles nine modes in one actor and auto-detects ticker, CIK, accession, URL, or keyword. Built for AI agents, quant funds, equity analysts, and compliance teams who need every SEC EDGAR endpoint — 10-K, 10-Q, 8-K, Form 4 insider trades, 13F holdings, 13D/G activism, DEF 14A proxies, Form D and XBRL facts — without juggling nine separate actors.
What it does
Pass anything — a ticker (AAPL), a 10-digit CIK (0000320193), an accession number (0000320193-25-000123), a filing URL, or a free-text keyword — and the actor routes to the right SEC endpoint automatically. Or override with mode.
| Mode | Use it for | Example input |
|---|---|---|
auto | Default. Guesses from query shape. | AAPL |
company | Filings list for a company | MSFT, 0000789019 |
filing | Single filing details (+ optional full text) | 0000320193-25-000123 |
search | Full-text search across every filing | climate risk |
insider | Form 3/4/5 insider trades | TSLA |
holdings | 13F institutional positions | 0001067983 (Berkshire) |
activist | 13D/G beneficial ownership stakes | GME |
xbrl | XBRL financial facts (revenue, assets, EPS) | AAPL + xbrlConcept: Revenues |
form-d | Form D Reg D startup funding filings | AI startup |
recent | Latest global EDGAR feed | (leave query empty) |
Why this actor
- One actor instead of nine. Most SEC scrapers do filings OR insider trades OR XBRL OR search — never all four. This one does ten.
- MCP-ready. Designed for AI agents — Claude/GPT/Gemini call it as one tool, get structured JSON back, pay per result.
- Free SEC API. No proxy, no anti-bot, no API key. Just the official rate-limited public endpoints.
- Flat, agent-friendly schema. Every record has the same top-level keys regardless of mode (with mode-specific fields explicit on each row).
- Auto-detect input. Pass a ticker, a CIK, a URL, an accession, or a keyword — the actor figures it out.
Pricing — Premium PPE + Pay-Per-Usage
Both monetization models enabled. Pick at run time.
| Event | Price | When charged |
|---|---|---|
apify-actor-start | $0.00005 | Once per GB of memory at run start |
result | $0.005 | Per record written to dataset |
ai-summary | $0.03 | Per AI-generated filing summary (only if enableAiSummary=true) |
Typical run cost (50 filings, no AI): ~$0.25. Heavy run cost (1,000 results, no AI): ~$5.00.
Pay-Per-Usage is also enabled for massive workloads where compute-pricing beats per-record. Apify default compute + proxy passthrough rates apply.
Output fields (dataset schema)
Every record has _mode and the standard filing keys; mode-specific keys appear where applicable.
{"_mode": "company","cik": "0000320193","ticker": "AAPL","companyName": "Apple Inc.","accessionNumber": "0000320193-25-000123","formType": "10-K","filedAt": "2025-11-01","acceptedAt": "2025-11-01T18:04:32.000Z","periodOfReport": "2025-09-28","filingUrl": "https://www.sec.gov/Archives/edgar/data/320193/000032019325000123/0000320193-25-000123-index.htm","primaryDocUrl": "https://www.sec.gov/Archives/edgar/data/320193/000032019325000123/aapl-20250928.htm","items": null,"fileNumber": "001-36743","filmNumber": "251404123","size": 13845678,"isXBRL": true}
Mode-specific fields:
- search:
snippet,score - insider:
insiderName,insiderTitle,transactionDate,transactionCode,shares,pricePerShare,totalValue,sharesOwnedAfter - holdings:
institutionName,institutionCik,issuerName,issuerCusip,sharesHeld,valueHeld - activist:
ownershipPercent - xbrl:
concept,value,unit,fy,fp,startDate,endDate - form-d:
offeringAmount,amountSold,industry,executiveOfficers
Input
| Field | Type | Default | Notes |
|---|---|---|---|
query | string | AAPL | Ticker, CIK, accession, URL, or keyword (REQUIRED) |
mode | enum | auto | Override auto-detection (10 values) |
formTypes | string[] | [] | Filter to form types e.g. ["10-K", "10-Q"] |
dateFrom | string | "" | Filing date lower bound YYYY-MM-DD |
dateTo | string | "" | Filing date upper bound YYYY-MM-DD |
maxResults | int | 100 | Cap total dataset items (1-10000) |
includeFullText | bool | false | Fetch + strip HTML to plain text per filing |
enableAiSummary | bool | false | AI summary per filing (+$0.03 each) |
xbrlConcept | string | "" | XBRL concept tag (e.g. Revenues) — xbrl mode only |
Examples
Get all Apple 10-K and 10-Q since 2024
{"query": "AAPL","mode": "company","formTypes": ["10-K", "10-Q"],"dateFrom": "2024-01-01","maxResults": 50}
Full-text search every filing for "climate change risk"
{"query": "climate change risk","mode": "search","formTypes": ["10-K"],"maxResults": 100}
Berkshire Hathaway 13F holdings
{"query": "0001067983","mode": "holdings","maxResults": 200}
Apple revenue across all reported periods (XBRL)
{"query": "AAPL","mode": "xbrl","xbrlConcept": "Revenues","maxResults": 100}
Tesla insider trades this year
{"query": "TSLA","mode": "insider","dateFrom": "2026-01-01"}
Latest 100 SEC filings (any company)
{"query": "","mode": "recent","maxResults": 100}
Use cases
- AI agents — Drop into Claude/GPT/Gemini as one MCP tool covering every SEC endpoint.
- Quant funds — Pull XBRL company facts across the russell 3000 for backtests.
- Equity analysts — Monitor 8-K filings for material events; track Form 4 insider clusters.
- Compliance — Watch DEF 14A say-on-pay results; audit beneficial-ownership disclosures.
- VCs + sales — Mine Form D filings for fresh startup-funding leads.
- Journalists — Full-text search disclosures for ESG, litigation, executive comp signals.
- Backtesters — Survivorship-bias correction via delisted Form 25 / Form 15 (use
recent+ form filter). - LLM training — Bulk download structured filings for fine-tuning finance models.
How it works
- Reads input. If
mode=auto, detects shape with regex:- URL →
filing - 18-digit or
XXXXXXXXXX-XX-XXXXXX→filing - 1-10 digit number →
company(treated as CIK) - Alphanumeric ≤ 10 chars →
company(treated as ticker) - Anything else →
search
- URL →
- Resolves ticker → CIK via the official
company_tickers.jsonlookup (cached). - Hits the right SEC endpoint:
data.sec.gov/submissions/CIK{10-digit}.json— company filingsdata.sec.gov/api/xbrl/companyfacts/CIK{10-digit}.json— XBRL factsdata.sec.gov/api/xbrl/companyconcept/CIK{10-digit}/us-gaap/{concept}.json— single conceptefts.sec.gov/LATEST/search-index?q=...— full-text searchwww.sec.gov/cgi-bin/browse-edgar?action=getcurrent— recent feedwww.sec.gov/Archives/edgar/data/{cik}/{accession}/index.json— single filing
- Rate-limits to 8 req/s (under the SEC 10 req/s fair-access cap).
- Sends required
User-Agent: khadinakbar khadinakbaronline@gmail.comheader on every request. - Pushes flat records to the dataset, charges per result.
MCP integration
Expose via Apify MCP (mcp.apify.com) as tool apify--sec-edgar-all-in-one-scraper. Agents discovery-shop on the description; the tool description is tuned to the 5-part formula (verb + object + domain → when to use → when NOT to use → return shape → pricing).
// Claude Code exampleconst result = await client.callTool({name: 'apify--sec-edgar-all-in-one-scraper',arguments: { query: 'AAPL', mode: 'xbrl', xbrlConcept: 'Revenues', maxResults: 20 }});
FAQ
Q: Why one actor for nine modes? A: AI agents pick one tool at a time. Bundling related verbs into one well-described tool beats forcing the agent to choose between nine micro-tools. Also: one bill, one charge envelope.
Q: How do I get the full filing text?
A: Set includeFullText: true. The actor downloads the primary document, strips HTML, and returns it under the fullText field (truncated at 50K chars per filing, 200K for single-filing mode).
Q: Does it cover all 150+ SEC form types?
A: Yes — every form in data.sec.gov/submissions/ is fetchable. The formTypes filter accepts any of them.
Q: What about non-US filings? A: SEC EDGAR is US-only. For non-US filings use sources like CNINFO (China), SEDAR+ (Canada), or Companies House (UK). This actor returns an error on non-US queries.
Q: How fresh is the data? A: SEC publishes filings to EDGAR within minutes of acceptance. The actor hits live endpoints — no cache lag.
Q: AI summary — what model?
A: Placeholder for Gemini 2.5 Flash. Set GEMINI_API_KEY env var in your actor settings to populate (rolling out in v0.2).
Q: What if my CIK has fewer than 10 digits? A: Pass it however you have it. The actor zero-pads to 10 digits automatically.
Q: Does it monitor for new filings?
A: Schedule the actor with mode: recent (or company-mode with dateFrom: today) for daily checks. Diff against your previous run client-side.
Limits
- SEC rate limit: 10 req/s. Actor rate-limits itself to 8 req/s to stay safely under.
- Search pagination: SEC EDGAR full-text search caps at 10,000 hits per query. Use date narrowing to dig deeper.
- Submissions endpoint: Returns the 1,000 most recent filings per CIK. Older filings live in separate paginated files (planned for v0.2).
- XBRL coverage: Filings before 2009 mostly lack XBRL. Older companies will return fewer concepts.
Legal
SEC EDGAR is a US Securities and Exchange Commission public dataset. All filings are public-domain disclosures filed by registrants under the Securities Act of 1933 and Exchange Act of 1934. This actor accesses official SEC.gov endpoints with the required User-Agent header per SEC EDGAR fair-access policy. No personal data, no scraping prohibited, no robots.txt violation, no TOS issue. This actor is provided for informational and research purposes only and does not constitute investment advice.
Changelog
0.1 (2026-05-28)
- Initial release
- 9 modes auto-detected from query (company, filing, search, insider, holdings, activist, xbrl, form-d, recent)
- Free official SEC EDGAR API — no proxy, no key
- 8 req/s self-throttle under SEC 10 req/s cap
- ISO 8601 dates, flat per-record schema, <500 tokens/item
- Pay-per-event premium tier ($0.005/result, $0.00005 start, optional $0.03/AI summary)
- Pay-Per-Usage tier enabled in parallel
Related actors
- Google Finance Scraper — stock news + sentiment to layer on top of filings-driven research.
- Stock Price Tracker — real-time multi-asset prices (Yahoo Finance) for tickers you watch in EDGAR.
- Yahoo Finance Scraper — quotes, OHLC, financials, options & earnings to pair with 10-K/10-Q fundamentals.
- Crypto Price Tracker — 17K+ altcoins for digital-asset coverage alongside SEC filings.
- Google Patents Scraper — patents + citations to map IP exposure against issuer 10-Ks.