SEC EDGAR Scraper — 10-K, 8-K, Form 4 & 13F avatar

SEC EDGAR Scraper — 10-K, 8-K, Form 4 & 13F

Pricing

from $5.00 / 1,000 results

Go to Apify Store
SEC EDGAR Scraper — 10-K, 8-K, Form 4 & 13F

SEC EDGAR Scraper — 10-K, 8-K, Form 4 & 13F

Scrape SEC EDGAR — 10-K, 10-Q, 8-K, Form 4 insiders, 13F holdings, 13D/G stakes, DEF 14A, Form D, XBRL facts, full-text search. 9 modes auto-detected from ticker/CIK/accession/URL/keyword. MCP-ready, free official SEC API. $0.005/result.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Khadin Akbar

Khadin Akbar

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

1

Monthly active users

a day ago

Last modified

Share

SEC EDGAR Scraper bundles nine modes in one actor and auto-detects ticker, CIK, accession, URL, or keyword. Built for AI agents, quant funds, equity analysts, and compliance teams who need every SEC EDGAR endpoint — 10-K, 10-Q, 8-K, Form 4 insider trades, 13F holdings, 13D/G activism, DEF 14A proxies, Form D and XBRL facts — without juggling nine separate actors.

What it does

Pass anything — a ticker (AAPL), a 10-digit CIK (0000320193), an accession number (0000320193-25-000123), a filing URL, or a free-text keyword — and the actor routes to the right SEC endpoint automatically. Or override with mode.

ModeUse it forExample input
autoDefault. Guesses from query shape.AAPL
companyFilings list for a companyMSFT, 0000789019
filingSingle filing details (+ optional full text)0000320193-25-000123
searchFull-text search across every filingclimate risk
insiderForm 3/4/5 insider tradesTSLA
holdings13F institutional positions0001067983 (Berkshire)
activist13D/G beneficial ownership stakesGME
xbrlXBRL financial facts (revenue, assets, EPS)AAPL + xbrlConcept: Revenues
form-dForm D Reg D startup funding filingsAI startup
recentLatest global EDGAR feed(leave query empty)

Why this actor

  • One actor instead of nine. Most SEC scrapers do filings OR insider trades OR XBRL OR search — never all four. This one does ten.
  • MCP-ready. Designed for AI agents — Claude/GPT/Gemini call it as one tool, get structured JSON back, pay per result.
  • Free SEC API. No proxy, no anti-bot, no API key. Just the official rate-limited public endpoints.
  • Flat, agent-friendly schema. Every record has the same top-level keys regardless of mode (with mode-specific fields explicit on each row).
  • Auto-detect input. Pass a ticker, a CIK, a URL, an accession, or a keyword — the actor figures it out.

Pricing — Premium PPE + Pay-Per-Usage

Both monetization models enabled. Pick at run time.

EventPriceWhen charged
apify-actor-start$0.00005Once per GB of memory at run start
result$0.005Per record written to dataset
ai-summary$0.03Per AI-generated filing summary (only if enableAiSummary=true)

Typical run cost (50 filings, no AI): ~$0.25. Heavy run cost (1,000 results, no AI): ~$5.00.

Pay-Per-Usage is also enabled for massive workloads where compute-pricing beats per-record. Apify default compute + proxy passthrough rates apply.

Output fields (dataset schema)

Every record has _mode and the standard filing keys; mode-specific keys appear where applicable.

{
"_mode": "company",
"cik": "0000320193",
"ticker": "AAPL",
"companyName": "Apple Inc.",
"accessionNumber": "0000320193-25-000123",
"formType": "10-K",
"filedAt": "2025-11-01",
"acceptedAt": "2025-11-01T18:04:32.000Z",
"periodOfReport": "2025-09-28",
"filingUrl": "https://www.sec.gov/Archives/edgar/data/320193/000032019325000123/0000320193-25-000123-index.htm",
"primaryDocUrl": "https://www.sec.gov/Archives/edgar/data/320193/000032019325000123/aapl-20250928.htm",
"items": null,
"fileNumber": "001-36743",
"filmNumber": "251404123",
"size": 13845678,
"isXBRL": true
}

Mode-specific fields:

  • search: snippet, score
  • insider: insiderName, insiderTitle, transactionDate, transactionCode, shares, pricePerShare, totalValue, sharesOwnedAfter
  • holdings: institutionName, institutionCik, issuerName, issuerCusip, sharesHeld, valueHeld
  • activist: ownershipPercent
  • xbrl: concept, value, unit, fy, fp, startDate, endDate
  • form-d: offeringAmount, amountSold, industry, executiveOfficers

Input

FieldTypeDefaultNotes
querystringAAPLTicker, CIK, accession, URL, or keyword (REQUIRED)
modeenumautoOverride auto-detection (10 values)
formTypesstring[][]Filter to form types e.g. ["10-K", "10-Q"]
dateFromstring""Filing date lower bound YYYY-MM-DD
dateTostring""Filing date upper bound YYYY-MM-DD
maxResultsint100Cap total dataset items (1-10000)
includeFullTextboolfalseFetch + strip HTML to plain text per filing
enableAiSummaryboolfalseAI summary per filing (+$0.03 each)
xbrlConceptstring""XBRL concept tag (e.g. Revenues) — xbrl mode only

Examples

Get all Apple 10-K and 10-Q since 2024

{
"query": "AAPL",
"mode": "company",
"formTypes": ["10-K", "10-Q"],
"dateFrom": "2024-01-01",
"maxResults": 50
}

Full-text search every filing for "climate change risk"

{
"query": "climate change risk",
"mode": "search",
"formTypes": ["10-K"],
"maxResults": 100
}

Berkshire Hathaway 13F holdings

{
"query": "0001067983",
"mode": "holdings",
"maxResults": 200
}

Apple revenue across all reported periods (XBRL)

{
"query": "AAPL",
"mode": "xbrl",
"xbrlConcept": "Revenues",
"maxResults": 100
}

Tesla insider trades this year

{
"query": "TSLA",
"mode": "insider",
"dateFrom": "2026-01-01"
}

Latest 100 SEC filings (any company)

{
"query": "",
"mode": "recent",
"maxResults": 100
}

Use cases

  • AI agents — Drop into Claude/GPT/Gemini as one MCP tool covering every SEC endpoint.
  • Quant funds — Pull XBRL company facts across the russell 3000 for backtests.
  • Equity analysts — Monitor 8-K filings for material events; track Form 4 insider clusters.
  • Compliance — Watch DEF 14A say-on-pay results; audit beneficial-ownership disclosures.
  • VCs + sales — Mine Form D filings for fresh startup-funding leads.
  • Journalists — Full-text search disclosures for ESG, litigation, executive comp signals.
  • Backtesters — Survivorship-bias correction via delisted Form 25 / Form 15 (use recent + form filter).
  • LLM training — Bulk download structured filings for fine-tuning finance models.

How it works

  1. Reads input. If mode=auto, detects shape with regex:
    • URL → filing
    • 18-digit or XXXXXXXXXX-XX-XXXXXXfiling
    • 1-10 digit number → company (treated as CIK)
    • Alphanumeric ≤ 10 chars → company (treated as ticker)
    • Anything else → search
  2. Resolves ticker → CIK via the official company_tickers.json lookup (cached).
  3. Hits the right SEC endpoint:
    • data.sec.gov/submissions/CIK{10-digit}.json — company filings
    • data.sec.gov/api/xbrl/companyfacts/CIK{10-digit}.json — XBRL facts
    • data.sec.gov/api/xbrl/companyconcept/CIK{10-digit}/us-gaap/{concept}.json — single concept
    • efts.sec.gov/LATEST/search-index?q=... — full-text search
    • www.sec.gov/cgi-bin/browse-edgar?action=getcurrent — recent feed
    • www.sec.gov/Archives/edgar/data/{cik}/{accession}/index.json — single filing
  4. Rate-limits to 8 req/s (under the SEC 10 req/s fair-access cap).
  5. Sends required User-Agent: khadinakbar khadinakbaronline@gmail.com header on every request.
  6. Pushes flat records to the dataset, charges per result.

MCP integration

Expose via Apify MCP (mcp.apify.com) as tool apify--sec-edgar-all-in-one-scraper. Agents discovery-shop on the description; the tool description is tuned to the 5-part formula (verb + object + domain → when to use → when NOT to use → return shape → pricing).

// Claude Code example
const result = await client.callTool({
name: 'apify--sec-edgar-all-in-one-scraper',
arguments: { query: 'AAPL', mode: 'xbrl', xbrlConcept: 'Revenues', maxResults: 20 }
});

FAQ

Q: Why one actor for nine modes? A: AI agents pick one tool at a time. Bundling related verbs into one well-described tool beats forcing the agent to choose between nine micro-tools. Also: one bill, one charge envelope.

Q: How do I get the full filing text? A: Set includeFullText: true. The actor downloads the primary document, strips HTML, and returns it under the fullText field (truncated at 50K chars per filing, 200K for single-filing mode).

Q: Does it cover all 150+ SEC form types? A: Yes — every form in data.sec.gov/submissions/ is fetchable. The formTypes filter accepts any of them.

Q: What about non-US filings? A: SEC EDGAR is US-only. For non-US filings use sources like CNINFO (China), SEDAR+ (Canada), or Companies House (UK). This actor returns an error on non-US queries.

Q: How fresh is the data? A: SEC publishes filings to EDGAR within minutes of acceptance. The actor hits live endpoints — no cache lag.

Q: AI summary — what model? A: Placeholder for Gemini 2.5 Flash. Set GEMINI_API_KEY env var in your actor settings to populate (rolling out in v0.2).

Q: What if my CIK has fewer than 10 digits? A: Pass it however you have it. The actor zero-pads to 10 digits automatically.

Q: Does it monitor for new filings? A: Schedule the actor with mode: recent (or company-mode with dateFrom: today) for daily checks. Diff against your previous run client-side.

Limits

  • SEC rate limit: 10 req/s. Actor rate-limits itself to 8 req/s to stay safely under.
  • Search pagination: SEC EDGAR full-text search caps at 10,000 hits per query. Use date narrowing to dig deeper.
  • Submissions endpoint: Returns the 1,000 most recent filings per CIK. Older filings live in separate paginated files (planned for v0.2).
  • XBRL coverage: Filings before 2009 mostly lack XBRL. Older companies will return fewer concepts.

SEC EDGAR is a US Securities and Exchange Commission public dataset. All filings are public-domain disclosures filed by registrants under the Securities Act of 1933 and Exchange Act of 1934. This actor accesses official SEC.gov endpoints with the required User-Agent header per SEC EDGAR fair-access policy. No personal data, no scraping prohibited, no robots.txt violation, no TOS issue. This actor is provided for informational and research purposes only and does not constitute investment advice.

Changelog

0.1 (2026-05-28)

  • Initial release
  • 9 modes auto-detected from query (company, filing, search, insider, holdings, activist, xbrl, form-d, recent)
  • Free official SEC EDGAR API — no proxy, no key
  • 8 req/s self-throttle under SEC 10 req/s cap
  • ISO 8601 dates, flat per-record schema, <500 tokens/item
  • Pay-per-event premium tier ($0.005/result, $0.00005 start, optional $0.03/AI summary)
  • Pay-Per-Usage tier enabled in parallel