SEC EDGAR Data Scraper avatar

SEC EDGAR Data Scraper

Pricing

from $1.00 / 1,000 results

Go to Apify Store
SEC EDGAR Data Scraper

SEC EDGAR Data Scraper

High-speed, browserless extraction of SEC EDGAR filings (10-K, 10-Q, 8-K, Form 4) by ticker symbol. Get structured company data, document manifests, and historical records in seconds without the overhead of a headless browser.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Inus Grobler

Inus Grobler

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

SEC EDGAR filings scraper for Apify

Extract structured SEC EDGAR filing data for one or more stock tickers using direct SEC JSON and archive endpoints. This actor resolves tickers to CIKs, expands historical filing archives when needed, filters filings by date and filing focus, and writes normalized filing records to the default dataset.

It is designed for analysts, investors, quants, compliance teams, and data engineers who need SEC filing data for forms such as 10-K, 10-Q, 8-K, DEF 14A, DEFA14A, S-1, Form 4, and related filings.

Why use this actor

  • Uses direct SEC endpoints instead of a browser, which keeps runs simpler and more stable.
  • Resolves tickers such as AAPL, MSFT, and BRK-B to official SEC CIK identifiers automatically.
  • Expands paginated submission history when a company has older filing archive pages.
  • Supports simple filters for filing focus, filing categories, explicit form types, and filing dates.
  • Offers two enrichment modes:
    • filing-detail for filing metadata and document manifests
    • full for parsed SEC complete submission text when text extraction is available

Quick start

Run the actor with only a ticker list for a fast default scrape:

{
"tickers": ["AAPL", "MSFT"]
}

Default behavior:

  • filingFocus: investor
  • enrichmentMode: filing-detail
  • dateFrom / dateTo: the most recent December window
  • maxFilingsPerTicker: not capped unless you provide a value

For a broader, slower run with parsed submission text:

{
"tickers": ["AAPL", "MSFT", "NVDA"],
"filingFocus": "investor_plus_governance",
"dateFrom": "2025-01-01",
"dateTo": "2025-12-01",
"enrichmentMode": "full"
}

Input reference

tickers

  • Required.
  • Array of stock tickers.
  • Tickers are normalized to uppercase before lookup.

filingFocus

  • Optional high-level filing filter.
  • Supported values:
    • investor
    • investor_plus_governance
    • company_filings
    • ownership
    • all

dateFrom and dateTo

  • Optional filing date range in YYYY-MM-DD format.
  • When omitted, the actor defaults to the most recent December date range for a quick maintenance-style scrape.

enrichmentMode

  • filing-detail Returns filing metadata, filing detail URLs, primary document URLs, and normalized document manifests.
  • full Also fetches and parses the SEC complete submission text file when the filing content is text-based and extractable.

maxFilingsPerTicker

  • Optional positive integer.
  • Leave empty to return all matching filings for each ticker.

formTypes

  • Optional advanced override for exact SEC form types such as 10-K, 10-Q, 8-K, or DEF 14A.
  • When provided, this overrides the high-level focus selection.

filingCategories

  • Optional advanced override for normalized categories:
    • financial_reports
    • company_updates
    • governance
    • capital_markets
    • ownership
    • other

Output

Each dataset item represents one matched filing. Output fields include:

  • Ticker, company name, CIK, form type, filing category, filing date, report date
  • Filing detail URL, primary document URL, filing header URL
  • Normalized list of filing documents and data files
  • Submission text status and error details
  • In full mode, structured submissionText data and extracted primary-document text when available

Sample dataset item:

{
"ticker": "AAPL",
"companyName": "Apple Inc.",
"cik": "0000320193",
"formType": "10-K",
"filingCategory": "financial_reports",
"filingDate": "2025-10-31",
"reportDate": "2025-09-27",
"acceptedAt": "2025-10-31 06:01:26",
"periodOfReport": "2025-09-27",
"accessionNumber": "0000320193-25-000079",
"filingDetailUrl": "https://www.sec.gov/Archives/edgar/data/320193/000032019325000079/0000320193-25-000079-index.html",
"primaryDocumentUrl": "https://www.sec.gov/Archives/edgar/data/320193/000032019325000079/aapl-20250927.htm",
"filingHeaderUrl": "https://www.sec.gov/Archives/edgar/data/320193/000032019325000079/0000320193-25-000079.hdr.sgml",
"documentCount": 16,
"dataFileCount": 8,
"submissionTextStatus": "parsed",
"submissionTextTextTruncated": true
}

Dataset items can be exported from Apify in JSON, CSV, Excel, XML, and other supported formats.

Pricing model

This actor uses Apify pay-per-event pricing.

The exact prices should be taken from the Apify Pricing tab because pricing may change over time. The charging model is:

  • ticker-search Charged once per successfully resolved ticker in filing-detail mode.
  • ticker-search-premium Charged once per successfully resolved ticker in full mode.
  • apify-default-dataset-item Charged once per dataset item written to the default dataset.
  • apify-actor-start Synthetic Apify start event handled by the platform.

Important charging notes:

  • Invalid or unresolved tickers are not charged as ticker search events.
  • Internal pagination and retry requests are not charged as separate search events.
  • Result charges are tied to items actually written to the default dataset.

Reliability and SEC handling

  • Uses a conservative global throttle for direct SEC requests.
  • Retries common transient SEC and upstream failures such as 403, 429, and 503.
  • Sends a proper SEC User-Agent header.
  • Avoids proxy-specific behavior and works through direct SEC access on Apify.

Known limitations

  • Some filings, especially annual reports in PDF form such as ARS, may be returned with submissionTextStatus: "not_extractable" in full mode. The filing metadata and document URLs are still included.
  • full mode is materially slower and more expensive than filing-detail because it fetches and parses complete submission text files.
  • Very broad date ranges across many tickers can produce large datasets and longer runtimes.

Best use cases

  • SEC EDGAR filing search by ticker
  • Historical 10-K, 10-Q, and 8-K extraction
  • Proxy statement and governance filing collection
  • Ownership and insider filing monitoring
  • Financial research datasets for backtesting and analysis