Investor Presentation Scraper avatar

Investor Presentation Scraper

Pricing

from $3.50 / 1,000 scraped results

Go to Apify Store
Investor Presentation Scraper

Investor Presentation Scraper

Investor Presentation Scraper extracts investor decks, earnings presentations, SEC filings, annual reports, shareholder documents, and financial news from SEC EDGAR, company IR websites, Yahoo Finance, and Google News. Supports PDF discovery, deduplication, and structured dataset export.

Pricing

from $3.50 / 1,000 scraped results

Rating

0.0

(0)

Developer

Data Pilot

Data Pilot

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Categories

Share

๐Ÿ“Š Investor Presentation Scraper

An Apify Actor that aggregates Investor Presentation data, financial filings, and corporate news from multiple authoritative sources โ€” SEC EDGAR, Yahoo Finance, and Google News. Enter a company name and get back structured Investor Presentation records, earnings reports, SEC filings, and related financial news in one unified dataset.

Whether you're conducting due diligence, tracking a competitor's Investor Presentation activity, or building a financial intelligence pipeline, this actor delivers comprehensive data efficiently.


๐Ÿ“‹ Table of Contents


๐Ÿ”ฅ Features

  • โœ… Multi-Source Aggregation โ€” Collects Investor Presentation data from SEC EDGAR, Yahoo Finance, and Google News in one run
  • โœ… SEC EDGAR Integration โ€” Searches 8 filing types (8-K, 10-K, 10-Q, DEF14A, S-1, 424B4, SC13G, SC13D) for Investor Presentation documents
  • โœ… Investor Keyword Search โ€” Targets "investor presentation", "earnings presentation", "annual report", and "roadshow" keywords on EDGAR
  • โœ… Yahoo Finance News โ€” Fetches financial news and press releases per company ticker
  • โœ… Google News RSS โ€” Searches 10 financial keyword combinations per company
  • โœ… Deduplication โ€” Removes duplicate records by URL + title across all sources
  • โœ… Configurable Sources โ€” Enable or disable each data source independently
  • โœ… Rate Limiting โ€” Polite delays between API requests to avoid blocks
  • โœ… Error Handling โ€” Graceful per-source error recovery with detailed logging
  • โœ… Dataset Integration โ€” Pushes all Investor Presentation records to Apify dataset in real time

๐ŸŒ Data Sources

SourceTypeCoverage
SEC EDGAROfficial filings8-K, 10-K, 10-Q, DEF14A, S-1, 424B4, SC13G, SC13D + investor presentations
Yahoo FinanceNews & press releasesCompany ticker news, financial announcements
Google News RSSNews articlesEarnings, roadshows, shareholder updates, IPO news

SEC EDGAR Search Keywords

  • investor presentation
  • earnings presentation
  • annual report
  • roadshow

Google News Search Combinations

  • {company} investor presentation
  • {company} annual report
  • {company} earnings
  • {company} quarterly results
  • {company} financial results
  • {company} investor day
  • {company} shareholder
  • {company} SEC filing
  • {company} IPO
  • {company} roadshow

โš™๏ธ How It Works

  1. Input Parsing โ€” Read company name, max results, and selected sources
  2. SEC EDGAR Search โ€” Query EDGAR full-text search for 8 filing types + investor presentation keywords + company browse
  3. Yahoo Finance Search โ€” Search company name, extract tickers, fetch news per ticker
  4. Google News Search โ€” Query RSS feed for 10 financial keyword combinations
  5. Aggregation โ€” Merge results from all selected sources into one list
  6. Deduplication โ€” Remove duplicates by URL + title combination
  7. Trim to Limit โ€” Apply maxResults cap
  8. Dataset Push โ€” Push all unique Investor Presentation records with timestamps
  9. Completion โ€” Log total records saved

๐Ÿ“ฅ Input

FieldTypeDefaultDescription
companystringRequiredCompany name to search Investor Presentation data for
maxResultsinteger1000Maximum total records to return across all sources
sourcesarray["edgar", "yahoo", "google_news"]Which sources to enable
{
"company": "Tesla",
"maxResults": 1000,
"sources": ["edgar", "yahoo", "google_news"]
}

Example Input โ€” SEC Only

{
"company": "Apple Inc",
"maxResults": 500,
"sources": ["edgar"]
}

Example Input โ€” News Only

{
"company": "Microsoft",
"maxResults": 300,
"sources": ["yahoo", "google_news"]
}

๐Ÿ“ค Output

Each Investor Presentation or filing record contains:

FieldTypeDescription
sourcestringData source (e.g., SEC EDGAR (8-K), Yahoo Finance, Google News)
companystringCompany name
filing_typestringFiling or content type (e.g., 8-K, 10-K, News Article)
titlestringTitle of the filing or Investor Presentation
datestringPublication or filing date
periodstringPeriod of report (SEC filings only)
descriptionstringBrief description or publisher info
urlstringDirect link to the filing or article
cikstringSEC CIK identifier (EDGAR records only)
accessionstringSEC accession number or ticker symbol
scraped_atstringISO 8601 UTC timestamp

Example Output โ€” SEC EDGAR

{
"source": "SEC EDGAR (8-K)",
"company": "Tesla, Inc.",
"filing_type": "8-K",
"title": "8-K โ€” Tesla, Inc.",
"date": "2024-10-23",
"period": "2024-09-30",
"description": "SEC 8-K filing",
"url": "https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=1318605&type=8-K",
"cik": "1318605",
"accession": "0001318605-24-000056",
"scraped_at": "2025-03-22T12:34:56Z"
}

Example Output โ€” Investor Presentation (EDGAR)

{
"source": "SEC EDGAR (Presentation)",
"company": "Tesla, Inc.",
"filing_type": "8-K",
"title": "Investor Presentation โ€” Tesla, Inc.",
"date": "2024-01-15",
"period": "",
"description": "Investor document: investor presentation",
"url": "https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=1318605&type=8-K",
"cik": "1318605",
"accession": "0001318605-24-000012",
"scraped_at": "2025-03-22T12:34:56Z"
}

Example Output โ€” Yahoo Finance

{
"source": "Yahoo Finance (TSLA)",
"company": "Tesla",
"filing_type": "News",
"title": "Tesla Q3 Earnings Beat Expectations",
"date": "2024-10-24",
"period": "",
"description": "Reuters",
"url": "https://finance.yahoo.com/news/...",
"cik": "",
"accession": "TSLA",
"scraped_at": "2025-03-22T12:34:56Z"
}

Example Output โ€” Google News

{
"source": "Google News",
"company": "Tesla",
"filing_type": "News Article",
"title": "Tesla Investor Day 2024: Key Takeaways",
"date": "2024-03-01 09",
"period": "",
"description": "Source: Bloomberg | Query: Tesla investor presentation",
"url": "https://news.google.com/...",
"cik": "",
"accession": "",
"scraped_at": "2025-03-22T12:34:56Z"
}

๐ŸŽฏ Use Cases

  • ๐Ÿ’ผ Due Diligence โ€” Gather all Investor Presentation filings and news before investment decisions
  • ๐Ÿ“ˆ Competitive Intelligence โ€” Track competitor Investor Presentation activity and earnings announcements
  • ๐Ÿฆ Financial Research โ€” Aggregate SEC filings and Investor Presentation documents for analysis
  • ๐Ÿค– Financial Data Pipelines โ€” Feed structured Investor Presentation data into BI tools or AI models
  • ๐Ÿ“ฐ News Monitoring โ€” Track company-specific financial news across multiple sources
  • ๐Ÿงพ Regulatory Research โ€” Collect SEC filings for compliance and regulatory analysis
  • ๐Ÿ“Š Market Intelligence โ€” Monitor roadshow activity, IPOs, and shareholder communications

๐Ÿš€ Quick Start

  1. Open on Apify โ€” Visit the actor page and click Try for free
  2. Set Company Name โ€” Enter the company name in the company field
  3. Choose Sources โ€” Select which sources to include (edgar, yahoo, google_news)
  4. Set Limit โ€” Configure maxResults based on how much data you need
  5. Run the Actor โ€” Click Start and monitor progress in the logs
  6. Download Results โ€” Export the Investor Presentation dataset as JSON, CSV, or Excel

Sample Log Output

Starting scrape for: 'Tesla' | limit=1000
[SEC EDGAR] Searching: Tesla
EDGAR total: 412
[Yahoo Finance] Searching: Tesla
Yahoo total: 280
[Google News] Searching: Tesla
Google News total: 198
Pushing 890 unique items to dataset...
Done!

๐Ÿงฐ Technical Stack

ComponentTechnology
HTTP Requestsrequests library
HTML Parsing(lxml) + XML parser
SEC EDGARFull-text search API + Browse API
Yahoo FinanceQuery API v1 (/v1/finance/search)
Google NewsRSS feed (news.google.com/rss)
Asyncasyncio
Rate Limiting0.3โ€“0.5s delays between requests
PlatformApify Actor (serverless, scalable)

๐Ÿ“ฆ Changelog

v1.0.0 โ€” Initial Release

  • SEC EDGAR integration: 8 filing types (8-K, 10-K, 10-Q, DEF14A, S-1, 424B4, SC13G, SC13D)
  • EDGAR investor presentation keyword search (4 keywords)
  • EDGAR company browse scraping
  • Yahoo Finance news search with multi-ticker support
  • Google News RSS search with 10 financial keyword combinations
  • Multi-source aggregation into unified Investor Presentation dataset
  • Deduplication by URL + title
  • Configurable source selection (edgar, yahoo, google_news)
  • Configurable result limit (up to 1,000+)
  • ISO 8601 timestamp on all records
  • Rate limiting (0.3โ€“0.5s delays)
  • Per-source error handling and logging

๐Ÿง‘โ€๐Ÿ’ป Support & Feedback

  • Issues & Ideas โ€” Open a ticket on the Apify Actor issue tracker
  • Documentation โ€” Visit Apify Docs for platform guides
  • Tips โ€” Use the exact registered company name for best EDGAR results
  • SEC Search โ€” For private companies, use google_news and yahoo sources only

โš ๏ธ Disclaimer: This actor collects publicly available Investor Presentation data and financial filings. Please ensure your usage complies with the terms of service of each data source. Intended for research and informational purposes only.