Investor Presentation Scraper
Pricing
from $3.50 / 1,000 scraped results
Investor Presentation Scraper
Investor Presentation Scraper extracts investor decks, earnings presentations, SEC filings, annual reports, shareholder documents, and financial news from SEC EDGAR, company IR websites, Yahoo Finance, and Google News. Supports PDF discovery, deduplication, and structured dataset export.
Pricing
from $3.50 / 1,000 scraped results
Rating
0.0
(0)
Developer
Data Pilot
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Share
๐ Investor Presentation Scraper
An Apify Actor that aggregates Investor Presentation data, financial filings, and corporate news from multiple authoritative sources โ SEC EDGAR, Yahoo Finance, and Google News. Enter a company name and get back structured Investor Presentation records, earnings reports, SEC filings, and related financial news in one unified dataset.
Whether you're conducting due diligence, tracking a competitor's Investor Presentation activity, or building a financial intelligence pipeline, this actor delivers comprehensive data efficiently.
๐ Table of Contents
- Features
- Data Sources
- How It Works
- Input
- Output
- Use Cases
- Quick Start
- Technical Stack
- Changelog
- Support
๐ฅ Features
- โ Multi-Source Aggregation โ Collects Investor Presentation data from SEC EDGAR, Yahoo Finance, and Google News in one run
- โ SEC EDGAR Integration โ Searches 8 filing types (8-K, 10-K, 10-Q, DEF14A, S-1, 424B4, SC13G, SC13D) for Investor Presentation documents
- โ Investor Keyword Search โ Targets "investor presentation", "earnings presentation", "annual report", and "roadshow" keywords on EDGAR
- โ Yahoo Finance News โ Fetches financial news and press releases per company ticker
- โ Google News RSS โ Searches 10 financial keyword combinations per company
- โ Deduplication โ Removes duplicate records by URL + title across all sources
- โ Configurable Sources โ Enable or disable each data source independently
- โ Rate Limiting โ Polite delays between API requests to avoid blocks
- โ Error Handling โ Graceful per-source error recovery with detailed logging
- โ Dataset Integration โ Pushes all Investor Presentation records to Apify dataset in real time
๐ Data Sources
| Source | Type | Coverage |
|---|---|---|
| SEC EDGAR | Official filings | 8-K, 10-K, 10-Q, DEF14A, S-1, 424B4, SC13G, SC13D + investor presentations |
| Yahoo Finance | News & press releases | Company ticker news, financial announcements |
| Google News RSS | News articles | Earnings, roadshows, shareholder updates, IPO news |
SEC EDGAR Search Keywords
investor presentationearnings presentationannual reportroadshow
Google News Search Combinations
{company} investor presentation{company} annual report{company} earnings{company} quarterly results{company} financial results{company} investor day{company} shareholder{company} SEC filing{company} IPO{company} roadshow
โ๏ธ How It Works
- Input Parsing โ Read company name, max results, and selected sources
- SEC EDGAR Search โ Query EDGAR full-text search for 8 filing types + investor presentation keywords + company browse
- Yahoo Finance Search โ Search company name, extract tickers, fetch news per ticker
- Google News Search โ Query RSS feed for 10 financial keyword combinations
- Aggregation โ Merge results from all selected sources into one list
- Deduplication โ Remove duplicates by URL + title combination
- Trim to Limit โ Apply
maxResultscap - Dataset Push โ Push all unique Investor Presentation records with timestamps
- Completion โ Log total records saved
๐ฅ Input
| Field | Type | Default | Description |
|---|---|---|---|
company | string | Required | Company name to search Investor Presentation data for |
maxResults | integer | 1000 | Maximum total records to return across all sources |
sources | array | ["edgar", "yahoo", "google_news"] | Which sources to enable |
Example Input โ Full Search
{"company": "Tesla","maxResults": 1000,"sources": ["edgar", "yahoo", "google_news"]}
Example Input โ SEC Only
{"company": "Apple Inc","maxResults": 500,"sources": ["edgar"]}
Example Input โ News Only
{"company": "Microsoft","maxResults": 300,"sources": ["yahoo", "google_news"]}
๐ค Output
Each Investor Presentation or filing record contains:
| Field | Type | Description |
|---|---|---|
source | string | Data source (e.g., SEC EDGAR (8-K), Yahoo Finance, Google News) |
company | string | Company name |
filing_type | string | Filing or content type (e.g., 8-K, 10-K, News Article) |
title | string | Title of the filing or Investor Presentation |
date | string | Publication or filing date |
period | string | Period of report (SEC filings only) |
description | string | Brief description or publisher info |
url | string | Direct link to the filing or article |
cik | string | SEC CIK identifier (EDGAR records only) |
accession | string | SEC accession number or ticker symbol |
scraped_at | string | ISO 8601 UTC timestamp |
Example Output โ SEC EDGAR
{"source": "SEC EDGAR (8-K)","company": "Tesla, Inc.","filing_type": "8-K","title": "8-K โ Tesla, Inc.","date": "2024-10-23","period": "2024-09-30","description": "SEC 8-K filing","url": "https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=1318605&type=8-K","cik": "1318605","accession": "0001318605-24-000056","scraped_at": "2025-03-22T12:34:56Z"}
Example Output โ Investor Presentation (EDGAR)
{"source": "SEC EDGAR (Presentation)","company": "Tesla, Inc.","filing_type": "8-K","title": "Investor Presentation โ Tesla, Inc.","date": "2024-01-15","period": "","description": "Investor document: investor presentation","url": "https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=1318605&type=8-K","cik": "1318605","accession": "0001318605-24-000012","scraped_at": "2025-03-22T12:34:56Z"}
Example Output โ Yahoo Finance
{"source": "Yahoo Finance (TSLA)","company": "Tesla","filing_type": "News","title": "Tesla Q3 Earnings Beat Expectations","date": "2024-10-24","period": "","description": "Reuters","url": "https://finance.yahoo.com/news/...","cik": "","accession": "TSLA","scraped_at": "2025-03-22T12:34:56Z"}
Example Output โ Google News
{"source": "Google News","company": "Tesla","filing_type": "News Article","title": "Tesla Investor Day 2024: Key Takeaways","date": "2024-03-01 09","period": "","description": "Source: Bloomberg | Query: Tesla investor presentation","url": "https://news.google.com/...","cik": "","accession": "","scraped_at": "2025-03-22T12:34:56Z"}
๐ฏ Use Cases
- ๐ผ Due Diligence โ Gather all Investor Presentation filings and news before investment decisions
- ๐ Competitive Intelligence โ Track competitor Investor Presentation activity and earnings announcements
- ๐ฆ Financial Research โ Aggregate SEC filings and Investor Presentation documents for analysis
- ๐ค Financial Data Pipelines โ Feed structured Investor Presentation data into BI tools or AI models
- ๐ฐ News Monitoring โ Track company-specific financial news across multiple sources
- ๐งพ Regulatory Research โ Collect SEC filings for compliance and regulatory analysis
- ๐ Market Intelligence โ Monitor roadshow activity, IPOs, and shareholder communications
๐ Quick Start
- Open on Apify โ Visit the actor page and click Try for free
- Set Company Name โ Enter the company name in the
companyfield - Choose Sources โ Select which sources to include (
edgar,yahoo,google_news) - Set Limit โ Configure
maxResultsbased on how much data you need - Run the Actor โ Click Start and monitor progress in the logs
- Download Results โ Export the Investor Presentation dataset as JSON, CSV, or Excel
Sample Log Output
Starting scrape for: 'Tesla' | limit=1000[SEC EDGAR] Searching: TeslaEDGAR total: 412[Yahoo Finance] Searching: TeslaYahoo total: 280[Google News] Searching: TeslaGoogle News total: 198Pushing 890 unique items to dataset...Done!
๐งฐ Technical Stack
| Component | Technology |
|---|---|
| HTTP Requests | requests library |
| HTML Parsing | (lxml) + XML parser |
| SEC EDGAR | Full-text search API + Browse API |
| Yahoo Finance | Query API v1 (/v1/finance/search) |
| Google News | RSS feed (news.google.com/rss) |
| Async | asyncio |
| Rate Limiting | 0.3โ0.5s delays between requests |
| Platform | Apify Actor (serverless, scalable) |
๐ฆ Changelog
v1.0.0 โ Initial Release
- SEC EDGAR integration: 8 filing types (8-K, 10-K, 10-Q, DEF14A, S-1, 424B4, SC13G, SC13D)
- EDGAR investor presentation keyword search (4 keywords)
- EDGAR company browse scraping
- Yahoo Finance news search with multi-ticker support
- Google News RSS search with 10 financial keyword combinations
- Multi-source aggregation into unified Investor Presentation dataset
- Deduplication by URL + title
- Configurable source selection (
edgar,yahoo,google_news) - Configurable result limit (up to 1,000+)
- ISO 8601 timestamp on all records
- Rate limiting (0.3โ0.5s delays)
- Per-source error handling and logging
๐งโ๐ป Support & Feedback
- Issues & Ideas โ Open a ticket on the Apify Actor issue tracker
- Documentation โ Visit Apify Docs for platform guides
- Tips โ Use the exact registered company name for best EDGAR results
- SEC Search โ For private companies, use
google_newsandyahoosources only
โ ๏ธ Disclaimer: This actor collects publicly available Investor Presentation data and financial filings. Please ensure your usage complies with the terms of service of each data source. Intended for research and informational purposes only.