SEC Investor Alerts Scraper
Pricing
Pay per event
SEC Investor Alerts Scraper
Scrape SEC Investor.gov alerts and bulletins with dates, summaries, categories, full text, and source URLs for compliance monitoring.
Pricing
Pay per event
Rating
0.0
(0)
Developer
Stas Persiianenko
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Scrape SEC Investor.gov investor alerts and bulletins into a clean dataset for compliance monitoring, fraud-risk research, investor education feeds, and internal knowledge bases.
The actor crawls the public Investor.gov Alerts & Bulletins pages, follows pagination, optionally opens each detail page, and returns structured records with dates, summaries, categories, full body text, source URLs, and scrape metadata.
What does SEC Investor Alerts Scraper do?
SEC Investor Alerts Scraper turns Investor.gov alerts and bulletins into exportable data.
It can collect:
- 🛡️ SEC investor alerts
- 📌 SEC investor bulletins
- 📅 publication dates
- 🔗 canonical source URLs
- 📝 summaries from the listing page
- 📄 full article body text from detail pages
- 🏷️ categories and tags when available
- ⏱️ scrape timestamps for monitoring workflows
Who is it for?
This scraper is designed for teams that need a repeatable SEC investor-protection feed.
- Compliance teams monitoring new fraud and investor-protection notices
- Fraud-risk analysts tracking scam themes and enforcement warnings
- Investor-relations teams watching official SEC investor education updates
- Fintech content teams feeding alerts into newsletters or apps
- Legal researchers building archives of investor guidance
- Knowledge-base teams indexing public SEC investor education content
Why use this actor?
Investor.gov is public, but the Alerts & Bulletins pages are formatted for reading, not for automated workflows.
This actor gives you:
- A structured dataset instead of manual copy and paste
- Detail-page body text for search and RAG workflows
- Consistent dates and source URLs for audit trails
- A low-cost HTTP crawler without browser overhead
- Inputs that work for scheduled monitoring or one-off exports
Data extracted
| Field | Description |
|---|---|
title | Alert or bulletin headline |
type | Investor Alert, Investor Bulletin, or Other |
publishedDate | Normalized ISO date when parseable |
publishedDateText | Raw date text shown by Investor.gov |
sourceUrl | Absolute source URL |
summary | Listing summary or short detail fallback |
bodyText | Full article text when detail extraction is enabled |
category | Source category when present |
tags | Category and breadcrumb-style labels |
scrapedAt | Timestamp when the record was saved |
How much does it cost to scrape SEC investor alerts?
This actor uses pay-per-event pricing.
- A small start fee is charged once per run.
- A per-item event is charged for each alert or bulletin saved.
- Higher Apify usage tiers receive lower per-item prices.
You can control spend with maxItems. For a quick test, use 10-25 items. For a monitoring export, use 100+ items or let the actor crawl the available pages.
How to use SEC Investor Alerts Scraper
- Open the actor on Apify.
- Keep the default Investor.gov Alerts & Bulletins URL or add specific detail URLs.
- Set
maxItemsto the number of records you need. - Keep
includeBodyenabled if you need full article text. - Run the actor.
- Export results as JSON, CSV, Excel, XML, RSS, or through the Apify API.
Input options
Start URLs
Use the main listing URL for normal monitoring:
https://www.investor.gov/introduction-investing/general-resources/news-alerts/alerts-bulletins
You can also provide detail URLs when you only need specific alerts or bulletins.
Maximum alerts/bulletins
maxItems limits the number of dataset rows. This is the main cost-control input.
Alert types
Use types to limit output to Investor Alerts, Investor Bulletins, or Other pages.
Include full body text
When includeBody is true, the actor opens detail pages and extracts article text. Disable it for a faster metadata-only run.
Request delay
Investor.gov pages are static and public, so no delay is usually needed. Add a delay if you are running very large scheduled jobs.
Example input
{"startUrls": [{ "url": "https://www.investor.gov/introduction-investing/general-resources/news-alerts/alerts-bulletins" }],"maxItems": 100,"includeBody": true,"maxBodyChars": 20000,"requestDelayMs": 0}
Example output
{"title": "Updated Investor Alert: Identity Theft, Data Breaches and Your Investment Accounts","type": "Investor Alert","publishedDate": "2026-04-23","publishedDateText": "April 23, 2026","sourceUrl": "https://www.investor.gov/introduction-investing/general-resources/news-alerts/alerts-bulletins/investor-alerts/investor-59","summary": "In accordance with the President’s Executive Order...","bodyText": "In accordance with the President’s Executive Order...","category": "Enforcement","tags": ["Enforcement"],"scrapedAt": "2026-06-30T00:00:00.000Z"}
Tips for better results
- Use the default listing URL to collect the newest alerts and bulletins.
- Use detail URLs for a small curated list.
- Keep
includeBodyenabled for compliance archives and search indexes. - Lower
maxItemsfor smoke tests. - Schedule the actor daily or weekly to monitor changes.
Integrations
Use the output with:
- Google Sheets for compliance review queues
- Slack or email alerts for new Investor.gov warnings
- Notion, Airtable, or databases for internal research libraries
- Vector databases for fraud-risk retrieval workflows
- BI dashboards that track investor-protection themes over time
- RSS or webhook automations for internal monitoring feeds
API usage with Node.js
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: process.env.APIFY_TOKEN });const run = await client.actor('automation-lab/sec-investor-alerts-scraper').call({startUrls: [{ url: 'https://www.investor.gov/introduction-investing/general-resources/news-alerts/alerts-bulletins' }],maxItems: 100,includeBody: true});const { items } = await client.dataset(run.defaultDatasetId).listItems();console.log(items);
API usage with Python
from apify_client import ApifyClientclient = ApifyClient('MY-APIFY-TOKEN')run = client.actor('automation-lab/sec-investor-alerts-scraper').call(run_input={'startUrls': [{'url': 'https://www.investor.gov/introduction-investing/general-resources/news-alerts/alerts-bulletins'}],'maxItems': 100,'includeBody': True,})items = client.dataset(run['defaultDatasetId']).list_items().itemsprint(items)
API usage with cURL
curl -X POST "https://api.apify.com/v2/acts/automation-lab~sec-investor-alerts-scraper/runs?token=$APIFY_TOKEN" \-H 'Content-Type: application/json' \-d '{"maxItems":100,"includeBody":true}'
MCP usage
Connect the actor to Claude Desktop, Claude Code, or other MCP clients through Apify MCP Server:
https://mcp.apify.com/?tools=automation-lab/sec-investor-alerts-scraper
Claude Code CLI setup:
$claude mcp add apify-sec-investor-alerts "https://mcp.apify.com/?tools=automation-lab/sec-investor-alerts-scraper"
Claude Desktop JSON config:
{"mcpServers": {"apify-sec-investor-alerts": {"url": "https://mcp.apify.com/?tools=automation-lab/sec-investor-alerts-scraper"}}}
Example prompts:
- "Run the SEC investor alerts scraper and summarize new scam warnings."
- "Find the latest Investor.gov bulletins about identity theft."
- "Export SEC investor alerts into a compliance monitoring table."
Scheduling and monitoring
The actor is useful as a scheduled monitor.
Recommended schedules:
- Daily for fraud-risk monitoring teams
- Weekly for compliance knowledge bases
- Monthly for investor education archives
Store previous results and compare sourceUrl values to detect newly published alerts.
Legality
This actor collects public Investor.gov pages. Respect Apify platform rules, Investor.gov terms, and applicable laws. Do not use the data for deceptive activity. Keep source attribution when redistributing public guidance.
FAQ
Can I scrape only SEC Investor Alerts?
Yes. Set types to ["Investor Alert"] to filter the dataset to alert records only.
Can I use the scraper for recurring monitoring?
Yes. Schedule the actor and compare sourceUrl values between runs to detect newly published alerts or bulletins.
Troubleshooting
Why did I receive fewer records than expected?
Check maxItems, types, and whether your start URL is a listing page or a detail page. The source may also have a finite number of current listing pages.
Why is bodyText empty?
Make sure includeBody is true. Some unusual pages may not use the standard Investor.gov article body template; the actor will still save listing metadata when possible.
How do I reduce cost?
Lower maxItems or disable includeBody for metadata-only runs.
Related scrapers
Explore related public-data actors by Automation Lab:
- https://apify.com/automation-lab/sec-form-adv-scraper
- https://apify.com/automation-lab/sec-investment-adviser-scraper
- https://apify.com/automation-lab/website-contact-finder
Changelog
0.1
- Initial release with listing pagination, detail extraction, type inference, categories, tags, and full body text.
Support
If the source page changes or you need additional Investor.gov fields, open an Apify issue with the run URL and desired output fields.