
Senator Financial Disclosures Scraper
Pricing
$10.00 / 1,000 results

Senator Financial Disclosures Scraper
This actor scrapes and downloads U.S. House of Representatives financial disclosure PDFs by member last name and/or filing year. Perfect for journalists, researchers, and compliance professionals seeking fast, structured access to official disclosure documents.
0.0 (0)
Pricing
$10.00 / 1,000 results
0
Total users
3
Monthly users
3
Runs succeeded
>99%
Last modified
24 days ago
Senator Disclosures Scraper
This project is an Apify Actor that scrapes financial disclosure PDFs for a given U.S. House member and year from the U.S. House of Representatives Financial Disclosure Reports.
Features
- Automated scraping of disclosure PDFs by member last name and/or filing year
- Form-based input UI in Apify Console (no need to edit JSON)
- Outputs results to the Apify dataset for further processing
- Built with Apify SDK, got, and jsdom
Inputs
The Actor expects the following input (via Apify input UI or JSON):
Field | Type | Description | Required |
---|---|---|---|
lastName | string | Member's last name | No |
year | integer | Filing year (1994–2025) | No |
You can fill either or both fields. If neither is filled, the actor will exit with a warning.
Outputs
The Actor pushes results to the default dataset. Each item has the following structure:
{"senator": "Pelosi, Hon.. Nancy","year": 2025,"url": "https://disclosures-clerk.house.gov/public_disc/ptr-pdfs/2025/20026590.pdf"}
Getting Started
- Install dependencies:
$npm install
- Run locally:
apify run# ornpm start
- Deploy to Apify:
- Log in:
apify login
- Deploy:
apify push
- Log in:
Deploy to GitHub
- Create a new GitHub repository (e.g., using GitHub CLI:
gh repo create sen-disclosures-scraper --public --source=. --remote=origin --push
) - Push your code:
git add .git commit -m "Initial commit"git push -u origin main
Project Structure
src/main.js
— Main Actor logic.actor/actor.json
— Apify Actor definition.actor/input_schema.json
— Input schema for Apify UIstorage/
— Local Apify/Crawlee storage (git-ignored)README.md
— Project documentation
Notes
- The project uses Node.js 18+.
- Memory usage is optimized (default 256–512 MB is sufficient).
- The input UI is now form-based in Apify Console for easy use.
Resources
On this page
Share Actor: