GSA Site Scanning Scraper — US Federal Website Inventory
Pricing
from $3.00 / 1,000 results
GSA Site Scanning Scraper — US Federal Website Inventory
Extract the GSA Site Scanning inventory of U.S. federal government websites — tech stack, security headers, HTTPS/HSTS posture, USWDS design-system usage, DAP analytics presence, and more. Filter by agency, base domain, or status code.
Pricing
from $3.00 / 1,000 results
Rating
0.0
(0)
Developer
Compute Edge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
GSA Site Scanning Scraper
Extract the GSA Site Scanning inventory of U.S. federal government websites. This is the authoritative GSA-maintained dataset used to track HTTPS/HSTS adoption, USWDS (U.S. Web Design System) usage, DAP analytics deployment, and technology-stack signals across .gov properties.
Why it matters
- GovTech vendors — identify federal agencies still on legacy CMSes
- Compliance auditors — measure 21st Century IDEA / OMB M-22-09 adoption
- Security researchers — flag federal subdomains with weak HTTPS posture
- Open-source advocates — quantify USWDS rollout
Output fields
| Field | Description |
|---|---|
url / final_url | Site URL |
base_domain | Registrable domain |
agency / bureau | Owning agency |
live | Whether the site responds |
status_code | HTTP status |
https, hsts, hsts_max_age | Transport security signals |
dap_detected, dap_parameters | Digital Analytics Program presence |
uswds_semantic_version | USWDS design-system version |
cms | Detected CMS (WordPress, Drupal, ...) |
analytics_used | Analytics vendors detected |
javascript_libraries | JS libraries fingerprinted |
third_party_service_domains | External services loaded |
Input
{"apiKey": "DEMO_KEY","baseDomain": "nasa.gov","liveOnly": true,"maxResults": 500}
DEMO_KEY works but is rate-limited to 30 requests/hour. For real use, get a free key at https://api.data.gov/signup/.
Pricing
Pay-per-result at $0.003/record. Apify compute is billed separately.
Legal
The GSA Site Scanning dataset is U.S. federal public-domain data published openly under api.data.gov. This Actor only fetches publicly-available data — no authentication is bypassed.