GSA Site Scanning Scraper — US Federal Website Inventory avatar

GSA Site Scanning Scraper — US Federal Website Inventory

Pricing

from $3.00 / 1,000 results

Go to Apify Store
GSA Site Scanning Scraper — US Federal Website Inventory

GSA Site Scanning Scraper — US Federal Website Inventory

Extract the GSA Site Scanning inventory of U.S. federal government websites — tech stack, security headers, HTTPS/HSTS posture, USWDS design-system usage, DAP analytics presence, and more. Filter by agency, base domain, or status code.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Compute Edge

Compute Edge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

GSA Site Scanning Scraper

Extract the GSA Site Scanning inventory of U.S. federal government websites. This is the authoritative GSA-maintained dataset used to track HTTPS/HSTS adoption, USWDS (U.S. Web Design System) usage, DAP analytics deployment, and technology-stack signals across .gov properties.

Why it matters

  • GovTech vendors — identify federal agencies still on legacy CMSes
  • Compliance auditors — measure 21st Century IDEA / OMB M-22-09 adoption
  • Security researchers — flag federal subdomains with weak HTTPS posture
  • Open-source advocates — quantify USWDS rollout

Output fields

FieldDescription
url / final_urlSite URL
base_domainRegistrable domain
agency / bureauOwning agency
liveWhether the site responds
status_codeHTTP status
https, hsts, hsts_max_ageTransport security signals
dap_detected, dap_parametersDigital Analytics Program presence
uswds_semantic_versionUSWDS design-system version
cmsDetected CMS (WordPress, Drupal, ...)
analytics_usedAnalytics vendors detected
javascript_librariesJS libraries fingerprinted
third_party_service_domainsExternal services loaded

Input

{
"apiKey": "DEMO_KEY",
"baseDomain": "nasa.gov",
"liveOnly": true,
"maxResults": 500
}

DEMO_KEY works but is rate-limited to 30 requests/hour. For real use, get a free key at https://api.data.gov/signup/.

Pricing

Pay-per-result at $0.003/record. Apify compute is billed separately.

The GSA Site Scanning dataset is U.S. federal public-domain data published openly under api.data.gov. This Actor only fetches publicly-available data — no authentication is bypassed.