Pricing

from $13.00 / 1,000 result items

W3C Standards Catalog Scraper

Scrape W3C standards catalog: title, status, type, date, editors, abstract, shortname, group, deliverer, errata, and specification URL. Covers Recommendations, Working Drafts, Notes, and Candidate Recommendations. Export web standards to JSON, CSV, or Excel for developer tooling.

Pricing

from $13.00 / 1,000 result items

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

6 days ago

Last modified

📐 W3C Standards Catalog Scraper

🚀 Export the full W3C Web standards catalog in seconds. Pull 1,696 specifications including HTML, CSS, ARIA, WebSocket, Web Components, and every other open Web standard with maturity status, deliverers, and full version history.

The W3C Standards Catalog Scraper exports the official W3C specifications corpus, returning 15 fields per record, including shortname, title, maturity status, description, latest version URL, first version URL, working-group deliverer shortnames, and full version history when requested. The dataset is the authoritative catalog of Web standards published by the World Wide Web Consortium since 1994.

The catalog covers 1,696 specifications across HTML, CSS, the DOM, Web APIs, ARIA accessibility standards, WebSocket, Web Components, payment APIs, internationalization, security, privacy, and dozens of other working groups. A second mode enumerates W3C working groups and community groups themselves, returning the org chart of the open Web.

🎯 Target Audience	💡 Primary Use Cases
Web developers, browser engineers, standards researchers, accessibility auditors, technical writers, conformance teams, framework authors	Conformance audits, "supported standards" dashboards, browser feature trackers, accessibility coverage, framework spec mapping, standards research

📋 What the W3C Standards Catalog Scraper does

Three workflows in a single run:

📚 Full specifications catalog. Every W3C spec from Recommendation to Working Draft to Retired, with shortname, title, status, and links.
🏛️ Working groups directory. Switch to mode: "groups" to enumerate the W3C organisational chart of working groups and community groups.
🔖 Status and group filters. Narrow to one maturity level (Recommendation, Candidate Recommendation, Working Draft, Group Note, Retired, Superseded, Rescinded, Proposed Recommendation) or to one working-group shortname (css, webapps, html, aria).
🗂️ Optional version history. Toggle includeVersions to pull the per-spec version list with one extra call per record.

Each record carries the canonical shortname, the human title, the maturity status, the editor's draft URL, the latest and first version URLs, the deliverers (working-group shortnames), and a stable API URL back to the W3C catalog.

💡 Why it matters: the Web is an open platform because standards are public, traceable, and versioned. Building a conformance, browser-tracker, or framework dashboard around them means parsing inconsistent HTML, scraping multiple pages, and stitching the org chart together by hand. This Actor gives you the structured catalog in one call.

📊 Data fields

Each record includes: apiUrl, creationDate, description, editorDraftUrl, firstVersionUrl, groupShortnames, groupType, homepageUrl, isClosed, latestVersionUrl, mode, scrapedAt, seriesShortname, seriesVersion, shortlink, shortname, status, title, versionHistory, versionsCount. These field names come straight from the actor's dataset schema, so what you see here is what lands in your dataset.

🚀 How to use

📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
🌐 Open the Actor. Go to the W3C Standards Catalog Scraper page on the Apify Store.
🎯 Set input. Pick a mode (specifications or groups), optionally filter by status or group, and set maxItems.
🚀 Run it. Click Start and let the Actor collect your data.
📥 Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to a downloaded catalog: 3-5 minutes. No coding required.

🔗 Recommended Actors

📨 IETF Datatracker Drafts Scraper - Internet standards drafts, RFCs, and charters
📚 arXiv Scraper - Open-access research papers across all fields
📊 OEC Economic Complexity Trade Scraper - International trade flows by country and product
📈 Indexmundi Scraper - Global demographic and economic indicators
🌐 Nominatim OSM Scraper - Geocode addresses via OpenStreetMap

💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.

⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the W3C or its member organisations. All trademarks mentioned are the property of their respective owners. Only publicly available W3C catalog data is collected.

🆘 Need Help?

If you hit a bug, have questions about setup, or need a scraper we haven't built yet, open our contact form or write to parseforge@protonmail.com. We also take on paid custom data projects.

For faster answers, join our Discord. It's the best place to get support and suggest new actors.

HTML Validity Report Generator

gentle_cloud/html-validity-report-generator

Validate web pages against W3C HTML standards. Get detailed error, warning, and info reports using the official W3C Nu HTML Checker API.

Monkey Coder

Technical Standards Revision Monitor

flintglade/technical-standards-revision-monitor

Monitor official IETF and W3C standards for status, supersession, section, reference, and normative-language changes with deterministic baselines and evidence-linked hashes.

Flintglade

IETF Datatracker Documents Scraper

parseforge/ietf-datatracker-drafts-scraper

Pull IETF Datatracker internet drafts and RFCs: document name, title, authors, abstract, working group, area, status, revision, dates, related drafts, and PDF or text URL. Export internet engineering standards to JSON, CSV, or Excel for protocol research and developer tooling.

ParseForge

W3C Html Reporter

service-paradis/w3c-html-reporter

Get HTML validity reports from various web pages using W3C HTML validator.

Alexandre Paradis

Reconciliation Service Extractor (OpenRefine / W3C)

datamule/reconciliation-service-extractor

Point at ANY W3C Reconciliation (OpenRefine) service and pull results: fetch the capability manifest, batch-match query strings to candidate entities with scores, autocomplete (suggest), and fetch property values (extend). Works with Wikidata, GND, Getty and any conforming endpoint.

Datamule

Product Catalog API

vivid_astronaut/product-catalog

BRAINIALL Team

SBA API - Small Business Size Standards & Eligibility

alizarin_refrigerator-owner/sba-api---small-business-size-standards-eligibility

Access SBA (Small Business Administration) data including size standards by NAICS code, small business eligibility determination, contracting thresholds, loan programs, and federal set-aside requirements. Essential for government contractors, small business certification, and federal procurement.

The Howlers

One Rep Max and Strength Standards Calculator

mangudai/my-actor

Estimate your one rep max from any set with seven proven formulas, rank a lift against strength standards by bodyweight, score totals with Wilks, DOTS and IPF GL, load the barbell and plan warmups. Pure offline math, no API key, one row per calculation.