IETF Datatracker Documents Scraper
Pricing
from $14.00 / 1,000 result items
IETF Datatracker Documents Scraper
Pull IETF Datatracker internet drafts and RFCs: document name, title, authors, abstract, working group, area, status, revision, dates, related drafts, and PDF or text URL. Export internet engineering standards to JSON, CSV, or Excel for protocol research and developer tooling.
Pricing
from $14.00 / 1,000 result items
Rating
0.0
(0)
Developer
ParseForge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
7 hours ago
Last modified
Categories
Share

๐จ IETF Datatracker Drafts Scraper
๐ Export the IETF standards lifecycle in seconds. Pull 42,000+ Internet Drafts, 9,000+ RFCs, working-group charters, meeting agendas, BoF requests, and the full Internet engineering corpus with state, stream, and group metadata.
๐ Last updated: 2026-05-23 ยท ๐ 17 fields per record ยท ๐ 42k+ drafts ยท ๐ 9k+ RFCs ยท ๐๏ธ All IETF working groups
The IETF Datatracker Drafts Scraper exports the authoritative catalog of Internet engineering standards published through the IETF. It returns 17 fields per record, including draft name, title, RFC number when applicable, abstract, revision, stream, standardization level, working-group acronym, state slugs, page count, word count, keywords, last-updated timestamp, expiry date, and a canonical Datatracker URL.
The corpus covers 14 document types including Internet Drafts, RFCs, Charters, BoF Requests, Meeting Agendas, Minutes, Slides, Conflict Reviews, Status Changes, Reviews, Shepherd Writeups, Liaison Statements, Best Current Practices, and For Your Information notes. Together they trace the lifecycle of every Internet protocol from initial proposal through stable RFC.
| ๐ฏ Target Audience | ๐ก Primary Use Cases |
|---|---|
| Network engineers, protocol researchers, infrastructure security teams, standards trackers, technical writers, policy analysts, RFC authors, working-group chairs | Standards-tracking dashboards, RFC archives, draft monitoring, working-group reports, protocol research, conformance audits |
๐ What the IETF Datatracker Drafts Scraper does
Five workflows in a single run:
- ๐ Document-type selector. Internet Draft, RFC, Charter, BoF Request, Meeting Agenda, Minutes, Slides, Conflict Review, Status Change, Review, Shepherd Writeup, Liaison Statement, Best Current Practice, For Your Information.
- ๐ State filter. Restrict to Active, Expired, Published as RFC, Replaced, Author Withdrew, or IETF Withdrew. Leave empty for any.
- ๐๏ธ Working-group filter. Scope to one IETF working group by acronym (e.g.
httpbis,tls,quic,oauth). - ๐
Date window. Pull only documents updated within a custom
dateFrom/dateTorange. - ๐งพ Rich metadata. Each record carries the abstract, page count, word count, keywords, stream, standardization level, and a stable Datatracker link.
๐ก Why it matters: the IETF runs by rough consensus and running code, but its catalog of 42,000+ drafts and 9,000+ RFCs is the documentation backbone of the open Internet. A queryable copy is the foundation of every standards dashboard, conformance audit, and protocol research project.
๐ฌ Full Demo
๐ง Coming soon: a 3-minute walkthrough showing how to pull a working group's active drafts and load them into a tracking dashboard.
โ๏ธ Input
| Input | Type | Default | Behavior |
|---|---|---|---|
| maxItems | integer | 10 | Records to return. Free plan caps at 10, paid plan at 1,000,000. |
| type | string | "draft" | One of 14 IETF document types. |
| state | string | "" | State slug (active, expired, rfc, repl, auth-rm, ietf-rm). Empty = any. |
| groupAcronym | string | "" | IETF working-group acronym (e.g. httpbis, tls, quic). |
| dateFrom, dateTo | string (YYYY-MM-DD) | "" | Updated-on window. |
Example: every active HTTPBIS working-group draft.
{"maxItems": 100,"type": "draft","state": "active","groupAcronym": "httpbis"}
Example: every RFC published since 2024.
{"maxItems": 1000,"type": "rfc","dateFrom": "2024-01-01"}
โ ๏ธ Good to Know: state filtering is applied after fetching each document's state metadata. For working-group filters, the Actor resolves the acronym to a group ID server-side before paging, so large group queries stay efficient.
๐ Output
Each record contains 17 fields. Download the dataset as CSV, Excel, JSON, or XML.
๐งพ Schema
| Field | Type | Example |
|---|---|---|
๐ name | string | null | "draft-ietf-httpbis-http2bis" |
๐ title | string | null | "HTTP/2" |
๐ท๏ธ type | string | "draft" |
๐ข rfcNumber | number | null | 9113 |
๐ revision | string | null | "05" |
๐ abstract | string | null | "This document describes HTTP/2..." |
๐ stream | string | null | "ietf" |
โญ stdLevel | string | null | "std" |
๐ฏ intendedStdLevel | string | null | "std" |
๐๏ธ groupAcronym | string | null | "httpbis" |
๐ stateSlugs | string[] | null | ["rfc"] |
๐ pages | number | null | 96 |
๐ค words | number | null | 42150 |
๐ท๏ธ keywords | string[] | null | ["http","web","framing"] |
๐ timeUpdated | ISO 8601 | "2024-06-12T13:43:00" |
โณ expires | ISO 8601 | null | "2025-01-01T00:00:00" |
๐ datatrackerUrl | string | "https://datatracker.ietf.org/doc/draft-ietf-httpbis-http2bis/" |
๐ scrapedAt | ISO 8601 | "2026-05-23T00:00:00.000Z" |
๐ฆ Sample records
โจ Why choose this Actor
| Capability | |
|---|---|
| ๐จ | Full corpus. 42,000+ drafts, 9,000+ RFCs, plus charters, agendas, BoF requests, and 10 more document types. |
| ๐ | State and stream filters. Active, Expired, Replaced, Withdrawn, Published as RFC. |
| ๐๏ธ | Working-group scoping. Acronym-based filter for httpbis, tls, quic, oauth, and every other IETF group. |
| ๐ | Date windows. Pull only what changed since your last run. |
| ๐ | Rich metadata. Pages, words, keywords, abstract, stream, std level, intended std level. |
| โก | Fast. 100 documents in under 30 seconds. |
| ๐ซ | No authentication. Public IETF Datatracker. No login or token needed. |
๐ The IETF catalog is the documentation source-of-truth for the protocols that run the open Internet. A clean, queryable copy makes every standards dashboard, conformance audit, and research project faster.
๐ How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| โญ IETF Datatracker Drafts Scraper (this Actor) | $5 free credit, then pay-per-use | 42k drafts, 9k RFCs | Live per run | type, state, group, date | โก 2 min |
| RFC Editor static index | Free | RFCs only | Periodic | None | ๐ข No structure |
| Hand-parsed Datatracker HTML | Free | Same | Manual | Custom code | ๐ Days |
| Tools.ietf.org legacy scraping | Free | Subset | Stale | None | ๐ Variable |
Pick this Actor when you need the full IETF lifecycle (not just published RFCs) with structured filters and stable fields.
๐ How to use
- ๐ Sign up. Create a free account with $5 credit (takes 2 minutes).
- ๐ Open the Actor. Go to the IETF Datatracker Drafts Scraper page on the Apify Store.
- ๐ฏ Set input. Pick a
type, optionally filter bystate,groupAcronym, or date window, and setmaxItems. - ๐ Run it. Click Start and let the Actor collect your data.
- ๐ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.
โฑ๏ธ Total time from signup to a downloaded dataset: 3-5 minutes. No coding required.
๐ผ Business use cases
๐ Automating IETF Datatracker Drafts Scraper
Control the scraper programmatically for scheduled runs and pipeline integrations:
- ๐ข Node.js. Install the
apify-clientNPM package. - ๐ Python. Use the
apify-clientPyPI package. - ๐ See the Apify API documentation for full details.
The Apify Schedules feature lets you trigger this Actor on any cron interval. Daily or weekly working-group monitors are a common pattern, with Slack or email notifications on new draft revisions.
๐ Beyond business use cases
Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.
๐ค Ask an AI assistant about this scraper
Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:
- ๐ฌ ChatGPT
- ๐ง Claude
- ๐ Perplexity
- ๐ Copilot
โ Frequently Asked Questions
๐งฉ How does it work?
Pick a document type, optionally narrow by state, working group, or date window, and click Start. The Actor pages through the Datatracker catalog and emits a clean structured record per document.
๐จ How many documents are in the catalog?
The IETF Datatracker tracks 42,000+ Internet Drafts and 9,000+ RFCs, plus charters, BoF requests, and 10 more document types. The Actor pages through the whole catalog when maxItems is high enough.
๐๏ธ Can I filter by working group?
Yes. Set groupAcronym to the group's acronym (for example httpbis, tls, quic, oauth). The Actor resolves the acronym to the internal group ID and applies the filter server-side.
๐ What states are supported?
active, expired, rfc (Published as RFC), repl (Replaced), auth-rm (Author Withdrew), ietf-rm (IETF Withdrew). Leave the field empty for any state.
๐ Can I pull only what changed?
Yes. Use dateFrom and dateTo (YYYY-MM-DD) to scope the run to a custom updated-on window. Common pattern: pull yesterday's changes for a daily monitor.
โฐ Can I schedule regular runs?
Yes. Use Apify Schedules to run on any cron interval. Daily monitors of one working group, weekly RFC digests, and monthly full-catalog refreshes are all common patterns.
โ๏ธ Is this data legal to use?
Yes. The IETF Datatracker is publicly accessible and the underlying IETF documents are published for open use.
๐ผ Can I use this commercially?
Yes. Standards dashboards, conformance reports, and protocol-research products are all valid commercial use cases.
๐ณ Do I need a paid Apify plan?
No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you access to scheduling, higher concurrency, and larger datasets.
๐ What happens if a run fails partway?
Apify retries transient errors automatically. Records already pushed are preserved. The Actor pages through the catalog deterministically, so a re-run picks up cleanly.
๐ What if I need help?
Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.
๐ Integrate with any app
IETF Datatracker Drafts Scraper connects to any cloud service via Apify integrations:
- Make - Automate multi-step workflows
- Zapier - Connect with 5,000+ apps
- Slack - Get run notifications in your channels
- Airbyte - Pipe IETF data into your warehouse
- GitHub - Trigger runs from repo commits
- Google Drive - Export datasets straight to Sheets
You can also use webhooks to fire downstream actions when a run finishes. Push a daily diff of HTTPBIS drafts into Slack, or load every new RFC into your knowledge base.
๐ Recommended Actors
- ๐ W3C Standards Catalog Scraper - Open Web specifications catalog
- ๐ arXiv Scraper - Open-access research papers across all fields
- ๐ FIRST.org CSIRT Teams Scraper - Incident response teams and EPSS scores
- ๐ OEC Economic Complexity Trade Scraper - International trade flows
- ๐ Nominatim OSM Scraper - Geocode addresses via OpenStreetMap
๐ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.
๐ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.
โ ๏ธ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the IETF or the Internet Society. All trademarks mentioned are the property of their respective owners. Only publicly available Datatracker data is collected.