IETF Datatracker Documents Scraper avatar

IETF Datatracker Documents Scraper

Pricing

from $14.00 / 1,000 result items

Go to Apify Store
IETF Datatracker Documents Scraper

IETF Datatracker Documents Scraper

Pull IETF Datatracker internet drafts and RFCs: document name, title, authors, abstract, working group, area, status, revision, dates, related drafts, and PDF or text URL. Export internet engineering standards to JSON, CSV, or Excel for protocol research and developer tooling.

Pricing

from $14.00 / 1,000 result items

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 hours ago

Last modified

Share

ParseForge Banner

๐Ÿ“จ IETF Datatracker Drafts Scraper

๐Ÿš€ Export the IETF standards lifecycle in seconds. Pull 42,000+ Internet Drafts, 9,000+ RFCs, working-group charters, meeting agendas, BoF requests, and the full Internet engineering corpus with state, stream, and group metadata.

๐Ÿ•’ Last updated: 2026-05-23 ยท ๐Ÿ“Š 17 fields per record ยท ๐Ÿ“œ 42k+ drafts ยท ๐Ÿ“„ 9k+ RFCs ยท ๐Ÿ›๏ธ All IETF working groups

The IETF Datatracker Drafts Scraper exports the authoritative catalog of Internet engineering standards published through the IETF. It returns 17 fields per record, including draft name, title, RFC number when applicable, abstract, revision, stream, standardization level, working-group acronym, state slugs, page count, word count, keywords, last-updated timestamp, expiry date, and a canonical Datatracker URL.

The corpus covers 14 document types including Internet Drafts, RFCs, Charters, BoF Requests, Meeting Agendas, Minutes, Slides, Conflict Reviews, Status Changes, Reviews, Shepherd Writeups, Liaison Statements, Best Current Practices, and For Your Information notes. Together they trace the lifecycle of every Internet protocol from initial proposal through stable RFC.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Network engineers, protocol researchers, infrastructure security teams, standards trackers, technical writers, policy analysts, RFC authors, working-group chairsStandards-tracking dashboards, RFC archives, draft monitoring, working-group reports, protocol research, conformance audits

๐Ÿ“‹ What the IETF Datatracker Drafts Scraper does

Five workflows in a single run:

  • ๐Ÿ“œ Document-type selector. Internet Draft, RFC, Charter, BoF Request, Meeting Agenda, Minutes, Slides, Conflict Review, Status Change, Review, Shepherd Writeup, Liaison Statement, Best Current Practice, For Your Information.
  • ๐Ÿ”– State filter. Restrict to Active, Expired, Published as RFC, Replaced, Author Withdrew, or IETF Withdrew. Leave empty for any.
  • ๐Ÿ›๏ธ Working-group filter. Scope to one IETF working group by acronym (e.g. httpbis, tls, quic, oauth).
  • ๐Ÿ“… Date window. Pull only documents updated within a custom dateFrom/dateTo range.
  • ๐Ÿงพ Rich metadata. Each record carries the abstract, page count, word count, keywords, stream, standardization level, and a stable Datatracker link.

๐Ÿ’ก Why it matters: the IETF runs by rough consensus and running code, but its catalog of 42,000+ drafts and 9,000+ RFCs is the documentation backbone of the open Internet. A queryable copy is the foundation of every standards dashboard, conformance audit, and protocol research project.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing how to pull a working group's active drafts and load them into a tracking dashboard.


โš™๏ธ Input

InputTypeDefaultBehavior
maxItemsinteger10Records to return. Free plan caps at 10, paid plan at 1,000,000.
typestring"draft"One of 14 IETF document types.
statestring""State slug (active, expired, rfc, repl, auth-rm, ietf-rm). Empty = any.
groupAcronymstring""IETF working-group acronym (e.g. httpbis, tls, quic).
dateFrom, dateTostring (YYYY-MM-DD)""Updated-on window.

Example: every active HTTPBIS working-group draft.

{
"maxItems": 100,
"type": "draft",
"state": "active",
"groupAcronym": "httpbis"
}

Example: every RFC published since 2024.

{
"maxItems": 1000,
"type": "rfc",
"dateFrom": "2024-01-01"
}

โš ๏ธ Good to Know: state filtering is applied after fetching each document's state metadata. For working-group filters, the Actor resolves the acronym to a group ID server-side before paging, so large group queries stay efficient.


๐Ÿ“Š Output

Each record contains 17 fields. Download the dataset as CSV, Excel, JSON, or XML.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿ†” namestring | null"draft-ietf-httpbis-http2bis"
๐Ÿ“œ titlestring | null"HTTP/2"
๐Ÿท๏ธ typestring"draft"
๐Ÿ”ข rfcNumbernumber | null9113
๐Ÿ”„ revisionstring | null"05"
๐Ÿ“ abstractstring | null"This document describes HTTP/2..."
๐ŸŒŠ streamstring | null"ietf"
โญ stdLevelstring | null"std"
๐ŸŽฏ intendedStdLevelstring | null"std"
๐Ÿ›๏ธ groupAcronymstring | null"httpbis"
๐Ÿ”– stateSlugsstring[] | null["rfc"]
๐Ÿ“„ pagesnumber | null96
๐Ÿ”ค wordsnumber | null42150
๐Ÿท๏ธ keywordsstring[] | null["http","web","framing"]
๐Ÿ•’ timeUpdatedISO 8601"2024-06-12T13:43:00"
โณ expiresISO 8601 | null"2025-01-01T00:00:00"
๐Ÿ”— datatrackerUrlstring"https://datatracker.ietf.org/doc/draft-ietf-httpbis-http2bis/"
๐Ÿ•’ scrapedAtISO 8601"2026-05-23T00:00:00.000Z"

๐Ÿ“ฆ Sample records


โœจ Why choose this Actor

Capability
๐Ÿ“จFull corpus. 42,000+ drafts, 9,000+ RFCs, plus charters, agendas, BoF requests, and 10 more document types.
๐Ÿ”–State and stream filters. Active, Expired, Replaced, Withdrawn, Published as RFC.
๐Ÿ›๏ธWorking-group scoping. Acronym-based filter for httpbis, tls, quic, oauth, and every other IETF group.
๐Ÿ“…Date windows. Pull only what changed since your last run.
๐Ÿ“ŠRich metadata. Pages, words, keywords, abstract, stream, std level, intended std level.
โšกFast. 100 documents in under 30 seconds.
๐ŸšซNo authentication. Public IETF Datatracker. No login or token needed.

๐Ÿ“Š The IETF catalog is the documentation source-of-truth for the protocols that run the open Internet. A clean, queryable copy makes every standards dashboard, conformance audit, and research project faster.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
โญ IETF Datatracker Drafts Scraper (this Actor)$5 free credit, then pay-per-use42k drafts, 9k RFCsLive per runtype, state, group, dateโšก 2 min
RFC Editor static indexFreeRFCs onlyPeriodicNone๐Ÿข No structure
Hand-parsed Datatracker HTMLFreeSameManualCustom code๐Ÿ•’ Days
Tools.ietf.org legacy scrapingFreeSubsetStaleNone๐Ÿ•’ Variable

Pick this Actor when you need the full IETF lifecycle (not just published RFCs) with structured filters and stable fields.


๐Ÿš€ How to use

  1. ๐Ÿ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. ๐ŸŒ Open the Actor. Go to the IETF Datatracker Drafts Scraper page on the Apify Store.
  3. ๐ŸŽฏ Set input. Pick a type, optionally filter by state, groupAcronym, or date window, and set maxItems.
  4. ๐Ÿš€ Run it. Click Start and let the Actor collect your data.
  5. ๐Ÿ“ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from signup to a downloaded dataset: 3-5 minutes. No coding required.


๐Ÿ’ผ Business use cases

๐ŸŒ Network & Infrastructure Engineering

  • Track new drafts in HTTPBIS, QUIC, TLS, OAUTH
  • Monitor protocol updates that affect your stack
  • Build internal "protocols we implement" dashboards
  • Alert on state transitions (draft to RFC)

๐Ÿ” Security & Conformance

  • Audit TLS, IPsec, SASL, and PKIX drafts
  • Track BCP and security-area working groups
  • Build compliance reports against current RFCs
  • Detect deprecated specs in your codebase

๐Ÿ“š Standards Research

  • Map working-group activity over time
  • Trace draft-to-RFC lifecycle data
  • Build citation graphs across documents
  • Academic papers on Internet governance

๐Ÿ“ฐ Technical Writing & DevRel

  • Auto-update docs links to latest RFCs
  • Generate "implements" pages for OSS projects
  • Newsletter content on standards updates
  • Internal "RFC of the week" digests

๐Ÿ”Œ Automating IETF Datatracker Drafts Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

  • ๐ŸŸข Node.js. Install the apify-client NPM package.
  • ๐Ÿ Python. Use the apify-client PyPI package.
  • ๐Ÿ“š See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Daily or weekly working-group monitors are a common pattern, with Slack or email notifications on new draft revisions.


๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Internet governance research with reproducible pulls
  • Protocol-evolution studies for CS papers
  • Network coursework anchored to real RFCs
  • Standards-process measurement studies

๐ŸŽจ Personal and creative

  • Build a personal RFC reading queue
  • Side projects mapping the Internet's plumbing
  • Hobbyist protocol implementations
  • "On this day in IETF history" feeds

๐Ÿค Non-profit and civic

  • Internet-policy advocacy with citation data
  • Open-source contribution dashboards
  • Digital-rights research with primary sources
  • Standards transparency for civil society

๐Ÿงช Experimentation

  • Train document-classification models on RFC text
  • Prototype agent pipelines that read IETF specs
  • Build "what changed since last quarter" workflows
  • Test embeddable RFC lookup widgets

๐Ÿค– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:


โ“ Frequently Asked Questions

๐Ÿงฉ How does it work?

Pick a document type, optionally narrow by state, working group, or date window, and click Start. The Actor pages through the Datatracker catalog and emits a clean structured record per document.

๐Ÿ“จ How many documents are in the catalog?

The IETF Datatracker tracks 42,000+ Internet Drafts and 9,000+ RFCs, plus charters, BoF requests, and 10 more document types. The Actor pages through the whole catalog when maxItems is high enough.

๐Ÿ›๏ธ Can I filter by working group?

Yes. Set groupAcronym to the group's acronym (for example httpbis, tls, quic, oauth). The Actor resolves the acronym to the internal group ID and applies the filter server-side.

๐Ÿ”– What states are supported?

active, expired, rfc (Published as RFC), repl (Replaced), auth-rm (Author Withdrew), ietf-rm (IETF Withdrew). Leave the field empty for any state.

๐Ÿ“… Can I pull only what changed?

Yes. Use dateFrom and dateTo (YYYY-MM-DD) to scope the run to a custom updated-on window. Common pattern: pull yesterday's changes for a daily monitor.

โฐ Can I schedule regular runs?

Yes. Use Apify Schedules to run on any cron interval. Daily monitors of one working group, weekly RFC digests, and monthly full-catalog refreshes are all common patterns.

Yes. The IETF Datatracker is publicly accessible and the underlying IETF documents are published for open use.

๐Ÿ’ผ Can I use this commercially?

Yes. Standards dashboards, conformance reports, and protocol-research products are all valid commercial use cases.

๐Ÿ’ณ Do I need a paid Apify plan?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you access to scheduling, higher concurrency, and larger datasets.

๐Ÿ” What happens if a run fails partway?

Apify retries transient errors automatically. Records already pushed are preserved. The Actor pages through the catalog deterministically, so a re-run picks up cleanly.

๐Ÿ†˜ What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.


๐Ÿ”Œ Integrate with any app

IETF Datatracker Drafts Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe IETF data into your warehouse
  • GitHub - Trigger runs from repo commits
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to fire downstream actions when a run finishes. Push a daily diff of HTTPBIS drafts into Slack, or load every new RFC into your knowledge base.


๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


๐Ÿ†˜ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.


โš ๏ธ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the IETF or the Internet Society. All trademarks mentioned are the property of their respective owners. Only publicly available Datatracker data is collected.