IETF RFC Editor Index Scraper avatar

IETF RFC Editor Index Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
IETF RFC Editor Index Scraper

IETF RFC Editor Index Scraper

Extract every published IETF RFC with metadata: title, authors, status, stream, obsoletes/updates relationships, DOI, and abstract. ~9,700 RFCs from RFC 1 to today, fully filterable.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Compute Edge

Compute Edge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Categories

Share

Extract structured metadata for every published IETF Request for Comments (RFC) — the foundational standards documents of the Internet, from RFC 1 (1969) through today's TLS, HTTP/3, OAuth, and DNS specifications. This Actor parses the official RFC Editor XML index and turns ~9,700 RFCs into clean, filterable JSON.

The IETF RFC corpus is the canonical source-of-truth for how the Internet works. Engineers, standards consultants, protocol researchers, compliance auditors, patent prosecutors, and AI agents reasoning about networking all need it as structured data — not a 20MB XML file. This Actor solves that.

Key Features

  • Every RFC — From RFC 1 to the latest publication, including obsoleted documents
  • Status & stream filters — Filter by Proposed Standard, Internet Standard, Best Current Practice, Informational, Experimental, Historic; or by stream (IETF / IAB / IRTF / Independent / Legacy)
  • Year range — Pull RFCs by publication year window
  • Keyword filter — Substring match across title, keywords, and abstract
  • Current-only mode — Skip RFCs that have been obsoleted by a newer document
  • Standards relationships — Captures obsoletes, obsoleted-by, updates, updated-by, and also-known-as (BCP/STD/FYI) cross-references
  • No authentication — Public IETF data source

Output Data Fields

FieldDescription
rfcNumberInteger RFC number
docIdDocument ID (e.g., RFC9293)
titleRFC title
authorsList of author names
month / yearPublication month and year
formatsAvailable file formats (ASCII, HTML, PDF, XML)
pageCountPage count
currentStatusCurrent status (Proposed Standard, Internet Standard, etc.)
publicationStatusOriginal publication status
streamPublishing stream (IETF, IAB, IRTF, Independent, Legacy)
doiDigital Object Identifier
keywordsAuthor-supplied keywords
abstractPlain-text abstract
obsoletesDoc-IDs of RFCs this one obsoletes
obsoletedByDoc-IDs of RFCs that obsolete this one
updatesDoc-IDs this one updates
updatedByDoc-IDs that update this one
alsoKnownAsBCP / STD / FYI document IDs
hasErrataWhether errata exist
rfcUrlCanonical rfc-editor.org URL

How to Scrape the RFC Index

  1. Open the RFC Editor Index Scraper on Apify Store
  2. (Optional) Filter by status, stream, year range, or keyword
  3. (Optional) Enable "Current RFCs Only" to skip obsoleted documents
  4. Click Start — every matching RFC is written to the default dataset

Pricing

This Actor uses pay-per-result pricing. A full ~9,700-RFC extract finishes in well under a minute since it parses a single XML file.

Use Cases

  • Standards tracking — Subscribe to a filter (e.g., new TLS RFCs) for compliance updates
  • Patent prior-art search — Filter by year/keyword to find pre-existing protocol disclosures
  • AI protocol assistants — Build RAG pipelines that answer questions about IETF standards
  • Vendor compliance audits — Confirm products implement current (not obsoleted) RFCs
  • Academic / education — Bulk metadata for citation tooling and curriculum design

The IETF RFC series is published by the RFC Editor and is freely available under the IETF Trust Legal Provisions. This Actor reads the public rfc-index.xml from rfc-editor.org and parses it locally. No authentication is bypassed. Data is provided "as is" without warranty.