SCOTUS Oyez Oral Arguments Scraper avatar

SCOTUS Oyez Oral Arguments Scraper

Pricing

Pay per event

Go to Apify Store
SCOTUS Oyez Oral Arguments Scraper

SCOTUS Oyez Oral Arguments Scraper

Scrape Supreme Court oral argument transcripts, case metadata, and speaker-attributed text from Oyez.org. Extract 5,000+ cases dating back to 1955 with full transcripts (justice/advocate speech turns), audio links, decisions, dissents, and concurrences.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

9 days ago

Last modified

Share

Scrape Supreme Court oral argument transcripts, case metadata, and speaker-attributed speech from Oyez (the IIT Chicago-Kent canonical SCOTUS index). Covers 5,000+ cases dating back to 1955 with full transcripts showing justice and advocate turns, audio links, decisions, votes, and advocate details.

What You Get

Each output record contains:

  • Case metadata: docket number, case name, parties, term year, lower court, manner of jurisdiction, citation
  • Dates: date argued and date decided (ISO 8601)
  • People: which justices heard the case, which decided it, and all advocates with their roles
  • Decisions: majority/dissenting vote counts, winning party, decision type
  • Oral argument sessions: title, audio MP3 URL, duration (when fetchTranscripts is enabled)
  • Full transcripts: speaker-attributed segments with justice/advocate name, role, spoken text, and start/end timestamps in seconds
  • Opinion announcements: post-decision audio and transcripts
  • Written opinions: author, type (majority, concurring, dissenting), Justia link
  • Case summaries: facts of the case, conclusion (HTML-stripped)
  • Cross-references: Justia URL, Oyez canonical URL, Oyez API URL

Input Options

FieldDescriptionDefault
termStartFirst SCOTUS term year to scrape (e.g. 2020)2020
termEndLast SCOTUS term year (inclusive). Omit to scrape only termStart2023
docketNumberScrape a single case by docket number (e.g. 19-1392). Requires termStart.
caseNameFilter cases whose name contains this string (case-insensitive, e.g. Dobbs)
fetchTranscriptsFetch full speaker-attributed transcripts for each oral argumenttrue
maxItemsMaximum case records to return (0 = unlimited)15

Example: Single Case with Transcript

{
"termStart": 2021,
"docketNumber": "19-1392",
"fetchTranscripts": true,
"maxItems": 1
}

Returns Dobbs v. Jackson Women's Health Organization with the full Oyez-normalized transcript, audio link, advocate list, and decision metadata.

Example: Full Recent Term (No Transcripts)

{
"termStart": 2022,
"termEnd": 2022,
"fetchTranscripts": false,
"maxItems": 0
}

Returns all ~62 October 2022 Term cases with metadata, advocate lists, and decision records — no transcript audio fetches.

{
"termStart": 2021,
"termEnd": 2021,
"caseName": "New York",
"fetchTranscripts": true,
"maxItems": 5
}

Transcript Format

When fetchTranscripts is enabled, oral_argument_transcript contains a JSON-serialized array:

[
{
"speaker": "John G. Roberts, Jr.",
"role": "Chief Justice of the United States",
"text": "We will hear argument this morning in Case 19-1392...",
"start_sec": 0.08,
"end_sec": 9.2,
"section": 0
},
{
"speaker": "Scott Stewart",
"role": "Counsel for Petitioner",
"text": "Mr. Chief Justice, and may it please the Court...",
"start_sec": 9.2,
"end_sec": 45.1,
"section": 0
}
]

Audio MP3 links are in oral_argument_sessions[].media_url.

Data Source & Attribution

Data is sourced from the Oyez Project, operated by IIT Chicago-Kent College of Law. Oyez's normalized case metadata and annotation work (justice attribution, summaries) is licensed CC BY-NC 4.0 — commercial redistribution of Oyez's annotated content requires attribution. The underlying oral argument audio and verbatim transcripts are US federal government works and are public domain.

Performance

  • Rate-limited to 2 requests/second out of respect for Oyez's non-profit infrastructure
  • Memory: 256 MB (no browser)
  • Typical run: ~1-2 seconds per case without transcripts; ~3-5 seconds per case with transcripts (one additional API call per oral argument session)

Why Oyez?

Oyez is the only public source with speaker-attributed SCOTUS oral argument transcripts. supremecourt.gov publishes raw transcript PDFs but without justice attribution or audio alignment. Oyez normalizes these into a structured JSON API with justice/advocate identification, making it the canonical source for constitutional law research, legal-tech RAG pipelines, and AI training datasets.