Pricing

from $23.63 / 1,000 results

Hansard UK Parliament Debates Scraper

Export the official transcripts of UK Parliament debates and speeches from Hansard. Filter by House (Commons or Lords), search term, member, and date range. Each record includes the full speech text, speaker, debate section, and a permalink to the official Hansard transcript.

Pricing

from $23.63 / 1,000 results

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

🗣️ Hansard UK Parliament Debates Scraper

🚀 Export UK Parliament debate transcripts in seconds. Pull every spoken contribution from the House of Commons and House of Lords, filtered by topic, member, date, or department. Each record is a clean structured speech with full text, speaker, debate section, and a permalink to the official Hansard transcript. No sign-up, no manual paging, no parser to maintain.

🕒 Last updated: 2026-05-15 · 📊 17 fields per record · 🗣️ Millions of contributions · 📜 250+ years of debates · 🇬🇧 Both Houses

The Hansard UK Parliament Debates Scraper queries the official Hansard transcript catalogue and returns up to 17 structured fields per record, including the contribution ID, speaker name and member ID, House, debate section, sitting date, full speech text, word count, ordering metadata, and a deep permalink back to the official Hansard page.

The catalogue covers the official record of every spoken contribution in the UK Parliament, including ministerial statements, backbench speeches, oral questions, urgent questions, statements, and full debates. Hansard has tracked the proceedings of the UK Parliament since 1803 and is the canonical record cited by historians, journalists, and political researchers.

🎯 Target Audience	💡 Primary Use Cases
Political analysts and researchers, journalists, NLP and machine-learning teams, public-affairs and lobbying firms, civic-tech projects, academic political scientists, content creators	Speech and rhetoric analysis, member voting-context research, topic mining, NLP training corpora, ministerial statement monitoring, lobbyist due diligence, civic-tech transparency tools

📋 What the Hansard UK Debates Scraper does

Six filtering workflows in a single run:

🔎 Free-text search. Match a keyword or phrase across every spoken contribution (e.g. "climate change", "NHS funding", "AUKUS").
🏛️ House filter. Restrict to House of Commons, House of Lords, or both.
👤 Member filter. Substring match on the speaker name (e.g. "Keir Starmer", "Lord Hannan").
📅 Date range. Scope to any sitting-date window with startDate and endDate.
🏛️ Department filter. Substring match on the responsible government department (e.g. "Treasury", "Department for Education").
🔢 Page-driven sample. Pull the latest contributions across all topics when no query is set.

Each record includes the contribution ID, the member's name and (when available) their member ID, the House, the section ("Commons Chamber", "Westminster Hall", etc.), the debate section title, the Hansard internal section code, the sitting date, the timecode of the contribution, the full speech text (HTML preserved), a word count, the order in the debate, the paragraph tag, and a deep permalink back to the official Hansard transcript page.

💡 Why it matters: Hansard transcripts power policy analysis, NLP corpora, civic transparency, and political journalism. Building your own pipeline means writing a paginated client, mapping debate identifiers to permalinks, normalising HTML across sittings, and refreshing daily. This Actor skips all of that and gives you a clean refreshed snapshot on every run.

🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded Hansard dataset.

⚙️ Input

Input	Type	Default	Behavior
searchTerm	string	""	Keyword or phrase to search across UK Parliament transcripts. Empty = most recent contributions across all topics.
house	string	"Both"	One of Both, Commons, or Lords.
memberName	string	""	Substring match on the speaker name.
startDate	string	""	Earliest sitting date (YYYY-MM-DD).
endDate	string	""	Latest sitting date (YYYY-MM-DD).
department	string	""	Substring match on the responsible department.
maxItems	integer	10	Records to return. Free plan caps at 10, paid plan at 1,000,000.

Example: every Commons contribution mentioning "Heathrow" since 2026-01-01.

{
    "maxItems": 200,
    "searchTerm": "Heathrow",
    "house": "Commons",
    "startDate": "2026-01-01"
}

Example: latest 50 contributions by Keir Starmer in either House.

{
    "maxItems": 50,
    "memberName": "Keir Starmer"
}

⚠️ Good to Know: the text field preserves the original Hansard markup, including column-number <span> tags and inline subscripts. That keeps the record faithful to the official transcript. If you need plain-text, strip HTML downstream once.

📊 Output

Each contribution record carries up to 17 fields. Download the dataset as CSV, Excel, JSON, or XML.

🧾 Schema

Field	Type	Example
🆔 `contributionId`	string	`"0D8CEA45-19F1-4BF6-83D3-6688C26C01B9"`
👤 `memberName`	string	`"Sarah Olney"`
👤 `attributedTo`	string	`"Sarah Olney"`
🆔 `memberId`	number	`4591`
🏛️ `house`	string	`"Commons"`
📂 `section`	string	`"Commons Chamber"`
📂 `debateSection`	string	`" Heathrow Airport: Third Runway"`
🆔 `debateSectionId`	string	`"15106B6A-3101-426D-89E3-0544452BD096"`
📂 `hansardSection`	string	`"CP-CR1"`
📅 `sittingDate`	YYYY-MM-DD	`"2026-05-14"`
🕒 `timecode`	string	`"2026-05-14T15:03:57"`
📝 `text`	string	`"The hon. Gentleman is absolutely right that we need to see the economic case..."`
🔢 `wordCount`	number	`806`
🔢 `orderInDebate`	number	`6`
🏷️ `paragraphTag`	string	`"hs_Para"`
🔗 `url`	string	`"https://hansard.parliament.uk/Commons/2026-05-14/debates/.../HeathrowAirport%3AThirdRunway#contribution-..."`
🕒 `scrapedAt`	ISO 8601	`"2026-05-15T20:10:51.113Z"`

📦 Sample record

✨ Why choose this Actor

	Capability
🗣️	Both Houses, full text. Spoken contributions from Commons and Lords with the complete speech body.
🎯	Multi-dimensional filters. Search term, House, member, date range, and department combine freely.
🔗	Permalinks per row. Every record links back to the canonical Hansard page anchor for citation.
📜	Historic depth. Indexed transcripts spanning decades of UK parliamentary debate.
⚡	Fast. 100 contributions in seconds, 10,000 records in a few minutes.
🔁	Always fresh. Every run hits the live transcript catalogue, so the dataset reflects the latest sittings.
🚫	No authentication. Public open-government data. No login needed.

📊 Searchable Hansard transcripts are the foundation of every political-journalism dashboard, NLP corpus on UK politics, and lobbyist briefing pack.

📈 How it compares to alternatives

Approach	Cost	Coverage	Refresh	Filters	Setup
⭐ Hansard UK Debates Scraper (this Actor)	$5 free credit, then pay-per-use	Both Houses, full text	Live per run	search term, House, member, date, department	⚡ 2 min
Commercial parliamentary monitoring	$10k - $80k/year	Comparable + voting records	Daily	Many	🐢 Weeks (procurement)
TheyWorkForYou scraping	Free	Commons-leaning, derived	Daily	Few	🕒 Days
Manual hansard.parliament.uk browsing	Free	Whole catalogue	Live	Site-side	⏳ Forever

Pick this Actor when you want structured speech-level records with permalinks and zero pipeline maintenance.

🚀 How to use

📝 Sign up. Create a free account w/ $5 credit (takes 2 minutes).
🌐 Open the Actor. Go to the Hansard UK Parliament Debates Scraper page on the Apify Store.
🎯 Set input. Type a search term or member name, optionally pick a House and date range, and set maxItems.
🚀 Run it. Click Start and let the Actor collect your contributions.
📥 Download. Grab your dataset in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to a downloaded Hansard dataset: 3-5 minutes. No coding required.

💼 Business use cases

🏛️ Public Affairs & Lobbying

Monitor every mention of your client's industry across Parliament
Build briefing packs from a member's recent contributions
Track ministerial statements by department
Surface debate momentum on niche policy areas

📰 Political Journalism

Search every Commons speech for a specific quote
Trace how a policy has been debated over years
Build interactive dashboards of speech topics
Cross-reference Hansard with member directory data

🤖 NLP & Machine Learning

Train domain-specific UK political language models
Build topic-classification corpora with member metadata
Sentiment analysis on individual MPs over time
Question-answering systems that cite primary sources

📊 Civic Analytics & Research

Quantitative speech-pattern studies for academic papers
Word-frequency tracking on policy themes per session
Comparative analysis of Commons vs Lords language
Public dashboards showing debate volume by topic

🔌 Automating Hansard Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

🟢 Node.js. Install the apify-client NPM package.
🐍 Python. Use the apify-client PyPI package.
📚 See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Hourly or daily refreshes keep your political monitoring dashboards in sync with each new sitting.

🌟 Beyond business use cases

Open parliamentary transcripts power more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

Quantitative discourse analysis for political-science theses
Coursework on parliamentary procedure and rhetoric
Reproducible debate corpora for NLP research papers
Historical archives of policy framing over decades

🎨 Personal and creative

Side projects that visualise speech patterns by party
Newsletters that summarise yesterday's notable contributions
Word clouds of a session's most-debated topics
Hobbyist explorations of parliamentary humour

🤝 Non-profit and civic

Civic-tech tools that surface debates on a topic to citizens
Watchdog dashboards tracking member contribution rates
Investigative journalism on lobbying-aligned speeches
Accessibility projects that simplify parliamentary language

🧪 Experimentation

Train summarisation models on parliamentary debates
Build agent pipelines that brief journalists on yesterday's speeches
Prototype semantic-search tools across decades of debate
Stress-test NLP infrastructure with real, long-form text

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions

🧩 How does it work?

Type a search term, optionally pick a member or date window, click Start, and the Actor pages through the official Hansard transcript catalogue, applies your filters, and emits a clean structured row per spoken contribution. No browser automation, no captchas, no setup.

📏 How accurate is the data?

Every record comes from the official Hansard catalogue used by hansard.parliament.uk itself, so the speech text, member, and debate references match the canonical record line for line.

🔁 How often is the dataset refreshed?

Hansard is updated as sittings are transcribed and published, typically within hours of a debate. Every run hits the live catalogue.

🏛️ Does it cover both Houses?

Yes. Set house to Both (default), Commons, or Lords.

👤 Can I get every speech by a single MP or peer?

Yes. Set the memberName filter to a substring of their name (e.g. "Starmer", "Lord Hannan"). Combine with startDate and endDate for a session-bounded view.

📅 How far back does the catalogue go?

The official Hansard archive runs back to the early 19th century, with full digital coverage of recent sessions and increasingly complete coverage going back decades.

📝 Why does the speech text contain HTML tags?

The text field preserves Hansard's original markup (column-number <span>s, inline subscripts, paragraph anchors) so the record stays faithful to the official transcript. Strip HTML downstream if you need plain text.

⏰ Can I schedule regular runs?

Yes. Use Apify Schedules to run this Actor on any cron interval (hourly during sittings, daily otherwise) and keep your political monitoring dashboards in sync.

⚖️ Is this data legal to use?

Hansard transcripts are published under the Open Parliament Licence, which permits commercial reuse with attribution. Review the licence terms for your specific application.

💼 Can I use this data commercially?

Yes. The Open Parliament Licence explicitly allows commercial reuse with attribution. You remain responsible for following the licence in your product.

💳 Do I need a paid Apify plan to use this Actor?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you scheduling, higher concurrency, and larger datasets.

🆘 What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.

🔌 Integrate with any app

Hansard UK Debates Scraper connects to any cloud service via Apify integrations:

Make - Automate multi-step monitoring workflows
Zapier - Connect with 5,000+ apps
Slack - Get debate-mention alerts in your channels
Airbyte - Pipe transcripts into your warehouse
GitHub - Trigger runs from commits and releases
Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh transcripts into your NLP pipeline, or alert your political-research team in Slack.

🔗 Recommended Actors

🏛️ UK Parliament Members Scraper - MPs and Lords with biographies, committees, and contact details
🇬🇧 GOV.UK Content Search Scraper - Search the entire UK government publications catalogue
🛡️ OpenSanctions Sanctions & PEP Scraper - Sanctioned entities and politically exposed persons
⚡ Carbon Intensity UK Scraper - National Grid carbon intensity feed
📰 GovTrack U.S. Congress Scraper - U.S. legislative bills and votes

💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.

🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.

⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the UK Parliament, the House of Commons, the House of Lords, or the Hansard Society. All trademarks mentioned are the property of their respective owners. Only publicly available open Hansard transcript data is collected, under the Open Parliament Licence.

Hansard DebateInfo Scraper

getdataforme/hansard-debateInfo-scraper

The Hansard DebateInfo Scraper extracts structured data from UK parliamentary debates, capturing details like titles, dates, and speaker contributions....

GetDataForMe

UK Parliament Scraper

crawlerbros/uk-parliament-scraper

Scrape UK Parliament open data - search MPs and Lords, get member profiles and careers, browse public bills, and pull House of Commons voting divisions. Powered by the official parliament.uk APIs. No login or API key.

Crawler Bros

Hansard Discovery Scraper

getdataforme/hansard-discovery-scraper

The Hansard Discovery Scraper is a powerful tool for extracting data from parliamentary discussions, offering customizable search queries and date range filtering....

GetDataForMe

UK Parliament Members Scraper (MPs & Lords)

parseforge/members-uk-parliament-scraper

Scrape UK House of Commons MPs and House of Lords peers from the official UK Parliament Members directory. Filter by House, party, gender, constituency or current/former status. Returns names, parties, constituencies, contact details, biographies, government posts and committee memberships.

ParseForge

Japan Parliament Search MCP (国会会議録検索)

e-asakura/japan-parliament-search-mcp

MCP server for searching Japan's National Diet records (1947-present). Ministerial statements, Q&A, and policy debates from official NDL data.

Edward Asakura

UK Companies House

artificially/uk-companies-house

Search and extract company data from UK Companies House. Get company details, officers, filing history, and more from the official UK government registry.

Artificially

European Parliament Scraper

crawlerbros/european-parliament-scraper

Scrape European Parliament open data - list current MEPs by country and political group, fetch detailed MEP profiles, and browse plenary sitting days. Powered by the official data.europarl.europa.eu API. No login or API key.

Crawler Bros

Company House GOV.UK

nocodeventure/company-house-gov-uk

Get all the information you need about UK companies in seconds! This scraper grabs data from the official Companies House website - that's where the UK government keeps records of every company.

No-Code Venture

Company House GOV.UK (PPE)

nocodeventure/company-house-gov-uk-ppe

Get all the information you need about UK companies in seconds! This scraper grabs data from the official Companies House website - that's where the UK government keeps records of every company.

No-Code Venture

European Parliament MEPs Scraper | MEP Profiles and Data

parseforge/european-mep-disclosures-scraper

Export Members of European Parliament records from the EU Open Data Portal: full name, identifier, given/family name, sort label and label. Combine with parliamentary terms for historical mapping. CSV, Excel, JSON or XML for political research and lobbying analysis.

ParseForge

Hansard UK Parliament Debates Scraper

🗣️ Hansard UK Parliament Debates Scraper

📋 What the Hansard UK Debates Scraper does

🎬 Full Demo

⚙️ Input

📊 Output

🧾 Schema

📦 Sample record

✨ Why choose this Actor

📈 How it compares to alternatives

🚀 How to use

💼 Business use cases

🏛️ Public Affairs & Lobbying

📰 Political Journalism

🤖 NLP & Machine Learning

📊 Civic Analytics & Research

🔌 Automating Hansard Scraper

🌟 Beyond business use cases

🎓 Research and academia

🎨 Personal and creative

🤝 Non-profit and civic

🧪 Experimentation

🤖 Ask an AI assistant about this scraper

❓ Frequently Asked Questions

🧩 How does it work?

📏 How accurate is the data?

🔁 How often is the dataset refreshed?

🏛️ Does it cover both Houses?

👤 Can I get every speech by a single MP or peer?

📅 How far back does the catalogue go?

📝 Why does the speech text contain HTML tags?

⏰ Can I schedule regular runs?

⚖️ Is this data legal to use?

💼 Can I use this data commercially?

💳 Do I need a paid Apify plan to use this Actor?

🆘 What if I need help?

🔌 Integrate with any app

🔗 Recommended Actors

You might also like

Hansard DebateInfo Scraper

UK Parliament Scraper

Hansard Discovery Scraper

UK Parliament Members Scraper (MPs & Lords)

Japan Parliament Search MCP (国会会議録検索)

UK Companies House

European Parliament Scraper

Company House GOV.UK

Company House GOV.UK (PPE)

European Parliament MEPs Scraper | MEP Profiles and Data

Related articles