Pricing

from $22.87 / 1,000 results

GOV.UK Content Search Scraper

Scrape GOV.UK: search the entire UK government publications catalogue (policies, guidance, news, statistics). Filter by query, organisation, format or date. Returns titles, descriptions, URLs, organisations and publication dates.

Pricing

from $22.87 / 1,000 results

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

🇬🇧 GOV.UK Content Search Scraper

🚀 Search the entire UK government catalogue in seconds. Pull 600,000+ pages, publications, news stories, statistics, consultations, and services straight from GOV.UK. Filter by query, organisation, format, or date. No sign-up, no manual paging, no parser to maintain.

🕒 Last updated: 2026-05-15 · 📊 11 fields per record · 📚 600,000+ pages · 🏛️ 1,400+ government bodies · 📂 130+ document formats

The GOV.UK Content Search Scraper queries the official UK government content index and returns up to 11 structured fields per record, including titles, descriptions, URLs, document formats, publication dates, and the publishing organisations and world locations. GOV.UK is the canonical home for almost every UK government publication, news story, statistical release, consultation, and citizen service.

The catalogue covers the entire central UK government estate, from the Cabinet Office and HM Treasury to HMRC, the Ministry of Defence, the Department for Transport, and over a thousand executive agencies, arms-length bodies, and tribunals. This Actor makes that data downloadable as CSV, Excel, JSON, or XML in under five minutes. Filters run server-side, so you skip the parser engineering entirely.

🎯 Target Audience	💡 Primary Use Cases
Policy analysts, regulatory and compliance teams, journalists, lobbyists, GovTech vendors, academic researchers, market-intelligence firms	Policy monitoring, regulatory horizon-scanning, consultation tracking, FOI release feeds, ministerial speech analysis, statistics release pipelines, press monitoring

📋 What the GOV.UK Content Search Scraper does

Six filtering workflows in a single run:

🔎 Free-text search. Query any keyword or phrase across titles, descriptions, and body text of every GOV.UK page.
📂 Format filter. Restrict to a single GOV.UK document format from a list of 130+ (news_story, press_release, guidance, official_statistics, consultation_outcome, statutory_guidance, FOI release, and many more).
🏛️ Organisation filter. Restrict to a single department or agency by slug (e.g. cabinet-office, hm-revenue-customs, department-for-transport).
📅 Date range filter. publishedAfter and publishedBefore scope to any window on the public timestamp.
🔢 Sort order. Relevance (default), newest first, oldest first, title A-Z, or most popular.
🌍 World locations and topical events. Returned per record so you can pivot by country or government event.

Each record includes the document title, description, canonical GOV.UK URL, content ID, format and document type, publication date, every publishing organisation (with slug and acronym), associated world locations, and topical events.

💡 Why it matters: UK government publications drive regulation, market opportunities, public-sector procurement, and the news cycle. Building your own GOV.UK pipeline means writing a paginated search client, mapping 130+ formats, joining organisation slugs, and refreshing daily. This Actor skips all of that and gives you a clean refreshed snapshot on every run.

🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded GOV.UK dataset.

⚙️ Input

Input	Type	Default	Behavior
query	string	"vehicle tax"	Free-text search across the full GOV.UK catalogue. Empty = browse all pages of the chosen format / organisation.
format	string	""	One of 130+ GOV.UK document formats. Empty = all formats.
organisation	string	""	Organisation slug (e.g. cabinet-office, hm-revenue-customs). Empty = all organisations.
publishedAfter	string	""	Earliest publication date (YYYY-MM-DD).
publishedBefore	string	""	Latest publication date (YYYY-MM-DD).
orderBy	string	""	Sort order: relevance, newest, oldest, title A-Z, or most popular.
maxItems	integer	10	Records to return. Free plan caps at 10, paid plan at 1,000,000.

Example: 50 most recent HMRC press releases.

{
    "maxItems": 50,
    "format": "press_release",
    "organisation": "hm-revenue-customs",
    "orderBy": "-public_timestamp"
}

Example: every Department for Transport publication mentioning "low traffic neighbourhood" since 2024.

{
    "maxItems": 200,
    "query": "low traffic neighbourhood",
    "organisation": "department-for-transport",
    "publishedAfter": "2024-01-01"
}

⚠️ Good to Know: GOV.UK formats evolve over time. Older content may carry a legacy format value while newer content uses documentType. Both are emitted per record so you can pivot on whichever is most useful for your analysis. The url field always returns the canonical absolute GOV.UK link.

📊 Output

Each record carries up to 11 fields. Download the dataset as CSV, Excel, JSON, or XML.

🧾 Schema

Field	Type	Example
📌 `title`	string	`"Tax your vehicle"`
📝 `description`	string	`"Renew or tax your vehicle for the first time using a reminder letter..."`
🔗 `url`	string	`"https://www.gov.uk/vehicle-tax"`
🆔 `contentId`	string	`"fa748fae-3de4-4266-ae85-0797ada3f40c"`
📂 `format`	string	`"transaction"`
📂 `documentType`	string	`"transaction"`
📅 `publishedAt`	ISO 8601	`"2017-12-07T12:54:39Z"`
🏛️ `organisations`	array	`[{"title": "Driver and Vehicle Licensing Agency", "slug": "driver-and-vehicle-licensing-agency", "acronym": "DVLA"}]`
🌍 `worldLocations`	array	`[{"title": "France", "slug": "france"}]`
🏷️ `topicalEvents`	array	`[{"title": "Spring Budget 2024", "slug": "spring-budget-2024"}]`
🕒 `scrapedAt`	ISO 8601	`"2026-05-15T18:29:40.375Z"`

📦 Sample record

✨ Why choose this Actor

	Capability
📚	Whole-of-government coverage. 600,000+ pages from 1,400+ central government bodies, agencies, and tribunals.
🎯	Multi-dimensional filters. Query, format, organisation, date range, and sort order combine freely.
🏛️	Organisation joins. Each record names every publishing body with slug and acronym for clean joins to your CRM.
⚡	Fast. 50 pages in seconds, 10,000 records in a few minutes.
🌐	Authoritative source. Cited by policy researchers, lobbyists, and regulatory teams across the UK.
🔁	Always fresh. Every run hits the live catalogue, so your dataset reflects current publications.
🚫	No authentication. Works with public open-government data. No login needed.

📊 Searchable government publications are the foundation of every regulatory horizon-scanning tool, policy newsletter, and procurement dashboard in the UK.

📈 How it compares to alternatives

Approach	Cost	Coverage	Refresh	Filters	Setup
⭐ GOV.UK Content Search Scraper (this Actor)	$5 free credit, then pay-per-use	600,000+ pages, 130+ formats	Live per run	query, format, organisation, date range, sort	⚡ 2 min
Commercial policy-monitoring platforms	$10k - $100k/year	Comparable + summaries	Daily	Many	🐢 Weeks (procurement)
RSS feeds per organisation	Free	Limited per feed	Hourly	Few	🕒 Hours (one feed at a time)
Manual GOV.UK browsing	Free	Whole site	Live	Same as the website	⏳ Forever (no automation)

Pick this Actor when you want server-side cross-organisation search, structured records, and zero pipeline maintenance.

🚀 How to use

📝 Sign up. Create a free account w/ $5 credit (takes 2 minutes).
🌐 Open the Actor. Go to the GOV.UK Content Search Scraper page on the Apify Store.
🎯 Set input. Type a query (or leave empty), pick an organisation or format, set a date window if you need one, and set maxItems.
🚀 Run it. Click Start and let the Actor collect your results.
📥 Download. Grab your dataset in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to a downloaded GOV.UK dataset: 3-5 minutes. No coding required.

💼 Business use cases

🏛️ Policy & Regulatory Monitoring

Daily horizon-scanning across every UK department
Filter by consultation_outcome to track ended consultations
Pivot on topicalEvents for budget and statement coverage
Build a structured policy-change feed for compliance teams

📰 News, PR & Communications

Press monitoring for ministerial speeches and statements
Track press releases by department for media briefings
Catch FOI releases the moment they go public
Power newsroom dashboards with structured GOV.UK feeds

📊 Statistics & Open Data

Pull every official_statistics release for analytics pipelines
Track statistics_announcement for upcoming publications
Cross-join with publishing organisation for sector views
Replace brittle RSS parsing with structured records

🛒 Public Procurement & GovTech

Monitor procurement and contract notices by organisation
Spot upcoming consultations that signal policy direction
Build vendor dashboards from corporate_report releases
Feed CRM enrichment with department slugs and acronyms

🔌 Automating GOV.UK Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

🟢 Node.js. Install the apify-client NPM package.
🐍 Python. Use the apify-client PyPI package.
📚 See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Hourly or daily refreshes keep downstream policy monitors and dashboards in sync automatically.

🌟 Beyond business use cases

Open government data powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

Quantitative studies of policy publication patterns
Public-administration coursework on government output
Historical archives of consultations and outcomes
Reproducible datasets for political-science research

🎨 Personal and creative

Side projects mapping departmental publication volumes
Newsletter generators that summarise weekly releases
Personal RSS-style feeds across multiple organisations
Visualisations of policy themes over time

🤝 Non-profit and civic

Civic-tech tools that surface relevant consultations to citizens
Investigative journalism on department-level disclosure patterns
Watchdog dashboards tracking ministerial communications
Accessibility projects that reformat GOV.UK content

🧪 Experimentation

Train classifiers that auto-tag policy areas
Build agent pipelines that summarise daily releases
Prototype recommender systems for citizen services
Stress-test search infrastructure with real volume data

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions

🧩 How does it work?

Type a query, optionally pick an organisation or format, click Start, and the Actor pages through the official GOV.UK content index, applies your filters, and emits a clean structured record per page. No browser automation, no captchas, no setup.

📏 How accurate is the data?

Every record comes from the canonical GOV.UK content index used by the gov.uk website itself, so titles, descriptions, dates, and organisation references match what you see on the page.

🔁 How often is the dataset refreshed?

GOV.UK is updated continuously as departments publish new pages. Every run of this Actor hits the live catalogue.

📂 Which document formats are supported?

130+ GOV.UK formats including news_story, press_release, guidance, official_statistics, consultation_outcome, statutory_guidance, FOI release, statistical_data_set, transparency, corporate_report, decision, and many more. Use the format filter to scope to one type.

🏛️ How do I find an organisation slug?

The slug is the last segment of the organisation page URL. For example, the Driver and Vehicle Licensing Agency lives at https://www.gov.uk/government/organisations/driver-and-vehicle-licensing-agency, so its slug is driver-and-vehicle-licensing-agency.

📅 Can I scope to a date range?

Yes. Use publishedAfter and publishedBefore (YYYY-MM-DD). Both are inclusive and run against the document's public timestamp.

⏰ Can I schedule regular runs?

Yes. Use Apify Schedules to run this Actor on any cron interval (hourly, daily, weekly) and keep downstream policy monitors in sync.

⚖️ Is this data legal to use?

GOV.UK content is published under the Open Government Licence v3.0, which permits commercial reuse with attribution. Review the licence terms for your specific application.

💼 Can I use this data commercially?

Yes. Open Government Licence v3.0 explicitly allows commercial reuse with attribution. You remain responsible for following the licence terms in your product.

💳 Do I need a paid Apify plan to use this Actor?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you scheduling, higher concurrency, and larger datasets.

🔁 What happens if a run fails or gets interrupted?

Apify automatically retries transient errors. If a run still fails, you can inspect the log in the Runs tab, fix the input, and re-run. Partial datasets from failed runs are preserved so you never lose progress.

🆘 What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.

🔌 Integrate with any app

GOV.UK Content Search Scraper connects to any cloud service via Apify integrations:

Make - Automate multi-step monitoring workflows
Zapier - Connect with 5,000+ apps
Slack - Get new-publication alerts in your channels
Airbyte - Pipe GOV.UK pages into your warehouse
GitHub - Trigger runs from commits and releases
Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh policy publications into your CRM, or alert your communications team in Slack.

🔗 Recommended Actors

🏛️ UK Parliament Members Scraper - MPs and Lords with biographies, committees, and contact details
🗣️ Hansard UK Parliament Debates Scraper - Full transcripts of Commons and Lords debates
🛡️ OpenSanctions Sanctions & PEP Scraper - 280k+ sanctioned entities and PEPs
⚡ Carbon Intensity UK Scraper - National Grid carbon intensity forecasts
🚂 OurAirports Global Airport Database Scraper - 85,000+ airports worldwide

💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.

🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.

⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the UK Government, the Government Digital Service, or any UK department. All trademarks mentioned are the property of their respective owners. Only publicly available open-government content is collected, under the Open Government Licence.

GOV UK Companies Register

harvestedge/gov-uk-companies-register

The UK Companies Scraper for UK Chamber of Commerce. Ideal for lead generation, compliance checks, KYC enrichment, and UK company intelligence workflows. GOV.UK

Harvest Edge

Data.gov.uk Scraper

parseforge/data-gov-uk-scraper

Collect UK government open data effortlessly. Extract datasets, publishers, formats, topics, licenses, and download links from data.gov.uk — the official UK open data portal. Perfect for researchers, policy analysts, and developers building data catalogs.

ParseForge

5.0

Data.gov.uk Scraper - Low-cost💲🔥📚🇬🇧

delectable_incubator/data-gov-uk-scraper-low-cost

Scrape data.gov.uk dataset listings 🔎📊 with a powerful open data scraper. Extract dataset titles, publishers, update dates, descriptions, tags, and dataset URLs from search results. Ideal for government data monitoring, open data research, dataset discovery, and structured data catalog creation 🚀

Prime Scrape

Company House GOV.UK

nocodeventure/company-house-gov-uk

Get all the information you need about UK companies in seconds! This scraper grabs data from the official Companies House website - that's where the UK government keeps records of every company.

No-Code Venture

Company House GOV.UK (PPE)

nocodeventure/company-house-gov-uk-ppe

Get all the information you need about UK companies in seconds! This scraper grabs data from the official Companies House website - that's where the UK government keeps records of every company.

No-Code Venture

UK Companies House

artificially/uk-companies-house

Search and extract company data from UK Companies House. Get company details, officers, filing history, and more from the official UK government registry.

Artificially

UK Contracts Finder Scraper

crawlerbros/uk-contracts-finder-scraper

Scrape the UK Government Contracts Finder - a free public database of government procurement contracts and tenders. Search by keyword, filter by status or date, and extract contract titles, buyers, values, dates, and descriptions.

Crawler Bros

Legislation.gov.uk Scraper

crawlerbros/legislation-gov-uk-scraper

Scrape UK statute law from legislation.gov.uk. Fetch specific Acts and Statutory Instruments, browse by type and year, get the newest legislation, or search by title. Returns clean metadata plus official XML/PDF links.

Crawler Bros

Reed.co.uk Jobs Scraper - UK Job Listings

parseforge/reed-co-uk-scraper

Scrape UK jobs from Reed.co.uk by keyword, location, salary, sector, contract type, remote option or date posted. Returns title, employer, salary, full description and application URL.

ParseForge

Indeed UK Jobs Scraper — Full Descriptions & Salary

totaka/indeed-uk-jobs-scraper

Scrape job listings from Indeed UK (uk.indeed.com) — job title, company, location, salary, and full description. Ideal for UK recruitment intelligence and salary benchmarking.

Thomas Gharbi

GOV.UK Content Search Scraper

🇬🇧 GOV.UK Content Search Scraper

📋 What the GOV.UK Content Search Scraper does

🎬 Full Demo

⚙️ Input

📊 Output

🧾 Schema

📦 Sample record

✨ Why choose this Actor

📈 How it compares to alternatives

🚀 How to use

💼 Business use cases

🏛️ Policy & Regulatory Monitoring

📰 News, PR & Communications

📊 Statistics & Open Data

🛒 Public Procurement & GovTech

🔌 Automating GOV.UK Scraper

🌟 Beyond business use cases

🎓 Research and academia

🎨 Personal and creative

🤝 Non-profit and civic

🧪 Experimentation

🤖 Ask an AI assistant about this scraper

❓ Frequently Asked Questions

🧩 How does it work?

📏 How accurate is the data?

🔁 How often is the dataset refreshed?

📂 Which document formats are supported?

🏛️ How do I find an organisation slug?

📅 Can I scope to a date range?

⏰ Can I schedule regular runs?

⚖️ Is this data legal to use?

💼 Can I use this data commercially?

💳 Do I need a paid Apify plan to use this Actor?

🔁 What happens if a run fails or gets interrupted?

🆘 What if I need help?

🔌 Integrate with any app

🔗 Recommended Actors

You might also like

GOV UK Companies Register

Data.gov.uk Scraper

Data.gov.uk Scraper - Low-cost💲🔥📚🇬🇧

Company House GOV.UK

Company House GOV.UK (PPE)

UK Companies House

UK Contracts Finder Scraper

Legislation.gov.uk Scraper

Reed.co.uk Jobs Scraper - UK Job Listings

Indeed UK Jobs Scraper — Full Descriptions & Salary