PR Newswire Scraper avatar

PR Newswire Scraper

Pricing

from $2.00 / 1,000 results

Go to Apify Store
PR Newswire Scraper

PR Newswire Scraper

Extract press releases from PR Newswire by keyword, company, industry, or date range. Get full text, publication dates, media contacts, and more.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Harish Garg

Harish Garg

Maintained by Community

Actor stats

1

Bookmarked

3

Total users

2

Monthly active users

19 hours ago

Last modified

Share

Search and export press releases from PR Newswire in seconds. This Actor gives you a full-text search engine over a continuously updated index of PR Newswire press releases, with date filtering, BM25 relevance ranking, and clean structured output ready for CSV, Excel, JSON, or API consumption.

Use it to monitor competitors, track M&A and earnings announcements, build financial datasets, power media monitoring tools, or feed downstream LLM pipelines with grounded, sourceable press release data.

Why this Actor

Get clean, structured press releases from PR Newswire in seconds — searchable by keyword, filterable by date, and ready to drop into a spreadsheet, database, or LLM pipeline.

  • Press releases, on demand. Pull the latest releases, a date range, or every release matching a keyword — without running a browser or parsing HTML yourself.
  • Search like Google, not like grep. Prefix matching, diacritic folding, and BM25 ranking mean cafe merger finds Café Holdings announces merger without you wrestling with regex.
  • Title-weighted relevance. Headline matches rank 10× higher than body matches, so the most relevant releases surface first.
  • Date range filtering. Pull every release between any two dates with one parameter — perfect for quarterly research, event windows, or backfilling datasets.
  • Structured output. Every row is clean JSON with stable field names. Pipe straight into Sheets, BigQuery, a vector DB, or your own ETL.
  • No scraping fragility on your side. You don't run a browser, manage proxies, or parse HTML. You query an index. Runs finish in seconds, not hours.

Common use cases

  • Competitive intelligence — track product launches, leadership changes, and partnership announcements across an industry.
  • Financial & investor research — surface earnings, guidance updates, M&A activity, and IPO filings on a date range.
  • Media monitoring & PR analytics — measure share-of-voice for a brand, executive, or topic over time.
  • Dataset building for AI/ML — bulk-export press releases as a labeled corpus for fine-tuning, RAG, or sentiment analysis.
  • Lead generation — find companies announcing funding rounds, expansions, or hiring sprees.
  • Compliance & legal discovery — pull every public statement from a company in a defined window.

Output: what you get back

Each row in the dataset contains:

FieldDescription
idInternal press release ID
urlCanonical PR Newswire URL
titlePress release headline
pubDatePublication date (ISO YYYY-MM-DD)
timestampPublication timestamp as scraped from the source
bodyFull press release body text

Export with one click as JSON, CSV, Excel, XML, RSS, or HTML from the Apify dataset view, or pull programmatically via the Apify API.

Input parameters

All fields are optional — combine them freely.

FieldTypeDescription
querystringFull-text search across title and content. Tokens are prefix-matched and AND-combined. Leave empty to browse by date.
titleOnlybooleanRestrict query to the title column. Default false.
fromDatestringLower bound for pubDate (inclusive), YYYY-MM-DD.
toDatestringUpper bound for pubDate (inclusive), YYYY-MM-DD.
maxRecordsintegerCap on rows returned. Default 100, max 1000.

Example queries

Browse the 50 most recent press releases:

{ "maxRecords": 50 }

Search for earnings news in 2026:

{ "query": "earnings", "fromDate": "2026-01-01", "maxRecords": 20 }

Find M&A headlines only (title match):

{ "query": "merger acquisition", "titleOnly": true, "maxRecords": 10 }

Pull every release mentioning a company in Q1:

{ "query": "tesla", "fromDate": "2026-01-01", "toDate": "2026-03-31", "maxRecords": 1000 }

Get full body text for downstream LLM/RAG use:

{ "query": "guidance raised", "maxRecords": 25 }

How search works

  • No query → results sorted by pubDate descending (newest first).
  • With query → results sorted by BM25 relevance, with title hits weighted 10× heavier than body hits.
  • Prefix matchingtrump rally matches rows containing both trump* and rally*, in any order, in any column.
  • Diacritic-insensitivecafe matches Café. The tokenizer is unicode61 with diacritic folding.
  • Date filters apply to the ISO pubDate field; rows with a missing publication date are excluded when a date filter is set.

Limits & pricing

  • maxRecords is capped at 1,000 rows per run. To pull larger archives, slice by date range across multiple runs.
  • Runs are typically sub-second on the index — you pay for compute time, which is minimal.

FAQ

Can I integrate this with my app? Yes — every Apify Actor exposes a REST API and webhooks. Trigger runs from your backend and read results from the dataset endpoint.


Questions, feature requests, or want a custom actor? Reach out via the Apify Console — feedback drives the roadmap.