PR Newswire Scraper
Pricing
from $2.00 / 1,000 results
PR Newswire Scraper
Extract press releases from PR Newswire by keyword, company, industry, or date range. Get full text, publication dates, media contacts, and more.
Pricing
from $2.00 / 1,000 results
Rating
0.0
(0)
Developer
Harish Garg
Actor stats
1
Bookmarked
3
Total users
2
Monthly active users
19 hours ago
Last modified
Categories
Share
Search and export press releases from PR Newswire in seconds. This Actor gives you a full-text search engine over a continuously updated index of PR Newswire press releases, with date filtering, BM25 relevance ranking, and clean structured output ready for CSV, Excel, JSON, or API consumption.
Use it to monitor competitors, track M&A and earnings announcements, build financial datasets, power media monitoring tools, or feed downstream LLM pipelines with grounded, sourceable press release data.
Why this Actor
Get clean, structured press releases from PR Newswire in seconds — searchable by keyword, filterable by date, and ready to drop into a spreadsheet, database, or LLM pipeline.
- Press releases, on demand. Pull the latest releases, a date range, or every release matching a keyword — without running a browser or parsing HTML yourself.
- Search like Google, not like grep. Prefix matching, diacritic folding, and BM25 ranking mean
cafe mergerfindsCafé Holdings announces mergerwithout you wrestling with regex. - Title-weighted relevance. Headline matches rank 10× higher than body matches, so the most relevant releases surface first.
- Date range filtering. Pull every release between any two dates with one parameter — perfect for quarterly research, event windows, or backfilling datasets.
- Structured output. Every row is clean JSON with stable field names. Pipe straight into Sheets, BigQuery, a vector DB, or your own ETL.
- No scraping fragility on your side. You don't run a browser, manage proxies, or parse HTML. You query an index. Runs finish in seconds, not hours.
Common use cases
- Competitive intelligence — track product launches, leadership changes, and partnership announcements across an industry.
- Financial & investor research — surface earnings, guidance updates, M&A activity, and IPO filings on a date range.
- Media monitoring & PR analytics — measure share-of-voice for a brand, executive, or topic over time.
- Dataset building for AI/ML — bulk-export press releases as a labeled corpus for fine-tuning, RAG, or sentiment analysis.
- Lead generation — find companies announcing funding rounds, expansions, or hiring sprees.
- Compliance & legal discovery — pull every public statement from a company in a defined window.
Output: what you get back
Each row in the dataset contains:
| Field | Description |
|---|---|
id | Internal press release ID |
url | Canonical PR Newswire URL |
title | Press release headline |
pubDate | Publication date (ISO YYYY-MM-DD) |
timestamp | Publication timestamp as scraped from the source |
body | Full press release body text |
Export with one click as JSON, CSV, Excel, XML, RSS, or HTML from the Apify dataset view, or pull programmatically via the Apify API.
Input parameters
All fields are optional — combine them freely.
| Field | Type | Description |
|---|---|---|
query | string | Full-text search across title and content. Tokens are prefix-matched and AND-combined. Leave empty to browse by date. |
titleOnly | boolean | Restrict query to the title column. Default false. |
fromDate | string | Lower bound for pubDate (inclusive), YYYY-MM-DD. |
toDate | string | Upper bound for pubDate (inclusive), YYYY-MM-DD. |
maxRecords | integer | Cap on rows returned. Default 100, max 1000. |
Example queries
Browse the 50 most recent press releases:
{ "maxRecords": 50 }
Search for earnings news in 2026:
{ "query": "earnings", "fromDate": "2026-01-01", "maxRecords": 20 }
Find M&A headlines only (title match):
{ "query": "merger acquisition", "titleOnly": true, "maxRecords": 10 }
Pull every release mentioning a company in Q1:
{ "query": "tesla", "fromDate": "2026-01-01", "toDate": "2026-03-31", "maxRecords": 1000 }
Get full body text for downstream LLM/RAG use:
{ "query": "guidance raised", "maxRecords": 25 }
How search works
- No query → results sorted by
pubDatedescending (newest first). - With query → results sorted by BM25 relevance, with title hits weighted 10× heavier than body hits.
- Prefix matching —
trump rallymatches rows containing bothtrump*andrally*, in any order, in any column. - Diacritic-insensitive —
cafematchesCafé. The tokenizer isunicode61with diacritic folding. - Date filters apply to the ISO
pubDatefield; rows with a missing publication date are excluded when a date filter is set.
Limits & pricing
maxRecordsis capped at 1,000 rows per run. To pull larger archives, slice by date range across multiple runs.- Runs are typically sub-second on the index — you pay for compute time, which is minimal.
FAQ
Can I integrate this with my app? Yes — every Apify Actor exposes a REST API and webhooks. Trigger runs from your backend and read results from the dataset endpoint.
Questions, feature requests, or want a custom actor? Reach out via the Apify Console — feedback drives the roadmap.