News Across the Web Scraper
Pricing
Pay per usage
News Across the Web Scraper
Sweep the latest stories from Semafor, Axios, Medium, The Intercept, The Verge, TechCrunch and Wired in one feed. Get headline, author, date, summary, full body and direct article link. Built for media monitoring, PR and competitive intelligence.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
ParseForge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share

📰 News Across the Web Scraper
🚀 All-in-one bundle: queries 7 sources in one run. Semafor, Axios, Medium, The Intercept, The Verge, TechCrunch, Wired.
🕒 Last updated: 2026-05-27 · 📊 10 fields per article · 7 publications · global tech & politics coverage
News Across the Web Scraper is a single combo Actor that pulls fresh headlines from seven major publications in parallel and returns one unified dataset. Each article is tagged with its source so newsrooms, researchers, and content aggregators can build cross-publication briefings without juggling seven RSS readers.
The bundle covers Semafor (front-page), Axios (RSS), Medium (Towards Data Science feed), The Intercept (RSS), The Verge (Atom), TechCrunch (RSS), and Wired (RSS).
| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| News aggregators | Cross-publication briefings |
| Media monitoring teams | Brand and keyword surveillance |
| Researchers | Topic discovery across outlets |
| Content marketers | Trend spotting |
📋 What this Scraper does
- Hits each publication's public RSS/Atom/HTML feed in parallel.
- Extracts title, URL, author, publication date, summary, categories, image.
- Normalizes into one schema with a
sourcefield. - Optional
querysubstring filter on title and summary.
💡 Why it matters: seven feeds, one schema, one schedule - perfect for daily digests.
🎬 Full Demo
🚧 Coming soon
⚙️ Input
| Field | Type | Description |
|---|---|---|
query | string | Optional case-insensitive substring filter |
sources | array | Subset of the 7 publications |
maxItems | integer | Free: 10 · Paid: up to 1,000,000 |
proxyConfiguration | object | Optional Apify Proxy config |
{ "maxItems": 50 }
{ "query": "ai", "sources": ["techcrunch", "wired", "theverge"], "maxItems": 100 }
⚠️ Good to Know: Medium's free RSS only exposes ~10 latest articles per feed.
📊 Output
| Field | Type | Description |
|---|---|---|
🖼 imageUrl | string | Hero image when published in the feed |
🌐 source | string | Publication slug |
📌 title | string | Headline |
🔗 url | string | Article URL |
✍️ author | string | Byline |
📅 publishedAt | string | Publication date |
📝 summary | string | First 500-1000 chars of description/content |
🏷️ categories | array | Tags / categories |
🕒 scrapedAt | string | ISO timestamp |
❌ error | string | Populated only on failure |
✨ Why choose this Actor
| 📰 Seven publications, one schema | One Actor, one dataset |
| ⚡ Parallel fetch | Promise.allSettled keeps slow feeds from blocking fast ones |
| 🌍 Public data only | No accounts, no API keys |
| 📥 tabular / spreadsheet / tabular / tabular | Native Apify pull |
📈 How it compares to alternatives
| This Actor | Per-source actors | Paid news APIs | |
|---|---|---|---|
| Inputs | 1 | 7 | 1 |
| Free tier | ✅ 10 items | ✅ | ❌ |
| No API key | ✅ | ✅ | ❌ |
🚀 How to use
- Create a free account w/ $5 credit.
- Open the Actor and pick which publications to query.
- Optional: set
queryto filter by keyword. - Run and pull tabular/spreadsheet/tabular/tabular.
- Schedule hourly or daily for fresh feeds.
💼 Business use cases
Media monitoring
Track brand or executive mentions across major tech and politics outlets.
Daily briefings
Generate a single morning digest from seven publications.
Content marketing
Spot trending topics before competitors do.
Sentiment research
Build cross-publication corpora.
🔌 Automating this Scraper
Use Make, Zapier, Slack, Airbyte, GitHub Actions, Google Drive, Webhooks.
🌟 Beyond business use cases
Research
Cross-publication framing analysis.
Personal
Build your own minimalist news reader.
Non-profit
Track coverage of civic-tech and watchdog stories.
Experimentation
Train LLM summarizers on diverse outlets.
🤖 Ask an AI assistant about this scraper
Pipe the dataset into ChatGPT, Claude, Perplexity, or Copilot and ask "summarize today's AI coverage across The Verge, TechCrunch and Wired."
❓ Frequently Asked Questions
Q: Is this real-time? A: As real-time as each source's RSS update cadence.
Q: Do you store article text? A: Only the summary/description published in the feed.
Q: Can I get full article body? A: Not in this Actor; use a dedicated article-parser actor downstream.
Q: How fresh is the data? A: Fetched on each run from live feeds.
Q: Are paywalled publications included? A: Only public feeds. Paywalled bodies are not bypassed.
Q: Will you add more outlets? A: Yes - request via the contact form.
Q: Does it deduplicate? A: No, raw items per source. De-dupe downstream by url.
Q: Schema stable? A: Yes, fields are guaranteed even if null.
Q: Free limits? A: 10 items per run for free users.
Q: Languages? A: All seven feeds are English.
🔌 Integrate with any app
Make, Zapier, Slack, Airbyte, Google Drive, GitHub, Webhooks, REST API, n8n.
🔗 Recommended Actors
| Actor | What it does |
|---|---|
| Semafor Articles | Semafor only |
| Axios Articles | Axios only |
| Medium Publication Articles | Medium only |
| The Intercept Articles | The Intercept only |
| Verge / TechCrunch / Wired | Tech-trio RSS only |
💡 Pro Tip: browse the complete ParseForge collection.
🆘 Need Help? Open our contact form
⚠️ Disclaimer: independent tool, not affiliated with any publication. Only publicly available feeds are collected.