📰 Crunchbase News Scraper — Daily Funding & M&A Headlines
Pricing
from $100.00 / 1,000 news articles
📰 Crunchbase News Scraper — Daily Funding & M&A Headlines
Daily VC funding rounds, M&A, IPO, startup news from Crunchbase News. Structured JSON with entity extraction, funding amount parsing, round-type classification. Bloomberg / Reuters Eikon / Refinitiv / Mergermarket / TechCrunch alternative. Pay-per-article.
Pricing
from $100.00 / 1,000 news articles
Rating
0.0
(0)
Developer
NexGenData
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
10 days ago
Last modified
Share
VC funding rounds. Acquisitions. IPOs. Quarterly recaps. Startup launches. One Actor, the full firehose from
news.crunchbase.com— the editorial side of Crunchbase, not the locked-down data platform. Pay-per-article, no $30K/seat Bloomberg or $20K/year Mergermarket contract.
📊 Sample Output
▶️ Try this Actor → — first article on us, $0.10 per article after.
🎯 Who this is for
- VC associates running daily intel — every Series A/B/C ann, every M&A whisper, in one structured JSON feed.
- Sales / BD teams chasing founders who just raised. Trigger an outreach the morning the round is announced.
- Corporate development tracking acquisitions in their sector — see who Adobe, Salesforce, Cisco, Snowflake just bought.
- Investor relations running competitive intel for their portfolio companies.
- Journalists & analysts building newsletters, Substacks, sector reports.
- Quant funds wiring "headline alpha" feeds into event-driven strategies.
If you've ever paid $1,800/year for Term Sheet, $20K/year for Mergermarket, or $24K/seat for Reuters Eikon just to get the same headlines that Crunchbase News publishes for free — this Actor turns that page into structured rows with entity extraction, funding amount parsing, and round-type classification.
💸 Pricing
| What you pay | When |
|---|---|
| $0.01 actor start | Once per run (covers compute kickoff) |
| $0.10 per article | Each article record pushed to dataset (the primary unit) |
A typical daily intel run pulling the last 24 hours across all categories (~15 articles) is $1.51. A weekly bulk run pulling 100 articles is $10.01. A monthly back-fill of 500 articles is $50.01.
That's <1% of what a Bloomberg Terminal seat costs to surface the same M&A and funding stories — and ours is structured, queryable, and merges cleanly into your data warehouse.
📥 Inputs
| Field | Type | Description |
|---|---|---|
limit | int | Max articles to return (1–500). Default 25. |
category | array | Restrict to Crunchbase News sections (venture, ma, startups, business, public-markets, ai, fintech, cybersecurity, crypto, climate-sustainability, health-wellness-biotech, quarterly-and-annual-reports, sales-marketing). Empty = ALL. |
date_range | string | today (last 24h), week (last 7d), month (last 30d), or all. Default week. |
keyword_filter | array | Case-insensitive substrings matched against title + excerpt. Examples: ["Series A","Series B"], ["IPO","SPAC"], ["acquires","acquired"]. |
include_full_text | boolean | If true, body_markdown contains the full article body and full_content=true. Default true. |
All fields are optional. An empty input runs the default "last 7 days, 25 articles, with full text" preset.
📤 Output schema
Each article record (one dataset row):
| Field | Type | Example |
|---|---|---|
title | string | "Exclusive: Xpanner Lands $18M To Offer 'Automation As A Service' To Construction Sites" |
slug | string | "xpanner-automation-as-a-service-for-construction-sites-..." |
url | string | https://news.crunchbase.com/real-estate-property-tech/xpanner-... |
published_at | ISO 8601 | "2026-05-14T14:00:23+00:00" |
author | string | "Marlize van Romburgh" |
category | string | "Artificial intelligence" |
tags | array | ["Manufacturing","Real estate & property tech","Robotics","Startups","unicorn"] |
excerpt | string | First 200 chars of body text |
mentioned_companies | array | ["Xpanner"] — entity-extracted from anchor links to crunchbase.com/organization/* |
mentioned_investors | array | ["Korea Investment Partners","KB Investment Co."] — entity-extracted using investor heuristics |
funding_amount_usd | int/null | 18000000 — parsed from headline + first 400 chars |
round_type | string | "Series B" — Pre-Seed / Seed / Series A–G / Bridge / IPO / M&A |
full_content | boolean | true if body_markdown is populated |
body_markdown | string | Full article body in markdown (paragraphs, links, headings) |
data_source | string | "news.crunchbase.com/feed (RSS)" or "news.crunchbase.com (HTML)" |
🆚 Crunchbase News vs the alternatives
| Tool | Cost | Funding rounds | M&A | IPO | Entity extraction | Structured JSON | API access |
|---|---|---|---|---|---|---|---|
| This Actor | $0.10/article | ✅ Daily | ✅ Daily | ✅ | ✅ Yes | ✅ Yes | ✅ Apify |
| Bloomberg Terminal | ~$30K/yr/seat | ✅ | ✅ | ✅ | ✅ | ❌ Terminal-only | Limited |
| Reuters Eikon / LSEG | ~$24K/yr/seat | ✅ | ✅ | ✅ | ✅ | ❌ Terminal-only | Limited |
| Refinitiv Workspace | ~$22K/yr/seat | ✅ | ✅ | ✅ | ✅ | ❌ Terminal-only | Limited |
| Mergermarket | ~$20K/yr/seat | Partial | ✅✅✅ | ✅ | ✅ | ❌ Web-only | ❌ |
| TechCrunch (free) | $0 | ✅ | ✅ | ✅ | ❌ Unstructured | ❌ HTML only | ❌ |
| This Actor | $0.10/article | ✅ | ✅ | ✅ | ✅ | ✅ JSON | ✅ REST |
You pay per useful row. No per-seat licence, no annual lock-in, no terminal install. Wire it into your warehouse on day one.
🧪 Quick-start examples
Daily morning brief — every story in the last 24h (~12 articles, ~$1.21):
{ "limit": 25, "date_range": "today" }
M&A radar — weekly acquisitions (~30 articles, ~$3.01):
{ "category": ["ma","business"], "date_range": "week", "keyword_filter": ["acquires","acquired","acquisition","merger"], "limit": 50 }
Series A/B watcher (~40 articles, ~$4.01):
{ "keyword_filter": ["Series A","Series B"], "date_range": "week", "limit": 50 }
AI funding firehose (~60 articles, ~$6.01):
{ "category": ["ai"], "date_range": "month", "include_full_text": true, "limit": 100 }
Headlines-only monitoring (no body — cheapest, ~$2.51 for 25 rows):
{ "limit": 25, "date_range": "today", "include_full_text": false }
Monthly backfill — entire month, all categories (~150 articles, ~$15.01):
{ "limit": 500, "date_range": "month" }
🧱 How extraction works
- Funding amount: regex over
title + first 400 charsof body, matching patterns like$5M,$1.2 billion,$500K. Converted to USD integers ($18M→18000000). - Round type: pattern-matched against
Pre-Seed,Seed,Series AthroughSeries G,Bridge,IPO,M&A. First match wins, ordered by specificity. - Mentioned companies vs investors: anchor links inside the body pointing at
crunchbase.com/organization/*are classified by sentence context — anchors nearled by,backed by,invested, or whose link text containsVentures/Capital/Partners/Fundare tagged as investors; everything else is the operating company. - Categories & tags: pulled from the WordPress taxonomy attached to each post (multiple per article — an AI fintech story might be tagged
AI,Fintech,Startups). - Date filtering: applied against the published-at timestamp from RSS /
<meta property="article:published_time">. Articles without parseable dates are kept (rather than silently dropped).
🪜 Source strategy & rate-limit posture
- Primary —
news.crunchbase.com/feed/RSS. Server-rendered, 10 freshest articles, full body incontent:encoded. Zero anti-bot, polite User-Agent. - Pagination — when
limit > 10, we walk/sections/<slug>/page/N/HTML pages, harvest article URLs, fetch each one individually with a 400ms delay. - Headers — desktop Chrome User-Agent, no JS execution required (WordPress is server-rendered).
- Failure modes — if a category index 404s, we fall back to the all-category feed and filter client-side. Articles with missing fields keep best-effort partial records (no silent drops).
Crunchbase News is a WordPress site, not the locked-down Crunchbase data platform. The data platform requires a $999+/mo enterprise contract and aggressive anti-bot. The news site is editorially free, publicly indexable, and the legal grey-zone risk is the same as scraping any open WordPress blog.
🔗 Sister actors in the NexGenData fleet
If Crunchbase News headlines are useful to you, these adjacent Actors are the natural pair:
- Startup Funding Tracker — round-by-round funding events with lead investor, valuation, post-money. The #1 companion to this Actor: news headlines tell you what happened, this tells you how much and who led.
- YC Companies Directory Scraper — Y Combinator alumni, every batch since S05 (5,000+ companies). When a YC company shows up in Crunchbase News, this Actor gives you the rest of their profile.
- Techstars Companies Directory — Techstars accelerator alumni (5,591 companies, 128 programs). Same play as YC: enrich news mentions with full founder/program/cohort context.
- SEC Form 8-K Material Events Scraper — real-time SEC 8-K filings (acquisitions, departures, material events). Crunchbase News covers private-market events; 8-K covers the public-market regulatory disclosures.
- IPO Tracker — upcoming + recent IPOs with lockup expirations, pricing, valuations. Crunchbase News announces the IPO; this Actor gives you the structured pricing data.
- Finance MCP Server — Claude / ChatGPT MCP server bundling the entire NexGenData finance fleet so an LLM can call any of these Actors as a tool.
Stack four of these and you've replicated a Bloomberg + PitchBook + CB Insights workflow for under $100/month.
🤝 Affiliate / support
Built and maintained by NexGenData — leave a star or a review on Apify Console.
Need a custom slice (Crunchbase News + Form D + LinkedIn enrichment for outbound)? Email scrapers@thenextgennexus.com — bulk pricing, white-label feeds, and webhook delivery available.
▶️ Try this Actor → — pay-per-article, $0.10 each.
📰 The NexGenData Newswire & News Suite
Don't monitor one wire — cover them all. Pair this with the rest of the suite for complete PR, press-release, and news coverage from a single vendor with one consistent output schema.
Press-release wires
- PR Newswire — US corporate announcements & earnings releases
- PR Newswire Asia — APAC corporate announcements
- Business Wire — company press releases & disclosures
- GlobeNewswire — listed-company news & regulatory filings
- EIN Presswire — broad-distribution press releases
- PR Web — SMB & small-business press releases
News & headlines
- AP News — Associated Press breaking news & articles
- BBC News — global BBC headlines & articles
- Google News — aggregated headlines & trending topics
- Hacker News — tech & startup stories and discussion
- Crunchbase News — funding rounds, M&A & startup headlines (← you are here)
Regional / regulatory
- Investegate RNS — UK LSE/AIM regulatory (RNS) announcements
About NexGenData
NexGenData publishes 220+ buyer-intent actors covering SEC filings, YC alumni, Delaware DOC, global stock screeners across 30+ exchanges, IPO calendars, IP and patent intelligence, FDA approvals, B2B lead generation, and more. Every actor is pay-per-result with no seat licensing.
Apify affiliate program — free credits + 30% off
Sign up to Apify via our referral link and you'll get:
- Free starter credits to test this actor and the rest of our 220+ actor fleet
- 30% off platform fees for the life of your account
Browse the full NexGenData catalog and sign up here — same Apify, same actors, just cheaper for you.
Built and maintained by NexGenData.
