📰 Crunchbase News Scraper — Daily Funding & M&A Headlines
Pricing
from $100.00 / 1,000 news articles
📰 Crunchbase News Scraper — Daily Funding & M&A Headlines
Daily VC funding rounds, M&A, IPO, startup news from Crunchbase News. Structured JSON with entity extraction, funding amount parsing, round-type classification. Bloomberg / Reuters Eikon / Refinitiv / Mergermarket / TechCrunch alternative. Pay-per-article.
Pricing
from $100.00 / 1,000 news articles
Rating
0.0
(0)
Developer
NexGenData
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Share
VC funding rounds. Acquisitions. IPOs. Quarterly recaps. Startup launches. One Actor, the full firehose from
news.crunchbase.com— the editorial side of Crunchbase, not the locked-down data platform. Pay-per-article, no $30K/seat Bloomberg or $20K/year Mergermarket contract.
▶️ Try this Actor → — first article on us, $0.10 per article after.
🎯 Who this is for
- VC associates running daily intel — every Series A/B/C ann, every M&A whisper, in one structured JSON feed.
- Sales / BD teams chasing founders who just raised. Trigger an outreach the morning the round is announced.
- Corporate development tracking acquisitions in their sector — see who Adobe, Salesforce, Cisco, Snowflake just bought.
- Investor relations running competitive intel for their portfolio companies.
- Journalists & analysts building newsletters, Substacks, sector reports.
- Quant funds wiring "headline alpha" feeds into event-driven strategies.
If you've ever paid $1,800/year for Term Sheet, $20K/year for Mergermarket, or $24K/seat for Reuters Eikon just to get the same headlines that Crunchbase News publishes for free — this Actor turns that page into structured rows with entity extraction, funding amount parsing, and round-type classification.
💸 Pricing
| What you pay | When |
|---|---|
| $0.01 actor start | Once per run (covers compute kickoff) |
| $0.10 per article | Each article record pushed to dataset (the primary unit) |
A typical daily intel run pulling the last 24 hours across all categories (~15 articles) is $1.51. A weekly bulk run pulling 100 articles is $10.01. A monthly back-fill of 500 articles is $50.01.
That's <1% of what a Bloomberg Terminal seat costs to surface the same M&A and funding stories — and ours is structured, queryable, and merges cleanly into your data warehouse.
📥 Inputs
| Field | Type | Description |
|---|---|---|
limit | int | Max articles to return (1–500). Default 25. |
category | array | Restrict to Crunchbase News sections (venture, ma, startups, business, public-markets, ai, fintech, cybersecurity, crypto, climate-sustainability, health-wellness-biotech, quarterly-and-annual-reports, sales-marketing). Empty = ALL. |
date_range | string | today (last 24h), week (last 7d), month (last 30d), or all. Default week. |
keyword_filter | array | Case-insensitive substrings matched against title + excerpt. Examples: ["Series A","Series B"], ["IPO","SPAC"], ["acquires","acquired"]. |
include_full_text | boolean | If true, body_markdown contains the full article body and full_content=true. Default true. |
All fields are optional. An empty input runs the default "last 7 days, 25 articles, with full text" preset.
📤 Output schema
Each article record (one dataset row):
| Field | Type | Example |
|---|---|---|
title | string | "Exclusive: Xpanner Lands $18M To Offer 'Automation As A Service' To Construction Sites" |
slug | string | "xpanner-automation-as-a-service-for-construction-sites-..." |
url | string | https://news.crunchbase.com/real-estate-property-tech/xpanner-... |
published_at | ISO 8601 | "2026-05-14T14:00:23+00:00" |
author | string | "Marlize van Romburgh" |
category | string | "Artificial intelligence" |
tags | array | ["Manufacturing","Real estate & property tech","Robotics","Startups","unicorn"] |
excerpt | string | First 200 chars of body text |
mentioned_companies | array | ["Xpanner"] — entity-extracted from anchor links to crunchbase.com/organization/* |
mentioned_investors | array | ["Korea Investment Partners","KB Investment Co."] — entity-extracted using investor heuristics |
funding_amount_usd | int/null | 18000000 — parsed from headline + first 400 chars |
round_type | string | "Series B" — Pre-Seed / Seed / Series A–G / Bridge / IPO / M&A |
full_content | boolean | true if body_markdown is populated |
body_markdown | string | Full article body in markdown (paragraphs, links, headings) |
data_source | string | "news.crunchbase.com/feed (RSS)" or "news.crunchbase.com (HTML)" |
🆚 Crunchbase News vs the alternatives
| Tool | Cost | Funding rounds | M&A | IPO | Entity extraction | Structured JSON | API access |
|---|---|---|---|---|---|---|---|
| This Actor | $0.10/article | ✅ Daily | ✅ Daily | ✅ | ✅ Yes | ✅ Yes | ✅ Apify |
| Bloomberg Terminal | ~$30K/yr/seat | ✅ | ✅ | ✅ | ✅ | ❌ Terminal-only | Limited |
| Reuters Eikon / LSEG | ~$24K/yr/seat | ✅ | ✅ | ✅ | ✅ | ❌ Terminal-only | Limited |
| Refinitiv Workspace | ~$22K/yr/seat | ✅ | ✅ | ✅ | ✅ | ❌ Terminal-only | Limited |
| Mergermarket | ~$20K/yr/seat | Partial | ✅✅✅ | ✅ | ✅ | ❌ Web-only | ❌ |
| TechCrunch (free) | $0 | ✅ | ✅ | ✅ | ❌ Unstructured | ❌ HTML only | ❌ |
| This Actor | $0.10/article | ✅ | ✅ | ✅ | ✅ | ✅ JSON | ✅ REST |
You pay per useful row. No per-seat licence, no annual lock-in, no terminal install. Wire it into your warehouse on day one.
🧪 Quick-start examples
Daily morning brief — every story in the last 24h (~12 articles, ~$1.21):
{ "limit": 25, "date_range": "today" }
M&A radar — weekly acquisitions (~30 articles, ~$3.01):
{ "category": ["ma","business"], "date_range": "week", "keyword_filter": ["acquires","acquired","acquisition","merger"], "limit": 50 }
Series A/B watcher (~40 articles, ~$4.01):
{ "keyword_filter": ["Series A","Series B"], "date_range": "week", "limit": 50 }
AI funding firehose (~60 articles, ~$6.01):
{ "category": ["ai"], "date_range": "month", "include_full_text": true, "limit": 100 }
Headlines-only monitoring (no body — cheapest, ~$2.51 for 25 rows):
{ "limit": 25, "date_range": "today", "include_full_text": false }
Monthly backfill — entire month, all categories (~150 articles, ~$15.01):
{ "limit": 500, "date_range": "month" }
🧱 How extraction works
- Funding amount: regex over
title + first 400 charsof body, matching patterns like$5M,$1.2 billion,$500K. Converted to USD integers ($18M→18000000). - Round type: pattern-matched against
Pre-Seed,Seed,Series AthroughSeries G,Bridge,IPO,M&A. First match wins, ordered by specificity. - Mentioned companies vs investors: anchor links inside the body pointing at
crunchbase.com/organization/*are classified by sentence context — anchors nearled by,backed by,invested, or whose link text containsVentures/Capital/Partners/Fundare tagged as investors; everything else is the operating company. - Categories & tags: pulled from the WordPress taxonomy attached to each post (multiple per article — an AI fintech story might be tagged
AI,Fintech,Startups). - Date filtering: applied against the published-at timestamp from RSS /
<meta property="article:published_time">. Articles without parseable dates are kept (rather than silently dropped).
🪜 Source strategy & rate-limit posture
- Primary —
news.crunchbase.com/feed/RSS. Server-rendered, 10 freshest articles, full body incontent:encoded. Zero anti-bot, polite User-Agent. - Pagination — when
limit > 10, we walk/sections/<slug>/page/N/HTML pages, harvest article URLs, fetch each one individually with a 400ms delay. - Headers — desktop Chrome User-Agent, no JS execution required (WordPress is server-rendered).
- Failure modes — if a category index 404s, we fall back to the all-category feed and filter client-side. Articles with missing fields keep best-effort partial records (no silent drops).
Crunchbase News is a WordPress site, not the locked-down Crunchbase data platform. The data platform requires a $999+/mo enterprise contract and aggressive anti-bot. The news site is editorially free, publicly indexable, and the legal grey-zone risk is the same as scraping any open WordPress blog.
🔗 Sister actors in the NexGenData fleet
If Crunchbase News headlines are useful to you, these adjacent Actors are the natural pair:
- Startup Funding Tracker — round-by-round funding events with lead investor, valuation, post-money. The #1 companion to this Actor: news headlines tell you what happened, this tells you how much and who led.
- YC Companies Directory Scraper — Y Combinator alumni, every batch since S05 (5,000+ companies). When a YC company shows up in Crunchbase News, this Actor gives you the rest of their profile.
- Techstars Companies Directory — Techstars accelerator alumni (5,591 companies, 128 programs). Same play as YC: enrich news mentions with full founder/program/cohort context.
- SEC Form 8-K Material Events Scraper — real-time SEC 8-K filings (acquisitions, departures, material events). Crunchbase News covers private-market events; 8-K covers the public-market regulatory disclosures.
- IPO Tracker — upcoming + recent IPOs with lockup expirations, pricing, valuations. Crunchbase News announces the IPO; this Actor gives you the structured pricing data.
- Finance MCP Server — Claude / ChatGPT MCP server bundling the entire NexGenData finance fleet so an LLM can call any of these Actors as a tool.
Stack four of these and you've replicated a Bloomberg + PitchBook + CB Insights workflow for under $100/month.
🤝 Affiliate / support
Built and maintained by NexGenData — leave a star or a review on Apify Console.
Need a custom slice (Crunchbase News + Form D + LinkedIn enrichment for outbound)? Email scrapers@thenextgennexus.com — bulk pricing, white-label feeds, and webhook delivery available.
▶️ Try this Actor → — pay-per-article, $0.10 each.
About NexGenData
NexGenData publishes 220+ buyer-intent actors covering SEC filings, YC alumni, Delaware DOC, global stock screeners across 30+ exchanges, IPO calendars, IP and patent intelligence, FDA approvals, B2B lead generation, and more. Every actor is pay-per-result with no seat licensing.
Apify affiliate program — free credits + 30% off
Sign up to Apify via our referral link and you'll get:
- Free starter credits to test this actor and the rest of our 220+ actor fleet
- 30% off platform fees for the life of your account
Browse the full NexGenData catalog and sign up here — same Apify, same actors, just cheaper for you.
Built and maintained by NexGenData.