🏒 Company Data Aggregator β€” Crunchbase API Alternative avatar

🏒 Company Data Aggregator β€” Crunchbase API Alternative

Pricing

$30.00 / 1,000 company profile aggregation per domains

Go to Apify Store
🏒 Company Data Aggregator β€” Crunchbase API Alternative

🏒 Company Data Aggregator β€” Crunchbase API Alternative

Bulk company profile lookup. Aggregates WHOIS, DNS, GitHub org, SSL certs, tech stack headers, robots/sitemap β€” zero auth, zero paid APIs. Replaces the Crunchbase Free API (killed 2023).

Pricing

$30.00 / 1,000 company profile aggregation per domains

Rating

1.0

(1)

Developer

NexGenData

NexGenData

Maintained by Community

Actor stats

0

Bookmarked

7

Total users

4

Monthly active users

13 days ago

Last modified

Share

Company Data Aggregator β€” Crunchbase API Alternative

Aggregate company data from multiple public sources into one record. Built for sales enrichment and competitive research without paid Crunchbase or Clearbit subscriptions.

For each company name or domain you pass in, the actor returns one aggregated profile with:

  • Company name, description, industry
  • Founded year
  • Headquarters / address
  • Employee band (LinkedIn-style: 1-10, 11-50, 51-200, 201-500, 501-1000, 1001-5000, 5001-10000, 10001+)
  • Total funding raised + last round (when includeFunding is enabled) β€” pulled from public Crunchbase pages and news-search snippets
  • Tech stack (when includeTechStack is enabled) β€” detected from HTTP headers and inline script fingerprints
  • Social handles: LinkedIn, Twitter/X, Facebook, GitHub, YouTube, Instagram
  • Key people (CEO, CTO, CFO, COO, Founders) parsed from /about and /team pages
  • sources array tagging each source that contributed to the record

Input

{
"companies": ["Stripe", "airbnb.com"],
"includeFunding": true,
"includeTechStack": false
}
FieldTypeDefaultNotes
companiesarray of strings(required)Company names or domains. Names will be resolved to a domain via DuckDuckGo.
includeFundingbooleantrueAttempt funding-round + total-raised lookups. Adds 1–2 HTTP calls per company.
includeTechStackbooleanfalseRun header + script fingerprinting against the homepage.

Output

One dataset record per input company:

{
"company_name": "Stripe",
"domain": "stripe.com",
"description": "Stripe is a financial infrastructure platform for businesses…",
"industry": "Financial Services",
"founded": 2010,
"headquarters": "San Francisco, CA",
"employee_count": 8000,
"employee_band": "5001-10000",
"total_funding_usd": 8800000000,
"last_round": {"type": "Series I", "amount_usd": 6500000000, "date": "2023"},
"tech_stack": ["Cloudflare", "React", "Stripe", "Segment"],
"social": {
"linkedin": "https://www.linkedin.com/company/stripe",
"twitter": "https://twitter.com/stripe",
"github": "https://github.com/stripe"
},
"key_people": [
{"name": "Patrick Collison", "title": "CEO, Co-founder"},
{"name": "John Collison", "title": "President, Co-founder"}
],
"sources": ["input", "homepage", "/about", "crunchbase"],
"errors": [],
"scrapedAt": "2026-05-17T00:00:00+00:00"
}

When a source can't be reached (anti-bot blocks, DNS failure, blank pages), the actor still emits a record and tags the failure in errors. Nothing is silently dropped.

Pricing

Pay-Per-Event: one company-profile charge per emitted profile.

Architecture

  • httpx + BeautifulSoup (lxml parser)
  • Apify residential proxy β€” automatic fallback to direct if unavailable
  • Concurrent fetch of homepage, /about, and /careers
  • Schema.org JSON-LD parsing for canonical name/founding/HQ when available
  • Crunchbase free org page + DuckDuckGo news search for funding signals