Equity Crowdfunding Leads avatar

Equity Crowdfunding Leads

Pricing

Pay per event

Go to Apify Store
Equity Crowdfunding Leads

Equity Crowdfunding Leads

Unified Wefunder + Republic + StartEngine founder leads in one schema. Founder name, company, tagline, region, raise progress, valuation. Built for VC scouts and SDRs targeting funded startups.

Pricing

Pay per event

Rating

0.0

(0)

Developer

DevilScrapes

DevilScrapes

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Categories

Share

Equity Crowdfunding Leads

Equity Crowdfunding Leads

We do the dirty work so your dataset stays clean. 😈

$5.05 / 1,000 rows — 6x cheaper than Crunchbase scrapers. Unify currently-raising and recently-funded campaigns from the three biggest US equity-crowdfunding platforms (Wefunder, Republic, StartEngine) into one flat founder-lead dataset. Built for VC scouts, SDRs targeting founders, and competitive-intel analysts who today need to hand-visit three platforms with no shared schema.

This Actor hits the source endpoints directly — Wefunder's internal JSON API, Republic's SSR shell, StartEngine's offering sitemap — parses each with Pydantic-validated models, and pushes a single normalized dataset to Apify. No login. No API key. No Crunchbase ToS exposure. Wefunder alone surfaces 4,800+ currently-listed companies with founder name, tagline, raise progress, investor count, and pre-money valuation in one call.

🎯 What this scrapes

Three equity-crowdfunding platforms, one schema:

  1. Wefunderwefunder.com (primary source; full founder + tagline + raise progress + valuation payload)
  2. Republicrepublic.com (secondary; slug + company name from trending carousel + JSON-LD breadcrumb)
  3. StartEnginestartengine.com (secondary; slug + company name from sitemap; financials nullable in v1 — detail pages are JS-gated)

Per-source coverage matrix (v1):

SourceSlugCompany nameTaglineFounderRaised $Valuation $Investors
Wefunderyesyesyesyesyesyesyes
Republicyesyes
StartEngineyesyes

Output fields:

FieldTypeDescription
sourcestringSource platform (wefunder, republic, startengine)
campaign_slugstringURL slug (e.g. riserobotics, atari-hotels, ai-frontier-fund)
company_namestringDisplay name
taglinestring | nullShort pitch one-liner
industrystring | nullVertical / category label
locationstring | nullRegion, state, or city
foundersarrayList of founder names (empty when none disclosed)
website_urlstring | nullExternal company website when linked
target_amount_usdnumber | nullFunding goal in USD
raised_amount_usdnumber | nullRunning total raised in USD
num_investorsinteger | nullCurrent investor count
valuation_usdnumber | nullPre-money valuation in USD
revenue_usdnumber | nullLatest annual revenue (Form C) when published
funding_stagestring | nullDerived stage label (raising, funded, closed)
campaign_urlstringCanonical campaign detail URL
scraped_atstringISO 8601 UTC timestamp

🔥 Features

  • Three platforms, one schema — drop straight into your CRM, spreadsheet, or BI tool with no per-source normalization.
  • Wefunder primary path — full founder + tagline + raise progress + valuation per row via the internal JSON API the Wefunder SPA itself calls.
  • Pre-money valuation parsing — Wefunder's $62.1M-style shorthand auto-converted to a numeric USD float.
  • Per-source isolation — one source failure (Cloudflare block, page restructure) does not abort the run; the other two still produce data.
  • Pydantic v2 validation — both input and dataset rows are model-validated; invalid input fails fast with a clear error before any network call.
  • Filter knobs — restrict by source list, funding status (active / funded / all), industry substring, or hard row cap per source.
  • No browser — pure HTTP with curl-cffi browser TLS fingerprint impersonation; low compute footprint and fast runs.
  • Exponential backoff on 429 / 503 with Retry-After honoured; max 5 attempts.
  • Apify Proxy on by defaultBUYPROXIES94952 group clears Wefunder + Republic CF fingerprinting from datacenter IPs.

💡 Use cases

  • VC scout pipelines — schedule weekly runs, enrich with LinkedIn lookups on founders[], build a "first dollar in" tracker for sub-seed deals raising on Wefunder.
  • SDR founder outreach — filter by industry substring (e.g. "fintech", "climate", "AI"), drop founders into your email tool, message before a Series A bump.
  • Competitive intelligence — track which sectors are over-raising vs under-raising quarter-over-quarter; spot whitespace before incumbents do.
  • Crowdfunding leaderboards — publish a public site ranking the fastest-growing campaigns this week; updates daily on a scheduled run.
  • Cap-table benchmarks — pre-money valuation distribution by sector at the Reg CF stage, a niche cleanly covered by Wefunder's terms.nb field.
  • Form C deep dives — pair this Actor with sec-edgar-filings-scraper to follow the slug from Wefunder back to the canonical Form C / Form C-AR PDF on EDGAR for revenue, expense, and SAFE term extraction.

⚙️ How to use it

  1. Open the Actor input form.
  2. (Optional) Pick Sources — leave empty to scrape all three, or list a subset like ["wefunder"].
  3. (Optional) Set Max rows per source — default 50, cap 500.
  4. (Optional) Pick Status filteractive (default) for currently raising, funded for recently funded, all for both.
  5. (Optional) Set Industry filter — a case-insensitive substring matched against tagline or industry.
  6. Leave Use Apify Proxy on (default) — Wefunder and Republic block plain datacenter IPs.
  7. Click Start. Results stream into the default dataset.

Quick examples

All three sources, default settings:

{
"sources": [],
"maxPerSource": 50,
"statusFilter": "active",
"useProxy": true
}

Wefunder only, fintech rows, currently raising (the highest-signal default for SDR pipelines):

{
"sources": ["wefunder"],
"maxPerSource": 100,
"statusFilter": "active",
"industryFilter": "fintech",
"useProxy": true
}

QA fixture — Wefunder + Republic, 3 rows each, no proxy:

{
"sources": ["wefunder", "republic"],
"maxPerSource": 3,
"statusFilter": "active",
"useProxy": false
}

📥 Input

FieldTypeRequiredDefaultDescription
sourcesarray of source literalsno[] (all three)Subset of wefunder, republic, startengine
maxPerSourceintegerno50Hard cap per source; range 1..500
statusFilterenum stringno"active"One of active / funded / all
industryFilterstring | nullnonullCase-insensitive substring filter over tagline/industry
useProxybooleannotrueRoute via Apify Proxy (BUYPROXIES94952)

📤 Output

One dataset row per discovered campaign. Example record — Wefunder RISE Robotics as of 2026-05-16:

{
"source": "wefunder",
"campaign_slug": "riserobotics",
"company_name": "RISE Robotics",
"tagline": "Electrifying heavy machines",
"industry": null,
"location": "MA",
"founders": ["Hiten Sonpal"],
"website_url": null,
"target_amount_usd": null,
"raised_amount_usd": 17448682.0,
"num_investors": 417,
"valuation_usd": 62100000.0,
"revenue_usd": null,
"funding_stage": "raising",
"campaign_url": "https://wefunder.com/riserobotics",
"scraped_at": "2026-05-16T13:40:00.000Z"
}

Download the dataset as JSON, CSV, Excel, or XML from the Export button on the run page.

💰 Pricing

Pay-Per-Event (PPE):

EventRate (USD)Trigger
actor-start$0.05Once per Actor run at boot
result-row$0.005Per campaign row pushed

A typical run (all three sources, default 50/source = 150 rows) costs ~$0.80. Per-1,000-row extrapolation: $5.05 — roughly 6x cheaper than Crunchbase-style scrapers at $30/1k, while sourcing from public crowdfunding campaigns (cleaner ToS posture than Crunchbase scraping).

🚧 Limitations

  • StartEngine detail pages are JavaScript-gated. v1 emits one row per offering slug from the public sitemap-private-offerings.xml with name derived from the slug; live raise progress, valuation, and investor count stay nullable on this source. A v2 Camoufox-backed full-render path is planned.
  • Republic financials are client-rendered. v1 surfaces ~10 trending campaign slugs per run from the SSR shell with company name from the JSON-LD breadcrumb. Raised amount, valuation, and investor count are out of reach without a real browser.
  • Wefunder is the data-rich source. Run Wefunder-only (sources: ["wefunder"]) when you need the most fields per row.
  • Authoritative campaign data only. No SEC EDGAR Form C parsing (use sec-edgar-filings-scraper), no investor identity scraping (privacy), no comment threads or campaign updates.
  • 7-day default-storage retention on the Apify free plan. Schedule runs and export to your own storage for time-series.
  • No historical tracking. Every run is a fresh snapshot. Pipeline runs into BigQuery / S3 / Snowflake to build deltas.

❓ FAQ

Why only these three platforms? They are the three largest US equity-crowdfunding portals by total capital deployed: Wefunder, Republic, and StartEngine. NextSeed, MicroVentures, and the LATAM portals are deliberately out of scope for v1 — open a feature request if you need them.

Why do Republic and StartEngine return so few fields compared to Wefunder? Republic and StartEngine are SPAs whose investment data is rendered client-side and gated behind PerimeterX/Datadome challenges that require a real browser to clear. We emit slug + canonical company name + campaign URL from the unauthenticated surface (carousel anchors / sitemap) so the data is still useful as a discovery feed; v2 plans Camoufox full-render for the rich fields.

Where does the Wefunder valuation come from? The Wefunder /-/companies/explore endpoint embeds pre-money valuation as dollar-shorthand text ("$62.1M", "$700K", "$1.2B") in the terms.nb field. This Actor parses that shorthand into a numeric USD float in valuation_usd. Malformed values emit null rather than crashing.

Does this Actor track price / valuation changes across runs? No — every run is a fresh snapshot. Schedule runs and export to your own storage to build a time series. Apify's default run-scoped storage is purged after 7 days on the free plan.

Companion Actor? Yes — sec-edgar-filings-scraper is the natural follow-up for any campaign you want a deep dive on. Take the slug from this Actor, look up the issuer's CIK on EDGAR, and parse the Form C / Form C-AR PDFs for revenue, expense, share count, and SAFE-term detail.

💬 Your feedback

Found a parser that broke after Wefunder, Republic, or StartEngine restructured their page? Want a fourth platform added (NextSeed, MicroVentures, etc.)? Open an issue on the Actor's Apify Store page or contact us at apify.com/DevilScrapes. We monitor publish-day QA failures and ship patches the same week.