Pricing

Pay per event

Failory Startups Scraper

Scrape Failory startup directory pages into clean startup profiles with websites, industries, founders, funding details, and investors.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Actor stats

Bookmarked

Total users

Monthly active users

18 days ago

Last modified

What does Failory Startups Scraper do?

Failory Startups Scraper turns public Failory directory pages into clean dataset rows.

It fetches Failory pages over HTTP, parses the startup cards and company detail tables, and exports one row per startup profile.

Typical rows include:

🚀 Startup name
🔗 Website URL
🧭 Failory source URL
🏷️ Industries and category badges
📝 Company description
📍 Headquarters, city, and country
👥 Founders
💰 Funding amount and latest funding status
🏦 Top investors
🖼️ Logo URL
⏱️ Scrape timestamp

Who is it for?

This actor is useful for teams that need startup intelligence without manual copy-paste.

🧲 Lead generation teams building startup prospect lists
💼 B2B sales teams targeting funded companies
📊 Market researchers mapping startup categories
🏦 Investors and accelerators screening startup ecosystems
🧪 Product marketers researching competitor categories
🧰 Data teams feeding startup records into CRMs or BI tools

Why use this actor?

Failory pages are useful but manual browsing is slow.

This actor gives you structured, exportable records that can be filtered, enriched, deduplicated, and joined with other datasets.

Benefits:

No browser automation needed for normal runs
Public HTTP pages only
Country and category page support
Source metadata included for traceability
One dataset row per startup
Works with Apify datasets, webhooks, APIs, and integrations

What Failory pages can I scrape?

Use public URLs under https://www.failory.com/startups.

Examples:

https://www.failory.com/startups/united-states
https://www.failory.com/startups/saas
https://www.failory.com/startups/artificial-intelligence
https://www.failory.com/startups/fintech
https://www.failory.com/startups/united-kingdom

You can also provide slugs like united-states, saas, or artificial-intelligence.

Data table

Field	Description
`startupName`	Startup or company name
`rank`	Rank/order on the Failory page
`websiteUrl`	External website linked by Failory
`sourceUrl`	Failory page where the startup was found
`sourcePageTitle`	Page title or heading
`sourcePageSlug`	Failory slug after `/startups/`
`sourcePageType`	Country, category, directory, or unknown
`sourcePageLabel`	Human-readable page label
`description`	Startup description paragraph
`industries`	Array of Failory badges
`industryText`	Comma-separated industries for CSV tools
`headquarters`	Headquarters text
`city`	Parsed city from headquarters
`country`	Parsed country from headquarters
`yearFounded`	Founded year
`founders`	Founder names
`fundingAmount`	Funding amount text
`startupSize`	Size bucket from Failory
`lastFundingStatus`	Latest funding stage/status
`topInvestors`	Investor names
`logoUrl`	Logo image URL
`scrapedAt`	ISO timestamp

How much does it cost to scrape Failory startup profiles?

The default pay-per-event setup is designed for affordable startup lead generation.

Pricing uses:

A small run-start event
A per-profile result event for each startup saved

At the formula-derived BRONZE price, 1,000 startup profiles cost about $0.03 plus the small start fee. Tiered discounts apply on higher Apify plans.

How to run it

Open the actor on Apify.
Add one or more Failory startup directory URLs.
Optionally add slugs such as saas or united-states.
Set maxItems to the number of startup profiles you need.
Set maxPages if you start from a broad directory page.
Click Start.
Export the dataset as JSON, CSV, Excel, XML, RSS, or HTML.

Input example

{
  "startUrls": [
    { "url": "https://www.failory.com/startups/united-states" },
    { "url": "https://www.failory.com/startups/saas" }
  ],
  "slugs": ["artificial-intelligence"],
  "maxItems": 100,
  "maxPages": 5
}

Output example

{
  "startupName": "Perplexity",
  "rank": 2,
  "websiteUrl": "https://www.perplexity.ai/?ref=failory",
  "sourceUrl": "https://www.failory.com/startups/united-states",
  "sourcePageType": "country",
  "sourcePageLabel": "United States",
  "description": "Perplexity has developed an AI-powered answer engine...",
  "industries": ["AI", "Chatbot", "Generative AI"],
  "headquarters": "San Francisco, California, United States",
  "country": "United States",
  "yearFounded": 2022,
  "fundingAmount": "$1.5B",
  "lastFundingStatus": "Venture Round"
}

Tips for best results

Start with one or two specific pages before scraping many categories.
Use maxItems to control dataset size and cost.
Use sourcePageLabel to group records by country or category.
Deduplicate by startupName and websiteUrl when combining pages.
Use industryText for spreadsheet filters.
Use industries when processing JSON programmatically.

Integrations

You can connect the dataset to common workflows:

📇 Send startup records to a CRM
📬 Trigger outreach workflows with Apify webhooks
📊 Load startup datasets into BigQuery, Snowflake, or Sheets
🔎 Enrich websites with separate email or SEO actors
🧠 Feed profiles into research assistants or scoring models

API usage with Node.js

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: process.env.APIFY_TOKEN });
const run = await client.actor('automation-lab/failory-startups-scraper').call({
  startUrls: [{ url: 'https://www.failory.com/startups/saas' }],
  maxItems: 50,
  maxPages: 2,
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

API usage with Python

from apify_client import ApifyClient
import os

client = ApifyClient(os.environ['APIFY_TOKEN'])
run = client.actor('automation-lab/failory-startups-scraper').call(run_input={
    'startUrls': [{'url': 'https://www.failory.com/startups/united-states'}],
    'maxItems': 50,
    'maxPages': 2,
})
items = client.dataset(run['defaultDatasetId']).list_items().items
print(items)

API usage with cURL

curl -X POST "https://api.apify.com/v2/acts/automation-lab~failory-startups-scraper/runs?token=$APIFY_TOKEN" \
  -H 'Content-Type: application/json' \
  -d '{"startUrls":[{"url":"https://www.failory.com/startups/saas"}],"maxItems":50,"maxPages":2}'

MCP integration

Use the actor from MCP-compatible tools through Apify MCP Server.

MCP URL:

https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper

Add it from Claude Code with a command like:

$claude mcp add apify-failory-startups "https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper"

JSON configuration example:

{
  "mcpServers": {
    "apify-failory-startups": {
      "url": "https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper"
    }
  }
}

Example prompts:

"Use the Failory Startups Scraper MCP tool to scrape 50 SaaS startups and summarize the most common industries."
"Run automation-lab/failory-startups-scraper for United States startups and format the top funded companies as a table."
"Find Failory AI startups with the MCP tool and identify founders and investors mentioned in the data."

Claude Desktop MCP setup

Add Apify MCP Server to Claude Desktop and include this actor in the tools query.

Use your Apify token for authentication. Then ask Claude to run automation-lab/failory-startups-scraper with a Failory URL and a small maxItems value.

Claude Code MCP setup

Configure the Apify MCP endpoint with:

https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper

Add it from Claude Code with a command like:

$claude mcp add apify-failory-startups "https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper"

You can also use a JSON-style MCP server configuration:

{
  "mcpServers": {
    "apify-failory-startups": {
      "url": "https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper"
    }
  }
}

Then call the actor from your coding workflow to create fixtures, update prospect files, or refresh market datasets.

Example MCP prompts:

"Use the Failory Startups Scraper tool to get 20 SaaS startups and save the names and websites as CSV."
"Run the Failory actor for united-states with maxItems 10 and summarize funding stages."
"Collect AI startup profiles from Failory and return founder and investor columns."

Data quality notes

The actor extracts what Failory publishes in the page HTML.

Some records may not include every field. For example, some pages may omit funding amount, investor names, or founder names. Empty values are exported as null or empty arrays depending on field type.

Limitations

It does not log in to Failory.
It does not bypass private data walls.
It does not infer emails or phone numbers.
It does not guarantee that Failory's public data is current.
It extracts startup profiles from list pages, not unrelated blog posts.

FAQ

Yes. This actor only uses public Failory startup directory pages that are visible without an account.

Does it extract emails or phone numbers?

No. Failory startup directory pages do not consistently publish emails or phone numbers, so the actor does not invent or infer them.

Why do some pages return only a small number of records?

Some Failory pages expose a curated top list in the HTML. The actor exports the records available in that public page markup.

Can I start from the main `/startups` page?

Yes. The actor can discover country and category links from the main directory page. Increase maxPages if you want it to follow more discovered pages.

Troubleshooting

If you get zero records, check that your URL is a Failory startup directory page.

Good URL pattern:

https://www.failory.com/startups/<country-or-category-slug>

If a broad /startups page returns fewer records than expected, increase maxPages so the actor can follow more discovered directory links.

Legality

This actor is designed to scrape publicly available Failory pages. It does not access private accounts or restricted data. You are responsible for using the exported data in accordance with applicable laws, Failory's terms, and privacy rules that apply to your use case.

Other automation-lab actors that may complement this workflow:

Changelog

0.1

Initial version with HTTP extraction for public Failory startup directory pages.

Support

If a Failory page layout changes or a specific startup category stops parsing, open an Apify issue with the input URL and run ID so we can reproduce it quickly.

Failory Live Startups Directory Scraper

jungle_synthesizer/failory-live-startups-directory-scraper

Scrape Failory's live startups directory — 14,000+ startups across 267+ country, city, and industry facet pages. Extracts startup name, website URL, industry, year founded, funding amount, funding round, and facet label. Ideal for lead generation, VC research, and competitive intelligence.

BowTiedRaccoon

BetaList Startups Scraper - Startup Launch Data

benthepythondev/betalist-startups-scraper

Scrape BetaList startup discovery pages and extract startup names, URLs, descriptions and launch metadata.

Ben

Y Combinator Companies Directory Scraper

lead.gen.labs/y-combinator-companies-directory-scraper

Extract structured startup profiles from the public Y Combinator company directory, including company names, batches, industries, regions, descriptions, founders, websites, hiring status, and source URLs for investor research, recruiting, partnership discovery, and startup lead generation.

LeadGen Labs

Y Combinator Scraper with Founders

scrapers-hub/y-combinator-scraper-with-founders

Y Combinator scraper to extract startup profiles, founder names, company details, websites, industries, batch information, locations, and other publicly available data 🚀📊 Perfect for startup research, investor sourcing, competitor analysis, and market intelligence.

Scrapers Hub

Startup Funding Signal Scraper - EU & US

actorpilot/startup-funding-signal-scraper

Track recently funded European and US startups as B2B sales signals. Extract startup name, amount, round, investors, sector, source URL, stable IDs, hashes, and run summaries from public funding news feeds.

S. Klein

AngelList Startup & Founder Lead Scraper

meticulous_snail/angellist-startup-leads

Find startups and founders on AngelList/Wellfound by industry and stage. Extract company name, founders, location, funding status, employee count, and job openings. Perfect for investor relations, SaaS sales, and startup-focused service providers.

Beatsync Pro

Startup Funding and Launch Tracker

jeweled_jockstrap/my-actor-2

Track startup launches funding rounds and trending repos from HackerNews ProductHunt and GitHub. Free alternative to Crunchbase for VCs and founders.

Juan Triviño

Crunchbase Startup Scraper - Funding & Company Data

listless_adzuki/crunchbase-startup-scraper

Scrape Crunchbase for startup funding, company profiles, and investor data. Essential for VC research, sales intelligence, and market analysis.

Andres Rodriguez

Startup Jobs Scraper

solidcode/startup-jobs-scraper

[💰 $0.90 / 1K] Extract startup and tech job listings from startup.jobs. Search by keyword and location, filter for remote roles, or paste startup.jobs URLs, and get structured jobs with companies, salaries, tags, and full descriptions.

SolidCode

IndieHackers Product Scraper — Revenue, Founders & Startup Data

mattdef/indiehackers-product-scraper

Scrape IndieHackers product directory via Algolia API. Extract startup names, taglines, MRR revenue, followers, platforms, funding type, and more. Perfect for micro-SaaS research, startup acquisition, and indie hacker market analysis.

Matthieu Cast