Failory Startups Scraper
Pricing
Pay per event
Failory Startups Scraper
Scrape Failory startup directory pages into clean startup profiles with websites, industries, founders, funding details, and investors.
Pricing
Pay per event
Rating
0.0
(0)
Developer
Stas Persiianenko
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Extract structured startup profiles from public Failory startup directory pages.
Use this actor to collect startup names, websites, industries, descriptions, founders, funding snippets, headquarters, investor names, and source-page metadata from country and category lists such as United States startups, SaaS startups, fintech startups, and AI startups.
What does Failory Startups Scraper do?
Failory Startups Scraper turns public Failory directory pages into clean dataset rows.
It fetches Failory pages over HTTP, parses the startup cards and company detail tables, and exports one row per startup profile.
Typical rows include:
- ๐ Startup name
- ๐ Website URL
- ๐งญ Failory source URL
- ๐ท๏ธ Industries and category badges
- ๐ Company description
- ๐ Headquarters, city, and country
- ๐ฅ Founders
- ๐ฐ Funding amount and latest funding status
- ๐ฆ Top investors
- ๐ผ๏ธ Logo URL
- โฑ๏ธ Scrape timestamp
Who is it for?
This actor is useful for teams that need startup intelligence without manual copy-paste.
- ๐งฒ Lead generation teams building startup prospect lists
- ๐ผ B2B sales teams targeting funded companies
- ๐ Market researchers mapping startup categories
- ๐ฆ Investors and accelerators screening startup ecosystems
- ๐งช Product marketers researching competitor categories
- ๐งฐ Data teams feeding startup records into CRMs or BI tools
Why use this actor?
Failory pages are useful but manual browsing is slow.
This actor gives you structured, exportable records that can be filtered, enriched, deduplicated, and joined with other datasets.
Benefits:
- No browser automation needed for normal runs
- Public HTTP pages only
- Country and category page support
- Source metadata included for traceability
- One dataset row per startup
- Works with Apify datasets, webhooks, APIs, and integrations
What Failory pages can I scrape?
Use public URLs under https://www.failory.com/startups.
Examples:
https://www.failory.com/startups/united-stateshttps://www.failory.com/startups/saashttps://www.failory.com/startups/artificial-intelligencehttps://www.failory.com/startups/fintechhttps://www.failory.com/startups/united-kingdom
You can also provide slugs like united-states, saas, or artificial-intelligence.
Data table
| Field | Description |
|---|---|
startupName | Startup or company name |
rank | Rank/order on the Failory page |
websiteUrl | External website linked by Failory |
sourceUrl | Failory page where the startup was found |
sourcePageTitle | Page title or heading |
sourcePageSlug | Failory slug after /startups/ |
sourcePageType | Country, category, directory, or unknown |
sourcePageLabel | Human-readable page label |
description | Startup description paragraph |
industries | Array of Failory badges |
industryText | Comma-separated industries for CSV tools |
headquarters | Headquarters text |
city | Parsed city from headquarters |
country | Parsed country from headquarters |
yearFounded | Founded year |
founders | Founder names |
fundingAmount | Funding amount text |
startupSize | Size bucket from Failory |
lastFundingStatus | Latest funding stage/status |
topInvestors | Investor names |
logoUrl | Logo image URL |
scrapedAt | ISO timestamp |
How much does it cost to scrape Failory startup profiles?
The default pay-per-event setup is designed for affordable startup lead generation.
Pricing uses:
- A small run-start event
- A per-profile
resultevent for each startup saved
At the BRONZE price, 1,000 startup profiles cost $0.50 plus the small start fee. Tiered discounts apply on higher Apify plans.
How to run it
- Open the actor on Apify.
- Add one or more Failory startup directory URLs.
- Optionally add slugs such as
saasorunited-states. - Set
maxItemsto the number of startup profiles you need. - Set
maxPagesif you start from a broad directory page. - Click Start.
- Export the dataset as JSON, CSV, Excel, XML, RSS, or HTML.
Input example
{"startUrls": [{ "url": "https://www.failory.com/startups/united-states" },{ "url": "https://www.failory.com/startups/saas" }],"slugs": ["artificial-intelligence"],"maxItems": 100,"maxPages": 5}
Output example
{"startupName": "Perplexity","rank": 2,"websiteUrl": "https://www.perplexity.ai/?ref=failory","sourceUrl": "https://www.failory.com/startups/united-states","sourcePageType": "country","sourcePageLabel": "United States","description": "Perplexity has developed an AI-powered answer engine...","industries": ["AI", "Chatbot", "Generative AI"],"headquarters": "San Francisco, California, United States","country": "United States","yearFounded": 2022,"fundingAmount": "$1.5B","lastFundingStatus": "Venture Round"}
Tips for best results
- Start with one or two specific pages before scraping many categories.
- Use
maxItemsto control dataset size and cost. - Use
sourcePageLabelto group records by country or category. - Deduplicate by
startupNameandwebsiteUrlwhen combining pages. - Use
industryTextfor spreadsheet filters. - Use
industrieswhen processing JSON programmatically.
Integrations
You can connect the dataset to common workflows:
- ๐ Send startup records to a CRM
- ๐ฌ Trigger outreach workflows with Apify webhooks
- ๐ Load startup datasets into BigQuery, Snowflake, or Sheets
- ๐ Enrich websites with separate email or SEO actors
- ๐ง Feed profiles into research assistants or scoring models
API usage with Node.js
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: process.env.APIFY_TOKEN });const run = await client.actor('automation-lab/failory-startups-scraper').call({startUrls: [{ url: 'https://www.failory.com/startups/saas' }],maxItems: 50,maxPages: 2,});const { items } = await client.dataset(run.defaultDatasetId).listItems();console.log(items);
API usage with Python
from apify_client import ApifyClientimport osclient = ApifyClient(os.environ['APIFY_TOKEN'])run = client.actor('automation-lab/failory-startups-scraper').call(run_input={'startUrls': [{'url': 'https://www.failory.com/startups/united-states'}],'maxItems': 50,'maxPages': 2,})items = client.dataset(run['defaultDatasetId']).list_items().itemsprint(items)
API usage with cURL
curl -X POST "https://api.apify.com/v2/acts/automation-lab~failory-startups-scraper/runs?token=$APIFY_TOKEN" \-H 'Content-Type: application/json' \-d '{"startUrls":[{"url":"https://www.failory.com/startups/saas"}],"maxItems":50,"maxPages":2}'
MCP integration
Use the actor from MCP-compatible tools through Apify MCP Server.
MCP URL:
https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper
Add it from Claude Code with a command like:
$claude mcp add apify-failory-startups "https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper"
JSON configuration example:
{"mcpServers": {"apify-failory-startups": {"url": "https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper"}}}
Example prompts:
- "Use the Failory Startups Scraper MCP tool to scrape 50 SaaS startups and summarize the most common industries."
- "Run automation-lab/failory-startups-scraper for United States startups and format the top funded companies as a table."
- "Find Failory AI startups with the MCP tool and identify founders and investors mentioned in the data."
Claude Desktop MCP setup
Add Apify MCP Server to Claude Desktop and include this actor in the tools query.
Use your Apify token for authentication. Then ask Claude to run automation-lab/failory-startups-scraper with a Failory URL and a small maxItems value.
Claude Code MCP setup
Configure the Apify MCP endpoint with:
https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper
Add it from Claude Code with a command like:
$claude mcp add apify-failory-startups "https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper"
You can also use a JSON-style MCP server configuration:
{"mcpServers": {"apify-failory-startups": {"url": "https://mcp.apify.com/?tools=automation-lab/failory-startups-scraper"}}}
Then call the actor from your coding workflow to create fixtures, update prospect files, or refresh market datasets.
Example MCP prompts:
- "Use the Failory Startups Scraper tool to get 20 SaaS startups and save the names and websites as CSV."
- "Run the Failory actor for united-states with maxItems 10 and summarize funding stages."
- "Collect AI startup profiles from Failory and return founder and investor columns."
Data quality notes
The actor extracts what Failory publishes in the page HTML.
Some records may not include every field. For example, some pages may omit funding amount, investor names, or founder names. Empty values are exported as null or empty arrays depending on field type.
Limitations
- It does not log in to Failory.
- It does not bypass private data walls.
- It does not infer emails or phone numbers.
- It does not guarantee that Failory's public data is current.
- It extracts startup profiles from list pages, not unrelated blog posts.
FAQ
Can I scrape Failory without a login?
Yes. This actor only uses public Failory startup directory pages that are visible without an account.
Does it extract emails or phone numbers?
No. Failory startup directory pages do not consistently publish emails or phone numbers, so the actor does not invent or infer them.
Why do some pages return only a small number of records?
Some Failory pages expose a curated top list in the HTML. The actor exports the records available in that public page markup.
Can I start from the main /startups page?
Yes. The actor can discover country and category links from the main directory page. Increase maxPages if you want it to follow more discovered pages.
Troubleshooting
If you get zero records, check that your URL is a Failory startup directory page.
Good URL pattern:
https://www.failory.com/startups/<country-or-category-slug>
If a broad /startups page returns fewer records than expected, increase maxPages so the actor can follow more discovered directory links.
Legality
This actor is designed to scrape publicly available Failory pages. It does not access private accounts or restricted data. You are responsible for using the exported data in accordance with applicable laws, Failory's terms, and privacy rules that apply to your use case.
Related scrapers
Other automation-lab actors that may complement this workflow:
- https://apify.com/automation-lab/ycombinator-scraper
- https://apify.com/automation-lab/linktree-profile-scraper
- https://apify.com/automation-lab/sitejabber-reviews-scraper
- https://apify.com/automation-lab/npi-registry-provider-scraper
Changelog
0.1
Initial version with HTTP extraction for public Failory startup directory pages.
Support
If a Failory page layout changes or a specific startup category stops parsing, open an Apify issue with the input URL and run ID so we can reproduce it quickly.