Pricing

from $15.00 / 1,000 results

Company Deep Research — AI Agent Dossier API

One-call company intelligence dossier for AI agents: website meta, tech stack, socials, contacts, SEO basics, news signals, competitors and blog/RSS from a company name or domain. No API key, no browser.

Pricing

from $15.00 / 1,000 results

Rating

0.0

(0)

Developer

Logiover

Actor stats

Bookmarked

Total users

Monthly active users

7 days ago

Last modified

Company Deep Research Scraper — AI Company Intelligence & Enrichment API (No API Key)

Turn a company name or domain into a complete, structured intelligence dossier in one Actor run. Give it stripe.com (or just "Stripe") and get back a clean JSON record with website meta, tech stack, social profiles, contact emails & phones, SEO basics, recent news signals, competitor domains, JSON-LD firmographics (founded year, employee range, industry, HQ) and RSS/blog feeds — aggregated from the site itself plus keyless public sources. Batch up to 500 companies per run. No API key, no login, no headless browser.

🏆 Why this company research scraper?

~40 fields per company · hundreds of dossiers per run · direct HTTP + lightweight HTML parsing (no browser) · name→domain resolution · export to JSON / CSV / Excel. The unofficial company enrichment API alternative for AI agents, sales/rev-tech, CRM enrichment and market mapping — one call replaces five separate scrapers.

✨ What this Actor does / Key features

🧠 Company identity — name, description, logo, favicon, Open Graph / Twitter Card metadata, detected languages, ogType, canonical website URL.
🛠️ Tech-stack fingerprinting — 40+ technologies detected from HTML, script/stylesheet paths and HTTP headers: Next.js, React, Vue, Angular, Svelte, WordPress, Shopify, Webflow, Wix, Cloudflare, Vercel, Netlify, Google Analytics, GTM, HubSpot, Intercom, Segment, Stripe, Sentry, Datadog, Algolia, Tailwind and more.
🔗 Social profiles — LinkedIn, X/Twitter, GitHub, Facebook, Instagram, YouTube, TikTok, Discord and Telegram, from homepage links + schema.org sameAs.
✉️ Contact channels — emails (from mailto: + text regex, junk filtered) and phone numbers (from tel: + international regex).
📈 SEO basics — title tag, meta description, H1/H2 headings, plus robots.txt parsed for discovered sitemapUrls.
📰 News signals — recent company news (last ~30 days) via Google News RSS, each with title, URL, publish date and source.
⚔️ Competitor discovery — alternative/competitor domains mined from public search result pages, with junk domains (Wikipedia, social, etc.) filtered out.
🏢 Firmographics from JSON-LD — schema.org Organization/Corporation blocks parsed for foundedYear, employeesRange, industry, country and city.
📡 RSS / blog feeds — <link rel=alternate> feed tags, blog links and common feed-path guesses (/feed, /rss, /blog/feed).
⚡ Direct HTTP + JSON extraction — pure fetch + HTML parsing on a small Node 20 container; no Playwright to keep warm, no GraphQL token.
🛡️ Proxy support — Apify datacenter proxy by default (avoids per-IP rate limits); optional RESIDENTIAL for bot-walled targets.

🚀 Quick start (3 steps)

Configure — pick a mode (domain, name or single) and paste your targets: a list of domains, a list of company names, or a single domain. Trim sections if you only need part of the dossier.
Run — click Start. The Actor resolves each company, enriches it in parallel and streams one dossier per company into your dataset.
Get your data — open the Output tab and export to JSON, CSV, Excel or XML, or pull it via the Apify API. Feed it straight into an LLM prompt, a vector DB or your CRM.

📥 Input

Pick a mode, then provide targets for that mode (domains / domain / companyNames). Everything else is optional.

Example — enrich a batch of domains (highest volume)

{
  "mode": "domain",
  "domains": ["stripe.com", "linear.app", "figma.com", "notion.so", "vercel.com"],
  "sections": ["meta", "techStack", "socials", "contacts", "seo"],
  "concurrency": 5,
  "useApifyProxy": true
}

Example — resolve company names → website → dossier (messy lead lists)

{
  "mode": "name",
  "companyNames": ["Notion", "Figma", "Linear", "Vercel", "Supabase"],
  "sections": ["meta", "techStack", "socials", "contacts", "news", "competitors"],
  "concurrency": 4
}

Example — deep single dossier (all sections, max depth)

{
  "mode": "single",
  "domain": "openai.com",
  "maxNews": 15,
  "maxCompetitors": 10
}

Field	Type	Description
`mode`	string	`domain` (batch enrich), `name` (resolve names → website → enrich), or `single` (one deep dossier). Required.
`domains`	array	Domains/URLs for `domain` mode, e.g. `stripe.com`, `https://openai.com`. Normalized automatically (protocol + `www` stripped).
`domain`	string	Single domain or URL for `single` mode.
`companyNames`	array	Free-text company names for `name` mode, e.g. `Notion`, `Figma`. Resolved to a website first.
`sections`	array	Dossier sections to collect: `meta`, `techStack`, `socials`, `contacts`, `seo`, `news`, `competitors`, `rss`. Fewer = faster & cheaper per company.
`maxNews`	integer	Max news items per company (0–50).
`maxCompetitors`	integer	Max competitor domains per company (0–30).
`concurrency`	integer	Parallel companies in batch modes (1–20). Lower for fragile targets.
`useApifyProxy`	boolean	Route requests through Apify datacenter proxy (AUTO). Recommended — avoids per-IP rate limits.
`proxyGroups`	array	Override proxy group, e.g. `["RESIDENTIAL"]` for bot-walled targets. Leave empty for datacenter.

Which mode? Use domain when you already have websites (fastest, batch up to 500). Use name when you only have messy company names from a form or CRM. Use single for one company you want fully enriched at maximum depth. The output schema is identical across all three.

📤 Output

One row per company — a complete ~40-field intelligence dossier, exportable to JSON, CSV, Excel or XML. Here is a trimmed sample record:

{
  "input": "stripe.com",
  "resolvedDomain": "stripe.com",
  "websiteUrl": "https://stripe.com/",
  "companyName": "Stripe",
  "title": "Stripe | Financial Infrastructure to Grow Your Revenue",
  "description": "Stripe is a suite of APIs powering online payment processing...",
  "logo": "https://stripe.com/img/v3/home/social.png",
  "favicon": "https://www.google.com/s2/favicons?domain=stripe.com&sz=64",
  "languages": ["en-US"],
  "ogType": "website",
  "techStack": ["Next.js", "React", "Cloudflare", "Stripe", "Google Analytics", "Segment", "Sentry"],
  "socials": [
    { "platform": "twitter", "url": "https://twitter.com/stripe" },
    { "platform": "linkedin", "url": "https://www.linkedin.com/company/stripe" },
    { "platform": "github", "url": "https://github.com/stripe" }
  ],
  "emails": ["support@stripe.com"],
  "phones": [],
  "linkedinUrl": "https://www.linkedin.com/company/stripe",
  "twitterUrl": "https://twitter.com/stripe",
  "githubUrl": "https://github.com/stripe",
  "youtubeUrl": "https://www.youtube.com/stripe",
  "employeesRange": "5001-10000",
  "foundedYear": 2010,
  "industry": "Fintech",
  "country": "US",
  "city": "South San Francisco",
  "seoTitleTag": "Stripe | Financial Infrastructure to Grow Your Revenue",
  "seoMetaDescription": "Millions of businesses of all sizes use Stripe...",
  "seoHeadings": ["Payments", "Online payments", "In-person payments"],
  "sitemapUrls": ["https://stripe.com/sitemap.xml"],
  "news": [
    { "title": "Stripe announces new product...", "url": "https://...", "publishedAt": "2026-06-28", "source": "TechCrunch" }
  ],
  "competitors": ["paypal.com", "adyen.com", "braintreepayments.com", "square.com"],
  "rssFeeds": ["https://stripe.com/blog/feed"],
  "blogUrl": "https://stripe.com/blog",
  "httpsValid": true,
  "scrapedAt": "2026-07-06T12:00:00.000Z"
}

Tip: use the Overview dataset view to scan many companies at a glance, the Social & contact view for outreach lists, and the News signals view for monitoring.

💡 Use cases

CRM / lead enrichment — feed a column of domains or company names and enrich each with industry, employee range, tech stack, LinkedIn and contact channels.
AI-agent grounding (RAG) — give agents current, structured company facts instead of stale training-set knowledge; embed each dossier into a vector DB.
Sales & account research — qualify accounts by stack, size and news momentum, then build a competitor set before outreach.
Market mapping — build a landscape of hundreds of companies in a niche and cluster, filter and rank them as structured rows.
Vendor / tool evaluation — compare candidate SaaS vendors by website, tech stack, docs and recent news in one pass.
Competitive & news monitoring — schedule recurring runs over a watchlist and diff datasets to track tech, news and competitor movement.

👥 Who uses it

AI-agent & LLM-app builders (Claude, ChatGPT, Cursor, LangGraph, CrewAI, AutoGen tools) · sales & rev-tech / GTM teams enriching lead lists · growth and demand-gen agencies · market researchers & data journalists building company landscapes · investors & analysts doing lightweight due diligence · developers building an internal company database or MCP tool.

💰 Pricing

This Actor runs on a simple pay-per-result model — you're charged one result event per company dossier produced, with no separate Apify platform fees to calculate. Runs that yield zero companies (bad input, blocked target) are not billed. Try it on the free tier first, then scale up. See the Pricing tab on this page for the current rate.

❓ Frequently Asked Questions

Is this a company enrichment / research API alternative? Yes. Instead of paying for a closed enrichment API, this Actor aggregates a company's own public homepage plus keyless public sources into one structured dossier — an unofficial company enrichment API alternative that returns machine-readable JSON with no API key.

Does this Actor need an API key or a login? No. It uses only keyless public endpoints (public search for name→domain resolution, Google News RSS) and direct HTTP fetches of company websites. You just need an Apify account — no company logins, no third-party API keys.

How do I export the data to CSV or JSON? Run the Actor, then export the resulting dataset as CSV, JSON, Excel or XML from the Apify console or via the Apify API. Each row is one company dossier.

Is it legal to use this Actor? It fetches only publicly available web pages and keyless public APIs — it does not authenticate, bypass access controls or scrape behind logins. You are responsible for using the data in compliance with each website's Terms of Service and applicable laws such as GDPR.

How much data can I get / how many companies per run? Practically unlimited — the Actor streams results and charges per dossier. domain mode accepts batches up to 500; concurrency is capped at 20 to stay polite. A few hundred companies per run is comfortable; for thousands, split into batches.

How do I research a company when I only have its name, not the website? Use name mode. Pass free-text names like "Figma" or "Linear" and the Actor resolves each to its website first (via public search + favicon lookup), then enriches — perfect for messy CRM or form data.

How accurate is the tech-stack detection? The fingerprint library covers 40+ of the most common technologies from client-side signals (script paths, framework markers) and server headers. It will miss server-only tech (databases, backend languages) and deeply bundled apps, but for vendor reconnaissance and quick triage it's reliable. For exhaustive Wappalyzer-grade detection, pair it with a dedicated tech-stack detector.

Will it work on sites behind Cloudflare or a login wall? Homepages rarely block. If a target returns 403/503, the dossier is still produced with whatever fields could be collected and the domain recorded. For systematically walled targets, set proxyGroups: ["RESIDENTIAL"].

Does it find personal emails of employees? No. It collects contact emails published on the company homepage (support, info, press). For employee/person emails, use a dedicated LinkedIn or person-email Actor.

Can AI agents call this directly? Yes — this is the primary design target. Expose it through an MCP server or an Apify tool integration; the agent passes a company name/domain and gets structured JSON back, with no browsing or HTML parsing on the agent side.

🔗 More AI & research scrapers by logiover

Building an AI research or company-intelligence pipeline? Pair this Actor with the rest of the ai-research-intel suite:

Actor	What it does
AI Deep Research	Multi-source, agentic deep-research reports on any topic
AI Web Extract	LLM-powered structured extraction from any web page
AI Web Search	Programmatic web search results for agents & RAG
News Intelligence Scraper	Topic-level multi-source news + signals
Google News Scraper	Google News articles by keyword, with dates & sources
Competitor Ad Intelligence	Track competitors' ad creatives & spend signals
GitHub Repository Scraper	Repo metadata, activity and stats for a company's OSS
npm Package Intelligence	npm package stats, deps and maintainer signals
arXiv Paper Scraper	Academic papers, authors and abstracts for research
SERP Keyword Research	Search-engine result pages & keyword data
CVE Security Advisory Monitor	Track CVEs & security advisories for a vendor

👉 Browse all logiover scrapers on Apify Store — 180+ actors across real estate, jobs, crypto, social media & B2B data.

⭐ Support & feedback

Found a bug or need an extra field? Open an issue on the Issues tab — response is usually fast. If this Actor saves you time, a ★★★★★ review on the Store page genuinely helps and is hugely appreciated. 🙏

⚖️ Legal

This Actor fetches only publicly available web pages and keyless public APIs. It does not authenticate, bypass access controls or scrape behind logins. It is intended for legitimate research, sales intelligence and AI-agent grounding on data that is already public. You are responsible for complying with each website's Terms of Service, robots.txt, GDPR and any applicable local laws.

📝 Changelog

2026-07-06

✨ README overhaul: badge row, ready-to-run example scenarios per mode, trimmed real-schema output sample with a collapsible full field reference, keyword-gap FAQ, and the ai-research-intel cross-promo grid.

2026-07-02 — v1.0

Initial release.
3 modes: domain (batch), name (resolve → enrich), single (deep).
8 dossier sections: meta, techStack, socials, contacts, seo, news, competitors, rss.
40+ tech-stack fingerprints.
schema.org Organization JSON-LD parsing.
Apify datacenter proxy by default, residential opt-in.
Pay-per-result (result event per dossier).

AI Company Research Agent

constant_quadruped/ai-company-research-agent

Get comprehensive company intelligence in seconds. Research any company for tech stack, key employees, competitors, news sentiment & AI insights.

Domain Intelligence API: whois, DNS, SSL, tech stack

hereditary_model/domain-intelligence

One call returns a complete domain dossier: registration age, DNS, SSL, tech stack, site size, socials, and a trust score. Built for due diligence agents.

Aaron Marxsen

Company Enrichment from Domain

heenalr/company-enrichment

Enrich a company domain into name, description, emails, phones, social profiles, and tech stack — from the company's own public website. No LinkedIn. Built for AI agents.

Heenal Rajani

Deep Intelligence Lookup

startuphub/deep-intelligence

A deep intelligence dossier on any person or company from a name, company, domain, LinkedIn URL, or email. Free redacted preview or full unredacted report.

StartupHub

AI Company Researcher

sthiven_r/ai-company-researcher

Turn a domain into a sales dossier: what the company does, their tech stack (technographics), contacts, hiring & e-commerce signals, social profiles — plus optional AI summary & outreach email. For SDRs and sales teams.

Wilker Sthiven Rangel Manrique

Company Tech-Stack & Domain Intelligence API

technicaldost/company-intelligence-api

Domain to company tech stack, founded year, headcount, industry, funding, socials & logo. Open-data company enrichment API. A BuiltWith + Clearbit alternative.

Technical Dost Solutions

Company Research Scraper — Deep Company Intelligence Data

scrapepilot/aiendn

Extract deep company intelligence data from any domain. Get company name, description, employees, tech stack, social links, GitHub stats, Wikipedia summary, executive leadership, recent news, and contact emails — from 8 sources in parallel. $0.06 per company.

Scrape Pilot

Startup Intelligence MCP Server

ritindersingh/startup-intelligence-mcp

MCP server for AI agents. Given a company name or domain, returns funding stage, team size, tech stack, news, and hiring signals in one tool call.

Ritinder Singh

WebSight API — AI-Native Web Intelligence

george.the.developer/websight-api

One API call for 7 types of web intelligence from any URL: clean markdown, tech stack, SEO audit, contacts, structured data, AI score, domain intel. Token-optimized for AI agents. Cached.

George Kioko

Company Data Enrichment — Domain to Company Profile

jimmywhyte/company-enrichment

Feed a list of domains and get a company profile for each: name, description, industry, logo, location, founded year, employee size, tech stack, emails, and socials — pulled from the company's own website. No LinkedIn, no cookies. Pipeline-ready for bulk CRM enrichment.