# Public Business Lead Extractor (`express_kingfisher/public-business-lead-extractor`) Actor

Extract public business contact information from company websites: emails, phone numbers, social profiles, team pages, and company metadata.

- **URL**: https://apify.com/express\_kingfisher/public-business-lead-extractor.md
- **Developed by:** [Prince Raj](https://apify.com/express_kingfisher) (community)
- **Categories:** Lead generation, Automation, Developer tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $0.01 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Public Business Lead Page Extractor

Extract public business contact information from company websites. Built for B2B sales teams, agencies, and freelancers who need to build prospect lists from public data.

### What It Extracts

- **Company info**: Business name, domain, inferred category
- **Contact data**: Public email addresses, phone numbers, contact page URLs
- **Social profiles**: LinkedIn, Twitter, Facebook, Instagram, GitHub
- **Decision pages**: About, Team, Careers, Pricing, Contact page links
- **Lead quality score**: Composite score based on data completeness

### Why It's Better

Focuses on public business contact data only - no personal data scraping, no gray areas. Extracts from multiple page types (contact, about, team) and scores lead quality automatically.

### Input

| Field | Type | Default | Description |
|-------|------|---------|-------------|
| `urls` | string[] | required | Business website URLs to scan |
| `maxPagesPerDomain` | integer | 5 | Pages to scan per domain (1-20) |
| `extractEmails` | boolean | true | Extract public email addresses |
| `extractPhones` | boolean | true | Extract public phone numbers |
| `extractSocials` | boolean | true | Extract social media profiles |
| `extractDecisionPages` | boolean | true | Find About, Team, Careers pages |

### Output Example

```json
{
  "domain": "acme.com",
  "companyName": "Acme Corp",
  "emails": ["info@acme.com", "sales@acme.com"],
  "phones": ["+1 (555) 123-4567"],
  "socialProfiles": [
    { "platform": "linkedin", "url": "https://linkedin.com/company/acme" },
    { "platform": "twitter", "url": "https://twitter.com/acmecorp" }
  ],
  "contactPage": "https://acme.com/contact",
  "aboutPage": "https://acme.com/about",
  "careersPage": "https://acme.com/careers",
  "pricingPage": "https://acme.com/pricing",
  "leadScore": 78,
  "enrichmentNotes": "Has pricing page and careers page - active business",
  "scrapedAt": "2025-01-15T10:30:00Z"
}
````

### Use Cases

- **B2B prospecting**: Build lead lists from company websites
- **Sales research**: Gather contact info before outreach
- **Agency client research**: Quick company profiling
- **Partnership outreach**: Find the right contact channels
- **Market mapping**: Catalog businesses in a specific industry

### PPE Pricing

| Event | Description | Suggested Price |
|-------|-------------|-----------------|
| `domain-processed` | One domain fully processed | $0.005 |
| `lead-extracted` | Lead with contact data found | $0.003 |

### Limitations

- Only extracts publicly visible contact information
- Cannot find personal emails (only generic business emails)
- Phone number detection works for common formats
- Some sites may block automated scraping
- Lead scoring is heuristic-based

### Legal/Ethical Use

This actor only extracts publicly visible business contact information. It does not scrape personal data, infer private information, or bypass authentication. Users are responsible for compliance with applicable data protection laws (GDPR, CCPA, etc.).

### Local Run

```bash
cd actors/public-business-lead-extractor
apify run --input-file .actor/sample_input.json
```

### Deploy

```bash
cd actors/public-business-lead-extractor
apify push
```

### FAQ

**Q: Does it find personal email addresses?**
A: No, it only extracts generic business emails (info@, sales@, contact@). It does not attempt to find personal email addresses.

**Q: How does lead scoring work?**
A: Score is based on data completeness (emails, phones, social profiles, decision pages). Higher scores indicate more complete, actionable leads.

**Q: Can I scan multiple pages per domain?**
A: Yes, set `maxPagesPerDomain` to crawl additional pages (contact, about, team, careers).

### Tags

business lead extractor, B2B prospecting, contact finder, company research, lead generation, email extractor, phone finder, social profile finder, business intelligence

# Actor input Schema

## `urls` (type: `array`):

Company website URLs to extract leads from.

## `maxPagesPerDomain` (type: `integer`):

Number of pages to crawl per domain for contact discovery.

## `extractEmails` (type: `boolean`):

Find public email addresses on the website.

## `extractPhones` (type: `boolean`):

Find public phone numbers on the website.

## `extractSocials` (type: `boolean`):

Find social media profile links (LinkedIn, Twitter, etc.)

## `extractDecisionPages` (type: `boolean`):

Locate team, about, and leadership pages.

## Actor input object example

```json
{
  "urls": [
    "https://apify.com"
  ],
  "maxPagesPerDomain": 5,
  "extractEmails": true,
  "extractPhones": true,
  "extractSocials": true,
  "extractDecisionPages": true
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "urls": [
        "https://apify.com"
    ]
};

// Run the Actor and wait for it to finish
const run = await client.actor("express_kingfisher/public-business-lead-extractor").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "urls": ["https://apify.com"] }

# Run the Actor and wait for it to finish
run = client.actor("express_kingfisher/public-business-lead-extractor").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "urls": [
    "https://apify.com"
  ]
}' |
apify call express_kingfisher/public-business-lead-extractor --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=express_kingfisher/public-business-lead-extractor",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Public Business Lead Extractor",
        "description": "Extract public business contact information from company websites: emails, phone numbers, social profiles, team pages, and company metadata.",
        "version": "1.0",
        "x-build-id": "w4hn1WtC6ZSuZf4UQ"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/express_kingfisher~public-business-lead-extractor/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-express_kingfisher-public-business-lead-extractor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/express_kingfisher~public-business-lead-extractor/runs": {
            "post": {
                "operationId": "runs-sync-express_kingfisher-public-business-lead-extractor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/express_kingfisher~public-business-lead-extractor/run-sync": {
            "post": {
                "operationId": "run-sync-express_kingfisher-public-business-lead-extractor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "urls"
                ],
                "properties": {
                    "urls": {
                        "title": "Website URLs",
                        "maxItems": 50,
                        "type": "array",
                        "description": "Company website URLs to extract leads from.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxPagesPerDomain": {
                        "title": "Max pages per domain",
                        "minimum": 1,
                        "maximum": 15,
                        "type": "integer",
                        "description": "Number of pages to crawl per domain for contact discovery.",
                        "default": 5
                    },
                    "extractEmails": {
                        "title": "Extract email addresses",
                        "type": "boolean",
                        "description": "Find public email addresses on the website.",
                        "default": true
                    },
                    "extractPhones": {
                        "title": "Extract phone numbers",
                        "type": "boolean",
                        "description": "Find public phone numbers on the website.",
                        "default": true
                    },
                    "extractSocials": {
                        "title": "Extract social profiles",
                        "type": "boolean",
                        "description": "Find social media profile links (LinkedIn, Twitter, etc.)",
                        "default": true
                    },
                    "extractDecisionPages": {
                        "title": "Find decision-maker pages",
                        "type": "boolean",
                        "description": "Locate team, about, and leadership pages.",
                        "default": true
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
