Decision Maker Name & Email Extractor avatar
Decision Maker Name & Email Extractor

Pricing

Pay per usage

Go to Store
Decision Maker Name & Email Extractor

Decision Maker Name & Email Extractor

Developed by

Dominic M. Quaiser

Dominic M. Quaiser

Maintained by Community

An actor that crawls a website to identify key decision‑maker names and job titles, then uses NER‑powered matching to extract and pair their email addresses for streamlined lead generation and B2B data enrichment.

0.0 (0)

Pricing

Pay per usage

3

Total users

8

Monthly users

8

Runs succeeded

>99%

Last modified

2 days ago

[0.2.0—alpha]

This actor is designed to crawl websites and extract contact information for key decision-makers. It navigates through pages, identifies potential names and job titles, and extracts email addresses.

The actor uses a machine learning model (NER - Named Entity Recognition) to validate that a found name is likely a person, and then attempts to intelligently match that person with a found email address.

⚠️ Disclaimer: This Apify actor is currently under development. Its features and behavior may change frequently.

Input

The actor requires the following input:

  • Start URLs (startUrls): The initial web page URL where the crawl will begin. The actor will automatically stay on the same domain as the Start URL.
  • Proxy Configuration (proxyConfiguration): Standard Apify proxy settings to be used for the crawl. It is highly recommended to use a proxy.
  • Memory: Although the new machine learning model can run on 2 GB of RAM, I recommend at least 4 GB for significantly better performance.

Example Input:

{
"proxyConfiguration": {
"useApifyProxy": true
},
"startUrls": [
{
"url": "https://example.com/",
"method": "GET"
}
]
}

Output

The actor provides a single, aggregated result for each domain crawled. The output contains a list of all found contacts, categorized by how the information was found.

Each item in the contacts list will have a contact_type which can be:

  • person_with_email: A person's name and title were found and successfully matched to an email address.
  • person_only: A person's name and title were found, but no corresponding email could be matched.
  • email_only: An email address was found, but it could not be matched to a specific person.

Example Output Snippet:

[{
"url": "example.com",
"status_code": 200,
"emails": [
"info@example.com",
"j.doe@example.com"
],
"people": [
{
"name": "Jane Doe",
"title": "Chief Executive Officer"
}
],
"source_urls": [
"https://example.com/",
"https://example.com/about-us",
"https://example.com/contact"
],
"contacts": [
{
"name": "Jane Doe",
"title": "Chief Executive Officer",
"email": "j.doe@example.com",
"match_score": 0.95,
"contact_type": "person_with_email"
},
{
"name": null,
"title": null,
"email": "info@example.com",
"match_score": 0,
"contact_type": "email_only"
}
],
"is_aggregated": true
}]

Limitations

  • Crawl Limit: The scraping process is limited to approximately 50 pages per run. This may not be sufficient for very large websites.
  • English Title Limitation: The actor is currently limited to processing and identifying English titles only.
  • Dynamic Content Limitation: The actor does not currently work for websites that rely heavily on dynamically loaded JavaScript content.
  • Performance: The actor does not currently work for websites that rely heavily on dynamically loaded JavaScript content.

Pricing

The cost of a run will vary based on the complexity of the target website and the number of pages crawled. As a rough estimate, a run that yields a significant number of results might cost around $0.25. However, this is not a guarantee and the actual cost may be higher or lower.

Future Improvements

  • Support for bulk URL input.
  • Allowing customization of job title keywords or multilingual job titles support.
  • Headless browser for JavaScript heavy sites.
  • Custom machine learning model.

You are solely responsible for determining the legality of your use of this actor and the data it generates. The scraping and handling of data, particularly personal information, is subject to complex legal frameworks, including but not limited to the General Data Protection Regulation (GDPR/DSGVO), copyright laws, and the terms of service of the websites you scrape.

It is your responsibility to ensure that your specific use case is compliant with all applicable laws. This includes obtaining any necessary consents, having a legitimate interest, and adhering to all data protection principles if you process personal data.

The developer of this actor provides no warranty and assume no liability for any misuse of this tool or for any legal issues that may arise from your use of the data. This text does not constitute legal advice.

Maintainer

For any questions, issues, or feature requests, please contact me.