Recruiter Lead Finder – HR & Talent Acquisition Contacts avatar

Recruiter Lead Finder – HR & Talent Acquisition Contacts

Pricing

from $2.50 / 1,000 recruiter lead finders

Go to Apify Store
Recruiter Lead Finder – HR & Talent Acquisition Contacts

Recruiter Lead Finder – HR & Talent Acquisition Contacts

Crawl company websites to find recruiters, HR managers, talent acquisition leads. Extract emails, LinkedIn, phones, Twitter. Smart prioritization, dedup across domains. Pay only for pages with new contacts ($2.50/1K)

Pricing

from $2.50 / 1,000 recruiter lead finders

Rating

0.0

(0)

Developer

Scrape Pilot

Scrape Pilot

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

2

Monthly active users

11 days ago

Last modified

Share


🎯 Recruiter Lead Finder – Extract HR & Talent Acquisition Contacts (Emails, LinkedIn, Phone)

Crawl company websites to find recruiters, HR managers, talent acquisition specialists, and people ops contacts. Extract email addresses, LinkedIn profiles, phone numbers, Twitter handles – automatically deduplicated across domains.
Perfect for recruitment agencies, sales teams, B2B lead generation, and talent sourcing. Smart crawling prioritises contact pages, team directories, and “people” sections. Includes Playwright fallback for JavaScript‑heavy sites and checkpoint/resume.


💡 What is the Recruiter Lead Finder?

The Recruiter Lead Finder is an intelligent Apify actor that automatically discovers HR and talent acquisition contacts from any company website. It:

  • Crawls a domain (configurable depth, default 40 pages) while staying on the same site
  • Prioritises pages likely to contain contact info: /contact, /team, /about, /people, /leadership, /careers, /recruit, /talent
  • Extracts email addresses (validated, de‑obfuscated, filtered from file extensions and year ranges)
  • Extracts LinkedIn profiles (individual /in/ and company /company/ URLs)
  • Extracts phone numbers (validated, rejecting dates and copyright years)
  • Extracts Twitter/X handles (skipping share/intent links)
  • Detects job titles matching HR/TA/People roles (Recruiter, Talent Acquisition, HR Manager, People Partner, Sourcer, etc.)
  • Matches page content against target keywords (customisable)
  • Outputs two types of items:
    1. Page‑scan items – one per crawled page, including all contacts found on that page.
    2. Contact items – one per unique contact (email, LinkedIn, phone, Twitter), deduplicated globally across all start URLs in the same run.

The actor uses curl_cffi with Chrome impersonation for fast HTTP requests and falls back to Playwright (headless Chromium) for JavaScript‑rendered pages. It respects maxPagesPerDomain, supports checkpoint/resume, and automatically skips locale‑specific pages (e.g., /de/, /fr/) to avoid duplicate content.

Pricing: Pay only for successfully crawled pages that contain new contacts (deduplicated). $2.50 per 1,000 contact‑bearing pages + $0.02 actor start. Failed pages cost nothing.


🚀 Key Features

FeatureDescription
Smart prioritisationCrawls contact, team, about, people, leadership, careers, recruit, talent pages first.
HR/TA keyword matchingDetects pages mentioning recruiter, talent acquisition, HR, people ops, sourcing, etc.
Contact extractionEmails (mailto + obfuscated), LinkedIn profiles (in/company), phone numbers, Twitter handles.
Job title detectionExtracts specific HR roles (Recruiter, Talent Acquisition Specialist, HR Manager, People Partner, etc.).
Global deduplicationAcross multiple start URLs – the same contact appears only once (saves money).
Dual fetch engineFast HTTP (curl_cffi) + Playwright fallback for JS‑heavy sites.
Checkpoint & resumeSaves progress after each page; resumes from last state if interrupted.
Skip locale pagesIgnores /de/, /fr/, /zh-Hans/ variants to avoid crawling the same content multiple times.
Residential proxy readyBypass IP blocks (recommended for large crawls).
Pay‑per‑result pricing$0.02 actor start + $2.50 per 1,000 pages that yield new contacts. Failed pages are free.

📥 Input Parameters

ParameterTypeRequiredDefaultDescription
startUrlsarray of objectsYesList of starting URLs, e.g., [{"url": "https://stripe.com"}].
targetKeywordsarray of stringsNoBuilt‑in list (recruiter, talent, hr, human resources, hiring, talent acquisition, people, people team, sourcer, recruiting, talent partner, people ops, hr manager)Custom keywords to prioritise pages.
maxPagesPerDomainintegerNo40Maximum pages to crawl per domain.
maxConcurrencyintegerNo20Number of concurrent HTTP requests.
maxContactsPerPageintegerNo100Limit extracted contacts per page.
usePlaywrightFallbackbooleanNotrueUse headless browser for pages that fail HTTP or have little content.
skipLocalizedPagesbooleanNotrueSkip URLs containing locale prefixes (/de/, /fr/).
checkpointKeystringNo""Custom checkpoint key to isolate state for different runs.
proxyConfigurationobjectNoApify proxy configuration. Residential strongly recommended.

Example Input

{
"startUrls": [
{"url": "https://stripe.com"},
{"url": "https://shopify.com"}
],
"maxPagesPerDomain": 30,
"maxConcurrency": 15,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

📤 Output Fields

The actor pushes two types of items to the dataset:

1. Page‑scan item (record_type: "page_scan")

FieldTypeDescription
record_typestring"page_scan"
company_domainstringDomain of the crawled website.
source_urlstringURL of the crawled page.
page_titlestringExtracted page title.
meta_descriptionstringMeta description (if any).
matched_keywordsarrayWhich target keywords were found on this page.
job_titlesarrayExtracted HR/TA job titles from the page.
emailsarrayEmail addresses found.
linkedin_profilesarrayLinkedIn profile URLs.
phone_numbersarrayPhone numbers.
twitter_handlesarrayTwitter/X profile URLs.
new_contacts_foundintegerNumber of new contacts discovered on this page (globally deduplicated).
page_previewstringFirst 800 characters of cleaned page text.
status_codeintegerHTTP status code (if any).
errorstringError message if failed.

2. Contact item (record_type: "contact")

FieldTypeDescription
record_typestring"contact"
contact_typestringemail, linkedin_profile, phone, twitter_handle.
contact_valuestringThe actual contact (email, URL, phone number, Twitter handle).
job_titles_on_pagearrayHR/TA job titles extracted from the page where the contact was found.
company_domainstringCompany domain.
source_urlstringURL where the contact was discovered.
page_titlestringTitle of that page.
matched_keywordsarrayKeywords matched on the page.

Example Output (Page‑scan)

{
"record_type": "page_scan",
"company_domain": "stripe.com",
"source_url": "https://stripe.com/jobs",
"page_title": "Jobs at Stripe",
"meta_description": "Join Stripe and help build the economic infrastructure of the internet.",
"matched_keywords": ["recruiter", "talent", "hiring", "careers"],
"job_titles": ["Recruiter", "Talent Acquisition Partner"],
"emails": ["talent@stripe.com"],
"linkedin_profiles": ["https://linkedin.com/company/stripe"],
"phone_numbers": [],
"twitter_handles": [],
"new_contacts_found": 2,
"page_preview": "We are hiring...",
"status_code": 200,
"error": ""
}

Example Output (Contact)

{
"record_type": "contact",
"contact_type": "email",
"contact_value": "talent@stripe.com",
"job_titles_on_page": ["Recruiter", "Talent Acquisition Partner"],
"company_domain": "stripe.com",
"source_url": "https://stripe.com/jobs",
"page_title": "Jobs at Stripe",
"matched_keywords": ["recruiter", "talent", "hiring", "careers"]
}

💰 Pricing

ComponentPrice
Actor start (per run)$0.02
Per successful page that yields new contacts$0.0025
Per 1,000 such pages$2.50
  • You are charged only for pages that are successfully fetched, parsed, and contain at least one new contact (globally deduplicated across all start URLs in the same run).
  • Pages that return errors, have no contacts, or contain only already‑seen contacts are free.
  • Example: 500 pages with new contacts = $0.02 + (500 × $0.0025) = $1.27.
  • Example: 2,500 pages with new contacts = $0.02 + (2,500 × $0.0025) = $6.27.

Checkpoint/resume: If the actor stops (spending limit, timeout, manual stop), you can restart – it will resume from the last saved state and only charge for new pages with new contacts that haven't been processed before.


🛠 How to Use on Apify

  1. Create a task with this actor.
  2. Provide one or more start URLs – these are the company websites you want to find recruiters for.
  3. Adjust maxPagesPerDomain – 30–50 is usually enough to find all contact pages.
  4. Enable residential proxies – strongly recommended to avoid being blocked.
  5. Run – the actor crawls each domain, prioritising contact/team pages, extracts contacts, and pushes results.
  6. Export – download the dataset as JSON, CSV, or Excel.

Running via API

curl -X POST "https://api.apify.com/v2/acts/your-username~recruiter-lead-finder/runs" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-d '{
"startUrls": [{"url": "https://stripe.com"}],
"maxPagesPerDomain": 30,
"proxyConfiguration": {"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]}
}'

🎯 Use Cases

Use CaseHow It Helps
Recruitment agenciesBuild lists of in‑house recruiters and TA professionals to pitch your staffing services.
Sales teams (HR tech)Find decision‑makers for HR software, payroll, benefits, or talent platforms.
B2B lead generationEnrich CRM with verified HR contact emails and LinkedIn profiles.
Sourcing automationAutomatically discover the right people to contact for candidate placements.
Market researchAnalyse how companies structure their TA teams (titles, contact methods).

❓ Frequently Asked Questions

1. What exactly counts as a “successful page” for pricing?
A page is considered successful if:

  • HTTP status is 200 (or successful redirect).
  • The page is parsed and at least one new, unique contact (email, LinkedIn, phone, Twitter) is discovered that has not been seen in any previous page of the same run (across all start URLs).

2. Can I run the actor without a proxy?
For a small number of domains (1–5) with few pages, datacenter IPs may work. For larger runs, residential proxies are strongly recommended to avoid rate limiting (429) and blocking (403).

3. What does the “skipLocalizedPages” option do?
It prevents crawling URLs like https://example.com/de/, https://example.com/fr/, etc., which often contain the same content in different languages. This avoids duplicate processing and saves cost.

4. How does the actor detect HR‑related job titles?
It scans the page text for patterns like:

  • Recruiter
  • Talent Acquisition (Specialist/Manager/Partner/Lead)
  • HR Manager / Human Resources (Manager/Director/Business Partner)
  • People Partner / People Operations
  • Sourcer
  • Hiring Manager
  • Talent Partner

5. Are phone numbers validated?
Yes. The actor rejects numbers that look like:

  • Copyright year ranges (2020-2024)
  • Bare 4‑digit years
  • Date formats (12/12/2024)
  • All‑digit strings without separators (e.g., 12345678 – too short/long or no formatting) Only well‑formatted numbers with 7–15 digits and appropriate punctuation are kept.

6. What does “checkpoint/resume” mean?
The actor saves its state (which URLs have been processed, which contacts have been seen) to the key‑value store. If the run stops (e.g., spending limit reached), you can restart the same actor with the same input – it will continue from where it left off without re‑charging for already processed pages.

7. Can I customise the target keywords?
Yes. Provide a targetKeywords array in the input. Only pages containing at least one of these keywords will be prioritised (though all pages are scanned for contacts).

8. Why am I charged only for pages with new contacts?
To avoid paying for duplicate results (e.g., the same email appears on multiple pages). Once a contact is discovered, any subsequent page that contains it will not trigger a charge.

9. What happens if the actor encounters a page that requires login?
The actor will likely receive a 403 or 302 redirect to a login page, which will be classified as a failure – you will not be charged.

10. Can I use this actor to scrape individual LinkedIn profiles?
It extracts LinkedIn profile URLs (e.g., linkedin.com/in/username), but does not scrape those profiles themselves. For full LinkedIn profile scraping, see our link.



🔍 SEO Keywords

recruiter lead finder, HR contact extractor, talent acquisition email scraper, find recruiter emails, HR manager LinkedIn finder, people ops contact scraper, sourcer email extractor, talent partner discovery, B2B lead generation HR, recruitment automation, Apify HR scraper, company recruiter finder




Start finding recruiter contacts today – $0.02 per run + $2.50 per 1,000 pages with new contacts. Smart crawling, global dedup, proxy ready.