Recruiter Lead Finder – HR & Talent Acquisition Contacts
Pricing
from $2.50 / 1,000 recruiter lead finders
Recruiter Lead Finder – HR & Talent Acquisition Contacts
Crawl company websites to find recruiters, HR managers, talent acquisition leads. Extract emails, LinkedIn, phones, Twitter. Smart prioritization, dedup across domains. Pay only for pages with new contacts ($2.50/1K)
Pricing
from $2.50 / 1,000 recruiter lead finders
Rating
0.0
(0)
Developer
Scrape Pilot
Maintained by CommunityActor stats
0
Bookmarked
4
Total users
2
Monthly active users
11 days ago
Last modified
Categories
Share
🎯 Recruiter Lead Finder – Extract HR & Talent Acquisition Contacts (Emails, LinkedIn, Phone)
Crawl company websites to find recruiters, HR managers, talent acquisition specialists, and people ops contacts. Extract email addresses, LinkedIn profiles, phone numbers, Twitter handles – automatically deduplicated across domains.
Perfect for recruitment agencies, sales teams, B2B lead generation, and talent sourcing. Smart crawling prioritises contact pages, team directories, and “people” sections. Includes Playwright fallback for JavaScript‑heavy sites and checkpoint/resume.
💡 What is the Recruiter Lead Finder?
The Recruiter Lead Finder is an intelligent Apify actor that automatically discovers HR and talent acquisition contacts from any company website. It:
- Crawls a domain (configurable depth, default 40 pages) while staying on the same site
- Prioritises pages likely to contain contact info:
/contact,/team,/about,/people,/leadership,/careers,/recruit,/talent - Extracts email addresses (validated, de‑obfuscated, filtered from file extensions and year ranges)
- Extracts LinkedIn profiles (individual
/in/and company/company/URLs) - Extracts phone numbers (validated, rejecting dates and copyright years)
- Extracts Twitter/X handles (skipping share/intent links)
- Detects job titles matching HR/TA/People roles (Recruiter, Talent Acquisition, HR Manager, People Partner, Sourcer, etc.)
- Matches page content against target keywords (customisable)
- Outputs two types of items:
- Page‑scan items – one per crawled page, including all contacts found on that page.
- Contact items – one per unique contact (email, LinkedIn, phone, Twitter), deduplicated globally across all start URLs in the same run.
The actor uses curl_cffi with Chrome impersonation for fast HTTP requests and falls back to Playwright (headless Chromium) for JavaScript‑rendered pages. It respects maxPagesPerDomain, supports checkpoint/resume, and automatically skips locale‑specific pages (e.g., /de/, /fr/) to avoid duplicate content.
Pricing: Pay only for successfully crawled pages that contain new contacts (deduplicated). $2.50 per 1,000 contact‑bearing pages + $0.02 actor start. Failed pages cost nothing.
🚀 Key Features
| Feature | Description |
|---|---|
| Smart prioritisation | Crawls contact, team, about, people, leadership, careers, recruit, talent pages first. |
| HR/TA keyword matching | Detects pages mentioning recruiter, talent acquisition, HR, people ops, sourcing, etc. |
| Contact extraction | Emails (mailto + obfuscated), LinkedIn profiles (in/company), phone numbers, Twitter handles. |
| Job title detection | Extracts specific HR roles (Recruiter, Talent Acquisition Specialist, HR Manager, People Partner, etc.). |
| Global deduplication | Across multiple start URLs – the same contact appears only once (saves money). |
| Dual fetch engine | Fast HTTP (curl_cffi) + Playwright fallback for JS‑heavy sites. |
| Checkpoint & resume | Saves progress after each page; resumes from last state if interrupted. |
| Skip locale pages | Ignores /de/, /fr/, /zh-Hans/ variants to avoid crawling the same content multiple times. |
| Residential proxy ready | Bypass IP blocks (recommended for large crawls). |
| Pay‑per‑result pricing | $0.02 actor start + $2.50 per 1,000 pages that yield new contacts. Failed pages are free. |
📥 Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
startUrls | array of objects | Yes | – | List of starting URLs, e.g., [{"url": "https://stripe.com"}]. |
targetKeywords | array of strings | No | Built‑in list (recruiter, talent, hr, human resources, hiring, talent acquisition, people, people team, sourcer, recruiting, talent partner, people ops, hr manager) | Custom keywords to prioritise pages. |
maxPagesPerDomain | integer | No | 40 | Maximum pages to crawl per domain. |
maxConcurrency | integer | No | 20 | Number of concurrent HTTP requests. |
maxContactsPerPage | integer | No | 100 | Limit extracted contacts per page. |
usePlaywrightFallback | boolean | No | true | Use headless browser for pages that fail HTTP or have little content. |
skipLocalizedPages | boolean | No | true | Skip URLs containing locale prefixes (/de/, /fr/). |
checkpointKey | string | No | "" | Custom checkpoint key to isolate state for different runs. |
proxyConfiguration | object | No | – | Apify proxy configuration. Residential strongly recommended. |
Example Input
{"startUrls": [{"url": "https://stripe.com"},{"url": "https://shopify.com"}],"maxPagesPerDomain": 30,"maxConcurrency": 15,"proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
📤 Output Fields
The actor pushes two types of items to the dataset:
1. Page‑scan item (record_type: "page_scan")
| Field | Type | Description |
|---|---|---|
record_type | string | "page_scan" |
company_domain | string | Domain of the crawled website. |
source_url | string | URL of the crawled page. |
page_title | string | Extracted page title. |
meta_description | string | Meta description (if any). |
matched_keywords | array | Which target keywords were found on this page. |
job_titles | array | Extracted HR/TA job titles from the page. |
emails | array | Email addresses found. |
linkedin_profiles | array | LinkedIn profile URLs. |
phone_numbers | array | Phone numbers. |
twitter_handles | array | Twitter/X profile URLs. |
new_contacts_found | integer | Number of new contacts discovered on this page (globally deduplicated). |
page_preview | string | First 800 characters of cleaned page text. |
status_code | integer | HTTP status code (if any). |
error | string | Error message if failed. |
2. Contact item (record_type: "contact")
| Field | Type | Description |
|---|---|---|
record_type | string | "contact" |
contact_type | string | email, linkedin_profile, phone, twitter_handle. |
contact_value | string | The actual contact (email, URL, phone number, Twitter handle). |
job_titles_on_page | array | HR/TA job titles extracted from the page where the contact was found. |
company_domain | string | Company domain. |
source_url | string | URL where the contact was discovered. |
page_title | string | Title of that page. |
matched_keywords | array | Keywords matched on the page. |
Example Output (Page‑scan)
{"record_type": "page_scan","company_domain": "stripe.com","source_url": "https://stripe.com/jobs","page_title": "Jobs at Stripe","meta_description": "Join Stripe and help build the economic infrastructure of the internet.","matched_keywords": ["recruiter", "talent", "hiring", "careers"],"job_titles": ["Recruiter", "Talent Acquisition Partner"],"emails": ["talent@stripe.com"],"linkedin_profiles": ["https://linkedin.com/company/stripe"],"phone_numbers": [],"twitter_handles": [],"new_contacts_found": 2,"page_preview": "We are hiring...","status_code": 200,"error": ""}
Example Output (Contact)
{"record_type": "contact","contact_type": "email","contact_value": "talent@stripe.com","job_titles_on_page": ["Recruiter", "Talent Acquisition Partner"],"company_domain": "stripe.com","source_url": "https://stripe.com/jobs","page_title": "Jobs at Stripe","matched_keywords": ["recruiter", "talent", "hiring", "careers"]}
💰 Pricing
| Component | Price |
|---|---|
| Actor start (per run) | $0.02 |
| Per successful page that yields new contacts | $0.0025 |
| Per 1,000 such pages | $2.50 |
- You are charged only for pages that are successfully fetched, parsed, and contain at least one new contact (globally deduplicated across all start URLs in the same run).
- Pages that return errors, have no contacts, or contain only already‑seen contacts are free.
- Example: 500 pages with new contacts = $0.02 + (500 × $0.0025) = $1.27.
- Example: 2,500 pages with new contacts = $0.02 + (2,500 × $0.0025) = $6.27.
Checkpoint/resume: If the actor stops (spending limit, timeout, manual stop), you can restart – it will resume from the last saved state and only charge for new pages with new contacts that haven't been processed before.
🛠 How to Use on Apify
- Create a task with this actor.
- Provide one or more start URLs – these are the company websites you want to find recruiters for.
- Adjust
maxPagesPerDomain– 30–50 is usually enough to find all contact pages. - Enable residential proxies – strongly recommended to avoid being blocked.
- Run – the actor crawls each domain, prioritising contact/team pages, extracts contacts, and pushes results.
- Export – download the dataset as JSON, CSV, or Excel.
Running via API
curl -X POST "https://api.apify.com/v2/acts/your-username~recruiter-lead-finder/runs" \-H "Content-Type: application/json" \-H "Authorization: Bearer YOUR_API_TOKEN" \-d '{"startUrls": [{"url": "https://stripe.com"}],"maxPagesPerDomain": 30,"proxyConfiguration": {"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]}}'
🎯 Use Cases
| Use Case | How It Helps |
|---|---|
| Recruitment agencies | Build lists of in‑house recruiters and TA professionals to pitch your staffing services. |
| Sales teams (HR tech) | Find decision‑makers for HR software, payroll, benefits, or talent platforms. |
| B2B lead generation | Enrich CRM with verified HR contact emails and LinkedIn profiles. |
| Sourcing automation | Automatically discover the right people to contact for candidate placements. |
| Market research | Analyse how companies structure their TA teams (titles, contact methods). |
❓ Frequently Asked Questions
1. What exactly counts as a “successful page” for pricing?
A page is considered successful if:
- HTTP status is 200 (or successful redirect).
- The page is parsed and at least one new, unique contact (email, LinkedIn, phone, Twitter) is discovered that has not been seen in any previous page of the same run (across all start URLs).
2. Can I run the actor without a proxy?
For a small number of domains (1–5) with few pages, datacenter IPs may work. For larger runs, residential proxies are strongly recommended to avoid rate limiting (429) and blocking (403).
3. What does the “skipLocalizedPages” option do?
It prevents crawling URLs like https://example.com/de/, https://example.com/fr/, etc., which often contain the same content in different languages. This avoids duplicate processing and saves cost.
4. How does the actor detect HR‑related job titles?
It scans the page text for patterns like:
- Recruiter
- Talent Acquisition (Specialist/Manager/Partner/Lead)
- HR Manager / Human Resources (Manager/Director/Business Partner)
- People Partner / People Operations
- Sourcer
- Hiring Manager
- Talent Partner
5. Are phone numbers validated?
Yes. The actor rejects numbers that look like:
- Copyright year ranges (
2020-2024) - Bare 4‑digit years
- Date formats (
12/12/2024) - All‑digit strings without separators (e.g.,
12345678– too short/long or no formatting) Only well‑formatted numbers with 7–15 digits and appropriate punctuation are kept.
6. What does “checkpoint/resume” mean?
The actor saves its state (which URLs have been processed, which contacts have been seen) to the key‑value store. If the run stops (e.g., spending limit reached), you can restart the same actor with the same input – it will continue from where it left off without re‑charging for already processed pages.
7. Can I customise the target keywords?
Yes. Provide a targetKeywords array in the input. Only pages containing at least one of these keywords will be prioritised (though all pages are scanned for contacts).
8. Why am I charged only for pages with new contacts?
To avoid paying for duplicate results (e.g., the same email appears on multiple pages). Once a contact is discovered, any subsequent page that contains it will not trigger a charge.
9. What happens if the actor encounters a page that requires login?
The actor will likely receive a 403 or 302 redirect to a login page, which will be classified as a failure – you will not be charged.
10. Can I use this actor to scrape individual LinkedIn profiles?
It extracts LinkedIn profile URLs (e.g., linkedin.com/in/username), but does not scrape those profiles themselves. For full LinkedIn profile scraping, see our link.
🔍 SEO Keywords
recruiter lead finder, HR contact extractor, talent acquisition email scraper, find recruiter emails, HR manager LinkedIn finder, people ops contact scraper, sourcer email extractor, talent partner discovery, B2B lead generation HR, recruitment automation, Apify HR scraper, company recruiter finder
🔗 Related Actors
- Company Info Scraper – Contacts, Tech Stack, Social Profiles – More detailed company profiling.
- LinkedIn Profile Scraper – Extract full public LinkedIn profiles.
- Social Profile Extractor – Find Facebook, LinkedIn, Twitter – Discover social media links.
Start finding recruiter contacts today – $0.02 per run + $2.50 per 1,000 pages with new contacts. Smart crawling, global dedup, proxy ready.