LinkedIn Company Associated Members Scraper
Under maintenancePricing
from $3.99 / 1,000 results
LinkedIn Company Associated Members Scraper
Under maintenanceScrape LinkedIn company pages for aggregated people insights — top employers, locations, schools, service categories — plus affiliated pages and associated member profiles. Bulk-friendly with smart proxy fallback.
Pricing
from $3.99 / 1,000 results
Rating
0.0
(0)
Developer
ScraperX
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
0
Monthly active users
12 days ago
Last modified
Categories
Share
🏢 LinkedIn Company Associated Members Scraper
Bulk-scrape LinkedIn company pages for aggregated people insights, affiliated pages, and associated member profiles — at scale, in real time.
Provide a list of LinkedIn company URLs (e.g. https://www.linkedin.com/company/google) and the actor returns:
- 📊 Aggregated people facets — top employers, locations, schools, service categories, network distance breakdown
- 🏷️ Affiliated pages — showcase pages, subsidiaries, acquired companies
- 👥 Associated member profiles — name, headline, location, LinkedIn URL
Every record streams to the dataset the moment it finishes scraping, so partial results are always available even if the run is interrupted.
✨ Why Choose This Actor?
- ⚡ Bulk lookup — feed hundreds of company URLs in one run
- 🌐 Smart proxy ladder — starts direct, auto-escalates 🟢 Direct → 🟡 Datacenter → 🔴 Residential only when blocked
- 🛡️ Browser-grade TLS impersonation —
curl_cffiwith Chrome 131 fingerprint sneaks past basic anti-bot defenses without spawning a headless browser - 📡 Real-time table output — each company is a row, every section a column
- 🍪 Cookie-based auth — bring your own
li_at, no scraping farms - 🔄 Self-healing queryIds — discovers LinkedIn's rotating GraphQL hashes from the live page; baked-in fallbacks if discovery is blocked
🔑 Key Features
| Feature | Description |
|---|---|
| 🔗 Bulk URL input | Paste any number of company URLs — one per line |
| 🌐 Auto proxy fallback | 🟢 Direct → 🟡 Datacenter → 🔴 Residential (3 retries) |
| 📡 Real-time output | Each company is pushed live to the dataset |
| 🧮 Per-section columns | Locations, Schools, Service Categories, Affiliated Pages, Profiles — each as its own table column |
| 🍪 Session-cookie auth | Bring your own li_at cookie |
| ⏱️ Configurable delay | Random jitter on top of your throttle |
| 🛟 Graceful errors | A failed company doesn't kill the run — it lands in the dataset with an error field |
📥 Input
{"urls": ["https://www.linkedin.com/company/google","https://www.linkedin.com/company/amazon"],"liAt": "AQEDAU...your_cookie...","getAffiliatedPages": true,"getAssociatedMembersProfiles": true,"maxProfiles": 10,"requestDelaySeconds": 1.5,"proxyConfiguration": { "useApifyProxy": false }}
| Field | Type | Description |
|---|---|---|
urls | array<string> | Required. LinkedIn company / showcase / school URLs. |
liAt | string | Required. Your li_at session cookie from a logged-in LinkedIn account. |
getAffiliatedPages | boolean | Include affiliated / showcase / subsidiary pages. Default true. |
getAssociatedMembersProfiles | boolean | Include associated profile listings. Default true. |
maxProfiles | integer | Max profiles per company. Default 10. |
requestDelaySeconds | number | Delay between LinkedIn requests (seconds). Default 1.5. |
proxyConfiguration | object | Override the proxy ladder. Default: no proxy — actor escalates automatically. |
🍪 How to grab your li_at cookie
- Log into linkedin.com.
- Open DevTools → Application (Chrome) or Storage (Firefox).
- Cookies →
https://www.linkedin.com→ copy the value ofli_at. - Paste it into the actor input. Treat it like a password — it gives access to your account.
📤 Output
Each dataset row is one company. Example (abridged):
{"company_name": "Google","url": "https://www.linkedin.com/company/google","people_url": "https://www.linkedin.com/company/google/people/","Current company": [{ "name": "Google", "count": 11873 },{ "name": "Microsoft", "count": 390 }],"Locations": [{ "name": "India", "count": 5122 },{ "name": "United States", "count": 3623 }],"Connections": [{ "name": "1st", "count": 0 },{ "name": "2nd", "count": 0 },{ "name": "3rd+", "count": 0 }],"School": [{ "name": "Stanford University", "count": 224 }],"Current Function": [],"Skill Explicit": [],"Service categories": [{ "name": "Coaching & Mentoring", "count": 348 }],"Field of Study": [],"affiliated_pages": [{"category": "Information Technology & Services","url": "https://www.linkedin.com/showcase/androidenterprise/","company": "Android Enterprise"}],"profiles": [{"profile_fullname": "Sundar Pichai","profile_headline": "CEO at Google","profile_location": "Mountain View, CA","profile_linkedin": "https://www.linkedin.com/in/sundarpichai"}]}
| Field | Description |
|---|---|
🏢 company_name | Resolved display name |
🔗 url | Original company URL |
👥 people_url | Direct link to the People tab |
🏭 Current company | Top current employers of associated members |
📍 Locations | Top member locations |
🔁 Connections | 1st / 2nd / 3rd+ degree breakdown |
🎓 School | Top schools of associated members |
💼 Current Function | Top job functions (LinkedIn may not expose this) |
🛠️ Skill Explicit | Top declared skills (LinkedIn may not expose this) |
🧩 Service categories | Service categories the members offer |
📚 Field of Study | Top fields of study (LinkedIn may not expose this) |
🏷️ affiliated_pages | Showcase / subsidiary / acquired pages |
👤 profiles | Associated member profile previews |
⚠️ error | Present only when a company failed to scrape |
Note — LinkedIn no longer exposes
Current Function,Skill Explicit, orField of Studythrough their public people-search facets. Those columns are emitted as empty arrays for forward compatibility.
🚀 How to Use (Apify Console)
- Log in at console.apify.com → Actors.
- Open LinkedIn Company Associated Members Scraper.
- Paste your company URLs in the 🔗 Company URLs field.
- Paste your 🍪
li_atcookie in the cookie field. - (Optional) Tune profiles, delay, proxy.
- Click ▶ Start.
- Watch the logs — each company is logged with proxy tier, facet samples, and a save confirmation.
- Open the Output tab to see the table with per-section columns.
🤖 Use via API
Run synchronously and get results in one call:
curl -X POST \"https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN" \-H "Content-Type: application/json" \-d '{"urls": ["https://www.linkedin.com/company/google","https://www.linkedin.com/company/amazon"],"liAt": "AQEDAU...","getAffiliatedPages": true,"getAssociatedMembersProfiles": true,"maxProfiles": 10}'
Or asynchronously (returns a run ID; poll for status):
curl -X POST \"https://api.apify.com/v2/acts/<ACTOR_ID>/runs?token=$APIFY_TOKEN" \-H "Content-Type: application/json" \-d '{"urls":["https://www.linkedin.com/company/openai"],"liAt":"AQEDAU..."}'
🌐 Proxy Strategy
The actor uses a sticky tiered fallback:
1. 🟢 Direct (no proxy) ← start here2. 🟡 Apify Datacenter Proxy ← on first block3. 🔴 Apify Residential Proxy ← if datacenter blocked too (3 retries)
Once the actor escalates to a higher tier, it stays there for the rest of the run — no flapping back down. Each escalation is logged with the exact block reason (HTTP status, network error, etc.) so you can audit what happened.
🎯 Best Use Cases
- 🧑💼 Sales & lead enrichment — qualify a list of target accounts by their team makeup
- 📊 Competitive intel — see where competitors hire from and which schools dominate
- 🤝 Recruiting & sourcing — discover talent pools at target employers
- 🔬 Market research — affiliated-page graphs reveal corporate structure at a glance
💰 Pricing (Pay-Per-Event)
| Event | What it covers |
|---|---|
apify-actor-start | One-time charge per run (covers startup overhead) |
company-result | Charged once per company successfully scraped |
Failed companies still appear in the dataset (with an error field) for transparency, but they do not bill the company-result event.
❓ Frequently Asked Questions
Q: Why do I need to provide a li_at cookie?
LinkedIn's Voyager API requires an authenticated session. The cookie is the simplest way to authorize the actor as you. We never persist or log it.
Q: Is the cookie safe to share? It is masked in the input UI and sent over HTTPS only. Still, treat it like a password. Rotate it from LinkedIn's session-management page if you suspect leakage.
Q: Will I get rate-limited?
Possible if you push the delay too low or scrape many companies in one run. The actor handles 429/503 gracefully — it backs off, then escalates to a paid proxy tier. Default requestDelaySeconds=1.5 is a safe starting point.
Q: Why are some facet columns empty?
LinkedIn no longer exposes Current Function, Skill Explicit, and Field of Study through their public people-search facets. We keep the columns for forward compatibility with their API.
Q: Can the actor scrape profile email addresses? No — this actor only collects public people-search results. Email scraping requires a different actor and stronger legal cover.
⚖️ Cautions / Legal
- Data is collected only from publicly available sections of LinkedIn that your account can access.
- Do not scrape private accounts or content behind closed groups / paywalls.
- The end user is responsible for legal compliance (GDPR, CCPA, anti-spam laws, LinkedIn's ToS).
- Respect rate limits — being a good citizen also reduces blocks.
💬 Support & Feedback
Issues, feature requests, or custom integrations? Reach out at dev.scraperengine@gmail.com or open a message in the Apify Console.