LinkedIn Company Associated Members Scraper
Under maintenancePricing
from $3.99 / 1,000 results
LinkedIn Company Associated Members Scraper
Under maintenancePricing
from $3.99 / 1,000 results
Rating
0.0
(0)
Developer
ScrapeLabs
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
0
Monthly active users
5 days ago
Last modified
Categories
Share
๐ข LinkedIn Company Associated Members Scraper
Bulk-scrape LinkedIn company pages for aggregated people insights, affiliated pages, and associated member profiles โ at scale, in real time.
Provide a list of LinkedIn company URLs (e.g. https://www.linkedin.com/company/google) and the actor returns:
- ๐ Aggregated people facets โ top employers, locations, schools, service categories, network distance breakdown
- ๐ท๏ธ Affiliated pages โ showcase pages, subsidiaries, acquired companies
- ๐ฅ Associated member profiles โ name, headline, location, LinkedIn URL
Every record streams to the dataset the moment it finishes scraping, so partial results are always available even if the run is interrupted.
โจ Why Choose This Actor?
- โก Bulk lookup โ feed hundreds of company URLs in one run
- ๐ Smart proxy ladder โ starts direct, auto-escalates ๐ข Direct โ ๐ก Datacenter โ ๐ด Residential only when blocked
- ๐ก๏ธ Browser-grade TLS impersonation โ
curl_cffiwith Chrome 131 fingerprint sneaks past basic anti-bot defenses without spawning a headless browser - ๐ก Real-time table output โ each company is a row, every section a column
- ๐ช Cookie-based auth โ bring your own
li_at, no scraping farms - ๐ Self-healing queryIds โ discovers LinkedIn's rotating GraphQL hashes from the live page; baked-in fallbacks if discovery is blocked
๐ Key Features
| Feature | Description |
|---|---|
| ๐ Bulk URL input | Paste any number of company URLs โ one per line |
| ๐ Auto proxy fallback | ๐ข Direct โ ๐ก Datacenter โ ๐ด Residential (3 retries) |
| ๐ก Real-time output | Each company is pushed live to the dataset |
| ๐งฎ Per-section columns | Locations, Schools, Service Categories, Affiliated Pages, Profiles โ each as its own table column |
| ๐ช Session-cookie auth | Bring your own li_at cookie |
| โฑ๏ธ Configurable delay | Random jitter on top of your throttle |
| ๐ Graceful errors | A failed company doesn't kill the run โ it lands in the dataset with an error field |
๐ฅ Input
{"urls": ["https://www.linkedin.com/company/google","https://www.linkedin.com/company/amazon"],"liAt": "AQEDAU...your_cookie...","getAffiliatedPages": true,"getAssociatedMembersProfiles": true,"maxProfiles": 10,"requestDelaySeconds": 1.5,"proxyConfiguration": { "useApifyProxy": false }}
| Field | Type | Description |
|---|---|---|
urls | array<string> | Required. LinkedIn company / showcase / school URLs. |
liAt | string | Required. Your li_at session cookie from a logged-in LinkedIn account. |
getAffiliatedPages | boolean | Include affiliated / showcase / subsidiary pages. Default true. |
getAssociatedMembersProfiles | boolean | Include associated profile listings. Default true. |
maxProfiles | integer | Max profiles per company. Default 10. |
requestDelaySeconds | number | Delay between LinkedIn requests (seconds). Default 1.5. |
proxyConfiguration | object | Override the proxy ladder. Default: no proxy โ actor escalates automatically. |
๐ช How to grab your li_at cookie
- Log into linkedin.com.
- Open DevTools โ Application (Chrome) or Storage (Firefox).
- Cookies โ
https://www.linkedin.comโ copy the value ofli_at. - Paste it into the actor input. Treat it like a password โ it gives access to your account.
๐ค Output
Each dataset row is one company. Example (abridged):
{"company_name": "Google","url": "https://www.linkedin.com/company/google","people_url": "https://www.linkedin.com/company/google/people/","Current company": [{ "name": "Google", "count": 11873 },{ "name": "Microsoft", "count": 390 }],"Locations": [{ "name": "India", "count": 5122 },{ "name": "United States", "count": 3623 }],"Connections": [{ "name": "1st", "count": 0 },{ "name": "2nd", "count": 0 },{ "name": "3rd+", "count": 0 }],"School": [{ "name": "Stanford University", "count": 224 }],"Current Function": [],"Skill Explicit": [],"Service categories": [{ "name": "Coaching & Mentoring", "count": 348 }],"Field of Study": [],"affiliated_pages": [{"category": "Information Technology & Services","url": "https://www.linkedin.com/showcase/androidenterprise/","company": "Android Enterprise"}],"profiles": [{"profile_fullname": "Sundar Pichai","profile_headline": "CEO at Google","profile_location": "Mountain View, CA","profile_linkedin": "https://www.linkedin.com/in/sundarpichai"}]}
| Field | Description |
|---|---|
๐ข company_name | Resolved display name |
๐ url | Original company URL |
๐ฅ people_url | Direct link to the People tab |
๐ญ Current company | Top current employers of associated members |
๐ Locations | Top member locations |
๐ Connections | 1st / 2nd / 3rd+ degree breakdown |
๐ School | Top schools of associated members |
๐ผ Current Function | Top job functions (LinkedIn may not expose this) |
๐ ๏ธ Skill Explicit | Top declared skills (LinkedIn may not expose this) |
๐งฉ Service categories | Service categories the members offer |
๐ Field of Study | Top fields of study (LinkedIn may not expose this) |
๐ท๏ธ affiliated_pages | Showcase / subsidiary / acquired pages |
๐ค profiles | Associated member profile previews |
โ ๏ธ error | Present only when a company failed to scrape |
Note โ LinkedIn no longer exposes
Current Function,Skill Explicit, orField of Studythrough their public people-search facets. Those columns are emitted as empty arrays for forward compatibility.
๐ How to Use (Apify Console)
- Log in at console.apify.com โ Actors.
- Open LinkedIn Company Associated Members Scraper.
- Paste your company URLs in the ๐ Company URLs field.
- Paste your ๐ช
li_atcookie in the cookie field. - (Optional) Tune profiles, delay, proxy.
- Click โถ Start.
- Watch the logs โ each company is logged with proxy tier, facet samples, and a save confirmation.
- Open the Output tab to see the table with per-section columns.
๐ค Use via API
Run synchronously and get results in one call:
curl -X POST \"https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN" \-H "Content-Type: application/json" \-d '{"urls": ["https://www.linkedin.com/company/google","https://www.linkedin.com/company/amazon"],"liAt": "AQEDAU...","getAffiliatedPages": true,"getAssociatedMembersProfiles": true,"maxProfiles": 10}'
Or asynchronously (returns a run ID; poll for status):
curl -X POST \"https://api.apify.com/v2/acts/<ACTOR_ID>/runs?token=$APIFY_TOKEN" \-H "Content-Type: application/json" \-d '{"urls":["https://www.linkedin.com/company/openai"],"liAt":"AQEDAU..."}'
๐ Proxy Strategy
The actor uses a sticky tiered fallback:
1. ๐ข Direct (no proxy) โ start here2. ๐ก Apify Datacenter Proxy โ on first block3. ๐ด Apify Residential Proxy โ if datacenter blocked too (3 retries)
Once the actor escalates to a higher tier, it stays there for the rest of the run โ no flapping back down. Each escalation is logged with the exact block reason (HTTP status, network error, etc.) so you can audit what happened.
๐ฏ Best Use Cases
- ๐งโ๐ผ Sales & lead enrichment โ qualify a list of target accounts by their team makeup
- ๐ Competitive intel โ see where competitors hire from and which schools dominate
- ๐ค Recruiting & sourcing โ discover talent pools at target employers
- ๐ฌ Market research โ affiliated-page graphs reveal corporate structure at a glance
๐ฐ Pricing (Pay-Per-Event)
| Event | What it covers |
|---|---|
apify-actor-start | One-time charge per run (covers startup overhead) |
company-result | Charged once per company successfully scraped |
Failed companies still appear in the dataset (with an error field) for transparency, but they do not bill the company-result event.
โ Frequently Asked Questions
Q: Why do I need to provide a li_at cookie?
LinkedIn's Voyager API requires an authenticated session. The cookie is the simplest way to authorize the actor as you. We never persist or log it.
Q: Is the cookie safe to share? It is masked in the input UI and sent over HTTPS only. Still, treat it like a password. Rotate it from LinkedIn's session-management page if you suspect leakage.
Q: Will I get rate-limited?
Possible if you push the delay too low or scrape many companies in one run. The actor handles 429/503 gracefully โ it backs off, then escalates to a paid proxy tier. Default requestDelaySeconds=1.5 is a safe starting point.
Q: Why are some facet columns empty?
LinkedIn no longer exposes Current Function, Skill Explicit, and Field of Study through their public people-search facets. We keep the columns for forward compatibility with their API.
Q: Can the actor scrape profile email addresses? No โ this actor only collects public people-search results. Email scraping requires a different actor and stronger legal cover.
โ๏ธ Cautions / Legal
- Data is collected only from publicly available sections of LinkedIn that your account can access.
- Do not scrape private accounts or content behind closed groups / paywalls.
- The end user is responsible for legal compliance (GDPR, CCPA, anti-spam laws, LinkedIn's ToS).
- Respect rate limits โ being a good citizen also reduces blocks.
๐ฌ Support & Feedback
Issues, feature requests, or custom integrations? Reach out at dev.scraperengine@gmail.com or open a message in the Apify Console.