LinkedIn Company Associated Members Scraper avatar

LinkedIn Company Associated Members Scraper

Under maintenance

Pricing

from $3.99 / 1,000 results

Go to Apify Store
LinkedIn Company Associated Members Scraper

LinkedIn Company Associated Members Scraper

Under maintenance

Scrape LinkedIn company pages for aggregated people insights — top employers, locations, schools, service categories — plus affiliated pages and associated member profiles. Bulk-friendly with smart proxy fallback.

Pricing

from $3.99 / 1,000 results

Rating

0.0

(0)

Developer

ScraperX

ScraperX

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

12 days ago

Last modified

Share

🏢 LinkedIn Company Associated Members Scraper

Bulk-scrape LinkedIn company pages for aggregated people insights, affiliated pages, and associated member profiles — at scale, in real time.

Provide a list of LinkedIn company URLs (e.g. https://www.linkedin.com/company/google) and the actor returns:

  • 📊 Aggregated people facets — top employers, locations, schools, service categories, network distance breakdown
  • 🏷️ Affiliated pages — showcase pages, subsidiaries, acquired companies
  • 👥 Associated member profiles — name, headline, location, LinkedIn URL

Every record streams to the dataset the moment it finishes scraping, so partial results are always available even if the run is interrupted.


✨ Why Choose This Actor?

  • Bulk lookup — feed hundreds of company URLs in one run
  • 🌐 Smart proxy ladder — starts direct, auto-escalates 🟢 Direct → 🟡 Datacenter → 🔴 Residential only when blocked
  • 🛡️ Browser-grade TLS impersonationcurl_cffi with Chrome 131 fingerprint sneaks past basic anti-bot defenses without spawning a headless browser
  • 📡 Real-time table output — each company is a row, every section a column
  • 🍪 Cookie-based auth — bring your own li_at, no scraping farms
  • 🔄 Self-healing queryIds — discovers LinkedIn's rotating GraphQL hashes from the live page; baked-in fallbacks if discovery is blocked

🔑 Key Features

FeatureDescription
🔗 Bulk URL inputPaste any number of company URLs — one per line
🌐 Auto proxy fallback🟢 Direct → 🟡 Datacenter → 🔴 Residential (3 retries)
📡 Real-time outputEach company is pushed live to the dataset
🧮 Per-section columnsLocations, Schools, Service Categories, Affiliated Pages, Profiles — each as its own table column
🍪 Session-cookie authBring your own li_at cookie
⏱️ Configurable delayRandom jitter on top of your throttle
🛟 Graceful errorsA failed company doesn't kill the run — it lands in the dataset with an error field

📥 Input

{
"urls": [
"https://www.linkedin.com/company/google",
"https://www.linkedin.com/company/amazon"
],
"liAt": "AQEDAU...your_cookie...",
"getAffiliatedPages": true,
"getAssociatedMembersProfiles": true,
"maxProfiles": 10,
"requestDelaySeconds": 1.5,
"proxyConfiguration": { "useApifyProxy": false }
}
FieldTypeDescription
urlsarray<string>Required. LinkedIn company / showcase / school URLs.
liAtstringRequired. Your li_at session cookie from a logged-in LinkedIn account.
getAffiliatedPagesbooleanInclude affiliated / showcase / subsidiary pages. Default true.
getAssociatedMembersProfilesbooleanInclude associated profile listings. Default true.
maxProfilesintegerMax profiles per company. Default 10.
requestDelaySecondsnumberDelay between LinkedIn requests (seconds). Default 1.5.
proxyConfigurationobjectOverride the proxy ladder. Default: no proxy — actor escalates automatically.
  1. Log into linkedin.com.
  2. Open DevTools → Application (Chrome) or Storage (Firefox).
  3. Cookies → https://www.linkedin.com → copy the value of li_at.
  4. Paste it into the actor input. Treat it like a password — it gives access to your account.

📤 Output

Each dataset row is one company. Example (abridged):

{
"company_name": "Google",
"url": "https://www.linkedin.com/company/google",
"people_url": "https://www.linkedin.com/company/google/people/",
"Current company": [
{ "name": "Google", "count": 11873 },
{ "name": "Microsoft", "count": 390 }
],
"Locations": [
{ "name": "India", "count": 5122 },
{ "name": "United States", "count": 3623 }
],
"Connections": [
{ "name": "1st", "count": 0 },
{ "name": "2nd", "count": 0 },
{ "name": "3rd+", "count": 0 }
],
"School": [
{ "name": "Stanford University", "count": 224 }
],
"Current Function": [],
"Skill Explicit": [],
"Service categories": [
{ "name": "Coaching & Mentoring", "count": 348 }
],
"Field of Study": [],
"affiliated_pages": [
{
"category": "Information Technology & Services",
"url": "https://www.linkedin.com/showcase/androidenterprise/",
"company": "Android Enterprise"
}
],
"profiles": [
{
"profile_fullname": "Sundar Pichai",
"profile_headline": "CEO at Google",
"profile_location": "Mountain View, CA",
"profile_linkedin": "https://www.linkedin.com/in/sundarpichai"
}
]
}
FieldDescription
🏢 company_nameResolved display name
🔗 urlOriginal company URL
👥 people_urlDirect link to the People tab
🏭 Current companyTop current employers of associated members
📍 LocationsTop member locations
🔁 Connections1st / 2nd / 3rd+ degree breakdown
🎓 SchoolTop schools of associated members
💼 Current FunctionTop job functions (LinkedIn may not expose this)
🛠️ Skill ExplicitTop declared skills (LinkedIn may not expose this)
🧩 Service categoriesService categories the members offer
📚 Field of StudyTop fields of study (LinkedIn may not expose this)
🏷️ affiliated_pagesShowcase / subsidiary / acquired pages
👤 profilesAssociated member profile previews
⚠️ errorPresent only when a company failed to scrape

Note — LinkedIn no longer exposes Current Function, Skill Explicit, or Field of Study through their public people-search facets. Those columns are emitted as empty arrays for forward compatibility.


🚀 How to Use (Apify Console)

  1. Log in at console.apify.comActors.
  2. Open LinkedIn Company Associated Members Scraper.
  3. Paste your company URLs in the 🔗 Company URLs field.
  4. Paste your 🍪 li_at cookie in the cookie field.
  5. (Optional) Tune profiles, delay, proxy.
  6. Click ▶ Start.
  7. Watch the logs — each company is logged with proxy tier, facet samples, and a save confirmation.
  8. Open the Output tab to see the table with per-section columns.

🤖 Use via API

Run synchronously and get results in one call:

curl -X POST \
"https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"urls": [
"https://www.linkedin.com/company/google",
"https://www.linkedin.com/company/amazon"
],
"liAt": "AQEDAU...",
"getAffiliatedPages": true,
"getAssociatedMembersProfiles": true,
"maxProfiles": 10
}'

Or asynchronously (returns a run ID; poll for status):

curl -X POST \
"https://api.apify.com/v2/acts/<ACTOR_ID>/runs?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{"urls":["https://www.linkedin.com/company/openai"],"liAt":"AQEDAU..."}'

🌐 Proxy Strategy

The actor uses a sticky tiered fallback:

1. 🟢 Direct (no proxy) ← start here
2. 🟡 Apify Datacenter Proxy ← on first block
3. 🔴 Apify Residential Proxy ← if datacenter blocked too (3 retries)

Once the actor escalates to a higher tier, it stays there for the rest of the run — no flapping back down. Each escalation is logged with the exact block reason (HTTP status, network error, etc.) so you can audit what happened.


🎯 Best Use Cases

  • 🧑‍💼 Sales & lead enrichment — qualify a list of target accounts by their team makeup
  • 📊 Competitive intel — see where competitors hire from and which schools dominate
  • 🤝 Recruiting & sourcing — discover talent pools at target employers
  • 🔬 Market research — affiliated-page graphs reveal corporate structure at a glance

💰 Pricing (Pay-Per-Event)

EventWhat it covers
apify-actor-startOne-time charge per run (covers startup overhead)
company-resultCharged once per company successfully scraped

Failed companies still appear in the dataset (with an error field) for transparency, but they do not bill the company-result event.


❓ Frequently Asked Questions

Q: Why do I need to provide a li_at cookie? LinkedIn's Voyager API requires an authenticated session. The cookie is the simplest way to authorize the actor as you. We never persist or log it.

Q: Is the cookie safe to share? It is masked in the input UI and sent over HTTPS only. Still, treat it like a password. Rotate it from LinkedIn's session-management page if you suspect leakage.

Q: Will I get rate-limited? Possible if you push the delay too low or scrape many companies in one run. The actor handles 429/503 gracefully — it backs off, then escalates to a paid proxy tier. Default requestDelaySeconds=1.5 is a safe starting point.

Q: Why are some facet columns empty? LinkedIn no longer exposes Current Function, Skill Explicit, and Field of Study through their public people-search facets. We keep the columns for forward compatibility with their API.

Q: Can the actor scrape profile email addresses? No — this actor only collects public people-search results. Email scraping requires a different actor and stronger legal cover.


  • Data is collected only from publicly available sections of LinkedIn that your account can access.
  • Do not scrape private accounts or content behind closed groups / paywalls.
  • The end user is responsible for legal compliance (GDPR, CCPA, anti-spam laws, LinkedIn's ToS).
  • Respect rate limits — being a good citizen also reduces blocks.

💬 Support & Feedback

Issues, feature requests, or custom integrations? Reach out at dev.scraperengine@gmail.com or open a message in the Apify Console.