LinkedIn Company Associated Members Scraper avatar

LinkedIn Company Associated Members Scraper

Under maintenance

Pricing

from $2.99 / 1,000 results

Go to Apify Store
LinkedIn Company Associated Members Scraper

LinkedIn Company Associated Members Scraper

Under maintenance

Pricing

from $2.99 / 1,000 results

Rating

0.0

(0)

Developer

ScrapAPI

ScrapAPI

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

6 days ago

Last modified

Share

๐Ÿข LinkedIn Company Associated Members Scraper

Bulk-scrape LinkedIn company pages for aggregated people insights, affiliated pages, and associated member profiles โ€” at scale, in real time.

Provide a list of LinkedIn company URLs (e.g. https://www.linkedin.com/company/google) and the actor returns:

  • ๐Ÿ“Š Aggregated people facets โ€” top employers, locations, schools, service categories, network distance breakdown
  • ๐Ÿท๏ธ Affiliated pages โ€” showcase pages, subsidiaries, acquired companies
  • ๐Ÿ‘ฅ Associated member profiles โ€” name, headline, location, LinkedIn URL

Every record streams to the dataset the moment it finishes scraping, so partial results are always available even if the run is interrupted.


โœจ Why Choose This Actor?

  • โšก Bulk lookup โ€” feed hundreds of company URLs in one run
  • ๐ŸŒ Smart proxy ladder โ€” starts direct, auto-escalates ๐ŸŸข Direct โ†’ ๐ŸŸก Datacenter โ†’ ๐Ÿ”ด Residential only when blocked
  • ๐Ÿ›ก๏ธ Browser-grade TLS impersonation โ€” curl_cffi with Chrome 131 fingerprint sneaks past basic anti-bot defenses without spawning a headless browser
  • ๐Ÿ“ก Real-time table output โ€” each company is a row, every section a column
  • ๐Ÿช Cookie-based auth โ€” bring your own li_at, no scraping farms
  • ๐Ÿ”„ Self-healing queryIds โ€” discovers LinkedIn's rotating GraphQL hashes from the live page; baked-in fallbacks if discovery is blocked

๐Ÿ”‘ Key Features

FeatureDescription
๐Ÿ”— Bulk URL inputPaste any number of company URLs โ€” one per line
๐ŸŒ Auto proxy fallback๐ŸŸข Direct โ†’ ๐ŸŸก Datacenter โ†’ ๐Ÿ”ด Residential (3 retries)
๐Ÿ“ก Real-time outputEach company is pushed live to the dataset
๐Ÿงฎ Per-section columnsLocations, Schools, Service Categories, Affiliated Pages, Profiles โ€” each as its own table column
๐Ÿช Session-cookie authBring your own li_at cookie
โฑ๏ธ Configurable delayRandom jitter on top of your throttle
๐Ÿ›Ÿ Graceful errorsA failed company doesn't kill the run โ€” it lands in the dataset with an error field

๐Ÿ“ฅ Input

{
"urls": [
"https://www.linkedin.com/company/google",
"https://www.linkedin.com/company/amazon"
],
"liAt": "AQEDAU...your_cookie...",
"getAffiliatedPages": true,
"getAssociatedMembersProfiles": true,
"maxProfiles": 10,
"requestDelaySeconds": 1.5,
"proxyConfiguration": { "useApifyProxy": false }
}
FieldTypeDescription
urlsarray<string>Required. LinkedIn company / showcase / school URLs.
liAtstringRequired. Your li_at session cookie from a logged-in LinkedIn account.
getAffiliatedPagesbooleanInclude affiliated / showcase / subsidiary pages. Default true.
getAssociatedMembersProfilesbooleanInclude associated profile listings. Default true.
maxProfilesintegerMax profiles per company. Default 10.
requestDelaySecondsnumberDelay between LinkedIn requests (seconds). Default 1.5.
proxyConfigurationobjectOverride the proxy ladder. Default: no proxy โ€” actor escalates automatically.
  1. Log into linkedin.com.
  2. Open DevTools โ†’ Application (Chrome) or Storage (Firefox).
  3. Cookies โ†’ https://www.linkedin.com โ†’ copy the value of li_at.
  4. Paste it into the actor input. Treat it like a password โ€” it gives access to your account.

๐Ÿ“ค Output

Each dataset row is one company. Example (abridged):

{
"company_name": "Google",
"url": "https://www.linkedin.com/company/google",
"people_url": "https://www.linkedin.com/company/google/people/",
"Current company": [
{ "name": "Google", "count": 11873 },
{ "name": "Microsoft", "count": 390 }
],
"Locations": [
{ "name": "India", "count": 5122 },
{ "name": "United States", "count": 3623 }
],
"Connections": [
{ "name": "1st", "count": 0 },
{ "name": "2nd", "count": 0 },
{ "name": "3rd+", "count": 0 }
],
"School": [
{ "name": "Stanford University", "count": 224 }
],
"Current Function": [],
"Skill Explicit": [],
"Service categories": [
{ "name": "Coaching & Mentoring", "count": 348 }
],
"Field of Study": [],
"affiliated_pages": [
{
"category": "Information Technology & Services",
"url": "https://www.linkedin.com/showcase/androidenterprise/",
"company": "Android Enterprise"
}
],
"profiles": [
{
"profile_fullname": "Sundar Pichai",
"profile_headline": "CEO at Google",
"profile_location": "Mountain View, CA",
"profile_linkedin": "https://www.linkedin.com/in/sundarpichai"
}
]
}
FieldDescription
๐Ÿข company_nameResolved display name
๐Ÿ”— urlOriginal company URL
๐Ÿ‘ฅ people_urlDirect link to the People tab
๐Ÿญ Current companyTop current employers of associated members
๐Ÿ“ LocationsTop member locations
๐Ÿ” Connections1st / 2nd / 3rd+ degree breakdown
๐ŸŽ“ SchoolTop schools of associated members
๐Ÿ’ผ Current FunctionTop job functions (LinkedIn may not expose this)
๐Ÿ› ๏ธ Skill ExplicitTop declared skills (LinkedIn may not expose this)
๐Ÿงฉ Service categoriesService categories the members offer
๐Ÿ“š Field of StudyTop fields of study (LinkedIn may not expose this)
๐Ÿท๏ธ affiliated_pagesShowcase / subsidiary / acquired pages
๐Ÿ‘ค profilesAssociated member profile previews
โš ๏ธ errorPresent only when a company failed to scrape

Note โ€” LinkedIn no longer exposes Current Function, Skill Explicit, or Field of Study through their public people-search facets. Those columns are emitted as empty arrays for forward compatibility.


๐Ÿš€ How to Use (Apify Console)

  1. Log in at console.apify.com โ†’ Actors.
  2. Open LinkedIn Company Associated Members Scraper.
  3. Paste your company URLs in the ๐Ÿ”— Company URLs field.
  4. Paste your ๐Ÿช li_at cookie in the cookie field.
  5. (Optional) Tune profiles, delay, proxy.
  6. Click โ–ถ Start.
  7. Watch the logs โ€” each company is logged with proxy tier, facet samples, and a save confirmation.
  8. Open the Output tab to see the table with per-section columns.

๐Ÿค– Use via API

Run synchronously and get results in one call:

curl -X POST \
"https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"urls": [
"https://www.linkedin.com/company/google",
"https://www.linkedin.com/company/amazon"
],
"liAt": "AQEDAU...",
"getAffiliatedPages": true,
"getAssociatedMembersProfiles": true,
"maxProfiles": 10
}'

Or asynchronously (returns a run ID; poll for status):

curl -X POST \
"https://api.apify.com/v2/acts/<ACTOR_ID>/runs?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{"urls":["https://www.linkedin.com/company/openai"],"liAt":"AQEDAU..."}'

๐ŸŒ Proxy Strategy

The actor uses a sticky tiered fallback:

1. ๐ŸŸข Direct (no proxy) โ† start here
2. ๐ŸŸก Apify Datacenter Proxy โ† on first block
3. ๐Ÿ”ด Apify Residential Proxy โ† if datacenter blocked too (3 retries)

Once the actor escalates to a higher tier, it stays there for the rest of the run โ€” no flapping back down. Each escalation is logged with the exact block reason (HTTP status, network error, etc.) so you can audit what happened.


๐ŸŽฏ Best Use Cases

  • ๐Ÿง‘โ€๐Ÿ’ผ Sales & lead enrichment โ€” qualify a list of target accounts by their team makeup
  • ๐Ÿ“Š Competitive intel โ€” see where competitors hire from and which schools dominate
  • ๐Ÿค Recruiting & sourcing โ€” discover talent pools at target employers
  • ๐Ÿ”ฌ Market research โ€” affiliated-page graphs reveal corporate structure at a glance

๐Ÿ’ฐ Pricing (Pay-Per-Event)

EventWhat it covers
apify-actor-startOne-time charge per run (covers startup overhead)
company-resultCharged once per company successfully scraped

Failed companies still appear in the dataset (with an error field) for transparency, but they do not bill the company-result event.


โ“ Frequently Asked Questions

Q: Why do I need to provide a li_at cookie? LinkedIn's Voyager API requires an authenticated session. The cookie is the simplest way to authorize the actor as you. We never persist or log it.

Q: Is the cookie safe to share? It is masked in the input UI and sent over HTTPS only. Still, treat it like a password. Rotate it from LinkedIn's session-management page if you suspect leakage.

Q: Will I get rate-limited? Possible if you push the delay too low or scrape many companies in one run. The actor handles 429/503 gracefully โ€” it backs off, then escalates to a paid proxy tier. Default requestDelaySeconds=1.5 is a safe starting point.

Q: Why are some facet columns empty? LinkedIn no longer exposes Current Function, Skill Explicit, and Field of Study through their public people-search facets. We keep the columns for forward compatibility with their API.

Q: Can the actor scrape profile email addresses? No โ€” this actor only collects public people-search results. Email scraping requires a different actor and stronger legal cover.


  • Data is collected only from publicly available sections of LinkedIn that your account can access.
  • Do not scrape private accounts or content behind closed groups / paywalls.
  • The end user is responsible for legal compliance (GDPR, CCPA, anti-spam laws, LinkedIn's ToS).
  • Respect rate limits โ€” being a good citizen also reduces blocks.

๐Ÿ’ฌ Support & Feedback

Issues, feature requests, or custom integrations? Reach out at dev.scraperengine@gmail.com or open a message in the Apify Console.