# LinkedIn Company Scraper (`ramman/linkedin-company-scraper`) Actor

Scrapes public LinkedIn company profile data, including name, website, industry, size, headquarters, Job Urls, founded year, specialties, followers, and about text.

- **URL**: https://apify.com/ramman/linkedin-company-scraper.md
- **Developed by:** [ramman](https://apify.com/ramman) (community)
- **Categories:** Automation, Developer tools, Lead generation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $3.00 / 1,000 results

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### What does LinkedIn Company Scraper do?

**LinkedIn Company Scraper** extracts company profile details from **LinkedIn company pages** (for example `https://www.linkedin.com/company/microsoft/`). You provide one or more company URLs and the Actor saves structured records into an **Apify dataset**.

Typical extracted fields include company name, website, industry, company size, headquarters, founded year, specialties, followers, about text — and (optionally) job posting URLs linked from the company jobs page.

This Actor is designed to run on the Apify platform, so you can use **API access, scheduling, integrations, proxy rotation, and run monitoring**.

### Why use LinkedIn Company Scraper?

- **Lead generation / enrichment**: build firmographic datasets for outbound and CRM enrichment
- **Market research**: compare companies by industry, size, location, and description text
- **Automation**: replace manual copy/paste with repeatable, exportable runs
- **Easy exports**: download results as JSON, CSV, Excel, or HTML from the Dataset tab

### How to use LinkedIn Company Scraper (step-by-step)

1. Open the Actor in Apify Console and go to the **Input** tab.
2. Paste your LinkedIn company page URLs into **Company URLs**.
3. Keep **Use Apify Proxy** enabled (recommended for LinkedIn).
4. If LinkedIn shows a login wall, add your **`li_at` session cookie** (optional but helps a lot).
5. Click **Start** and wait for the run to finish.
6. Open the **Dataset** tab to view, export, or download your results.

### Input

You configure the run from the **Input** tab in Apify Console.

#### Main input fields

- **`companyUrls` (required)**: LinkedIn company profile URLs to scrape.
- **`useApifyProxy` (recommended)**: turns on Apify Proxy to reduce blocks.
- **`apifyProxyGroups`**: which proxy group(s) to use (default: `RESIDENTIAL`).
- **`linkedInSessionCookie` (`li_at`, optional)**: paste your `li_at` cookie value from a logged-in browser session to improve access when LinkedIn blocks unauthenticated users.
- **`linkedInCookies` (advanced, optional)**: pass full cookie objects if you know what you’re doing.
- **`maxRequestRetries`**: how many times to retry a failed company page.
- **`requestDelaySecs`**: delay between requests to reduce blocking risk.
- **`maxJobUrls`**: maximum number of job posting URLs to extract per company.

#### Input example

```json
{
  "companyUrls": [
    "https://www.linkedin.com/company/microsoft/",
    "https://www.linkedin.com/company/google/"
  ],
  "useApifyProxy": true,
  "apifyProxyGroups": ["RESIDENTIAL"],
  "linkedInSessionCookie": "",
  "maxRequestRetries": 3,
  "requestDelaySecs": 3,
  "maxJobUrls": 50
}
````

### Output

The Actor stores results in the **default dataset**. Each dataset item is one company.

Example output (simplified):

```json
{
  "companyName": "Microsoft",
  "linkedinUrl": "https://www.linkedin.com/company/microsoft/",
  "website": "https://news.microsoft.com/",
  "industry": "Software Development",
  "companySize": "10,001+ employees",
  "headquarters": "Redmond, Washington",
  "founded": "1975",
  "specialties": "Business Software, Developer Tools, ...",
  "followers": "25,000,000 followers",
  "about": "Every company has a mission...",
  "jobUrls": [
    "https://www.linkedin.com/jobs/view/123...",
    "https://www.linkedin.com/jobs/view/456..."
  ]
}
```

You can download the dataset in various formats such as **JSON, HTML, CSV, or Excel**.

### Data table (main fields)

| Field | Description |
| --- | --- |
| `companyName` | Official company name |
| `linkedinUrl` | LinkedIn company profile URL |
| `website` | Company website (if available) |
| `industry` | LinkedIn industry label |
| `companySize` | Employee count range |
| `headquarters` | HQ location |
| `founded` | Year founded |
| `specialties` | Listed specialties |
| `followers` | Follower count text (as shown) |
| `about` | Company description / overview |
| `jobUrls` | Job posting URLs (up to `maxJobUrls`) |

### Pricing / Cost estimation

How much does it cost to scrape LinkedIn company pages?

- The main cost driver is **how many company URLs** you scrape and whether LinkedIn requires additional retries / slower browsing.
- If you enable **Residential proxies**, success rates are usually better, but proxy usage can increase cost.
- To estimate cost quickly, run a small test (e.g. 10 URLs) and extrapolate based on your average time per company.

### Tips & advanced options

- If you get blocked or see missing fields, **increase `requestDelaySecs`** and keep concurrency low.
- For the best success rate, use **Apify Proxy + `li_at` cookie** (optional).
- If you only need company profiles (not job URLs), consider lowering `maxJobUrls` to reduce extra requests.

### FAQ, disclaimers, and support

#### Is scraping LinkedIn allowed?

Scraping LinkedIn is subject to LinkedIn’s Terms of Service and applicable laws. Use this Actor responsibly, respect rate limits, and only collect data you are allowed to process.

#### Why do I see partial results?

LinkedIn sometimes shows a login wall or rate-limits requests. Using Apify Proxy and (optionally) a valid `li_at` cookie usually improves access, but LinkedIn page structure can change over time.

#### Support

For issues or feature requests, please use the Actor **Issues** tab in Apify Console.

### For developers (optional)

Local development:

```bash
npm install
apify run
```

# Actor input Schema

## `companyUrls` (type: `array`):

LinkedIn company profile URLs to scrape (e.g. https://www.linkedin.com/company/microsoft/).

## `useApifyProxy` (type: `boolean`):

Recommended for LinkedIn. Uses Apify residential proxies to reduce login blocks.

## `apifyProxyGroups` (type: `array`):

Proxy groups to use. RESIDENTIAL is recommended for LinkedIn.

## `linkedInSessionCookie` (type: `string`):

Optional li\_at cookie value from a logged-in LinkedIn browser session. Improves success rate when LinkedIn shows a login wall.

## `linkedInCookies` (type: `array`):

Optional full cookie objects to pass to the browser context.

## `maxRequestRetries` (type: `integer`):

How many times to retry a failed company page request.

## `requestDelaySecs` (type: `integer`):

Minimum delay between requests to linkedin.com to reduce blocking risk.

## `maxJobUrls` (type: `integer`):

Maximum number of job posting URLs to extract per company from the jobs page.

## Actor input object example

```json
{
  "companyUrls": [
    "https://www.linkedin.com/company/microsoft/",
    "https://www.linkedin.com/company/google/"
  ],
  "useApifyProxy": true,
  "apifyProxyGroups": [
    "RESIDENTIAL"
  ],
  "linkedInCookies": [],
  "maxRequestRetries": 3,
  "requestDelaySecs": 3,
  "maxJobUrls": 50
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "companyUrls": [
        "https://www.linkedin.com/company/microsoft/",
        "https://www.linkedin.com/company/google/"
    ],
    "apifyProxyGroups": [
        "RESIDENTIAL"
    ],
    "linkedInCookies": []
};

// Run the Actor and wait for it to finish
const run = await client.actor("ramman/linkedin-company-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "companyUrls": [
        "https://www.linkedin.com/company/microsoft/",
        "https://www.linkedin.com/company/google/",
    ],
    "apifyProxyGroups": ["RESIDENTIAL"],
    "linkedInCookies": [],
}

# Run the Actor and wait for it to finish
run = client.actor("ramman/linkedin-company-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "companyUrls": [
    "https://www.linkedin.com/company/microsoft/",
    "https://www.linkedin.com/company/google/"
  ],
  "apifyProxyGroups": [
    "RESIDENTIAL"
  ],
  "linkedInCookies": []
}' |
apify call ramman/linkedin-company-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=ramman/linkedin-company-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "LinkedIn Company Scraper",
        "description": "Scrapes public LinkedIn company profile data, including name, website, industry, size, headquarters, Job Urls, founded year, specialties, followers, and about text.",
        "version": "0.1",
        "x-build-id": "y3ETdoEb6weDcrR3C"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/ramman~linkedin-company-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-ramman-linkedin-company-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/ramman~linkedin-company-scraper/runs": {
            "post": {
                "operationId": "runs-sync-ramman-linkedin-company-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/ramman~linkedin-company-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-ramman-linkedin-company-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "companyUrls"
                ],
                "properties": {
                    "companyUrls": {
                        "title": "Company URLs",
                        "type": "array",
                        "description": "LinkedIn company profile URLs to scrape (e.g. https://www.linkedin.com/company/microsoft/).",
                        "items": {
                            "type": "string"
                        }
                    },
                    "useApifyProxy": {
                        "title": "Use Apify Proxy",
                        "type": "boolean",
                        "description": "Recommended for LinkedIn. Uses Apify residential proxies to reduce login blocks.",
                        "default": true
                    },
                    "apifyProxyGroups": {
                        "title": "Apify Proxy groups",
                        "type": "array",
                        "description": "Proxy groups to use. RESIDENTIAL is recommended for LinkedIn.",
                        "default": [
                            "RESIDENTIAL"
                        ],
                        "items": {
                            "type": "string"
                        }
                    },
                    "linkedInSessionCookie": {
                        "title": "LinkedIn session cookie (li_at)",
                        "type": "string",
                        "description": "Optional li_at cookie value from a logged-in LinkedIn browser session. Improves success rate when LinkedIn shows a login wall."
                    },
                    "linkedInCookies": {
                        "title": "LinkedIn cookies (advanced)",
                        "type": "array",
                        "description": "Optional full cookie objects to pass to the browser context."
                    },
                    "maxRequestRetries": {
                        "title": "Max request retries",
                        "minimum": 0,
                        "maximum": 10,
                        "type": "integer",
                        "description": "How many times to retry a failed company page request.",
                        "default": 3
                    },
                    "requestDelaySecs": {
                        "title": "Delay between requests (seconds)",
                        "minimum": 0,
                        "maximum": 60,
                        "type": "integer",
                        "description": "Minimum delay between requests to linkedin.com to reduce blocking risk.",
                        "default": 3
                    },
                    "maxJobUrls": {
                        "title": "Max job URLs",
                        "minimum": 1,
                        "maximum": 200,
                        "type": "integer",
                        "description": "Maximum number of job posting URLs to extract per company from the jobs page.",
                        "default": 50
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
