# Similarweb Scraper – Website Traffic & Analytics Data (`logiover/similarweb-scraper`) Actor

Similarweb scraper and API alternative. Export website traffic, ranks and analytics to CSV/JSON; scrape Top Websites data without login or API key.

- **URL**: https://apify.com/logiover/similarweb-scraper.md
- **Developed by:** [Logiover](https://apify.com/logiover) (community)
- **Categories:** Business, Marketing, SEO tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $3.50 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Similarweb Scraper — Website Traffic & Analytics Data

Scrape **Similarweb website traffic and analytics** for any domain, or pull the public **Top Websites rankings** in bulk. This actor is a **Similarweb scraper** and **website traffic checker** that turns Similarweb's public data into a clean, analysis-ready dataset for competitor research and SEO.

It runs in two modes — a high-volume ranking mode (hundreds to thousands of ranked sites per run) and a per-domain enrichment mode with full traffic snapshots. No login, no cookies, no API key. Export CSV, JSON or Excel.

---

### What you get

**Ranking mode** — one record per ranked site:

- `rank`, `domain`, `category`
- `rankChange`
- `avgVisitDuration`, `pagesPerVisit`, `bounceRate`
- `listCategory`, `listCountry`, `listUrl`, `websiteUrl`, `scrapedAt`

**Enrichment mode** — one full analytics record per domain:

- `domain`, `totalVisits`
- `globalRank`, `countryRank`, `countryRankCountry`, `categoryRank`, `category`
- `bounceRate`, `pagesPerVisit`, `avgVisitDuration`
- `topTrafficSource`, `topTrafficSourceShare`, `trafficSources` (full breakdown)
- `topCountries` (share + change per country)
- `dataPeriod`, `scrapedAt`

Leave categories and countries empty in ranking mode to scrape the global list plus every top category — 1,000+ ranked sites in a single run.

---

### Use cases

- **Competitor traffic analysis** — pull total visits, ranks and engagement for any competitor domain.
- **Market & niche research** — scrape an entire category's Top Websites list to map the players.
- **SEO & content benchmarking** — compare bounce rate, pages per visit and visit duration across sites.
- **Lead and prospect lists** — build a ranked dataset of the top sites per country or vertical.
- **Traffic-source insight** — see each domain's top traffic source and top countries.
- **Analytics products** — feed ranked, fresh website datasets into your own dashboard.

---

### Input reference

| Field | Type | Description |
|-------|------|-------------|
| `categories` | array | Ranking mode. Category slugs (`finance`, `games`, …). 50 sites each. Empty + no countries scrapes ALL top categories. |
| `countries` | array | Ranking mode. Country slugs (`united-states`, `germany`, …). 50 sites each. |
| `global` | boolean | Ranking mode. Also scrape the global Top 50. |
| `maxResults` | integer | Cap total output records (0 = no cap). |
| `domains` | array | Enrichment mode. Domains to look up individually. |
| `concurrency` | integer | Parallel browsers for enrichment (1–10). |
| `proxyConfiguration` | object | US residential proxy (required — Similarweb is WAF-protected). |

> Combined country + category lists are limited by Similarweb to ~5 rows on the free pages; single-category and single-country lists return the full 50.

---

### How to use

1. **Ranking mode:** set **Categories** (e.g. `e-commerce-and-shopping`, `finance`) and/or **Countries** (e.g. `united-states`, `germany`), or enable **global**. Leave both empty to scrape all top categories.
2. **Enrichment mode:** add **Domains** (e.g. `apify.com`, `shopify.com`) to fetch full per-domain analytics. You can use either mode or both.
3. Set **Max results** to cap large runs and keep the default US residential **proxy** (required — Similarweb is WAF-protected).
4. Run, then export to CSV, JSON or Excel — or pull via the Apify API.

#### Example — ranking mode (one category)

```json
{ "categories": ["e-commerce-and-shopping"] }
````

#### Example — all top categories, capped

```json
{ "global": true, "maxResults": 2000 }
```

#### Example — enrichment mode

```json
{ "domains": ["apify.com", "github.com", "shopify.com"] }
```

#### Sample enrichment output

```json
{
  "domain": "apify.com",
  "totalVisits": "4.1M",
  "globalRank": "#8,929",
  "countryRank": "#9,123",
  "categoryRank": "#268",
  "bounceRate": "36.35%",
  "pagesPerVisit": "8.04",
  "avgVisitDuration": "00:04:52",
  "topTrafficSource": "Direct",
  "dataPeriod": "April 2026"
}
```

#### How it works

- Ranking mode reads `similarweb.com/top-websites/{country?}/{category?}/` pages with a fast HTTP client over a US residential proxy and parses the ranking table — no headless browser, so it is cheap and scales.
- Enrichment mode opens each `similarweb.com/website/{domain}/` page in a headless browser, clears the AWS WAF challenge, and parses the rendered analytics. Domains run concurrently.

***

### FAQ

#### How many records can ranking mode return?

Up to 50 per list. Each top category and each country is its own list, so scraping all top categories yields 1,000+ records, and adding countries multiplies that. Use `maxResults` to cap a run.

#### Why is there no total-visits number in ranking mode?

Similarweb's public Top Websites table only shows rank, category, rank change, average visit duration, pages per visit and bounce rate — no numeric visit count. To get total visits for a site, run it through enrichment mode.

#### Do I need a Similarweb login or API key?

No. Both modes read public Similarweb pages — no account, cookies or developer key.

#### Why are residential proxies required?

Similarweb is protected by AWS WAF, so US residential proxies are needed to load the pages reliably. The default proxy configuration handles this for you.

#### Can I export to CSV, JSON or Excel?

Yes. Both modes write to an Apify dataset you can export to CSV, JSON, Excel, XML or JSONL from the run page or via the API.

#### Is this a Similarweb API alternative?

Yes. It reads Similarweb's public pages directly, so you get traffic, ranks and analytics data without the official Similarweb API, paid plan or developer key.

#### How do I scrape Similarweb without login?

Both ranking and enrichment modes read public Similarweb pages over a US residential proxy — no account, cookies or sign-in needed. Just add domains or categories and run.

#### How do I export Similarweb traffic data to CSV or JSON?

Run the actor, then download the dataset as CSV, JSON or Excel from the run page, or pull the website traffic data via the Apify API for your own pipeline.

***

### Changelog

#### 2026-06-07

- Docs: added coverage for Similarweb API alternative, scraping without login, and exporting website traffic data to CSV/JSON.

***

*Collects publicly available data only. Not affiliated with or endorsed by Similarweb. You are responsible for compliance with Similarweb's Terms and applicable laws.*

# Actor input Schema

## `categories` (type: `array`):

Similarweb category slugs to pull Top Websites rankings for, e.g. 'e-commerce-and-shopping', 'finance', 'news-and-media', 'ai-chatbots-and-tools'. 50 sites per category. Leave empty to scrape ALL top categories (1,000+ sites).

## `countries` (type: `array`):

Country slugs for country-level Top Websites rankings, e.g. 'united-states', 'germany', 'united-kingdom', 'turkey'. 50 sites per country. Combine with categories for country+category lists (note: Similarweb limits combined lists to ~5 rows on the free pages).

## `global` (type: `boolean`):

Also scrape the global Top Websites list (top 50 sites worldwide).

## `maxResults` (type: `integer`):

Cap total output records (0 = no cap). Useful to limit large all-category runs.

## `domains` (type: `array`):

Website domains to look up individually on Similarweb. One full-analytics record per domain (total visits, ranks, bounce rate, pages/visit, visit duration, traffic sources, top countries). Protocol and www are stripped automatically.

## `concurrency` (type: `integer`):

Parallel browsers for domain-enrichment mode (1-10). Higher finishes large domain lists faster but uses more memory.

## `proxyConfiguration` (type: `object`):

Similarweb is protected by AWS WAF. US residential proxies are required. Leave the default (Apify Proxy · RESIDENTIAL · US).

## Actor input object example

```json
{
  "categories": [
    "e-commerce-and-shopping",
    "finance",
    "games"
  ],
  "countries": [
    "united-states",
    "germany"
  ],
  "global": false,
  "maxResults": 10,
  "domains": [
    "apify.com",
    "nytimes.com",
    "shopify.com"
  ],
  "concurrency": 5,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "US"
  }
}
```

# Actor output Schema

## `rank` (type: `string`):

Rank in list

## `domain` (type: `string`):

Domain

## `category` (type: `string`):

Category

## `totalVisits` (type: `string`):

Total visits

## `globalRank` (type: `string`):

Global rank

## `bounceRate` (type: `string`):

Bounce rate

## `avgVisitDuration` (type: `string`):

Avg visit duration

## `pagesPerVisit` (type: `string`):

Pages per visit

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "categories": [
        "e-commerce-and-shopping"
    ],
    "maxResults": 10,
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ],
        "apifyProxyCountry": "US"
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("logiover/similarweb-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "categories": ["e-commerce-and-shopping"],
    "maxResults": 10,
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
        "apifyProxyCountry": "US",
    },
}

# Run the Actor and wait for it to finish
run = client.actor("logiover/similarweb-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "categories": [
    "e-commerce-and-shopping"
  ],
  "maxResults": 10,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "US"
  }
}' |
apify call logiover/similarweb-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=logiover/similarweb-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Similarweb Scraper – Website Traffic & Analytics Data",
        "description": "Similarweb scraper and API alternative. Export website traffic, ranks and analytics to CSV/JSON; scrape Top Websites data without login or API key.",
        "version": "1.0",
        "x-build-id": "7cDcBNCxpfgclTtaj"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/logiover~similarweb-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-logiover-similarweb-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/logiover~similarweb-scraper/runs": {
            "post": {
                "operationId": "runs-sync-logiover-similarweb-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/logiover~similarweb-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-logiover-similarweb-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "categories": {
                        "title": "Categories (ranking mode)",
                        "type": "array",
                        "description": "Similarweb category slugs to pull Top Websites rankings for, e.g. 'e-commerce-and-shopping', 'finance', 'news-and-media', 'ai-chatbots-and-tools'. 50 sites per category. Leave empty to scrape ALL top categories (1,000+ sites).",
                        "items": {
                            "type": "string"
                        }
                    },
                    "countries": {
                        "title": "Countries (ranking mode)",
                        "type": "array",
                        "description": "Country slugs for country-level Top Websites rankings, e.g. 'united-states', 'germany', 'united-kingdom', 'turkey'. 50 sites per country. Combine with categories for country+category lists (note: Similarweb limits combined lists to ~5 rows on the free pages).",
                        "items": {
                            "type": "string"
                        }
                    },
                    "global": {
                        "title": "Include global Top 50 (ranking mode)",
                        "type": "boolean",
                        "description": "Also scrape the global Top Websites list (top 50 sites worldwide).",
                        "default": false
                    },
                    "maxResults": {
                        "title": "Max results",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Cap total output records (0 = no cap). Useful to limit large all-category runs.",
                        "default": 0
                    },
                    "domains": {
                        "title": "Domains (enrichment mode)",
                        "type": "array",
                        "description": "Website domains to look up individually on Similarweb. One full-analytics record per domain (total visits, ranks, bounce rate, pages/visit, visit duration, traffic sources, top countries). Protocol and www are stripped automatically.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "concurrency": {
                        "title": "Enrichment concurrency",
                        "minimum": 1,
                        "maximum": 10,
                        "type": "integer",
                        "description": "Parallel browsers for domain-enrichment mode (1-10). Higher finishes large domain lists faster but uses more memory.",
                        "default": 5
                    },
                    "proxyConfiguration": {
                        "title": "Proxy",
                        "type": "object",
                        "description": "Similarweb is protected by AWS WAF. US residential proxies are required. Leave the default (Apify Proxy · RESIDENTIAL · US).",
                        "default": {
                            "useApifyProxy": true,
                            "apifyProxyGroups": [
                                "RESIDENTIAL"
                            ],
                            "apifyProxyCountry": "US"
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
