SimilarWeb Analytics Scraper avatar
SimilarWeb Analytics Scraper

Pricing

Pay per event

Go to Apify Store
SimilarWeb Analytics Scraper

SimilarWeb Analytics Scraper

Gain insights from web traffic analysis from SimilarWeb, include sources and geographical breakdown.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Zen Studio

Zen Studio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Extract website traffic, rankings, and traffic sources from SimilarWeb: comprehensive competitive intelligence including global rankings, monthly visits, engagement metrics, geographic distribution, and traffic source breakdown for any website.

No login required. No cookies needed. Just enter your domains and start extracting valuable insights.

Why SimilarWeb Data?

SimilarWeb is the industry-leading web analytics platform trusted by Fortune 500 companies, marketers, and researchers worldwide. With this actor, you can:

  • Track competitor traffic: Monitor monthly visits and growth trends
  • Analyze traffic sources: Understand where visitors come from (Search, Direct, Social, Referrals)
  • Benchmark rankings: Compare global and country-specific rankings
  • Identify audience geography: See which countries drive the most traffic
  • Measure engagement: Analyze bounce rates, pages per visit, and time on site

Quick Start

Basic: Analyze a Single Domain

{
"domains": ["apify.com"]
}

Bulk Analysis: Multiple Domains

{
"domains": [
"apify.com",
"selenium.dev",
"playwright.dev",
"phpscraper.de"
]
}

Flexible Input: URLs and Domains

The actor accepts both plain domains and full URLs - it automatically extracts the domain:

{
"domains": [
"example.com",
"https://www.test.com/some/page",
"http://another-site.org:8080/path"
]
}

Input Parameters

ParameterTypeRequiredDescription
domainsarrayYesList of domains or URLs to analyze

Input Flexibility:

  • Plain domains: example.com
  • With subdomain: www.example.com, blog.example.com
  • Full URLs: https://example.com/page?query=1
  • URLs with ports: http://example.com:8080

All inputs are automatically normalized, deduplicated, and lowercased.

Output Example

Each domain returns comprehensive SimilarWeb analytics data:

{
"domain": "apify.com",
"data": {
"site_name": "apify.com",
"description": "Cloud platform for web scraping, browser automation, AI agents, and data for AI.",
"title": "Apify: Full-stack web scraping and data extraction platform",
"global_rank": {
"rank": 18630
},
"country_rank": {
"country": 840,
"country_code": "US",
"rank": 16326
},
"category_rank": {
"rank": 441,
"category": "Computers_Electronics_and_Technology"
},
"estimated_monthly_visits": {
"2025-07-01": 2199161,
"2025-08-01": 2089977,
"2025-09-01": 1911397
},
"engagements": {
"bounce_rate": 0.345,
"month": 9,
"year": 2025,
"page_per_visit": 9.49,
"visits": 1911397,
"time_on_site": 362.21
},
"top_country_shares": [
{
"country": 840,
"country_code": "US",
"value": 0.194
},
{
"country": 356,
"country_code": "IN",
"value": 0.119
},
{
"country": 826,
"country_code": "GB",
"value": 0.046
}
],
"traffic_sources": {
"social": 0.017,
"paid_referrals": 0.005,
"mail": 0.001,
"referrals": 0.052,
"search": 0.444,
"direct": 0.482
}
},
"error": null
}

Key Data Points

FieldDescription
global_rankWorldwide ranking among all websites
country_rankRanking within the primary country
category_rankRanking within the website's category
estimated_monthly_visitsLast 3 months of traffic data
engagementsBounce rate, pages/visit, time on site
top_country_sharesGeographic distribution of visitors
traffic_sourcesBreakdown by channel (search, direct, social, etc.)

Failed Domains

Domains that can't be analyzed (too small, blocked, etc.) return with an error:

{
"domain": "tiny-unknown-site.com",
"data": null,
"error": "HTTP 404"
}

Pricing: Pay As You Go

This actor uses Apify's pay-per-event billing model for cost-effective, usage-based pricing.

EventCostDescription
domain$1.99 / 1,000Charged per successfully analyzed domain

Volume Discounts

TierPrice per 1,000
Base$1.99
Bronze$1.95
Silver$1.90
Gold$1.80

Key billing details:

  • Only successful results are charged: Failed domains (404, no data) are free
  • No minimum commitment: Pay only for what you use
  • Budget controls: Set spending limits in your Apify account

The Actor start is always charged at $0.25/1k starts.

API Integration

Call via Apify API

curl -X POST "https://api.apify.com/v2/acts/zen-studio/similarweb-scraper/runs?token=YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"domains": ["google.com", "facebook.com", "amazon.com"]
}'

Python Integration

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("zen-studio/similarweb-scraper").call(
run_input={
"domains": ["apify.com", "scrapy.org"]
}
)
# Fetch results
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(f"{item['domain']}: Rank #{item['data']['global_rank']['rank']}")

JavaScript/Node.js Integration

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor('zen-studio/similarweb-scraper').call({
domains: ['apify.com', 'scrapy.org']
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach(item => {
console.log(`${item.domain}: Rank #${item.data?.global_rank?.rank}`);
});

Use Cases

  • Competitive Intelligence: Monitor competitor traffic and identify opportunities to capture market share
  • SEO & Content Strategy: Analyze traffic sources to understand what channels work best
  • Market Research: Identify rising players and track market dynamics over time
  • Lead Qualification (B2B:) Enrich your CRM and prioritize leads based on website performance
  • Investment Analysis: Evaluate company performance through web traffic as alternative data
  • Affiliate Marketing: Identify high-traffic websites for partnership opportunities
  • E-commerce & Retail: Track competitor storefronts and identify effective marketing channels

Advanced Usage

Scheduled Monitoring

Set up recurring runs to track competitors over time:

  1. Go to the actor's Schedules tab
  2. Create a new schedule (e.g., weekly on Mondays)
  3. Configure your input with target domains
  4. Results are stored in datasets for trend analysis

Integration with Google Sheets

Export results directly to Google Sheets using Apify's integration:

  1. Run the actor with your domains
  2. Go to the dataset
  3. Click ExportGoogle Sheets

Technical Details

Rate Limits & Performance

  • Automatic batching: We chunk your workload into batches for faster parallel processing.
  • Parallel processing: Up to 4 concurrent API requests for large lists
  • Fault tolerance: Failed batches don't stop other batches from processing

Error Handling

The actor handles errors gracefully:

Error TypeBehavior
Invalid domainSkipped with validation error
No SimilarWeb dataReturns with error message
Rate limit (429)Automatic retry with backoff
Server error (5xx)Automatic retry (3 attempts)
Auth failure (401)Actor fails with clear message

Data Export

Results can be exported in multiple formats:

  • JSON: Full structured data
  • CSV: Flattened for spreadsheets
  • Excel: Direct .xlsx download
  • XML: For legacy system integration

Access exports via the Apify Console or API:

# JSON export
curl "https://api.apify.com/v2/datasets/DATASET_ID/items?format=json&token=YOUR_TOKEN"
# CSV export
curl "https://api.apify.com/v2/datasets/DATASET_ID/items?format=csv&token=YOUR_TOKEN"

FAQ

How accurate is SimilarWeb data?

SimilarWeb uses a combination of direct measurement, partnerships, and statistical modeling. Data is most accurate for high-traffic sites (100K+ monthly visits). Smaller sites may have less precise estimates.

What happens if a domain has no data?

Domains with insufficient traffic or that are too new may not have SimilarWeb data. These are returned with data: null and an error message. You are not charged for failed domains.

Can I analyze subdomains?

SimilarWeb typically aggregates data at the root domain level. Subdomains like blog.example.com may be redirected to the main domain's data.

How many domains can I analyze?

There's no hard limit. The actor efficiently processes lists of any size using batching. For very large lists (10,000+), consider using scheduled runs.

Is the data real-time?

No. SimilarWeb data is aggregated monthly. The most recent data point is typically 1-2 months old.

Disclaimer

This actor retrieves publicly available data from SimilarWeb. The accuracy and availability of data depends on SimilarWeb's coverage and methodology. Use the data in compliance with applicable laws and SimilarWeb's terms of service.

The actor does not store any data beyond the immediate run results. All data is delivered directly to your Apify dataset.


Need help? Open an issue on the actor's GitHub repository or contact support through Apify Console.