SimilarWeb Analytics Scraper
Pricing
Pay per event
SimilarWeb Analytics Scraper
Gain insights from web traffic analysis from SimilarWeb, include sources and geographical breakdown.
Pricing
Pay per event
Rating
0.0
(0)
Developer

Zen Studio
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Share
Extract website traffic, rankings, and traffic sources from SimilarWeb: comprehensive competitive intelligence including global rankings, monthly visits, engagement metrics, geographic distribution, and traffic source breakdown for any website.
No login required. No cookies needed. Just enter your domains and start extracting valuable insights.
Why SimilarWeb Data?
SimilarWeb is the industry-leading web analytics platform trusted by Fortune 500 companies, marketers, and researchers worldwide. With this actor, you can:
- Track competitor traffic: Monitor monthly visits and growth trends
- Analyze traffic sources: Understand where visitors come from (Search, Direct, Social, Referrals)
- Benchmark rankings: Compare global and country-specific rankings
- Identify audience geography: See which countries drive the most traffic
- Measure engagement: Analyze bounce rates, pages per visit, and time on site
Quick Start
Basic: Analyze a Single Domain
{"domains": ["apify.com"]}
Bulk Analysis: Multiple Domains
{"domains": ["apify.com","selenium.dev","playwright.dev","phpscraper.de"]}
Flexible Input: URLs and Domains
The actor accepts both plain domains and full URLs - it automatically extracts the domain:
{"domains": ["example.com","https://www.test.com/some/page","http://another-site.org:8080/path"]}
Input Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
domains | array | Yes | List of domains or URLs to analyze |
Input Flexibility:
- Plain domains:
example.com - With subdomain:
www.example.com,blog.example.com - Full URLs:
https://example.com/page?query=1 - URLs with ports:
http://example.com:8080
All inputs are automatically normalized, deduplicated, and lowercased.
Output Example
Each domain returns comprehensive SimilarWeb analytics data:
{"domain": "apify.com","data": {"site_name": "apify.com","description": "Cloud platform for web scraping, browser automation, AI agents, and data for AI.","title": "Apify: Full-stack web scraping and data extraction platform","global_rank": {"rank": 18630},"country_rank": {"country": 840,"country_code": "US","rank": 16326},"category_rank": {"rank": 441,"category": "Computers_Electronics_and_Technology"},"estimated_monthly_visits": {"2025-07-01": 2199161,"2025-08-01": 2089977,"2025-09-01": 1911397},"engagements": {"bounce_rate": 0.345,"month": 9,"year": 2025,"page_per_visit": 9.49,"visits": 1911397,"time_on_site": 362.21},"top_country_shares": [{"country": 840,"country_code": "US","value": 0.194},{"country": 356,"country_code": "IN","value": 0.119},{"country": 826,"country_code": "GB","value": 0.046}],"traffic_sources": {"social": 0.017,"paid_referrals": 0.005,"mail": 0.001,"referrals": 0.052,"search": 0.444,"direct": 0.482}},"error": null}
Key Data Points
| Field | Description |
|---|---|
global_rank | Worldwide ranking among all websites |
country_rank | Ranking within the primary country |
category_rank | Ranking within the website's category |
estimated_monthly_visits | Last 3 months of traffic data |
engagements | Bounce rate, pages/visit, time on site |
top_country_shares | Geographic distribution of visitors |
traffic_sources | Breakdown by channel (search, direct, social, etc.) |
Failed Domains
Domains that can't be analyzed (too small, blocked, etc.) return with an error:
{"domain": "tiny-unknown-site.com","data": null,"error": "HTTP 404"}
Pricing: Pay As You Go
This actor uses Apify's pay-per-event billing model for cost-effective, usage-based pricing.
| Event | Cost | Description |
|---|---|---|
domain | $1.99 / 1,000 | Charged per successfully analyzed domain |
Volume Discounts
| Tier | Price per 1,000 |
|---|---|
| Base | $1.99 |
| Bronze | $1.95 |
| Silver | $1.90 |
| Gold | $1.80 |
Key billing details:
- Only successful results are charged: Failed domains (404, no data) are free
- No minimum commitment: Pay only for what you use
- Budget controls: Set spending limits in your Apify account
The Actor start is always charged at $0.25/1k starts.
API Integration
Call via Apify API
curl -X POST "https://api.apify.com/v2/acts/zen-studio/similarweb-scraper/runs?token=YOUR_API_TOKEN" \-H "Content-Type: application/json" \-d '{"domains": ["google.com", "facebook.com", "amazon.com"]}'
Python Integration
from apify_client import ApifyClientclient = ApifyClient("YOUR_API_TOKEN")run = client.actor("zen-studio/similarweb-scraper").call(run_input={"domains": ["apify.com", "scrapy.org"]})# Fetch resultsfor item in client.dataset(run["defaultDatasetId"]).iterate_items():print(f"{item['domain']}: Rank #{item['data']['global_rank']['rank']}")
JavaScript/Node.js Integration
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });const run = await client.actor('zen-studio/similarweb-scraper').call({domains: ['apify.com', 'scrapy.org']});const { items } = await client.dataset(run.defaultDatasetId).listItems();items.forEach(item => {console.log(`${item.domain}: Rank #${item.data?.global_rank?.rank}`);});
Use Cases
- Competitive Intelligence: Monitor competitor traffic and identify opportunities to capture market share
- SEO & Content Strategy: Analyze traffic sources to understand what channels work best
- Market Research: Identify rising players and track market dynamics over time
- Lead Qualification (B2B:) Enrich your CRM and prioritize leads based on website performance
- Investment Analysis: Evaluate company performance through web traffic as alternative data
- Affiliate Marketing: Identify high-traffic websites for partnership opportunities
- E-commerce & Retail: Track competitor storefronts and identify effective marketing channels
Advanced Usage
Scheduled Monitoring
Set up recurring runs to track competitors over time:
- Go to the actor's Schedules tab
- Create a new schedule (e.g., weekly on Mondays)
- Configure your input with target domains
- Results are stored in datasets for trend analysis
Integration with Google Sheets
Export results directly to Google Sheets using Apify's integration:
- Run the actor with your domains
- Go to the dataset
- Click Export → Google Sheets
Technical Details
Rate Limits & Performance
- Automatic batching: We chunk your workload into batches for faster parallel processing.
- Parallel processing: Up to 4 concurrent API requests for large lists
- Fault tolerance: Failed batches don't stop other batches from processing
Error Handling
The actor handles errors gracefully:
| Error Type | Behavior |
|---|---|
| Invalid domain | Skipped with validation error |
| No SimilarWeb data | Returns with error message |
| Rate limit (429) | Automatic retry with backoff |
| Server error (5xx) | Automatic retry (3 attempts) |
| Auth failure (401) | Actor fails with clear message |
Data Export
Results can be exported in multiple formats:
- JSON: Full structured data
- CSV: Flattened for spreadsheets
- Excel: Direct .xlsx download
- XML: For legacy system integration
Access exports via the Apify Console or API:
# JSON exportcurl "https://api.apify.com/v2/datasets/DATASET_ID/items?format=json&token=YOUR_TOKEN"# CSV exportcurl "https://api.apify.com/v2/datasets/DATASET_ID/items?format=csv&token=YOUR_TOKEN"
FAQ
How accurate is SimilarWeb data?
SimilarWeb uses a combination of direct measurement, partnerships, and statistical modeling. Data is most accurate for high-traffic sites (100K+ monthly visits). Smaller sites may have less precise estimates.
What happens if a domain has no data?
Domains with insufficient traffic or that are too new may not have SimilarWeb data. These are returned with data: null and an error message. You are not charged for failed domains.
Can I analyze subdomains?
SimilarWeb typically aggregates data at the root domain level. Subdomains like blog.example.com may be redirected to the main domain's data.
How many domains can I analyze?
There's no hard limit. The actor efficiently processes lists of any size using batching. For very large lists (10,000+), consider using scheduled runs.
Is the data real-time?
No. SimilarWeb data is aggregated monthly. The most recent data point is typically 1-2 months old.
Disclaimer
This actor retrieves publicly available data from SimilarWeb. The accuracy and availability of data depends on SimilarWeb's coverage and methodology. Use the data in compliance with applicable laws and SimilarWeb's terms of service.
The actor does not store any data beyond the immediate run results. All data is delivered directly to your Apify dataset.
Need help? Open an issue on the actor's GitHub repository or contact support through Apify Console.