Sephora Scraper
Pricing
$30.00/month + usage
Pricing
$30.00/month + usage
Rating
5.0
(1)
Developer
Richard Feng
Actor stats
11
Bookmarked
305
Total users
10
Monthly active users
0.87 hours
Issues response
13 days ago
Last modified
Categories
Share
Sephora Scraper is a professional-grade Apify actor designed to extract high-fidelity product data from Sephora's international stores. Built for reliability and scale, it intelligently handles anti-scraping measures, manages sessions, and delivers detailed structured data for your e-commerce analytics.
π Features
- Deep Product Extraction: Retrieves detailed product specifications, including:
- Multiple variants (shades, sizes) with individual pricing and stock status.
- High-resolution image galleries and media assets.
- Rich HTML descriptions, ingredients, and "how to use" guides.
- Review counts, ratings, and "loves" metrics.
- AI-generated sentiment summary and tagged pros/cons from customer reviews.
- Smart URL Parsing: Automatically converts standard web URLs (e.g.,
www.sephora.com/shop/...) into efficient API calls. - Multi-Region Support: Seamlessly supports
en-US,en-CA, andfr-CAstores based on input URLs. - Anti-Blocking Architecture:
- Built-in Apify Proxy integration with session rotation.
- Smart retries (up to 5 times) for failed requests.
- Mimics real-user behavior to bypass security checks.
- Scalable Performance: Configurable concurrency to balance speed and stability.
π Input Parameters
The actor accepts a JSON object with the following configuration:
| Field | Type | Description | Default |
|---|---|---|---|
startUrls | Array | Required. A list of Sephora URLs to scrape. Supports: β’ Category Pages: https://www.sephora.com/shop/...β’ Product Pages: https://www.sephora.com/product/...β’ International URLs: sephora.com/ca/en/... | [] |
proxy | Object | Proxy configuration. Residential proxies are highly recommended for Sephora. | { "useApifyProxy": true } |
maxConcurrency | Integer | Maximum number of parallel requests. Reduce this if you encounter blocking. | 5 |
maxRequestsPerCrawl | Integer | Limit the total number of requests. Set to 0 for unlimited crawling. | 0 |
Input Example
{"startUrls": [{ "url": "https://www.sephora.com/shop/clean-makeup" },{ "url": "https://www.sephora.com/product/glossier-cloud-paint-gel-cream-blush-P468600" }],"proxy": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]},"maxConcurrency": 2,"maxRequestsPerCrawl": 100}
π¦ Output Data
Data is stored in the default dataset in JSON format. Each item represents a unique product.
Data Schema
| Field | Type | Description |
|---|---|---|
source | Object | Metadata about the crawl (ID, URL, Retailer, Currency). |
brand | String | Product brand name. |
title | String | Product title. |
description | String | Full HTML description. |
variants | Array | List of all SKUs/variants for this product. |
price | Object | Pricing details inside each variant (current, original, stock). |
medias | Array | All images and media associated with the product. |
stats | Object | Review count, rating, and loves count. |
sentiments | Object | AI-generated sentiment summary and tagged pros/cons items from customer reviews. |
π§ Advanced Configuration
Proxy Configuration
Sephora employs strict anti-scraping measures.
- Residential Proxies: Mandatory for consistent success. Datacenter proxies are often blocked.
- Session Persistence: The scraper uses intelligent session rotation (cookies & headers) to maintain access.
Performance Tuning
- Default Concurrency: Set to
5to be safe. - Scaling Up: If using high-quality residential proxies, you can increase
maxConcurrencyto10-20for faster speeds. - Debugging: Set
maxRequestsPerCrawlto a low number (e.g.,10) to verify your configuration before a full run.
β FAQ & Troubleshooting
Q: I'm getting 403 Forbidden errors. A: This usually means your IP is blocked. Ensure you are using Residential Proxies. If the issue persists, try reducing concurrency.
Q: Can I scrape reviews? A: Currently, this actor fetches review stats (count, average rating) but not individual review text. Review text extraction is on the roadmap.
Q: Does it support other countries?
A: Yes! The scraper automatically detects the region from your input URL (e.g., sephora.com/ca).
TODO / Roadmap
- Video Extraction: Fetch product video URLs.
- Reviews: Extract full text of user reviews.
- Ingredients Analysis: Structured breakdown of ingredients.