Sephora Scraper avatar
Sephora Scraper

Pricing

$20.00/month + usage

Go to Apify Store
Sephora Scraper

Sephora Scraper

Sephora scraper to fetch product or products of category.

Pricing

$20.00/month + usage

Rating

5.0

(2)

Developer

Autofactor

Autofactor

Maintained by Community

Actor stats

8

Bookmarked

254

Total users

5

Monthly active users

34 days

Issues response

3 days ago

Last modified

Share

Sephora Scraper is a professional-grade Apify actor designed to extract high-fidelity product data from Sephora's international stores. Built for reliability and scale, it intelligently handles anti-scraping measures, manages sessions, and delivers detailed structured data for your e-commerce analytics.


🚀 Features

  • Deep Product Extraction: Retrieves detailed product specifications, including:
    • Multiple variants (shades, sizes) with individual pricing and stock status.
    • High-resolution image galleries and media assets.
    • Rich HTML descriptions, ingredients, and "how to use" guides.
    • Review counts, ratings, and "loves" metrics.
  • Smart URL Parsing: Automatically converts standard web URLs (e.g., www.sephora.com/shop/...) into efficient API calls.
  • Multi-Region Support: Seamlessly supports en-US, en-CA, and fr-CA stores based on input URLs.
  • Anti-Blocking Architecture:
    • Built-in Apify Proxy integration with session rotation.
    • Smart retries (up to 5 times) for failed requests.
    • Mimics real-user behavior to bypass security checks.
  • Scalable Performance: Configurable concurrency to balance speed and stability.

📋 Input Parameters

The actor accepts a JSON object with the following configuration:

FieldTypeDescriptionDefault
startUrlsArrayRequired. A list of Sephora URLs to scrape. Supports:
Category Pages: https://www.sephora.com/shop/...
Product Pages: https://www.sephora.com/product/...
International URLs: sephora.com/ca/en/...
[]
proxyObjectProxy configuration. Residential proxies are highly recommended for Sephora.{ "useApifyProxy": true }
maxConcurrencyIntegerMaximum number of parallel requests. Reduce this if you encounter blocking.5
maxRequestsPerCrawlIntegerLimit the total number of requests. Set to 0 for unlimited crawling.0

Input Example

{
"startUrls": [
{ "url": "https://www.sephora.com/shop/clean-makeup" },
{ "url": "https://www.sephora.com/product/glossier-cloud-paint-gel-cream-blush-P468600" }
],
"proxy": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
},
"maxConcurrency": 2,
"maxRequestsPerCrawl": 100
}

📦 Output Data

Data is stored in the default dataset in JSON format. Each item represents a unique product.

Data Schema

FieldTypeDescription
sourceObjectMetadata about the crawl (ID, URL, Retailer, Currency).
brandStringProduct brand name.
titleStringProduct title.
descriptionStringFull HTML description.
variantsArrayList of all SKUs/variants for this product.
priceObjectPricing details inside each variant (current, original, stock).
mediasArrayAll images and media associated with the product.
statsObjectReview count, rating, and loves count.

🔧 Advanced Configuration

Proxy Configuration

Sephora employs strict anti-scraping measures.

  • Residential Proxies: Mandatory for consistent success. Datacenter proxies are often blocked.
  • Session Persistence: The scraper uses intelligent session rotation (cookies & headers) to maintain access.

Performance Tuning

  • Default Concurrency: Set to 5 to be safe.
  • Scaling Up: If using high-quality residential proxies, you can increase maxConcurrency to 10-20 for faster speeds.
  • Debugging: Set maxRequestsPerCrawl to a low number (e.g., 10) to verify your configuration before a full run.

❓ FAQ & Troubleshooting

Q: I'm getting 403 Forbidden errors. A: This usually means your IP is blocked. Ensure you are using Residential Proxies. If the issue persists, try reducing concurrency.

Q: Can I scrape reviews? A: Currently, this actor fetches review stats (count, average rating) but not individual review text. Review text extraction is on the roadmap.

Q: Does it support other countries? A: Yes! The scraper automatically detects the region from your input URL (e.g., sephora.com/ca).


TODO / Roadmap

  • Video Extraction: Fetch product video URLs.
  • Reviews: Extract full text of user reviews.
  • Ingredients Analysis: Structured breakdown of ingredients.