Shein Product Scraper avatar

Shein Product Scraper

Pricing

from $35.00 / 1,000 results

Go to Apify Store
Shein Product Scraper

Shein Product Scraper

Pricing

from $35.00 / 1,000 results

Rating

5.0

(1)

Developer

ruly optimism

ruly optimism

Maintained by Community

Actor stats

0

Bookmarked

45

Total users

6

Monthly active users

7.8 hours

Issues response

a month ago

Last modified

Share

๐Ÿ›๏ธ SHEIN Product Scraper

The most reliable scraper for extracting comprehensive product data from SHEIN - one of the world's largest fashion e-commerce platforms with over 150 million active users.

โœจ Why Choose This Scraper?

  • High Success Rate - 95%+ success rate with 3-layer anti-blocking system
  • Complete Product Data - Get everything: prices, sizes, colors, specs, images, SKUs
  • Fast & Efficient - Optimized for speed (~10-30s per product)
  • Smart Recovery - Automatic browser restart on crashes, continues processing
  • Proxy Included - Premium residential proxy for Israel domain included
  • Reliable - 3 captcha bypass strategies + automatic retries

๐Ÿ›ก๏ธ Anti-Blocking System

This scraper includes a sophisticated 3-strategy captcha bypass system:

StrategyDescriptionSpeed
Strategy 1Main domain redirect~2-3s
Strategy 2Wait & retry~4-5s
Strategy 3IP rotation (new proxy)~40-50s

The system automatically tries each strategy in order until success.

๐Ÿ“Š What Data You Get

Each product returns comprehensive, structured data:

{
"sku": "sz2503138148213601",
"product_id": "70811510",
"title": "SHEIN BAE Sexy Lace Sheer Camisole Top",
"main_image": "https://img.shein.com/images/product1.jpg",
"images": ["https://...", "https://..."],
"color": "Black",
"retail_price": {
"amount": 58.00,
"amount_with_symbol": "โ‚ช58.00",
"usd_amount": 15.50,
"usd_amount_with_symbol": "$15.50"
},
"sale_price": {
"amount": 29.00,
"amount_with_symbol": "โ‚ช29.00",
"usd_amount": 7.75,
"usd_amount_with_symbol": "$7.75"
},
"has_discount": true,
"discount_percentage": 50,
"sizes": [
{
"attr_value_name": "S",
"is_sold_out": false,
"size_chart": [
{"attr_name_value_key": "Bust", "attr_name_value_cm": "86 cm"}
]
}
],
"specs": [
{"name": "Material", "value": "Polyester"},
{"name": "Style", "value": "Sexy"},
{"name": "Pattern Type", "value": "Solid"}
],
"variants": [...],
"url": "https://il.shein.com/...",
"scrape_time_seconds": 12.5,
"scraped_at": "2024-12-15T14:30:00Z"
}

๐ŸŽฏ Perfect For

Use CaseDescription
Price MonitoringTrack SHEIN prices and discounts over time
Competitor AnalysisCompare products, pricing, and inventory
Market ResearchAnalyze fashion trends and product availability
DropshippingGet accurate product data for your listings
E-commerce IntegrationSync SHEIN products to your store
Data AnalyticsBuild comprehensive fashion industry datasets

โš™๏ธ Input Configuration

FieldTypeDescriptionDefault
urlsarrayList of SHEIN product URLs (max 5 per run)Required
useProxybooleanEnable premium proxy (recommended)true
maxRetriesnumberRetries per URL (1-10)3
maxCaptchaRetriesnumberMax captcha bypass attempts5
includeImagesbooleanInclude all image URLstrue
includeReviewsbooleanInclude ratingstrue
timeoutnumberPage timeout in seconds30
delayBetweenRequestsnumberDelay between requests (ms)1000

Example Input

{
"urls": [
"https://il.shein.com/SHEIN-BAE-Sexy-Lace-p-70811510.html",
"https://il.shein.com/Another-Product-p-12345678.html"
],
"useProxy": true,
"maxRetries": 3,
"timeout": 30
}

๐Ÿ“ˆ Performance

MetricValue
Average scrape time (no captcha)10-15s per product
Average scrape time (with captcha bypass)20-50s per product
Success rate>95%
Memory usage~1GB
Max URLs per run5 (recommended)

Performance Optimizations

  • โšก Fast Chrome startup - Disabled unnecessary Chrome features
  • โšก Eager page load - Don't wait for all resources
  • โšก Images disabled - Skip image loading for speed
  • โšก Quick polling - 0.5s data check interval
  • โšก Browser timeouts - 60s page load, 30s script timeout
  • โšก Auto recovery - Browser restarts on crash

๐ŸŒ Supported Regions

Currently optimized for:

  • Israel (il.shein.com) - Full support with premium proxy

More regions coming soon!

๐Ÿ’ก Tips for Best Results

  1. Use Proxy - Keep useProxy: true for reliable scraping
  2. Batch Size - Process 3-5 URLs per run for best stability
  3. Valid URLs - Ensure URLs are valid SHEIN product pages ending in .html
  4. Reasonable Delay - Use 1000ms+ delay between requests
  5. Monitor Logs - Watch for strategy indicators to understand performance

๐Ÿ“Š Output Statistics

Each run provides detailed statistics:

{
"total": 5,
"success": 5,
"failed": 0,
"captcha_bypasses": 2,
"duration_seconds": 65
}

๐Ÿ”ง Error Handling

ErrorAction
Captcha detectedAuto-bypass with 3 strategies
Page timeoutSkip URL, restart browser, continue
Browser crashAuto-restart, continue with next URL
Network errorRetry with new proxy

๐Ÿ“ Changelog

v1.2 (Latest)

  • โšก Optimized Chrome startup (~4-6s faster)
  • โšก Reduced captcha bypass times
  • ๐Ÿ›ก๏ธ Added browser crash recovery
  • ๐Ÿ›ก๏ธ Added page load timeouts (60s max)
  • ๐Ÿ“Š Strategy indicators in logs

v1.1

  • Added 3-strategy captcha bypass system
  • IP rotation with Smartproxy
  • Improved error handling

v1.0

  • Initial release
  • Full product data extraction
  • Premium proxy integration

๐Ÿ”’ Compliance

  • This scraper is designed for legitimate business purposes
  • Please respect SHEIN's terms of service
  • Use responsibly and don't overload their servers

๐Ÿค Support

Need help or have questions?

  • Open an issue on the actor page
  • Contact via Apify messaging

โญ If this scraper helps your business, please leave a review!