Shein Product Scraper avatar
Shein Product Scraper
Under maintenance

Pricing

from $20.00 / 1,000 results

Go to Apify Store
Shein Product Scraper

Shein Product Scraper

Under maintenance

Pricing

from $20.00 / 1,000 results

Rating

0.0

(0)

Developer

ruly optimism

ruly optimism

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

8 hours ago

Last modified

Share

🛍️ SHEIN Product Scraper

The most reliable scraper for extracting comprehensive product data from SHEIN - one of the world's largest fashion e-commerce platforms with over 150 million active users.

✨ Why Choose This Scraper?

  • High Success Rate - 95%+ success rate with 3-layer anti-blocking system
  • Complete Product Data - Get everything: prices, sizes, colors, specs, images, SKUs
  • Fast & Efficient - Optimized for speed (~10-30s per product)
  • Smart Recovery - Automatic browser restart on crashes, continues processing
  • Proxy Included - Premium residential proxy for Israel domain included
  • Reliable - 3 captcha bypass strategies + automatic retries

🛡️ Anti-Blocking System

This scraper includes a sophisticated 3-strategy captcha bypass system:

StrategyDescriptionSpeed
Strategy 1Main domain redirect~2-3s
Strategy 2Wait & retry~4-5s
Strategy 3IP rotation (new proxy)~40-50s

The system automatically tries each strategy in order until success.

📊 What Data You Get

Each product returns comprehensive, structured data:

{
"sku": "sz2503138148213601",
"product_id": "70811510",
"title": "SHEIN BAE Sexy Lace Sheer Camisole Top",
"main_image": "https://img.shein.com/images/product1.jpg",
"images": ["https://...", "https://..."],
"color": "Black",
"retail_price": {
"amount": 58.00,
"amount_with_symbol": "₪58.00",
"usd_amount": 15.50,
"usd_amount_with_symbol": "$15.50"
},
"sale_price": {
"amount": 29.00,
"amount_with_symbol": "₪29.00",
"usd_amount": 7.75,
"usd_amount_with_symbol": "$7.75"
},
"has_discount": true,
"discount_percentage": 50,
"sizes": [
{
"attr_value_name": "S",
"is_sold_out": false,
"size_chart": [
{"attr_name_value_key": "Bust", "attr_name_value_cm": "86 cm"}
]
}
],
"specs": [
{"name": "Material", "value": "Polyester"},
{"name": "Style", "value": "Sexy"},
{"name": "Pattern Type", "value": "Solid"}
],
"variants": [...],
"url": "https://il.shein.com/...",
"scrape_time_seconds": 12.5,
"scraped_at": "2024-12-15T14:30:00Z"
}

🎯 Perfect For

Use CaseDescription
Price MonitoringTrack SHEIN prices and discounts over time
Competitor AnalysisCompare products, pricing, and inventory
Market ResearchAnalyze fashion trends and product availability
DropshippingGet accurate product data for your listings
E-commerce IntegrationSync SHEIN products to your store
Data AnalyticsBuild comprehensive fashion industry datasets

⚙️ Input Configuration

FieldTypeDescriptionDefault
urlsarrayList of SHEIN product URLs (max 5 per run)Required
useProxybooleanEnable premium proxy (recommended)true
maxRetriesnumberRetries per URL (1-10)3
maxCaptchaRetriesnumberMax captcha bypass attempts5
includeImagesbooleanInclude all image URLstrue
includeReviewsbooleanInclude ratingstrue
timeoutnumberPage timeout in seconds30
delayBetweenRequestsnumberDelay between requests (ms)1000

Example Input

{
"urls": [
"https://il.shein.com/SHEIN-BAE-Sexy-Lace-p-70811510.html",
"https://il.shein.com/Another-Product-p-12345678.html"
],
"useProxy": true,
"maxRetries": 3,
"timeout": 30
}

📈 Performance

MetricValue
Average scrape time (no captcha)10-15s per product
Average scrape time (with captcha bypass)20-50s per product
Success rate>95%
Memory usage~1GB
Max URLs per run5 (recommended)

Performance Optimizations

  • Fast Chrome startup - Disabled unnecessary Chrome features
  • Eager page load - Don't wait for all resources
  • Images disabled - Skip image loading for speed
  • Quick polling - 0.5s data check interval
  • Browser timeouts - 60s page load, 30s script timeout
  • Auto recovery - Browser restarts on crash

🌍 Supported Regions

Currently optimized for:

  • Israel (il.shein.com) - Full support with premium proxy

More regions coming soon!

💡 Tips for Best Results

  1. Use Proxy - Keep useProxy: true for reliable scraping
  2. Batch Size - Process 3-5 URLs per run for best stability
  3. Valid URLs - Ensure URLs are valid SHEIN product pages ending in .html
  4. Reasonable Delay - Use 1000ms+ delay between requests
  5. Monitor Logs - Watch for strategy indicators to understand performance

📊 Output Statistics

Each run provides detailed statistics:

{
"total": 5,
"success": 5,
"failed": 0,
"captcha_bypasses": 2,
"duration_seconds": 65
}

🔧 Error Handling

ErrorAction
Captcha detectedAuto-bypass with 3 strategies
Page timeoutSkip URL, restart browser, continue
Browser crashAuto-restart, continue with next URL
Network errorRetry with new proxy

📝 Changelog

v1.2 (Latest)

  • ⚡ Optimized Chrome startup (~4-6s faster)
  • ⚡ Reduced captcha bypass times
  • 🛡️ Added browser crash recovery
  • 🛡️ Added page load timeouts (60s max)
  • 📊 Strategy indicators in logs

v1.1

  • Added 3-strategy captcha bypass system
  • IP rotation with Smartproxy
  • Improved error handling

v1.0

  • Initial release
  • Full product data extraction
  • Premium proxy integration

🔒 Compliance

  • This scraper is designed for legitimate business purposes
  • Please respect SHEIN's terms of service
  • Use responsibly and don't overload their servers

🤝 Support

Need help or have questions?

  • Open an issue on the actor page
  • Contact via Apify messaging

If this scraper helps your business, please leave a review!