Shein Product Scraper
Pricing
Pay per usage
Shein Product Scraper
Extract product data from Shein including prices, images, descriptions, sizes, and ratings. Perfect for market research, price comparison, inventory monitoring, and competitive analysis. Get structured data in JSON or CSV format with reliable parsing.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Shahid Irfan
Actor stats
0
Bookmarked
5
Total users
3
Monthly active users
6 days ago
Last modified
Categories
Share
Extract product data from Shein links in a structured format. Use category, selection, search, or tag-style Shein URLs as input and collect product titles, URLs, prices, images, and merchandising metadata for reporting and monitoring.
Features
- Flexible URL Support — Accepts Shein category, selection, search, and tag-style URLs.
- Structured Product Dataset — Collect normalized product records with pricing, category, and merchandising metadata.
- Duplicate Protection — Automatically avoids duplicate records during a run.
- Configurable Collection Size — Control how many products to save with a single input.
- Null-Free Output — Automatically removes empty fields so your dataset stays clean.
Use Cases
Product Research
Track what products appear in major Shein category feeds and identify assortment trends over time.
Pricing Intelligence
Collect sale and original prices to monitor price movement and discount positioning.
Merchandising Analysis
Compare product visibility across categories and evaluate which items are frequently surfaced.
Catalog Monitoring
Build recurring snapshots of listing pages for internal reporting and audit workflows.
Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
startUrl | String | Yes | https://www.shein.com/RecommendSelection/Women-Clothing-sc-017172961.html | Shein URL (category, selection, search, or tag format). |
results_wanted | Integer | No | 20 | Maximum number of products to save. |
proxyConfiguration | Object | No | Apify Proxy Residential | Proxy settings for stable extraction. |
Output Data
Each dataset item can contain:
| Field | Type | Description |
|---|---|---|
product_id | String | Unique product identifier. |
sku | String | Product SKU or goods code. |
title | String | Product title. |
url | String | Product page URL. |
image_url | String | Main product image URL. |
currency | String | Currency code. |
sale_price | Number | Current sale price. |
original_price | Number | Original or list price. |
discount_text | String | Discount label text. |
brand | String | Brand name when available. |
rating | Number | Product rating score. |
reviews_count | Number | Number of reviews. |
category_id | String | Category identifier when available. |
goods_id | String | Source goods ID from listing feed. |
goods_name | String | Source product name from listing feed. |
goods_img | String | Source primary image URL. |
detail_image | Array | Additional gallery image URLs. |
salePrice | Object | Original nested sale-price object from source. |
retailPrice | Object | Original nested retail-price object from source. |
flashPrice | Object | Flash-sale pricing object when available. |
homeBadge | Object | Badge data shown on listing cards. |
homeBelt | Object | Belt/label metadata shown on product cards. |
homeBorder | Object | Card border/decoration metadata. |
priceColor | String | Source color hint for price rendering. |
appTraceInfo | String | Source ranking and tracing metadata. |
superDealsGoodsType | String | Source deals classification flag. |
Only non-empty values are stored, so fields with no data are omitted.
Usage Examples
Basic Run
{"startUrl": "https://www.shein.com/RecommendSelection/Women-Clothing-sc-017172961.html","results_wanted": 20}
Larger Collection
{"startUrl": "https://www.shein.com/RecommendSelection/Women-Clothing-sc-017172961.html","results_wanted": 150,"proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
Proxy-Tuned Run
{"startUrl": "https://www.shein.com/RecommendSelection/Women-Clothing-sc-017172961.html","results_wanted": 50,"proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
Sample Output
{"product_id": "381589670","title": "Psalm 91 Shirt, Bible Verse-Shirt, Christian Shirt, Women's Religious Shirt","url": "https://m.shein.com/us/-p-381589670.html","image_url": "https://img.ltwebstatic.com/v4/j/spmp/2025/12/12/57/17655032678738734c78dbcef4a0fa386250e0798a_square_thumbnail_405x552.jpg","sale_price": 2.93,"original_price": 39.99,"discount_text": "-93%","category_id": "1738","goods_id": "381589670","goods_name": "Psalm 91 Shirt, Bible Verse-Shirt, Christian Shirt, Women's Religious Shirt","goods_img": "https://img.ltwebstatic.com/v4/j/spmp/2025/12/12/57/17655032678738734c78dbcef4a0fa386250e0798a_square_thumbnail_405x552.jpg","detail_image": ["https://img.ltwebstatic.com/v4/j/spmp/2025/12/12/4b/1765503269ca7f496d588e370aaaea4d2f6a5e7172_square_thumbnail_405x552.jpg"],"salePrice": {"amount": "2.93","amountWithSymbol": "$2.93"},"retailPrice": {"amount": "39.99","amountWithSymbol": "$39.99"},"flashPrice": {"amount": "2.93","amountWithSymbol": "$2.93"},"homeBadge": {"text": "-93%","type": "discount"},"homeBelt": {"text": "Flash Sale","type": "flashsale"},"priceColor": "#F93A00","superDealsGoodsType": "0"}
Tips For Best Results
Use Stable Listing URLs
- Start with direct category or selection URLs.
- Avoid temporary redirect links when possible.
Keep QA Runs Fast
- Use
results_wanted: 20for quick validation runs. - Increase limits for production exports.
Improve Reliability
- Use residential proxies for protected pages.
- Keep
results_wantedrealistic for faster completion.
Integrations
- Google Sheets — Build live product tracking sheets.
- Airtable — Store and filter product snapshots.
- Make — Trigger automated workflows from fresh runs.
- Zapier — Route data to downstream tools.
- Webhooks — Forward output to your own services.
Export Formats
- JSON — Best for APIs and programmatic processing.
- CSV — Easy spreadsheet analysis.
- Excel — Business reporting workflows.
- XML — Legacy system integrations.
Frequently Asked Questions
How many products can I collect?
You can collect as many as the page can load, up to your results_wanted value.
Why should I use proxies?
Proxies improve stability on protected pages and reduce request blocking.
Why are some fields missing from certain items?
Some products do not expose all fields. Empty values are removed from output by design.
Can I run this actor on a schedule?
Yes. You can schedule runs in Apify to produce recurring catalog snapshots.
Does this actor support other Shein links?
Yes. Category, selection, search, and tag-style Shein URLs can be used as startUrl.
Support
For issues or feature requests, use the actor issue channel in Apify Console.
Resources
Legal Notice
This actor is provided for legitimate data collection workflows. You are responsible for ensuring your use complies with applicable laws, website terms, and data usage requirements.