Shein Product Scraper avatar

Shein Product Scraper

Pricing

Pay per usage

Go to Apify Store
Shein Product Scraper

Shein Product Scraper

Extract product data from Shein including prices, images, descriptions, sizes, and ratings. Perfect for market research, price comparison, inventory monitoring, and competitive analysis. Get structured data in JSON or CSV format with reliable parsing.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

5

Total users

3

Monthly active users

6 days ago

Last modified

Share

Extract product data from Shein links in a structured format. Use category, selection, search, or tag-style Shein URLs as input and collect product titles, URLs, prices, images, and merchandising metadata for reporting and monitoring.


Features

  • Flexible URL Support — Accepts Shein category, selection, search, and tag-style URLs.
  • Structured Product Dataset — Collect normalized product records with pricing, category, and merchandising metadata.
  • Duplicate Protection — Automatically avoids duplicate records during a run.
  • Configurable Collection Size — Control how many products to save with a single input.
  • Null-Free Output — Automatically removes empty fields so your dataset stays clean.

Use Cases

Product Research

Track what products appear in major Shein category feeds and identify assortment trends over time.

Pricing Intelligence

Collect sale and original prices to monitor price movement and discount positioning.

Merchandising Analysis

Compare product visibility across categories and evaluate which items are frequently surfaced.

Catalog Monitoring

Build recurring snapshots of listing pages for internal reporting and audit workflows.


Input Parameters

ParameterTypeRequiredDefaultDescription
startUrlStringYeshttps://www.shein.com/RecommendSelection/Women-Clothing-sc-017172961.htmlShein URL (category, selection, search, or tag format).
results_wantedIntegerNo20Maximum number of products to save.
proxyConfigurationObjectNoApify Proxy ResidentialProxy settings for stable extraction.

Output Data

Each dataset item can contain:

FieldTypeDescription
product_idStringUnique product identifier.
skuStringProduct SKU or goods code.
titleStringProduct title.
urlStringProduct page URL.
image_urlStringMain product image URL.
currencyStringCurrency code.
sale_priceNumberCurrent sale price.
original_priceNumberOriginal or list price.
discount_textStringDiscount label text.
brandStringBrand name when available.
ratingNumberProduct rating score.
reviews_countNumberNumber of reviews.
category_idStringCategory identifier when available.
goods_idStringSource goods ID from listing feed.
goods_nameStringSource product name from listing feed.
goods_imgStringSource primary image URL.
detail_imageArrayAdditional gallery image URLs.
salePriceObjectOriginal nested sale-price object from source.
retailPriceObjectOriginal nested retail-price object from source.
flashPriceObjectFlash-sale pricing object when available.
homeBadgeObjectBadge data shown on listing cards.
homeBeltObjectBelt/label metadata shown on product cards.
homeBorderObjectCard border/decoration metadata.
priceColorStringSource color hint for price rendering.
appTraceInfoStringSource ranking and tracing metadata.
superDealsGoodsTypeStringSource deals classification flag.

Only non-empty values are stored, so fields with no data are omitted.


Usage Examples

Basic Run

{
"startUrl": "https://www.shein.com/RecommendSelection/Women-Clothing-sc-017172961.html",
"results_wanted": 20
}

Larger Collection

{
"startUrl": "https://www.shein.com/RecommendSelection/Women-Clothing-sc-017172961.html",
"results_wanted": 150,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Proxy-Tuned Run

{
"startUrl": "https://www.shein.com/RecommendSelection/Women-Clothing-sc-017172961.html",
"results_wanted": 50,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Sample Output

{
"product_id": "381589670",
"title": "Psalm 91 Shirt, Bible Verse-Shirt, Christian Shirt, Women's Religious Shirt",
"url": "https://m.shein.com/us/-p-381589670.html",
"image_url": "https://img.ltwebstatic.com/v4/j/spmp/2025/12/12/57/17655032678738734c78dbcef4a0fa386250e0798a_square_thumbnail_405x552.jpg",
"sale_price": 2.93,
"original_price": 39.99,
"discount_text": "-93%",
"category_id": "1738",
"goods_id": "381589670",
"goods_name": "Psalm 91 Shirt, Bible Verse-Shirt, Christian Shirt, Women's Religious Shirt",
"goods_img": "https://img.ltwebstatic.com/v4/j/spmp/2025/12/12/57/17655032678738734c78dbcef4a0fa386250e0798a_square_thumbnail_405x552.jpg",
"detail_image": [
"https://img.ltwebstatic.com/v4/j/spmp/2025/12/12/4b/1765503269ca7f496d588e370aaaea4d2f6a5e7172_square_thumbnail_405x552.jpg"
],
"salePrice": {
"amount": "2.93",
"amountWithSymbol": "$2.93"
},
"retailPrice": {
"amount": "39.99",
"amountWithSymbol": "$39.99"
},
"flashPrice": {
"amount": "2.93",
"amountWithSymbol": "$2.93"
},
"homeBadge": {
"text": "-93%",
"type": "discount"
},
"homeBelt": {
"text": "Flash Sale",
"type": "flashsale"
},
"priceColor": "#F93A00",
"superDealsGoodsType": "0"
}

Tips For Best Results

Use Stable Listing URLs

  • Start with direct category or selection URLs.
  • Avoid temporary redirect links when possible.

Keep QA Runs Fast

  • Use results_wanted: 20 for quick validation runs.
  • Increase limits for production exports.

Improve Reliability

  • Use residential proxies for protected pages.
  • Keep results_wanted realistic for faster completion.

Integrations

  • Google Sheets — Build live product tracking sheets.
  • Airtable — Store and filter product snapshots.
  • Make — Trigger automated workflows from fresh runs.
  • Zapier — Route data to downstream tools.
  • Webhooks — Forward output to your own services.

Export Formats

  • JSON — Best for APIs and programmatic processing.
  • CSV — Easy spreadsheet analysis.
  • Excel — Business reporting workflows.
  • XML — Legacy system integrations.

Frequently Asked Questions

How many products can I collect?

You can collect as many as the page can load, up to your results_wanted value.

Why should I use proxies?

Proxies improve stability on protected pages and reduce request blocking.

Why are some fields missing from certain items?

Some products do not expose all fields. Empty values are removed from output by design.

Can I run this actor on a schedule?

Yes. You can schedule runs in Apify to produce recurring catalog snapshots.

Yes. Category, selection, search, and tag-style Shein URLs can be used as startUrl.


Support

For issues or feature requests, use the actor issue channel in Apify Console.

Resources


This actor is provided for legitimate data collection workflows. You are responsible for ensuring your use complies with applicable laws, website terms, and data usage requirements.