SHEIN Scraper (All Page Types) avatar

SHEIN Scraper (All Page Types)

Pricing

from $20.00 / 1,000 results

Go to Apify Store
SHEIN Scraper (All Page Types)

SHEIN Scraper (All Page Types)

Extracts product listings and product detail data from [SHEIN US](https://us.shein.com/). You can use it to collect products from the homepage, search result pages, supported SHEIN category pages, and direct product pages, then export the results as JSON, CSV, Excel, or HTML from the Dataset tab.

Pricing

from $20.00 / 1,000 results

Rating

0.0

(0)

Developer

Hypebridge

Hypebridge

Maintained by Community

Actor stats

0

Bookmarked

15

Total users

8

Monthly active users

2 days ago

Last modified

Share

What does SHEIN Product Listing Scraper do?

SHEIN Product Listing Scraper extracts listing-level product data from SHEIN. Use it to collect products from the homepage, search result pages, supported category pages, other product-listing pages, and direct product page URLs from SHEIN country subdomains.

The actor focuses on fields proven reliable across live runs: product name, price, URL, images, category, ranking position, and source metadata.

Why scrape SHEIN?

  • Track products and pricing: Monitor current prices, discounts, and product availability across search or category pages.
  • Build product datasets: Export structured fashion catalog data for analysis, research, or internal merchandising workflows.
  • Compare categories and search demand: See what products appear for a keyword or within a supported SHEIN category page.
  • Normalize country-subdomain inputs: Accept SHEIN country URLs while extracting through the working catalog session.

What data can SHEIN Product Listing Scraper extract?

FieldTypeDescription
sourceUrlStringThe page URL that produced the product
sourceTypeStringHomepage, search, category selection, direct product, or generic listing source
goodsIdStringSHEIN product identifier
titleStringProduct name
productUrlStringProduct page URL
imageUrlStringMain product image
imageUrlsArrayListing image URLs when available
priceNumberCurrent sale price
priceTextStringRaw price text when available
listPriceNumberOriginal price when available
discountPriceNumberSeparately exposed discount value when available
discountPercentNumberDiscount percentage when available
categoryIdStringSHEIN category ID when available
searchKeywordStringSearch term used for search-result runs or direct product indirection
selectIdStringCategory selection identifier for supported category pages
rankNumberProduct position within the current page response
extractionStatusStringpartial when a direct product URL could not be resolved through search indirection
extractionErrorMessageStringReason a direct product URL only produced a URL-level fallback record

The full field list is available in the Output tab.

Supported URLs

TypeExample
Homepagehttps://us.shein.com/
Searchhttps://au.shein.com/pdsearch/shorts/?search_source=4&search_type=all&source=hotWord
Category pagehttps://de.shein.com/RecommendSelection/Men-Clothing-sc-017172963.html
Recommend category pagehttps://ca.shein.com/recommend/New-in-Blouses-sc-10050028999.html
Standard category pagehttps://us.shein.com/Women-Dresses-c-1727.html
Direct product pagehttps://ca.shein.com/SHEIN-EZwear-Ripped-Raw-Trim-Denim-Shorts-Y2k-p-11133529.html?mallCode=1

Known listing URL shapes use optimized routes. Direct product pages are resolved through the working search API when the product is discoverable there. Other SHEIN pages use a warmed browser fallback that extracts embedded or visible product cards when present. Product data and prices are extracted through the US catalog session, so currency is reported as USD.

How to scrape SHEIN

  1. Open SHEIN Product Listing Scraper on Apify.
  2. Add one or more supported startUrls.
  3. Set maxItems to control the size of the run.
  4. Set maxPagesPerStartUrl if you want to limit pagination.
  5. Start the run and download results from the Dataset tab.

Input

See the Input tab for the full schema. The main fields are:

ParameterTypeDefaultDescription
startUrlsArray[]Homepage, search, category, product, or product-listing URLs from SHEIN country subdomains. Empty input exits successfully with no rows
maxItemsNumber500Maximum number of products to output. Use 0 for unlimited
maxPagesPerStartUrlNumber10Maximum number of pages to process per input URL
maxConcurrencyNumber2Number of browser pages processed in parallel
debugModeBooleanfalseSaves extra run artifacts for troubleshooting

Example input

{
"startUrls": [
{ "url": "https://au.shein.com/pdsearch/shorts/?search_source=4&search_type=all&source=hotWord" },
{ "url": "https://de.shein.com/RecommendSelection/Men-Clothing-sc-017172963.html" },
{ "url": "https://ca.shein.com/SHEIN-EZwear-Ripped-Raw-Trim-Denim-Shorts-Y2k-p-11133529.html?mallCode=1" }
],
"maxItems": 100,
"maxPagesPerStartUrl": 2
}

Output

Results are stored in the Dataset tab and can be downloaded as JSON, CSV, Excel, or HTML.

Each item can include:

  • Listing data: title, product URL, images, price, discount, category, ranking
  • Source metadata: source URL, source type, search keyword, category selection ID
  • Fallback metadata: partial status and error message for direct product URLs that cannot be resolved through search

Sample output

{
"sourceUrl": "https://us.shein.com/pdsearch/shorts?page=1",
"sourceType": "search",
"goodsId": "11133529",
"title": "SHEIN EZwear Ripped Raw Trim Denim Shorts Y2k",
"productUrl": "https://us.shein.com/SHEIN-EZwear-Ripped-Raw-Trim-Denim-Shorts-Y2k-p-11133529.html?mallCode=1",
"imageUrl": "https://img.ltwebstatic.com/images3_pi/2023/02/01/example_thumbnail_405x552.jpg",
"price": 23.24,
"listPrice": 36.52,
"discountPercent": 36,
"categoryId": "1935",
"searchKeyword": "shorts",
"rank": 1
}

Tips

  • Use smaller maxItems values when testing new inputs.
  • Prefer search and supported category URLs when you need consistent result volume.
  • Direct product URLs return listing-level data only when the product is discoverable through the working search API.

FAQ

Can this scrape product pages?

You can provide a direct product page as a start URL. The actor resolves it through SHEIN search and returns listing-level data when the matching goodsId appears in search results. If it cannot be resolved, the actor saves a partial URL-level row instead of failing the run.

Does it support every SHEIN URL?

It accepts SHEIN country-subdomain URLs and uses optimized handling for homepage, search, RecommendSelection, and product pages. Other SHEIN pages use a generic product-card fallback; pages without embedded or visible products may return no rows instead of failing the run.

Can I export the results?

Yes. Results are stored in the Dataset tab and can be downloaded as JSON, CSV, Excel, or HTML.

Support

  • Questions or feature requests: use the Actor page or your internal issue tracker
  • Programmatic access: use the API tab in Apify