SHEIN Scraper (All Page Types)
Pricing
from $20.00 / 1,000 results
SHEIN Scraper (All Page Types)
Extracts product listings and product detail data from [SHEIN US](https://us.shein.com/). You can use it to collect products from the homepage, search result pages, supported SHEIN category pages, and direct product pages, then export the results as JSON, CSV, Excel, or HTML from the Dataset tab.
Pricing
from $20.00 / 1,000 results
Rating
0.0
(0)
Developer
Hypebridge
Maintained by CommunityActor stats
0
Bookmarked
15
Total users
8
Monthly active users
2 days ago
Last modified
Categories
Share
What does SHEIN Product Listing Scraper do?
SHEIN Product Listing Scraper extracts listing-level product data from SHEIN. Use it to collect products from the homepage, search result pages, supported category pages, other product-listing pages, and direct product page URLs from SHEIN country subdomains.
The actor focuses on fields proven reliable across live runs: product name, price, URL, images, category, ranking position, and source metadata.
Why scrape SHEIN?
- Track products and pricing: Monitor current prices, discounts, and product availability across search or category pages.
- Build product datasets: Export structured fashion catalog data for analysis, research, or internal merchandising workflows.
- Compare categories and search demand: See what products appear for a keyword or within a supported SHEIN category page.
- Normalize country-subdomain inputs: Accept SHEIN country URLs while extracting through the working catalog session.
What data can SHEIN Product Listing Scraper extract?
| Field | Type | Description |
|---|---|---|
sourceUrl | String | The page URL that produced the product |
sourceType | String | Homepage, search, category selection, direct product, or generic listing source |
goodsId | String | SHEIN product identifier |
title | String | Product name |
productUrl | String | Product page URL |
imageUrl | String | Main product image |
imageUrls | Array | Listing image URLs when available |
price | Number | Current sale price |
priceText | String | Raw price text when available |
listPrice | Number | Original price when available |
discountPrice | Number | Separately exposed discount value when available |
discountPercent | Number | Discount percentage when available |
categoryId | String | SHEIN category ID when available |
searchKeyword | String | Search term used for search-result runs or direct product indirection |
selectId | String | Category selection identifier for supported category pages |
rank | Number | Product position within the current page response |
extractionStatus | String | partial when a direct product URL could not be resolved through search indirection |
extractionErrorMessage | String | Reason a direct product URL only produced a URL-level fallback record |
The full field list is available in the Output tab.
Supported URLs
| Type | Example |
|---|---|
| Homepage | https://us.shein.com/ |
| Search | https://au.shein.com/pdsearch/shorts/?search_source=4&search_type=all&source=hotWord |
| Category page | https://de.shein.com/RecommendSelection/Men-Clothing-sc-017172963.html |
| Recommend category page | https://ca.shein.com/recommend/New-in-Blouses-sc-10050028999.html |
| Standard category page | https://us.shein.com/Women-Dresses-c-1727.html |
| Direct product page | https://ca.shein.com/SHEIN-EZwear-Ripped-Raw-Trim-Denim-Shorts-Y2k-p-11133529.html?mallCode=1 |
Known listing URL shapes use optimized routes. Direct product pages are resolved through the working search API when the product is discoverable there. Other SHEIN pages use a warmed browser fallback that extracts embedded or visible product cards when present. Product data and prices are extracted through the US catalog session, so currency is reported as USD.
How to scrape SHEIN
- Open SHEIN Product Listing Scraper on Apify.
- Add one or more supported
startUrls. - Set
maxItemsto control the size of the run. - Set
maxPagesPerStartUrlif you want to limit pagination. - Start the run and download results from the Dataset tab.
Input
See the Input tab for the full schema. The main fields are:
| Parameter | Type | Default | Description |
|---|---|---|---|
startUrls | Array | [] | Homepage, search, category, product, or product-listing URLs from SHEIN country subdomains. Empty input exits successfully with no rows |
maxItems | Number | 500 | Maximum number of products to output. Use 0 for unlimited |
maxPagesPerStartUrl | Number | 10 | Maximum number of pages to process per input URL |
maxConcurrency | Number | 2 | Number of browser pages processed in parallel |
debugMode | Boolean | false | Saves extra run artifacts for troubleshooting |
Example input
{"startUrls": [{ "url": "https://au.shein.com/pdsearch/shorts/?search_source=4&search_type=all&source=hotWord" },{ "url": "https://de.shein.com/RecommendSelection/Men-Clothing-sc-017172963.html" },{ "url": "https://ca.shein.com/SHEIN-EZwear-Ripped-Raw-Trim-Denim-Shorts-Y2k-p-11133529.html?mallCode=1" }],"maxItems": 100,"maxPagesPerStartUrl": 2}
Output
Results are stored in the Dataset tab and can be downloaded as JSON, CSV, Excel, or HTML.
Each item can include:
- Listing data: title, product URL, images, price, discount, category, ranking
- Source metadata: source URL, source type, search keyword, category selection ID
- Fallback metadata: partial status and error message for direct product URLs that cannot be resolved through search
Sample output
{"sourceUrl": "https://us.shein.com/pdsearch/shorts?page=1","sourceType": "search","goodsId": "11133529","title": "SHEIN EZwear Ripped Raw Trim Denim Shorts Y2k","productUrl": "https://us.shein.com/SHEIN-EZwear-Ripped-Raw-Trim-Denim-Shorts-Y2k-p-11133529.html?mallCode=1","imageUrl": "https://img.ltwebstatic.com/images3_pi/2023/02/01/example_thumbnail_405x552.jpg","price": 23.24,"listPrice": 36.52,"discountPercent": 36,"categoryId": "1935","searchKeyword": "shorts","rank": 1}
Tips
- Use smaller
maxItemsvalues when testing new inputs. - Prefer search and supported category URLs when you need consistent result volume.
- Direct product URLs return listing-level data only when the product is discoverable through the working search API.
FAQ
Can this scrape product pages?
You can provide a direct product page as a start URL. The actor resolves it through SHEIN search and returns listing-level data when the matching goodsId appears in search results. If it cannot be resolved, the actor saves a partial URL-level row instead of failing the run.
Does it support every SHEIN URL?
It accepts SHEIN country-subdomain URLs and uses optimized handling for homepage, search, RecommendSelection, and product pages. Other SHEIN pages use a generic product-card fallback; pages without embedded or visible products may return no rows instead of failing the run.
Can I export the results?
Yes. Results are stored in the Dataset tab and can be downloaded as JSON, CSV, Excel, or HTML.
Support
- Questions or feature requests: use the Actor page or your internal issue tracker
- Programmatic access: use the API tab in Apify


