Fashion Product Page Scraper
Pricing
from $0.00 / result
Fashion Product Page Scraper
This actor extracts detailed product information from valid fashion e-commerce URLs—including product name, description, price, image urls, available sizes, and brand metadata. It verifies if a link is a genuine product page and determines real-time stock availability. Read more at www.flashi.ai
Pricing
from $0.00 / result
Rating
0.0
(0)
Developer

Flash Intelligence
Actor stats
0
Bookmarked
3
Total users
1
Monthly active users
7 days ago
Last modified
Categories
Share
👗 Fashion Product Page Scraper
Built by Flash Intelligence Extract detailed product data, real-time pricing, and stock availability from fashion e-commerce websites.
This Actor is a specialized scraping tool designed for the fashion and apparel industry. Unlike generic scrapers, it understands the unique structure of fashion product pages. It automatically detects product details, validates if a link is a genuine product page (filtering out blogs or collections), and returns structured JSON data ready for analysis.
✨ Key Features
- Real-Time Stock Check: Instantly verifies if an item is currently in stock.
- Size Intelligence: Distinguishes between Available Sizes and Out-of-Stock Sizes.
- Smart Validation: Automatically validates URLs. If you input a non-product page (like a homepage or blog), the scraper flags it immediately rather than returning incorrect data.
- Rich Metadata: Extracts deep details including Brand, Material, Color, Condition, Gender, and Category.
- Image Extraction: Retrieves high-quality product image URLs.
- Price Monitoring: Captures the current selling price.
🚀 Use Cases
- Competitor Price Monitoring: Track how competitors are pricing similar fashion items.
- Inventory Tracking: Monitor when specific sizes go out of stock or are replenished.
- Dropshipping & Retail Arbitrage: Find underpriced items and verify availability instantly.
- Trend Analysis: Aggregate data on materials, colors, and descriptions across different brands.
🔌 Input Parameters
The Actor takes a simplified input object. You only need to provide the target url.
Example Input
{"url": "[https://theminnesotan.com/products/bears-trucker](https://theminnesotan.com/products/bears-trucker)"}
Start a new web scraping project quickly and easily in Python with our empty project template. It provides a basic structure for the Actor with Apify SDK and allows you to easily add your own functionality.
📦 Output Data
Results are stored in the default Apify Dataset (JSON, CSV, Excel, HTML).
1. Successful Scrape (Status: 200)
Valid product pages return detailed metadata and stock status.
{"input_url": "[https://theminnesotan.com/products/bears-trucker](https://theminnesotan.com/products/bears-trucker)","scrape_status": "success","product_in_stock": true,"is_valid_product_page": true,"product_details": {"product_title": "Bears Trucker Hat","product_description": "Classic trucker hat featuring the Bears logo...","product_price": 29.99,"product_sizes_available": ["S", "M", "L", "XL"],"product_sizes_out_of_stock": ["XXL"],"product_images": ["[https://cdn.shopify.com/s/files/1/image_1.jpg](https://cdn.shopify.com/s/files/1/image_1.jpg)","[https://cdn.shopify.com/s/files/1/image_2.jpg](https://cdn.shopify.com/s/files/1/image_2.jpg)"],"product_color": "Navy/Orange","product_brand": "The Minnesotan","product_material": "Cotton/Polyester Mesh","product_sex": "Unisex","product_category": "Accessories > Hats","product_condition": "New"}}
⚠️ Error States & Validation
This Actor is designed to be robust. It distinguishes between a Technical Failure (which stops the run) and a Validation Failure (which logs the error and continues).
1. Invalid Page (Soft Failure)
If the URL provided is reachable but is not a fashion product page (e.g., a blog post, a homepage, or a privacy policy), the Actor will not fail.
Instead, it returns a successful response with an invalid_page status. This allows you to process bulk URLs without stopping the entire run because of one bad link.
{"input_url": "[https://theminnesotan.com/blogs/news](https://theminnesotan.com/blogs/news)","scrape_status": "invalid_page","error": "The provided URL does not appear to be a product page."}
Included features
- Apify SDK for Python - a toolkit for building Apify Actors and scrapers in Python
- Input schema - define and easily validate a schema for your Actor's input
- Request queue - queues into which you can put the URLs you want to scrape
- Dataset - store structured data where each object stored has the same attributes
How it works
Insert your own code to async with Actor: block. You can use the Apify SDK with any other Python library.
Resources
- Python tutorials in Academy
- Video guide on getting data using Apify API
- Integration with Make, GitHub, Zapier, Google Drive, and other apps
- A short guide on how to build web scrapers using code templates:
Getting started
For complete information see this article. In short, you will:
- Build the Actor
- Run the Actor
Pull the Actor for local development
If you would like to develop locally, you can pull the existing Actor from Apify console using Apify CLI:
-
Install
apify-cliUsing Homebrew
$brew install apify-cliUsing NPM
$npm -g install apify-cli -
Pull the Actor by its unique
<ActorId>, which is one of the following:- unique name of the Actor to pull (e.g. "apify/hello-world")
- or ID of the Actor to pull (e.g. "E2jjCZBezvAZnX8Rb")
You can find both by clicking on the Actor title at the top of the page, which will open a modal containing both Actor unique name and Actor ID.
This command will copy the Actor into the current directory on your local machine.
$apify pull <ActorId>
Documentation reference
To learn more about Apify and Actors, take a look at the following resources: