Fashion Product Page Scraper avatar
Fashion Product Page Scraper
Under maintenance

Pricing

from $0.00 / result

Go to Apify Store
Fashion Product Page Scraper

Fashion Product Page Scraper

Under maintenance

This actor extracts detailed product information from valid fashion e-commerce URLs—including product name, description, price, image urls, available sizes, and brand metadata. It verifies if a link is a genuine product page and determines real-time stock availability. Read more at www.flashi.ai

Pricing

from $0.00 / result

Rating

0.0

(0)

Developer

Flash Intelligence

Flash Intelligence

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

7 days ago

Last modified

Share

👗 Fashion Product Page Scraper

Built by Flash Intelligence Extract detailed product data, real-time pricing, and stock availability from fashion e-commerce websites.

This Actor is a specialized scraping tool designed for the fashion and apparel industry. Unlike generic scrapers, it understands the unique structure of fashion product pages. It automatically detects product details, validates if a link is a genuine product page (filtering out blogs or collections), and returns structured JSON data ready for analysis.

✨ Key Features

  • Real-Time Stock Check: Instantly verifies if an item is currently in stock.
  • Size Intelligence: Distinguishes between Available Sizes and Out-of-Stock Sizes.
  • Smart Validation: Automatically validates URLs. If you input a non-product page (like a homepage or blog), the scraper flags it immediately rather than returning incorrect data.
  • Rich Metadata: Extracts deep details including Brand, Material, Color, Condition, Gender, and Category.
  • Image Extraction: Retrieves high-quality product image URLs.
  • Price Monitoring: Captures the current selling price.

🚀 Use Cases

  • Competitor Price Monitoring: Track how competitors are pricing similar fashion items.
  • Inventory Tracking: Monitor when specific sizes go out of stock or are replenished.
  • Dropshipping & Retail Arbitrage: Find underpriced items and verify availability instantly.
  • Trend Analysis: Aggregate data on materials, colors, and descriptions across different brands.

🔌 Input Parameters

The Actor takes a simplified input object. You only need to provide the target url.

Example Input

{
"url": "[https://theminnesotan.com/products/bears-trucker](https://theminnesotan.com/products/bears-trucker)"
}

Start a new web scraping project quickly and easily in Python with our empty project template. It provides a basic structure for the Actor with Apify SDK and allows you to easily add your own functionality.

📦 Output Data

Results are stored in the default Apify Dataset (JSON, CSV, Excel, HTML).

1. Successful Scrape (Status: 200)

Valid product pages return detailed metadata and stock status.

{
"input_url": "[https://theminnesotan.com/products/bears-trucker](https://theminnesotan.com/products/bears-trucker)",
"scrape_status": "success",
"product_in_stock": true,
"is_valid_product_page": true,
"product_details": {
"product_title": "Bears Trucker Hat",
"product_description": "Classic trucker hat featuring the Bears logo...",
"product_price": 29.99,
"product_sizes_available": ["S", "M", "L", "XL"],
"product_sizes_out_of_stock": ["XXL"],
"product_images": [
"[https://cdn.shopify.com/s/files/1/image_1.jpg](https://cdn.shopify.com/s/files/1/image_1.jpg)",
"[https://cdn.shopify.com/s/files/1/image_2.jpg](https://cdn.shopify.com/s/files/1/image_2.jpg)"
],
"product_color": "Navy/Orange",
"product_brand": "The Minnesotan",
"product_material": "Cotton/Polyester Mesh",
"product_sex": "Unisex",
"product_category": "Accessories > Hats",
"product_condition": "New"
}
}

⚠️ Error States & Validation

This Actor is designed to be robust. It distinguishes between a Technical Failure (which stops the run) and a Validation Failure (which logs the error and continues).

1. Invalid Page (Soft Failure)

If the URL provided is reachable but is not a fashion product page (e.g., a blog post, a homepage, or a privacy policy), the Actor will not fail.

Instead, it returns a successful response with an invalid_page status. This allows you to process bulk URLs without stopping the entire run because of one bad link.

{
"input_url": "[https://theminnesotan.com/blogs/news](https://theminnesotan.com/blogs/news)",
"scrape_status": "invalid_page",
"error": "The provided URL does not appear to be a product page."
}

Included features

  • Apify SDK for Python - a toolkit for building Apify Actors and scrapers in Python
  • Input schema - define and easily validate a schema for your Actor's input
  • Request queue - queues into which you can put the URLs you want to scrape
  • Dataset - store structured data where each object stored has the same attributes

How it works

Insert your own code to async with Actor: block. You can use the Apify SDK with any other Python library.

Resources

Getting started

For complete information see this article. In short, you will:

  1. Build the Actor
  2. Run the Actor

Pull the Actor for local development

If you would like to develop locally, you can pull the existing Actor from Apify console using Apify CLI:

  1. Install apify-cli

    Using Homebrew

    $brew install apify-cli

    Using NPM

    $npm -g install apify-cli
  2. Pull the Actor by its unique <ActorId>, which is one of the following:

    • unique name of the Actor to pull (e.g. "apify/hello-world")
    • or ID of the Actor to pull (e.g. "E2jjCZBezvAZnX8Rb")

    You can find both by clicking on the Actor title at the top of the page, which will open a modal containing both Actor unique name and Actor ID.

    This command will copy the Actor into the current directory on your local machine.

    $apify pull <ActorId>

Documentation reference

To learn more about Apify and Actors, take a look at the following resources: