Puma Product Scraper avatar

Puma Product Scraper

Pricing

Pay per usage

Go to Apify Store
Puma Product Scraper

Puma Product Scraper

Extract Puma product data at scale. Scrape prices, descriptions, images, and reviews from Puma.com. Real-time monitoring, zero blocks. Perfect for price tracking, competitive analysis, and inventory management.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Extract Puma product listing data from category pages such as men's shoes.
Collect pricing, ratings, color variants, size options, badges, and product links in a clean dataset ready for analysis and monitoring.

Features

  • Fast category extraction - Collect products directly from Puma category pages.
  • Rich product coverage - Includes pricing, ratings, color data, badges, and size groups.
  • Pagination support - Automatically continues through result pages until target count is reached.
  • Clean dataset output - Omits empty fields for compact, analysis-ready records.
  • Flexible controls - Set result limits, page size, and URL target.

Use Cases

Competitive Pricing Tracking

Monitor shoe pricing and promotional deltas across Puma categories over time. Build snapshots for daily, weekly, or seasonal comparisons.

Merchandising Analysis

Measure assortment depth by category, color coverage, and orderable size availability. Identify which product lines have broad or narrow variant coverage.

Product Intelligence

Track review counts, average ratings, and badge presence to understand which products are most visible and marketable.

Catalog Monitoring

Detect newly listed products, removed variants, and changes in product metadata by comparing recurring runs.


Input Parameters

ParameterTypeRequiredDefaultDescription
start_urlStringNohttps://us.puma.com/us/en/men/shoesPuma listing URL (category/search/tag/country).
results_wantedIntegerNo20Maximum number of products to save.
page_sizeIntegerNo24Number of products requested per page.
proxyConfigurationObjectNo{ \"useApifyProxy\": false }Optional Apify Proxy configuration.

Output Data

Each dataset item can include:

FieldTypeDescription
product_idStringProduct variant identifier.
variant_idStringVariant ID.
master_idStringMaster product ID.
nameStringProduct name.
brandStringBrand label.
color_nameStringColor name.
color_codeStringColor code.
price_regularNumberRegular price.
price_saleNumberSale price.
price_promotionNumberPromotion price when available.
price_bestNumberBest available price field.
ratingNumberAverage rating.
review_countNumberNumber of reviews.
product_urlStringProduct detail URL.
image_urlStringPrimary image URL.
all_image_urlsArrayAll available variant image URLs.
sizesArraySize group and size-level availability details.
badge_labelsArrayProduct badge labels.
variant_promotion_messagesArrayVariant-level promotion messages.
master_promotion_messagesArrayMaster-level promotion messages.
category_nameStringCategory title.
category_pathStringCategory path extracted from URL.
scraped_atStringISO timestamp of extraction.

Usage Examples

Basic Category Run

{
"start_url": "https://us.puma.com/us/en/men/shoes",
"results_wanted": 20
}

Larger Pull

{
"start_url": "https://us.puma.com/us/en/men/shoes",
"results_wanted": 120,
"page_size": 24
}

Search URL Run

{
"start_url": "https://us.puma.com/us/en/search?q=running",
"results_wanted": 60
}

Sample Output

{
"source": "puma",
"category_url": "https://us.puma.com/us/en/men/shoes",
"category_path": "/men/shoes",
"category_id": "mens-shoes",
"category_name": "Men's Shoes and Sneakers",
"product_id": "198556306179",
"variant_id": "198556306179",
"master_id": "308762",
"sku": "308762_07",
"product_url": "https://us.puma.com/us/en/pd/scuderia-ferrari-trinity-2-mens-sneakers/308762?swatch=07",
"name": "Scuderia Ferrari Trinity 2 Men's Sneakers",
"brand": "Ferrari",
"color_name": "PUMA Black-Speed Yellow",
"color_code": "07",
"orderable": true,
"price_regular": 103,
"price_sale": 103,
"rating": 0,
"review_count": 0,
"badge_labels": ["New"],
"image_url": "https://images.puma.com/image/upload/f_auto,q_auto,b_rgb:fafafa,w_2000,h_2000/global/308762/07/sv01/fnd/PNA/fmt/png/Scuderia-Ferrari-Trinity-2-Men's-Sneakers",
"scraped_at": "2026-04-01T10:55:22.102Z"
}

Tips for Best Results

Start with QA-Friendly Limits

  • Use results_wanted: 20 for quick checks.
  • Increase gradually for larger production snapshots.

Keep URL Category-Specific

  • Use direct category URLs instead of home pages.
  • Validate that the URL resolves to a listing page.

Handle Scale with Proxy

  • Enable proxy when running frequent or high-volume schedules.
  • Keep page size moderate for stable runs.

Integrations

Connect scraped data with:

  • Google Sheets - Track price and assortment changes over time.
  • Airtable - Build searchable merchandising datasets.
  • Make - Trigger downstream workflows after each run.
  • Zapier - Send updates to CRMs, dashboards, or alerts.
  • Webhooks - Push run results to your own services.

Export Formats

  • JSON - For APIs and automation pipelines.
  • CSV - For spreadsheet workflows.
  • Excel - For reporting and business review.
  • XML - For legacy integrations.

Frequently Asked Questions

How many products can I scrape?

You can request any count, but practical limits depend on page availability and run time constraints.

Can I scrape categories other than men's shoes?

Yes. Provide a different Puma category URL in start_url.

Why are some optional fields missing in output?

The actor excludes empty values by design, so records remain compact and clean.

Does the actor support pagination automatically?

Yes. It paginates until your target count is reached or no more products are available.

Can I run this on a schedule?

Yes. Use Apify schedules for recurring data snapshots.

What if the target page structure changes?

Run a small test and then adjust input/filter settings as needed.


Support

For issues or feature requests, use your Apify Console issue flow.

Resources


This actor is intended for lawful data collection and analysis. You are responsible for complying with website terms, local regulations, and internal data governance policies.