Puma Product Scraper
Pricing
Pay per usage
Puma Product Scraper
Extract Puma product data at scale. Scrape prices, descriptions, images, and reviews from Puma.com. Real-time monitoring, zero blocks. Perfect for price tracking, competitive analysis, and inventory management.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Shahid Irfan
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extract Puma product listing data from category pages such as men's shoes.
Collect pricing, ratings, color variants, size options, badges, and product links in a clean dataset ready for analysis and monitoring.
Features
- Fast category extraction - Collect products directly from Puma category pages.
- Rich product coverage - Includes pricing, ratings, color data, badges, and size groups.
- Pagination support - Automatically continues through result pages until target count is reached.
- Clean dataset output - Omits empty fields for compact, analysis-ready records.
- Flexible controls - Set result limits, page size, and URL target.
Use Cases
Competitive Pricing Tracking
Monitor shoe pricing and promotional deltas across Puma categories over time. Build snapshots for daily, weekly, or seasonal comparisons.
Merchandising Analysis
Measure assortment depth by category, color coverage, and orderable size availability. Identify which product lines have broad or narrow variant coverage.
Product Intelligence
Track review counts, average ratings, and badge presence to understand which products are most visible and marketable.
Catalog Monitoring
Detect newly listed products, removed variants, and changes in product metadata by comparing recurring runs.
Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
start_url | String | No | https://us.puma.com/us/en/men/shoes | Puma listing URL (category/search/tag/country). |
results_wanted | Integer | No | 20 | Maximum number of products to save. |
page_size | Integer | No | 24 | Number of products requested per page. |
proxyConfiguration | Object | No | { \"useApifyProxy\": false } | Optional Apify Proxy configuration. |
Output Data
Each dataset item can include:
| Field | Type | Description |
|---|---|---|
product_id | String | Product variant identifier. |
variant_id | String | Variant ID. |
master_id | String | Master product ID. |
name | String | Product name. |
brand | String | Brand label. |
color_name | String | Color name. |
color_code | String | Color code. |
price_regular | Number | Regular price. |
price_sale | Number | Sale price. |
price_promotion | Number | Promotion price when available. |
price_best | Number | Best available price field. |
rating | Number | Average rating. |
review_count | Number | Number of reviews. |
product_url | String | Product detail URL. |
image_url | String | Primary image URL. |
all_image_urls | Array | All available variant image URLs. |
sizes | Array | Size group and size-level availability details. |
badge_labels | Array | Product badge labels. |
variant_promotion_messages | Array | Variant-level promotion messages. |
master_promotion_messages | Array | Master-level promotion messages. |
category_name | String | Category title. |
category_path | String | Category path extracted from URL. |
scraped_at | String | ISO timestamp of extraction. |
Usage Examples
Basic Category Run
{"start_url": "https://us.puma.com/us/en/men/shoes","results_wanted": 20}
Larger Pull
{"start_url": "https://us.puma.com/us/en/men/shoes","results_wanted": 120,"page_size": 24}
Search URL Run
{"start_url": "https://us.puma.com/us/en/search?q=running","results_wanted": 60}
Sample Output
{"source": "puma","category_url": "https://us.puma.com/us/en/men/shoes","category_path": "/men/shoes","category_id": "mens-shoes","category_name": "Men's Shoes and Sneakers","product_id": "198556306179","variant_id": "198556306179","master_id": "308762","sku": "308762_07","product_url": "https://us.puma.com/us/en/pd/scuderia-ferrari-trinity-2-mens-sneakers/308762?swatch=07","name": "Scuderia Ferrari Trinity 2 Men's Sneakers","brand": "Ferrari","color_name": "PUMA Black-Speed Yellow","color_code": "07","orderable": true,"price_regular": 103,"price_sale": 103,"rating": 0,"review_count": 0,"badge_labels": ["New"],"image_url": "https://images.puma.com/image/upload/f_auto,q_auto,b_rgb:fafafa,w_2000,h_2000/global/308762/07/sv01/fnd/PNA/fmt/png/Scuderia-Ferrari-Trinity-2-Men's-Sneakers","scraped_at": "2026-04-01T10:55:22.102Z"}
Tips for Best Results
Start with QA-Friendly Limits
- Use
results_wanted: 20for quick checks. - Increase gradually for larger production snapshots.
Keep URL Category-Specific
- Use direct category URLs instead of home pages.
- Validate that the URL resolves to a listing page.
Handle Scale with Proxy
- Enable proxy when running frequent or high-volume schedules.
- Keep page size moderate for stable runs.
Integrations
Connect scraped data with:
- Google Sheets - Track price and assortment changes over time.
- Airtable - Build searchable merchandising datasets.
- Make - Trigger downstream workflows after each run.
- Zapier - Send updates to CRMs, dashboards, or alerts.
- Webhooks - Push run results to your own services.
Export Formats
- JSON - For APIs and automation pipelines.
- CSV - For spreadsheet workflows.
- Excel - For reporting and business review.
- XML - For legacy integrations.
Frequently Asked Questions
How many products can I scrape?
You can request any count, but practical limits depend on page availability and run time constraints.
Can I scrape categories other than men's shoes?
Yes. Provide a different Puma category URL in start_url.
Why are some optional fields missing in output?
The actor excludes empty values by design, so records remain compact and clean.
Does the actor support pagination automatically?
Yes. It paginates until your target count is reached or no more products are available.
Can I run this on a schedule?
Yes. Use Apify schedules for recurring data snapshots.
What if the target page structure changes?
Run a small test and then adjust input/filter settings as needed.
Support
For issues or feature requests, use your Apify Console issue flow.
Resources
Legal Notice
This actor is intended for lawful data collection and analysis. You are responsible for complying with website terms, local regulations, and internal data governance policies.