Under maintenance

Pricing

$9.00 / 1,000 results

Try for free

Go to Store

Macy's Scraper

Under maintenance

Try for free

Developed by

Gustavo Rudiger

Macy's web scraper to crawl product information including price and sale price, color, and images. Extract all data in a dataset in structured formats.

0.0 (0)

Pricing

$9.00 / 1,000 results

Total users

Monthly users

Runs succeeded

89%

Last modified

4 days ago

E-commerce

Back to issues Create new issue

Run ID: oqNnuH2MjE1MmkM1t

Closed

82society opened this issue

Hi Gustavo, https://www.macys.com/shop/womens-clothing/all-womens-clothing/Pageindex/246?id=188851

I indicated the link above for scraping page. All women's page 246. However, the log indicates as follows: https://www.macys.com/shop/mens-clothing/all-mens-clothing/Pageindex/1217?id=197651 https://www.macys.com/shop/mens-clothing/all-mens-clothing/Pageindex/1216?id=197651 https://www.macys.com/shop/mens-clothing/all-mens-clothing/Pageindex/1215?id=197651 https://www.macys.com/shop/mens-clothing/all-mens-clothing/Pageindex/1214?id=197651 https://www.macys.com/shop/womens-clothing/all-womens-clothing/Pageindex/247?id=188851

For some reason it went over to men's and to page 1217, 1216, 1215, 1214. Why is that?

82society

Also Run ID f2lwGO06ELesgyJrS and deuQAydeWf0rgKT0J are having the same issue. Some pages are scraped from mens.

82society

Run ID: sLTYaK3ytuzSbQa6n I set the item https://www.macys.com/shop/mens-clothing/all-mens-clothing/Pageindex/881?id=197651 Page 881. However, it's the from link that it started scraping was from https://www.macys.com/shop/mens-clothing/all-mens-clothing/Pageindex/1267?id=197651 I haven't check everything yet. But so far, every run I checked, it's got a weird pattern.

Gustavo Rudiger (trudax)

Those are failed urls from previous runs.

82society

"those are failed URLs from previous runs" Question: 1. If I set https://www.macys.com/shop/womens-clothing/all-womens-clothing/Pageindex/246?id=188851 (women's page 246), why is it attempting to scrape from men's page 1214?

Run ID: v9PI0NTpbxve7G6Nu

I ran this task from page 600 and it only obtain 16 results. Question: 2. do I get charged for running a task when there are failed urls due to collected previously? 3. I recall you mentioning that sometime the Run stops as Succeeded maybe because Macy's page may be blocking it from proceeding. Is there a way to fix or bypass that?

Gustavo Rudiger (trudax)

If you run once for all-mens-clothing, all failed url for this run will be stored to be retryed in the next run. Then the second run you run for all-womens-clothing but the previous failed URLs from all-mens-clothing are also added to the queue. I will take a closer look at this run, seems that some products are returning error and are not being scraped. I didn't understood your second question, Apify charges for the use of resources spent running the actor. Is not possible to bypass 100% all the anti-scraping set by any website, you need to retry the request with different sessions (which Apify does automatically) to eventually bypass it.

Gustavo Rudiger (trudax)

There was a product page with a collection that wasn't being scrapped since the layout was composed of multiple products. I have added this new layout to the actor and the product from this URLs will be also scrapped now.

Add comment

Nordstrom Scraper

trudax/actor-nordstrom-scraper

Nordstrom web scraper to crawl product information including price and sale price, color, and images. Extract all data in a dataset in multiple formats.

Gustavo Rudiger

214

Kabum Scraper

trudax/kabum

Gustavo Rudiger

Macys Product Search

pintostudio/macys-product-search

The Macy's Product Search Scraper is a reliable tool to extract product data directly from Macy's search results. Whether you're conducting market research, tracking pricing trends, or building a product database, this actor provides comprehensive product information in an easy-to-use format.

Pinto Studio

Reddit Scraper

trudax/reddit-scraper

Unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

Gustavo Rudiger

7.1K

3.9

Macys Product Scraper

getdataforme/macys-product-scraper

Scrape product data from Macy's using the Macys Scraper Apify actor. Extract detailed information like product names, prices, images, descriptions, and SKUs from Macy's product pages. Automate your e-commerce data extraction with high accuracy and efficiency. Proxy support included.

GetDataForMe

Ulta Scraper

autofacts/ulta-scraper

Ulta web scraper to crawl product information including price and sale price, color, and images.

Autofactor

5.0

Farfetch Scraper

autofacts/farfetch

Farfetch web scraper to crawl product information including price and sale price, color, and images.

Autofactor

208

5.0

Yellow Pages US Scraper

trudax/yellow-pages-us-scraper

Scrape addresses, phone numbers, categories, and names from Yellow Pages US listings. Customizable Yellow Pages API to crawl and download all contact data.

Gustavo Rudiger

4.1K

Target Search Scraper

axlymxp/target-search-scraper

A web scraper that searches Target's website for products based on a keyword and store ID. Extracts detailed product information including name, price, description, images, availability and store details. Results are saved to a structured dataset.

axly

Reddit Scraper Lite

trudax/reddit-scraper-lite

Pay Per Result, unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

Gustavo Rudiger

7.2K

3.9