Net-A-Porter Scraper avatar
Net-A-Porter Scraper
Under maintenance

Pricing

$45.00/month + usage

Go to Store
Net-A-Porter Scraper

Net-A-Porter Scraper

Under maintenance
runtime/net-a-porter-scraper

Developed by

Runtime

Maintained by Community

Net-A-Porter Scraper is designed to extract product data such as prices, designers, colors, and codes from Net-A-Porter. It efficiently handles pagination and proxies.

0.0 (0)

Pricing

$45.00/month + usage

1

Monthly users

1

Runs succeeded

>99%

Last modified

4 days ago

This Apify Actor scrapes product data from Net-A-Porter.
It navigates through listing pages (with pagination) and then visits each product’s detail page to extract key information such as:

  • Designer
  • Product Name
  • Price
  • Color
  • Product Code

Features

  • Scraping of both listing pages and product detail pages.
  • Automatic pagination based on the pagination_max parameter.
  • Limitation on the total number of scraped products via the products_max parameter.
  • Use of proxies (Apify) to reduce the risk of blocking.

Input Parameters

ParameterTypeDescriptionDefault Value
startUrlsarrayStart URLs for scraping.["https://www.net-a-porter.com/fr-fr/shop/chaussures"]
pagination_maxintegerMaximum number of pages to scrape.5
products_maxintegerMaximum number of products to scrape.50
proxyConfigurationobjectProxy configuration for scraping (using Apify proxies is recommended).{ "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] }

Output

The Actor generates a JSON dataset containing, for each product, an object similar to:

{ "url": "https://www.net-a-porter.com/fr-fr/shop/product/gianvito-rossi/...", "designer": "GIANVITO ROSSI", "productName": "Flat shoes with pointed toe in suede", "price": "690", "color": "Black", "productCode": "1647597357846081" }

This project is intended for educational and research purposes only. Please ensure you comply with the Net-A-Porter website’s terms of use and applicable scraping laws.


Additional Explanations

  • Pagination and Limitation:
    The routes.js file checks on each listing page if the page number (defined via the pageNumber parameter in the URL) is less than pagination_max and adds the next page. Additionally, before adding a product detail page for processing, it verifies that the total number of scraped products does not exceed products_max.

  • Page Differentiation:
    The logic in handleRequest determines whether the URL corresponds to a listing page or a detail page. On listing pages, product detail links are extracted using the selector .ProductList0__productItemContainer a.

  • Proxy Usage:
    The proxy configuration is managed via the proxyConfiguration parameter in the input schema and is applied when launching the crawler.

You can customize and extend this template according to your specific needs or changes in the target site's HTML structure.

Pricing

Pricing model

Rental 

To use this Actor, you have to pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period. You also pay for the Apify platform usage.

Free trial

Price

$45.00