Net-A-Porter Scraper
Pricing
$45.00/month + usage
Net-A-Porter Scraper
Net-A-Porter Scraper is designed to extract product data such as prices, designers, colors, and codes from Net-A-Porter. It efficiently handles pagination and proxies.
0.0 (0)
Pricing
$45.00/month + usage
1
Monthly users
1
Runs succeeded
>99%
Last modified
4 days ago
This Apify Actor scrapes product data from Net-A-Porter.
It navigates through listing pages (with pagination) and then visits each product’s detail page to extract key information such as:
- Designer
- Product Name
- Price
- Color
- Product Code
Features
- Scraping of both listing pages and product detail pages.
- Automatic pagination based on the
pagination_max
parameter. - Limitation on the total number of scraped products via the
products_max
parameter. - Use of proxies (Apify) to reduce the risk of blocking.
Input Parameters
Parameter | Type | Description | Default Value |
---|---|---|---|
startUrls | array | Start URLs for scraping. | ["https://www.net-a-porter.com/fr-fr/shop/chaussures"] |
pagination_max | integer | Maximum number of pages to scrape. | 5 |
products_max | integer | Maximum number of products to scrape. | 50 |
proxyConfiguration | object | Proxy configuration for scraping (using Apify proxies is recommended). | { "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] } |
Output
The Actor generates a JSON dataset containing, for each product, an object similar to:
{ "url": "https://www.net-a-porter.com/fr-fr/shop/product/gianvito-rossi/...", "designer": "GIANVITO ROSSI", "productName": "Flat shoes with pointed toe in suede", "price": "690", "color": "Black", "productCode": "1647597357846081" }
Legal Disclaimer
This project is intended for educational and research purposes only. Please ensure you comply with the Net-A-Porter website’s terms of use and applicable scraping laws.
Additional Explanations
-
Pagination and Limitation:
The routes.js file checks on each listing page if the page number (defined via thepageNumber
parameter in the URL) is less thanpagination_max
and adds the next page. Additionally, before adding a product detail page for processing, it verifies that the total number of scraped products does not exceedproducts_max
. -
Page Differentiation:
The logic in handleRequest determines whether the URL corresponds to a listing page or a detail page. On listing pages, product detail links are extracted using the selector.ProductList0__productItemContainer a
. -
Proxy Usage:
The proxy configuration is managed via theproxyConfiguration
parameter in the input schema and is applied when launching the crawler.
You can customize and extend this template according to your specific needs or changes in the target site's HTML structure.
Pricing
Pricing model
RentalTo use this Actor, you have to pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period. You also pay for the Apify platform usage.
Free trial
Price
$45.00