Ultimate ALDI Scraper avatar

Ultimate ALDI Scraper

Try for free

1 day trial then $30.00/month - No credit card required now

Go to Store
Ultimate ALDI Scraper

Ultimate ALDI Scraper

eneiromatos/ultimate-aldi-scraper
Try for free

1 day trial then $30.00/month - No credit card required now

This is the ultimate web scraping and tool for extracting the most relevant data points from products on new.aldi.us! Developed by an expert software developer, this powerful scraper is a fast and reliable tool for all your web scraping needs.

Ultimate ALDI Web Scraper

Ultimate ALDI Web Scraper is the ultimate web scraping and tool for extracting the most relevant data points from products on new.aldi.us! Developed by an expert software developer, this powerful scraper is written in TypeScript, making it a fast and reliable tool for all your web scraping needs. With Ultimate ALDI Web Scraper, you can extract data from ALDI website (https://new.aldi.us/) using a variety of inputs, including category URLs, brand URLs, search keywords, or specific product URLs. Whether you're an eCommerce entrepreneur, a data analyst, or just someone who loves collecting data, this is the tool you need to get the job done quickly and efficiently. So why wait? Register today and start scraping!

Usage

The input for the scraper should contain the following fields:

  • productUrls: An array of URLs for specific product pages to scrape.
  • listingUrls: An array of URLs for category pages or brand pages to scrape (that contains listing of products and pagination).
  • keywords: An array of search keywords to use when searching ALDI.com.
  • maxPrice: The maximum price for products to scrape.
  • minPrice: The minimum price for products to scrape.
  • startPageNumber: The page number to start scraping from.
  • finalPageNumber: The final page number to scrape.

Using 0 as value for minPrice and maxPrice indicates the scraper to collect products from all price ranges.

Using 0 as value for startPageNumber and finalPageNumber indicates the scraper to crawl all the page range.

Output

The output includes the following fields for each product:

  • URL: The URL of the product page.
  • idCodes: An object containing the unique identifier codes of the product, including UPC.
  • seller: An object containing information about the seller and brand of the product, including the brand, brandURL.
  • title: The title of the product.
  • media: An object containing URLs for images and videos of the product, including the main image URL and gallery array of image URLs.
  • pricing: An object containing pricing information for the product, including the fullPrice and currencySymbol.
  • isAvailable: A boolean indicating whether the product is currently available.
  • info: An object containing additional information about the product, including the longDescription and countryOfOrigin.
  • measurements: An object containing information about the product measures, including the baseMeasure and packagesCount.
  • category: An object containing information about the category of the product, including the fullPath and pathParts array of category objects, each with a name and url field.

Note

Please report any error or let me know your suggestions to improve this software.

Developer
Maintained by Community

Actor Metrics

  • 1 monthly user

  • 1 star

  • >99% runs succeeded

  • Created in Jan 2024

  • Modified a year ago

Categories