Amazon Bestsellers Scraper

  • junglee/amazon-bestsellers
  • Modified
  • Users 158
  • Runs 826
  • Created by Author's avatarJunglee

Scrape the Amazon Best Sellers categories and extract details on top 100 most popular items on Amazon. Download product name, price, URL, and thumbnail image. Best used on .com, .co.uk, .de, .fr, .es, and .it domains. Download your data in various formats: HTML table, JSON, CSV, Excel, and more.

Amazon Bestsellers Scraper

Our free Amazon Best Sellers Scraper allows you to scrape the 100 top-selling items on Amazon. It extracts data from Amazon Best Sellers pages in structured formats such as JSON, XML, CSV, or Excel. With this Amazon API, you will be able to:

  • extract data from different domains: US, GB, DE, FR, ES and IT.
  • extract all Amazon Best Seller product details: name, price, URL, and thumbnail image.
  • scrape Amazon Best Seller categories and subcategories.
  • besides Amazon BEst Seller categories, you can now also scrape the following Amazon categories with this scraper: Most Wished, Movers and Shakers, New Releases, and Gift Ideas

If you would prefer a more general Amazon product or data scraper, you should try Amazon Scraper instead.

How do I scrape Amazon Best Sellers?

For a step-by-step guide on how to scrape Amazon Best Sellers, follow our Amazon Best Sellers Scraper tutorial.

How many results can you scrape with Amazon Best Seller scraper?

Amazon Best Sellers scraper can return up to tens of thousands of results on average. However, you have to keep in mind that scraping amazon.com has many variables to it and may cause the results to fluctuate case by case. There’s no one-size-fits-all-use-cases number. The maximum number of results may vary depending on the complexity of the input, location, and other factors. Some of the most frequent cases are:

  • website gives a different number of results depending on the type/value of the input
  • website has an internal limit that no scraper can cross
  • scraper has a limit that we are working on improving

Therefore, while we regularly run Actor tests to keep the benchmarks in check, the results may also fluctuate without our knowing. The best way to know for sure for your particular use case is to do a test run yourself.

Need to find product pairs between Amazon and another online shop?

Use the AI Product Matcher. This AI model allows you to compare items from different web stores, identifying exact matches and comparing real-time data obtained via web scraping. With the AI Product Matcher, you can use scraped product data to monitor product matches across the industry, implement dynamic pricing for your website, replace or complement manual mapping, and obtain realistic estimates against your competition for upcoming promo campaigns.

Most importantly, it is relatively easy to get started with (just follow this guide).

Input parameters

There are two ways you can scrape Amazon Best Sellers: either by Domain or by Amazon URL.

  1. Scraping by Domain will get you data from one of 6 available Amazon domains. You can only pick one per run.
  2. Scraping by Amazon URL will get you all details from a chosen category and its subcategories. You can add as many URLs as you want.

Here's an input example for an Amazon URL:

Apify  -  Amazon best sellers - input example  input

Tips and tricks

  • By default, this Amazon scraper extracts the 37 top Best Seller subcategories. A deeper level of extraction can be added to allow you to scrape the top-selling items from the first level of the main categories' sub-divisions.

  • The default depth of the crawl is limited to two subcategories. There is a way around this restriction. Start on the main category, scrape two departments. Then remove duplicate category URLs from there and feed them back into the scraper again.

Proxy configuration

The proxy configuration (proxyConfiguration) option enables you to set proxies that will be used by the scraper in order to prevent its detection by target websites. You can use both Apify Proxy and custom HTTP or SOCKS5 proxy servers.

The following table lists the available options of the proxy configuration setting:

Apify Proxy (automatic): The scraper will load all web pages using Apify Proxy in automatic mode. In this mode, the proxy uses all proxy groups that are available to the user, and for each new web page it automatically selects the proxy that hasn't been used in the longest time for the specific hostname, in order to reduce the chance of detection by the website. You can view the list of available proxy groups on the proxy page in the app.

Apify Proxy (selected groups): The scraper will load all web pages using Apify Proxy with specific groups of target proxy servers.

Custom proxies: The scraper will use a custom list of proxy servers. The proxies must be specified in the scheme://user:password@host:port format, multiple proxies should be separated by a space or new line. The URL scheme can be either HTTP or SOCKS5. User and password might be omitted, but the port must always be present.

Amazon Best Sellers data output

You can download the resulting datasets in various universal formats such as JSON, HTML, CSV or Excel. Each item in the dataset will contain a separate Amazon item following this format:

[{ "category": "Amazon.co.uk Best Sellers: The most popular items in Books", "categoryUrl": "https://www.amazon.co.uk/best-sellers-books-Amazon/zgbs/books/ref=zg_bs_nav_0", "ID": 0, "name": "The Bullet That Missed: (The Thursday Murder Club 3)", "price": null, "url": "https://www.amazon.co.uk/Bullet-that-Missed-Thursday-Mystery/dp/0241512425/ref=zg_bs_books_sccl_1/261-2733972-0388621?pd_rd_i=0241512425&psc=1", "thumbnail": "https://images-eu.ssl-images-amazon.com/images/I/71xfjR3QXyL._AC_UL600_SR600,400_.jpg" }, { "category": "Amazon.co.uk Best Sellers: The most popular items in Books", "categoryUrl": "https://www.amazon.co.uk/best-sellers-books-Amazon/zgbs/books/ref=zg_bs_nav_0", "ID": 1, "name": "One: Simple One-Pan Wonders", "price": null, "url": "https://www.amazon.co.uk/One-One-Pan-Wonders-Jamie-Oliver/dp/0241431107/ref=zg_bs_books_sccl_2/261-2733972-0388621?pd_rd_i=0241431107&psc=1", "thumbnail": "https://images-eu.ssl-images-amazon.com/images/I/81CBtopMxOL._AC_UL600_SR600,400_.jpg" }, { "category": "Amazon.co.uk Best Sellers: The most popular items in Books", "categoryUrl": "https://www.amazon.co.uk/best-sellers-books-Amazon/zgbs/books/ref=zg_bs_nav_0", "ID": 2, "name": "Verity: The thriller that will capture your heart and blow your mind", "price": null, "url": "https://www.amazon.co.uk/Verity-thriller-that-capture-heart/dp/1408726602/ref=zg_bs_books_sccl_3/261-2733972-0388621?pd_rd_i=1408726602&psc=1", "thumbnail": "https://images-eu.ssl-images-amazon.com/images/I/91e9kVbpfZL._AC_UL600_SR600,400_.jpg" } ...

During the run

  • During the run, the actor will output messages notifying you of which page is being extracted. When the items are extracted, the actor will notify you that they are being saved.
  • Due to concurrent extraction of pages, these notifications may not be displayed in order.
  • In the event of an error, the actor will complete its run immediately, without adding any data to the dataset.

How can you use Amazon Best Sellers data?

If you're web scraping Amazon for retail or market research, the Amazon Best Sellers list can tell you a lot about the top trends in e-commerce. Competing directly against top-selling items across Amazon can be difficult, but it also can be a source of inspiration for new products for e-commerce retailers. Data collected on those top-selling items can help you:

  • stay ahead of the competition
  • improve sales for your business
  • track up-and-coming products
  • adjust your product prices to market levels

Integrations and Amazon Best Sellers Scraper

Last but not least, Amazon Best Sellers Scraper can be connected with almost any cloud service or web app thanks to integrations on the Apify platform. You can integrate with Make, Zapier, Slack, Airbyte, GitHub, Google Sheets, Google Drive, and more. Or you can use webhooks to carry out an action whenever an event occurs, e.g. get a notification whenever Instagram API Scraper successfully finishes a run.

Using Amazon Best Sellers Scraper with the Apify API

The Apify API gives you programmatic access to the Apify platform. The API is organized around RESTful HTTP endpoints that enable you to manage, schedule, and run Apify actors. The API also lets you access any datasets, monitor actor performance, fetch results, create and update versions, and more.

To access the API using Node.js, use the apify-client NPM package. To access the API using Python, use the apify-client PyPI package.

Check out the Apify API reference docs for full details or click on the API tab for code examples.

Resources on how to scrape Amazon Best Sellers data

  1. Platform pricing page with pricing specifications.
  2. Video guide ▷ on how to choose the right subscription plan.
  3. Step-by-step guide on how to use Amazon Best Sellers scraper.
  4. Input tab with all the technical parameters of this scraper.
  5. Is web scraping legal on the legal aspects of scraping.

Not your cup of tea? Build your own scraper

Amazon Best Sellers Scraper doesn’t exactly do what you need? You can always build your own! We have various scraper templates in Python, JavaScript, and TypeScript to get you started. Alternatively, you can write it from scratch using our open-source library Crawlee. You can keep the scraper to yourself or make it public by adding it to Apify Store (and find users for it).

Or let us know if you need a custom scraping solution.

Your feedback

We’re always working on improving the performance of our Actors. So if you’ve got any technical feedback for xxx or simply found a bug, please create an issue on the Actor’s Issues tab in Apify Console.