Amazon Product Scraper avatar

Amazon Product Scraper

Try for free

14 days trial then $40.00/month - No credit card required now

Go to Store
Amazon Product Scraper

Amazon Product Scraper

junglee/amazon-crawler
Try for free

14 days trial then $40.00/month - No credit card required now

Use this Amazon scraper to collect data based on URL and country from the Amazon website. Extract product information without using the Amazon API, including reviews, prices, descriptions, and Amazon Standard Identification Numbers (ASINs). Download data in various structured formats.

You can access the Amazon Product Scraper programmatically from your own applications by using the Apify API. You can choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

1{
2  "openapi": "3.0.1",
3  "info": {
4    "version": "0.0",
5    "x-build-id": "a9RJPVm5C5hZLVRnT"
6  },
7  "servers": [
8    {
9      "url": "https://api.apify.com/v2"
10    }
11  ],
12  "paths": {
13    "/acts/junglee~amazon-crawler/run-sync-get-dataset-items": {
14      "post": {
15        "operationId": "run-sync-get-dataset-items-junglee-amazon-crawler",
16        "x-openai-isConsequential": false,
17        "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
18        "tags": [
19          "Run Actor"
20        ],
21        "requestBody": {
22          "required": true,
23          "content": {
24            "application/json": {
25              "schema": {
26                "$ref": "#/components/schemas/inputSchema"
27              }
28            }
29          }
30        },
31        "parameters": [
32          {
33            "name": "token",
34            "in": "query",
35            "required": true,
36            "schema": {
37              "type": "string"
38            },
39            "description": "Enter your Apify token here"
40          }
41        ],
42        "responses": {
43          "200": {
44            "description": "OK"
45          }
46        }
47      }
48    },
49    "/acts/junglee~amazon-crawler/runs": {
50      "post": {
51        "operationId": "runs-sync-junglee-amazon-crawler",
52        "x-openai-isConsequential": false,
53        "summary": "Executes an Actor and returns information about the initiated run in response.",
54        "tags": [
55          "Run Actor"
56        ],
57        "requestBody": {
58          "required": true,
59          "content": {
60            "application/json": {
61              "schema": {
62                "$ref": "#/components/schemas/inputSchema"
63              }
64            }
65          }
66        },
67        "parameters": [
68          {
69            "name": "token",
70            "in": "query",
71            "required": true,
72            "schema": {
73              "type": "string"
74            },
75            "description": "Enter your Apify token here"
76          }
77        ],
78        "responses": {
79          "200": {
80            "description": "OK",
81            "content": {
82              "application/json": {
83                "schema": {
84                  "$ref": "#/components/schemas/runsResponseSchema"
85                }
86              }
87            }
88          }
89        }
90      }
91    },
92    "/acts/junglee~amazon-crawler/run-sync": {
93      "post": {
94        "operationId": "run-sync-junglee-amazon-crawler",
95        "x-openai-isConsequential": false,
96        "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
97        "tags": [
98          "Run Actor"
99        ],
100        "requestBody": {
101          "required": true,
102          "content": {
103            "application/json": {
104              "schema": {
105                "$ref": "#/components/schemas/inputSchema"
106              }
107            }
108          }
109        },
110        "parameters": [
111          {
112            "name": "token",
113            "in": "query",
114            "required": true,
115            "schema": {
116              "type": "string"
117            },
118            "description": "Enter your Apify token here"
119          }
120        ],
121        "responses": {
122          "200": {
123            "description": "OK"
124          }
125        }
126      }
127    }
128  },
129  "components": {
130    "schemas": {
131      "inputSchema": {
132        "type": "object",
133        "required": [
134          "categoryOrProductUrls"
135        ],
136        "properties": {
137          "categoryOrProductUrls": {
138            "title": "Enter one or more Amazon category or product URLs",
139            "uniqueItems": true,
140            "type": "array",
141            "description": "Use a link to the Amazon category you want to scrape, e.g. video games - https://www.amazon.com/s?bbn=16225016011&rh=n%3A20972781011, https://www.amazon.com/dp/B0B3BVWJ6Y.\n You can also filter within the category with almost any filter available, i.e. price, manufacturer, rating, etc. \n By using a different URL you can also choose a different Amazon country, e.g. Germany: https://www.amazon.de/. You don't need to set a proxy country to do that.",
142            "items": {
143              "type": "object",
144              "required": [
145                "url"
146              ],
147              "properties": {
148                "url": {
149                  "type": "string",
150                  "title": "URL of a web page",
151                  "format": "uri"
152                }
153              }
154            }
155          },
156          "maxItemsPerStartUrl": {
157            "title": "Max results (per start URL)",
158            "minimum": 0,
159            "type": "integer",
160            "description": "Enter the maximum number of results you want to scrape per each star URL. This will let the scraper know when to stop."
161          },
162          "language": {
163            "title": "Language",
164            "enum": [
165              "cs",
166              "da",
167              "de",
168              "en",
169              "es",
170              "fr",
171              "nl",
172              "pl",
173              "pt",
174              "sv",
175              "tr",
176              "he",
177              "ar",
178              "mrta",
179              "hi",
180              "bn",
181              "ta",
182              "te",
183              "kn",
184              "ml",
185              "ko",
186              "zh_CN",
187              "zh_TW",
188              "ja"
189            ],
190            "type": "string",
191            "description": "Language to use on Amazon.\n\n**Disclaimer**: each Amazon domain has a different set of supported languages. If you choose a language that is not supported by the domain, the scraper will use the default language of the domain."
192          },
193          "proxyCountry": {
194            "title": "Proxy country",
195            "enum": [
196              "AUTO_SELECT_PROXY_COUNTRY",
197              "AF",
198              "AL",
199              "DZ",
200              "AS",
201              "AD",
202              "AO",
203              "AI",
204              "AQ",
205              "AG",
206              "AR",
207              "AM",
208              "AW",
209              "AU",
210              "AT",
211              "AZ",
212              "BS",
213              "BH",
214              "BD",
215              "BB",
216              "BY",
217              "BE",
218              "BZ",
219              "BJ",
220              "BM",
221              "BT",
222              "BO",
223              "BQ",
224              "BA",
225              "BW",
226              "BV",
227              "BR",
228              "IO",
229              "VG",
230              "BN",
231              "BG",
232              "BF",
233              "BI",
234              "KH",
235              "CM",
236              "CA",
237              "CV",
238              "KY",
239              "CF",
240              "TD",
241              "CL",
242              "CN",
243              "CX",
244              "CC",
245              "CO",
246              "KM",
247              "CK",
248              "CR",
249              "HR",
250              "CU",
251              "CW",
252              "CY",
253              "CZ",
254              "CD",
255              "DK",
256              "DJ",
257              "DM",
258              "DO",
259              "TL",
260              "EC",
261              "EG",
262              "SV",
263              "GQ",
264              "ER",
265              "EE",
266              "ET",
267              "FK",
268              "FO",
269              "FJ",
270              "FI",
271              "FR",
272              "GF",
273              "PF",
274              "TF",
275              "GA",
276              "GM",
277              "GE",
278              "DE",
279              "GH",
280              "GI",
281              "GR",
282              "GL",
283              "GD",
284              "GP",
285              "GU",
286              "GT",
287              "GG",
288              "GW",
289              "GN",
290              "GY",
291              "HT",
292              "HM",
293              "HN",
294              "HK",
295              "HU",
296              "IS",
297              "IN",
298              "ID",
299              "IR",
300              "IQ",
301              "IE",
302              "IM",
303              "IL",
304              "IT",
305              "CI",
306              "JM",
307              "JP",
308              "JE",
309              "JO",
310              "KZ",
311              "KE",
312              "KI",
313              "XK",
314              "KW",
315              "KG",
316              "LA",
317              "LV",
318              "LB",
319              "LS",
320              "LR",
321              "LY",
322              "LI",
323              "LT",
324              "LU",
325              "MO",
326              "MG",
327              "MW",
328              "MY",
329              "MV",
330              "ML",
331              "MT",
332              "MH",
333              "MQ",
334              "MR",
335              "MU",
336              "YT",
337              "MX",
338              "FM",
339              "MD",
340              "MC",
341              "MN",
342              "ME",
343              "MS",
344              "MA",
345              "MZ",
346              "MM",
347              "NA",
348              "NR",
349              "NP",
350              "NL",
351              "NC",
352              "NZ",
353              "NI",
354              "NE",
355              "NG",
356              "NU",
357              "NF",
358              "KP",
359              "MK",
360              "MP",
361              "NO",
362              "OM",
363              "PK",
364              "PW",
365              "PS",
366              "PA",
367              "PG",
368              "PY",
369              "PE",
370              "PH",
371              "PN",
372              "PL",
373              "PT",
374              "PR",
375              "QA",
376              "CG",
377              "RE",
378              "RO",
379              "RU",
380              "RW",
381              "BL",
382              "SH",
383              "KN",
384              "LC",
385              "MF",
386              "PM",
387              "VC",
388              "WS",
389              "SM",
390              "ST",
391              "SA",
392              "SN",
393              "RS",
394              "SC",
395              "SL",
396              "SG",
397              "SX",
398              "SK",
399              "SI",
400              "SB",
401              "SO",
402              "ZA",
403              "GS",
404              "KR",
405              "SS",
406              "ES",
407              "LK",
408              "SD",
409              "SR",
410              "SJ",
411              "SZ",
412              "SE",
413              "CH",
414              "SY",
415              "TW",
416              "TJ",
417              "TZ",
418              "TH",
419              "TG",
420              "TK",
421              "TO",
422              "TT",
423              "TN",
424              "TR",
425              "TM",
426              "TC",
427              "TV",
428              "UM",
429              "VI",
430              "UG",
431              "UA",
432              "AE",
433              "GB",
434              "US",
435              "UY",
436              "UZ",
437              "VU",
438              "VA",
439              "VE",
440              "VN",
441              "WF",
442              "EH",
443              "YE",
444              "ZM",
445              "ZW",
446              "AX"
447            ],
448            "type": "string",
449            "description": "You can set proxy country to **avoid geo-blocking** and **access region-specific content**. Amazon shows you the products that can be shipped to your address based on the proxy you use.\n\n*By default*, the scraper **automatically selects** the **proxy country** based on the used Amazon website domain, ensuring low blocking rates and results relevant to the base domain.",
450            "default": "AUTO_SELECT_PROXY_COUNTRY"
451          },
452          "maxOffers": {
453            "title": "Maximum scraped offers",
454            "minimum": 0,
455            "type": "integer",
456            "description": "Enter the maximum number of offers you want to scrape per each product. This will let the scraper know when to stop.\n\nNote that this **will increase the number of requests** and **extend the scraping time**.",
457            "default": 0
458          },
459          "scrapeSellers": {
460            "title": "Scrape sellers",
461            "type": "boolean",
462            "description": "Enable this option to extract further information about sellers of the product. Scrapes the featured offer seller (main seller).\n\nThis option goes hand in hand with the `maxOffers` option bby scraping the seller of each offer. If you set `maxOffers` to 0, only the featured offer seller will be scraped.\n\nNote that this **will increase the number of requests** and **extend the scraping time**.",
463            "default": false
464          },
465          "useCaptchaSolver": {
466            "title": "Use Captcha solver (warning: see description)",
467            "type": "boolean",
468            "description": "If enabled the scraper will automatically solve captchas thrown by Amazon. This will decrease the amount of request retries and increase the speed of the scraper.<br><br>***IMPORTANT*** - This option works well only for the `'.com'` Amazon domain, but even for that one Amazon doesn't show a few product fields after solving a captcha (specifically: 'attributes', 'manufacturer attributes', and 'bestseller ranks')",
469            "default": false
470          },
471          "scrapeProductVariantPrices": {
472            "title": "Scrape product variant prices",
473            "type": "boolean",
474            "description": "Enable this option to extract prices of different variations of a product. Useful when you need prices for each variant.\n\nNote that this **will increase the number of requests** and **extend the scraping time**.",
475            "default": false
476          },
477          "scrapeProductDetails": {
478            "title": "Scrape product details",
479            "type": "boolean",
480            "description": "If enabled, the scraper will extract each found product from the category page in detail (this is the default behavior).\n\nIf disabled, the scraper will only extract the quick product information from the category page. Useful for faster and more lightweight searches for products.",
481            "default": true
482          }
483        }
484      },
485      "runsResponseSchema": {
486        "type": "object",
487        "properties": {
488          "data": {
489            "type": "object",
490            "properties": {
491              "id": {
492                "type": "string"
493              },
494              "actId": {
495                "type": "string"
496              },
497              "userId": {
498                "type": "string"
499              },
500              "startedAt": {
501                "type": "string",
502                "format": "date-time",
503                "example": "2025-01-08T00:00:00.000Z"
504              },
505              "finishedAt": {
506                "type": "string",
507                "format": "date-time",
508                "example": "2025-01-08T00:00:00.000Z"
509              },
510              "status": {
511                "type": "string",
512                "example": "READY"
513              },
514              "meta": {
515                "type": "object",
516                "properties": {
517                  "origin": {
518                    "type": "string",
519                    "example": "API"
520                  },
521                  "userAgent": {
522                    "type": "string"
523                  }
524                }
525              },
526              "stats": {
527                "type": "object",
528                "properties": {
529                  "inputBodyLen": {
530                    "type": "integer",
531                    "example": 2000
532                  },
533                  "rebootCount": {
534                    "type": "integer",
535                    "example": 0
536                  },
537                  "restartCount": {
538                    "type": "integer",
539                    "example": 0
540                  },
541                  "resurrectCount": {
542                    "type": "integer",
543                    "example": 0
544                  },
545                  "computeUnits": {
546                    "type": "integer",
547                    "example": 0
548                  }
549                }
550              },
551              "options": {
552                "type": "object",
553                "properties": {
554                  "build": {
555                    "type": "string",
556                    "example": "latest"
557                  },
558                  "timeoutSecs": {
559                    "type": "integer",
560                    "example": 300
561                  },
562                  "memoryMbytes": {
563                    "type": "integer",
564                    "example": 1024
565                  },
566                  "diskMbytes": {
567                    "type": "integer",
568                    "example": 2048
569                  }
570                }
571              },
572              "buildId": {
573                "type": "string"
574              },
575              "defaultKeyValueStoreId": {
576                "type": "string"
577              },
578              "defaultDatasetId": {
579                "type": "string"
580              },
581              "defaultRequestQueueId": {
582                "type": "string"
583              },
584              "buildNumber": {
585                "type": "string",
586                "example": "1.0.0"
587              },
588              "containerUrl": {
589                "type": "string"
590              },
591              "usage": {
592                "type": "object",
593                "properties": {
594                  "ACTOR_COMPUTE_UNITS": {
595                    "type": "integer",
596                    "example": 0
597                  },
598                  "DATASET_READS": {
599                    "type": "integer",
600                    "example": 0
601                  },
602                  "DATASET_WRITES": {
603                    "type": "integer",
604                    "example": 0
605                  },
606                  "KEY_VALUE_STORE_READS": {
607                    "type": "integer",
608                    "example": 0
609                  },
610                  "KEY_VALUE_STORE_WRITES": {
611                    "type": "integer",
612                    "example": 1
613                  },
614                  "KEY_VALUE_STORE_LISTS": {
615                    "type": "integer",
616                    "example": 0
617                  },
618                  "REQUEST_QUEUE_READS": {
619                    "type": "integer",
620                    "example": 0
621                  },
622                  "REQUEST_QUEUE_WRITES": {
623                    "type": "integer",
624                    "example": 0
625                  },
626                  "DATA_TRANSFER_INTERNAL_GBYTES": {
627                    "type": "integer",
628                    "example": 0
629                  },
630                  "DATA_TRANSFER_EXTERNAL_GBYTES": {
631                    "type": "integer",
632                    "example": 0
633                  },
634                  "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
635                    "type": "integer",
636                    "example": 0
637                  },
638                  "PROXY_SERPS": {
639                    "type": "integer",
640                    "example": 0
641                  }
642                }
643              },
644              "usageTotalUsd": {
645                "type": "number",
646                "example": 0.00005
647              },
648              "usageUsd": {
649                "type": "object",
650                "properties": {
651                  "ACTOR_COMPUTE_UNITS": {
652                    "type": "integer",
653                    "example": 0
654                  },
655                  "DATASET_READS": {
656                    "type": "integer",
657                    "example": 0
658                  },
659                  "DATASET_WRITES": {
660                    "type": "integer",
661                    "example": 0
662                  },
663                  "KEY_VALUE_STORE_READS": {
664                    "type": "integer",
665                    "example": 0
666                  },
667                  "KEY_VALUE_STORE_WRITES": {
668                    "type": "number",
669                    "example": 0.00005
670                  },
671                  "KEY_VALUE_STORE_LISTS": {
672                    "type": "integer",
673                    "example": 0
674                  },
675                  "REQUEST_QUEUE_READS": {
676                    "type": "integer",
677                    "example": 0
678                  },
679                  "REQUEST_QUEUE_WRITES": {
680                    "type": "integer",
681                    "example": 0
682                  },
683                  "DATA_TRANSFER_INTERNAL_GBYTES": {
684                    "type": "integer",
685                    "example": 0
686                  },
687                  "DATA_TRANSFER_EXTERNAL_GBYTES": {
688                    "type": "integer",
689                    "example": 0
690                  },
691                  "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
692                    "type": "integer",
693                    "example": 0
694                  },
695                  "PROXY_SERPS": {
696                    "type": "integer",
697                    "example": 0
698                  }
699                }
700              }
701            }
702          }
703        }
704      }
705    }
706  }
707}

📦 Amazon Product Scraper OpenAPI definition

OpenAPI is a standard for designing and describing RESTful APIs, allowing developers to define API structure, endpoints, and data formats in a machine-readable way. It simplifies API development, integration, and documentation.

OpenAPI is effective when used with AI agents and GPTs by standardizing how these systems interact with various APIs, for reliable integrations and efficient communication.

By defining machine-readable API specifications, OpenAPI allows AI models like GPTs to understand and use varied data sources, improving accuracy. This accelerates development, reduces errors, and provides context-aware responses, making OpenAPI a core component for AI applications.

You can download the OpenAPI definitions for Amazon Product Scraper from the options below:

If you’d like to learn more about how OpenAPI powers GPTs, read our blog post.

You can also check out our other API clients:

Developer
Maintained by Apify

Actor Metrics

  • 294 monthly users

  • 109 bookmarks

  • >99% runs succeeded

  • 2.6 days response time

  • Created in Oct 2018

  • Modified 4 hours ago

Categories