Amazon Products By Category Scraper avatar
Amazon Products By Category Scraper

Pricing

$12.00/month + usage

Go to Store
Amazon Products By Category Scraper

Amazon Products By Category Scraper

Developed by

Pinto Studio

Pinto Studio

Maintained by Community

The Amazon Products By Category Actor is a web scraping tool deployed on the Apify platform that extracts product information from Amazon category pages.

0.0 (0)

Pricing

$12.00/month + usage

0

Total users

2

Monthly users

2

Runs succeeded

>99%

Last modified

2 days ago

Amazon Products By Category Actor

Overview

The Amazon Products By Category Actor is a web scraping tool deployed on the Apify platform that extracts product information from Amazon category pages. This actor allows you to scrape product listings with various sorting options and filtering capabilities.

Features

  • Category-based scraping: Extract products from specific Amazon category URLs
  • Pagination support: Specify starting page number for scraping
  • Flexible sorting: Sort products by featured, price, reviews, or newest arrivals
  • Custom filtering: Apply additional filters to refine product search results
  • Structured output: Returns well-formatted JSON data with comprehensive product information

Input Parameters

Required Parameters

ParameterTypeDescription
categoryUrlstringThe Amazon category URL to scrape products from

Optional Parameters

ParameterTypeDefaultDescription
pageinteger1The page number to start scraping from (minimum: 1)
sortBystring"featured"How to sort the product results
filtersobjectnullAdditional filters to apply to product search

Sort Options

The sortBy parameter accepts the following values:

  • featured - Featured products (default)
  • price-asc - Price: Low to High
  • price-desc - Price: High to Low
  • review-rank - Customer Reviews
  • newest - Newest Arrivals

Input Schema Example

{
"categoryUrl": "https://www.amazon.com/s?i=specialty-aps&bbn=16225007011&rh=n%3A16225007011%2Cn%3A1292110011",
"page": 1,
"sortBy": "featured",
"filters": {
"brand": "Samsung",
"priceRange": "50-200"
}
}

Output Format

The actor returns a JSON object containing:

Main Structure

{
"results": [
{
"asin": "Product ASIN",
"title": "Product title",
"url": "Product URL",
"image": "Product image URL",
"price": "Product price",
"rating": "Customer rating",
"reviews": "Number of reviews",
"delivery": "Delivery information",
"sponsored": boolean,
"prime": boolean,
"extraInfo": "Additional delivery info"
}
],
"pagination": {
"currentPage": 1,
"itemsPerPage": 24,
"totalResults": 7000,
"totalAvailablePages": 292,
"paginationLinks": [...]
},
"relatedSearch": []
}

Product Object Fields

FieldTypeDescription
asinstringAmazon Standard Identification Number
titlestringProduct title/name
urlstringDirect link to the product page
imagestringURL of the product image
pricestringProduct price (formatted with currency)
ratingstringCustomer rating (e.g., "4.6 out of 5 stars")
reviewsstringTotal number of customer reviews
deliverystringStandard delivery information
sponsoredbooleanWhether the product is sponsored
primebooleanWhether the product is Prime eligible
extraInfostringAdditional delivery or product information

Pagination Object Fields

FieldTypeDescription
currentPageintegerCurrent page number
itemsPerPageintegerNumber of items per page
totalResultsintegerTotal number of search results
totalAvailablePagesintegerTotal number of available pages
paginationLinksarrayNavigation links for pagination

Usage Examples

Basic Usage

Scrape the first page of a category with default settings:

{
"categoryUrl": "https://www.amazon.com/s?i=computers&rh=n%3A541966"
}

Advanced Usage

Scrape with custom sorting and pagination:

{
"categoryUrl": "https://www.amazon.com/s?i=electronics&rh=n%3A172282",
"page": 2,
"sortBy": "price-asc",
"filters": {
"minPrice": 25,
"maxPrice": 100,
"brand": "Apple"
}
}

Error Handling

The actor includes comprehensive error handling:

  • Invalid URLs will be caught and logged
  • Network errors are handled gracefully
  • Parsing errors are reported with detailed messages
  • The actor will exit cleanly even if errors occur

Best Practices

  1. Rate Limiting: Be mindful of Amazon's rate limits. Avoid running multiple instances simultaneously on the same category.

  2. URL Format: Ensure the category URL is properly formatted and points to a valid Amazon category page.

  3. Pagination: Start with page 1 and increment gradually if you need multiple pages of results.

  4. Filtering: Use the filters parameter judiciously to avoid overly restrictive searches that return no results.

  5. Data Storage: The actor stores results both in the default dataset and as a named value 'output' for easy retrieval.

Technical Details

  • Platform: Apify
  • Runtime: Node.js
  • Dependencies: Axios for HTTP requests, Apify SDK for platform integration
  • Output Storage: Results are stored in both the dataset and as a key-value store

Limitations

  • Scraping is subject to Amazon's terms of service and rate limiting
  • Some product information may not be available for all items
  • Sponsored and Prime eligibility may vary by region
  • Complex filters may not work with all category types

Support

If you have any questions or encounter any issues, please consult the Apify documentation or reach out to us through one of the following channels: