Walmart Reviews Scraper avatar
Walmart Reviews Scraper

Pricing

$14.99/month + usage

Go to Apify Store
Walmart Reviews Scraper

Walmart Reviews Scraper

Scrapes customer reviews from Walmart product pages, capturing reviewer names, ratings, comments, timestamps, helpful votes, and review URLs. Ideal for sentiment analysis, product research, competitor tracking, and large-scale Walmart review data extraction

Pricing

$14.99/month + usage

Rating

0.0

(0)

Developer

Scrapio

Scrapio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Extract customer reviews from Walmart.com products with ease. This Apify Actor allows you to scrape product reviews from individual product pages or search results, supporting bulk processing of multiple products and keywords.

Why Choose Us?

  • Intelligent Proxy Management: Automatically handles proxy fallback (no proxy → datacenter → residential) to ensure successful data extraction even when Walmart blocks requests
  • Flexible Input Options: Supports product URLs, search URLs, or simple keywords - process hundreds of products in a single run
  • Live Data Saving: Reviews are saved immediately as they're extracted, so you never lose data even if the actor is interrupted
  • Human-like Behavior: Built-in delays and rate limiting to mimic natural browsing patterns and reduce detection risk
  • Comprehensive Logging: Detailed progress updates keep you informed throughout the extraction process
  • Robust Error Handling: Automatic retries with exponential backoff ensure maximum data collection success

Key Features

  • Multiple Input Types: Process direct product URLs, search URLs, or keywords
  • Bulk Processing: Extract reviews from multiple products simultaneously
  • Customizable Sorting: Sort reviews by relevancy, date, helpfulness, or rating
  • Smart Proxy Fallback: Automatic proxy switching when requests are blocked
  • Pagination Support: Automatically handles multiple pages of reviews
  • Rate Limiting: Built-in delays to avoid overwhelming target servers
  • Live Saving: Data saved in real-time to prevent loss
  • Skip Empty Products: Option to skip products without reviews for faster processing

Input

The actor accepts the following input parameters:

Example Input

{
"startUrls": [
{
"url": "https://walmart.com/search?q=tshirt"
},
{
"url": "https://www.walmart.com/ip/product-name/123456"
},
"laptop"
],
"sortOrder": "relevancy",
"maxComments": 10,
"maxProductsPerStartUrl": 20,
"skipProductsWithoutReviews": true,
"proxyConfiguration": {
"useApifyProxy": false
}
}

Input Fields

FieldTypeRequiredDescription
startUrlsArray✅ YesList of Walmart product URLs, search URLs, or keywords. Supports bulk input. Examples:
• Product URL: https://www.walmart.com/ip/product-name/123456
• Search URL: https://walmart.com/search?q=tshirt
• Keyword: "laptop" (will be converted to search URL)
sortOrderString❌ NoHow to sort the reviews. Options:
"relevancy" - Most relevant (default)
"submission-desc" - Newest first
"helpful" - Most helpful
"rating-desc" - Highest rating first
"rating-asc" - Lowest rating first
maxCommentsInteger❌ NoMaximum number of reviews to extract per product. Range: 1-1000. Default: 10
maxProductsPerStartUrlInteger❌ NoMaximum number of products to process from each start URL (for search URLs). Set to 0 for unlimited. Range: 0-1000. Default: 20
skipProductsWithoutReviewsBoolean❌ NoIf enabled, products with no reviews will be skipped to speed up processing. Default: true
proxyConfigurationObject❌ NoProxy settings. By default, starts with no proxy. If Walmart blocks requests, automatically falls back to datacenter proxy, then residential proxy with retries. Default: {"useApifyProxy": false}

Proxy Fallback Behavior

The actor implements intelligent proxy fallback:

  1. No Proxy (default): Starts with direct connection if useApifyProxy: false
  2. Datacenter Proxy: If request is blocked (444/403), automatically switches to datacenter proxy (SHADER group)
  3. Residential Proxy: If datacenter proxy fails, falls back to residential proxy (RESIDENTIAL group)
  4. Retry Logic: On residential proxy failure, retries up to 3 times before giving up
  5. Persistence: Once a fallback occurs, continues using the selected proxy for all remaining requests

All proxy events are logged clearly for monitoring.

Output

The actor outputs structured review data to the Apify dataset. Each review contains the following fields:

Example Output

{
"itemType": "Review",
"productUrl": "https://www.walmart.com/ip/product-name/123456",
"rating": 5,
"title": "Great product!",
"text": "I've been using this product for a month and it's amazing. Highly recommend!"
}

Output Fields

FieldTypeDescription
itemTypeStringAlways "Review" to identify the data type
productUrlStringFull URL of the Walmart product page
ratingNumberCustomer rating (typically 1-5 stars)
titleStringReview title/headline
textStringFull review text content

Dataset View

The output dataset includes a default table view with the following columns:

  • Type: Review identifier
  • Product URL: Clickable link to the product page
  • Rating: Star rating (1-5)
  • Review Title: Review headline
  • Review Text: Full review content

🚀 How to Use the Actor (via Apify Console)

  1. Log in to Apify Console and navigate to Actors
  2. Find the walmart-reviews-scraper actor and click on it
  3. Configure inputs:
    • Add your Walmart URLs, search URLs, or keywords in the startUrls field
    • Adjust maxComments to set how many reviews per product
    • Set sortOrder to choose review sorting method
    • Configure proxyConfiguration if needed (optional)
  4. Run the actor by clicking the "Start" button
  5. Monitor logs in real-time to see extraction progress
  6. Access results in the OUTPUT tab once the run completes
  7. Export results to JSON, CSV, Excel, or other formats as needed

Using Keywords

You can input simple keywords instead of full URLs:

{
"startUrls": [
"tshirt",
"laptop",
"headphones"
]
}

The actor will automatically convert keywords to Walmart search URLs.

Using Search URLs

Process multiple products from search results:

{
"startUrls": [
{
"url": "https://walmart.com/search?q=tshirt"
}
],
"maxProductsPerStartUrl": 50
}

Using Direct Product URLs

Extract reviews from specific products:

{
"startUrls": [
{
"url": "https://www.walmart.com/ip/product-name/123456"
}
]
}

Best Use Cases

  • Market Research: Analyze customer sentiment and feedback for competitive products
  • Product Analysis: Collect reviews to understand product strengths and weaknesses
  • Review Monitoring: Track reviews for specific products over time
  • Data Aggregation: Build datasets of customer opinions for analysis
  • Quality Assurance: Monitor product quality through customer feedback
  • Content Creation: Gather authentic customer experiences for content marketing

Frequently Asked Questions

How many reviews can I extract?

The maxComments parameter controls reviews per product (default: 10, max: 1000). For unlimited reviews per product, set maxComments to a high value (e.g., 1000), though this may take longer.

What happens if a product has no reviews?

If skipProductsWithoutReviews is enabled (default), products without reviews are skipped automatically. If disabled, the actor will still process the product but extract 0 reviews.

How does proxy fallback work?

The actor starts with no proxy by default. If Walmart blocks a request (status 444 or 403), it automatically:

  1. Tries datacenter proxy
  2. If that fails, tries residential proxy
  3. Retries residential proxy up to 3 times if needed
  4. Continues with the working proxy for all subsequent requests

Can I process multiple products at once?

Yes! You can provide multiple URLs, keywords, or search terms in the startUrls array. The actor processes them sequentially to avoid overwhelming the server.

How long does extraction take?

Extraction time depends on:

  • Number of products
  • Number of reviews per product
  • Network speed and proxy performance
  • Rate limiting delays (1-5 seconds between requests)

Typically, 10 reviews from 20 products takes approximately 5-10 minutes.

Is the data saved in real-time?

Yes! Reviews are saved to the dataset immediately as they're extracted (live saving). This means if the actor is interrupted, you won't lose already-extracted data.

Can I sort reviews by different criteria?

Yes! Use the sortOrder parameter:

  • "relevancy" - Most relevant reviews first
  • "submission-desc" - Newest reviews first
  • "helpful" - Most helpful reviews first
  • "rating-desc" - Highest rated reviews first
  • "rating-asc" - Lowest rated reviews first

What if I get blocked?

The actor automatically handles blocks through proxy fallback. If you're consistently blocked, try:

  1. Enabling Apify proxy in proxyConfiguration
  2. Reducing maxProductsPerStartUrl to process fewer products
  3. Increasing delays (already built-in, but you can reduce concurrency)

Support and Feedback

If you encounter any issues or have suggestions for improvement, please:

  • Check the actor logs for detailed error messages
  • Review the input configuration to ensure all required fields are set
  • Contact Apify support or open an issue in the actor repository

Cautions

  • Legal Compliance: This actor collects data only from publicly available sources on Walmart.com. Ensure your use case complies with Walmart's Terms of Service and applicable data protection laws.
  • Rate Limiting: The actor includes built-in delays to respect Walmart's servers. Avoid running multiple instances simultaneously with aggressive settings.
  • Data Usage: You are responsible for ensuring compliance with data protection regulations (GDPR, CCPA, etc.) when processing personal information from reviews.
  • Terms of Service: Review and comply with Walmart's Terms of Service and robots.txt before using this actor for commercial purposes.
  • Ethical Scraping: Use this actor responsibly and ethically. Do not use scraped data for spam, harassment, or other malicious purposes.

Note: This actor is designed for legitimate research and data collection purposes. Always respect website terms of service and applicable laws when scraping data.