Walmart Reviews Scraper
Pricing
$14.99/month + usage
Walmart Reviews Scraper
Scrapes customer reviews from Walmart product pages, capturing reviewer names, ratings, comments, timestamps, helpful votes, and review URLs. Ideal for sentiment analysis, product research, competitor tracking, and large-scale Walmart review data extraction
Pricing
$14.99/month + usage
Rating
0.0
(0)
Developer

Scrapio
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extract customer reviews from Walmart.com products with ease. This Apify Actor allows you to scrape product reviews from individual product pages or search results, supporting bulk processing of multiple products and keywords.
Why Choose Us?
- Intelligent Proxy Management: Automatically handles proxy fallback (no proxy → datacenter → residential) to ensure successful data extraction even when Walmart blocks requests
- Flexible Input Options: Supports product URLs, search URLs, or simple keywords - process hundreds of products in a single run
- Live Data Saving: Reviews are saved immediately as they're extracted, so you never lose data even if the actor is interrupted
- Human-like Behavior: Built-in delays and rate limiting to mimic natural browsing patterns and reduce detection risk
- Comprehensive Logging: Detailed progress updates keep you informed throughout the extraction process
- Robust Error Handling: Automatic retries with exponential backoff ensure maximum data collection success
Key Features
- ✅ Multiple Input Types: Process direct product URLs, search URLs, or keywords
- ✅ Bulk Processing: Extract reviews from multiple products simultaneously
- ✅ Customizable Sorting: Sort reviews by relevancy, date, helpfulness, or rating
- ✅ Smart Proxy Fallback: Automatic proxy switching when requests are blocked
- ✅ Pagination Support: Automatically handles multiple pages of reviews
- ✅ Rate Limiting: Built-in delays to avoid overwhelming target servers
- ✅ Live Saving: Data saved in real-time to prevent loss
- ✅ Skip Empty Products: Option to skip products without reviews for faster processing
Input
The actor accepts the following input parameters:
Example Input
{"startUrls": [{"url": "https://walmart.com/search?q=tshirt"},{"url": "https://www.walmart.com/ip/product-name/123456"},"laptop"],"sortOrder": "relevancy","maxComments": 10,"maxProductsPerStartUrl": 20,"skipProductsWithoutReviews": true,"proxyConfiguration": {"useApifyProxy": false}}
Input Fields
| Field | Type | Required | Description |
|---|---|---|---|
| startUrls | Array | ✅ Yes | List of Walmart product URLs, search URLs, or keywords. Supports bulk input. Examples: • Product URL: https://www.walmart.com/ip/product-name/123456• Search URL: https://walmart.com/search?q=tshirt• Keyword: "laptop" (will be converted to search URL) |
| sortOrder | String | ❌ No | How to sort the reviews. Options: • "relevancy" - Most relevant (default)• "submission-desc" - Newest first• "helpful" - Most helpful• "rating-desc" - Highest rating first• "rating-asc" - Lowest rating first |
| maxComments | Integer | ❌ No | Maximum number of reviews to extract per product. Range: 1-1000. Default: 10 |
| maxProductsPerStartUrl | Integer | ❌ No | Maximum number of products to process from each start URL (for search URLs). Set to 0 for unlimited. Range: 0-1000. Default: 20 |
| skipProductsWithoutReviews | Boolean | ❌ No | If enabled, products with no reviews will be skipped to speed up processing. Default: true |
| proxyConfiguration | Object | ❌ No | Proxy settings. By default, starts with no proxy. If Walmart blocks requests, automatically falls back to datacenter proxy, then residential proxy with retries. Default: {"useApifyProxy": false} |
Proxy Fallback Behavior
The actor implements intelligent proxy fallback:
- No Proxy (default): Starts with direct connection if
useApifyProxy: false - Datacenter Proxy: If request is blocked (444/403), automatically switches to datacenter proxy (
SHADERgroup) - Residential Proxy: If datacenter proxy fails, falls back to residential proxy (
RESIDENTIALgroup) - Retry Logic: On residential proxy failure, retries up to 3 times before giving up
- Persistence: Once a fallback occurs, continues using the selected proxy for all remaining requests
All proxy events are logged clearly for monitoring.
Output
The actor outputs structured review data to the Apify dataset. Each review contains the following fields:
Example Output
{"itemType": "Review","productUrl": "https://www.walmart.com/ip/product-name/123456","rating": 5,"title": "Great product!","text": "I've been using this product for a month and it's amazing. Highly recommend!"}
Output Fields
| Field | Type | Description |
|---|---|---|
| itemType | String | Always "Review" to identify the data type |
| productUrl | String | Full URL of the Walmart product page |
| rating | Number | Customer rating (typically 1-5 stars) |
| title | String | Review title/headline |
| text | String | Full review text content |
Dataset View
The output dataset includes a default table view with the following columns:
- Type: Review identifier
- Product URL: Clickable link to the product page
- Rating: Star rating (1-5)
- Review Title: Review headline
- Review Text: Full review content
🚀 How to Use the Actor (via Apify Console)
- Log in to Apify Console and navigate to Actors
- Find the
walmart-reviews-scraperactor and click on it - Configure inputs:
- Add your Walmart URLs, search URLs, or keywords in the
startUrlsfield - Adjust
maxCommentsto set how many reviews per product - Set
sortOrderto choose review sorting method - Configure
proxyConfigurationif needed (optional)
- Add your Walmart URLs, search URLs, or keywords in the
- Run the actor by clicking the "Start" button
- Monitor logs in real-time to see extraction progress
- Access results in the OUTPUT tab once the run completes
- Export results to JSON, CSV, Excel, or other formats as needed
Using Keywords
You can input simple keywords instead of full URLs:
{"startUrls": ["tshirt","laptop","headphones"]}
The actor will automatically convert keywords to Walmart search URLs.
Using Search URLs
Process multiple products from search results:
{"startUrls": [{"url": "https://walmart.com/search?q=tshirt"}],"maxProductsPerStartUrl": 50}
Using Direct Product URLs
Extract reviews from specific products:
{"startUrls": [{"url": "https://www.walmart.com/ip/product-name/123456"}]}
Best Use Cases
- Market Research: Analyze customer sentiment and feedback for competitive products
- Product Analysis: Collect reviews to understand product strengths and weaknesses
- Review Monitoring: Track reviews for specific products over time
- Data Aggregation: Build datasets of customer opinions for analysis
- Quality Assurance: Monitor product quality through customer feedback
- Content Creation: Gather authentic customer experiences for content marketing
Frequently Asked Questions
How many reviews can I extract?
The maxComments parameter controls reviews per product (default: 10, max: 1000). For unlimited reviews per product, set maxComments to a high value (e.g., 1000), though this may take longer.
What happens if a product has no reviews?
If skipProductsWithoutReviews is enabled (default), products without reviews are skipped automatically. If disabled, the actor will still process the product but extract 0 reviews.
How does proxy fallback work?
The actor starts with no proxy by default. If Walmart blocks a request (status 444 or 403), it automatically:
- Tries datacenter proxy
- If that fails, tries residential proxy
- Retries residential proxy up to 3 times if needed
- Continues with the working proxy for all subsequent requests
Can I process multiple products at once?
Yes! You can provide multiple URLs, keywords, or search terms in the startUrls array. The actor processes them sequentially to avoid overwhelming the server.
How long does extraction take?
Extraction time depends on:
- Number of products
- Number of reviews per product
- Network speed and proxy performance
- Rate limiting delays (1-5 seconds between requests)
Typically, 10 reviews from 20 products takes approximately 5-10 minutes.
Is the data saved in real-time?
Yes! Reviews are saved to the dataset immediately as they're extracted (live saving). This means if the actor is interrupted, you won't lose already-extracted data.
Can I sort reviews by different criteria?
Yes! Use the sortOrder parameter:
"relevancy"- Most relevant reviews first"submission-desc"- Newest reviews first"helpful"- Most helpful reviews first"rating-desc"- Highest rated reviews first"rating-asc"- Lowest rated reviews first
What if I get blocked?
The actor automatically handles blocks through proxy fallback. If you're consistently blocked, try:
- Enabling Apify proxy in
proxyConfiguration - Reducing
maxProductsPerStartUrlto process fewer products - Increasing delays (already built-in, but you can reduce concurrency)
Support and Feedback
If you encounter any issues or have suggestions for improvement, please:
- Check the actor logs for detailed error messages
- Review the input configuration to ensure all required fields are set
- Contact Apify support or open an issue in the actor repository
Cautions
- Legal Compliance: This actor collects data only from publicly available sources on Walmart.com. Ensure your use case complies with Walmart's Terms of Service and applicable data protection laws.
- Rate Limiting: The actor includes built-in delays to respect Walmart's servers. Avoid running multiple instances simultaneously with aggressive settings.
- Data Usage: You are responsible for ensuring compliance with data protection regulations (GDPR, CCPA, etc.) when processing personal information from reviews.
- Terms of Service: Review and comply with Walmart's Terms of Service and robots.txt before using this actor for commercial purposes.
- Ethical Scraping: Use this actor responsibly and ethically. Do not use scraped data for spam, harassment, or other malicious purposes.
Note: This actor is designed for legitimate research and data collection purposes. Always respect website terms of service and applicable laws when scraping data.