Walmart Scraper — Products, Prices, Ratings & Stock
Pricing
from $3.00 / 1,000 product extracteds
Walmart Scraper — Products, Prices, Ratings & Stock
Pricing
from $3.00 / 1,000 product extracteds
Rating
0.0
(0)
Developer
NexGenData
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
16 hours ago
Last modified
Categories
Share
The fast, reliable Walmart.com product scraper for Apify. Feed it search keywords or product URLs and get back clean, structured JSON — one record per product — with title, price, sale (was) price, rating, review count, brand, seller, availability, image and the Walmart item id. Ready to drop straight into a pricing dashboard, repricer, BI warehouse or quant model.
Walmart is the largest retailer in the United States — ~$640B annual revenue, 4,600+ US stores, and walmart.com is the #2 US e-commerce site after Amazon. It is also one of the richest public price-and-availability surfaces on the web: first-party Walmart inventory and a sprawling third-party marketplace, side by side. This actor turns that surface into a programmatic feed.
What you get per product
| Field | Type | Description |
|---|---|---|
title | string | Product name / title |
price | number | Current selling price |
was_price | number | Original / strikethrough price (only present when the item is on sale) |
currency | string | Currency code (e.g. USD) |
rating | number | Average star rating (0–5) |
reviews_count | int | Number of customer reviews |
walmart_id | string | Walmart US item id (usItemId) — the stable join key |
brand | string | Brand name |
seller | string | Seller — Walmart.com for first-party, or the marketplace seller name |
availability | string | Stock status (IN_STOCK / OUT_OF_STOCK / display string) |
url | string | Canonical Walmart product URL |
image_url | string | Primary product image URL |
category | string | Product category / type (when exposed) |
query | string | The keyword or URL that produced this record |
mode | string | keyword_search or product_url |
data_source | string | Provenance (httpx vs playwright) |
as_of_timestamp | string | UTC ISO-8601 capture time |
Alongside the product rows, the actor emits a per-query summary record (record_type: "query_summary") for each keyword with the product count and avg_price — handy for at-a-glance category price benchmarking without re-aggregating downstream.
Two input modes
Search keywords
Provide a keywords array (e.g. ["coffee maker", "air fryer", "running shoes"]). Each keyword runs a Walmart search across multiple result pages and returns one record per product, up to maxItems. Use this for category sweeps, competitor catalogue discovery, or price benchmarking.
Product URLs
Provide a productUrls array of exact Walmart product pages (e.g. https://www.walmart.com/ip/.../121002347). Each URL returns a single, detailed product record. Use this for watchlists, repricing, and tracking specific SKUs over time.
You can supply both in one run.
How the actor handles Walmart's anti-bot wall
Walmart defends walmart.com with PerimeterX / bot-detection on datacenter IPs. The actor uses a resilient waterfall:
httpx+ realistic browser headers (warmed session) — pulls the homepage first to pick up cookies, then requests the search / product page and parses the embedded__NEXT_DATA__JSON blob (the same hydration state Walmart's own front-end uses). This is the reliable, structured path — not brittle DOM scraping.- Playwright headless Chromium fallback — with stealth countermeasures (
navigator.webdriverpatch, realistic locale / timezone / viewport, plugin spoofing) for when the static HTML is blocked or JS-gated. - Graceful no-result handling — if every strategy is blocked, the actor pushes a single
statusrow explaining the situation (no charge applied) rather than crashing your pipeline.
All paths run behind Apify's RESIDENTIAL proxy pool by default, which is strongly recommended for Walmart.
How buyers actually use this actor
- Repricing / price intelligence — daily
productUrlspull of a SKU watchlist to feed a dynamic-pricing engine;was_priceexposes live markdown depth. - Brand & MAP monitoring —
keyword_searchacross a brand portfolio to catch unauthorized marketplace sellers and minimum-advertised-price violations. - Category research — weekly keyword sweeps with the
query_summaryrows to track average price and assortment depth per category over time. - Quant / consumer-discretionary desks — Walmart price & availability as an alt-data signal joinable by
walmart_id. - Affiliate & comparison sites — keep a Walmart price/stock column fresh next to other retailers.
Related NexGenData e-commerce actors
Pair the Walmart Scraper with the rest of the NexGenData retail-intelligence fleet:
- 🔗 Amazon Product Scraper — the #1 US e-commerce surface; price, BSR, ratings & buy-box.
- 🔗 Google Shopping Scraper — cross-retailer price comparison from a single query.
- 🔗 eBay Sold Comps — real sold-listing comps for resale & valuation.
- 🏢 About NexGenData — the full fleet of production e-commerce, finance and social-data scrapers.
Run Walmart + Amazon + Google Shopping on the same SKU list to build a complete cross-retailer price-and-stock matrix.
Get 20% off your first 3 months on Apify
New to Apify? Use this referral link to sign up and get 20% off for 3 months, then 30% ongoing. Pairs cleanly with any of the e-commerce actors above.
Input parameters
keywords— array of Walmart search terms (keyword_searchmode).productUrls— array of exact Walmart product page URLs (product_urlmode).maxItems— max product records per run (1–1000; default 50).proxyConfiguration— Apify proxy (defaults to RESIDENTIAL — strongly recommended).
Disclaimer
This actor collects only publicly visible Walmart.com product data using the same page endpoints any logged-out walmart.com visitor reaches. It does not bypass logins, paywalls or private data. Buyers are responsible for compliance with Walmart's Terms of Service and applicable data regulations in their jurisdiction.