Walmart Scraper — Products, Prices, Ratings & Stock avatar

Walmart Scraper — Products, Prices, Ratings & Stock

Pricing

from $3.00 / 1,000 product extracteds

Go to Apify Store
Walmart Scraper — Products, Prices, Ratings & Stock

Walmart Scraper — Products, Prices, Ratings & Stock

Pricing

from $3.00 / 1,000 product extracteds

Rating

0.0

(0)

Developer

NexGenData

NexGenData

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

16 hours ago

Last modified

Share

The fast, reliable Walmart.com product scraper for Apify. Feed it search keywords or product URLs and get back clean, structured JSON — one record per product — with title, price, sale (was) price, rating, review count, brand, seller, availability, image and the Walmart item id. Ready to drop straight into a pricing dashboard, repricer, BI warehouse or quant model.

Walmart is the largest retailer in the United States — ~$640B annual revenue, 4,600+ US stores, and walmart.com is the #2 US e-commerce site after Amazon. It is also one of the richest public price-and-availability surfaces on the web: first-party Walmart inventory and a sprawling third-party marketplace, side by side. This actor turns that surface into a programmatic feed.


What you get per product

FieldTypeDescription
titlestringProduct name / title
pricenumberCurrent selling price
was_pricenumberOriginal / strikethrough price (only present when the item is on sale)
currencystringCurrency code (e.g. USD)
ratingnumberAverage star rating (0–5)
reviews_countintNumber of customer reviews
walmart_idstringWalmart US item id (usItemId) — the stable join key
brandstringBrand name
sellerstringSeller — Walmart.com for first-party, or the marketplace seller name
availabilitystringStock status (IN_STOCK / OUT_OF_STOCK / display string)
urlstringCanonical Walmart product URL
image_urlstringPrimary product image URL
categorystringProduct category / type (when exposed)
querystringThe keyword or URL that produced this record
modestringkeyword_search or product_url
data_sourcestringProvenance (httpx vs playwright)
as_of_timestampstringUTC ISO-8601 capture time

Alongside the product rows, the actor emits a per-query summary record (record_type: "query_summary") for each keyword with the product count and avg_price — handy for at-a-glance category price benchmarking without re-aggregating downstream.


Two input modes

Search keywords

Provide a keywords array (e.g. ["coffee maker", "air fryer", "running shoes"]). Each keyword runs a Walmart search across multiple result pages and returns one record per product, up to maxItems. Use this for category sweeps, competitor catalogue discovery, or price benchmarking.

Product URLs

Provide a productUrls array of exact Walmart product pages (e.g. https://www.walmart.com/ip/.../121002347). Each URL returns a single, detailed product record. Use this for watchlists, repricing, and tracking specific SKUs over time.

You can supply both in one run.


How the actor handles Walmart's anti-bot wall

Walmart defends walmart.com with PerimeterX / bot-detection on datacenter IPs. The actor uses a resilient waterfall:

  1. httpx + realistic browser headers (warmed session) — pulls the homepage first to pick up cookies, then requests the search / product page and parses the embedded __NEXT_DATA__ JSON blob (the same hydration state Walmart's own front-end uses). This is the reliable, structured path — not brittle DOM scraping.
  2. Playwright headless Chromium fallback — with stealth countermeasures (navigator.webdriver patch, realistic locale / timezone / viewport, plugin spoofing) for when the static HTML is blocked or JS-gated.
  3. Graceful no-result handling — if every strategy is blocked, the actor pushes a single status row explaining the situation (no charge applied) rather than crashing your pipeline.

All paths run behind Apify's RESIDENTIAL proxy pool by default, which is strongly recommended for Walmart.


How buyers actually use this actor

  • Repricing / price intelligence — daily productUrls pull of a SKU watchlist to feed a dynamic-pricing engine; was_price exposes live markdown depth.
  • Brand & MAP monitoringkeyword_search across a brand portfolio to catch unauthorized marketplace sellers and minimum-advertised-price violations.
  • Category research — weekly keyword sweeps with the query_summary rows to track average price and assortment depth per category over time.
  • Quant / consumer-discretionary desks — Walmart price & availability as an alt-data signal joinable by walmart_id.
  • Affiliate & comparison sites — keep a Walmart price/stock column fresh next to other retailers.

Pair the Walmart Scraper with the rest of the NexGenData retail-intelligence fleet:

Run Walmart + Amazon + Google Shopping on the same SKU list to build a complete cross-retailer price-and-stock matrix.


Get 20% off your first 3 months on Apify

New to Apify? Use this referral link to sign up and get 20% off for 3 months, then 30% ongoing. Pairs cleanly with any of the e-commerce actors above.


Input parameters

  • keywords — array of Walmart search terms (keyword_search mode).
  • productUrls — array of exact Walmart product page URLs (product_url mode).
  • maxItems — max product records per run (1–1000; default 50).
  • proxyConfiguration — Apify proxy (defaults to RESIDENTIAL — strongly recommended).

Disclaimer

This actor collects only publicly visible Walmart.com product data using the same page endpoints any logged-out walmart.com visitor reaches. It does not bypass logins, paywalls or private data. Buyers are responsible for compliance with Walmart's Terms of Service and applicable data regulations in their jurisdiction.