๐ Walmart Data Extractor
Pricing
from $4.99 / 1,000 results
๐ Walmart Data Extractor
๐ Walmart Data Extractor pulls product details, pricing, ratings & availability from Walmart for fast market research. ๐ Automate leads, monitor competitors & track trends with reliable data. ๐ Great for B2B insights & analytics.
Pricing
from $4.99 / 1,000 results
Rating
0.0
(0)
Developer
Scraper Engine
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extract rich, structured product data from Walmart.com at scale. Feed it
category pages, search pages, or product (/ip/) URLs โ or just a keyword โ and
get back prices, images, brand, full specifications, ratings, seller info, and
much more. Built for reliability with automatic proxy escalation, anti-bot
browser impersonation, retries, and real-time dataset saving.
โจ Why Choose This Actor?
- ๐ Bulk URLs โ mix category, search and product URLs in a single run.
- ๐ก๏ธ Smart proxy escalation โ starts direct, falls back to datacenter, then residential automatically, and sticks with residential once it has to.
- ๐งฐ Anti-bot by design โ uses
impitbrowser impersonation (real TLS/HTTP fingerprints) instead of heavy headless browsers. - ๐พ Live results โ products stream into the output table as they're scraped, grouped by source section, so a mid-run stop never loses data.
- โญ Reviews & specs โ opt into reviews and get full
idmlspecifications. - ๐งฉ Customizable output โ reshape every record with your own Python hooks.
๐ Key Features
| Feature | Description |
|---|---|
| Category scraping | Auto-paginates browse/category pages |
| Search scraping | Search pages or a raw keyword |
| Product detail | Direct /ip/ URL extraction |
| Reviews | includeReviews / onlyReviews |
| Limits | maxItems (global) and endPage |
| Location | Best-effort zipCode targeting |
| Proxy | direct โ datacenter โ residential (sticky) |
๐ฅ Input
{"startUrls": [{ "url": "https://www.walmart.com/browse/auto-tires/brake-pads/91083_1074765_9038935_4670095_4582920" }],"search": "laptop","maxItems": 10,"endPage": null,"zipCode": "10001","includeReviews": false,"onlyReviews": false,"proxy": { "useApifyProxy": false }}
| Field | Type | Description |
|---|---|---|
startUrls | array | Walmart category / search / product URLs (bulk). Required. |
search | string | Keyword โ converted to a search URL. |
maxItems | integer | Cap on total products. Empty = no limit. |
endPage | integer | Last category/search page to read. |
zipCode | string | US ZIP for localized pricing/availability. |
postalCode | integer | โ ๏ธ Deprecated โ use zipCode. |
includeReviews | boolean | Attach reviews to each product. |
onlyReviews | boolean | Keep only reviews + identifiers. |
extendOutputFunction | string | Python def extendOutputFunction(product) โ dict merged in. |
outputFilterFunction | string | Python def outputFilterFunction(product) โ reshape/drop. |
proxy | object | Proxy config. Default: no proxy (auto-escalates on block). |
๐ค Output
Each product is pushed as one dataset row with the full Walmart product object plus convenience columns for the table view:
{"name": "MAX Advanced Brakes - Brake Kit ...","brand": "Max Advanced Brakes","priceString": "$194.99","price": 194.99,"availabilityStatus": "IN_STOCK","usItemId": "1902495893","productUrl": "https://www.walmart.com/ip/.../1902495893","imageUrl": "https://i5.walmartimages.com/seo/...jpeg","sourceSection": "browse_auto_tires","sourceUrl": "https://www.walmart.com/browse/...","priceInfo": { "currentPrice": { "price": 194.99, "priceString": "$194.99" } },"idml": { "specifications": { }, "longDescription": "..." },"reviews": null}
A structured, per-section summary (mirroring results_by_url) is also written to
the key-value store as OUTPUT.
๐ How to Use (Apify Console)
- Log in at https://console.apify.com โ Actors.
- Open Walmart Data Extractor.
- Paste your Walmart URLs (or a keyword), set
maxItems, and configure proxy. - Click Start.
- Watch products stream into the run log and Output tab in real time.
- Export to JSON / CSV / XLSX when done.
๐ค Use via API
curl -X POST "https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN" \-H "Content-Type: application/json" \-d '{"startUrls":[{"url":"https://www.walmart.com/search?q=laptop"}],"maxItems":10}'
๐ฏ Best Use Cases
- ๐ฐ Price monitoring & repricing
- ๐ Catalog & assortment analysis
- ๐ Competitor & market research
- ๐ท๏ธ Brand / seller tracking
๐ณ Pricing
This actor uses the pay-per-event model. The primary event is
row_result, charged once per product saved to the dataset. Platform startup is
covered by the synthetic apify-actor-start event. You only pay for the products
you actually receive.
โ FAQ
Which URLs are supported? Category/browse pages, search pages, and product
(/ip/) pages.
Do I need a proxy? No. The actor runs direct by default and only escalates to datacenter then residential proxies if Walmart blocks the request.
Can I limit the run? Yes โ use maxItems for a global cap and endPage to
stop pagination early.
Why are some fields null? Walmart omits fields per product; reviews are only
attached when includeReviews/onlyReviews is enabled.
โ๏ธ Cautions / Legal
- Data is collected only from publicly available Walmart pages.
- You are responsible for compliance with Walmart's ToS and applicable laws (GDPR, CCPA, etc.). Use reasonable rate limits and scrape responsibly.
๐ Support & Feedback
Open an issue on the Actor's Issues tab with your run ID and input, and we'll take a look.