# Walmart Product Scraper (`shahidirfan/walmart-product-scraper`) Actor

Scrape Walmart products at scale. Extract pricing, ratings, availability & product details. Real-time data for competitive intelligence, price monitoring & market research. Fast, reliable, production-ready.

- **URL**: https://apify.com/shahidirfan/walmart-product-scraper.md
- **Developed by:** [Shahid Irfan](https://apify.com/shahidirfan) (community)
- **Categories:** E-commerce, Automation, Developer tools
- **Stats:** 3 total users, 2 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Walmart Product Listing Scraper

Extract Walmart product listing data from search, category, brand, and similar listing URLs, or scrape by keyword. Collect product metadata, pricing, ratings, seller info, availability, and source pagination context in a clean dataset.

### Features

- **URL and keyword support** - Run from listing URLs or a simple keyword.
- **Multiple listing types** - Works with search, browse/category, brand, and tag-like listing pages.
- **Pagination control** - Limit pages per URL and total records for stable runs.
- **Location-aware options** - Optional ZIP code and store ID inputs.
- **Clean output records** - Empty/null values are removed from dataset items.

### Use Cases

#### Product Research
Track product mixes, prices, and ratings for specific keywords or categories.

#### Competitive Monitoring
Monitor listing visibility, seller presence, and sponsored placements.

#### Merchandising Analysis
Build datasets for assortment and pricing analysis across listing pages.

#### Data Pipelines
Export structured listing data to sheets, BI tools, or internal APIs.

---

### Input Parameters

| Parameter | Type | Required | Default | Description |
|-----------|------|----------|---------|-------------|
| `urls` | String | No | `https://www.walmart.com/search?q=iphone` | One or more Walmart listing URLs (comma/newline separated). |
| `keyword` | String | No | `iphone` | Keyword used when `urls` is not provided. |
| `results_wanted` | Integer | No | `20` | Maximum number of products to save. |
| `max_pages` | Integer | No | `5` | Maximum pages to process per seed URL. |
| `sort` | String | No | `best_match` | Listing sort mode. |
| `zip_code` | String | No | `10001` | Optional ZIP code for localized results. |
| `store_id` | String | No | — | Optional store ID for local context. |
| `proxyConfiguration` | Object | No | Apify Proxy enabled | Proxy settings for reliability. |

---

### Output Data

| Field | Type | Description |
|-------|------|-------------|
| `itemId` | String | Walmart US item ID. |
| `productId` | String | Walmart product identifier. |
| `name` | String | Product title. |
| `brand` | String | Brand name. |
| `productUrl` | String | Absolute product URL. |
| `price` | Number | Current product price. |
| `priceDisplay` | String | Formatted price text. |
| `averageRating` | Number | Average product rating. |
| `reviewCount` | Number | Number of product reviews. |
| `sellerName` | String | Seller display name. |
| `availabilityStatus` | String | Availability status code. |
| `imageUrl` | String | Primary image URL. |
| `isSponsored` | Boolean | Sponsored listing flag. |
| `sourceUrl` | String | Listing URL used for extraction. |
| `sourcePage` | Number | Listing page number. |

---

### Usage Examples

#### Keyword Run

```json
{
  "keyword": "iphone",
  "results_wanted": 20,
  "max_pages": 3
}
````

#### Search URL Run

```json
{
  "urls": "https://www.walmart.com/search?q=iphone",
  "results_wanted": 60,
  "max_pages": 4
}
```

#### Multiple URLs

```json
{
  "urls": "https://www.walmart.com/search?q=iphone\nhttps://www.walmart.com/browse/electronics/cell-phones/1105910_7551331_1101612",
  "results_wanted": 100,
  "max_pages": 5
}
```

***

### Sample Output

```json
{
  "itemId": "17687958230",
  "productId": "3EHV9SO2UGR7",
  "name": "Verizon Prepaid Apple iPhone 13, 128GB, Black - Prepaid Smartphone [Locked to Verizon Prepaid]",
  "brand": "Apple",
  "productUrl": "https://www.walmart.com/ip/VP-APPLE-IPHONE-13-128GB-5G-MIDNIGHT-PIB-HANDSET-WALMART-POSA/17687958230?classType=REGULAR",
  "price": 249,
  "priceDisplay": "$249.00",
  "averageRating": 4.3,
  "reviewCount": 82,
  "sellerName": "Walmart.com",
  "availabilityStatus": "IN_STOCK",
  "imageUrl": "https://i5.walmartimages.com/seo/VP-APPLE-IPHONE-13-128GB-5G-MIDNIGHT-PIB-HANDSET-WALMART-POSA_1c1e8882-6647-4f95-b6da-39183898618c.0b325bb079e2cd662542f5c50717c54a.jpeg",
  "isSponsored": false,
  "sourceUrl": "https://www.walmart.com/search?q=iphone&page=1",
  "sourcePage": 1
}
```

***

### Tips For Best Results

#### Start With Smaller Limits

- Use `results_wanted: 20` and `max_pages: 3` for validation runs.
- Increase limits after confirming your target URL pattern works.

#### Prefer Clean Listing URLs

- Use direct Walmart listing pages.
- Avoid redirect or tracking-heavy links when possible.

#### Use Proxy For Stability

- Keep Apify Proxy enabled for better reliability.
- Use residential groups for tougher runs.

***

### Integrations

Connect output data with:

- **Google Sheets** - Quick review and team sharing.
- **Airtable** - Listing catalog workflows.
- **Make** - Automated downstream actions.
- **Zapier** - Event-based integrations.
- **Webhooks** - Direct delivery to your services.

#### Export Formats

- **JSON** - Developer workflows.
- **CSV** - Spreadsheet analysis.
- **Excel** - Reporting.
- **XML** - Legacy integrations.

***

### Frequently Asked Questions

#### Can I run by keyword only?

Yes. Provide `keyword` and the actor will create the listing URL automatically.

#### Can I pass multiple URLs?

Yes. Put multiple URLs into `urls` separated by commas or new lines.

#### Does it handle pagination?

Yes. It paginates per URL up to `max_pages` and stops at `results_wanted`.

#### Why are some fields missing for some products?

Different products expose different listing metadata, so only available values are saved.

#### Is user input always prioritized?

Yes. Runtime input is always used first. Local `INPUT.json` is only used when runtime input is empty.

***

### Support

For issues or feature requests, use Apify Console support channels.

#### Resources

- [Apify Documentation](https://docs.apify.com/)
- [Apify API Reference](https://docs.apify.com/api/v2)
- [Scheduling Runs](https://docs.apify.com/schedules)

***

### Legal Notice

This actor is intended for lawful data collection. Users are responsible for compliance with site terms and applicable laws.

# Actor input Schema

## `urls` (type: `array`):

One or more Walmart listing URLs. Supports search, browse/category, brand, and tag-like listing URLs.

## `keyword` (type: `string`):

Optional keyword search. Used when URLs are not provided.

## `results_wanted` (type: `integer`):

Maximum number of products to save.

## `max_pages` (type: `integer`):

Maximum pages to process per seed URL.

## `proxyConfiguration` (type: `object`):

Use Apify Proxy for better stability.

## Actor input object example

```json
{
  "urls": [
    "https://www.walmart.com/search?q=iphone"
  ],
  "keyword": "iphone",
  "results_wanted": 20,
  "max_pages": 3,
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}
```

# Actor output Schema

## `overview` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "urls": [
        "https://www.walmart.com/search?q=iphone"
    ],
    "keyword": "iphone",
    "results_wanted": 20,
    "max_pages": 3
};

// Run the Actor and wait for it to finish
const run = await client.actor("shahidirfan/walmart-product-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "urls": ["https://www.walmart.com/search?q=iphone"],
    "keyword": "iphone",
    "results_wanted": 20,
    "max_pages": 3,
}

# Run the Actor and wait for it to finish
run = client.actor("shahidirfan/walmart-product-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "urls": [
    "https://www.walmart.com/search?q=iphone"
  ],
  "keyword": "iphone",
  "results_wanted": 20,
  "max_pages": 3
}' |
apify call shahidirfan/walmart-product-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=shahidirfan/walmart-product-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Walmart Product Scraper",
        "description": "Scrape Walmart products at scale. Extract pricing, ratings, availability & product details. Real-time data for competitive intelligence, price monitoring & market research. Fast, reliable, production-ready.",
        "version": "0.0",
        "x-build-id": "ubkqXzx72l0aUFJzj"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/shahidirfan~walmart-product-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-shahidirfan-walmart-product-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/shahidirfan~walmart-product-scraper/runs": {
            "post": {
                "operationId": "runs-sync-shahidirfan-walmart-product-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/shahidirfan~walmart-product-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-shahidirfan-walmart-product-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "urls": {
                        "title": "URLs",
                        "type": "array",
                        "description": "One or more Walmart listing URLs. Supports search, browse/category, brand, and tag-like listing URLs.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "keyword": {
                        "title": "Keyword",
                        "type": "string",
                        "description": "Optional keyword search. Used when URLs are not provided."
                    },
                    "results_wanted": {
                        "title": "Results wanted",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of products to save.",
                        "default": 20
                    },
                    "max_pages": {
                        "title": "Max pages",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum pages to process per seed URL.",
                        "default": 5
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Use Apify Proxy for better stability.",
                        "default": {
                            "useApifyProxy": false
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
