Home Depot Product Scraper
Under maintenancePricing
from $3.50 / 1,000 product results
Home Depot Product Scraper
Under maintenanceScrape Home Depot products by keyword, URL, or item ID. Returns prices, specs, ratings, reviews, store inventory, images, and breadcrumbs. Pay only $3.50 per 1000 results.
Pricing
from $3.50 / 1,000 product results
Rating
0.0
(0)
Developer
Crikit
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
a day ago
Last modified
Categories
Share
Scrape Home Depot product data at scale. This actor turns Home Depot search keywords, category pages, and product URLs into clean, structured JSON records you can pipe into a spreadsheet, a database, or your own analytics pipeline. It is built on top of the same Home Depot federation GraphQL gateway the website uses, so the data you get back is the data shoppers see, store-aware pricing, real-time inventory, ratings, reviews, specifications, and all.
Pricing is simple. You pay $3.50 per 1,000 items (about $0.0035 per result), with no Apify platform usage on top. That is roughly half the cost of the older Home Depot scrapers on the Apify Store, and you get a richer record per item.
What you can do with this Home Depot scraper
- Track competitor pricing across thousands of SKUs in a category every morning.
- Build a Home Depot product API for an internal tool or a price-comparison site.
- Watch a single SKU for stock changes at a particular store (BOPIS, BOSS, same-day).
- Pull review text and ratings for sentiment analysis on a tool category.
- Export product specifications into a procurement catalog.
- Generate seed data for an e-commerce demo or a machine learning project.
Inputs
The actor accepts any combination of:
- keywords - search terms like
drillorfrench door refrigerator. Each keyword paginates up to yourmaxItemsPerKeywordsetting (Home Depot caps any single keyword at about 576 results, so 240 is a sensible default). - urls - paste any mix of Home Depot product pages (
/p/.../<itemId>), search pages (/s/...), or category pages (/b/.../N-<navParam>). The actor classifies each URL and runs the right operation. - productIds - if you already have a list of 9-digit Home Depot item IDs, drop them in directly. This is the fastest input shape.
- storeId + zipCode - drive store-aware pricing and delivery dates. Default is store
0121(Atlanta-Cumberland) with ZIP30033. Change these to match the market you are tracking. - enrichPDP (default on) - fetches the full product detail page for every result, so you get UPC, full specs, complete image gallery, fulfillment, and more.
- enrichReviews (off by default) - pulls the latest reviews for every product with photo URLs and verified-purchaser flags.
What the output looks like
Each dataset record is a flat object with the most important fields lifted to the top level:
- Identifiers:
itemId,title,brand,modelNumber,upc,storeSkuNumber - Pricing:
price,originalPrice,dollarOff,percentageOff,promotionDescription,unitOfMeasure - Ratings and reviews:
ratingValue,reviewCount - Media:
imageUrlplus a fullallImagesarray, plusvideosand an optional 360 viewer URL - Taxonomy:
breadcrumbs,productDepartment,productSubType - Inventory:
fulfillmentOptions,storeInventory,backordered,anchorStoreStatus - SEO:
seoTitle,seoMetaDescription, canonical URL
Reviews are written as separate records with source: "review" and the same itemId so you can join them back to the product.
How it works under the hood
The actor uses the Home Depot federation GraphQL gateway (/federation-gateway/graphql) with the same searchModel, productClientOnlyProduct, fulfillment, and reviews operations the website calls. Home Depot is protected by Akamai Bot Manager, so a stealth headless Chromium warms a fresh cookie session against the homepage on a US residential Apify proxy before any GraphQL request goes out. Cookies refresh automatically on any Akamai block signal.
Best practices
- Always run from US residential proxies. Datacenter IPs are blocked at the perimeter.
- Start with
maxItems: 50for your first run to size the cost before kicking off a large crawl. - For category sweeps, supply the category URL rather than a keyword: it is more deterministic and easier to resume.
- If you only need price and stock for known SKUs, pass
productIdsand leavekeywordsempty. This is the cheapest input mode per result.
Legal
This actor only collects publicly available product data from Home Depot search and product pages. No account login is required, no user-credential cookies are used. Use the output in accordance with Home Depot's terms of service. You are responsible for how you store, process, and resell the data.