US Foods Catalog Scraper | Brands, Specs, Images
Pricing
Pay per event
US Foods Catalog Scraper | Brands, Specs, Images
Scrape the US Foods public product catalog for brands, case sizes, serving sizes, features, and images. Export JSON, CSV, Excel, XML for restaurant procurement.
Pricing
Pay per event
Rating
0.0
(0)
Developer
ParseForge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
7 days ago
Last modified
Categories
Share

๐ฝ๏ธ US Foods Catalog Scraper
๐ Export the US Foods public product catalog in seconds. Collect titles, brands, case sizes, descriptions, features, benefits, and images for foodservice products. No API key, no registration, no manual CSV wrangling.
๐ Last updated: 2026-05-23 ยท ๐ 16 fields per record ยท 500+ products ยท 30+ brands ยท JSON / CSV / Excel / XML export
US Foods is one of the two largest broadline foodservice distributors in the United States, serving more than 250,000 restaurants, healthcare operations, schools, hotels, and independent kitchens. Its public product catalog at usfoods.com/products-we-offer showcases the proprietary brand lineup that distinguishes the company from rivals like Sysco. This Actor walks that public catalog end-to-end and returns one clean JSON record per product, ready for spreadsheets, dashboards, AI pipelines, or vendor benchmarking workflows.
The scraper enumerates every product URL exposed by the official sitemap.xml, then parses each detail page server-side. Output includes the primary image, product title, internal product ID, brand name, brand logo, rich description, case size, serving size, servings per case, an ordered list of features, an ordered list of benefits, optional regulatory disclaimers, and a gallery of all carousel images. Every record is timestamped with scrapedAt so you can run the Actor on a schedule and diff snapshots over time.
| ๐ฏ Target Audience | ๐ก Primary Use Cases |
|---|---|
| Restaurant owners and executive chefs | Build a unified product database for cross-distributor comparison |
| Restaurant group purchasing managers | Track US Foods brand portfolio changes week over week |
| Hospital, school, and hotel food service directors | Populate menu engineering tools with case-size and servings data |
| Foodservice consultants and procurement analysts | Audit vendor product mix and surface new launches |
| Market research and competitive intelligence teams | Map the US Foods proprietary brand catalog by category |
| Data engineers building AI agents for hospitality | Feed clean, structured product data into LLM workflows |
๐ What the US Foods Catalog Scraper does
- ๐ Crawls the full public catalog. Walks the official sitemap to discover every product URL in
usfoods.com/products-we-offer/. - ๐ท๏ธ Captures brand context. Extracts brand name and brand logo image for every record so you can group products by proprietary line.
- ๐ฆ Returns pack and yield specs. Pulls case size, serving size, and servings per case directly from the structured product spec block.
- ๐ Preserves marketing copy verbatim. Saves the long-form description, features list, benefits list, and any regulatory disclaimer text.
- ๐ผ๏ธ Keeps the entire image gallery. Stores the hero image plus every carousel image so you can render product cards downstream.
- ๐ฏ Supports targeted runs. Pass a list of specific product URLs to skip the sitemap and scrape only what you need.
Each record arrives as a flat JSON object with predictable field ordering: image first, identifiers next, then specs, then arrays, then a scrapedAt timestamp and error slot. The shape is stable across runs, which makes downstream parsing trivial in pandas, BigQuery, Airtable, or any spreadsheet tool.
๐ก Why it matters: restaurants on 3-9% profit margins lose real money when food cost benchmarking is manual. A clean US Foods catalog feed is the missing piece for any procurement workflow that already tracks Sysco, Restaurant Depot, or other broadline distributors.
๐ฌ Full Demo
๐ง Coming soon: a 3-minute walkthrough showing how to launch a run, filter results, and export to Google Sheets.
โ๏ธ Input
| Field | Type | Default | Description |
|---|---|---|---|
| productUrls | array of strings | [] | Optional. Specific US Foods product URLs to scrape, e.g. https://www.usfoods.com/products-we-offer/product.1-diced-beef.html. Leave empty to crawl the full public catalog from the sitemap. |
| maxItems | integer | 10 | Hard cap on records returned. Free plans are limited to 10. Paid plans support up to 1,000,000. |
Example: crawl 50 products straight from the sitemap.
{"maxItems": 50}
Example: scrape only a curated short list.
{"productUrls": ["https://www.usfoods.com/products-we-offer/product.1-diced-beef.html","https://www.usfoods.com/products-we-offer/product.100-key-lime-juice.html","https://www.usfoods.com/products-we-offer/product.12-raised-edge-parbaked-pizza-crust.html"],"maxItems": 3}
โ ๏ธ Good to Know: the US Foods public catalog is a marketing showcase of the proprietary brand portfolio. It does not expose customer-specific contract pricing, which lives behind the ordering portal and varies by account. Treat this Actor as a product specification feed, not a real-time price feed.
๐ Output
Every product is returned as one flat JSON object. Fields are ordered for predictable downstream parsing: image first, then identifiers, specs, arrays, and a timestamp.
๐งพ Schema
| Field | Type | Example |
|---|---|---|
๐ผ imageUrl | string | https://www.usfoods.com/content/dam/products/eb/10750947006746_C1CF_2522811.jpg |
๐ title | string | 1" Diced Beef |
๐ url | string | https://www.usfoods.com/products-we-offer/product.1-diced-beef.html |
๐ productId | string | 2522811 |
๐ท๏ธ brand | string | STOCK YARDSยฎ |
๐จ brandImageUrl | string | https://www.usfoods.com/content/dam/.../STOCKYARDS-L-C.svg |
๐ description | string | Elevate your comfort food offerings with Stock Yards Diced Beef... |
๐ฆ caseSize | string | 2/5 LB. |
๐ฝ servingSize | string | null | 100 g |
๐ข servingsPerCase | string | null | 45 |
โจ features | array<string> | null | ["USDA Choice Grade", "NAMP/MBG: 135A", ...] |
๐ benefits | array<string> | null | ["Labor-saving: diced and frozen...", ...] |
โ๏ธ disclaimers | string | null | *Processing aids and potential cross-contact... |
๐ผ imageUrls | array<string> | ["https://www.usfoods.com/content/dam/products/.../...jpg"] |
๐ scrapedAt | string (ISO 8601) | 2026-05-23T12:15:36.682Z |
โ error | string | null | null |
๐ฆ Sample records
โจ Why choose this Actor
| Capability | |
|---|---|
| ๐ | Fast crawl path. Pure HTTP fetch plus Cheerio parsing. No browser, no proxy lock-in, no captcha dance. |
| ๐งฑ | Stable field shape. Every record matches the schema above, so your downstream code never breaks on missing keys. |
| ๐ฆ | Real pack and yield data. Case size, serving size, and servings per case parsed straight from the spec block. |
| ๐ท๏ธ | Brand grouping ready. Brand name and brand logo on every record lets you cluster the proprietary portfolio. |
| ๐ผ๏ธ | Image-first ordering. Hero image is field one, gallery is preserved, so product cards render cleanly. |
| ๐ฏ | URL targeting. Pass a list of specific product URLs for incremental updates or quality-control sweeps. |
| โฑ๏ธ | Timestamped output. scrapedAt on every record makes longitudinal tracking and diffing painless. |
๐ A full sitemap pass returns roughly 500 products in under three minutes on a single Apify run.
๐ How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| Manual browsing | Free, but hours per week | One product at a time | Whenever you remember | None | Copy-paste hell |
| Official publications | Free, very limited | Marketing PDFs only | Quarterly at best | None | Manual parsing |
| Paid live distributor APIs | High monthly fee | Customer-account scoped | Real-time | Account-level | Vendor onboarding |
| Legacy community dumps | Free | Stale, often years old | Never | None | Out of date |
| โญ US Foods Catalog Scraper (this Actor) | Pay only for what you run | Full public sitemap | On demand or scheduled | URL list, item cap | One click |
The right pick depends on whether you need authoritative, live, public catalog data or a contract-pricing data feed. This Actor covers the first cleanly, no friction.
๐ How to use
- ๐ Sign up for Apify. Create a free account at console.apify.com. No credit card needed.
- ๐ฝ๏ธ Open the Actor. Land on the US Foods Catalog Scraper page on the Apify Store.
- โ๏ธ Set your input. Leave
productUrlsempty for a full crawl, or paste specific product URLs. SetmaxItemsto whatever you need. - โถ๏ธ Hit Start. The Actor enumerates the sitemap and scrapes each page. Free runs cap at 10 records for preview.
- ๐ฅ Export your data. Download as JSON, CSV, Excel, or XML. Connect to Google Sheets, Airtable, Slack, Make, or your own backend.
โฑ๏ธ Total time from sign-up to first dataset: about two minutes.
๐ผ Business use cases
๐ Beyond business use cases
Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.
๐ Automating US Foods Catalog Scraper
Trigger and consume runs from any stack. Apify exposes a REST API plus first-party Node.js and Python SDKs.
- ๐ฉ Node.js SDK: see the JavaScript client docs for
runActor, polling, and dataset streaming. - ๐ Python SDK: the Python client docs cover the same surface with idiomatic Python.
- ๐ API reference: the full Apify REST API lets you wire runs into any HTTP-capable system.
Schedule recurring catalog snapshots with Apify Schedules and feed deltas into your warehouse, BI tool, or alerting workflow. Daily, weekly, or monthly cadences are all easy to set up from the console.
โ Frequently Asked Questions
๐ Integrate with any app
Connect the Actor to whichever tools you already use. Apify Integrations cover the popular automation stack.
- Make - drag-and-drop workflows triggered by completed runs
- Zapier - push records into thousands of Zapier-supported apps
- Slack - post run summaries to a team channel
- Airbyte - stream datasets into Snowflake, BigQuery, or Postgres
- GitHub Actions - kick off runs from CI pipelines
- Google Drive - sync exports to a shared folder
๐ Recommended Actors
- ๐ฅซ Sysco Food Service Scraper - the other broadline giant. Pair with this Actor for cross-distributor comparison.
- ๐ณ CHEF'STORE Wholesale Food Price Scraper - cash-and-carry pricing across all CHEF'STORE locations.
- ๐ฆ Uline Product Catalog Scraper - packaging, foodservice supplies, and operational consumables.
- ๐ Costco Product Scraper - bulk retail comparison for restaurants and independent operators.
- ๐พ Chewy.com Product Scraper - structured product catalogs from large ecommerce surfaces.
๐ก Pro Tip: browse the complete ParseForge collection for more foodservice and ecommerce data tools.
๐ Need Help? Stuck on input config or want a custom field added? Open our contact form and we will get back to you fast.
โ ๏ธ Disclaimer: this is an independent tool built by ParseForge. It is not affiliated with, endorsed by, or sponsored by US Foods Holding Corp. or any of its subsidiaries. Only publicly available data is collected.