US Foods Catalog Scraper | Brands, Specs, Images avatar

US Foods Catalog Scraper | Brands, Specs, Images

Pricing

Pay per event

Go to Apify Store
US Foods Catalog Scraper | Brands, Specs, Images

US Foods Catalog Scraper | Brands, Specs, Images

Scrape the US Foods public product catalog for brands, case sizes, serving sizes, features, and images. Export JSON, CSV, Excel, XML for restaurant procurement.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Share

ParseForge Banner

๐Ÿฝ๏ธ US Foods Catalog Scraper

๐Ÿš€ Export the US Foods public product catalog in seconds. Collect titles, brands, case sizes, descriptions, features, benefits, and images for foodservice products. No API key, no registration, no manual CSV wrangling.

๐Ÿ•’ Last updated: 2026-05-23 ยท ๐Ÿ“Š 16 fields per record ยท 500+ products ยท 30+ brands ยท JSON / CSV / Excel / XML export

US Foods is one of the two largest broadline foodservice distributors in the United States, serving more than 250,000 restaurants, healthcare operations, schools, hotels, and independent kitchens. Its public product catalog at usfoods.com/products-we-offer showcases the proprietary brand lineup that distinguishes the company from rivals like Sysco. This Actor walks that public catalog end-to-end and returns one clean JSON record per product, ready for spreadsheets, dashboards, AI pipelines, or vendor benchmarking workflows.

The scraper enumerates every product URL exposed by the official sitemap.xml, then parses each detail page server-side. Output includes the primary image, product title, internal product ID, brand name, brand logo, rich description, case size, serving size, servings per case, an ordered list of features, an ordered list of benefits, optional regulatory disclaimers, and a gallery of all carousel images. Every record is timestamped with scrapedAt so you can run the Actor on a schedule and diff snapshots over time.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Restaurant owners and executive chefsBuild a unified product database for cross-distributor comparison
Restaurant group purchasing managersTrack US Foods brand portfolio changes week over week
Hospital, school, and hotel food service directorsPopulate menu engineering tools with case-size and servings data
Foodservice consultants and procurement analystsAudit vendor product mix and surface new launches
Market research and competitive intelligence teamsMap the US Foods proprietary brand catalog by category
Data engineers building AI agents for hospitalityFeed clean, structured product data into LLM workflows

๐Ÿ“‹ What the US Foods Catalog Scraper does

  • ๐Ÿ“‚ Crawls the full public catalog. Walks the official sitemap to discover every product URL in usfoods.com/products-we-offer/.
  • ๐Ÿท๏ธ Captures brand context. Extracts brand name and brand logo image for every record so you can group products by proprietary line.
  • ๐Ÿ“ฆ Returns pack and yield specs. Pulls case size, serving size, and servings per case directly from the structured product spec block.
  • ๐Ÿ“ Preserves marketing copy verbatim. Saves the long-form description, features list, benefits list, and any regulatory disclaimer text.
  • ๐Ÿ–ผ๏ธ Keeps the entire image gallery. Stores the hero image plus every carousel image so you can render product cards downstream.
  • ๐ŸŽฏ Supports targeted runs. Pass a list of specific product URLs to skip the sitemap and scrape only what you need.

Each record arrives as a flat JSON object with predictable field ordering: image first, identifiers next, then specs, then arrays, then a scrapedAt timestamp and error slot. The shape is stable across runs, which makes downstream parsing trivial in pandas, BigQuery, Airtable, or any spreadsheet tool.

๐Ÿ’ก Why it matters: restaurants on 3-9% profit margins lose real money when food cost benchmarking is manual. A clean US Foods catalog feed is the missing piece for any procurement workflow that already tracks Sysco, Restaurant Depot, or other broadline distributors.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing how to launch a run, filter results, and export to Google Sheets.


โš™๏ธ Input

FieldTypeDefaultDescription
productUrlsarray of strings[]Optional. Specific US Foods product URLs to scrape, e.g. https://www.usfoods.com/products-we-offer/product.1-diced-beef.html. Leave empty to crawl the full public catalog from the sitemap.
maxItemsinteger10Hard cap on records returned. Free plans are limited to 10. Paid plans support up to 1,000,000.

Example: crawl 50 products straight from the sitemap.

{
"maxItems": 50
}

Example: scrape only a curated short list.

{
"productUrls": [
"https://www.usfoods.com/products-we-offer/product.1-diced-beef.html",
"https://www.usfoods.com/products-we-offer/product.100-key-lime-juice.html",
"https://www.usfoods.com/products-we-offer/product.12-raised-edge-parbaked-pizza-crust.html"
],
"maxItems": 3
}

โš ๏ธ Good to Know: the US Foods public catalog is a marketing showcase of the proprietary brand portfolio. It does not expose customer-specific contract pricing, which lives behind the ordering portal and varies by account. Treat this Actor as a product specification feed, not a real-time price feed.


๐Ÿ“Š Output

Every product is returned as one flat JSON object. Fields are ordered for predictable downstream parsing: image first, then identifiers, specs, arrays, and a timestamp.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿ–ผ imageUrlstringhttps://www.usfoods.com/content/dam/products/eb/10750947006746_C1CF_2522811.jpg
๐Ÿ“Œ titlestring1" Diced Beef
๐Ÿ”— urlstringhttps://www.usfoods.com/products-we-offer/product.1-diced-beef.html
๐Ÿ†” productIdstring2522811
๐Ÿท๏ธ brandstringSTOCK YARDSยฎ
๐ŸŽจ brandImageUrlstringhttps://www.usfoods.com/content/dam/.../STOCKYARDS-L-C.svg
๐Ÿ“ descriptionstringElevate your comfort food offerings with Stock Yards Diced Beef...
๐Ÿ“ฆ caseSizestring2/5 LB.
๐Ÿฝ servingSizestring | null100 g
๐Ÿ”ข servingsPerCasestring | null45
โœจ featuresarray<string> | null["USDA Choice Grade", "NAMP/MBG: 135A", ...]
๐Ÿ’Ž benefitsarray<string> | null["Labor-saving: diced and frozen...", ...]
โš–๏ธ disclaimersstring | null*Processing aids and potential cross-contact...
๐Ÿ–ผ imageUrlsarray<string>["https://www.usfoods.com/content/dam/products/.../...jpg"]
๐Ÿ•’ scrapedAtstring (ISO 8601)2026-05-23T12:15:36.682Z
โŒ errorstring | nullnull

๐Ÿ“ฆ Sample records


โœจ Why choose this Actor

Capability
๐Ÿš€Fast crawl path. Pure HTTP fetch plus Cheerio parsing. No browser, no proxy lock-in, no captcha dance.
๐ŸงฑStable field shape. Every record matches the schema above, so your downstream code never breaks on missing keys.
๐Ÿ“ฆReal pack and yield data. Case size, serving size, and servings per case parsed straight from the spec block.
๐Ÿท๏ธBrand grouping ready. Brand name and brand logo on every record lets you cluster the proprietary portfolio.
๐Ÿ–ผ๏ธImage-first ordering. Hero image is field one, gallery is preserved, so product cards render cleanly.
๐ŸŽฏURL targeting. Pass a list of specific product URLs for incremental updates or quality-control sweeps.
โฑ๏ธTimestamped output. scrapedAt on every record makes longitudinal tracking and diffing painless.

๐Ÿ“Š A full sitemap pass returns roughly 500 products in under three minutes on a single Apify run.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
Manual browsingFree, but hours per weekOne product at a timeWhenever you rememberNoneCopy-paste hell
Official publicationsFree, very limitedMarketing PDFs onlyQuarterly at bestNoneManual parsing
Paid live distributor APIsHigh monthly feeCustomer-account scopedReal-timeAccount-levelVendor onboarding
Legacy community dumpsFreeStale, often years oldNeverNoneOut of date
โญ US Foods Catalog Scraper (this Actor)Pay only for what you runFull public sitemapOn demand or scheduledURL list, item capOne click

The right pick depends on whether you need authoritative, live, public catalog data or a contract-pricing data feed. This Actor covers the first cleanly, no friction.


๐Ÿš€ How to use

  1. ๐Ÿ” Sign up for Apify. Create a free account at console.apify.com. No credit card needed.
  2. ๐Ÿฝ๏ธ Open the Actor. Land on the US Foods Catalog Scraper page on the Apify Store.
  3. โš™๏ธ Set your input. Leave productUrls empty for a full crawl, or paste specific product URLs. Set maxItems to whatever you need.
  4. โ–ถ๏ธ Hit Start. The Actor enumerates the sitemap and scrapes each page. Free runs cap at 10 records for preview.
  5. ๐Ÿ“ฅ Export your data. Download as JSON, CSV, Excel, or XML. Connect to Google Sheets, Airtable, Slack, Make, or your own backend.

โฑ๏ธ Total time from sign-up to first dataset: about two minutes.


๐Ÿ’ผ Business use cases

๐Ÿฝ๏ธ Independent restaurants

  • Build a US Foods reference list to negotiate pricing with reps
  • Track new brand launches in the proprietary portfolio
  • Cross-check case-size and yield specs against menu costing tools
  • Spot product reformulations by diffing descriptions over time

๐Ÿข Group purchasing managers

  • Maintain a unified product catalog across multiple distributors
  • Map US Foods proprietary brands to category benchmarks
  • Feed procurement dashboards with brand and yield context
  • Generate vendor-side reports for finance and operations leads

๐Ÿฅ Institutional foodservice

  • Populate dietitian tools with serving size and pack data
  • Audit broadline distributor coverage for hospital and school menus
  • Spot supply gaps when planning multi-site catering rotations
  • Build internal request portals seeded with real product copy

๐Ÿ“Š Procurement analysts and consultants

  • Map the foodservice brand landscape for clients
  • Generate competitive intelligence reports on broadline product mix
  • Track product launches and discontinuations between snapshots
  • Power AI shopping agents with structured catalog data

๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

๐ŸŽจ Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

๐Ÿค Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

๐Ÿงช Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

๐Ÿ”Œ Automating US Foods Catalog Scraper

Trigger and consume runs from any stack. Apify exposes a REST API plus first-party Node.js and Python SDKs.

  • ๐ŸŸฉ Node.js SDK: see the JavaScript client docs for runActor, polling, and dataset streaming.
  • ๐Ÿ Python SDK: the Python client docs cover the same surface with idiomatic Python.
  • ๐Ÿ“š API reference: the full Apify REST API lets you wire runs into any HTTP-capable system.

Schedule recurring catalog snapshots with Apify Schedules and feed deltas into your warehouse, BI tool, or alerting workflow. Daily, weekly, or monthly cadences are all easy to set up from the console.


โ“ Frequently Asked Questions


๐Ÿ”Œ Integrate with any app

Connect the Actor to whichever tools you already use. Apify Integrations cover the popular automation stack.

  • Make - drag-and-drop workflows triggered by completed runs
  • Zapier - push records into thousands of Zapier-supported apps
  • Slack - post run summaries to a team channel
  • Airbyte - stream datasets into Snowflake, BigQuery, or Postgres
  • GitHub Actions - kick off runs from CI pipelines
  • Google Drive - sync exports to a shared folder

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more foodservice and ecommerce data tools.


๐Ÿ†˜ Need Help? Stuck on input config or want a custom field added? Open our contact form and we will get back to you fast.


โš ๏ธ Disclaimer: this is an independent tool built by ParseForge. It is not affiliated with, endorsed by, or sponsored by US Foods Holding Corp. or any of its subsidiaries. Only publicly available data is collected.