Pricing

Pay per event

NYT Cooking Recipe Scraper

Enumerate all ~25K public NYT Cooking recipes from the official sitemap and extract structured recipe data (ingredients, instructions, nutrition, ratings) from schema.org Recipe JSON-LD.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

Actor stats

Bookmarked

Total users

Monthly active users

17 days ago

Last modified

What it collects

Every record contains the following fields:

Field	Type	Description
`recipe_id`	string	Unique NYT Cooking recipe identifier
`url`	string	Canonical recipe URL
`name`	string	Recipe title
`author`	string	NYT Cooking contributor byline
`description`	string	Recipe description / headnote
`recipe_yield`	string	Serving size (e.g. "4 servings")
`total_time`	string	Total cooking time (e.g. "1 hr 30 min")
`prep_time`	string	Preparation time
`cook_time`	string	Active cooking time
`recipe_category`	string	Meal category (e.g. "Dinner, Main Course")
`recipe_cuisine`	string	Cuisine style (e.g. "Mediterranean Inspired")
`recipe_ingredient`	array	List of ingredient strings with quantities
`recipe_instructions`	array	Step-by-step instructions
`nutrition`	string	JSON-serialized nutrition facts (calories, fat, carbs, protein, sodium, etc.) from schema.org NutritionInformation. `null` for recipes without nutrition data.
`aggregate_rating`	number	Average user rating (1–5 scale)
`rating_count`	integer	Number of user ratings
`keywords`	array	Tags and keywords (ingredient highlights, technique, difficulty, etc.)
`image_urls`	array	Full-resolution image URLs
`date_published`	string	ISO 8601 publication date

Discovery

By default the actor walks the official NYT Cooking sitemap index (https://www.nytimes.com/sitemaps/new/cooking.xml.gz), which contains monthly sub-sitemaps covering the full recipe inventory. Only /recipes/ paths are collected — article and guide pages are excluded.

Inputs

Input	Type	Default	Description
`maxItems`	integer	10	Maximum number of recipes to collect. Set to 0 for no limit (full catalog run).
`startUrls`	array	—	Optional list of specific NYT Cooking recipe URLs to scrape directly, bypassing sitemap discovery. Useful for targeted single-recipe or small-batch runs.

Data source

All data is extracted from the schema.org/Recipe JSON-LD markup that NYT Cooking embeds in every public recipe page for SEO purposes. Recipe content — including ingredients, instructions, and metadata — is publicly available. The NYT Cooking paywall only gates account-specific features (recipe box, personal notes, collections) and does not restrict access to recipe markup.

Usage notes

For a full catalog run (~25K recipes), use maxItems: 0 and allow sufficient run time.
Nutrition data (nutrition field) is present on most recipes but absent on some recently published ones; the field is null in those cases.
The sitemap updates frequently (new recipes appear within hours of publication). Re-running with maxItems: 0 against the latest sub-sitemaps will catch additions.

Further reading: ISBN Database Access and Other Open Reference Data in Bulk

NYT Cooking Scraper

harvest/nyt-cooking-scraper

Scrapes recipe data from a New York Times Cooking recipe page. It extracts key details such as the recipe name, ingredients, instructions, cooking time, servings, and nutrition facts.

Harvest Data

Recipe Scraper — Extract Recipes from 100+ Cooking Websites

studio-amba/recipe-scraper

Scrape recipes with ingredients, instructions, nutrition, ratings, and cooking times from popular recipe websites. Supports allrecipes.com, bbcgoodfood.com, and any site with Schema.org Recipe markup.

Studio Amba

Recipe & Cooking Data Extractor

oneary/recipe-scraper

Extract recipes, ingredients, cooking instructions, and nutritional information from top recipe websites and food blogs.

Luan M.

BBC Good Food Recipe Scraper

jungle_synthesizer/bbcgoodfood-recipe-scraper

Enumerate and scrape the full BBC Good Food recipe catalogue (~15K+ recipes) from sitemap discovery. Extracts structured recipe data including ingredients, instructions, UK nutrition panels, skill level, dietary tags, ratings, and schema.org/Recipe JSON-LD fields.

BowTiedRaccoon

Recipe JSON-LD Bulk Harvester

jungle_synthesizer/recipe-jsonld-bulk-harvester

Harvest structured recipe data from any food blog. URL mode: scrape a provided list. Domain mode: auto-discover the sitemap, filter Recipe pages, and crawl them. Extracts name, author, parsed ingredients, instructions, nutrition, and ratings from schema.org/Recipe JSON-LD and hRecipe microformat.

BowTiedRaccoon

Recipe Scraper (Universal / schema.org)

crawlerbros/recipe-scraper

Scrape any schema.org-compliant recipe site like Epicurious, BBC Good Food, Tasty, NYT Cooking, Serious Eats, Food Network, plus thousands of food blogs. Extracts ingredients, instructions, nutrition, ratings, prep/cook time, yield, author, and images via JSON-LD parsing.

Crawler Bros

Recipe API

vivid_astronaut/recipe

BRAINIALL Team

Food.com Recipe Scraper

lulzasaur/foodcom-scraper

Scrape recipes from Food.com. Extract ingredients, instructions, nutrition, ratings, prep/cook times, and photos. Provide recipe URLs or browse by category for rich structured recipe data.

lulz bot

Rakuten Recipe Scraper

shiokoshi356/rakuten-recipe-scraper

Scrape Rakuten Recipe data via official API. Get recipe categories, top-ranked recipes with ingredients, instructions, and images.

Shiokoshi356

Recipe Data Scraper - Extract from 500+ Cooking Websites

vulnv/recipe-scraper

Powerful recipe scraper that extracts ingredients, instructions, nutrition facts, and cooking metadata from 500+ popular cooking websites including AllRecipes, Food Network, BBC Good Food, Epicurious, and more. Perfect for food apps, meal planning, nutrition analysis, and culinary research.