Pricing

Pay per event

Recipe JSON-LD Bulk Harvester

Harvest structured recipe data from any food blog. URL mode: scrape a provided list. Domain mode: auto-discover the sitemap, filter Recipe pages, and crawl them. Extracts name, author, parsed ingredients, instructions, nutrition, and ratings from schema.org/Recipe JSON-LD and hRecipe microformat.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

Actor stats

Bookmarked

Total users

Monthly active users

17 days ago

Last modified

What it does

URL mode: Provide a list of recipe page URLs (or a text file of URLs) and scrape each one.
Domain mode: Provide one or more domain names and the actor fetches robots.txt, discovers the site's sitemap(s), filters pages that look like recipes, and crawls them up to your maxItems limit.

Data is extracted from schema.org/Recipe JSON-LD (the near-universal standard used by virtually every food blog for Google rich results) with an hRecipe microformat fallback for legacy sites.

What you get

Each result record contains:

Field	Description
`name`	Recipe title
`author`	Author name
`description`	Recipe summary
`recipe_category`	Category (e.g. Dessert, Main Course)
`recipe_cuisine`	Cuisine type (e.g. Italian, Mexican)
`prep_time`	Preparation time (ISO 8601, e.g. PT15M)
`cook_time`	Cook time (ISO 8601)
`total_time`	Total time
`recipe_yield`	Servings (e.g. "4 servings")
`recipe_ingredient`	Raw ingredient strings from the page
`recipe_ingredient_parsed`	Structured ingredients — each parsed to "quantity unit item, prep"
`recipe_instructions`	Step-by-step instructions (one per array item)
`nutrition`	Nutrition facts as JSON string (calories, fat, protein, carbs, etc.)
`aggregate_rating`	Star rating (number)
`rating_count`	Number of ratings
`keywords`	Recipe tags/keywords
`image_urls`	Recipe photo URLs
`video_url`	Recipe video URL if present
`date_published`	Publication date (ISO 8601)
`source_domain`	Domain scraped
`url`	Full page URL
`schema_type`	Extraction method: `recipe-jsonld`, `hrecipe-microformat`, or `none`
`extraction_warnings`	Non-fatal issues (missing fields, parse errors)

Structured ingredient parser

The recipe_ingredient_parsed field is the headline feature — it breaks each raw ingredient string into structured components:

"2 cups all-purpose flour, sifted"  ->  "2 cups all-purpose flour, sifted"
"1/2 tsp kosher salt"               ->  "0.5 tsp kosher salt"
"1 large egg, at room temperature"  ->  "1 egg, at room temperature"

Handles Unicode fractions, mixed fractions ("1 1/2"), and common unit abbreviations.

Input

URL mode

{
  "urls": [
    "https://www.allrecipes.com/recipe/10813/best-chocolate-chip-cookies/",
    "https://www.simplyrecipes.com/best-easy-roast-chicken-recipe-5207046"
  ],
  "maxItems": 100
}

You can also use requestsFromUrl to point to a plain-text file with one URL per line.

Domain mode

{
  "domains": [
    "www.seriouseats.com",
    "www.kingarthurbaking.com"
  ],
  "maxItems": 500
}

The actor fetches robots.txt from each domain, discovers listed sitemaps (or falls back to /sitemap.xml), traverses sitemap indexes, and filters URLs that look like recipe pages.

Input fields

Field	Type	Description
`urls`	array	Recipe page URLs to scrape (URL mode)
`domains`	array	Domains to auto-discover and crawl (domain mode)
`maxItems`	integer	Maximum results to return (0 = unlimited)
`requestsFromUrl`	string	URL of a text file with one recipe URL per line

Provide either urls (+ optional requestsFromUrl) or domains — not both.

How it works

URL mode — The actor resolves the URL list, crawls each page, and extracts recipe data directly.

Domain mode — For each domain:

Fetch robots.txt to discover sitemap URLs
Fall back to /sitemap.xml if robots.txt lists none
Walk sitemap indexes to find leaf sitemaps
Filter URLs by recipe-path heuristics (path contains /recipe/, slug has 3+ hyphen-separated words, etc.)
Crawl each filtered URL and extract recipe data

Supported sites

Works on any food blog or cooking site that emits schema.org/Recipe JSON-LD — which covers the vast majority of food sites since Google requires it for recipe rich results. This includes:

Recipe-plugin-powered WordPress sites (Tasty Recipes, WP Recipe Maker, Recipe Card Blocks, etc.)
Major food media (Allrecipes, Simply Recipes, Serious Eats, Food Network, BBC Good Food, etc.)
Independent food bloggers
Any site using hRecipe microformat (legacy support)

Pricing

Billed per recipe record saved. The default pricing profile charges a small fee per record plus a run start fee.

Notes

Rate limiting: The actor respects per-domain rate limiting — sites that throttle will be retried with backoff automatically.
Paywalled pages: Pages that return 403 or require login will be skipped with a warning in extraction_warnings.
Missing schema: Pages where no Recipe schema is found produce a stub record with schema_type: "none" and a warning.

Further reading: ISBN Database Access and Other Open Reference Data in Bulk

Universal Recipe Data Scraper

automation-lab/universal-recipe-data-scraper

Extract normalized ingredients, instructions, nutrition, ratings, images, and metadata from public recipe URLs using Schema.org Recipe JSON-LD.

Stas Persiianenko

Recipe API

vivid_astronaut/recipe

BRAINIALL Team

BBC Good Food Recipe Scraper

jungle_synthesizer/bbcgoodfood-recipe-scraper

Enumerate and scrape the full BBC Good Food recipe catalogue (~15K+ recipes) from sitemap discovery. Extracts structured recipe data including ingredients, instructions, UK nutrition panels, skill level, dietary tags, ratings, and schema.org/Recipe JSON-LD fields.

BowTiedRaccoon

Food.com Recipe Scraper

lulzasaur/foodcom-scraper

Scrape recipes from Food.com. Extract ingredients, instructions, nutrition, ratings, prep/cook times, and photos. Provide recipe URLs or browse by category for rich structured recipe data.

lulz bot

NYT Cooking Recipe Scraper

jungle_synthesizer/nyt-cooking-recipe-scraper

Enumerate all ~25K public NYT Cooking recipes from the official sitemap and extract structured recipe data (ingredients, instructions, nutrition, ratings) from schema.org Recipe JSON-LD.

BowTiedRaccoon

Food Recipe Scraper API

shahidirfan/Food-Recipe-Scraper

Extract complete recipe data with the Food Recipe Scraper. This lightweight actor pulls ingredients, step-by-step instructions, and images from popular recipe sites. Perfect for building your culinary database or app. Start scraping recipes today!

Shahid Irfan

5.0

Rakuten Recipe Scraper

shiokoshi356/rakuten-recipe-scraper

Scrape Rakuten Recipe data via official API. Get recipe categories, top-ranked recipes with ingredients, instructions, and images.

Shiokoshi356

AllRecipes Scraper

fingolfin/allrecipies-scraper

n Apify Actor that scrapes recipe information from AllRecipes.com, including recipe search and detailed recipe data with ingredients, directions, nutrition facts, and reviews.

Mate Papava

Recipe Scraper — Extract Recipes from 100+ Cooking Websites

studio-amba/recipe-scraper

Scrape recipes with ingredients, instructions, nutrition, ratings, and cooking times from popular recipe websites. Supports allrecipes.com, bbcgoodfood.com, and any site with Schema.org Recipe markup.

Studio Amba

Recipe Scraper (Universal / schema.org)

crawlerbros/recipe-scraper

Scrape any schema.org-compliant recipe site like Epicurious, BBC Good Food, Tasty, NYT Cooking, Serious Eats, Food Network, plus thousands of food blogs. Extracts ingredients, instructions, nutrition, ratings, prep/cook time, yield, author, and images via JSON-LD parsing.