NCSU Extension Gardener Plant Toolbox Scraper avatar

NCSU Extension Gardener Plant Toolbox Scraper

Pricing

Pay per event

Go to Apify Store
NCSU Extension Gardener Plant Toolbox Scraper

NCSU Extension Gardener Plant Toolbox Scraper

Scrape the NC State Extension Gardener Plant Toolbox (~6,500 plants). Extracts 50+ structured fields per plant: scientific name, common names, USDA zones, light/soil/water needs, bloom traits, leaf/fruit data, landscape uses, wildlife attraction, poison severity, edibility, propagation, and images.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

NC State Extension Gardener Plant Toolbox Scraper

Scrapes plant records from the NC State Extension Gardener Plant Toolbox. Returns 50+ structured fields per plant — scientific name, common names, USDA hardiness zones, light and soil requirements, bloom traits, toxicity data, wildlife attraction, and gallery images — across the full ~6,500-plant database.


NC State Plant Toolbox Scraper Features

  • Extracts 50+ fields per plant record, including numerical height and width ranges in decimal feet
  • Collects taxonomic data: scientific name, genus, species, family, cultivar, phonetic spelling
  • Returns USDA hardiness zone ranges, light requirements, soil drainage, texture, and pH preferences
  • Captures bloom color, bloom season, flower description, and gardener value ratings
  • Extracts toxicity information — poisonous parts, severity, symptoms — plus edibility notes
  • Returns wildlife attraction lists (pollinators, birds, butterflies) and pest resistance data
  • Collects landscape use categories, landscape themes, and recommended propagation strategies
  • Discovers all plant URLs from the sitemap — no search queries, no pagination to manage
  • No proxies required. The site is clean SSR with no bot detection.

What Can You Do With NC State Plant Toolbox Data?

  • Horticulture researchers — build searchable plant databases with structured trait data for academic or extension projects
  • Landscape architects — filter plants by hardiness zone, light requirements, and mature size to generate site-appropriate plant palettes
  • App developers — power plant identification, garden planning, or companion planting tools with authoritative USDA-aligned data
  • Educators — export plant records for curriculum materials, field guides, or interactive botanical learning tools
  • Nursery operators — feed plant data into inventory systems or customer-facing product descriptions
  • Toxicology and safety teams — extract poison severity and symptom data for plant safety databases used in pediatric or veterinary contexts

How NC State Plant Toolbox Scraper Works

  1. Fetches the site's XML sitemap and filters to plant detail pages — roughly 6,500 URLs total
  2. Crawls each plant page using a Cheerio-based HTML crawler
  3. Parses the dt/dd key-value trait table on each page into a structured field map
  4. Writes one record per plant with all extracted fields to the dataset

Input

{
"maxItems": 100
}
FieldTypeDefaultDescription
maxItemsintegerMaximum number of plant records to scrape. Leave blank to scrape all (~6,500 plants). Prefill: 10.

NC State Plant Toolbox Scraper Output Fields

{
"id_plant": "hydrangea-paniculata",
"scientific_name": "Hydrangea paniculata",
"common_names": ["Panicle Hydrangea", "Pee Gee Hydrangea"],
"family": "Hydrangeaceae",
"genus": "Hydrangea",
"species": "paniculata",
"cultivar": null,
"phonetic_spelling": "hy-DRAN-jee-uh pan-ick-yoo-LAY-tuh",
"description": "Panicle hydrangea is a large deciduous shrub...",
"plant_type": ["Shrub"],
"habit_form": ["Arching", "Multi-stemmed"],
"native_to": ["China", "Japan"],
"usda_hardiness_zones": "3a-8b",
"height_min_ft": 8.0,
"height_max_ft": 15.0,
"width_min_ft": 6.0,
"width_max_ft": 12.0,
"growth_rate": "Rapid",
"maintenance": "Low",
"light": ["Full sun (6 or more hours of direct sunlight a day)", "Partial shade"],
"soil_drainage": ["Moist"],
"soil_texture": ["Clay", "Loam", "Sand"],
"soil_ph": ["Acid", "Neutral"],
"water_use": "Medium",
"bloom_color": ["White"],
"bloom_time": ["Summer", "Fall"],
"bloom_description": "The flowers appear in large conical panicles...",
"flower_value_to_gardener": ["Attractive Flowers", "Cut Flowers", "Dried Flowers"],
"leaf_color": ["Green"],
"leaf_type": "Broadleaf",
"leaf_arrangement": "Opposite",
"fall_color": ["Red/Burgundy"],
"fruit_type": "Capsule",
"fruit_value_to_gardener": [],
"stem_bark": "Bark color: Gray; Bark description: Peeling, cinnamon-brown",
"landscape_uses": ["Foundation", "Hedge", "Specimen"],
"landscape_themes": ["Cottage Garden", "Pollinator Garden"],
"attracts": ["Bees", "Butterflies"],
"resistant_to": ["Deer"],
"problems": [],
"poisonous_parts": [],
"poison_severity": null,
"poison_symptoms": null,
"edibility": null,
"propagation": ["Stem tip cuttings"],
"cultivars_varieties": ["Limelight", "Pinky Winky", "Quick Fire", "Vanilla Strawberry"],
"tags": ["NC", "Extension", "shrub"],
"image_urls": ["https://s3.amazonaws.com/..."],
"toolbox_url": "https://plants.ces.ncsu.edu/plants/hydrangea-paniculata/"
}
FieldTypeDescription
id_plantstringNCSU toolbox slug used as primary key
scientific_namestringFull scientific name
common_namesarrayList of common names
familystringPlant family
genusstringGenus
speciesstringSpecies epithet
cultivarstringCultivar name, if this is a cultivar page
phonetic_spellingstringPhonetic pronunciation of the scientific name
descriptionstringFull descriptive text for the plant
plant_typearrayPlant types (e.g. Perennial, Shrub, Tree)
habit_formarrayGrowth habit descriptors (e.g. Arching, Erect)
native_toarrayCountry or region of origin
usda_hardiness_zonesstringUSDA Plant Hardiness Zone range (e.g. 3a-8b)
height_min_ftnumberMinimum height in decimal feet
height_max_ftnumberMaximum height in decimal feet
width_min_ftnumberMinimum width in decimal feet
width_max_ftnumberMaximum width in decimal feet
growth_ratestringGrowth rate descriptor
maintenancestringMaintenance level (Low, Medium, High)
lightarrayLight requirements
soil_drainagearraySoil drainage preferences
soil_texturearraySoil texture preferences
soil_pharraySoil pH preferences
water_usestringWater use or moisture requirement descriptor
bloom_colorarrayFlower/bloom colors
bloom_timearrayBloom seasons or months
bloom_descriptionstringDescriptive text about the flowers
flower_value_to_gardenerarrayFlower values (e.g. Attractive Flowers, Attracts Pollinators)
leaf_colorarrayLeaf colors
leaf_typestringLeaf type descriptor
leaf_arrangementstringLeaf arrangement on stem
fall_colorarrayDeciduous fall leaf colors
fruit_typestringFruit type descriptor
fruit_value_to_gardenerarrayFruit display, harvest time, or color descriptors
stem_barkstringCombined stem and bark description
landscape_usesarrayLandscape location uses
landscape_themesarrayLandscape themes
attractsarrayWildlife this plant attracts
resistant_toarrayConditions or threats this plant resists
problemsarrayKnown problems (e.g. Poisonous to Humans)
poisonous_partsarrayPoisonous plant parts
poison_severitystringToxicity severity rating
poison_symptomsstringPoisoning symptom description
edibilitystringEdibility notes
propagationarrayRecommended propagation strategies
cultivars_varietiesarrayKnown cultivar/variety names for this species
tagsarrayHashtag labels from the toolbox
image_urlsarrayPlant gallery image URLs
toolbox_urlstringCanonical URL of the plant toolbox page

🔍 FAQ

How do I scrape the NC State plant database?

NC State Extension Gardener Plant Toolbox Scraper handles discovery automatically. Set maxItems to limit the run, or leave it blank to scrape all ~6,500 plants. No API key, no login.

What data can I get from the NC State Plant Toolbox?

NC State Extension Gardener Plant Toolbox Scraper returns 50+ fields per plant: taxonomy, USDA zones, dimensional data in decimal feet, bloom traits, soil and light requirements, toxicity ratings, wildlife attraction data, landscape uses, and gallery images. It's the full record, not a summary.

How much does NC State Extension Gardener Plant Toolbox Scraper cost to run?

Pricing is per data record. A full scrape of ~6,500 plants runs well within a reasonable budget at the default coefficient. Scraping a subset — say 500 plants for a regional project — costs proportionally less.

Does NC State Extension Gardener Plant Toolbox Scraper need proxies?

No. The NC State Plant Toolbox is a clean server-rendered site with no bot detection. The scraper hits it at a measured pace without residential proxy overhead.

Can I filter to specific plant types or zones?

NC State Extension Gardener Plant Toolbox Scraper does not filter during scraping — it collects all plants and returns the full field set. Filter after export using plant_type, usda_hardiness_zones, or light fields in your downstream tooling.


Need More Features?

Need custom filters, additional fields, or a different plant database? File an issue or get in touch.

Why Use NC State Extension Gardener Plant Toolbox Scraper?

  • Authoritative source — NC State Extension is a land-grant university program; data reflects peer-reviewed horticultural standards, not user submissions
  • Depth over breadth — 50+ structured fields per record, including numerical dimensions and granular toxicity data, means less cleanup and more usable output from the first run
  • No friction — no proxies, no authentication, no API quota to manage