Superclean Product Names avatar
Superclean Product Names

Pricing

from $0.70 / 1,000 results

Go to Apify Store
Superclean Product Names

Superclean Product Names

Clean messy product names from e-commerce exports and web scraping. AI removes promo text, prices, SKUs, and formatting noise. Three styles: Clean (core name only), Full (with attributes), Searchable (lowercase for dedup). Works with Shopify, Amazon, and any catalog data.

Pricing

from $0.70 / 1,000 results

Rating

0.0

(0)

Developer

Superlative

Superlative

Maintained by Community

Actor stats

2

Bookmarked

3

Total users

1

Monthly active users

2 hours ago

Last modified

Share

Clean messy product names from e-commerce exports, web scraping, and catalog data.

What does Superclean Product Names do?

This Actor uses AI to intelligently clean and normalize product names from Shopify exports, Amazon listings, web scraping, and catalog data.

  • Removes promotional noise — "BESTSELLER! Product FREE SHIPPING" becomes "Product"
  • Strips prices and discounts — "$19.99 SALE 50% OFF" is removed
  • Handles separators — "Product | Category | Store" extracts just "Product"
  • Fixes encoding issues — "Men’s T-Shirt" becomes "Men's T-Shirt"
  • Removes SKU codes — "T-Shirt XYZ-BLU-L-2024" becomes "T-Shirt"
  • Decodes HTML entities — "&" becomes "&"

Why clean product names?

Your product data comes from many sources with different formats:

  • "BESTSELLER! Nike Air Max 90 | Men's | Size 10 - $129.99"
  • "nike air max 90 mens white"
  • "Clothing > Men > Shoes > Nike Air Max 90"
  • "NIKE AIR MAX 90 - Free Shipping - Amazon.com"

Clean data means better:

  • Catalog consistency — Standardized names across your product database
  • Deduplication — Match products across different sources
  • Search quality — Clean names for search indexes
  • CRM enrichment — Display-ready product names

How to use Superclean Product Names

  1. Paste your product names into the input field
  2. Select your output style (Clean, Full, or Searchable)
  3. Click Start and download your cleaned results

Input example

{
"items": [
"BESTSELLER! Nike Air Max 90 | Men's | Size 10 - $129.99",
"organic coffee beans 12oz medium roast FREE SHIPPING",
"T-Shirt XYZ-BLU-L-2024 (Blue, Large)",
"Apple iPhone 15 Pro Max 256GB - Apple Store",
"Clothing > Men > T-Shirts > Blue Cotton Tee"
],
"style": "clean"
}

Output example

[
{
"id": 1,
"input": "BESTSELLER! Nike Air Max 90 | Men's | Size 10 - $129.99",
"output": "Nike Air Max 90",
"confidence": 0.95
},
{
"id": 2,
"input": "organic coffee beans 12oz medium roast FREE SHIPPING",
"output": "Organic Coffee Beans Medium Roast",
"confidence": 0.92
},
{
"id": 3,
"input": "T-Shirt XYZ-BLU-L-2024 (Blue, Large)",
"output": "T-Shirt",
"confidence": 0.88
},
{
"id": 4,
"input": "Apple iPhone 15 Pro Max 256GB - Apple Store",
"output": "Apple iPhone 15 Pro Max",
"confidence": 0.95
},
{
"id": 5,
"input": "Clothing > Men > T-Shirts > Blue Cotton Tee",
"output": "Blue Cotton Tee",
"confidence": 0.90
}
]

Output styles

StyleBest forBeforeAfter
CleanCatalogs, CRMBESTSELLER! Nike Air Max 90 | Size 10 - $129.99Nike Air Max 90
FullDetailed listingsnike air max 90 mens whiteNike Air Max 90 Men's White
SearchableDeduplicationNike Air Max 90 (Men's Size 10)nike air max 90

Clean (default)

Core product name only. Removes all promotional text, prices, variants, and noise.

  • "BESTSELLER! Nike Air Max 90 | Men's | Size 10" → "Nike Air Max 90"
  • "Coffee Beans 12oz FREE SHIPPING $9.99" → "Coffee Beans"
  • "T-Shirt XYZ-BLU-L-2024 (Blue)" → "T-Shirt"

Full

Properly formatted with attributes preserved. Fixes casing and encoding.

  • "nike air max 90 mens white" → "Nike Air Max 90 Men's White"
  • "ORGANIC COFFEE BEANS 12OZ" → "Organic Coffee Beans 12oz"
  • "iphone 15 pro max 256gb" → "iPhone 15 Pro Max 256GB"

Searchable

Lowercase, cleaned for matching and deduplication.

  • "Nike Air Max 90 (Men's Size 10)" → "nike air max 90"
  • "ORGANIC COFFEE BEANS 12oz" → "organic coffee beans"
  • "Apple iPhone 15 Pro Max" → "apple iphone 15 pro max"

Use cases

E-commerce catalog cleanup

Standardize product names across your Shopify, WooCommerce, or BigCommerce store. Clean up messy imports from suppliers.

Web scraping data cleanup

Clean product names extracted from competitor sites, marketplaces, or price comparison tools.

Product matching and deduplication

Use searchable style to create clean keys for matching the same product across different sources.

CRM and enrichment

Clean product names for display in HubSpot, Salesforce, or your data enrichment pipelines.

What gets cleaned

Promotional text removed

  • BESTSELLER, SALE, NEW, HOT, LIMITED, EXCLUSIVE
  • FREE SHIPPING, FAST DELIVERY, IN STOCK
  • BUY NOW, SHOP NOW, ADD TO CART

Prices removed

  • $19.99, €29.99, £15.00
  • 50% OFF, SAVE $10

Formatting fixed

  • ALL CAPS → Title Case
  • encoding issues (’ → ')
  • HTML entities (& → &)

Noise stripped

  • SKU codes (XYZ-BLU-L-2024)
  • Store names (- Amazon.com)
  • Category breadcrumbs (Clothing > Men >)
  • Variant parentheticals ((Size L, Blue))

Brand capitalization

The Actor correctly preserves brand-specific capitalization:

  • Apple — iPhone, iPad, MacBook, AirPods
  • Sony — PlayStation
  • Microsoft — Xbox
  • Tech terms — WiFi, Bluetooth, HDMI, USB, LED, 4K

Integrations

Apify API

Call directly via the Apify API for programmatic access.

Clay

Use the Apify integration in Clay to clean product names as part of your enrichment workflow.

Make / Zapier / n8n

Connect via the Apify app to automate product name cleaning in your workflows.

Pricing

ItemsCost
1,000$0.70
10,000$7.00
100,000$70.00

AI model costs

This Actor uses the Apify OpenRouter Actor to access AI models. AI token costs are billed to your Apify account at OpenRouter rates. Typical LLM costs are ~$0.05 per 1,000 names using the default openrouter/auto model.

Tips for best results

  • Batch your requests — Process names in bulk for efficiency
  • Check confidence scores — Items with confidence < 0.7 may need manual review
  • Choose the right style — Use "Clean" for catalogs, "Full" for detailed listings, "Searchable" for matching

Confidence scores

Each result includes a confidence score from 0 to 1:

  • 0.9+ — High confidence, no review needed
  • 0.7-0.9 — Moderate confidence, spot check recommended
  • < 0.7 — Low confidence, manual review suggested

Non-English names automatically receive confidence 0 and should be reviewed.


More from Superlative

Built by Superlative — Clean data in. Better emails out.