ai-data-cleaner-classifier avatar
ai-data-cleaner-classifier

Pricing

from $0.01 / 1,000 results

Go to Apify Store
ai-data-cleaner-classifier

ai-data-cleaner-classifier

Clean, normalize, deduplicate, and classify JSON, CSV, or Apify datasets using rules or OpenAI models. Built for automation pipelines, data preparation, and AI workflows. Supports dataset chaining, cost controls, and safe fallbacks.

Pricing

from $0.01 / 1,000 results

Rating

0.0

(0)

Developer

King Shepherd

King Shepherd

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

AI Data Cleaner & Classifier

Clean, normalize, deduplicate, and classify structured data using rules or AI.

This Actor helps you turn messy JSON, CSV files, or Apify datasets into clean, structured, and usable data for automation pipelines, analytics, CRMs, and AI workflows.


✅ What this Actor does

This Actor can process structured records and:

  • Normalize common fields (email, phone, name, company, URL)
  • Deduplicate records safely (hash-based)
  • Classify records using:
    • Rule-based logic
    • OpenAI models (optional)
  • Enrich data with:
    • Suggested tags
    • Industry (when detectable)
    • Confidence scores
  • Output clean, structured JSON to an Apify dataset

It is designed for automation and repeat usage, not one-off demos.


📥 Supported input sources (exactly one required)

You must provide one and only one of the following input sources:

1️⃣ Inline JSON data

{
"data": [
{ "email": "test@example.com", "company": "Acme Inc" }
]
}