Actor 2 Dataset Cleaner and Formatter avatar
Actor 2 Dataset Cleaner and Formatter

Pricing

from $0.01 / 1,000 results

Go to Apify Store
Actor 2 Dataset Cleaner and Formatter

Actor 2 Dataset Cleaner and Formatter

Automatically cleans and formats datasets by trimming whitespace, fixing capitalization, and removing duplicates. Supports both Apify datasets and uploaded JSON/CSV files.

Pricing

from $0.01 / 1,000 results

Rating

0.0

(0)

Developer

Leoncio Jr Coronado

Leoncio Jr Coronado

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

12 days ago

Last modified

Share

Dataset Cleaner & Formatter

A lightweight and efficient Actor that automatically cleans and formats datasets. It trims whitespace, fixes capitalization, removes duplicates, and normalizes common fields. Works with both Apify Datasets and uploaded JSON/CSV files.

🚀 Features

Trim leading/trailing whitespace

Fix inconsistent capitalization

Remove duplicate records

Normalize key naming

Supports CSV and JSON

Works with Apify Datasets or external file URLs

Very fast (no browser)

{ "source_type": "upload", "file_url": "https://example.com/data.csv" }

source_type

"upload" — Use a CSV/JSON file from a URL

"dataset" — Use an existing Apify Dataset

file_url

Direct link to CSV or JSON file

Required when source_type = "upload"

📤 Output

The cleaned dataset will include:

trimmed whitespace

normalized capitalization

removed duplicates

consistent structure

Output is stored in the default Dataset, ready for export to CSV, JSON, Excel, or API integration.

import requests

run = requests.post( "https://api.apify.com/v2/acts/motivational_nickel~dataset-cleaner-and-formatter/runs?token=YOUR_API_TOKEN", json={ "source_type": "upload", "file_url": "https://example.com/leads.csv" } ).json()

print(run["data"])

💡 Use Cases

Lead list cleaning

CRM data preprocessing

CSV-to-clean-CSV transformations

JSON structure normalization

Bulk dataset cleanup

Analytics and machine learning preparation

Automation pipelines

💸 Pricing

Pay-per-usage. Designed to support small and large datasets efficiently.

✔ Notes

Works without a browser → very fast and stable

Ideal as a preprocessing step before exporting or merging datasets

Handles both CSV and JSON formats automatically