Actor 2 Dataset Cleaner and Formatter
Pricing
from $0.01 / 1,000 results
Actor 2 Dataset Cleaner and Formatter
Automatically cleans and formats datasets by trimming whitespace, fixing capitalization, and removing duplicates. Supports both Apify datasets and uploaded JSON/CSV files.
Pricing
from $0.01 / 1,000 results
Rating
0.0
(0)
Developer

Leoncio Jr Coronado
Actor stats
0
Bookmarked
3
Total users
1
Monthly active users
12 days ago
Last modified
Categories
Share
Dataset Cleaner & Formatter
A lightweight and efficient Actor that automatically cleans and formats datasets. It trims whitespace, fixes capitalization, removes duplicates, and normalizes common fields. Works with both Apify Datasets and uploaded JSON/CSV files.
🚀 Features
Trim leading/trailing whitespace
Fix inconsistent capitalization
Remove duplicate records
Normalize key naming
Supports CSV and JSON
Works with Apify Datasets or external file URLs
Very fast (no browser)
{ "source_type": "upload", "file_url": "https://example.com/data.csv" }
source_type
"upload" — Use a CSV/JSON file from a URL
"dataset" — Use an existing Apify Dataset
file_url
Direct link to CSV or JSON file
Required when source_type = "upload"
📤 Output
The cleaned dataset will include:
trimmed whitespace
normalized capitalization
removed duplicates
consistent structure
Output is stored in the default Dataset, ready for export to CSV, JSON, Excel, or API integration.
import requests
run = requests.post( "https://api.apify.com/v2/acts/motivational_nickel~dataset-cleaner-and-formatter/runs?token=YOUR_API_TOKEN", json={ "source_type": "upload", "file_url": "https://example.com/leads.csv" } ).json()
print(run["data"])
💡 Use Cases
Lead list cleaning
CRM data preprocessing
CSV-to-clean-CSV transformations
JSON structure normalization
Bulk dataset cleanup
Analytics and machine learning preparation
Automation pipelines
💸 Pricing
Pay-per-usage. Designed to support small and large datasets efficiently.
✔ Notes
Works without a browser → very fast and stable
Ideal as a preprocessing step before exporting or merging datasets
Handles both CSV and JSON formats automatically
