Deprecated

Pricing

$9.99/month + usage

See alternative Actors

Go to Store

Easy Data Processor: Merge, Clean, and Transform Your Data

Deprecated

See alternative Actors

Developed by

codemaster devops

Meet the Ultimate Data Processor, a human-friendly tool that simplifies your data tasks. With this Apify actor, you can merge datasets, remove duplicates, and transform data quickly and effortlessly, all in one go. Say goodbye to complex processes and hello to streamlined data management

0.0 (0)

Pricing

$9.99/month + usage

Last modified

a year ago

Automation

Integrations

Dataset IDs

datasetIdsarrayOptional

Datasets that should be deduplicated and merged

Fields for deduplication

fieldsarrayOptional

Fields whose combination should be unique for the item to be considered unique. If none are provided, the actor does not perform deduplication.

What to output

outputEnumOptional

What will be pushed to the dataset from this actor

Value options:

"unique-items": string"duplicate-items": string"nothing": string

Default value of this property is "unique-items"

Mode

modeEnumOptional

How the loading and deduplication process will work.

Value options:

"dedup-after-load": string"dedup-as-loading": string

Default value of this property is "dedup-after-load"

Output dataset ID or name (optional)

outputDatasetIdstringOptional

Optionally can push into dataset of your choice. If you provide a dataset name that doesn't exist, a new named dataset will be created.

Limit fields to load

fieldsToLoadarrayOptional

You can choose which fields to load only. Useful to speed up the loading and reduce memory needs.

Pre dedup transform function

preDedupTransformFunctionstringOptional

Function to transform items before deduplication is applied. For 'dedup-after-load' mode this is done for all items at once. For 'dedup-as-loading' this is applied to each batch separately.

Post dedup transform function

postDedupTransformFunctionstringOptional

Function to transform items after deduplication is applied. For 'dedup-after-load' mode this is done for all items at once. For 'dedup-as-loading' this is applied to each batch separately.

Actor or Task ID (or name)

actorOrTaskIdstringOptional

Use Actor or Task ID (e.g. nwua9Gu5YrADL7ZDj) or full name (e.g. apify/instagram-scraper).

Only runs newer than

onlyRunsNewerThanstringOptional

Use a date format of either YYYY-MM-DD or with time YYYY-MM-DDTHH:mm:ss.

Only runs older than

onlyRunsOlderThanstringOptional

Use a date format of either YYYY-MM-DD or with time YYYY-MM-DDTHH:mm:ss.

Where to output

outputToEnumOptional

Either can output to a single dataset or to split data into KV records depending on upload batch size. KV is upload is much faster but data end up in many files.

Value options:

"dataset": string"key-value-store": string

Default value of this property is "dataset"

Parallel loads

parallelLoadsintegerOptional

Datasets can be loaded in parallel batches to speed things up if needed.

Default value of this property is 10

Parallel pushes

parallelPushesintegerOptional

Deduped data can be pushed in parallel batches to speed things up if needed. If you want the data to be in the exact same order, you need to set this to 1.

Default value of this property is 5

Upload batch size

uploadBatchSizeintegerOptional

How many items it should upload in one pushData call. Useful to not overload Apify API. Only important for dataset upload.

Default value of this property is 500

Download batch size

batchSizeLoadintegerOptional

How many items it will load in a single batch.

Default value of this property is 50000

Offset (how many items to skip from start)

offsetintegerOptional

By default we don't skip any items which is the same as setting offset to 0. For multiple datasets, it takes offset into the sum of their item counts but that is not very useful.

Limit (how many items to load)

limitintegerOptional

By default we don't limit the number loaded items

verbose log

verboseLogbooleanOptional

Good for smaller runs. Large runs might run out of log space.

Default value of this property is false

Null fields are unique

nullAsUniquebooleanOptional

If you want to treat null (or missing) fields as always unique items.

Default value of this property is false

Dataset IDs for just deduping

datasetIdsOfFilterItemsarrayOptional

The items from these datasets will be just used as a dedup filter for the main datasets. These items are loaded first and then the main datasets are compared for uniqueness and pushed.

Custom input data

customInputDataobjectOptional

You can pass custom data as a JSON object to be accessible in the transform functions as part of the 2nd parameter object.

🔥 Power Data Transformer

wiseek/power-data-transformer

Automate your entire data workflow: clean, merge, filter, deduplicate, enrich, and reshape your datasets using built-in transformation or powerful SQL pipelines — seamlessly integrated with automation platforms like n8n, Make.com, and Zapier.

wiseek

✨ Google Autocomplete Apify

damilo/google-autocomplete-apify

🔍✨ Instantly grab live Google Autocomplete ideas—no proxies, no browser! 🌍 Supports 250 + countries & 100 + languages, returns clean JSON (value, relevance) and a ⚡ “cursor-before” boost for hidden gems. Perfect for SEO, PPC, content brainstorming & trend spotting.

Imad

5.0

Tweet Scraper|$0.25/1K Tweets | Pay-Per Result | No Rate Limits

kaitoeasyapi/twitter-x-data-tweet-scraper-pay-per-result-cheapest

Only $0.25/1000 tweets for Twitter scraping, 100% reliability, swift data retrieval.This incredible low price is almost too good to be true.Thanks to our large-scale operations and efficient servers, we can offer you rock-bottom prices that no competitors can match. Don't miss this opportunity !

twlo low

3.9K

2.7

Cex Product Scraper (uk.webuy.com)

sync-network/cex-product-scraper-uk-webuy-com

Extracts detailed product data from CEX (uk.webuy.com), including prices, trade-in values, and stock status. Features customizable search parameters for targeted collection. Ideal for market analysis, price comparison, and inventory tracking in the second-hand electronics and entertainment market.

Alam

Fast Property24 | Search | Property | Scraper (Richest output)

memo23/property24-scraper

Extract comprehensive South African property data including detailed listings, agent info, high-res images, and market trends. Get structured JSON output with fields for pricing, location, features, and historical data. Perfect for real estate analysis and investment research.

Muhamed Didovic

Businessesforsale Scraper

memo23/businessesforsale-scraper

Scrapes BusinessesForSale.com to extract business listings with JSON-LD data, search metadata (criteria, tags, age filters, listings count), and search context. Built with TypeScript/Cheerio for reliable data extraction and market research.

Muhamed Didovic

Target.com Product Search Scraper

ecomscrape/target-product-search-scraper

The Target.com Product Search Scraper extracts detailed product data from Target.com, including name, title, brand, description, price, rating score, etc., using search query URLs. This tool is perfect for market research, trend analysis, lead generation, and campaign planning.

ecomscrape

Redfin.com Property Details Scraper

ecomscrape/redfin-com-property-details-page-scraper

The Redfin.com Property Details Scraper allows easy extraction of detailed property data. Simply provide the Property Page URLs, and it will gather key details like price, address, features, and coordinates, ensuring smooth integration and efficient use.

ecomscrape

Advanced Contact Details Scraper

ecomscrape/advanced-contact-details-scraper

Extract contact data from any website with advanced Cloudflare bypass technology. Our professional web scraper collects emails, phones, social media profiles from LinkedIn, Facebook, Instagram & more. Get structured JSON data for lead generation, recruitment & market research.

ecomscrape

TikTok Scraper

thenetaji/tiktok-scraper

Scrape TikTok videos, profiles, hashtags & more with our fast and reliable TikTok scraper. Get real-time data for analysis, marketing & research. Easy to use & powerful. Try now!

thenetaji

5.0

Tiktok Scraper

binod_yadav/tiktok-scraper

Scrape TikTok Profile and post. Only at $1.5/1000 results.

Binod Yadav

Easy Data Processor: Merge, Clean, and Transform Your Data

Easy Data Processor: Merge, Clean, and Transform Your Data

Dataset IDs

Fields for deduplication

What to output

Value options:

Mode

Value options:

Output dataset ID or name (optional)

Limit fields to load

Pre dedup transform function

Post dedup transform function

Actor or Task ID (or name)

Only runs newer than

Only runs older than

Where to output

Value options:

Parallel loads

Parallel pushes

Upload batch size

Download batch size

Offset (how many items to skip from start)

Limit (how many items to load)

verbose log

Null fields are unique

Dataset IDs for just deduping

Custom input data

You might also like

🔥 Power Data Transformer

✨ Google Autocomplete Apify

Tweet Scraper|$0.25/1K Tweets | Pay-Per Result | No Rate Limits

Cex Product Scraper (uk.webuy.com)

Fast Property24 | Search | Property | Scraper (Richest output)

Businessesforsale Scraper

Target.com Product Search Scraper

Redfin.com Property Details Scraper

Advanced Contact Details Scraper

TikTok Scraper

Tiktok Scraper