Easy Data Processor: Merge, Clean, and Transform Your Data
View all Actors
This Actor is unavailable because the developer has decided to deprecate it. Would you like to try a similar Actor instead?
See alternative ActorsEasy Data Processor: Merge, Clean, and Transform Your Data
dainty_screw/easy-data-processor-merge-clean-and-transform-your-data
Meet the Ultimate Data Processor, a human-friendly tool that simplifies your data tasks. With this Apify actor, you can merge datasets, remove duplicates, and transform data quickly and effortlessly, all in one go. Say goodbye to complex processes and hello to streamlined data management
2024-09-10
Features
- Add
persistedSharedObject
to transform functions input object. This object is shared for all transform function calls and persist over Actor migrations. This is useful mainly for thededup-as-loading
mode where the transform functions are called multiple times and only process a chunk of the data. - Add
nullAsUnique
to input. If set to true, thenull
and missing values are considered unique and not deduplicated.
2024-07-03
Features
- Enable merging all datasets for runs of an Actor or Task with
actorOrTaskId
,onlyRunsNewerThan
,onlyRunsOlderThan
input parameters.
2023-07-13
Features
- Add
customInputData
object to input for easy passing of custom values intopreDedupTransformFunction
andpostDedupTransformFunction
. It is part of the 2nd parameter object.
2021-01-24
Featues
- Added
fieldsToLoad
to input to increase speed and reducem meory if you don't need full items in output - Added
limit
andoffset
to input to be able to process only slices of dataset - Removed
uploadSleepMs
as the platform can now handle much higher load of upload
2021-01-14
Features
outputDatasetId
can now also use dataset name. If dataset with that name doesn't exist, a new dataset is created.
2020-07-10
Fixes:
dedup-as-loading
mode now works correctly with actor migrations. This means that this actor can finally be used for huge datasets with lower memory!
Features:
fields
are now optional which means the actor does not need to perform deduplication
Previous updates
Previous updates were not tracked, see GitHub commits if you need to find past changes or ask in Issues or Discord.
Developer
Maintained by Community
Categories