Apify Dataset Deduplication by URL
Created by
Stas Persiianenko
Actor
Dataset Deduplicator
Remove duplicate Apify dataset items by URL and keep the first occurrence while preserving clean unique records in the output dataset.
Dataset Deduplicatorautomation-lab/dataset-dedup
Unique key
TextNumberBooleanListObject
Input
Dataset IDs:a8HvXCQBgz2jolQa4
Dedup Fields(required):url
Keep Occurrence:first
Trim Whitespace:true
Max Items:10000
Output fields
Unique key
Sign up on Apify01
Create your Apify account to access the Dataset Deduplicator.
Start the run02
The Actor will start running based on the input automatically.
Receive the output03
Monitor the progress in real-time. You will be notified as soon as your dataset is complete and ready for review.
Integrate into your workflow04
The final output is delivered in JSON, CSV, or Excel format, ready to be plugged into your workflow.

