![Merge, Dedup & Transform Datasets avatar](https://images.apifyusercontent.com/P1i5pot5FZgF0Flm0QzUWm2jv5wgDI8tz8JhzE69prc/rs:fill:92:92/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9vdTJidVBDTjloWWRHTFZkVC9ycjc4bUdLZTk5aWo2YkVheC1wY2Ffcl9vay5wbmc.webp)
Merge, Dedup & Transform Datasets
Try for free
No credit card required
View all Actors![Merge, Dedup & Transform Datasets](https://images.apifyusercontent.com/P1i5pot5FZgF0Flm0QzUWm2jv5wgDI8tz8JhzE69prc/rs:fill:92:92/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9vdTJidVBDTjloWWRHTFZkVC9ycjc4bUdLZTk5aWo2YkVheC1wY2Ffcl9vay5wbmc.webp)
![Merge, Dedup & Transform Datasets](https://images.apifyusercontent.com/P1i5pot5FZgF0Flm0QzUWm2jv5wgDI8tz8JhzE69prc/rs:fill:92:92/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9vdTJidVBDTjloWWRHTFZkVC9ycjc4bUdLZTk5aWo2YkVheC1wY2Ffcl9vay5wbmc.webp)
Merge, Dedup & Transform Datasets
lukaskrivka/dedup-datasets
Try for free
No credit card required
The ultimate dataset processor. Extremely fast merging, deduplications & transformations all in a single run.
The code examples below show how to run the Actor and get its results. To run the code, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token, which you can find under Settings > Integrations in Apify Console. Learn more
1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4client = ApifyClient("<YOUR_API_TOKEN>")
5
6# Prepare the Actor input
7run_input = {
8 "preDedupTransformFunction": """async (items, { Apify }) => {
9 return items;
10}""",
11 "postDedupTransformFunction": """async (items, { Apify }) => {
12 return items;
13}""",
14 "customInputData": {},
15}
16
17# Run the Actor and wait for it to finish
18run = client.actor("lukaskrivka/dedup-datasets").call(run_input=run_input)
19
20# Fetch and print Actor results from the run's dataset (if there are any)
21print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
22for item in client.dataset(run["defaultDatasetId"]).iterate_items():
23 print(item)
24
25# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start
Developer
Maintained by Apify
Actor metrics
- 367 monthly users
- 42 stars
- 84.2% runs succeeded
- 2.8 days response time
- Created in Apr 2020
- Modified 23 days ago
Categories