Merge, Dedup & Transform Datasets avatar
Merge, Dedup & Transform Datasets
Try for free

No credit card required

View all Actors
Merge, Dedup & Transform Datasets

Merge, Dedup & Transform Datasets

lukaskrivka/dedup-datasets
Try for free

No credit card required

The ultimate dataset processor. Extremely fast merging, deduplications & transformations all in a single run.

User avatar

Select contact details

Closed

nico_v opened this issue
a month ago

Hello,

Given the (very) large amount of column the contact details output for this run (1,070 columns in total), I'd like only the first 2 Contact Details of each sub-category (e.g. emails/0 + emails/1 ; Facebooks/0 + Facebooks/1, ...).

How exactly can I make this happen please? I've tried in the JSON input, using the Fileds to Load as e.g. 'contactDetails/facebooks/0',, but it wouldn't output any of these.

Happy to hear from you on that, thanks.

User avatar

Hello,

I can do a small script for you to do this in this actor. Another option would be to join all emails to one column separated by ;, e.g. instead of emails/0 = john@apify.com and emails/1 = peter@apify.com, it would be emails = john@apify.com; peter@apify.com. And it could join any number (you can decide a limit). Would you prefer that or your original idea?

User avatar

nico_v

a month ago

Hello Lukáš,

Thank you for the suggestions. It would be great if you could indeed have contactDetails 0 AND 1 in one column for each contactDetails type (e.g. Facebook 0 + Facebook 1 into one column, separated by a comma).

How should I proceed?

Also, I would need this not for this run in partuclar, but anytime (as I plan many different datasets merges).

Thanks for your help !

User avatar

Hello,

I created a task "Merge Google Maps - Trim to 2 contacts" for you. There is just a simple transform function that cleans the contacts. See the run below. https://console.apify.com/actors/tasks/WyOhdrUQvylK2G1f6/runs/wONYnadbFx2JeWNY3#output

I also added placeId to deduplication fields so you get rid of duplicate places, it goes down to 31053 places now.

Developer
Maintained by Apify
Actor metrics
  • 250 monthly users
  • 96.6% runs succeeded
  • 0.78 days response time
  • Created in Apr 2020
  • Modified 14 days ago