Actor picture

Crawler Results Dedup Xls

petr_cermak/crawler-results-dedup-xls

This act downloads crawler execution results, deduplicates them and sends them to a specified e-mail. It can be used from a crawler finish webhook, the data attribute must be a stringified JSON. Example input: { "_id": "EXECUTION_ID" "data": "{ \"email\": \"YOUR_E_MAIL_ADDRESS\", \"title\": \"E_MAIL_TITLE\" }" }

To run the code examples, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
const Apify = require('apify');

// Set API token
Apify.client.setOptions({ token: '<YOUR_API_TOKEN>' });

// Prepare actor input
const input = {};

// Run the actor
const run = await Apify.call('petr_cermak/crawler-results-dedup-xls', input);

// Print actor output (if any)
console.log('Output');
console.dir(run.output);

// Fetch and print actor results from the run's dataset (if any)
console.log('Results from dataset');
const dataset = await Apify.openDataset(run.defaultDatasetId);
await dataset.forEach(async (item, index) => {
    console.log(JSON.stringify(item));
});
  • Modified
  • Last run
  • Used4042 times
  • Used by3 users