Actor picture

Example Sitemap Cheerio

jancurn/example-sitemap-cheerio

An example actor that first downloads a sitemap in XML format and the crawls each page from the sitemap using the fast CheerioCrawler from Apify SDK.

To run the code examples, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
const Apify = require('apify');

// Set API token
Apify.client.setOptions({ token: '<YOUR_API_TOKEN>' });

// Prepare actor input
const input = {};

// Run the actor
const run = await Apify.call('jancurn/example-sitemap-cheerio', input);

// Print actor output (if any)
console.log('Output');
console.dir(run.output);

// Fetch and print actor results from the run's dataset (if any)
console.log('Results from dataset');
const dataset = await Apify.openDataset(run.defaultDatasetId, { forceCloud: true });
await dataset.forEach(async (item, index) => {
    console.log(JSON.stringify(item));
});
  • Updated
  • Last run
  • Used23 times
  • Used by10 users
Categories