Cainiao Tracking Actor avatar

Cainiao Tracking Actor

Try for free

No credit card required

View all Actors
Cainiao Tracking Actor

Cainiao Tracking Actor

nguyennk91/cainiao-tracking-actor
Try for free

No credit card required

Dockerfile

1# This is a template for a Dockerfile used to run acts in Actor system.
2# The base image name below is set during the act build, based on user settings.
3# IMPORTANT: The base image must set a correct working directory, such as /usr/src/app or /home/user
4FROM apify/actor-node-chrome:v0.21.10
5
6# Second, copy just package.json and package-lock.json since it should be
7# the only file that affects "npm install" in the next step, to speed up the build
8COPY package*.json ./
9
10# Install NPM packages, skip optional and development dependencies to
11# keep the image small. Avoid logging too much and print the dependency
12# tree for debugging
13RUN npm --quiet set progress=false \
14 && npm install --only=prod --no-optional \
15 && echo "Installed NPM packages:" \
16 && (npm list --all || true) \
17 && echo "Node.js version:" \
18 && node --version \
19 && echo "NPM version:" \
20 && npm --version
21
22# Copy source code to container
23# Do this in the last step, to have fast build if only the source code changed
24COPY --chown=myuser:myuser . ./
25
26# NOTE: The CMD is already defined by the base image.
27# Uncomment this for local node inspector debugging:
28# CMD [ "node", "--inspect=0.0.0.0:9229", "main.js" ]

package.json

1{
2    "name": "apify-project",
3    "version": "0.0.1",
4    "description": "",
5    "author": "It's not you it's me",
6    "license": "ISC",
7    "dependencies": {
8        "apify": "0.21.10",
9        "request-promise": "latest"
10    },
11    "scripts": {
12        "start": "node main.js"
13    }
14}

main.js

1const Apify = require('apify');
2const request = require('request-promise');
3
4Apify.main(async () => {
5    // Get input of your actor
6    const input = await Apify.getInput();
7    console.log('Tracking Code:');
8    console.dir(input.trackingCode);
9
10    const requestList = new Apify.RequestList({
11        sources: [{ url: 'https://global.cainiao.com/detail.htm?mailNoList='+input.trackingCode }],
12    });
13    await requestList.initialize();
14
15    const crawler = new Apify.PuppeteerCrawler({
16        requestList,
17        handlePageFunction: async ({ page, request }) => {
18            // This function is called to extract data from a single web page
19            // 'page' is an instance of Puppeteer.Page with page.goto(request.url) already called
20            // 'request' is an instance of Request class with information about the page to load
21            await page
22            .waitForSelector('#waybill_list_val_box');
23            const element = await page.$("#waybill_list_val_box");
24            const trackingRaw = await page.evaluate(element => element.textContent, element);
25            console.log('Tracking Raw:');
26            console.dir(trackingRaw);
27            
28            const parsedTracking = JSON.parse(trackingRaw);
29        
30            console.log('Tracking Detail:');
31            console.dir(parsedTracking);
32            // Save the data to dataset.
33            await Apify.setValue('OUTPUT', parsedTracking);
34        },
35        handleFailedRequestFunction: async ({ request }) => {
36            // This function is called when the crawling of a request failed too many times
37            await Apify.pushData({
38                url: request.url,
39                succeeded: false,
40                errors: request.errorMessages,
41            });
42        },
43    });
44
45    await crawler.run();
46});
Developer
Maintained by Community
Actor metrics
  • 1 monthly user
  • 1 star
  • 50.0% runs succeeded
  • Created in Jan 2020
  • Modified about 2 years ago
Categories