Cheerio Scraper avatar
Cheerio Scraper

Pricing

Pay per usage

Go to Store
Cheerio Scraper

Cheerio Scraper

apify/cheerio-scraper

Developed by

Apify

Maintained by Apify

Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.

4.7 (11)

Pricing

Pay per usage

131

Monthly users

612

Runs succeeded

>99%

Last modified

5 months ago

MA

How to return response within the POST request.

Closed
mr_apify opened this issue
4 months ago

I want the curl request to return the scraped data directly in the terminal within 1 request, but it only returns me settings from the run.

curl "https://api.apify.com/v2/acts/apify~cheerio-scraper/runs?token="
-X POST
-H 'Content-Type: application/json'
-d '{ "startUrls": [ { "url": "https://example.com/" }, { "url": "https://tonytong.mystrikingly.com/" } ], "linkSelector": "a[href]", "pageFunction": "async function pageFunction(context) { const { $, request, log } = context; const pageTitle = $("title").first().text(); const pageContent = $("body").html(); log.info("Page scraped", { url: request.url, pageTitle }); return { url: request.url, pageTitle, content: pageContent }; }", "proxyConfiguration": { "useApifyProxy": true } }'

Can you check and let me know how to change this query?

MA

mr_apify

4 months ago

I found it, need to use run-sync-get-dataset-items in url

Pricing

Pricing model

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.