Cheerio Scraper avatar
Cheerio Scraper

Pricing

Pay per usage

Go to Store
Cheerio Scraper

Cheerio Scraper

apify/cheerio-scraper

Developed by

Apify

Maintained by Apify

Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.

4.7 (11)

Pricing

Pay per usage

131

Monthly users

611

Runs succeeded

>99%

Last modified

5 months ago

MA

Is there a way to run the request entirely using query param and no body?

Closed
mr_apify opened this issue
4 months ago

For such a request:

curl "https://api.apify.com/v2/acts/apify~cheerio-scraper/run-sync-get-dataset-items?token=" -X POST -H 'Content-Type: application/json' -d '{ "startUrls": [ { "url": "https://example.com/" }, { "url": "https://tonytong.mystrikingly.com/" } ], "linkSelector": "a[href]", "pageFunction": "async function pageFunction(context) { const { $, request, log } = context; const pageTitle = $("title").first().text(); const pageContent = $("body").html(); log.info("Page scraped", { url: request.url, pageTitle }); return { url: request.url, pageTitle, content: pageContent }; }", "proxyConfiguration": { "useApifyProxy": true }, "waitFor": true }'

Can I replace the body payload with the query param? As I'm trying to link this to a llm tool and its seems to have an issue handling a body.

jindrich.bar avatar

Hello mr_apify,

Thank you for your question! It is not possible to replace the body payload with query parameters for the run-sync-get-dataset-items endpoint. As per the API documentation, the input must be provided in the request body.

If you need a workaround, you can create a Saved Task with prefilled input and use the actor-task-run-sync-get-dataset-items endpoint, as described here:
https://docs.apify.com/api/v2/actor-task-run-sync-get-dataset-items-get.

However, if you want to parameterize your Actor runs dynamically, input cannot be passed via query parameters.

Since this addresses the issue, I’ll be closing this ticket. If you have further questions, feel free to reach out.

Pricing

Pricing model

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.