Cheerio Scraper avatar
Cheerio Scraper

Pricing

Pay per usage

Go to Store
Cheerio Scraper

Cheerio Scraper

Developed by

Apify

Apify

Maintained by Apify

Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.

4.7 (10)

Pricing

Pay per usage

173

Total users

8.9K

Monthly users

896

Runs succeeded

>99%

Issues response

12 days

Last modified

2 months ago

ND

can you scrape this website? https://therealdeal.com/miami/

Closed

andruchiii opened this issue
23 days ago

I would like to get the article link, article publish date and article title. Is it possible?

jindrich.bar avatar

Hi, thanks for your question!

Yes, it's possible to scrape data from https://therealdeal.com/miami/. I tested it using the Cheerio Scraper by extracting the __NEXT_DATA__ element, which contains the article metadata in a structured format.

The Page function should look something like this:

async function pageFunction(context) {
const { $ } = context;
const nextDataScript = $('#__NEXT_DATA__').html();
const nextData = JSON.parse(nextDataScript);
return [
...nextData.props.pageProps.data.editorialPickPosts,
...nextData.props.pageProps.data.posts.nodes
];
}

You can check out my test run here: https://console.apify.com/view/runs/IuBxlWpfxWfPZpItN . Feel free to copy my Actor input and customize it to fit your use case.

I'll close this issue now, but feel free to ask additional questions if you have any. Cheers!