
Cheerio Scraper
No credit card required

Cheerio Scraper
No credit card required
Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.

Tried migrating from Web Crawler but `cheerio().map()` doesn't seem to work the same
I got most of the original logic working, but the 2 fields I generate by mapping over elements and returning a property of that element doesn't seem to be working. I can't tell if the mapping is broken or if the element doesn't actually contain a property in this worker.
The fields that are showing up empty are details
and categories
, whereas details was a large block of HTML, categories
was actually a couple of strings and should be trivial

Figured it out. Man the cheerio documentation is kind of a headache. Thanks for pointing this idea out to me, it works great!
Actor Metrics
599 monthly users
-
120 bookmarks
>99% runs succeeded
36 days response time
Created in Apr 2019
Modified 4 months ago