Cheerio Scraper
No credit card required
Cheerio Scraper
No credit card required
Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.
Do you want to learn more about this Actor?
Get a demoI got most of the original logic working, but the 2 fields I generate by mapping over elements and returning a property of that element doesn't seem to be working. I can't tell if the mapping is broken or if the element doesn't actually contain a property in this worker.
The fields that are showing up empty are details
and categories
, whereas details was a large block of HTML, categories
was actually a couple of strings and should be trivial
Figured it out. Man the cheerio documentation is kind of a headache. Thanks for pointing this idea out to me, it works great!
Actor Metrics
442 monthly users
-
93 stars
>99% runs succeeded
28 days response time
Created in Apr 2019
Modified 2 months ago