Content Checker avatar
Content Checker
Try for free

No credit card required

View all Actors
Content Checker

Content Checker

jakubbalada/content-checker
Try for free

No credit card required

Monitor a website or web page for content changes. Automatically saves before and after screenshots and sends an email notification when content changes are detected.

Do you want to learn more about this Actor?

Get a demo
EW

Can this scraper monitor that a new blog post has been published?

Closed

expansive_wire opened this issue
a month ago

Hi I'm getting failures trying to run the actor for this company's blog page. I'd like to monitor this page and every time a new blog is posted be alerted (and then scrape the new blog post).

Can you help me understand?

2024-07-26T04:13:01.746Z ACTOR: Pulling Docker image of build l8oQqJT6pAaxvYUZq from repository. 2024-07-26T04:13:20.248Z ACTOR: Creating Docker container. 2024-07-26T04:13:21.060Z ACTOR: Starting Docker container. 2024-07-26T04:13:21.586Z Starting X virtual framebuffer using: Xvfb :99 -ac -screen 0 1920x1080x24+32 -nolisten tcp 2024-07-26T04:13:21.589Z Executing main command 2024-07-26T04:13:22.483Z INFO System info {"apifyVersion":"3.1.15","apifyClientVersion":"2.8.4","crawleeVersion":"3.7.3","osType":"Linux","nodeVersion":"v18.19.0"} 2024-07-26T04:13:23.214Z INFO PuppeteerCrawler: Starting the crawler. 2024-07-26T04:13:26.736Z INFO Page loaded with title: Pure Storage Blogs | Digitally Transform With Data | Pure Storage on URL: https://blog.purestorage.com/ 2024-07-26T04:13:26.737Z INFO Sleeping 5s ... 2024-07-26T04:13:31.758Z INFO Saving screenshot... 2024-07-26T04:13:32.526Z WARN Failed to extract the content, either the content selector is wrong or page layout changed. Check the full screenshot. 2024-07-26T04:13:32.704Z INFO PuppeteerCrawler: All requests from the queue have been processed, the crawler will shut down. 2024-07-26T04:13:33.169Z INFO PuppeteerCrawler: Final request statistics: {"requestsFinished":1,"requestsFailed":0,"retryHistogram":[1],"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":9278,"requestsFinishedPerMinute":6,"requestsFailedPerMinute":0,"requestTotalDurationMillis":9278,"requestsTotal":1,"crawlerRuntimeMillis":10102} 2024-07-26T04:13:33.170Z INFO PuppeteerCrawler: Finished! Total 1 requests: 1 succeeded, 0 failed. {"terminal":true} 2024-07-26T04:13:33.397Z 2024-07-26T04:13:33.398Z node:internal/process/esm_loader:40 2024-07-26T04:13:33.398Z internalBinding('errors').triggerUncaughtException( 2024-07-26T04:13:33.399Z ^ 2024-07-26T04:13:33.399Z Cannot get screenshot (screenshot selector is probably wrong). 2024-07-26T04:13:33.400Z Made screenshot of the full page instead: 2024-07-26T04:13:33.400Z https://api.apify.com/v2/key-value-stores/yqPcShFFuoFWD51BH/records/fullpageScreenshot.png 2024-07-26T04:13:33.403Z (Use node --trace-uncaught ... to show where the exception was thrown) 2024-07-26T04:13:33.403Z 2024-07-26T04:13:33.403Z Node.js v18.19.0

paja avatar

Hi,

thanks for reaching out. We'll look into it and let you know what can be done.

lukaskrivka avatar

Hello,

The selector article that you chose is not the correct one as it doesn't exist on the page. You can choose e.g. .wp-block-query-is-layout-flow ul

Developer
Maintained by Apify
Actor metrics
  • 161 monthly users
  • 34 stars
  • 60.9% runs succeeded
  • 5.4 days response time
  • Created in May 2018
  • Modified 3 months ago
Categories