Puppeteer Scraper avatar
Puppeteer Scraper

Pricing

Pay per usage

Go to Store
Puppeteer Scraper

Puppeteer Scraper

apify/puppeteer-scraper

Developed by

Apify

Maintained by Apify

Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.

5.0 (5)

Pricing

Pay per usage

116

Monthly users

605

Runs succeeded

>99%

Response time

30 days

Last modified

10 months ago

PI

how i can exclude start url in request queue list

Closed
pizicai36 opened this issue
2 years ago

i set request queue name now all url save in request queue list how i can exclude start url in request queue list ?

Andrey_Bykov avatar

Hey there! I don't quite understand what are you trying to achieve, could you please elaborate?

PI

pizicai36

2 years ago

when i set request queue name, all url will save in request queue list, when i run new task, it show the url has been processed ,so i can't get the new url and new data all new url (detail page url) in start page , so how i can exclude start url in request queue list

Andrey_Bykov avatar

I think for your use-case - just leave the request queue name empty. This way each run will use the default request queue and it will be empty at the beginning of each run.

PI

pizicai36

2 years ago

now i leave the request queue name empty. but when i run again ,it shoiw error: All requests from the queue have been processed, the crawler will shut down i confirm have new url in the start page

adamek avatar

I see your latest runs are getting some results, did you find the problem yourself?

PI

pizicai36

2 years ago

now is ok, thanks

554291 554291@qq.com

 

------------------ Original ------------------

Pricing

Pricing model

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.