Perplexity.AI Actor avatar

Perplexity.AI Actor

Under maintenance
Try for free

3 days trial then $30.00/month - No credit card required now

View all Actors
This Actor is under maintenance.

This Actor may be unreliable while under maintenance. Would you like to try a similar Actor instead?

See alternative Actors
Perplexity.AI Actor

Perplexity.AI Actor

jons/perplexity-actor
Try for free

3 days trial then $30.00/month - No credit card required now

Use the Perplexity.ai Scraper to extract information with AI. For example: "Find hotels in Prague that offer free breakfast and have a nightly rate under 1000 CZK." Export the results into a structured dataset.

DN

Fails very often and takes quite awhile 2-4 minutes each run.

Open

demonstrative_nomad opened this issue
a month ago

2024-08-22T07:36:35.988Z INFO PuppeteerCrawler:Statistics: PuppeteerCrawler request statistics: {"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":null,"requestsFinishedPerMinute":0,"requestsFailedPerMinute":0,"requestTotalDurationMillis":0,"requestsTotal":0,"crawlerRuntimeMillis":60690,"retryHistogram":[]} 2024-08-22T07:36:36.052Z INFO PuppeteerCrawler:AutoscaledPool: state {"currentConcurrency":1,"desiredConcurrency":1,"systemStatus":{"isSystemIdle":true,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":0},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.6,"actualRatio":0},"cpuInfo":{"isOverloaded":false,"limitRatio":0.4,"actualRatio":0.173},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":0}}} 2024-08-22T07:36:39.733Z WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. Timeout Error: waiting for function failed: timeout of 20000ms exceeded. 2024-08-22T07:36:39.735Z {"id":"ZotIN5GPCzaMgM8","url":"https://www.perplexity.ai/search?q=Evalute%20the%20outcome%20of%20the%20following%20statement%3A%20Will%20the%20SCOTUS%20affirmative%20action%20ruling%20result%20in%20a%20universal%20ban%3F%2C%20made%20in%202023-07-10","retryCount":1} 2024-08-22T07:37:33.514Z WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. Timeout Error: waiting for function failed: timeout of 20000ms exceeded. 2024-08-22T07:37:33.517Z {"id":"ZotIN5GPCzaMgM8","url":"https://www.perplexity.ai/search?q=Evalute%20the%20outcome%20of%20the%20following%20statement%3A%20Will%20the%20SCOTUS%20affirmative%20action%20ruling%20result%20in%20a%20universal%20ban%3F%2C%20made%20in%202023-07-10","retryCount":2} 2024-08-22T07:37:35.988Z INFO PuppeteerCrawler:Statistics: PuppeteerCrawler request statistics: {"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":null,"requestsFinishedPerMinute":0,"requestsFailedPerMinute":0,"requestTotalDurationMillis":0,"requestsTotal":0,"crawlerRuntimeMillis":120690,"retryHistogram":[]} 2024-08-22T07:37:36.057Z INFO PuppeteerCrawler:AutoscaledPool: state {"currentConcurrency":1,"desiredConcurrency":1,"systemStatus":{"isSystemIdle":true,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":0},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.6,"actualRatio":0.039},"cpuInfo":{"isOverloaded":false,"limitRatio":0.4,"actualRatio":0},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":0}}} 2024-08-22T07:37:57.555Z WARN Puppeteer Utils: An error occurred during the script injection! {"error":{"name":"TypeError","message":"Cannot read properties of undefined (reading 'createElement')","stack":"TypeError: Cannot read properties of undefined (reading 'createElement')\n at assert (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :978:19)\n at evaluate (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :2694:23)\n at evaluate (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :2722:3)\n at evaluate (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :32:2)\n at evaluate (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :36:3)\n at #evaluate (/home/myuser/node_modules... [line-too-long] 2024-08-22T07:38:26.449Z WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. Navigation timeout of 30000 ms exceeded 2024-08-22T07:38:26.452Z at CrawlerSetup._requestHandler (file:///home/myuser/dist/internals/crawler_setup.js:549:20) {"id":"ZotIN5GPCzaMgM8","url":"https://www.perplexity.ai/search?q=Evalute%20the%20outcome%20of%20the%20following%20statement%3A%20Will%20the%20SCOTUS%20affirmative%20action%20ruling%20result%20in%20a%20universal%20ban%3F%2C%20made%20in%202023-07-10","retryCount":3} 2024-08-22T07:38:36.044Z INFO PuppeteerCrawler:Statistics: PuppeteerCrawler request statistics: {"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":null,"requestsFinishedPerMinute":0,"requestsFailedPerMinute":0,"requestTotalDurationMillis":0,"requestsTotal":0,"crawlerRuntimeMillis":180746,"retryHistogram":[]} 2024-08-22T07:38:36.145Z INFO PuppeteerCrawler:AutoscaledPool: state {"currentConcurrency":1,"desiredConcurrency":1,"systemStatus":{"isSystemIdle":true,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":0},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.6,"actualRatio":0.057},"cpuInfo":{"isOverloaded":false,"limitRatio":0.4,"actualRatio":0.137},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":0}}} 2024-08-22T07:39:22.581Z ERROR PuppeteerCrawler: Request failed and reached maximum retries. Error: Timeout Error: waiting for function failed: timeout of 20000ms exceeded. 2024-08-22T07:39:22.584Z at Context._waitForFunction (:227:31) 2024-08-22T07:39:22.586Z at async Object.pageFunction (:64:5) 2024-08-22T07:39:22.588Z at async pptr:evaluate;CrawlerSetup._requestHandler%20(file%3A%2F%2F%2Fhome%2Fmyuser%2Fdist%2Finternals%2Fcrawler_setup.js%3A596%3A35):6:45 {"id":"ZotIN5GPCzaMgM8","url":"https://www.perplexity.ai/search?q=Evalute%20the%20outcome%20of%20the%20following%20statement%3A%20Will%20the%20SCOTUS%20affirmative%20action%20ruling%20result%20in%20a%20universal%20ban%3F%2C%20made%20in%202023-07-10","method":"GET","uniqueKey":"GET():https://www.perplexity.ai/search?q=Evalute+the+outcome+of+the+following+statement%3A+Will+the+SCOTUS+affirmative+action+ruling+result+in+a+universal+ban%3F%2C+made+in+2023-07-10"} 2024-08-22T07:39:22.590Z ERROR Request https://www.perplexity.ai/search?q=Evalute%20the%20outcome%20of%20the%20following%20statement%3A%20Will%20the%20SCOTUS%20affirmative%20action%20ruling%20result%20in%20a%20universal%20ban%3F%2C%20made%20in%202023-07-10 failed and will not be retried anymore. Marking as failed. 2024-08-22T07:39:22.592Z Last Error Message: Error: Timeout Error: waiting for function failed: timeout of 20000ms exceeded.

DN

demonstrative_nomad

a month ago

retrying the same query by running the actor again works, but hoping for a better solution thats more reliable and ideally faster too. as cost incur for failing is not low as well.

jons avatar

Jon (jons)

a month ago

Thanks for report the issue! Seem they change their site, just now I fixed it. Please try again with latest version 0.0.24.

DN

demonstrative_nomad

a month ago

Thanks it works better now, runs are now ~1 min+. Most of my initial runs work but at the end there's some weird behavior where it claims multiple results are being returned but viewing it only shows 1. there are a few failures near the end as well not sure why,

Here's an example of one of the fails below 2024-08-22T18:24:06.523Z ACTOR: Pulling Docker image of build HMr6kFCNLcnClmFPF from repository. 2024-08-22T18:24:08.559Z ACTOR: Creating Docker container. 2024-08-22T18:24:12.155Z ACTOR: Starting Docker container. 2024-08-22T18:24:13.535Z Starting X virtual framebuffer using: Xvfb :99 -ac -screen 0 1920x1080x24+32 -nolisten tcp 2024-08-22T18:24:13.541Z Executing main command 2024-08-22T18:24:14.534Z INFO System info {"apifyVersion":"3.2.4","apifyClientVersion":"2.9.4","crawleeVersion":"3.11.1","osType":"Linux","nodeVersion":"v18.20.4"} 2024-08-22T18:24:18.459Z INFO Configuring Web Scraper. 2024-08-22T18:24:42.187Z WARN 2024-08-22T18:24:42.188Z ***************************************************************** 2024-08-22T18:24:42.190Z * Web Scraper is running in DEVELOPMENT MODE! * 2024-08-22T18:24:42.192Z * Concurrency is limited, sessionPool is not available, * 2024-08-22T18:24:42.193Z * timeouts are increased and debugger is enabled. * 2024-08-22T18:24:42.195Z * If you want full control and performance switch * 2024-08-22T18:24:42.196Z * Run type to PRODUCTION! * 2024-08-22T18:24:42.197Z ***************************************************************** 2024-08-22T18:24:42.198Z 2024-08-22T18:24:42.199Z INFO Configuration completed. Starting the scrape. 2024-08-22T18:24:42.561Z INFO PuppeteerCrawler: Starting the crawler. 2024-08-22T18:24:43.564Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:24:53.562Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:24:59.587Z devtools-server starting. 2024-08-22T18:24:59.589Z devtools-server listening on port: 4321 2024-08-22T18:25:03.607Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:25:13.591Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:25:23.598Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:25:33.603Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:25:42.562Z INFO PuppeteerCrawler:Statistics: PuppeteerCrawler request statistics: {"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":null,"requestsFinishedPerMinute":0,"requestsFailedPerMinute":0,"requestTotalDurationMillis":0,"requestsTotal":0,"crawlerRuntimeMillis":62662,"retryHistogram":[]} 2024-08-22T18:25:43.605Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:25:43.699Z INFO PuppeteerCrawler:AutoscaledPool: state {"currentConcurrency":1,"desiredConcurrency":1,"systemStatus":{"isSystemIdle":true,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":0},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.6,"actualRatio":0.079},"cpuInfo":{"isOverloaded":false,"limitRatio":0.4,"actualRatio":0.368},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":0}}} 2024-08-22T18:25:53.488Z WARN Puppeteer Utils: An error occurred during the script injection! {"error":{"name":"TypeError","message":"Cannot read properties of undefined (reading 'createElement')","stack":"TypeError: Cannot read properties of undefined (reading 'createElement')\n at assert (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :978:19)\n at evaluate (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :2694:23)\n at evaluate (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :2722:3)\n at evaluate (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :32:2)\n at evaluate (evaluate at /home/myuser/node_modules/@crawlee/puppeteer/internals/utils/puppeteer_utils.js:83:14, :36:3)\n at #evaluate (/home/myuser/node_modules... [line-too-long] 2024-08-22T18:25:53.617Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:26:03.610Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:26:23.609Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:26:23.915Z INFO PuppeteerCrawler: All requests from the queue have been processed, the crawler will shut down. 2024-08-22T18:26:33.325Z INFO PuppeteerCrawler: Final request statistics: {"requestsFinished":1,"requestsFailed":0,"retryHistogram":[1],"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":83168,"requestsFinishedPerMinute":1,"requestsFailedPerMinute":0,"requestTotalDurationMillis":83168,"requestsTotal":1,"crawlerRuntimeMillis":113425} 2024-08-22T18:26:33.326Z INFO PuppeteerCrawler: Finished! Total 1 requests: 1 succeeded, 0 failed. {"terminal":true} 2024-08-22T18:26:33.609Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:26:34.326Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:26:34.328Z INFO Web Scraper finished.

2024-08-22T18:21:56.533Z ACTOR: Pulling Docker image of build HMr6kFCNLcnClmFPF from repository. 2024-08-22T18:22:16.275Z ACTOR: Creating Docker container. 2024-08-22T18:22:23.205Z ACTOR: Starting Docker container. 2024-08-22T18:22:24.626Z Starting X virtual framebuffer using: Xvfb :99 -ac -screen 0 1920x1080x24+32 -nolisten tcp 2024-08-22T18:22:24.627Z Executing main command 2024-08-22T18:22:26.556Z INFO System info {"apifyVersion":"3.2.4","apifyClientVersion":"2.9.4","crawleeVersion":"3.11.1","osType":"Linux","nodeVersion":"v18.20.4"} 2024-08-22T18:22:27.326Z INFO Configuring Web Scraper. 2024-08-22T18:22:39.090Z WARN 2024-08-22T18:22:39.092Z ***************************************************************** 2024-08-22T18:22:39.093Z * Web Scraper is running in DEVELOPMENT MODE! * 2024-08-22T18:22:39.095Z * Concurrency is limited, sessionPool is not available, * 2024-08-22T18:22:39.096Z * timeouts are increased and debugger is enabled. * 2024-08-22T18:22:39.097Z * If you want full control and performance switch * 2024-08-22T18:22:39.098Z * Run type to PRODUCTION! * 2024-08-22T18:22:39.099Z ***************************************************************** 2024-08-22T18:22:39.102Z 2024-08-22T18:22:39.103Z INFO Configuration completed. Starting the scrape. 2024-08-22T18:22:39.225Z INFO PuppeteerCrawler: Starting the crawler. 2024-08-22T18:22:41.047Z devtools-server starting. 2024-08-22T18:22:41.049Z devtools-server listening on port: 4321 2024-08-22T18:22:50.226Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:23:10.226Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:23:20.226Z WARN PuppeteerCrawler: Setting status message timed out after 1s 2024-08-22T18:23:29.890Z INFO PuppeteerCrawler: All requests from the queue have been processed, the crawler will shut down. 2024-08-22T18:23:31.008Z INFO PuppeteerCrawler: Final request statistics: {"requestsFinished":1,"requestsFailed":0,"retryHistogram":[1],"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":48477,"requestsFinishedPerMinute":1,"requestsFailedPerMinute":0,"requestTotalDurationMillis":48477,"requestsTotal":1,"crawlerRuntimeMillis":54274} 2024-08-22T18:23:31.009Z INFO PuppeteerCrawler: Finished! Total 1 requests: 1 succeeded, 0 failed. {"terminal":true} 2024-08-22T18:23:31.733Z INFO Web Scraper finished.

jons avatar

Jon (jons)

24 days ago

Hi! Could you please give the inputs that make the tasks failed, so that I can check the issue easily.

jons avatar

Jon (jons)

24 days ago

I just do some test and it work fine. Some tips:

  1. Add multiple search terms into one task: this will run faster then do one by one. Remember to increase timeout if need
  2. Always using residential proxy
DN

demonstrative_nomad

23 days ago

I'll will try multiple search terms and see how it goes. Thanks!

Developer
Maintained by Community
Actor metrics
  • 6 monthly users
  • 4 stars
  • 76.5% runs succeeded
  • 13 days response time
  • Created in Aug 2024
  • Modified 27 days ago
Categories