Tripadvisor Scraper avatar
Tripadvisor Scraper
Try for free

Pay $3.00 for 1,000 results

View all Actors
Tripadvisor Scraper

Tripadvisor Scraper

maxcopell/tripadvisor
Try for free

Pay $3.00 for 1,000 results

This unofficial Tripadvisor API is a data extraction tool able to get data on hotels, restaurants, things to do, vacation rentals, attractions, tours, and public trips. Get pricing, contact details, amenities, awards, ratings, and more. Download your data in Excel, JSON, CSV, and other formats.

Do you want to learn more about this Actor?

Get a demo
LL

Failed to crawl tripadvisor restaurant

Closed

allangan85 opened this issue
3 months ago

hi it keep saidl 403 failed as per see in the log.

2024-05-28T07:29:37.003Z ACTOR: Pulling Docker image of build Az1gxcFUgtybwOIRe from repository. 2024-05-28T07:29:37.110Z ACTOR: Creating Docker container. 2024-05-28T07:29:37.528Z ACTOR: Starting Docker container. 2024-05-28T07:29:41.222Z INFO System info {"apifyVersion":"3.1.15","apifyClientVersion":"2.8.4","crawleeVersion":"3.7.2","osType":"Linux","nodeVersion":"v18.20.3"} 2024-05-28T07:29:41.927Z INFO Input validation OK 2024-05-28T07:29:42.002Z INFO Created 1 start request 2024-05-28T07:29:42.218Z INFO CustomRequestsCheerioCrawler: Starting the crawler. 2024-05-28T07:29:43.289Z INFO CustomRequestsCheerioCrawler: Found location 'Singapore' for location 'japanese food singapore' {} 2024-05-28T07:29:46.582Z WARN CustomRequestsCheerioCrawler: Reclaiming failed request back to the list or queue. Blocked. Received status code 403. Retrying... 2024-05-28T07:29:46.585Z at sendCustomRequest (file:///usr/src/app/dist/hooks/send-custom-request.js:34:15) {"id":"8xN8bk4jR7nvnWj","url":"https://www.tripadvisor.com/Tourism-gundefined","retryCount":1} 2024-05-28T07:29:55.010Z WARN CustomRequestsCheerioCrawler: Reclaiming failed request back to the list or queue. Blocked. Received status code 403. Retrying... 2024-05-28T07:29:55.013Z at sendCustomRequest (file:///usr/src/app/dist/hooks/send-custom-request.js:34:15) {"id":"8xN8bk4jR7nvnWj","url":"https://www.tripadvisor.com/Tourism-gundefined","retryCount":2} 2024-05-28T07:30:02.066Z WARN CustomRequestsCheerioCrawler: Reclaiming failed request back to the list or queue. Blocked. Received status code 403. Retrying... 2024-05-28T07:30:02.068Z at sendCustomRequest (file:///usr/src/app/dist/hooks/send-custom-request.js:34:15) {"id":"8xN8bk4jR7nvnWj","url":"https://www.tripadvisor.com/Tourism-gundefined","retryCount":3} 2024-05-28T07:30:08.973Z INFO CustomRequestsCheerioCrawler: Opened tourism location search page {"url":"https://www.tripadvisor.com/Tourism-gundefined"} 2024-05-28T07:30:09.265Z WARN No relevant location types found for the tourism location search page. {"url":"https://www.tripadvisor.com/Tourism-gundefined","inputQueryOrUrl":"japanese food singapore","foundNonRelevantUrls":{"hotelsUrl":null,"restaurantsUrl":null,"attractionsUrl":null,"vacationRentalsUrl":null}} 2024-05-28T07:30:09.337Z INFO CustomRequestsCheerioCrawler: All requests from the queue have been processed, the crawler will shut down. 2024-05-28T07:30:09.669Z INFO CustomRequestsCheerioCrawler: Final request statistics: {"requestsFinished":2,"requestsFailed":0,"retryHistogram":[1,null,null,1],"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":2555,"requestsFinishedPerMinute":4,"requestsFailedPerMinute":0,"requestTotalDurationMillis":5109,"requestsTotal":2,"crawlerRuntimeMillis":27637} 2024-05-28T07:30:09.672Z INFO CustomRequestsCheerioCrawler: Finished! Total 2 requests: 2 succeeded, 0 failed. {"terminal":true}

lukas.prusa avatar

Hi Allan, thanks a lot for opening this issue!

I'm happy to inform you, that we've just updated the scraper with the fix ;)

There was a bug in the location search query crawling logic for less common locations.

Try it out now and let me know how it works, thanks and happy scraping!

Developer
Maintained by Apify
Actor metrics
  • 285 monthly users
  • 51 stars
  • 96.7% runs succeeded
  • 1.3 days response time
  • Created in Nov 2019
  • Modified about 3 hours ago
Categories