Tripadvisor Scraper avatar
Tripadvisor Scraper
Try for free

Pay $3.00 for 1,000 results

View all Actors
Tripadvisor Scraper

Tripadvisor Scraper

maxcopell/tripadvisor
Try for free

Pay $3.00 for 1,000 results

This unofficial Tripadvisor API is a data extraction tool able to get data on hotels, restaurants, things to do, vacation rentals, attractions, tours, and public trips. Get pricing, contact details, amenities, awards, ratings, and more. Download your data in Excel, JSON, CSV, and other formats.

Do you want to learn more about this Actor?

Get a demo
CS

Cannot read properties of null

Closed

charmed_shield opened this issue
2 months ago

When running a crawler I see: 2024-07-03T13:58:26.341Z INFO Input validation OK 2024-07-03T13:58:26.396Z INFO Created 1 start request 2024-07-03T13:58:26.698Z INFO CustomRequestsCheerioCrawler: Starting the crawler. 2024-07-03T13:58:26.930Z WARN CustomRequestsCheerioCrawler: Reclaiming failed request back to the list or queue. TypeError: Cannot read properties of null (reading 'metadata') 2024-07-03T13:58:26.931Z at parseRestaurantSearch (file:///home/myuser/dist/parsers/listing-pages/restaurant-search-parser.js:12:31) 2024-07-03T13:58:26.932Z at restaurantSearchRoute (file:///home/myuser/dist/routes/web-listings/restaurant-search-route.js:17:24) 2024-07-03T13:58:26.933Z at func (/home/myuser/node_modules/@crawlee/core/router.js:172:44) 2024-07-03T13:58:26.935Z at CustomRequestsCheerioCrawler.requestHandler (file:///home/myuser/dist/main.js:21:19) 2024-07-03T13:58:26.936Z at process.processTicksAndRejections (node:internal/process/task_queues:95:5) 2024-07-03T13:58:26.937Z at async wrap (/home/myuser/node_modules/@apify/timeout/index.js:52:21) {"id":"AUGNbpBAwkT4qk7","url":"https://www.tripadvisor.com/data/graphql/ids","retryCount":1} And then it dies.

Input is: { "includeAiReviewsSummary": false, "includeAttractions": false, "includeHotels": false, "includeNearbyResults": false, "includePriceOffers": false, "includeRestaurants": true, "includeTags": false, "includeVacationRentals": false, "maxItemsPerQuery": 10000, "startUrls": [ { "url": "https://www.tripadvisor.com/Restaurants-g274684-Czech_Republic.html" } ], "checkInDate": "", "checkOutDate": "", "language": "en", "currency": "USD" }

lhotanok avatar

Hello, thanks for reporting this! We are investigating the issue and we'll let you know when we have any updates.

lhotanok avatar

Hi, it seems there's a change on Tripadvisor's side that causes our scraper failing on top-level categories of restaurant search such as your example: https://www.tripadvisor.com/Restaurants-g274684-Czech_Republic.html. We are working on the fix to support those top-level restaurant categories again.

In the meantime, you can use URLs of subcategories such as Restaurants in Prague and Restaurants in Brno as start URLs instead of the top-level category Restaurants in Czech Republic. Subcategories work correctly thanks to using a different page layout and API calls compared to the top-level categories. See the example run: https://console.apify.com/view/runs/pG3xc7LqO3vupBOmG

lukas.prusa avatar

Hi, I'm happy to inform you that we've just updated the scraper with the fix :)

Try it out and let me know how it works, thanks!

Also, we have some issues currently with users not being able to comment/reopen on closed issues, so I will keep this open for now and feel free to close this if everything has been resolved :) Thanks and happy scraping!

Developer
Maintained by Apify
Actor metrics
  • 284 monthly users
  • 51 stars
  • 96.7% runs succeeded
  • 1.3 days response time
  • Created in Nov 2019
  • Modified about 2 hours ago
Categories