Tripadvisor Reviews Scraper
Pay $2.00 for 1,000 reviews
Tripadvisor Reviews Scraper
Pay $2.00 for 1,000 reviews
Get and download reviews for chosen places on Tripadvisor. Extract the review text, URL, rating, date of travel, published date, basic reviewer info, owner's response, helpful votes, images, review language, place details. Download reviews in XML, JSON, CSV.
Do you want to learn more about this Actor?
Get a demoThe Integration pipeline seems to be not working, the crawler runs for the first actor but fails while trying to run the second one.
Hi Sojern, thanks a lot for reporting this!
Unfortunately, there has been an update on TripAdvisor overnight, that caused the scraper to stop working. The issue is now fixed ;)
Please try rerunning this Actor now and let me know how it works, thanks and happy scraping!
Dear Lucas,
I am writing to follow up on the scraper we discussed. After reviewing it, I believe the process involves calling the API for the maxcopell/tripadvisor integration, which then triggers the maxcopell/tripadvisor-reviews integration.
Once the scraping run is complete, I am unsure which dataset is the most appropriate to utilize. While maxcopell/tripadvisor appears to lack some data fields, maxcopell/tripadvisor-reviews seems to incorporate data from both integrations.
Additionally, I noticed that the user value is consistently null within maxcopell/tripadvisor-reviews. Would you be able to investigate this issue as well?
Thank you for your time and assistance.
Yes, please use the maxcopell/tripadvisor-reviews
output, as that is being run from the integration and merging the data into a single output. The maxcopell/tripadvisor
dataset is basically just an intermediary layer to pass the place data further.
I forgot to mention it, but the integration has its own input, that is not copied over from the Actor. Please review the input for the integration, you most likely have scrapeReviewerInfo
set to false
. https://console.apify.com/actors/dbEyMBriog95Fv8CW/integrations
I hope this helps, thanks!
Hi lucas,
I have checked the actor, scrape reviewer info is set to True, so I guess there is some issue. Can you please help me with this?
Thanks.
Please check the reviews scraper integration input. The input of the Actor can be set different to the integration, so you probably have scrapeReviewerInfo
set incorrectly there.
Alternatively, you can set the integration as an Actor task, but that's not needed here, it would just make things more clear. Actor tasks are just a way of easily storing multiple Actor inputs.
Okay can you show a test run from your end. So that I can follow the same.
Yes, here is an example:
- https://console.apify.com/view/runs/IFRZ4wJ5r5PyOfNI5 - The starting Actor
- https://console.apify.com/view/runs/5iMWb7z4HgfKCPaSb - Reviews scraper integration
Notice the input for the reviews scraper, with scrapeReviewerInfo
set to true
Hi Lukas,
I did try to run the actor in my end for this venue https://www.tripadvisor.com/Hotel_Review-g33252-d82586-Reviews-Montrose_at_Beverly_Hills-West_Hollywood_California.html, below are the attached screenshots and links for the same, If you take a look at the screenshots i have set the ScrapeReviewerInfo to true, but still it returns null for the user. Can you please take a look at it.
- The starting actor https://console.apify.com/organization/v9hqAJiffZGKLkDhd/actors/Hvp4YfFGyLM635Q2F
- Reviews Scraper Integration
Thanks.
Thanks for sharing the runs. The input for the reviews scraper clearly states, that scrapeReviewerInfo
was set to false
. You can review the input for the run yourself:
The input for the Actor inside an integration is not linked with any standalone Actor input. Please update the input for the integration itself, not the Actor.
Hey Lukas,
I really appreciate it. I guess I figured it out right now. Thanks a lot!
You're welcome! I'm glad to hear you figured it out ;)
Hey Lukas,
I'm facing some challenges with the integration, I run the main actor and how do I retrieve the dataset items from the second actor after the run is complete, Because I have tried using the last dataset item endpoint but unfortunately it brings the results of the last run but not the current run. Can you please help me with that?
Hi, if I understand correctly you are starting it via API, right? This gets quite complicated because that requires the access to the dataset of the integration run, but to start it with the first one. Unfortunately, the current API doesn't contain any information about the ran integration, not even the ID... it's kinda stupid.
Assuming the latest run under your profile will be the reviews scraper, you can simply use this endpoint to get the last run and it's dataset ID: https://docs.apify.com/api/v2#/reference/actor-runs/run-collection/get-user-runs-list
Afterward, you can access the data with the dataset ID: https://docs.apify.com/api/v2#/reference/datasets/dataset/get-items
It's not the prettiest solution, but it should get the job done. If you are using either the JS or Python API clients for Apify, it shouldn't take more than a few lines of code ;)
I hope this helps, thanks and happy scraping!
- 366 monthly users
- 41 stars
- 99.4% runs succeeded
- 2.4 days response time
- Created in Jan 2023
- Modified 3 days ago