Tripadvisor Reviews Scraper avatar

Tripadvisor Reviews Scraper

Try for free

Pay $2.00 for 1,000 reviews

View all Actors
Tripadvisor Reviews Scraper

Tripadvisor Reviews Scraper

maxcopell/tripadvisor-reviews
Try for free

Pay $2.00 for 1,000 reviews

Get and download reviews for chosen places on Tripadvisor. Extract the review text, URL, rating, date of travel, published date, basic reviewer info, owner's response, helpful votes, images, review language, place details. Download reviews in XML, JSON, CSV.

Do you want to learn more about this Actor?

Get a demo
TP

Integration Issue

Closed

tasty_providence opened this issue
6 months ago

The Integration pipeline seems to be not working, the crawler runs for the first actor but fails while trying to run the second one.

lukas.prusa avatar

Hi Sojern, thanks a lot for reporting this!

Unfortunately, there has been an update on TripAdvisor overnight, that caused the scraper to stop working. The issue is now fixed ;)

Please try rerunning this Actor now and let me know how it works, thanks and happy scraping!

TO

tasty_providence-owner

6 months ago

Dear Lucas,

I am writing to follow up on the scraper we discussed. After reviewing it, I believe the process involves calling the API for the maxcopell/tripadvisor integration, which then triggers the maxcopell/tripadvisor-reviews integration.

Once the scraping run is complete, I am unsure which dataset is the most appropriate to utilize. While maxcopell/tripadvisor appears to lack some data fields, maxcopell/tripadvisor-reviews seems to incorporate data from both integrations.

Additionally, I noticed that the user value is consistently null within maxcopell/tripadvisor-reviews. Would you be able to investigate this issue as well?

Thank you for your time and assistance.

lukas.prusa avatar

Yes, please use the maxcopell/tripadvisor-reviews output, as that is being run from the integration and merging the data into a single output. The maxcopell/tripadvisor dataset is basically just an intermediary layer to pass the place data further.

I forgot to mention it, but the integration has its own input, that is not copied over from the Actor. Please review the input for the integration, you most likely have scrapeReviewerInfo set to false. https://console.apify.com/actors/dbEyMBriog95Fv8CW/integrations

I hope this helps, thanks!

TO

tasty_providence-owner

6 months ago

Hi lucas,

I have checked the actor, scrape reviewer info is set to True, so I guess there is some issue. Can you please help me with this?

Thanks.

lukas.prusa avatar

Please check the reviews scraper integration input. The input of the Actor can be set different to the integration, so you probably have scrapeReviewerInfo set incorrectly there.

Alternatively, you can set the integration as an Actor task, but that's not needed here, it would just make things more clear. Actor tasks are just a way of easily storing multiple Actor inputs.

TO

tasty_providence-owner

6 months ago

Okay can you show a test run from your end. So that I can follow the same.

lukas.prusa avatar

Yes, here is an example:

Notice the input for the reviews scraper, with scrapeReviewerInfo set to true

TO

tasty_providence-owner

6 months ago

Hi Lukas,

I did try to run the actor in my end for this venue https://www.tripadvisor.com/Hotel_Review-g33252-d82586-Reviews-Montrose_at_Beverly_Hills-West_Hollywood_California.html, below are the attached screenshots and links for the same, If you take a look at the screenshots i have set the ScrapeReviewerInfo to true, but still it returns null for the user. Can you please take a look at it.

links: https://console.apify.com/organization/v9hqAJiffZGKLkDhd/actors/dbEyMBriog95Fv8CW/runs/FfCQcRvlNMauumYsH

Thanks.

lukas.prusa avatar

Thanks for sharing the runs. The input for the reviews scraper clearly states, that scrapeReviewerInfo was set to false. You can review the input for the run yourself:

The input for the Actor inside an integration is not linked with any standalone Actor input. Please update the input for the integration itself, not the Actor.

TO

tasty_providence-owner

6 months ago

Hey Lukas,

I really appreciate it. I guess I figured it out right now. Thanks a lot!

lukas.prusa avatar

You're welcome! I'm glad to hear you figured it out ;)

TO

tasty_providence-owner

6 months ago

Hey Lukas,

I'm facing some challenges with the integration, I run the main actor and how do I retrieve the dataset items from the second actor after the run is complete, Because I have tried using the last dataset item endpoint but unfortunately it brings the results of the last run but not the current run. Can you please help me with that?

lukas.prusa avatar

Hi, if I understand correctly you are starting it via API, right? This gets quite complicated because that requires the access to the dataset of the integration run, but to start it with the first one. Unfortunately, the current API doesn't contain any information about the ran integration, not even the ID... it's kinda stupid.


Assuming the latest run under your profile will be the reviews scraper, you can simply use this endpoint to get the last run and it's dataset ID: https://docs.apify.com/api/v2#/reference/actor-runs/run-collection/get-user-runs-list

Afterward, you can access the data with the dataset ID: https://docs.apify.com/api/v2#/reference/datasets/dataset/get-items

It's not the prettiest solution, but it should get the job done. If you are using either the JS or Python API clients for Apify, it shouldn't take more than a few lines of code ;)

I hope this helps, thanks and happy scraping!

Developer
Maintained by Apify
Actor metrics
  • 366 monthly users
  • 41 stars
  • 99.4% runs succeeded
  • 2.4 days response time
  • Created in Jan 2023
  • Modified 3 days ago
Categories