Tripadvisor Reviews Scraper avatar

Tripadvisor Reviews Scraper

Pricing

Pay per usage

Go to Apify Store
Tripadvisor Reviews Scraper

Tripadvisor Reviews Scraper

Extract Tripadvisor reviews, ratings & guest feedback at scale. Perfect for market research, competitor benchmarking, reputation monitoring & travel industry intelligence. Scrape thousands of reviews instantly.

Pricing

Pay per usage

Rating

5.0

(1)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

3 days ago

Last modified

Share

Extract TripAdvisor hotel user reviews into a clean dataset for research, analytics, and travel intelligence workflows. This actor is API-based with paginated review retrieval, null value cleanup, and duplicate filtering.

Features

  • TripAdvisor review extraction — Collect review records from a hotel page URL.
  • Duplicate-safe output — Automatically removes duplicate review items.
  • Null-free dataset — Skips empty or null fields from every output item.
  • Pagination support — Retrieves reviews page-by-page using API offset and limit.

Use Cases

Reputation Monitoring

Track rating changes and review trends for a hotel over time. Build recurring snapshots to compare service perception month by month.

Competitive Benchmarking

Compare ranking, review volume, and guest sentiment between competing hotels in the same city.

Travel Market Research

Create structured datasets of guest feedback to identify frequent complaints, high-performing amenities, and seasonal patterns.

BI and Dashboards

Export review data to analytics tools and visualize rating distribution, helpful-vote trends, and review cadence.


Input Parameters

ParameterTypeRequiredDefaultDescription
startUrlStringYesSheraton Istanbul sample URLTripAdvisor hotel review page URL.
results_wantedIntegerNo20Maximum number of reviews to collect.
max_pagesIntegerNo5Safety cap for pagination operations.
proxyConfigurationObjectNoResidential proxy presetProxy settings for reliability and anti-blocking resilience.

Environment variable required:

  • TRIPADVISOR_API_KEY — TripAdvisor Content API key used by the actor.

Output Data

Each dataset item contains only non-empty fields. The actor outputs review records.

FieldTypeDescription
item_typeStringreview.
review_idStringUnique review identifier.
location_idStringTripAdvisor location identifier.
location_nameStringHotel name.
ratingNumberReview rating or overall location rating.
titleStringReview headline.
review_textStringFull review content.
published_dateStringReview publication date.
stay_dateStringStay/trip date when available.
trip_typeStringTrip category (for example family, business).
helpful_votesNumberHelpful vote count.
user_nameStringReviewer display name.
user_locationStringReviewer location.
source_urlStringSource TripAdvisor URL.

Usage Examples

Basic Run

{
"startUrl": "https://www.tripadvisor.com/Hotel_Review-g293974-d14930175-Reviews-Sheraton_Istanbul_City_Center-Istanbul.html",
"results_wanted": 20
}

Extended Pagination

{
"startUrl": "https://www.tripadvisor.com/Hotel_Review-g293974-d14930175-Reviews-Sheraton_Istanbul_City_Center-Istanbul.html",
"results_wanted": 100,
"max_pages": 20,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Sample Output

{
"item_type": "review",
"review_id": "123456789",
"location_id": "14930175",
"location_name": "Sheraton Istanbul City Center",
"rating": 5,
"title": "Excellent stay in Istanbul",
"review_text": "Great service, clean rooms, and very convenient location.",
"published_date": "2026-03-12",
"stay_date": "2026-03",
"trip_type": "Couples",
"helpful_votes": 3,
"user_name": "TravelExplorer",
"user_location": "London, United Kingdom",
"source_url": "https://www.tripadvisor.com/Hotel_Review-g293974-d14930175-Reviews-Sheraton_Istanbul_City_Center-Istanbul.html"
}

Tips For Best Results

Start With 20 Results

Use results_wanted: 20 for quick validation runs and faster QA-friendly execution.

Prefer Residential Proxies

Residential proxy routing improves reliability for protected pages and reduces blocked runs.

Use Stable Hotel URLs

Use canonical TripAdvisor hotel URLs containing the -d<locationId>- segment to ensure reliable ID extraction.


Integrations

  • Google Sheets — Share hotel review datasets with non-technical teams.
  • Airtable — Build searchable review repositories.
  • Make — Automate review ingestion pipelines.
  • Zapier — Trigger notifications on fresh run data.
  • Webhooks — Push new records into custom backend services.

Export Formats

  • JSON — Best for APIs and downstream processing.
  • CSV — Best for spreadsheet analysis.
  • Excel — Best for reports and stakeholder sharing.
  • XML — Best for legacy integrations.

Frequently Asked Questions

Why does the run fail with missing API key?

This actor uses the TripAdvisor reviews API directly and requires TRIPADVISOR_API_KEY in the runtime environment.

Does this actor remove duplicate reviews?

Yes. Duplicate entries are filtered before writing to the dataset.

Are null values included in output?

No. Empty and null fields are removed automatically from every saved item.

Can I scrape multiple hotels?

This actor is optimized for one hotel per run. Schedule multiple runs for multiple hotels.

How do I improve blocked runs?

Enable proxy routing with residential IPs and keep result limits reasonable.


Support

For issues or feature requests, use the Apify Console issue/support channels for this actor.

Resources


This actor is intended for legitimate data collection and analytics workflows. You are responsible for compliance with applicable laws, platform terms, and internal usage policies.