GetYourGuide Review Scraper avatar

GetYourGuide Review Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
GetYourGuide Review Scraper

GetYourGuide Review Scraper

The GetYourGuide Review Scraper is a powerful and easy-to-use actor designed to collect structured review data from GetYourGuide product pages. Whether you’re analyzing customer sentiment, monitoring competitor feedback, or enriching your travel content.

Pricing

from $3.00 / 1,000 results

Rating

5.0

(1)

Developer

Louvre LLC

Louvre LLC

Maintained by Community

Actor stats

0

Bookmarked

58

Total users

15

Monthly active users

16 days

Issues response

10 days ago

Last modified

Share

🧳🌍 GetYourGuide Review Scraper

πŸ“Œ Introduction

GetYourGuide Review Scraper is a powerful and easy-to-use Apify Actor that collects structured review data from GetYourGuide product pages. It allows you to extract ratings, review text, author information, photos, and other metadata β€” ideal for sentiment analysis, competitor feedback monitoring, travel content enrichment, and research workflows.


🎯 What is this Review Scraper?

This Actor automatically scrapes customer reviews from a specified GetYourGuide product URL and outputs them as structured JSON. It supports flexible pagination, reliable de-duplication, and produces data that's ready for analysis or integration into your systems.


πŸ” Features

  • ⭐ Comprehensive Review Data β€” Extracts ratings, review text, author name & location, photos, and helpful votes.
  • πŸ” Flexible Pagination β€” Supports custom starting page and page size.
  • πŸ”€ Multiple Sorting Options β€” Reviews can be processed in different orders (date, rating, etc.).
  • πŸš€ High Performance β€” Efficiently handles scraping of large volumes of reviews.
  • 🧹 Reliable Output β€” Includes duplicate detection and data validation.
  • πŸ”Œ Easy Integration β€” Simple input parameters and structured output.

🧰 Input Parameters

ParameterTypeRequiredDescription
product_urlString⚠️The GetYourGuide product page URL to scrape reviews from. Either product_url or product_urls must be provided.
product_urlsArray[String]⚠️Array of GetYourGuide product URLs for batch processing (1-200 URLs). Either product_url or product_urls must be provided.
page_sizeNumber❌Maximum number of reviews per page (default: 100, range: 1-1000).
pageNumber❌Start page number; will skip (page – 1) * page_size reviews from the beginning (default: 1).

πŸ“₯ Example Input

Single URL

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879",
"page_size": 100,
"page": 1
}

Batch Processing (Multiple URLs)

{
"product_urls": [
"https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879",
"https://www.getyourguide.com/paris-l16/eiffel-tower-skip-the-line-ticket-summit-by-lift-t56967"
],
"page_size": 100,
"page": 1
}

This configuration will scrape up to 100 reviews from the first page of the specified GetYourGuide product(s).


πŸ—ƒοΈ Output Data

Each review returned by this Actor includes the following structured fields:

FieldTypeDescription
idStringUnique review identifier.
review_idStringOriginal GetYourGuide review ID.
product_idStringGetYourGuide product identifier.
product_urlStringProduct URL used for scraping this review.
ratingNumberReview rating (1–5 stars).
titleStringReview title.
contentStringFull review text.
author_nameStringReviewer's name.
user_identifier_valueStringAuthor user identifier.
review_dateStringDate when the review was published (ISO 8601 format).
helpful_votesNumberNumber of helpful votes.
photosArrayArray of photo URLs (if any).
owner_responseStringBusiness owner response text (if any).
owner_response_authorStringResponse author name (if any).
owner_response_dateStringResponse publication date (ISO 8601 format, if any).
sourceStringData source identifier (always "gyg").
languageStringReview language code.
overall_star_ratingNumberOverall product star rating (e.g. 4.5 out of 5). -1 if unavailable.
overall_review_countNumberTotal number of reviews for the product. -1 if unavailable.

🧠 How It Works

  1. Input Validation β€” Validates the provided GetYourGuide product URL.
  2. Review Extraction β€” Scrapes reviews from the product page using advanced web scraping logic.
  3. Data Processing β€” Cleans and processes the extracted review data.
  4. Duplicate Detection β€” Identifies and handles duplicates.
  5. Data Storage β€” Saves the structured data to the Apify dataset for retrieval.

βš™οΈ Usage Examples

πŸ“¦ Basic Usage (Single URL)

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879"
}

πŸ“ˆ With Pagination

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879",
"page_size": 200,
"page": 2
}

πŸ”„ Batch Processing (Multiple URLs)

{
"product_urls": [
"https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879",
"https://www.getyourguide.com/paris-l16/eiffel-tower-skip-the-line-ticket-summit-by-lift-t56967",
"https://www.getyourguide.com/rome-l33/colosseum-skip-the-line-tickets-with-access-to-arena-t65680"
],
"page_size": 100,
"page": 1
}

πŸ’° Pricing & Limits

This Actor costs $3.00 per 1,000 results with a free trial available.


πŸš€ Deploying to Apify

To push this Actor to the Apify platform, use the Apify CLI:

Prerequisites

  1. Install Apify CLI (if not already installed):
$npm install -g apify-cli
  1. Login to Apify:
$apify login

Push Command

Navigate to the actor directory and push:

cd gyg-review-scraper
apify push

This will:

  • Build the Actor code
  • Upload it to your Apify account
  • Make it available in the Apify platform

Push and Set as Latest Version

To push and automatically set the version as latest:

$apify push -b latest --force

This command will:

  • Build and push the Actor
  • Set the pushed version as the latest version
  • Make it the default version when running the Actor

Alternative: Push with Specific Actor ID

If you want to push to a specific Actor ID:

$apify push --actor-id <YOUR_ACTOR_ID>

Or push with Actor ID and set as latest:

$apify push --actor-id <YOUR_ACTOR_ID> -b latest --force

Build and Push Separately

You can also build and push separately:

# Build the Actor
npm run build
# Push to Apify
apify push

Modifying the Version

To change the Actor version before pushing:

  1. Edit the version in .actor/actor.json:
{
"version": "1.3"
}

Note: Apify requires version format to be MAJOR.MINOR (e.g., 1.3), where MAJOR and MINOR are numbers from 0 to 99. Three-part versions like 1.3.1 are not supported.

  1. Push with the new version:
$apify push

Or push with a specific version directly:

$apify push --version 1.3

Note: The version in actor.json will be used by default. If you specify --version latest, it will set the pushed version as latest regardless of the version in actor.json.

πŸ“‘ API / CLI Integration

You can run this Actor using the Apify API or CLI. Example with curl:

curl "https://api.apify.com/v2/acts/louvre~gyg-review-scraper/runs?token=<YOUR_API_TOKEN>" \
-X POST \
-H "Content-Type: application/json" \
-d @input.json

Replace <YOUR_API_TOKEN> with your Apify token and input.json with your JSON input containing the fields above.



Scraping reviews across multiple platforms? These sibling Actors pair naturally with this one:

Use them together for cross-platform sentiment analysis, competitor monitoring, or travel content enrichment.


🏷️ Keywords

review API Β· sentiment analysis Β· JSON export Β· CSV download Β· travel data Β· customer reviews Β· web scraping Β· review scraper Β· Apify Actor


πŸ’‘ Found this useful? Bookmark it for future reference.