GetYourGuide Review Scraper avatar

GetYourGuide Review Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
GetYourGuide Review Scraper

GetYourGuide Review Scraper

The GetYourGuide Review Scraper is a powerful and easy-to-use actor designed to collect structured review data from GetYourGuide product pages. Whether you’re analyzing customer sentiment, monitoring competitor feedback, or enriching your travel content.

Pricing

from $3.00 / 1,000 results

Rating

5.0

(1)

Developer

Louvre LLC

Louvre LLC

Maintained by Community

Actor stats

0

Bookmarked

45

Total users

23

Monthly active users

12 days

Issues response

15 days ago

Last modified

Categories

Share

🧳🌍 GetYourGuide Review Scraper

πŸ“Œ Introduction

GetYourGuide Review Scraper is a powerful and easy-to-use Apify Actor that collects structured review data from GetYourGuide product pages. It allows you to extract ratings, review text, author information, photos, and other metadata β€” ideal for sentiment analysis, competitor feedback monitoring, travel content enrichment, and research workflows.


🎯 What is this Review Scraper?

This Actor automatically scrapes customer reviews from a specified GetYourGuide product URL and outputs them as structured JSON. It supports flexible pagination, reliable de-duplication, and produces data that's ready for analysis or integration into your systems.


πŸ” Features

  • ⭐ Comprehensive Review Data β€” Extracts ratings, review text, author name & location, photos, and helpful votes.
  • πŸ” Flexible Pagination β€” Supports custom starting page and page size.
  • πŸ”€ Multiple Sorting Options β€” Reviews can be processed in different orders (date, rating, etc.).
  • πŸš€ High Performance β€” Efficiently handles scraping of large volumes of reviews.
  • 🧹 Reliable Output β€” Includes duplicate detection and data validation.
  • πŸ”Œ Easy Integration β€” Simple input parameters and structured output.

🧰 Input Parameters

ParameterTypeRequiredDescription
product_urlStringβœ…The GetYourGuide product page URL to scrape reviews from.
page_sizeNumber❌Maximum number of reviews per page (default: 100).
pageNumber❌Start page number; will skip (page – 1) * page_size reviews from the beginning (default: 1).

πŸ“₯ Example Input

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879",
"page_size": 100,
"page": 1
}

This configuration will scrape up to 100 reviews from the first page of the specified GetYourGuide product.


πŸ—ƒοΈ Output Data

Each review returned by this Actor includes the following structured fields:

FieldTypeDescription
idStringUnique review identifier.
review_idStringOriginal GetYourGuide review ID.
product_idStringGetYourGuide product identifier.
product_urlStringProduct URL used for scraping this review.
ratingNumberReview rating (1–5 stars).
titleStringReview title.
contentStringFull review text.
author_nameStringReviewer's name.
user_identifier_valueStringAuthor user identifier.
review_dateStringDate when the review was published (ISO 8601 format).
helpful_votesNumberNumber of helpful votes.
photosArrayArray of photo URLs (if any).
owner_responseStringBusiness owner response text (if any).
owner_response_authorStringResponse author name (if any).
owner_response_dateStringResponse publication date (ISO 8601 format, if any).
sourceStringData source identifier (always "gyg").
languageStringReview language code.
overall_star_ratingNumberOverall product star rating (e.g. 4.5 out of 5). -1 if unavailable.
overall_review_countNumberTotal number of reviews for the product. -1 if unavailable.

🧠 How It Works

  1. Input Validation β€” Validates the provided GetYourGuide product URL.
  2. Review Extraction β€” Scrapes reviews from the product page using advanced web scraping logic.
  3. Data Processing β€” Cleans and processes the extracted review data.
  4. Duplicate Detection β€” Identifies and handles duplicates.
  5. Data Storage β€” Saves the structured data to the Apify dataset for retrieval.

βš™οΈ Usage Examples

πŸ“¦ Basic Usage

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879"
}

πŸ“ˆ Advanced Usage (Pagination)

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879",
"page_size": 200,
"page": 2
}

πŸ’° Pricing & Limits

This Actor costs $1.00 per 1,000 results with a free trial available.


πŸš€ Deploying to Apify

To push this Actor to the Apify platform, use the Apify CLI:

Prerequisites

  1. Install Apify CLI (if not already installed):
$npm install -g apify-cli
  1. Login to Apify:
$apify login

Push Command

Navigate to the actor directory and push:

cd gyg-review-scraper
apify push

This will:

  • Build the Actor code
  • Upload it to your Apify account
  • Make it available in the Apify platform

Push and Set as Latest Version

To push and automatically set the version as latest:

$apify push -b latest --force

This command will:

  • Build and push the Actor
  • Set the pushed version as the latest version
  • Make it the default version when running the Actor

Alternative: Push with Specific Actor ID

If you want to push to a specific Actor ID:

$apify push --actor-id <YOUR_ACTOR_ID>

Or push with Actor ID and set as latest:

$apify push --actor-id <YOUR_ACTOR_ID> -b latest --force

Build and Push Separately

You can also build and push separately:

# Build the Actor
npm run build
# Push to Apify
apify push

Modifying the Version

To change the Actor version before pushing:

  1. Edit the version in .actor/actor.json:
{
"version": "1.3"
}

Note: Apify requires version format to be MAJOR.MINOR (e.g., 1.3), where MAJOR and MINOR are numbers from 0 to 99. Three-part versions like 1.3.1 are not supported.

  1. Push with the new version:
$apify push

Or push with a specific version directly:

$apify push --version 1.3

Note: The version in actor.json will be used by default. If you specify --version latest, it will set the pushed version as latest regardless of the version in actor.json.

πŸ“‘ API / CLI Integration

You can run this Actor using the Apify API or CLI. Example with curl:

curl "https://api.apify.com/v2/acts/louvre~gyg-review-scraper/runs?token=<YOUR_API_TOKEN>" \
-X POST \
-H "Content-Type: application/json" \
-d @input.json

Replace <YOUR_API_TOKEN> with your Apify token and input.json with your JSON input containing the fields above.