GetYourGuide Review Scraper
Pricing
from $3.00 / 1,000 results
GetYourGuide Review Scraper
The GetYourGuide Review Scraper is a powerful and easy-to-use actor designed to collect structured review data from GetYourGuide product pages. Whether youβre analyzing customer sentiment, monitoring competitor feedback, or enriching your travel content.
Pricing
from $3.00 / 1,000 results
Rating
5.0
(1)
Developer
Louvre LLC
Actor stats
0
Bookmarked
45
Total users
23
Monthly active users
12 days
Issues response
15 days ago
Last modified
Categories
Share
π§³π GetYourGuide Review Scraper
π Introduction
GetYourGuide Review Scraper is a powerful and easy-to-use Apify Actor that collects structured review data from GetYourGuide product pages. It allows you to extract ratings, review text, author information, photos, and other metadata β ideal for sentiment analysis, competitor feedback monitoring, travel content enrichment, and research workflows.
π― What is this Review Scraper?
This Actor automatically scrapes customer reviews from a specified GetYourGuide product URL and outputs them as structured JSON. It supports flexible pagination, reliable de-duplication, and produces data that's ready for analysis or integration into your systems.
π Features
- β Comprehensive Review Data β Extracts ratings, review text, author name & location, photos, and helpful votes.
- π Flexible Pagination β Supports custom starting page and page size.
- π Multiple Sorting Options β Reviews can be processed in different orders (date, rating, etc.).
- π High Performance β Efficiently handles scraping of large volumes of reviews.
- π§Ή Reliable Output β Includes duplicate detection and data validation.
- π Easy Integration β Simple input parameters and structured output.
π§° Input Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
product_url | String | β | The GetYourGuide product page URL to scrape reviews from. |
page_size | Number | β | Maximum number of reviews per page (default: 100). |
page | Number | β | Start page number; will skip (page β 1) * page_size reviews from the beginning (default: 1). |
π₯ Example Input
{"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879","page_size": 100,"page": 1}
This configuration will scrape up to 100 reviews from the first page of the specified GetYourGuide product.
ποΈ Output Data
Each review returned by this Actor includes the following structured fields:
| Field | Type | Description |
|---|---|---|
id | String | Unique review identifier. |
review_id | String | Original GetYourGuide review ID. |
product_id | String | GetYourGuide product identifier. |
product_url | String | Product URL used for scraping this review. |
rating | Number | Review rating (1β5 stars). |
title | String | Review title. |
content | String | Full review text. |
author_name | String | Reviewer's name. |
user_identifier_value | String | Author user identifier. |
review_date | String | Date when the review was published (ISO 8601 format). |
helpful_votes | Number | Number of helpful votes. |
photos | Array | Array of photo URLs (if any). |
owner_response | String | Business owner response text (if any). |
owner_response_author | String | Response author name (if any). |
owner_response_date | String | Response publication date (ISO 8601 format, if any). |
source | String | Data source identifier (always "gyg"). |
language | String | Review language code. |
overall_star_rating | Number | Overall product star rating (e.g. 4.5 out of 5). -1 if unavailable. |
overall_review_count | Number | Total number of reviews for the product. -1 if unavailable. |
π§ How It Works
- Input Validation β Validates the provided GetYourGuide product URL.
- Review Extraction β Scrapes reviews from the product page using advanced web scraping logic.
- Data Processing β Cleans and processes the extracted review data.
- Duplicate Detection β Identifies and handles duplicates.
- Data Storage β Saves the structured data to the Apify dataset for retrieval.
βοΈ Usage Examples
π¦ Basic Usage
{"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879"}
π Advanced Usage (Pagination)
{"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879","page_size": 200,"page": 2}
π° Pricing & Limits
This Actor costs $1.00 per 1,000 results with a free trial available.
π Deploying to Apify
To push this Actor to the Apify platform, use the Apify CLI:
Prerequisites
- Install Apify CLI (if not already installed):
$npm install -g apify-cli
- Login to Apify:
$apify login
Push Command
Navigate to the actor directory and push:
cd gyg-review-scraperapify push
This will:
- Build the Actor code
- Upload it to your Apify account
- Make it available in the Apify platform
Push and Set as Latest Version
To push and automatically set the version as latest:
$apify push -b latest --force
This command will:
- Build and push the Actor
- Set the pushed version as the
latestversion - Make it the default version when running the Actor
Alternative: Push with Specific Actor ID
If you want to push to a specific Actor ID:
$apify push --actor-id <YOUR_ACTOR_ID>
Or push with Actor ID and set as latest:
$apify push --actor-id <YOUR_ACTOR_ID> -b latest --force
Build and Push Separately
You can also build and push separately:
# Build the Actornpm run build# Push to Apifyapify push
Modifying the Version
To change the Actor version before pushing:
- Edit the version in
.actor/actor.json:
{"version": "1.3"}
Note: Apify requires version format to be MAJOR.MINOR (e.g., 1.3), where MAJOR and MINOR are numbers from 0 to 99. Three-part versions like 1.3.1 are not supported.
- Push with the new version:
$apify push
Or push with a specific version directly:
$apify push --version 1.3
Note: The version in actor.json will be used by default. If you specify --version latest, it will set the pushed version as latest regardless of the version in actor.json.
π‘ API / CLI Integration
You can run this Actor using the Apify API or CLI. Example with curl:
curl "https://api.apify.com/v2/acts/louvre~gyg-review-scraper/runs?token=<YOUR_API_TOKEN>" \-X POST \-H "Content-Type: application/json" \-d @input.json
Replace <YOUR_API_TOKEN> with your Apify token and input.json with your JSON input containing the fields above.
π Related Links
- π Actor page: https://apify.com/louvre/gyg-review-scraper
- π Input schema: https://apify.com/louvre/gyg-review-scraper/input-schema
