Truspilot Reviews Scraper avatar
Truspilot Reviews Scraper

Pricing

$10.00/month + usage

Go to Apify Store
Truspilot Reviews Scraper

Truspilot Reviews Scraper

Easily filter and extract thousands of reviews with data such as title, description, score, reviewer, country, company response, and much more from companies on Trustpilot.com and download them to multiple file formats.

Pricing

$10.00/month + usage

Rating

0.0

(0)

Developer

unli

unli

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

0

Monthly active users

a month ago

Last modified

Share

Trustpilot Reviews Scraper

This Apify actor scrapes reviews from Trustpilot company pages with automatic pagination support. It extracts comprehensive data from each review including ratings, review text, author information, dates, and company details.

Features

  • ✅ Automatic pagination through all review pages
  • ✅ Extracts comprehensive review data from JSON-LD structured data
  • ✅ Respects rate limits with built-in delays
  • ✅ Handles errors gracefully
  • ✅ Configurable page limits and start page
  • ✅ Detailed logging and statistics

Input

The actor accepts the following input parameters:

FieldTypeRequiredDescription
trustpilotUrlStringYesThe URL of the Trustpilot company page (e.g., https://www.trustpilot.com/review/www.amazon.com)
maxPagesIntegerNoMaximum number of pages to scrape. Leave empty to scrape all pages.
startPageIntegerNoPage number to start scraping from (default: 1)

Example Input

{
"trustpilotUrl": "https://www.trustpilot.com/review/www.amazon.com",
"maxPages": 10,
"startPage": 1
}

Output

The actor outputs a dataset where each item represents a single review with the following structure:

{
"reviewId": "123abc...",
"author": {
"name": "John Doe",
"profileUrl": "https://www.trustpilot.com/users/..."
},
"rating": 5,
"title": "Great service!",
"text": "I had an amazing experience...",
"date": "2024-01-15T10:30:00Z",
"language": "en",
"company": {
"name": "Amazon",
"url": "https://www.amazon.com",
"aggregateRating": {
"rating": 4.5,
"reviewCount": 150000,
"bestRating": 5,
"worstRating": 1
}
},
"source": "trustpilot",
"pageUrl": "https://www.trustpilot.com/review/www.amazon.com?page=1",
"extractedAt": "2024-01-20T15:45:00.000Z"
}

Output Fields

  • reviewId: Unique identifier for the review
  • author.name: Name of the review author
  • author.profileUrl: Link to the author's Trustpilot profile
  • rating: Star rating (1-5)
  • title: Review headline/title
  • text: Full review text
  • date: Date when the review was published (ISO format)
  • language: Language code of the review
  • company.name: Name of the company being reviewed
  • company.url: Company's website URL
  • company.aggregateRating: Overall rating statistics for the company
  • source: Always "trustpilot"
  • pageUrl: URL of the page where the review was found
  • extractedAt: Timestamp of when the data was extracted

How It Works

  1. The actor starts by fetching the first page (or the specified startPage)
  2. It extracts all reviews from the JSON-LD structured data on the page
  3. Reviews are saved to the dataset
  4. If more reviews are found, it moves to the next page
  5. Process continues until:
    • No more reviews are found
    • The maxPages limit is reached
    • An error occurs (after the first page)

Usage Tips

Scraping All Reviews

To scrape all available reviews, simply provide the Trustpilot URL:

{
"trustpilotUrl": "https://www.trustpilot.com/review/www.example.com"
}

Limiting Pages

To scrape only the first 5 pages:

{
"trustpilotUrl": "https://www.trustpilot.com/review/www.example.com",
"maxPages": 5
}

Starting from a Specific Page

To resume scraping from page 10:

{
"trustpilotUrl": "https://www.trustpilot.com/review/www.example.com",
"startPage": 10
}

Rate Limiting

The actor includes a 1-2 second delay between page requests to be respectful to Trustpilot's servers. This helps prevent your requests from being blocked.

Error Handling

  • If the first page fails to load, the actor will throw an error
  • If subsequent pages fail, the actor will log the error and stop pagination gracefully
  • All successfully scraped reviews are saved before any error occurs

Limitations

  • The actor extracts data from publicly available JSON-LD structured data
  • Only reviews visible on the public pages are scraped
  • The actor respects Trustpilot's robots.txt and rate limits

Example Use Cases

  1. Competitor Analysis: Monitor competitor reviews and ratings
  2. Brand Monitoring: Track your company's reputation over time
  3. Market Research: Analyze customer sentiment across multiple companies
  4. Data Analysis: Export reviews for sentiment analysis and NLP projects

Support

If you encounter any issues or have questions, please open an issue on the actor's GitHub repository or contact the developer.

This actor is provided for educational and research purposes. Please respect Trustpilot's Terms of Service and use this tool responsibly. Ensure you have the right to scrape and use the data according to applicable laws and regulations.