Big rewrite. Now pulls data from __NEXT_DATA__ instead of scraping HTML, which gives way more fields and is less likely to break.
New fields added:
reviewId -- Trustpilot's internal review ID
reviewUrl -- direct link to the review
publishedDate / experienceDate / updatedDate -- three separate dates instead of one
authorCountry -- ISO country code (GB, US, DE, etc.)
authorReviewCount -- how many reviews that person has written
authorImageUrl -- profile picture URL
verificationLevel -- not just true/false, now shows the actual level
source -- Organic vs Invitation (huge for sentiment analysis)
language -- review language code
likesCount -- upvotes on the review
Business info improvements:
- Now extracts
websiteUrl, phone, email, address when available
- Gets data from
__NEXT_DATA__ first, falls back to JSON-LD
Under the hood:
- 3-tier extraction:
__NEXT_DATA__ > JSON-LD > HTML parsing
- 25 fields per review (up from 14)
- Field names cleaned up to match industry standard (e.g.
title instead of reviewTitle)
First release. Scrapes reviews, ratings, author info, and company replies. Supports star filtering, sorting, and pagination.