Review Aggregator & Sentiment Dataset Builder avatar

Review Aggregator & Sentiment Dataset Builder

Pricing

Pay per usage

Go to Apify Store
Review Aggregator & Sentiment Dataset Builder

Review Aggregator & Sentiment Dataset Builder

Scrape product reviews from public pages and build AI-ready sentiment datasets. Extract themes, pros/cons, complaint categories, and ratings.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Prince Raj

Prince Raj

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Aggregate public reviews and build AI-ready sentiment datasets. Built for product teams, local businesses, and agencies who need to analyze customer feedback at scale.

What It Extracts

  • Review data: Text, rating, date, author display name, review URL
  • Sentiment classification: Positive, negative, or neutral
  • Keyword themes: Top themes and topics from review content
  • Platform detection: Auto-detect review platform (Trustpilot, G2, Yelp, Google)
  • Review statistics: Average rating, sentiment distribution, review count

Why It's Better

Combines review extraction with sentiment analysis and theme extraction. Produces structured JSON output ready for AI training, dashboards, or research. Supports multiple review platforms.

Input

FieldTypeDefaultDescription
reviewPageUrlsstring[]requiredURLs of review pages to scrape
maxReviewsinteger100Maximum reviews to extract (1-1000)
platformstring"auto"Platform: auto, trustpilot, g2, yelp, google
includeKeywordThemesbooleantrueExtract keyword themes from reviews

Output Example

{
"sourceUrl": "https://www.trustpilot.com/review/example.com",
"platform": "trustpilot",
"totalReviews": 245,
"averageRating": 4.2,
"reviews": [
{
"text": "Great product, really helped our team...",
"rating": 5,
"date": "2025-01-10",
"author": "John D.",
"sentiment": "positive"
}
],
"sentimentDistribution": {
"positive": 180,
"neutral": 40,
"negative": 25
},
"keywordThemes": [
{ "theme": "customer support", "count": 45 },
{ "theme": "ease of use", "count": 38 },
{ "theme": "pricing", "count": 32 }
],
"scrapedAt": "2025-01-15T10:30:00Z"
}

Use Cases

  • Product research: Analyze customer sentiment across review platforms
  • Competitive analysis: Compare review sentiment with competitors
  • Customer feedback analysis: Identify common themes and pain points
  • AI training data: Build sentiment datasets for ML models
  • Reputation monitoring: Track review sentiment over time

PPE Pricing

EventDescriptionSuggested Price
review-extractedOne review extracted$0.002
review-analyzedReview with sentiment analysis$0.003

Limitations

  • Only extracts publicly visible reviews
  • Sentiment classification is keyword-based (not ML-powered)
  • Some platforms may block automated scraping
  • Author names may be anonymized on some platforms
  • Review dates may not always be available

Legal/Ethical Use

This actor only extracts publicly visible reviews. It does not access private data, bypass authentication, or store unnecessary personal information. Users are responsible for compliance with platform terms of service and applicable data protection laws.

Local Run

cd actors/review-sentiment-dataset-builder
apify run --input-file .actor/sample_input.json

Deploy

cd actors/review-sentiment-dataset-builder
apify push

FAQ

Q: Which review platforms are supported? A: Trustpilot, G2, Yelp, and Google Reviews. Auto-detection works for most standard review page formats.

Q: How accurate is sentiment analysis? A: It uses keyword-based classification, which works well for clearly positive/negative reviews. Nuanced sentiment may need manual review.

Q: Can I extract reviews from multiple pages? A: Yes, provide multiple URLs. Each URL is processed independently.

Tags

review scraper, sentiment analysis, customer feedback, review aggregator, Trustpilot scraper, G2 reviews, Yelp reviews, product reviews, AI training data