Noon Reviews Scraper avatar

Noon Reviews Scraper

Pricing

Pay per usage

Go to Apify Store
Noon Reviews Scraper

Noon Reviews Scraper

Harvest Noon product reviews and ratings in bulk. Extract customer feedback, detailed insights & seller performance data. Perfect for market research, competitive analysis, pricing optimization & consumer behavior intelligence.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Scrape comprehensive customer reviews and ratings from Noon.com products. Perfect for market research, competitor analysis, and sentiment evaluation.

Features

  • Extract Comprehensive Details: Gathers reviewer name, exact star rating, absolute date, review text, and verifying purchase status.
  • Richer Metadata: Automatically extracts helpful vote counts, multi-language translated reviews, user-uploaded review images, and the exact product variant (Color, Storage, etc.) the customer purchased.
  • Handling Anti-Bot Measures: Bypasses basic bot protections automatically, ensuring reliable data extraction.
  • Versatile Input: Accepts either a direct productId or a direct URL to the reviews page. Can also configure regional storefronts and sorting methods.
  • Limit Controls: Configure the exact maximum number of reviews (results_wanted) you wish to extract.

Use Cases

  • E-commerce Analytics: Understand customer satisfaction and pain points for various products and exact variants.
  • Competitor Analysis: Scrape reviews of competing products to identify market advantages.
  • Sentiment Analysis: Feed extracted review text (or their English translations) into AI models to gauge overall customer feeling.
  • Product Research: Identify common defects or highly praised features for R&D purposes.

Input Parameters

The scraper accepts the following parameters via JSON:

FieldTypeDescription
productIdStringThe Noon product ID (e.g. N70105592V).
startUrlStringAlternative to productId. The full Noon product URL.
results_wantedIntegerThe maximum number of reviews to extract (Default: 20).
sortFilterStringHow to sort the reviews (helpful, newest, highest_rating, lowest_rating).
localeStringThe regional storefront language code (e.g., en-ae, ar-sa).
proxyConfigurationObjectApify proxy settings. Residential proxies are heavily recommended.

Output Data

Data is stored in the Apify dataset in JSON format containing the following fields:

FieldTypeDescription
productIdStringThe overall ID of the targeted product.
variantSkuStringThe specific variant SKU that the user purchased and reviewed.
titleStringThe title or headline of the review.
authorStringThe display name of the reviewer.
ratingNumberThe star rating given out of 5.
reviewTextStringThe descriptive body of the review.
titleTranslationStringA machine English translation of the review title (if applicable).
reviewTextTranslationStringA machine English translation of the review body (if applicable).
dateStringWhen the review was last updated.
createdAtStringThe exact timestamp of when the review was created.
helpfulCountNumberThe number of times this review was voted as helpful.
verifiedPurchaseBooleanTrue if the purchase was verified by Noon.
imageUrlsArrayA list of URLs pointing to images uploaded by the reviewer.
variantArrayCharacteristics of the purchased variant (e.g., Color, Memory).

Usage Examples

Here is an example of the INPUT.json configuration:

{
"productId": "N70105592V",
"results_wanted": 20,
"sortFilter": "helpful",
"locale": "en-ae",
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Sample Output

{
"productId": "N70105592V",
"variantSku": "N70105592V",
"author": "Ahmed K.",
"title": "Excellent quality for the price",
"rating": 5,
"reviewText": "I genuinely loved the design and the battery life is amazing.",
"titleTranslation": "Excellent quality for the price",
"reviewTextTranslation": "I genuinely loved the design and the battery life is amazing.",
"date": "2023-10-24T12:00:00.000Z",
"createdAt": "2023-10-23T10:00:00.000Z",
"helpfulCount": 12,
"verifiedPurchase": true,
"imageUrls": [
"https://f.nooncdn.com/reviews/image1.jpg"
],
"variant": [
{ "name": "Color", "value": "Titanium Black" },
{ "name": "Memory", "value": "256GB" }
]
}

Tips

  • Proxies: Noon.com actively blocks datacenter IP addresses. Please ensure you are utilizing Residential Proxies within your proxyConfiguration.
  • Targeting: Provide only the productId if you want a reliable fallback.

FAQ

Does this scraper run in the background? Yes, it operates fully in the background automatically paginating through the review sections.

Can I scrape thousands of reviews at once? Yes, but you will be reliant on your proxy connection remaining intact. Setting a generous timeout and maximum retry limit might be beneficial.

Data collected by this Scraper is publicly accessible. You are responsible for ensuring that your use of the Scraped Data complies with all applicable local and international laws, including data privacy requirements.