Meesho Product Scrapper avatar
Meesho Product Scrapper

Pricing

Pay per event

Go to Store
Meesho Product Scrapper

Meesho Product Scrapper

Developed by

HappiTap

HappiTap

Maintained by Community

An Apify actor that extracts product details from Meesho.com product pages, category listings, and search results with comprehensive structured data including pricing, discounts, ratings, seller information, and more.

0.0 (0)

Pricing

Pay per event

0

2

2

Last modified

2 days ago

Meesho Product Scraper

An Apify actor that extracts product details from Meesho.com product pages, category listings, and search results with comprehensive structured data including pricing, discounts, ratings, seller information, and more.

What It Does

This scraper extracts structured data from various Meesho pages including:

FieldDescription
titleProduct name/title
productIdMeesho product identifier
priceCurrent product price (₹)
originalPriceOriginal price before discount (₹)
discountDiscount percentage or amount
ratingAverage user rating (e.g., 4.5)
reviewCountTotal number of user reviews
availabilityIn stock / out of stock status
sellerSold by (seller name)
categoryMain category or breadcrumb
productUrlDirect product URL on Meesho
imageUrlMain product image URL
imagesArray of all product images
descriptionProduct description
specificationsProduct specifications and attributes
variantsAvailable product variants (size, color, etc.)
sourceSource website (meesho.com)

Supported Page Types

  • Product Pages: Individual product detail pages (e.g., /p/12345)
  • Category Listings: Category browsing pages (e.g., /pl/3j3)
  • Search Results: Search query results (e.g., /search?q=dresses)
  • Collection Pages: Curated product collections

Use Cases

  • Indian E-commerce Research: Extract detailed product information from India's leading social commerce platform
  • Price Monitoring: Track product prices, discounts, and availability across categories
  • Fashion & Lifestyle Analysis: Monitor trending products in fashion, home, and lifestyle categories
  • Competitor Analysis: Monitor competitor products and pricing strategies on Meesho
  • Inventory Tracking: Check product availability and seller information
  • Social Commerce Data Collection: Gather comprehensive product catalogs from Meesho's marketplace

Input

The actor accepts the following input format:

{
"startUrls": [
{ "url": "https://www.amazon.com/s?k=wireless+earbuds" }
],
"maxItems": 50
}

Input Parameters

ParameterTypeRequiredDefaultDescription
startUrlsArrayYes-Array of objects with url property pointing to Meesho pages
maxItemsNumberNo50Maximum number of products to extract per page

Supported Meesho URLs

The scraper works with various Meesho page types:

Product Pages:

  • https://www.meesho.com/product-name/p/12345
  • https://www.meesho.com/another-product/p/67890

Category Listings:

  • https://www.meesho.com/dresses-women/pl/3j3
  • https://www.meesho.com/sarees/pl/1k8
  • https://www.meesho.com/kurtis/pl/2a7

Search Results:

  • https://www.meesho.com/search?q=dresses
  • https://www.meesho.com/search?q=ethnic+wear

Output

The actor outputs structured data for each product found:

{
"title": "Women's Cotton Kurti with Palazzo Set",
"productId": "12345",
"price": "₹599",
"originalPrice": "₹1299",
"discount": "54% off",
"rating": 4.2,
"reviewCount": 1250,
"availability": "In Stock",
"seller": "Fashion Store",
"category": "Women > Ethnic Wear > Kurtis",
"productUrl": "https://www.meesho.com/womens-cotton-kurti/p/12345",
"imageUrl": "https://images.meesho.com/images/products/12345/1_512.jpg",
"images": [
"https://images.meesho.com/images/products/12345/1_512.jpg",
"https://images.meesho.com/images/products/12345/2_512.jpg"
],
"description": "Comfortable cotton kurti with palazzo set perfect for daily wear",
"specifications": {
"Material": "Cotton",
"Sleeve": "3/4 Sleeve",
"Pattern": "Printed"
},
"variants": ["S", "M", "L", "XL"],
"source": "meesho.com",
"scrapedAt": "2024-01-01T00:00:00.000Z"
}

Example Usage

Extract Women's Dresses Category

{
"startUrls": [
{ "url": "https://www.meesho.com/dresses-women/pl/3j3" }
],
"maxItems": 50
}

Extract from Product Page

{
"startUrls": [
{ "url": "https://www.meesho.com/womens-cotton-kurti/p/12345" }
],
"maxItems": 1
}

Multiple Sources

{
"startUrls": [
{ "url": "https://www.meesho.com/dresses-women/pl/3j3" },
{ "url": "https://www.meesho.com/sarees/pl/1k8" }
],
"maxItems": 25
}

How It Works

  1. Page Type Detection: Automatically detects whether the URL is a product page, category listing, or search results
  2. Appropriate Handler: Routes to the correct scraping handler based on page type
  3. Data Extraction: Uses specialized selectors for each page type to extract product information
  4. Comprehensive Fields: Extracts all required fields including pricing, discounts, availability and seller information
  5. Data Validation: Ensures only products with valid titles and prices are included
  6. Structured Output: Returns clean, structured data ready for analysis

Features

  • Multi-Page Support: Handles product pages, category listings, and search results
  • Robust Extraction: 40+ fallback selectors to handle Meesho's dynamic page structure
  • Indian E-commerce Focus: Specialized for Meesho's social commerce platform
  • Enhanced Price Extraction: Supports Indian currency formats (₹, Rs, INR) with discount tracking
  • Stealth Mode: Uses Puppeteer with anti-bot evasion techniques
  • Proxy Support: Built-in proxy rotation for reliable scraping
  • Error Handling: Graceful error handling with detailed logging
  • Data Validation: Ensures data quality with validation checks
  • Comprehensive Product Data: Extracts images, variants, specifications, and descriptions

Installation

  1. Clone this repository
  2. Install dependencies: npm install
  3. Run the actor: npm start

Development

  • npm start - Run the actor
  • npm run format - Format code with Prettier
  • npm run lint - Run ESLint
  • npm run lint:fix - Fix ESLint issues

Architecture

  • src/main.js - Main entry point and input validation
  • src/routes.js - Request routing and page type detection
  • src/handlers/amazonProductPage.js - Individual product page scraping logic
  • src/handlers/amazonSearchResults.js - Search results and category page scraping logic
  • src/puppeteerLauncher.js - Puppeteer browser configuration with stealth mode

Notes

  • The scraper is designed to be respectful of Amazon's servers and includes appropriate delays
  • Results may vary based on Amazon's page structure changes
  • The scraper automatically handles different Amazon page layouts and product formats
  • All extracted data is timestamped for tracking purposes
  • Product pages return single items, while search/category pages return multiple products