Olx Product Scraper avatar

Olx Product Scraper

Pricing

$4.99/month + usage

Go to Apify Store
Olx Product Scraper

Olx Product Scraper

This Apify actor scrapes classified ads from OLX India, extracting detailed listings for motorcycles, cars, electronics, and real estate by visiting each product’s detail page and collecting structured data.

Pricing

$4.99/month + usage

Rating

0.0

(0)

Developer

ScrapeAI

ScrapeAI

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Share

🏍️ OLX Product Listings Scraper

This Apify actor scrapes classified ads data from OLX India, one of India's largest platforms for buying and selling used goods. Scrape motorcycles, cars, electronics, real estate, and other categories to collect comprehensive listing information including title, description, price, images, location, seller details, and product parameters. Perfect for market research, price comparison, and classified ads analytics.

βœ… Use Cases

  • πŸ“Š Conduct market research on used goods and classified ads
  • πŸ’° Monitor pricing trends across different categories
  • 🏷️ Track product availability and seller patterns
  • πŸ“ˆ Build price comparison and market analysis datasets
  • 🎯 Analyze geographic distribution of listings
  • πŸ” Research specific product categories or brands

πŸ“₯ Input Configuration

You can customize the actor using the following input fields:

{
"startUrls": [
{
"url": "https://www.olx.in/kerala_g2001160/motorcycles_c81"
}
],
"maxItems": 50,
"maxPages": 5
}

🧾 Fields Explained

FieldTypeDescription
startUrlsarrayArray of OLX category or search URLs to scrape (e.g., motorcycles, cars, electronics)
maxItemsnumberMaximum number of listings to scrape (0 = unlimited, default: 50)
maxPagesnumberMaximum number of pages to scrape per start URL (1-100, default: 5)

πŸ”— Example Start URLs

CategoryURL
Motorcycles (Kerala)https://www.olx.in/kerala_g2001160/motorcycles_c81
Cars (Mumbai)https://www.olx.in/mumbai_g4058877/cars_c84
Electronicshttps://www.olx.in/electronics-appliances_g5
Real Estatehttps://www.olx.in/properties_g5

πŸ“€ Output

The actor returns a dataset containing structured listing records. Each record includes comprehensive details extracted from the product detail page.

🧩 Sample Output

{
"id": "1733271487",
"product_url": "https://www.olx.in/item/bullet-350cc-excellent-condition-exchange-available-iid-1733271487",
"title": "Bullet 350cc excellent condition / exchange available",
"description": "Restored With all new parts, showroom condition",
"price": "β‚Ή 1,40,000", "price_raw": 140000,
"category": "Motorcycles",
"location": "Fort Kochi, Kerala",
"posted_date": "2 days ago",
"seller_name": "Ramesh Kumar",
"seller_type": "Individual",
"mobile_number": "+91XXXXXXXXXX",
"images": [
"https://apollo-singapore.akamaized.net:443/v1/files/48z6jxi9i76z2-IN/image",
"https://apollo-singapore.akamaized.net:443/v1/files/5dtb7z3tivqt-IN/image"
],
"mainImage": "https://apollo-singapore.akamaized.net:443/v1/files/48z6jxi9i76z2-IN/image",
"status": "active",
"parameters": [
{
"key_name": "Brand",
"value": "Royal Enfield"
},
{
"key_name": "Model",
"value": "Bullet"
},
{
"key_name": "Year",
"value": "1991"
},
{
"key_name": "KM driven",
"value": "2,000 km"
}
]
}

πŸ“‹ Output Fields

FieldTypeDescription
idstringUnique listing identifier
product_urlstringDirect listing page URL
titlestringListing title
descriptionstringDetailed listing description
pricestringFormatted price with currency symbol
price_rawnumberNumeric price value
categorystringProduct category
locationstringListing location
posted_datestringDate when listing was posted
seller_namestringName of the seller
seller_typestringType of seller (Individual, Dealer, etc.)
mobile_numberstringSeller's mobile number (if available)
imagesarrayArray of all product image URLs
mainImagestringPrimary product image URL
statusstringListing status (active, sold, etc.)
parametersarrayProduct-specific parameters (brand, model, year, etc.)

πŸ” Data Source

  • βœ“ Visits each product detail page individually
  • βœ“ Extracts data from visible page content (HTML selectors only)
  • βœ“ No API calls - pure browser automation
  • βœ“ Supports multiple categories (motorcycles, cars, electronics, etc.)
  • βœ“ Comprehensive listing information extraction
  • βœ“ Extracts seller information and contact details
  • βœ“ Captures all product images
  • βœ“ Geographic and temporal data included

πŸ”’ Proxy Configuration

This actor uses Playwright with stealth settings. For production use, configure proxy:

{
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"apifyProxyCountry": "IN"
}

πŸš€ How to Use

  1. Open the actor in Apify Console
  2. Click "Try actor" or create a new task
  3. Enter your start URLs (e.g., "https://www.olx.in/kerala_g2001160/motorcycles_c81")
  4. Configure maxItems and maxPages settings
  5. Run the actor
  6. Download results in JSON, CSV, or Excel format

βš™οΈ Advanced Input Example

{
"startUrls": [
{
"url": "https://www.olx.in/kerala_g2001160/motorcycles_c81"
},
{
"url": "https://www.olx.in/mumbai_g4058877/cars_c84"
}
],
"maxItems": 100,
"maxPages": 10
}

πŸ› οΈ Tech Stack

  • Apify SDK β€” actor orchestration and data handling
  • Playwright β€” browser automation with stealth capabilities
  • Node.js 18+ β€” runtime environment

πŸ“Š Data Processing Features

  • βœ“ Browser-based scraping to extract listing data
  • βœ“ Stealth mode with anti-detection measures
  • βœ“ Automatic pagination handling
  • βœ“ Structured data parsing and normalization
  • βœ“ Comprehensive logging and error handling
  • βœ“ Support for multiple categories and locations

πŸ”§ Local Development

Installation

$npm install

Running the Actor

$npm start

Validation

Validate all generated datasets against the schema:

$node validate-datasets.js

Code Quality

Format code:

$npm run format

Check linting:

$npm run lint

🀝 Support

For issues, questions, or feature requests, please refer to the Apify documentation or contact support.