Olx Product Scraper
Pricing
$4.99/month + usage
Olx Product Scraper
This Apify actor scrapes classified ads from OLX India, extracting detailed listings for motorcycles, cars, electronics, and real estate by visiting each productβs detail page and collecting structured data.
Pricing
$4.99/month + usage
Rating
0.0
(0)
Developer

ScrapeAI
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
7 days ago
Last modified
Categories
Share
ποΈ OLX Product Listings Scraper
This Apify actor scrapes classified ads data from OLX India, one of India's largest platforms for buying and selling used goods. Scrape motorcycles, cars, electronics, real estate, and other categories to collect comprehensive listing information including title, description, price, images, location, seller details, and product parameters. Perfect for market research, price comparison, and classified ads analytics.
β Use Cases
- π Conduct market research on used goods and classified ads
- π° Monitor pricing trends across different categories
- π·οΈ Track product availability and seller patterns
- π Build price comparison and market analysis datasets
- π― Analyze geographic distribution of listings
- π Research specific product categories or brands
π₯ Input Configuration
You can customize the actor using the following input fields:
{"startUrls": [{"url": "https://www.olx.in/kerala_g2001160/motorcycles_c81"}],"maxItems": 50,"maxPages": 5}
π§Ύ Fields Explained
| Field | Type | Description |
|---|---|---|
startUrls | array | Array of OLX category or search URLs to scrape (e.g., motorcycles, cars, electronics) |
maxItems | number | Maximum number of listings to scrape (0 = unlimited, default: 50) |
maxPages | number | Maximum number of pages to scrape per start URL (1-100, default: 5) |
π Example Start URLs
| Category | URL |
|---|---|
| Motorcycles (Kerala) | https://www.olx.in/kerala_g2001160/motorcycles_c81 |
| Cars (Mumbai) | https://www.olx.in/mumbai_g4058877/cars_c84 |
| Electronics | https://www.olx.in/electronics-appliances_g5 |
| Real Estate | https://www.olx.in/properties_g5 |
π€ Output
The actor returns a dataset containing structured listing records. Each record includes comprehensive details extracted from the product detail page.
π§© Sample Output
{"id": "1733271487","product_url": "https://www.olx.in/item/bullet-350cc-excellent-condition-exchange-available-iid-1733271487","title": "Bullet 350cc excellent condition / exchange available","description": "Restored With all new parts, showroom condition","price": "βΉ 1,40,000", "price_raw": 140000,"category": "Motorcycles","location": "Fort Kochi, Kerala","posted_date": "2 days ago","seller_name": "Ramesh Kumar","seller_type": "Individual","mobile_number": "+91XXXXXXXXXX","images": ["https://apollo-singapore.akamaized.net:443/v1/files/48z6jxi9i76z2-IN/image","https://apollo-singapore.akamaized.net:443/v1/files/5dtb7z3tivqt-IN/image"],"mainImage": "https://apollo-singapore.akamaized.net:443/v1/files/48z6jxi9i76z2-IN/image","status": "active","parameters": [{"key_name": "Brand","value": "Royal Enfield"},{"key_name": "Model","value": "Bullet"},{"key_name": "Year","value": "1991"},{"key_name": "KM driven","value": "2,000 km"}]}
π Output Fields
| Field | Type | Description |
|---|---|---|
id | string | Unique listing identifier |
product_url | string | Direct listing page URL |
title | string | Listing title |
description | string | Detailed listing description |
price | string | Formatted price with currency symbol |
price_raw | number | Numeric price value |
category | string | Product category |
location | string | Listing location |
posted_date | string | Date when listing was posted |
seller_name | string | Name of the seller |
seller_type | string | Type of seller (Individual, Dealer, etc.) |
mobile_number | string | Seller's mobile number (if available) |
images | array | Array of all product image URLs |
mainImage | string | Primary product image URL |
status | string | Listing status (active, sold, etc.) |
parameters | array | Product-specific parameters (brand, model, year, etc.) |
π Data Source
- β Visits each product detail page individually
- β Extracts data from visible page content (HTML selectors only)
- β No API calls - pure browser automation
- β Supports multiple categories (motorcycles, cars, electronics, etc.)
- β Comprehensive listing information extraction
- β Extracts seller information and contact details
- β Captures all product images
- β Geographic and temporal data included
π Proxy Configuration
This actor uses Playwright with stealth settings. For production use, configure proxy:
{"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"],"apifyProxyCountry": "IN"}
π How to Use
- Open the actor in Apify Console
- Click "Try actor" or create a new task
- Enter your start URLs (e.g., "https://www.olx.in/kerala_g2001160/motorcycles_c81")
- Configure maxItems and maxPages settings
- Run the actor
- Download results in JSON, CSV, or Excel format
βοΈ Advanced Input Example
{"startUrls": [{"url": "https://www.olx.in/kerala_g2001160/motorcycles_c81"},{"url": "https://www.olx.in/mumbai_g4058877/cars_c84"}],"maxItems": 100,"maxPages": 10}
π οΈ Tech Stack
- Apify SDK β actor orchestration and data handling
- Playwright β browser automation with stealth capabilities
- Node.js 18+ β runtime environment
π Data Processing Features
- β Browser-based scraping to extract listing data
- β Stealth mode with anti-detection measures
- β Automatic pagination handling
- β Structured data parsing and normalization
- β Comprehensive logging and error handling
- β Support for multiple categories and locations
π§ Local Development
Installation
$npm install
Running the Actor
$npm start
Validation
Validate all generated datasets against the schema:
$node validate-datasets.js
Code Quality
Format code:
$npm run format
Check linting:
$npm run lint
π€ Support
For issues, questions, or feature requests, please refer to the Apify documentation or contact support.