Tokopedia Scraper — Indonesia #1 Marketplace Product Search
Pricing
from $0.60 / 1,000 results
Tokopedia Scraper — Indonesia #1 Marketplace Product Search
Scrape Tokopedia product search results by keyword or URL. Extract product name, price, rating, review count, discount, shop info, category breadcrumb, and images. Pure HTTP GraphQL API — fast, low-cost, no browser overhead. Perfect for market research, competitor monitoring, and lead generation.
Pricing
from $0.60 / 1,000 results
Rating
0.0
(0)
Developer
Haketa
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extract product search results from Tokopedia, Indonesia's largest e-commerce platform with over 100 million monthly active users. This scraper pulls product listings — name, price, rating, review count, discount, shop details, category breadcrumb, and images — directly from Tokopedia's data source using a fast, lightweight HTTP-only approach with no browser overhead.
Why This Tokopedia Scraper?
Most Tokopedia scrapers rely on heavy headless browsers (Puppeteer/Playwright) to render JavaScript pages, driving up cost and runtime. This scraper communicates directly with the public product data API, delivering results 5-10× faster and at a fraction of the cost of browser-based alternatives. A single search for 1,000 products typically completes in under 30 seconds and costs less than $0.05 in platform usage.
Key Advantages
- Pure HTTP — no browser: Communicates directly with the GraphQL data source. No rendering, no JavaScript, no captchas.
- Fast & cheap: ~1–2 seconds per page of 60 results. Browser scrapers need 8–15 seconds per page.
- Anti-bot bypass built-in: Uses mobile-device header emulation to pass through bot detection without residential proxies.
- Deduplication: Never outputs duplicate products — tracks seen IDs across all pages.
- Multi-keyword: Run multiple searches in a single run. Great for category-wide market research.
- Flexible sorting: Relevance, newest, top sales, price low-to-high, price high-to-low, highest rating.
What Data Does It Extract?
Every product result includes the following fields:
| Field | Description |
|---|---|
id | Tokopedia product ID |
name | Product title |
url | Product page URL on tokopedia.com |
price | Display price (e.g. "Rp370.000") |
priceRaw | Numeric price in IDR (e.g. 370000) |
originalPrice | Pre-discount price if discounted |
originalPriceRaw | Numeric pre-discount price |
discountPercentage | Discount rate (0–100) |
priceRange | Price range string if multi-variant |
rating | Numeric rating score |
ratingAverage | Average star rating |
countReview | Number of reviews |
wishlist | Whether on wishlist |
imageUrl | Main product image |
videoUrl | Product video if available |
categoryId | Tokopedia category ID |
categoryName | Category name |
categoryBreadcrumb | Full category path |
shopId | Seller shop ID |
shopName | Seller shop name |
shopUrl | Seller shop URL |
shopCity | Seller location city |
shopIsOfficial | Whether an official brand store |
shopIsPowerBadge | Power seller badge status |
searchKeyword | The keyword used for this result |
searchUrl | The search URL that produced this result |
scrapedAt | ISO timestamp of extraction |
Use Cases
E-commerce Market Research
Track product availability, pricing trends, and category composition across Indonesia's largest marketplace. Run weekly searches for your product category to monitor new entrants and price movements.
Competitive Intelligence
Search for competitor brand names and products. See how they price, how many reviews they have, whether they run discounts, and which shops carry their products.
Lead Generation for Sellers & Brands
Find shops selling in your category. Extract seller names, locations, and shop URLs. Identify official stores and power sellers for partnership or distribution opportunities.
Dropshipping & Product Sourcing
Discover trending products and bestsellers. Sort by top sales or newest to find winning products for your own store. Use category breadcrumbs to explore related niches.
Price Monitoring
Track specific products or categories over time. Combine with scheduled runs (Apify Schedules) to build price history datasets. Set maxItemsPerSearch to 500+ for comprehensive coverage.
Academic & Data Science Research
Study Indonesia's e-commerce ecosystem — the 4th largest globally by population. Extract structured data for economic analysis, consumer behavior studies, or marketplace dynamics research.
Input Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
searchKeywords | string array | No* | Product keywords to search. Each runs independently. Example: ["iphone 15", "sepatu nike"] |
startUrls | string array | No* | Direct Tokopedia search or category URLs. Example: ["https://www.tokopedia.com/search?q=laptop"] |
maxItemsPerSearch | integer | No | Max products per keyword/URL. Default 100. Set 0 for unlimited. |
maxTotalItems | integer | No | Global cap across all searches. Default 0 (unlimited). |
rowsPerPage | integer | No | Results per API request (1–60). Default 60 for efficiency. |
sortOrder | select | No | Sort order: 23=Relevance, 5=Newest, 8=Top Sales, 11=Price Low-High, 10=Price High-Low, 9=Rating |
proxyConfiguration | proxy | No | Proxy settings. Datacenter proxy recommended for scale. |
* At least one of searchKeywords or startUrls is required.
Sort Order Reference
| Value | Meaning |
|---|---|
23 | Relevance (default) |
5 | Newest first |
8 | Top sales |
11 | Price: Low → High |
10 | Price: High → Low |
9 | Highest rating |
Output Format
Results are stored in the Apify Dataset as JSON objects. Each object represents one unique product from the search results.
{"id": "13252523998","name": "Lifesafer Anti Heat Iphone 15 Promax Aluminium Case Aromatherapy - Silver","url": "https://www.tokopedia.com/wijayakusumaolshop/lifesafer-anti-heat-iphone-15-promax-aluminium-case-aromatherapy-silver","price": "Rp370.000","priceRaw": 370000,"originalPrice": "","originalPriceRaw": null,"discountPercentage": 0,"priceRange": "","rating": 5,"ratingAverage": 5.0,"countReview": 12,"wishlist": false,"imageUrl": "https://images.tokopedia.net/img/cache/700-square/VqbWKr/2024/12/...jpg","videoUrl": "","categoryId": 65,"categoryName": "Soft Case Handphone","categoryBreadcrumb": "handphone-tablet/aksesoris-handphone/soft-case-handphone","shopId": "11279607","shopName": "wijayakusumaolshop","shopUrl": "https://www.tokopedia.com/wijayakusumaolshop","shopCity": "Jakarta Barat","shopIsOfficial": false,"shopIsPowerBadge": false,"gaKey": "handphone-tablet/aksesoris-handphone/soft-case-handphone","sourceEngine": "","searchKeyword": "iphone 15","searchUrl": "https://www.tokopedia.com/search?q=iphone+15&ob=23&source=search","scrapedAt": "2026-06-26T10:30:00.000Z"}
Performance & Pricing
| Metric | Typical Value |
|---|---|
| Speed | ~1–2 seconds per page (60 results) |
| 100 results | ~3–5 seconds |
| 1,000 results | ~20–30 seconds |
| Memory usage | ~200–400 MB |
| Cost per 1,000 results | ~$0.03–0.08 (platform usage) |
Datacenter proxies are included in the default configuration and work well for most use cases. Residential proxies are available but rarely needed — this scraper's mobile header emulation bypasses the bot detection that typically requires residential IPs.
How It Works — Technical Overview
The scraper sends HTTP POST requests directly to Tokopedia's product search endpoint with iOS mobile-device headers. This approach:
- Bypasses bot detection — Tokopedia's desktop site uses browser fingerprinting that blocks automated requests. The mobile API path, when called with proper iOS device headers (User-Agent, X-Device, Bd-Device-Id, etc.), responds normally.
- Returns structured JSON — No HTML parsing needed. The response is clean, typed GraphQL JSON.
- Paginates efficiently — Each request returns up to 60 products. The scraper follows page numbers until reaching the limit or exhausting results.
- Deduplicates across pages — Product IDs are tracked; duplicates from ranking shifts between pages are silently dropped.
Tips for Best Results
- Use specific keywords — "iphone 15 pro max case" gives more targeted results than "case". Tokopedia's search is keyword-based; precise queries yield better data.
- Start small, then scale — Test with
maxItemsPerSearch: 60first to verify data quality for your keyword. Then increase to 500+ for full extraction. - Combine with sorting — Sort by "top sales" (
sortOrder: 8) to find bestsellers. Sort by "newest" (sortOrder: 5) to discover recently listed products. - Run multiple keywords — One run can search 10, 20, or more keywords. Ideal for category-wide research — just list all your target keywords in
searchKeywords. - Schedule for monitoring — Use Apify Schedules to run daily or weekly searches. Track new products, price changes, and rating shifts over time.
- Export to your stack — Apify integrations let you push results to Google Sheets, Airtable, Notion, SQL databases, S3, or any REST API webhook.
Limitations
- Public search results only — This scraper extracts what Tokopedia shows in public search. Seller-private data, order volumes, and exact inventory counts are not available.
- Search API fields — The public API does not expose the "sold count" field (typically shown as "Terjual 100+"). We extract all available structured fields; some display-only labels are not present in the API response.
- Rate limits — Tokopedia may throttle excessive requests. The scraper includes exponential backoff and retry logic. For very large extractions (>5,000 results), spread across multiple runs or add delays.
- Indonesian marketplace — Tokopedia primarily serves Indonesia. Products are listed in Indonesian (Bahasa Indonesia). Non-Indonesian keywords may return limited results.
Legal & Responsible Use
This scraper extracts publicly available product information from Tokopedia. It does not access private accounts, bypass authentication, or scrape personal data.
To use this scraper responsibly:
- Comply with Tokopedia's Terms of Service
- Respect rate limits — don't overwhelm their servers
- Use extracted data in compliance with applicable laws
- This tool is for legitimate market research, competitive analysis, and academic use
Made for serious e-commerce data users — researchers, analysts, brand owners, and developers who need reliable Tokopedia data at scale without the cost and complexity of browser-based scraping.