Tokopedia Scraper
Pricing
from $5.00 / 1,000 product details
Tokopedia Scraper
Scrape Tokopedia product detail, category listings, and shop catalogs by URL — price, rating, reviews, stock, shop, image. HTTP-only via Chrome-TLS impersonation, no login. MCP-ready.
Pricing
from $5.00 / 1,000 product details
Rating
0.0
(0)
Developer
Khadin Akbar
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
0
Monthly active users
a day ago
Last modified
Categories
Share
Scrape Tokopedia — Indonesia's largest marketplace — for product detail, category listings, and shop catalogs. Paste any Tokopedia product, category, or shop URL and get clean, structured JSON: price, rating, reviews, stock, seller, image, and more. HTTP-only (no browser), no login, no cookies. Built for analysts, sellers, and AI agents.
What it does
Give the actor one or more Tokopedia URLs. It auto-detects the page type and returns one flat record per product:
| URL you paste | Mode | What you get |
|---|---|---|
https://www.tokopedia.com/{shop}/{product-slug} | Product detail | Name, price, slash/original price, discount, rating score, total ratings, per-star breakdown, description, image, shop |
https://www.tokopedia.com/p/handphone-tablet/handphone | Category listing | Up to ~80 product cards per page (auto-paginated): name, price, rating, reviews, stock, badges, image, shop |
https://www.tokopedia.com/{shop} or .../{shop}/product | Shop catalog | The shop's products (auto-paginated): name, price, rating, reviews, image |
When to use it
- Price & competitor monitoring — track competitor SKUs, prices, and stock on Tokopedia.
- Catalog research — pull an entire category or a competitor's shop catalog.
- Market research — rating, review counts, and badges across a category.
- AI agents — a single URL in, structured JSON out. MCP-ready.
Not for free-text keyword search. Tokopedia loads
/search?q=...results from a bot-protected internal API that this HTTP-only actor cannot reach. Browse by category (/p/...) or shop instead — those pages are server-rendered and fully scrapable. Passing a/search?q=URL is skipped with a clear message.
Output sample
{"type": "listing","productId": "102798088938","name": "OPPO RENO 15 5G (8/256 GB, 12/256 GB) Garansi Resmi","url": "https://www.tokopedia.com/cellsius/oppo-reno-15-5g-8-256-gb-12-256-gb-garansi-resmi-1733884686684424021","price": "Rp9.299.000","priceInt": 9299000,"rating": 5,"reviewCount": 1,"stock": 30,"categoryBreadcrumbs": "Handphone & Tablet/Handphone/Android OS","badges": ["Mall"],"imageUrl": "https://images.tokopedia.net/...","shopName": "cellsius","shopUrl": "https://www.tokopedia.com/cellsius","sourceUrl": "https://www.tokopedia.com/p/handphone-tablet/handphone","scrapedAt": "2026-06-21T00:00:00.000Z"}
Product-detail records add slashPrice, discountPercentage, ratingScore, ratingCount, ratingBreakdown, and description.
Pricing
This actor is pay-per-event. You are billed only for what you get:
| Event | Price | When |
|---|---|---|
| Actor start | $0.00005 | Once per run (scaled by memory) |
| Listing product | $0.003 | Each product from a category or shop page |
| Product detail | $0.005 | Each product from a product-detail (PDP) URL |
A 100-product category pull costs about $0.30. The actor enforces your maxResults cap and stops charging once it's reached. Pay-Per-Usage (compute + proxy) is also available for very large jobs.
Input
| Field | Type | Default | Description |
|---|---|---|---|
startUrls | array of strings | — (required) | Tokopedia product, category (/p/...), or shop URLs |
maxResults | integer | 100 | Hard cap on products returned across all URLs |
maxPagesPerList | integer | 5 | Pagination depth per category/shop URL (~80 products/page) |
proxyConfiguration | object | Apify datacenter | Proxy; default is sufficient |
Example input
{"startUrls": ["https://www.tokopedia.com/p/handphone-tablet/handphone","https://www.tokopedia.com/cuan-gadgetid/product"],"maxResults": 200,"maxPagesPerList": 5}
How it works
Tokopedia protects its edge with TLS/HTTP-2 fingerprinting that blocks ordinary HTTP clients. This actor uses impit Chrome-TLS impersonation to pass that block from a plain datacenter proxy — no headless browser, no residential IPs required — then parses the product data Tokopedia embeds in each server-rendered page (its Apollo window.__cache). That keeps runs fast and cheap while staying reliable.
The actor is resilient by design: it auto-paginates listings, de-duplicates products across pages so you are never charged twice for the same item, retries on transient blocks with a fresh session, fails honestly (a fully blocked run is reported as failed, not a silent empty success), and prints an up-front cost cap.
Use with AI agents (MCP)
This actor is MCP-ready and exposed through the Apify MCP server as apify--tokopedia-scraper. Agents pass a Tokopedia URL and receive flat, token-efficient JSON. Input descriptions are written for routing ("use this when…"), and errors return actionable guidance.
FAQ
Can it search by keyword? No — Tokopedia serves /search?q= from a bot-protected API. Use category (/p/...) or shop URLs, which are server-rendered.
Do I need an Indonesia proxy? No. The default Apify datacenter proxy works; the Chrome-TLS impersonation does the heavy lifting.
Does it need login or cookies? No. It only reads public, server-rendered pages.
What if a product was removed? Dead/removed URLs are skipped with a warning; the run still succeeds for the rest.
How many products per listing page? About 80. Increase maxPagesPerList to go deeper.
Legal & ethical use
This actor collects only publicly available data from Tokopedia pages. You are responsible for complying with Tokopedia's Terms of Service, applicable laws (including Indonesian data-protection law), and for not collecting personal data without a lawful basis. Use the data for legitimate purposes such as price monitoring and market research. This actor is not affiliated with, endorsed by, or sponsored by Tokopedia.
Support
Found a bug or want a new field? Open an issue on the actor's Issues tab.