Tokopedia Scraper avatar

Tokopedia Scraper

Pricing

from $5.00 / 1,000 product details

Go to Apify Store
Tokopedia Scraper

Tokopedia Scraper

Scrape Tokopedia product detail, category listings, and shop catalogs by URL — price, rating, reviews, stock, shop, image. HTTP-only via Chrome-TLS impersonation, no login. MCP-ready.

Pricing

from $5.00 / 1,000 product details

Rating

0.0

(0)

Developer

Khadin Akbar

Khadin Akbar

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

a day ago

Last modified

Share

Scrape Tokopedia — Indonesia's largest marketplace — for product detail, category listings, and shop catalogs. Paste any Tokopedia product, category, or shop URL and get clean, structured JSON: price, rating, reviews, stock, seller, image, and more. HTTP-only (no browser), no login, no cookies. Built for analysts, sellers, and AI agents.

What it does

Give the actor one or more Tokopedia URLs. It auto-detects the page type and returns one flat record per product:

URL you pasteModeWhat you get
https://www.tokopedia.com/{shop}/{product-slug}Product detailName, price, slash/original price, discount, rating score, total ratings, per-star breakdown, description, image, shop
https://www.tokopedia.com/p/handphone-tablet/handphoneCategory listingUp to ~80 product cards per page (auto-paginated): name, price, rating, reviews, stock, badges, image, shop
https://www.tokopedia.com/{shop} or .../{shop}/productShop catalogThe shop's products (auto-paginated): name, price, rating, reviews, image

When to use it

  • Price & competitor monitoring — track competitor SKUs, prices, and stock on Tokopedia.
  • Catalog research — pull an entire category or a competitor's shop catalog.
  • Market research — rating, review counts, and badges across a category.
  • AI agents — a single URL in, structured JSON out. MCP-ready.

Not for free-text keyword search. Tokopedia loads /search?q=... results from a bot-protected internal API that this HTTP-only actor cannot reach. Browse by category (/p/...) or shop instead — those pages are server-rendered and fully scrapable. Passing a /search?q= URL is skipped with a clear message.

Output sample

{
"type": "listing",
"productId": "102798088938",
"name": "OPPO RENO 15 5G (8/256 GB, 12/256 GB) Garansi Resmi",
"url": "https://www.tokopedia.com/cellsius/oppo-reno-15-5g-8-256-gb-12-256-gb-garansi-resmi-1733884686684424021",
"price": "Rp9.299.000",
"priceInt": 9299000,
"rating": 5,
"reviewCount": 1,
"stock": 30,
"categoryBreadcrumbs": "Handphone & Tablet/Handphone/Android OS",
"badges": ["Mall"],
"imageUrl": "https://images.tokopedia.net/...",
"shopName": "cellsius",
"shopUrl": "https://www.tokopedia.com/cellsius",
"sourceUrl": "https://www.tokopedia.com/p/handphone-tablet/handphone",
"scrapedAt": "2026-06-21T00:00:00.000Z"
}

Product-detail records add slashPrice, discountPercentage, ratingScore, ratingCount, ratingBreakdown, and description.

Pricing

This actor is pay-per-event. You are billed only for what you get:

EventPriceWhen
Actor start$0.00005Once per run (scaled by memory)
Listing product$0.003Each product from a category or shop page
Product detail$0.005Each product from a product-detail (PDP) URL

A 100-product category pull costs about $0.30. The actor enforces your maxResults cap and stops charging once it's reached. Pay-Per-Usage (compute + proxy) is also available for very large jobs.

Input

FieldTypeDefaultDescription
startUrlsarray of strings— (required)Tokopedia product, category (/p/...), or shop URLs
maxResultsinteger100Hard cap on products returned across all URLs
maxPagesPerListinteger5Pagination depth per category/shop URL (~80 products/page)
proxyConfigurationobjectApify datacenterProxy; default is sufficient

Example input

{
"startUrls": [
"https://www.tokopedia.com/p/handphone-tablet/handphone",
"https://www.tokopedia.com/cuan-gadgetid/product"
],
"maxResults": 200,
"maxPagesPerList": 5
}

How it works

Tokopedia protects its edge with TLS/HTTP-2 fingerprinting that blocks ordinary HTTP clients. This actor uses impit Chrome-TLS impersonation to pass that block from a plain datacenter proxy — no headless browser, no residential IPs required — then parses the product data Tokopedia embeds in each server-rendered page (its Apollo window.__cache). That keeps runs fast and cheap while staying reliable.

The actor is resilient by design: it auto-paginates listings, de-duplicates products across pages so you are never charged twice for the same item, retries on transient blocks with a fresh session, fails honestly (a fully blocked run is reported as failed, not a silent empty success), and prints an up-front cost cap.

Use with AI agents (MCP)

This actor is MCP-ready and exposed through the Apify MCP server as apify--tokopedia-scraper. Agents pass a Tokopedia URL and receive flat, token-efficient JSON. Input descriptions are written for routing ("use this when…"), and errors return actionable guidance.

FAQ

Can it search by keyword? No — Tokopedia serves /search?q= from a bot-protected API. Use category (/p/...) or shop URLs, which are server-rendered.

Do I need an Indonesia proxy? No. The default Apify datacenter proxy works; the Chrome-TLS impersonation does the heavy lifting.

Does it need login or cookies? No. It only reads public, server-rendered pages.

What if a product was removed? Dead/removed URLs are skipped with a warning; the run still succeeds for the rest.

How many products per listing page? About 80. Increase maxPagesPerList to go deeper.

This actor collects only publicly available data from Tokopedia pages. You are responsible for complying with Tokopedia's Terms of Service, applicable laws (including Indonesian data-protection law), and for not collecting personal data without a lawful basis. Use the data for legitimate purposes such as price monitoring and market research. This actor is not affiliated with, endorsed by, or sponsored by Tokopedia.

Support

Found a bug or want a new field? Open an issue on the actor's Issues tab.