Taobao Product Scraper avatar

Taobao Product Scraper

Pricing

$20.00/month + usage

Go to Apify Store
Taobao Product Scraper

Taobao Product Scraper

Extract detailed product data from Taobao, including title, price, reviews, specifications, images, options, and categories. Perfect for market research, e-commerce insights, and data-driven decision-making

Pricing

$20.00/month + usage

Rating

0.0

(0)

Developer

Pizani

Pizani

Maintained by Community

Actor stats

2

Bookmarked

84

Total users

1

Monthly active users

3 days ago

Last modified

Categories

Share

πŸ›’ Taobao Product Scraper

This Actor extracts product data from Taobao using Playwright and returns a ready-to-use JSON.


✨ What it does

  • Accepts a Taobao product_url.
  • Extracts title, price, images, and variants.
  • Collects store data (title, ID, and link).
  • Returns everything as JSON in the Key-Value Store (OUTPUT).

πŸ”§ Input

Example Actor input:

{
"product_url": "https://item.taobao.com/item.htm?id=624164131692"
}

🧾 Example Output

{
"sellerInfo": {
"shopTitle": "Neverislang.",
"shopID": "1822521250",
"shopLink": "https://shop1822521250.taobao.com"
},
"productInfo": {
"urlproduct": "https://item.taobao.com/item.htm?id=624164131692",
"title": "Authentic kaws puzzle high transparent acrylic display ...",
"price": "95.00",
"imgList": [
"https://img.alicdn.com/imgextra/i1/1822521250/O1CN01GCWTQ11L6Xh9zvtAQ_!!1822521250.jpg",
"... more images ..."
],
"atributtes": {
"Brand": "Neverislang.",
"Material": "Acrylic",
"Style": "simple and modern"
},
"options": [
{
"name": "Other sizes",
"imgUrl": "https://img.alicdn.com/imgextra/i1/1822521250/O1CN01GCWTQ11L6Xh9zvtAQ_!!1822521250.jpg",
"price": "95.00"
}
]
}
}

🧩 Field Reference

  • sellerInfo: Store information.
  • sellerInfo.shopTitle: Store name.
  • sellerInfo.shopID: Store identifier on Taobao.
  • sellerInfo.shopLink: Store URL.
  • productInfo: Product information.
  • productInfo.urlproduct: Product URL.
  • productInfo.title: Product title.
  • productInfo.price: Base price Yuan.
  • productInfo.imgList: List of image URLs.
  • productInfo.atributtes: Map of product attributes (key/value).
  • productInfo.options: List of variants.
  • productInfo.options[].name: Variant name/label.
  • productInfo.options[].imgUrl: Variant image URL.
  • productInfo.options[].price: Variant price Yuan.

πŸ› οΈ How to use via API

  1. Set your credentials: add API_TOKEN and ACTOR_ID.
  2. Provide the product: update PRODUCT_URL.
  3. Run the script: it starts the Actor and saves the JSON locally.
import os
import time
import json
import requests
API_TOKEN = "YOUR_APIFY_API_KEY"
ACTOR_ID = "pizani~taobao-product-scraper"
PRODUCT_URL = "https://item.taobao.com/item.htm?id=624164131692"
output_dir = "./scraped_results"
os.makedirs(output_dir, exist_ok=True)
output_file = os.path.join(output_dir, "product_data.json")
run_resp = requests.post(
f"https://api.apify.com/v2/acts/{ACTOR_ID}/runs?token={API_TOKEN}",
json={"product_url": PRODUCT_URL}
)
run_data = run_resp.json()
run_id = run_data.get("data", {}).get("id")
if not run_id:
raise Exception(f"Failed to start Actor: {run_data}")
status_url = f"https://api.apify.com/v2/actor-runs/{run_id}?token={API_TOKEN}"
while True:
status_resp = requests.get(status_url).json()
status = status_resp.get("data", {}).get("status")
if status in {"SUCCEEDED", "FAILED", "ABORTED", "TIMED-OUT"}:
break
time.sleep(5)
item_url = (
"https://api.apify.com/v2/key-value-stores/"
f"{status_resp['data']['defaultKeyValueStoreId']}"
"/records/OUTPUT?disableRedirect=true&token="
f"{API_TOKEN}"
)
result = requests.get(item_url).json()
with open(output_file, "w", encoding="utf-8") as f:
json.dump(result, f, ensure_ascii=False, indent=4)

ℹ️ Notes

  • Fields can vary depending on the product and site availability.