Alibaba Scraper
Pricing
from $5.00 / 1,000 results
Pricing
from $5.00 / 1,000 results
Rating
0.0
(0)
Developer
ScraperX
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Alibaba Scraper
Alibaba Scraper is a fast, reliable Alibaba product scraper that collects structured product listings from public trade/search pages. It reads Alibaba’s embedded window.__page__data_sse*._offer_list JSON to extract titles, prices, supplier info, images, and more — ideal for teams that need an Alibaba data extractor to scale cataloging, price checks, and supplier research. Built for marketers, data analysts, and researchers, this Alibaba scraping tool handles bulk URLs and streams results to a live dataset so you can scrape Alibaba product listings efficiently at scale.
What data / output can you get?
This actor outputs clean, structured product rows to your dataset as each page completes. The fields come directly from Alibaba’s offer list JSON and include product, pricing, and supplier metadata.
| Data type | Description | Example value |
|---|---|---|
| title | Product title shown in results | "Men’s Waterproof Hiking Jacket" |
| productUrl | Direct link to the product page | "https://www.alibaba.com/product-detail/1600981574830.html" |
| price | Displayed price or price range | "$12.50 - $18.90" |
| companyName | Supplier/company name | "Guangzhou Outdoor Gear Co., Ltd." |
| countryCode | Supplier country code | "CN" |
| mainImage | Main result image URL | "https://s.alicdn.com/abc/main.jpg" |
| multiImage | Array of additional image URLs | ["https://s.alicdn.com/abc/1.jpg","https://s.alicdn.com/abc/2.jpg"] |
| reviewScore | Average rating score | "4.8" |
| reviewCount | Number of reviews | "125" |
| soldOrder | Orders sold count | "560" |
| badges | Seller or listing badges | ["Trade Assurance","Verified"] |
| loopSellingPoints | Selling point highlights | ["Waterproof","Breathable"] |
You can export results to JSON, CSV, or Excel directly from the Dataset tab in Apify.
Key features
Get dependable Alibaba product data extraction without manual copy-paste. Results stream into your dataset in near real-time, with smart request handling for consistency.
| Feature | Description |
|---|---|
| 🚀 Bulk URL processing | Add many Alibaba search or category URLs and process them in one run. |
| 📈 Max Items control | Set how many result pages to open per URL via maxItems (1–5000). Legacy maxPages is supported. |
| 📡 Live dataset streaming | Products are pushed to the dataset as soon as each page finishes loading. |
| 🧭 Browser-like requests | Uses curl_cffi with Chrome impersonation for stable, browser-like HTTP behavior. |
| 🛡 Optional Apify Proxy | Enable proxyConfiguration.useApifyProxy for a managed fallback when needed. |
| 🧩 Robust JSON parsing | Reads the embedded window.__page__data_sse*._offer_list JSON for structured, consistent results. |
| ⚙️ Sensible concurrency | Runs multiple page loads (up to 8 in parallel) for balanced speed and stability. |
How to use Alibaba Scraper - step by step
- Sign in to Apify and open the Alibaba Scraper actor.
- Paste one or more Alibaba search or category URLs into the Page URLs (bulk) field (string list). You can add plain strings or objects with a url key.
- Set Max Items to control how many result pages to open per URL (default 10, up to 5000). Start small (e.g., 1–3) to preview.
- Optionally open Proxy configuration and enable useApifyProxy if your network conditions require a managed proxy fallback.
- Click Start. The run log will show progress; items are pushed to the dataset live as each page completes.
- Monitor the Dataset tab to see products populate in real time.
- Export your results in JSON, CSV, or Excel for analysis or downstream use.
Pro Tip: Need to automate at scale? Trigger this actor programmatically and pipe the dataset to your analytics stack for ongoing price monitoring or catalog updates using your preferred workflow tools.
Use cases
| Use case name | Description |
|---|---|
| Supplier discovery for sourcing | Aggregate suppliers across multiple Alibaba searches to shortlist vendors faster with companyName, countryCode, and badges. |
| Price monitoring for category trends | Track price ranges (price) across pages to spot shifts, compare listings, and inform purchasing decisions. |
| Product catalog building for e‑commerce | Collect titles, images, and product links (title, mainImage, productUrl) to seed catalogs and product research. |
| Market research & benchmarking | Analyze reviewScore, reviewCount, and soldOrder to benchmark demand and quality across similar products. |
| Bulk data ingestion for analytics | Use bulk URLs and live dataset streaming to build datasets for dashboards and BI without manual scraping. |
| Data enrichment for internal tools | Combine productUrl and supplierHref with your systems for enrichment or lead qualification workflows. |
| Academic or non‑profit research | Export structured Alibaba product data for studies on market availability, pricing, and supply chains. |
Why choose Alibaba Scraper?
Built for precision, automation, and reliability — this Alibaba catalog scraper focuses on structured output and smooth bulk runs.
- ✅ Accurate, structured output from Alibaba’s embedded _offer_list JSON
- 🌍 Works on public pages without login, ideal for global research
- ⚡ Scales across many URLs with live result streaming to your dataset
- 💻 Developer-friendly: predictable fields for easy pipelines and data joins
- 🔒 Ethical-by-design: targets public listing data only
- 💰 Cost visibility: pay-per-result via charged event row_result
- 🔗 Easy exports: download JSON, CSV, or Excel from the dataset
In short, it’s a dependable Alibaba product scraper vs. unstable browser extensions — purpose-built for consistent data extraction and automation.
Is it legal / ethical to use Alibaba Scraper?
Yes — when used responsibly. This actor collects data from publicly visible Alibaba listing pages only and does not access private or authenticated content.
Guidelines for compliant use:
- Scrape public information only and avoid personal data.
- Respect platform terms and applicable laws (e.g., GDPR, CCPA).
- Use proxy settings responsibly if required by your environment.
- Validate your use case with your legal team for edge cases.
Input parameters & output format
Example JSON input
{"urls": [{ "url": "https://www.alibaba.com/trade/search?keywords=jacket&page=1" },"https://www.alibaba.com/trade/search?keywords=backpack&page=1"],"maxItems": 3,"proxyConfiguration": { "useApifyProxy": false }}
- Plain strings in urls are supported, as well as objects with a url property.
- maxItems controls how many result pages per URL are fetched (alias: maxPages).
Parameters
| Field | Required | Description |
|---|---|---|
| urls | Yes | List of Alibaba URLs to process. Accepts plain strings or objects like { "url": "https://..." }. One URL per line. |
| maxItems | No | How many result pages to open per URL. Default 10, min 1, max 5000. Legacy alias maxPages is also accepted. |
| proxyConfiguration | No | Optional Apify proxy settings. Set useApifyProxy to true to enable managed proxy fallback if needed. |
Example JSON output
[{"badges": ["Trade Assurance", "Verified"],"certifications": ["CE"],"chatToken": "","companyId": "1234567890","companyLogo": "https://s.alicdn.com/logo/comp.png","companyName": "Guangzhou Outdoor Gear Co., Ltd.","contactSupplier": "https://www.alibaba.com/contact/supplier/abc","countryCode": "CN","customGroup": "","displayStarLevel": "5","eurl": "https://www.alibaba.com/abc/eurl","goldSupplierYears": "6","id": "offer_1600981574830","isShowAd": false,"loopSellingPoints": ["Waterproof", "Breathable"],"lyb": false,"mainImage": "https://s.alicdn.com/images/main.jpg","moq": "2 pieces","moqV2": "2","multiImage": ["https://s.alicdn.com/images/1.jpg","https://s.alicdn.com/images/2.jpg"],"pcLoopSellingPoints": ["In stock", "Fast dispatch"],"price": "$12.50 - $18.90","productId": "1600981574830","productScore": "92","productUrl": "https://www.alibaba.com/product-detail/1600981574830.html","reviewCount": "125","reviewScore": "4.8","shippingScore": "A","showAddToCart": false,"showCrown": false,"soldOrder": "560","supplierHomeHref": "https://guangzhou-gear.en.alibaba.com","supplierHref": "https://www.alibaba.com/supplier/abc","supplierService": "On-time delivery","supplierServiceScore": "A","title": "Men’s Waterproof Hiking Jacket","tmlid": "","trackInfo": ""}]
Each dataset item corresponds to a single product result. Fields may be empty when not present on the page; arrays default to [] and booleans to false.
FAQ
Do I need to log in or add cookies to scrape?
No. The actor fetches public Alibaba listing pages without login. It uses browser-like HTTP requests with Chrome impersonation to read the embedded offer list JSON.
How much does it cost to run?
This actor uses pay-per-event pricing. You’re charged a tiny start fee per run and $0.005 per result (charged_event_name "row_result"). See the run details on Apify for your exact totals.
How many pages can I scrape per URL?
You can set maxItems from 1 up to 5000 pages per URL. Start low to validate output, then scale as needed. A legacy alias maxPages is also supported.
Can I run it on many URLs at once?
Yes. Paste multiple Alibaba search or category URLs in the urls input. The actor will process them with sensible parallelism and push results live to the dataset.
What fields does it extract?
It captures product and supplier metadata from Alibaba’s _offer_list JSON, including title, productUrl, price, companyName, countryCode, mainImage, multiImage, reviewScore, reviewCount, soldOrder, badges, and more as shown in the output example.
Does it work as an Alibaba scraper Python integration?
Yes. You can trigger this Apify actor via the Apify API from Python or any language, then download the dataset as JSON, CSV, or Excel for downstream processing.
Is this an Alibaba scraper Chrome extension?
No. This is a cloud-based Apify actor. It runs headless on servers, streams results to a dataset, and avoids the instability of manual browser extensions.
Can I use proxies?
Yes. Proxies are optional. Set proxyConfiguration.useApifyProxy to true to enable Apify’s managed proxy fallback when needed for more stable access.
Closing CTA / Final thoughts
Alibaba Scraper is built for fast, structured Alibaba product data extraction at scale. With bulk URL input, maxItems control up to 5000 pages per URL, and live dataset streaming, it’s ideal for marketers, researchers, and analysts who need reliable cataloging, price tracking, and supplier discovery. Developers can integrate runs and exports into pipelines programmatically for automation. Start extracting cleaner, structured product data — and turn Alibaba’s public listings into actionable insights.