# Global Sources Supplier Scraper (`xtracto/globalsources-supplier-scraper`) Actor

Search Global Sources by keyword and extract verified supplier and product listings — company name, product title, price, MOQ, location, image, and profile URL. Requires a residential proxy.

- **URL**: https://apify.com/xtracto/globalsources-supplier-scraper.md
- **Developed by:** [Farhan Febrian Nauval](https://apify.com/xtracto) (community)
- **Categories:** E-commerce
- **Stats:** 2 total users, 1 monthly users, 0.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $10.00 / 1,000 results

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Global Sources Supplier Scraper

Search **Global Sources** by product keyword and pull verified B2B supplier and product listings — company name, product title, price, minimum order quantity (MOQ), location, image, and profile URL — as clean, structured rows.

### Why use this actor

- **Sourcing-grade supplier data** — company name, location, and the products they sell, ready for lead lists and supplier shortlists.
- **Product tiles with price and MOQ** — see the asking price and minimum order quantity per product, not just a name.
- **Keyword in, structured rows out** — give it product keywords; get back PRODUCT and SUPPLIER records you can sort and filter.
- **Loads each page fully like a real visitor** — so you get the complete listing, not a stripped-down preview.
- **Stable JSON output** — the same field names every run, ready for spreadsheets, databases, or pipelines.
- **No account, no API key** — works straight from public search pages.

### How it works

1. You provide one or more product keywords.
2. For each keyword, the actor opens the Global Sources supplier/product search and loads the page fully like a real visitor.
3. Each product tile becomes a `PRODUCT` record and each supplier becomes a `SUPPLIER` record, with name, price/MOQ where shown, location, image, and URL.
4. Everything lands in your dataset, exportable as JSON, CSV, or Excel.

You don't need to manage any browsers or scrapers.

> **Residential proxy required.** Global Sources only serves its pages to a residential visitor. You **must** run this actor with the Apify Proxy **RESIDENTIAL** group (it is prefilled in the input). Because each page is loaded fully like a real visitor, this actor is more compute-intensive than a simple HTML scraper.

### Input

```json
{
    "queries": [
        "wireless earbuds",
        "solar panel"
    ],
    "maxItems": 50,
    "maxRequestRetries": 3,
    "proxyConfiguration": { "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] }
}
````

| Field | Type | Description |
| --- | --- | --- |
| `queries` | array | **Required.** Product keywords to search for (one per entry). Each keyword returns its own supplier and product rows. |
| `maxItems` | integer | Maximum supplier/product rows to keep per keyword (`1`–`500`). Default `50`. |
| `maxRequestRetries` | integer | How many times to reload a page that hasn't finished loading before moving on (`1`–`10`). Default `3`. |
| `proxyConfiguration` | object | **Required — use the RESIDENTIAL group.** Global Sources only serves its pages to a residential visitor; without a residential proxy each keyword returns a block notice instead of data. |

### Output

Per product (`recordType: "PRODUCT"`):

```json
{
    "recordType": "PRODUCT",
    "_input": "wireless earbuds",
    "productTitle": "TWS Wireless Earbuds Bluetooth 5.3 Noise Cancelling",
    "price": "US$3.50",
    "currency": "USD",
    "moq": "100 pieces",
    "company": "Shenzhen Audio Technology Co., Ltd.",
    "location": "Shenzhen, CN",
    "image": "https://www.globalsources.com/.../earbuds.jpg",
    "url": "https://www.globalsources.com/product/1234567.html",
    "sourceUrl": "https://www.globalsources.com/searchList/products?query=wireless%20earbuds",
    "scrapedAt": "2026-06-10T18:06:28Z"
}
```

Per supplier (`recordType: "SUPPLIER"`):

```json
{
    "recordType": "SUPPLIER",
    "_input": "wireless earbuds",
    "company": "Guangzhou Sound Industrial Ltd.",
    "productTitle": null,
    "price": null,
    "currency": null,
    "moq": null,
    "location": "Guangzhou, CN",
    "image": "https://www.globalsources.com/.../logo.jpg",
    "url": "https://www.globalsources.com/supplier/9990001.html",
    "sourceUrl": "https://www.globalsources.com/searchList/suppliers?query=wireless%20earbuds",
    "scrapedAt": "2026-06-10T18:06:28Z"
}
```

| Field | Type | Description |
| --- | --- | --- |
| `recordType` | string | `"PRODUCT"` for a product tile, `"SUPPLIER"` for a company. |
| `_input` | string | The keyword that produced this record. |
| `productTitle` | string | Product name (PRODUCT records). |
| `company` | string | Supplier / manufacturer company name. |
| `price` | string | Asking price as shown, when available. |
| `currency` | string | ISO currency of the price, when available. |
| `moq` | string | Minimum order quantity, when shown. |
| `location` | string | Supplier location (city / country) when shown. |
| `image` | string | Primary image or logo URL. |
| `url` | string | Product or supplier profile URL. |
| `sourceUrl` | string | The search page the record came from. |
| `scrapedAt` | string | ISO 8601 timestamp of collection. |

If a keyword's page cannot be loaded (for example when run without a residential proxy), the actor emits a clearly-marked diagnostic record (`recordType: "DIAGNOSTIC"`) with an `error` and `message` explaining why, instead of silently skipping — so you always know what happened.

### Other Global Sources Scrapers

| Actor | Description |
| --- | --- |
| [Global Sources Supplier Scraper](https://apify.com/globalsources-supplier-scraper) | Supplier and product listings by keyword: company, price, MOQ, location. |

# Actor input Schema

## `queries` (type: `array`):

Product keywords to search Global Sources for (one per array entry). Each keyword returns its own set of supplier and product listings.

## `maxItems` (type: `integer`):

Maximum number of supplier/product rows to keep per keyword. Default 50.

## `maxRequestRetries` (type: `integer`):

How many times to reload a page that has not finished loading before moving on. Default 3.

## `proxyConfiguration` (type: `object`):

REQUIRED. Global Sources will only load its pages for a residential visitor, so you MUST use the Apify Proxy RESIDENTIAL group (prefilled). Without a residential proxy every keyword returns a block diagnostic instead of data.

## Actor input object example

```json
{
  "queries": [
    "wireless earbuds"
  ],
  "maxItems": 50,
  "maxRequestRetries": 3,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "queries": [
        "wireless earbuds"
    ],
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ]
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("xtracto/globalsources-supplier-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "queries": ["wireless earbuds"],
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
    },
}

# Run the Actor and wait for it to finish
run = client.actor("xtracto/globalsources-supplier-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "queries": [
    "wireless earbuds"
  ],
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}' |
apify call xtracto/globalsources-supplier-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=xtracto/globalsources-supplier-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Global Sources Supplier Scraper",
        "description": "Search Global Sources by keyword and extract verified supplier and product listings — company name, product title, price, MOQ, location, image, and profile URL. Requires a residential proxy.",
        "version": "1.0",
        "x-build-id": "x4JHsKQevE1xCXM6b"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/xtracto~globalsources-supplier-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-xtracto-globalsources-supplier-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/xtracto~globalsources-supplier-scraper/runs": {
            "post": {
                "operationId": "runs-sync-xtracto-globalsources-supplier-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/xtracto~globalsources-supplier-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-xtracto-globalsources-supplier-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "queries"
                ],
                "properties": {
                    "queries": {
                        "title": "Search keywords",
                        "type": "array",
                        "description": "Product keywords to search Global Sources for (one per array entry). Each keyword returns its own set of supplier and product listings.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxItems": {
                        "title": "Max items per keyword",
                        "minimum": 1,
                        "maximum": 500,
                        "type": "integer",
                        "description": "Maximum number of supplier/product rows to keep per keyword. Default 50.",
                        "default": 50
                    },
                    "maxRequestRetries": {
                        "title": "Max retries",
                        "minimum": 1,
                        "maximum": 10,
                        "type": "integer",
                        "description": "How many times to reload a page that has not finished loading before moving on. Default 3.",
                        "default": 3
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "REQUIRED. Global Sources will only load its pages for a residential visitor, so you MUST use the Apify Proxy RESIDENTIAL group (prefilled). Without a residential proxy every keyword returns a block diagnostic instead of data."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
