# ImportYeti Trade Data Scraper (`lulzasaur/importyeti-scraper`) Actor

Scrape US import/export trade data from ImportYeti. Search companies by name or product, get supplier profiles with shipment history, HS codes, and trade volumes.

- **URL**: https://apify.com/lulzasaur/importyeti-scraper.md
- **Developed by:** [lulz bot](https://apify.com/lulzasaur) (community)
- **Categories:** E-commerce
- **Stats:** 3 total users, 2 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## ImportYeti Scraper

Scrape US import/export trade data from ImportYeti. Search for companies, view detailed company profiles with supplier and buyer relationships, and extract shipment records including HS codes, ports, carriers, and product descriptions.

### Features

- **Company search** -- Find importers and exporters by keyword, product, or HS code
- **Company profiles** -- Get detailed profiles including address, website, total shipments, top products, suppliers, and buyers
- **Shipment records** -- Extract individual shipment data with HS codes, weights, ports, and bill of lading numbers
- **Supplier/buyer mapping** -- Discover trade relationships between companies
- **Country filtering** -- Filter results by supplier country of origin
- **HS code filtering** -- Target specific product categories by Harmonized System code
- **JSON API + HTML fallback** -- Uses ImportYeti's internal API when available, falls back to HTML scraping
- **Pagination support** -- Automatically follows paginated search results and shipment pages

### Input Parameters

| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| `mode` | string | `"search"` | Mode: `search`, `company`, or `shipments` |
| `searchQuery` | string | `""` | Search term (company name, product keyword, or general term) |
| `companyName` | string | `""` | Company name or URL slug for `company`/`shipments` mode |
| `hsCode` | string | `""` | Filter by HS code for specific product categories |
| `country` | string | `""` | Filter by supplier country (e.g., "China", "Vietnam") |
| `limit` | integer | `50` | Maximum number of results (1-500) |
| `maxPages` | integer | `5` | Maximum pages to scrape (1-50) |
| `proxyConfiguration` | object | -- | Proxy settings (recommended to avoid rate limits) |

### Example Inputs

**Search for electronics importers:**
```json
{
  "mode": "search",
  "searchQuery": "electronics",
  "country": "China",
  "limit": 50
}
````

**Get a company profile:**

```json
{
  "mode": "company",
  "companyName": "nike"
}
```

**Get shipment records for a company:**

```json
{
  "mode": "shipments",
  "companyName": "apple-inc",
  "hsCode": "8471",
  "limit": 100
}
```

### Output Fields

#### Search Results

| Field | Type | Description |
|-------|------|-------------|
| `companyName` | string | Company name |
| `slug` | string | URL slug on ImportYeti |
| `address` | string | Company address (if available) |
| `totalShipments` | number | Total shipment count |
| `country` | string | Country of origin |
| `url` | string | ImportYeti profile URL |

#### Company Profile

| Field | Type | Description |
|-------|------|-------------|
| `companyName` | string | Company name |
| `address` | string | Company address |
| `website` | string | Company website |
| `phone` | string | Phone number |
| `country` | string | Country |
| `totalShipments` | number | Total sea shipments recorded |
| `alsoKnownAs` | array | Alternate company names |
| `topProducts` | array | Top imported/exported products with HS codes |
| `hsCodes` | array | All associated HS codes |
| `suppliers` | array | Supplier names, countries, and shipment counts |
| `buyers` | array | Buyer/customer names and shipment counts |
| `recentShipments` | array | Recent shipment records |
| `carriers` | array | Shipping carriers used |
| `ports` | array | Ports used |
| `countriesOfOrigin` | array | Countries goods are sourced from |

#### Shipment Records

| Field | Type | Description |
|-------|------|-------------|
| `shipmentDate` | string | Date of shipment |
| `supplierName` | string | Supplier/shipper name |
| `buyerName` | string | Buyer/consignee name |
| `productDescription` | string | Product description |
| `hsCode` | string | HS tariff code |
| `weight` | string | Shipment weight |
| `quantity` | string | Quantity shipped |
| `portOfLading` | string | Origin port |
| `portOfUnlading` | string | Destination port |
| `billOfLading` | string | Bill of lading number |
| `vesselName` | string | Vessel/ship name |
| `country` | string | Country of origin |

### Use Cases

- **Supplier discovery** -- Find overseas suppliers for any product category
- **Competitive intelligence** -- See who your competitors are importing from
- **Supply chain mapping** -- Map the full supplier and buyer network for any company
- **Trade compliance** -- Verify supplier relationships and shipment volumes
- **Sourcing research** -- Compare suppliers by shipment frequency, HS codes, and country of origin
- **Market analysis** -- Analyze import trends by product category or country

### Data Source

Data is sourced from [ImportYeti](https://www.importyeti.com/), which aggregates US customs import records and bill of lading data.

***

### Run on Apify

This scraper runs on the [Apify platform](https://apify.com/?fpr=lulzasaur) -- a full-stack web scraping and automation cloud. Sign up for a free account to get started with 30-day trial of all features.

[Try Apify free ->](https://apify.com/?fpr=lulzasaur)

# Actor input Schema

## `mode` (type: `string`):

Scraping mode: 'search' finds companies by keyword, 'company' gets a full company profile, 'shipments' gets shipment records.

## `searchQuery` (type: `string`):

Search term for finding companies or shipments (company name, product keyword, or general term).

## `companyName` (type: `string`):

Company name or URL slug for 'company' or 'shipments' mode (e.g. 'nike', 'apple-inc'). The slug from the ImportYeti URL.

## `hsCode` (type: `string`):

Filter by Harmonized System (HS) code to find specific product categories.

## `country` (type: `string`):

Filter results by supplier country (e.g. 'China', 'Vietnam', 'India').

## `limit` (type: `integer`):

Maximum number of results to return.

## `maxPages` (type: `integer`):

Maximum number of pages to scrape for search/shipments modes.

## `proxyConfiguration` (type: `object`):

Proxy to use for requests. Recommended for avoiding rate limits.

## Actor input object example

```json
{
  "mode": "search",
  "searchQuery": "",
  "companyName": "",
  "hsCode": "",
  "country": "",
  "limit": 50,
  "maxPages": 5
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("lulzasaur/importyeti-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("lulzasaur/importyeti-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call lulzasaur/importyeti-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=lulzasaur/importyeti-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "ImportYeti Trade Data Scraper",
        "description": "Scrape US import/export trade data from ImportYeti. Search companies by name or product, get supplier profiles with shipment history, HS codes, and trade volumes.",
        "version": "1.0",
        "x-build-id": "qACG59gwyhMYgxoy8"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/lulzasaur~importyeti-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-lulzasaur-importyeti-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/lulzasaur~importyeti-scraper/runs": {
            "post": {
                "operationId": "runs-sync-lulzasaur-importyeti-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/lulzasaur~importyeti-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-lulzasaur-importyeti-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "mode": {
                        "title": "Mode",
                        "enum": [
                            "search",
                            "company",
                            "shipments"
                        ],
                        "type": "string",
                        "description": "Scraping mode: 'search' finds companies by keyword, 'company' gets a full company profile, 'shipments' gets shipment records.",
                        "default": "search"
                    },
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search term for finding companies or shipments (company name, product keyword, or general term).",
                        "default": ""
                    },
                    "companyName": {
                        "title": "Company Name / Slug",
                        "type": "string",
                        "description": "Company name or URL slug for 'company' or 'shipments' mode (e.g. 'nike', 'apple-inc'). The slug from the ImportYeti URL.",
                        "default": ""
                    },
                    "hsCode": {
                        "title": "HS Code",
                        "type": "string",
                        "description": "Filter by Harmonized System (HS) code to find specific product categories.",
                        "default": ""
                    },
                    "country": {
                        "title": "Country Filter",
                        "type": "string",
                        "description": "Filter results by supplier country (e.g. 'China', 'Vietnam', 'India').",
                        "default": ""
                    },
                    "limit": {
                        "title": "Result Limit",
                        "minimum": 1,
                        "maximum": 500,
                        "type": "integer",
                        "description": "Maximum number of results to return.",
                        "default": 50
                    },
                    "maxPages": {
                        "title": "Max Pages",
                        "minimum": 1,
                        "maximum": 50,
                        "type": "integer",
                        "description": "Maximum number of pages to scrape for search/shipments modes.",
                        "default": 5
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Proxy to use for requests. Recommended for avoiding rate limits."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
