# Baidu Images Scraper - Low-cost💲🔥🇨🇳🖼️ (`delectable_incubator/baidu-images-scraper---low-cost`) Actor

Scrape Baidu Images search results easily 🇨🇳🖼️ with a powerful image scraper.

Extract image URLs, thumbnails, titles, dimensions, sizes, and metadata for any keyword.

Ideal for visual research, AI training datasets, trend discovery, and Baidu Images SERP tracking with structured data 📊🚀

- **URL**: https://apify.com/delectable\_incubator/baidu-images-scraper---low-cost.md
- **Developed by:** [Prime Scrape](https://apify.com/delectable_incubator) (community)
- **Categories:** Social media, Videos, SEO tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $0.00005 / actor start

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

<p align="center"> <img src="https://i.ibb.co/jkNS73wX/readme.png" alt="Baidu Images Scraper - PrimeScrape" width="100%"> </p>

---

### Baidu Images Scraper 🌐🇨🇳🖼️🔎 百度图片爬虫

The Baidu Images Scraper is a fast, scalable, and reliable Apify Actor designed to extract structured image search results directly from Baidu Images.

It is built for China market intelligence, visual trend tracking, AI dataset creation, brand monitoring, and competitive image SERP analysis.

---

### 百度图片爬虫 🧠📊🚀 

该工具可通过批量关键词自动抓取百度图片搜索结果，并返回结构化、干净的图片元数据，方便进行分析、自动化处理或数据集构建。

非常适合从事视觉数据相关工作的研究人员、营销人员、数据工程师以及人工智能团队使用。

---

### 🎯 What This Scraper Does

Simply provide a list of keywords and a limit per keyword — the scraper handles everything automatically.

✅ Scrapes Baidu Images search results at scale

✅ Supports bulk keyword input (multi-search mode)

✅ Extracts structured image metadata

✅ Handles pagination automatically

✅ Stops at defined limits per keyword

✅ Moves seamlessly between keywords

✅ Clean, structured, analysis-ready output

---

### 📊 Data Extracted

🧾 Image Fields

| Field               | Description                      |
| ------------------- | -------------------------------- |
| 🔎 `keyword`        | Search keyword used              |
| 🔢 `position`       | Ranking position in Baidu Images |
| 🖼 `title`          | Image title (if available)       |
| 🔗 `imageUrl`       | Direct image URL                 |
| 🖼️ `thumbnailUrl`  | Thumbnail preview URL            |
| 🌐 `sourceUrl`      | Original source page URL         |
| 📏 `width`          | Image width (pixels)             |
| 📐 `height`         | Image height (pixels)            |
| 💾 `fileSize`       | Image file size (if available)   |
| ⏱ `searchTimestamp` | Timestamp of extraction          |


---

### 🛠 How to Use

1️⃣ Configure Input

Provide one or multiple keywords:

````

{
"keywords": \[
"人工智能",
"科技",
"中国城市"
],
"MAX\_ITEMS\_PER\_KEYWORD": 50
}

```

2️⃣ Run the Actor

Performs Baidu Images search

Extracts structured image results

Collects metadata per keyword

Stops automatically at limit

3️⃣ Export the Dataset

Download your results in multiple formats:

✅ JSON

✅ CSV

✅ Excel

✅ XML

✅ HTML


---

### ⚙️ Input Configuration

#### 📥 Input Example

```

{
"keywords": \["人工智能"],
"MAX\_ITEMS\_PER\_KEYWORD": 50
}

```



#### Input Fields

| Field                   | Type    | Description                           |
| ----------------------- | ------- | ------------------------------------- |
| `keywords`              | array   | List of search keywords               |
| `MAX_ITEMS_PER_KEYWORD` | integer | Maximum images to extract per keyword |


---


### 📤 Output Example

```

{
"keyword": "人工智能",
"position": 1,
"title": "人工智能概念图",
"imageUrl": "https://example.com/image.jpg",
"thumbnailUrl": "https://example.com/thumb.jpg",
"sourceUrl": "https://example.com/article",
"width": 1200,
"height": 800,
"fileSize": "245 KB",
"searchTimestamp": "2026-02-13T14:32:21.000Z"
}

````

---

### 📊 Output explanation

| Field             | Description                     |
| ----------------- | ------------------------------- |
| `keyword`         | Search term used                |
| `position`        | Ranking in Baidu Images results |
| `title`           | Image title                     |
| `imageUrl`        | Direct image file URL           |
| `thumbnailUrl`    | Preview image URL               |
| `sourceUrl`       | Original webpage source         |
| `width`           | Image width                     |
| `height`          | Image height                    |
| `fileSize`        | File size (if available)        |
| `searchTimestamp` | Time of scraping                |


---


### 🌍 Why Use This Scraper? 

📈 China Visual SEO Intelligence — track image rankings on Baidu

🕵️ Competitor Visual Monitoring — analyze brand imagery in China

🏷 Brand Tracking — monitor logos and visual identity

🎨 Creative Research — discover trending visuals

🤖 AI Dataset Building — structured image training data

🔄 Automation Ready — schedule recurring scraping runs

📊 SERP Intelligence — analyze visual search behavior

---

### ⚠️ Disclaimer

This tool is an independent solution and is not affiliated with, endorsed by, or sponsored by Baidu or any of its subsidiaries or partners.

---

### 💸 Pricing

This scraper runs on a **pay per events subscription model**.

You only pay for **successful runs**.

💳 **Price:** $9.98 / 1000 results

---

### Related Actors 

If you're interested in business scraping solutions, explore more tools:

(Coming soon)

---

### 📬 Support

⭐⭐⭐⭐⭐ Leave a 5-star rating if you like this tool

---

### 🌍 PrimeScrape

Built for scalable web data extraction & automation

Contact for custom scraping solutions or enterprise requests via Apify or by email.

# Actor input Schema

## `keywords` (type: `array`):

List of keywords or topics to search for images. Each keyword will be scraped separately.

用于搜索图片的关键词或主题列表。每个关键词将分别进行抓取。
## `MAX_ITEMS_PER_KEYWORD` (type: `integer`):

Maximum number of images to extract per keyword.

每个关键词要提取的图片最大数量。

## Actor input object example

```json
{
  "keywords": [
    "网络",
    "Web"
  ],
  "MAX_ITEMS_PER_KEYWORD": 20
}
````

# Actor output Schema

## `overview` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "keywords": [
        "网络",
        "Web"
    ],
    "MAX_ITEMS_PER_KEYWORD": 20
};

// Run the Actor and wait for it to finish
const run = await client.actor("delectable_incubator/baidu-images-scraper---low-cost").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "keywords": [
        "网络",
        "Web",
    ],
    "MAX_ITEMS_PER_KEYWORD": 20,
}

# Run the Actor and wait for it to finish
run = client.actor("delectable_incubator/baidu-images-scraper---low-cost").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "keywords": [
    "网络",
    "Web"
  ],
  "MAX_ITEMS_PER_KEYWORD": 20
}' |
apify call delectable_incubator/baidu-images-scraper---low-cost --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=delectable_incubator/baidu-images-scraper---low-cost",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Baidu Images Scraper - Low-cost💲🔥🇨🇳🖼️",
        "description": "Scrape Baidu Images search results easily 🇨🇳🖼️ with a powerful image scraper. \n\nExtract image URLs, thumbnails, titles, dimensions, sizes, and metadata for any keyword. \n\nIdeal for visual research, AI training datasets, trend discovery, and Baidu Images SERP tracking with structured data 📊🚀",
        "version": "0.0",
        "x-build-id": "akMi5Fyx21c0EYubO"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/delectable_incubator~baidu-images-scraper---low-cost/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-delectable_incubator-baidu-images-scraper---low-cost",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/delectable_incubator~baidu-images-scraper---low-cost/runs": {
            "post": {
                "operationId": "runs-sync-delectable_incubator-baidu-images-scraper---low-cost",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/delectable_incubator~baidu-images-scraper---low-cost/run-sync": {
            "post": {
                "operationId": "run-sync-delectable_incubator-baidu-images-scraper---low-cost",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "keywords",
                    "MAX_ITEMS_PER_KEYWORD"
                ],
                "properties": {
                    "keywords": {
                        "title": "Keywords or topics 🔍🖼️ | 关键词或主题 🔍🖼️",
                        "type": "array",
                        "description": "List of keywords or topics to search for images. Each keyword will be scraped separately.\n\n用于搜索图片的关键词或主题列表。每个关键词将分别进行抓取。",
                        "items": {
                            "type": "string"
                        },
                        "default": [
                            "网络",
                            "Web"
                        ]
                    },
                    "MAX_ITEMS_PER_KEYWORD": {
                        "title": "Max images per keyword | 每个关键词的最大图片数",
                        "type": "integer",
                        "description": "Maximum number of images to extract per keyword.\n\n每个关键词要提取的图片最大数量。",
                        "default": 20
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
