# Baidu Videos Scraper - Low-cost 低成本💲🎥📺🇨🇳 (`delectable_incubator/baidu-videos-scraper---low-cost-di-cheng-ben`) Actor

Scrape Baidu Video search results easily 🇨🇳🎥 with a powerful video SERP scraper.

Extract video URLs, titles, thumbnails, durations, sources, and publication dates for any keyword.

Ideal for video trend analysis, content research, and Baidu SERP tracking with structured datasets 📊🚀

- **URL**: https://apify.com/delectable\_incubator/baidu-videos-scraper---low-cost-di-cheng-ben.md
- **Developed by:** [Prime Scrape](https://apify.com/delectable_incubator) (community)
- **Categories:** Videos, SEO tools, Automation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $0.00005 / actor start

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

<p align="center"> <img src="https://i.ibb.co/jkNS73wX/readme.png" alt="Baidu All-in-One Scraper - PrimeScrape" width="100%"> </p>

---

### Baidu All-in-One Scraper 🇨🇳🚀🔎🌐

The Baidu All-in-One Scraper is a fast, scalable, and powerful Apify Actor designed to extract structured data from multiple Baidu verticals in a single unified workflow.

It supports Web Search, Images, Videos, News, Notes, and Library results, making it a complete solution for China market research, SEO intelligence, brand monitoring, competitor analysis, and AI dataset generation.

---

### 百度全能爬虫 🧠📊🚀

该工具可通过批量关键词自动抓取百度多个搜索垂直领域的数据，包括网页、图片、视频、新闻、笔记和文库。

非常适合市场研究、SEO分析、品牌监测、竞品分析以及 AI 数据集构建使用。

---

### 🎯 What This Scraper Does

Simply provide keywords and select a Baidu vertical — the scraper handles everything automatically.

✅ Supports multiple Baidu verticals (Search, Images, Videos, News, Notes, Library)

✅ Bulk keyword processing in a single run

✅ Extracts clean structured SERP data

✅ Handles pagination and scrolling automatically

✅ Stops at defined limits per keyword

✅ Seamlessly processes multiple keywords

✅ Fully automation-ready for scheduling workflows

✅ Optimized for Baidu SERP structure

---

### 📊 Data Extracted

Depending on the selected searchType, the scraper extracts:

| Field             | Description                    |
| ----------------- | ------------------------------ |
| `keyword`         | Search keyword used            |
| `position`        | Ranking position in results    |
| `title`           | Result title                   |
| `url`             | Direct result URL              |
| `displayedUrl`    | Visible display URL            |
| `snippet`         | Description / summary          |
| `domain`          | Source domain                  |
| `thumbnail`       | Thumbnail image (if available) |
| `imageUrl`        | Direct image URL (Images mode) |
| `source`          | Publisher / platform           |
| `author`          | Author name (if available)     |
| `publishedDate`   | Publication date (News/Notes)  |
| `duration`        | Video duration (Videos mode)   |
| `score`           | Relevance / rating score       |
| `pageCount`       | Document pages (Library mode)  |
| `viewCount`       | Views (if available)           |
| `likes`           | Engagement metrics             |
| `searchTimestamp` | Time of extraction             |


---

### 🛠 How to Use

1️⃣ Select a Baidu search type

2️⃣ Enter one or multiple keywords

3️⃣ Set maximum results per keyword

4️⃣ Run the Actor

5️⃣ Export data in JSON, CSV, Excel, XML, or HTML


---

### ⚙️ Input Configuration

#### 📥 Input Example

````

{
"keywords": \["人工智能", "科技", "中国市场"],
"MAX\_ITEMS\_PER\_KEYWORD": 50
}

```



#### Input Fields

| Field                   | Type    | Description                                      |
| ----------------------- | ------- | ------------------------------------------------ |
| `searchType`            | string  | Baidu vertical (search, images, videos, news...) |
| `keywords`              | array   | List of keywords                                 |
| `MAX_ITEMS_PER_KEYWORD` | integer | Maximum results per keyword                      |

---


### 📤 Output Example

```

{
"keyword": "人工智能",
"position": 1,
"title": "人工智能发展趋势分析",
"url": "https://example.com/article",
"displayedUrl": "example.com/article",
"snippet": "本文介绍了人工智能的发展现状...",
"domain": "example.com",
"publishedDate": "2026-02-13",
"searchTimestamp": "2026-02-13T14:32:21.000Z"
}

````

---

### 📊 Output explanation

| Field             | Description        |
| ----------------- | ------------------ |
| `keyword`         | Search term used   |
| `position`        | Ranking position   |
| `title`           | Result title       |
| `url`             | Result URL         |
| `snippet`         | Description text   |
| `domain`          | Source domain      |
| `thumbnail`       | Preview image      |
| `imageUrl`        | Image URL (Images) |
| `source`          | Publisher          |
| `author`          | Author name        |
| `publishedDate`   | Publication date   |
| `duration`        | Video duration     |
| `score`           | Relevance score    |
| `pageCount`       | Document pages     |
| `viewCount`       | Views              |
| `likes`           | Engagement metric  |
| `searchTimestamp` | Extraction time    |

---


### 🌍 Why Use This Scraper? 

📈 China SEO Research — track rankings across Baidu verticals

🕵️ Competitor Monitoring — analyze visibility across formats

🏷 Brand Monitoring — track brand mentions in China ecosystem

🎨 Content Discovery — identify trends across media types

🤖 AI Dataset Creation — build structured multi-source datasets

🔄 Automation Ready — schedule recurring scraping runs

📊 SERP Intelligence — understand Baidu ranking behavior

---

### ⚠️ Disclaimer

This tool is an independent solution and is not affiliated with, endorsed by, or sponsored by Baidu or any of its subsidiaries or partners.

---

### 💸 Pricing

This scraper runs on a **pay per events subscription model**.

You only pay for **successful runs**.

💳 **Price:** $9.98 / 1000 results

---

### Related Actors 

If you're interested in SEO scraping solutions, explore more tools:

(Coming soon)

---

### 📬 Support

⭐⭐⭐⭐⭐ Leave a 5-star rating if you like this tool

---

### 🌍 PrimeScrape

Built for scalable web data extraction & automation

Contact for custom scraping solutions or enterprise requests via Apify or by email.

# Actor input Schema

## `keywords` (type: `array`):

List of keywords or topics to search for videos. Each keyword will be scraped separately.

用于搜索视频的关键词或主题列表。每个关键词将分别进行抓取。
## `MAX_ITEMS_PER_KEYWORD` (type: `integer`):

Maximum number of videos to extract per keyword.

每个关键词要提取的视频最大数量。

## Actor input object example

```json
{
  "keywords": [
    "Web",
    "低成本"
  ],
  "MAX_ITEMS_PER_KEYWORD": 30
}
````

# Actor output Schema

## `overview` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "keywords": [
        "Web",
        "低成本"
    ],
    "MAX_ITEMS_PER_KEYWORD": 30
};

// Run the Actor and wait for it to finish
const run = await client.actor("delectable_incubator/baidu-videos-scraper---low-cost-di-cheng-ben").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "keywords": [
        "Web",
        "低成本",
    ],
    "MAX_ITEMS_PER_KEYWORD": 30,
}

# Run the Actor and wait for it to finish
run = client.actor("delectable_incubator/baidu-videos-scraper---low-cost-di-cheng-ben").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "keywords": [
    "Web",
    "低成本"
  ],
  "MAX_ITEMS_PER_KEYWORD": 30
}' |
apify call delectable_incubator/baidu-videos-scraper---low-cost-di-cheng-ben --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=delectable_incubator/baidu-videos-scraper---low-cost-di-cheng-ben",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Baidu Videos Scraper - Low-cost 低成本💲🎥📺🇨🇳",
        "description": "Scrape Baidu Video search results easily 🇨🇳🎥 with a powerful video SERP scraper. \n\nExtract video URLs, titles, thumbnails, durations, sources, and publication dates for any keyword. \n\nIdeal for video trend analysis, content research, and Baidu SERP tracking with structured datasets 📊🚀",
        "version": "0.0",
        "x-build-id": "tpKlYkmBrIfzsXMYy"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/delectable_incubator~baidu-videos-scraper---low-cost-di-cheng-ben/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-delectable_incubator-baidu-videos-scraper---low-cost-di-cheng-ben",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/delectable_incubator~baidu-videos-scraper---low-cost-di-cheng-ben/runs": {
            "post": {
                "operationId": "runs-sync-delectable_incubator-baidu-videos-scraper---low-cost-di-cheng-ben",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/delectable_incubator~baidu-videos-scraper---low-cost-di-cheng-ben/run-sync": {
            "post": {
                "operationId": "run-sync-delectable_incubator-baidu-videos-scraper---low-cost-di-cheng-ben",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "keywords",
                    "MAX_ITEMS_PER_KEYWORD"
                ],
                "properties": {
                    "keywords": {
                        "title": "Keywords or topics 🔍🎥 | 关键词或主题 🔍🎥",
                        "type": "array",
                        "description": "List of keywords or topics to search for videos. Each keyword will be scraped separately.\n\n用于搜索视频的关键词或主题列表。每个关键词将分别进行抓取。",
                        "items": {
                            "type": "string"
                        },
                        "default": [
                            "Web",
                            "低成本"
                        ]
                    },
                    "MAX_ITEMS_PER_KEYWORD": {
                        "title": "Max videos per keyword | 每个关键词的最大视频数",
                        "type": "integer",
                        "description": "Maximum number of videos to extract per keyword.\n\n每个关键词要提取的视频最大数量。",
                        "default": 30
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
