Pricing

from $1.20 / 1,000 results

Go to Apify Store

Baidu Search Scraper 百度搜索结果采集器 | SERP & 关键词排名抓取

Try for free

Fast Baidu SERP scraper / 百度搜索结果采集器。批量采集百度搜索结果(标题、真实链接、摘要、日期),支持 site:、文件类型、时间范围、精确短语等筛选。适用于百度SEO关键词排名监控、竞品分析与市场数据抓取

Pricing

from $1.20 / 1,000 results

Rating

0.0

(0)

Developer

Lofomachines

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Baidu Search Scraper | Extract Baidu Search Results

Baidu Search Scraper is a powerful web scraping tool designed to extract search engine results pages (SERPs) from Baidu (China's leading search engine). This scraper is built to reliably bypass captchas, anti-scraping systems, and bot detection mechanisms.

Whether you need to collect search results for SEO monitoring, brand protection, sentiment analysis, academic research, or market intelligence, this Baidu Scraper handles the complexities of pagination.

🚀 Key Features

Blazing Fast, Browserless Engine: Runs entirely over plain HTTP (no headless browser), fetching result pages concurrently. This makes runs dramatically faster and cheaper, with a tiny memory footprint.
Concurrent Multi-Query Search: Input multiple search terms (one per line) and process several of them in parallel within a single run.
Real Destination URL Resolution: Baidu search results use encrypted redirect links (baidu.com/link?url=...). This scraper automatically follows redirects to extract and output the real destination URLs.
Advanced Baidu Search Operators: Fully supports filters such as:
- Domain filtering (site:domain.com) - supports multiple domains combined with OR.
- Recency/Time range (last 24 hours, week, month, year).
- File type filtering (PDF, Word, Excel, PPT, RTF).
- Language Script Selection (Simplified or Traditional Chinese).
- Search in page titles only (intitle:) and Exact phrase matching ("phrase").

🎯 Use Cases

Chinese Market SEO Monitoring: Track organic rankings, indexation status, and SERP visibility for keywords on Baidu.
Brand Protection & Infringement Tracking: Search for unauthorized sellers, trademark violations, or fake brand representations on Chinese web properties.
Competitor Intelligence: Analyze competitor landing pages, display domains, and search snippets rank for specific terms.
Academic & Sentiment Analysis Research: Extract historical data, news snippets, and online discussions relevant to Chinese culture, business, or politics.

🛠️ How to Use

Configure Queries: Enter one or more keywords/queries in the Queries / Search Terms input box (one per line).
Define Max Results: Set the maximum number of results you want to retrieve per query (e.g., 100).
Apply Filters: (Optional) Restrict results by site/domain, publication date (time range), language script, or file type.
Enable URL Resolution: Keep Resolve real URLs checked to follow redirects and get the actual target URLs instead of raw Baidu redirect links.
Configure Proxy: The Apify Proxy is enabled by default for reliability. For heavy usage, residential proxies are recommended to avoid IP bans.
Run the Actor: Click the Run button. The scraper will collect the data and store it in your default dataset.

📥 Input Configuration

Here is a list of the available input parameters:

Field Name	Type	Description	Default
`queries`	`array`	List of search queries to run sequentially (one per line).	`["claude anthropic"]`
`maxResults`	`integer`	Max results to collect for each query.	`100`
`timeRange`	`string`	Filter results by date: `any`, `day`, `week`, `month`, or `year`.	`"any"`
`sites`	`array`	Limit search to specific domains (e.g. `wikipedia.org`).	`[]`
`filetype`	`string`	Limit results to specific file types: `pdf`, `doc`, `xls`, `ppt`, `rtf`.	`"any"`
`language`	`string`	Chinese script: `any`, `simplified`, or `traditional`.	`"any"`
`exactPhrase`	`string`	Require results to contain this exact phrase.	`""`
`excludeWords`	`array`	Exclude results containing these words.	`[]`
`titleOnly`	`boolean`	Restrict search matches to page titles only.	`false`
`resolveRealUrls`	`boolean`	Follow Baidu redirect links to get the real target URL.	`true`
`pageConcurrency`	`integer`	How many Baidu result pages to fetch in parallel per query (1–20).	`5`
`queryConcurrency`	`integer`	How many queries to process in parallel (1–10).	`3`
`proxyConfiguration`	`object`	Proxy settings. Apify Proxy is enabled by default for reliability.	`{ "useApifyProxy": true }`

📤 Output Format

Each scraped search result item is stored as an object in the Apify dataset. The scraper outputs the following fields:

Field	Type	Description
`query`	`string`	The search query term.
`position`	`integer`	1-based ranking position of the result for this query.
`page`	`integer`	The page number on Baidu where the result was found.
`title`	`string`	The title of the search result page.
`url`	`string`	The resolved, final destination URL (e.g., `https://example.com/page`).
`baiduUrl`	`string`	The original Baidu redirect URL.
`displayUrl`	`string`	The display domain name shown on Baidu.
`snippet`	`string`	Description snippet text matching your search terms.
`date`	`string`	Publication date of the page, normalized to `YYYY-MM-DD` (relative dates like `13小时前` / `3天前` are converted to an absolute date). `null` when Baidu shows no date.

Results are written to the dataset incrementally in batches of 10 as they are scraped, so data is available while the run is still in progress.

Output JSON Example

{
  "query": "apple",
  "position": 1,
  "page": 1,
  "title": "Apple (中国大陆) - 官方网站",
  "url": "https://www.apple.com.cn/",
  "baiduUrl": "http://www.baidu.com/link?url=6lHipUPotM6NN3efDPvd4gZk1ZSQhtVwsIBdG3DGtmFUBe5LzfEdru89qaxDmtNy",
  "displayUrl": "www.apple.com.cn/",
  "snippet": "探索Apple 的创新世界,选购各式 iPhone、iPad、Apple Watch 和 Mac,浏览各类配件、娱乐产品,并获得相关产品的专家服务支持。",
  "date": "2026-05-15"
}

💡 Troubleshooting & Performance Tips

Maximum Speed: Setting resolveRealUrls to false skips the per-result redirect-resolution step entirely. If you only need domain names or the raw Baidu redirect links, turn this off for the fastest possible runs.
Tuning Throughput: Increase pageConcurrency and queryConcurrency to fetch more in parallel. If you start seeing Baidu's anti-bot page, lower them and make sure the Apify Proxy is enabled.
Reliability: The Apify Proxy is enabled by default and a fresh proxy session is rotated in on every retry, so a single blocked IP is automatically swapped out.

❓ FAQ

Q: Can I scrape thousands of keywords?
A: Yes! You can input a large list of keywords in the queries field.

Q: Why are some destination URLs identical to the Baidu redirect URLs?
A: If the target website is offline, slow to respond, or blocks redirect resolution requests, the scraper falls back to the original Baidu redirect link to ensure you do not lose data.

百度搜索结果采集器 | 百度 SERP 数据抓取与关键词排名监控工具

关键词: 百度搜索采集器、百度SERP API、百度爬虫、百度搜索结果抓取、百度数据采集、百度关键词排名监控、百度SEO工具、百度搜索结果导出Excel、批量采集百度、百度排名查询。

百度搜索结果采集器(Baidu Search Scraper) 是一款高速、稳定的百度搜索引擎结果页(SERP)数据抓取工具。它能够批量采集百度搜索结果,自动提取每条结果的标题、真实网址、显示域名、描述摘要与发布日期,并支持百度的全部高级搜索指令。无论您是做百度SEO关键词排名监控、竞品分析、品牌口碑监测,还是学术与市场研究,本工具都能稳定、高效地为您获取结构化数据。

本采集器采用纯 HTTP 无头浏览器(browserless)架构,并发抓取多个结果页,因此速度极快、成本极低、内存占用极小——抓取约 50 条结果通常只需几秒钟。

🚀 核心功能

极速无浏览器引擎:完全基于 HTTP 请求并发抓取,无需启动浏览器,速度快、费用低、资源占用小。
多关键词并行采集:一次运行可批量处理多个搜索词(每行一个),并行执行。
真实网址解析:百度搜索结果使用加密跳转链接(baidu.com/link?url=...),本工具自动解析还原为真实目标网址。
日期智能归一化:将"13小时前""3天前"等相对时间统一转换为 YYYY-MM-DD 标准日期格式。
分批写入数据集:结果每 10 条实时写入数据集,运行过程中即可查看已采集数据。
强大的百度搜索指令支持:
- 站点过滤(site:domain.com)——支持多个域名 OR 组合;
- 时间范围(24小时、一周、一月、一年内);
- 文件类型(PDF、Word、Excel、PPT、RTF);
- 中文简繁体筛选;
- 仅标题搜索(intitle:)与精确短语匹配("短语");
- 排除关键词(-词)。

🎯 应用场景

百度SEO与关键词排名监控:追踪关键词在百度的自然排名、收录情况与 SERP 展现。
品牌保护与侵权监测:发现未授权销售、商标侵权或仿冒品牌信息。
竞品情报分析:分析竞争对手的落地页、展示域名与搜索摘要。
舆情与学术研究:批量采集新闻摘要、网络讨论等中文语料数据。

🛠️ 使用方法

填写搜索词:在"Queries / 搜索词"输入框中输入一个或多个关键词(每行一个)。
设置采集数量:设定每个关键词需要抓取的结果数量(maxResults)。
应用筛选条件(可选):按站点、时间范围、语言或文件类型过滤。
解析真实网址:保持勾选 resolveRealUrls,以获取真实目标网址而非百度跳转链接。
代理设置:默认已启用 Apify 代理以保证稳定性;大批量采集建议使用住宅代理。
运行 Actor:点击 Run 即可,数据将分批保存到数据集中,可导出为 JSON、CSV、Excel 等格式。

📥 输入参数

参数	类型	说明	默认值
`queries`	`array`	要采集的搜索词列表(每行一个)。	`["apple"]`
`maxResults`	`integer`	每个关键词采集的最大结果数。	`10`
`timeRange`	`string`	时间范围:`any`、`day`、`week`、`month`、`year`。	`"any"`
`sites`	`array`	限定采集的域名(`site:`)。	`[]`
`filetype`	`string`	限定文件类型:`pdf`、`doc`、`xls`、`ppt`、`rtf`。	`"any"`
`language`	`string`	中文简繁体:`any`、`simplified`、`traditional`。	`"any"`
`exactPhrase`	`string`	精确短语匹配。	`""`
`excludeWords`	`array`	排除包含这些词的结果。	`[]`
`titleOnly`	`boolean`	仅在标题中搜索(`intitle:`)。	`false`
`resolveRealUrls`	`boolean`	解析百度跳转链接为真实网址。	`true`
`pageConcurrency`	`integer`	每个关键词并行抓取的结果页数量(1–20)。	`5`
`queryConcurrency`	`integer`	并行处理的关键词数量(1–10)。	`3`
`proxyConfiguration`	`object`	代理设置,默认启用 Apify 代理。	`{ "useApifyProxy": true }`

📤 输出字段

字段	类型	说明
`query`	`string`	搜索词。
`position`	`integer`	该结果在此关键词中的排名位置(从 1 开始)。
`page`	`integer`	结果所在的百度页码。
`title`	`string`	搜索结果标题。
`url`	`string`	解析后的真实目标网址。
`baiduUrl`	`string`	百度原始跳转链接。
`displayUrl`	`string`	百度展示的域名/网址。
`snippet`	`string`	结果描述摘要。
`date`	`string`	发布日期,已归一化为 `YYYY-MM-DD`(无日期时为 `null`)。

❓ 常见问题

问:可以批量采集成千上万个关键词吗? 答:可以。在 queries 字段中输入大量关键词即可批量采集。

问:为什么个别结果的目标网址仍是百度跳转链接? 答:当目标网站离线、响应缓慢或拒绝跳转解析请求时,工具会回退保留原始百度跳转链接,以确保数据不丢失。

问:相对日期(如"3天前")会如何处理? 答:工具会自动将其换算为标准的 YYYY-MM-DD 绝对日期。

Baidu Search Scraper - 便宜 Cheap 🌐🇨🇳🔎

scrapestorm/baidu-search-scraper---bian-yi-cheap

🔍 Easily Collect Baidu Search Results 🇨🇳 Extract organic search results from Baidu for any keyword, including result URLs, titles, snippets, displayed links, domains & more 🌐📊 Perfect for China SEO research, competitor analysis, brand monitoring, market intelligence & Baidu SERP tracking 🚀✨

Storm_Scraper

Baidu Videos Scraper - Low-cost 低成本💲🎥📺🇨🇳

delectable_incubator/baidu-videos-scraper---low-cost-di-cheng-ben

Scrape Baidu Video search results easily 🇨🇳🎥 with a powerful video SERP scraper. Extract video URLs, titles, thumbnails, durations, sources, and publication dates for any keyword. Ideal for video trend analysis, content research, and Baidu SERP tracking with structured datasets 📊🚀

Prime Scrape

Baidu Videos Scraper - 便宜 Cheap 🇨🇳🔎📺

scrapestorm/baidu-videos-scraper---bian-yi-cheap

🔍 Easily Collect Baidu Video Search Results 🇨🇳🎥 Extract video search results from Baidu for any keyword, including video URLs, titles, thumbnails, duration, sources, publication dates & more 🌐📊 Perfect for China video trend monitoring, content research & Baidu Video SERP tracking 🚀✨

Storm_Scraper

Baidu Images Scraper - 便宜 Cheap 🇨🇳🔎🖼️ 百度图片爬虫

scrapestorm/baidu-images-scraper---bian-yi-cheap-bai-du-tu-pian-pa-chong

🔍 Easily Collect Baidu Images Search Results 🇨🇳🖼️ Extract image search results from Baidu Images for any keyword, including image URLs, thumbnails, titles, dimensions, sizes & more 🌐📊 Perfect for China visual research, AI dataset collection, trend discovery & Baidu Images SERP tracking 🚀✨

Storm_Scraper

Baidu Images Scraper - Low-cost💲🔥🇨🇳🖼️

delectable_incubator/baidu-images-scraper---low-cost

Scrape Baidu Images search results easily 🇨🇳🖼️ with a powerful image scraper. Extract image URLs, thumbnails, titles, dimensions, sizes, and metadata for any keyword. Ideal for visual research, AI training datasets, trend discovery, and Baidu Images SERP tracking with structured data 📊🚀

Prime Scrape

Baidu Notes Scraper - 便宜 Cheap 🇨🇳🔎📺

scrapestorm/baidu-notes-scraper---bian-yi-cheap

📝 Easily Collect Baidu Notes Search 🇨🇳🔎 Extract note search results from Baidu for any keyword, including note URLs, titles, thumbnails, authors, publication dates & more 🌐📊 Perfect for China content trend monitoring, brand research, competitor analysis & Baidu Notes SERP tracking 🚀✨

Storm_Scraper

Baidu News Scraper - 便宜 Cheap 📰🇨🇳🔎

scrapestorm/baidu-news-scraper---bian-yi-cheap

🔍 Easily Collect Baidu News Search Results 🇨🇳📰 Extract news search results from Baidu News for any keyword, including article URLs, titles, snippets, sources, publication dates, thumbnails & more 🌐📊 Perfect for China media monitoring, brand reputation tracking & Baidu News SERP tracking 🚀✨

Storm_Scraper

Baidu Notes Scraper - Low-cost 低成本💲🔥🇨🇳📺

delectable_incubator/baidu-notes-scraper---low-cost-di-cheng-ben

Scrape Baidu Notes search results easily 🇨🇳📝 with a powerful content scraper. Extract note URLs, titles, thumbnails, authors, and publication dates for any keyword. Ideal for content trend monitoring, brand research, competitor analysis, and Baidu SERP tracking with structured datasets 📊🚀

Prime Scrape

Baidu Scraper All-in-One - 便宜 Cheap 🇨🇳🔎🌐

scrapestorm/baidu-scraper-all-in-one---bian-yi-cheap

🔍 Easily Collect Baidu Search Results Across All Verticals 🇨🇳🌐 Extract structured search results from Baidu — including Web, Images, Videos, News, Notes and Library — for any keyword Perfect for China market research, SEO, media tracking, competitor analysis & multi-vertical SERP monitoring 🚀

Storm_Scraper