๐ Baidu Search Scraper
Pricing
from $3.99 / 1,000 results
๐ Baidu Search Scraper
Scrape Baidu search results at scale. Extract organic listings, answer boxes, related videos, related searches, and top searches. Supports bulk queries, proxy fallback, date filters, and device/language options for SEO and market research.
Pricing
from $3.99 / 1,000 results
Rating
0.0
(0)
Developer
Scrapio
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
a day ago
Last modified
Categories
Share
Baidu Search Scraper
Scrape Baidu search results at scale with intelligent proxy fallback. Extract organic results, answer boxes, related videos, people also search for, related searches, and top searches.
Why Choose Us?
- Intelligent proxy fallback: Starts with no proxy; automatically falls back to datacenter then residential if Baidu blocks requests
- Bulk support: Process multiple search queries in a single run
- Comprehensive extraction: Organic results, answer boxes, videos, related searches, and more
- Robust error handling: Retries and fallbacks keep your runs successful
Key Features
- No proxy by default โ Saves cost when Baidu allows direct access
- Automatic fallback โ Datacenter โ Residential with 3 retries on block
- Stick with residential โ Once fallback occurs, uses residential for all remaining requests
- Detailed logging โ Clear proxy events and progress in Apify logs
- Bulk URLs or terms โ Support for Baidu search URLs or plain search terms
Input
| Field | Type | Required | Description |
|---|---|---|---|
| urls | array | Yes | Baidu search URLs (e.g. https://www.baidu.com/s?wd=python) or plain search terms |
| proxyConfiguration | object | No | Proxy settings. Default: no proxy. Falls back automatically on block |
| maxPagination | integer | No | Max pages per query (0-10). Default: 3 |
| numResults | integer | No | Results per page (1-50). Default: 10 |
| startPage | integer | No | Starting page. Default: 1 |
| timePeriod | object | No | Optional date filter: {startDate, endDate} (YYYY-MM-DD) |
Example Input
{"urls": [{ "url": "https://www.baidu.com/s?wd=python" },{ "url": "https://www.baidu.com/s?wd=Javascript" }],"proxyConfiguration": { "useApifyProxy": false },"maxPagination": 3,"numResults": 10}
Output
{"summary": {"total_queries": 2,"queries": ["python", "Javascript"],"total_organic_results": 200,"total_answer_boxes": 168,"total_related_videos": 0,"total_people_also_search_for": 180,"total_related_searches": 1,"total_top_searches": 1675},"results_by_query": {"python": {"query": "python","organic_results": [...],"answer_box": [...],"related_videos": [...],"people_also_search_for": [...],"related_searches": [...],"top_searches": [...]}}}
How to Use the Actor (via Apify Console)
- Log in at https://console.apify.com and go to Actors
- Find baidu-search-scraper and click it
- Configure inputs (URLs/terms, proxy toggle, max pages, etc.)
- Run the actor
- Monitor logs in real time
- Access results in the OUTPUT tab
- Export results to JSON or CSV
Best Use Cases
- SEO research and competitor analysis
- Market research and trend monitoring
- Content discovery and topic research
- Academic research on search behavior
Frequently Asked Questions
Q: Does it work without a proxy?
A: Yes. By default it uses no proxy. If Baidu blocks, it automatically falls back to datacenter then residential proxies.
Q: Can I use my own proxy?
A: Yes. Configure proxy in the input. The fallback still applies if your proxy gets blocked.
Q: What if residential proxy fails?
A: It retries up to 3 times, then logs the error and continues with remaining queries.
Support and Feedback
Apify Store | Apify Documentation
Cautions
- Data is collected only from publicly available sources
- No data from private accounts or password-protected content
- End users are responsible for legal compliance (privacy, data protection, etc.)