๐Ÿ” Baidu Search Scraper avatar

๐Ÿ” Baidu Search Scraper

Pricing

from $3.99 / 1,000 results

Go to Apify Store
๐Ÿ” Baidu Search Scraper

๐Ÿ” Baidu Search Scraper

Scrape Baidu search results at scale. Extract organic listings, answer boxes, related videos, related searches, and top searches. Supports bulk queries, proxy fallback, date filters, and device/language options for SEO and market research.

Pricing

from $3.99 / 1,000 results

Rating

0.0

(0)

Developer

Scrapio

Scrapio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Baidu Search Scraper

Scrape Baidu search results at scale with intelligent proxy fallback. Extract organic results, answer boxes, related videos, people also search for, related searches, and top searches.

Why Choose Us?

  • Intelligent proxy fallback: Starts with no proxy; automatically falls back to datacenter then residential if Baidu blocks requests
  • Bulk support: Process multiple search queries in a single run
  • Comprehensive extraction: Organic results, answer boxes, videos, related searches, and more
  • Robust error handling: Retries and fallbacks keep your runs successful

Key Features

  • No proxy by default โ€“ Saves cost when Baidu allows direct access
  • Automatic fallback โ€“ Datacenter โ†’ Residential with 3 retries on block
  • Stick with residential โ€“ Once fallback occurs, uses residential for all remaining requests
  • Detailed logging โ€“ Clear proxy events and progress in Apify logs
  • Bulk URLs or terms โ€“ Support for Baidu search URLs or plain search terms

Input

FieldTypeRequiredDescription
urlsarrayYesBaidu search URLs (e.g. https://www.baidu.com/s?wd=python) or plain search terms
proxyConfigurationobjectNoProxy settings. Default: no proxy. Falls back automatically on block
maxPaginationintegerNoMax pages per query (0-10). Default: 3
numResultsintegerNoResults per page (1-50). Default: 10
startPageintegerNoStarting page. Default: 1
timePeriodobjectNoOptional date filter: {startDate, endDate} (YYYY-MM-DD)

Example Input

{
"urls": [
{ "url": "https://www.baidu.com/s?wd=python" },
{ "url": "https://www.baidu.com/s?wd=Javascript" }
],
"proxyConfiguration": { "useApifyProxy": false },
"maxPagination": 3,
"numResults": 10
}

Output

{
"summary": {
"total_queries": 2,
"queries": ["python", "Javascript"],
"total_organic_results": 200,
"total_answer_boxes": 168,
"total_related_videos": 0,
"total_people_also_search_for": 180,
"total_related_searches": 1,
"total_top_searches": 1675
},
"results_by_query": {
"python": {
"query": "python",
"organic_results": [...],
"answer_box": [...],
"related_videos": [...],
"people_also_search_for": [...],
"related_searches": [...],
"top_searches": [...]
}
}
}

How to Use the Actor (via Apify Console)

  1. Log in at https://console.apify.com and go to Actors
  2. Find baidu-search-scraper and click it
  3. Configure inputs (URLs/terms, proxy toggle, max pages, etc.)
  4. Run the actor
  5. Monitor logs in real time
  6. Access results in the OUTPUT tab
  7. Export results to JSON or CSV

Best Use Cases

  • SEO research and competitor analysis
  • Market research and trend monitoring
  • Content discovery and topic research
  • Academic research on search behavior

Frequently Asked Questions

Q: Does it work without a proxy?
A: Yes. By default it uses no proxy. If Baidu blocks, it automatically falls back to datacenter then residential proxies.

Q: Can I use my own proxy?
A: Yes. Configure proxy in the input. The fallback still applies if your proxy gets blocked.

Q: What if residential proxy fails?
A: It retries up to 3 times, then logs the error and continues with remaining queries.

Support and Feedback

Apify Store | Apify Documentation

Cautions

  • Data is collected only from publicly available sources
  • No data from private accounts or password-protected content
  • End users are responsible for legal compliance (privacy, data protection, etc.)