Zhihu Q&A Tracker - China Hot List & Knowledge Mining avatar

Zhihu Q&A Tracker - China Hot List & Knowledge Mining

Pricing

from $100.00 / 1,000 zhihu q&a records

Go to Apify Store
Zhihu Q&A Tracker - China Hot List & Knowledge Mining

Zhihu Q&A Tracker - China Hot List & Knowledge Mining

Scrape Zhihu (知乎), China's Quora: the daily hot list plus keyword Q&A search. Each record has the question, top-answer excerpt, voteup count, view count and category. For China social listening, consumer research and brand monitoring. No CN account needed.

Pricing

from $100.00 / 1,000 zhihu q&a records

Rating

0.0

(0)

Developer

NexGenData

NexGenData

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

🇨🇳 Zhihu Q&A Tracker — Hot List & Knowledge-Mining for China Research

Pull Zhihu (知乎) — China's Quora — as structured data: the daily hot list (热榜) plus keyword-driven Q&A search, each record carrying the question, top-answer excerpt, voteup count, view count and category.

Zhihu is where mainland China goes for long-form, expert-leaning answers — the highest-signal Chinese surface for why people think something, not just that a topic is trending. This actor turns Zhihu's hot list and content search into clean JSON for China social-listening, consumer research and brand-monitoring work, without you needing a Zhihu account, a Chinese phone number or a CN VPN.

Built for market researchers, consumer-insight teams, brand monitors and offshore China-watchers who want the Zhihu signal as rows in a dataset, not a screenshot.


What you can do with it

  • Consumer & market research — mine high-voteup answers for genuine expert opinion on a product, category or brand in China.
  • Brand monitoring — run keyword search on your brand / competitors and read the top-answer excerpt and voteup count to gauge sentiment and depth of discussion.
  • Trend tracking / social listening — capture the daily hot list (mode: "hot") on a schedule to see what questions China is asking right now.
  • Offshore China-watching — read long-form Chinese knowledge content surfaced and categorised, filterable by voteup quality threshold.

What you get per record

FieldTypeDescription
question_idstringZhihu question ID
question_titlestringThe question (verbatim Chinese)
question_urlstringCanonical zhihu.com/question/{id} URL
categorystringCanonical category classifier: tech / finance / business / education / career / lifestyle / health / entertainment / science / politics
answer_countint | nullNumber of answers on the question, when surfaced
view_countint | nullQuestion view / heat metric, when surfaced
top_answer_excerptstring | nullUp to 500 chars of the highest-voted answer (HTML stripped)
top_answer_authorstring | nullHandle of the top answer's author (匿名用户 if anonymous)
top_answer_voteup_countint | nullVoteup count of the top answer — Zhihu's strongest quality signal
is_hotboolTrue if the record appeared on Zhihu's hot list
created_atstring | nullQuestion creation time, when surfaced
data_sourcestringProvenance — exact probe path used (e.g. api.zhihu.com/topstory/hot-list, zhihu.com/hot (initialData), zhihu.com/search)

Most metric fields are nullable on purpose — Zhihu's anti-bot wall means not every path exposes every field on every run.


Input

ParameterTypeDefaultDescription
modestringbothhot = daily hot list only; search = keyword search only; both = hot list, then top up with keyword search
keywordsarray["人工智能","新能源汽车","投资"]Search terms (Chinese gives native-quality hits). Ignored when mode: "hot"
categoriesarray["tech","finance","business"]Restrict output to these category slugs. Empty = no filter
limitinteger30Max Q&A records (1–500)
min_voteupinteger0Drop records whose top answer has fewer than N voteups (0 = keep all)
include_hot_onlybooleanfalseWhen true, keep only records that appeared on the hot list
proxyConfigurationobjectRESIDENTIALApify proxy. RESIDENTIAL strongly recommended — Zhihu blocks datacenter IPs

Sample input

{
"mode": "both",
"keywords": ["人工智能", "新能源汽车", "投资"],
"categories": ["tech", "finance", "business"],
"limit": 30,
"min_voteup": 0,
"include_hot_only": false,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Sample output (truncated, schema-accurate)

[
{
"question_id": "650000001",
"question_title": "如何看待新能源汽车 2026 年的价格战?",
"question_url": "https://www.zhihu.com/question/650000001",
"category": "finance",
"answer_count": 412,
"view_count": 3800000,
"top_answer_excerpt": "价格战的本质是产能过剩下的份额博弈……(节选)",
"top_answer_author": "汽车行业分析师",
"top_answer_voteup_count": 5821,
"is_hot": true,
"created_at": "2026-04-02T09:15:00Z",
"data_source": "api.zhihu.com/topstory/hot-list"
}
]

How it gets the data

Zhihu is mainland-hosted and applies aggressive anti-bot detection (403 / interstitial captcha / empty payloads to datacenter IPs). The actor uses a probe waterfall so one blocked path does not kill the run:

  1. Mobile API hot listapi.zhihu.com/topstory/hot-list.
  2. Mobile API searchapi.zhihu.com/search_v3 for keyword queries.
  3. Public hot-list pageszhihu.com/hot and zhihu.com/billboard, parsing the embedded initialData blob.
  4. Public content searchzhihu.com/search?type=content.
  5. Question-detail enrichmentzhihu.com/question/{id} to fill in answer count, top-answer excerpt, author and voteup.
  6. Maintenance-stub fallback — a single status: "maintenance" row if every path is blocked, so pipelines never crash.

All paths run behind Apify's RESIDENTIAL proxy pool by default.


FAQ

Can I scrape Zhihu without an account or Chinese phone number? Yes. The actor runs server-side behind Apify proxies and returns JSON — no Zhihu login, phone number or VPN on your side.

What is Zhihu? Zhihu (知乎) is China's largest long-form Q&A / knowledge community — the closest mainland equivalent to Quora, but with far deeper expert participation.

How do I get only the trending hot list? Set mode: "hot" (or include_hot_only: true) to keep only hot-list questions and drop keyword-search noise.

How do I filter for high-quality answers? Use min_voteup — 100+ usually means substantive expert content, 1000+ usually means a viral / canonical answer.

How fresh is the data? Each run captures the hot list / search live and the record's data_source shows the exact path used. The actor has no internal scheduler — schedule it in Apify for a continuous feed.


Pair Zhihu with the rest of the NexGenData Chinese-social fleet for full cross-platform coverage:

  • Weibo Hot Search Tracker — China's #1 social-trending barometer (微博热搜榜): the fast real-time counterpart to Zhihu's long-form signal.
  • RedNote (Xiaohongshu) Scraper — trending posts, feeds and notes from Xiaohongshu (小红书 / RedNote) for beauty/fashion/lifestyle consumer intent.
  • Bilibili Video Search — keyword search across Bilibili (B站), the long-form video, gaming and education hub of mainland China.
  • Douyin Trending Tracker — the live Douyin (抖音 / Chinese TikTok) hot list for short-video trend signal.
  • Kuaishou Trending Tracker — trending short-video signal from Kuaishou (快手), skewing lower-tier-city audiences.
  • Douban Tracker — China movie/TV ratings and hot lists from Douban (豆瓣), the cultural-taste signal.
  • China Trends Tracker — cross-platform Chinese trend roll-up (Weibo / Baidu / Toutiao / Douyin).
  • Chinese Social Signals MCP — MCP server that plugs the whole Chinese-social fleet directly into Claude, ChatGPT and Cursor.

Notes & limits

  • Residential proxy strongly recommended. Datacenter IPs are blocked / captcha-gated; the default proxy group is RESIDENTIAL.
  • Maintenance fallback is by design. A status: "maintenance" row means the feed was temporarily blocked — retry shortly.
  • Excerpt is capped at 500 characters of the highest-voted answer, HTML stripped.
  • Pay-per-event billing. You pay per delivered Q&A record.