❓ Zhihu Search Scraper avatar

❓ Zhihu Search Scraper

Pricing

$3.00 / 1,000 results

Go to Apify Store
❓ Zhihu Search Scraper

❓ Zhihu Search Scraper

Extract Zhihu search data — title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Pricing

$3.00 / 1,000 results

Rating

0.0

(0)

Developer

Jackie Chen

Jackie Chen

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Categories

Share

Zhihu Search Scraper

zhihu-search-scraper

Search Zhihu (知乎), China's largest Q&A and knowledge-sharing platform, by keyword. Returns clean, structured records for each result: title, excerpt, full HTML content, upvote / comment / favorite counts, the question it belongs to, and author info. Choose what to search: answers & articles & questions, videos, columns, or users.

Unofficial. This Actor is not affiliated with, authorized, or endorsed by Zhihu (智者天下 / Zhihu Inc.). It is an independent tool that retrieves publicly available data via a third-party API. Use it in compliance with Zhihu's terms and all applicable laws; you are responsible for how you use the retrieved data.

What it does

  • Keyword search — give one or more keywords; the Actor paginates Zhihu search results for each until it hits maxItems or the end of results.
  • Search types — pick one:
    • general (default): answers, articles and questions.
    • video: Zhihu videos (zvideo).
    • column: Zhihu columns (专栏).
    • user: Zhihu users (people).
  • Promo / ad / "related query" / gaokao cards are filtered out automatically; only real content records are returned. Search-highlight <em> markup is stripped.

Input

FieldTypeDefaultDescription
searchKeywordsstring[]["人工智能"]Keywords to search. Each is paginated independently. Chinese keywords return the most results.
searchTypeenumgeneralgeneral / video / column / user.
maxItemsinteger50Max total records across all keywords.
includeContentbooleantrueInclude the full HTML body of answers/articles. Disable to keep items small.

Example input

{
"searchKeywords": ["人工智能", "心理学"],
"searchType": "general",
"maxItems": 100,
"includeContent": true
}

Output

One dataset item per record:

{
"id": "2044932700230914484",
"kind": "answer",
"title": "如何评价知乎上以赵泠为首的人工智能唱衰党?",
"excerpt": "大家一定要对这个专业去魅 ...",
"url": "https://www.zhihu.com/answer/2044932700230914484",
"authorName": "赵泠",
"authorUrlToken": "MarryMea",
"authorHeadline": "衣沾不足惜,但使愿无违。",
"voteupCount": 2159,
"commentCount": 126,
"favoritesCount": 0,
"questionTitle": "如何评价知乎上以赵泠为首的人工智能唱衰党?",
"questionId": "2042974223212213176",
"createdTime": 1780329948,
"updatedTime": 1780330499,
"content": "<p>...full answer HTML...</p>",
"source": "general:人工智能"
}

For user results the item carries authorName, authorUrlToken, authorHeadline, followerCount, answerCount and articlesCount; for column results it carries title, articlesCount and followerCount.

Notes

  • Data is sourced live. The Zhihu upstream is occasionally flaky and can emit a transient error even for a good keyword; the Actor retries with exponential backoff, so a run that briefly errors usually still returns results.
  • Record IDs are de-duplicated within a run.