Pricing

from $2.00 / 1,000 search results

Zhihu Scraper — Q&A, Answers, Articles, Columns

Zhihu scraper — extract long-form Mandarin Q&A, expert answers, articles & column posts. Keyword search, question answer threads, article detail, column article list. China market research, LLM training data, competitive intel. Four operations, one clean dataset per run. No API key.

Pricing

from $2.00 / 1,000 search results

Rating

0.0

(0)

Developer

SIÁN OÜ

Actor stats

Bookmarked

106

Total users

Monthly active users

8.3 hours

Issues response

an hour ago

Last modified

[2026-06-20]

🧯 Graceful gateway-error handling. Temporary upstream error pages are now retried and routed cleanly instead of surfacing as confusing parsing failures.

[2026-05-15]

🎉 Zhihu Scraper — Launch!

Keyword Search Across Zhihu — search answers, questions, articles, and people in one call; mixed-type results dispatched to the right ID/URL schema per row
Full Question Answer Threads — pull every answer for a Zhihu question with complete HTML body (not snippets), vote counts, comment counts, and author profile
Article Detail with Full HTML — single-call extraction of any Zhihu article or column post: full HTML content, topic tags, parent column reference, and author credentials
Column (Zhuanlan) Article Lists — paginate the complete catalog of any Zhihu column by slug, ~10 articles per page
Author + Badge Data on Every Row — Zhihu blue/gold badge verification, follower counts, lifetime upvote tallies, headline bios baked into every dataset row — ready for KOL discovery workflows
18–19-Digit ID Precision — IDs preserved as strings so 64-bit Zhihu identifiers never get silently truncated by JavaScript bigint limits
HTTPS URL Normalization — all Zhihu CDN URLs (*.zhimg.com) upgraded to HTTPS automatically
Resilient Pagination — built-in retry on transient upstream errors with mandarin-aware error translation (问题不存在, 专栏不存在 → plain English)
Pay-Per-Result Pricing — Search $0.004/row, Question Answer $0.005/row, Article Detail $0.040/row, Column Article $0.004/row. Volume discounts auto-applied at SILVER/GOLD/PLATINUM/DIAMOND tiers

💎 User Benefits

Gold-standard LLM training data — full HTML answer bodies plus author credentials and vote signals make this the cleanest source of curated Mandarin Q&A for SFT and RAG fine-tuning
Long-form depth competitors don't ship — most Zhihu scrapers return excerpts only; we return the entire answer and article body with formatting and images intact
One operation, one clean dataset — no chaining, no manual cursor management, no proxy setup
No account, no API key, no Zhihu login — paste a keyword or ID, click Run, get clean JSON

🎯 Use Cases

Wei (ML Engineer, Beijing) pulls 100K+ Mandarin answer threads per month to build domain-balanced Chinese instruction-tuning datasets
Lin (Insights Lead, Shanghai agency) keyword-tracks branded questions weekly to surface unfiltered consumer sentiment in real Chinese consumer language
Anya (PM, B2B SaaS) monitors competitor mentions in Q&A threads to catch "X vs. Y" comparison content before it goes viral
Marcus (B2B Marketing Lead) shortlists Zhihu KOLs by topic + badge verification + follower count for sponsored long-form content campaigns
Chen (Data Scientist, hedge fund) runs daily keyword searches across industry watchlists to spot emerging questions for alpha-generation pipelines
Dr. Park (Stanford computational linguist) builds Mandarin discourse corpora and trains discourse-level classifiers on Chinese internet argumentation patterns

❓ Zhihu Question Answers Scraper

ethereal_wool/zhihu-question-answers-scraper

Extract Zhihu question answers data — title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Jackie Chen

Zhihu Scraper — Hot List, Q&A & Profiles

blackfalcondata/zhihu-scraper

Scrape zhihu.com — trending hot-list questions (热榜), full Q&A answers with text and engagement counts, and author profiles as structured data. No login or API key required. Incremental mode flags new and changed records for monitoring and AI pipelines.

Black Falcon Data

Zhihu Q&A Tracker - China Hot List & Knowledge Mining

nexgendata/zhihu-qa-tracker

Scrape Zhihu (知乎), China's Quora: the daily hot list plus keyword Q&A search. Each record has the question, top-answer excerpt, voteup count, view count and category. For China social listening, consumer research and brand monitoring. No CN account needed.

NexGenData

❓ Zhihu Search Scraper

ethereal_wool/zhihu-search-scraper

Extract Zhihu search data — title, author, engagement, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Jackie Chen

❓ Zhihu User Content Scraper

ethereal_wool/zhihu-user-content-scraper

Extract Zhihu user content data — title, and more. Scrape by keyword, URL or ID. Export to JSON, CSV & Excel, use the API, schedule runs and integrate. No code required.

Jackie Chen

Quora Scraper

sian.agency/quora-scraper

Scrape Quora questions and answers into clean datasets — question text, answer & follower counts, topics, and top answers (author, credential, upvotes, views, text). Fast overview or full per-question detail. Engagement scoring built in. No account or API key needed.

SIÁN OÜ

CSV Combiner | 💾 Merge CSV Files with Custom Column Order

amr-mando/csv-combiner

Combine up to three CSV files into one. Columns are matched by header name, so data stays under the right column even when the files order their columns differently. You choose the output column order.

Mando

Grounded Q&A: Structured Answers with Citations

aitoolbreakdown/atb-grounded-qa

Answers a natural-language question using ONLY the URLs you provide. Returns structured JSON with per-claim citations and confidence. No hallucinated sources.

AI Tool Breakdown

Reddit Answers Scraper

lexis-solutions/reddit-answers-scraper

Unlock structured AI-powered Q&A from Reddit Answers—extract organized answers, source subreddits, related posts, and suggested topics. Perfect for market research, content creation, SEO strategy, and knowledge base building. Fast, reliable, and fully customizable.

Lexis Solutions

5.0

CSV Data Profiler — Column Types, Stats and Quality Report

eliai/csv-profiler

Profile any CSV via API. Input: a CSV URL or pasted text. Output: JSON per column with detected data type, null and unique counts, min/max/mean for numeric columns, top values for categorical columns, plus data-quality warnings. Cheap flat pay-per-file pricing.

Anthony Snider

Zhihu Scraper — Q&A, Answers, Articles, Columns

Changelog

[2026-06-20]