Pricing

from $3.00 / 1,000 article-scrapeds

Csdn Article Detail Scraper

Scrape full content of CSDN blog articles by URL — title, body text, HTML content, publish date, view count, and tags. Optionally translate the content to English, Indonesian, or any language. No login required.

Pricing

from $3.00 / 1,000 article-scrapeds

Rating

0.0

(0)

Developer

Romy

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

What does CSDN Article Detail Scraper do?

This Actor fetches the full content of CSDN blog articles from a list of URLs. Each article is parsed server-side — no JavaScript rendering needed. Supports optional auto-translation powered by Google Translate (no API key required).

Use cases

Content archiving — save full article text for offline storage or analysis
NLP / AI training data — collect Chinese technical articles as a dataset
Knowledge extraction — parse structured content from CSDN posts
International research — read Chinese technical content in your language via built-in translation
Pipeline integration — use alongside CSDN Article Search Scraper to go from keyword → full article content → translated

How to use

Step 1: Configure input

{
    "urls": [
        "https://blog.csdn.net/m0_58523831/article/details/120851261"
    ],
    "translateTo": "en"
}

Field	Type	Required	Description
`urls`	string[]	Yes	List of CSDN article URLs to scrape
`translateTo`	string	No	Target language code for translation. Leave empty to skip. Examples: `en`, `id`, `ja`, `ko`, `de`, `fr`

Step 2: Run and download results

Click Start and download results as JSON, CSV, or Excel from the Output tab.

Output

One object per URL:

{
  "url": "https://blog.csdn.net/m0_58523831/article/details/120851261",
  "title": "python从入门到精通——完整教程【转载】",
  "titleTranslated": "Python from beginner to proficient - complete tutorial [Reprinted]",
  "contentText": "文章目录\n一、pycharm下载安装...",
  "contentTextTranslated": "Article directory\n1. Download and install pycharm...",
  "contentHtml": "<div id=\"content_views\">...</div>",
  "publishedAt": "2021-10-19 09:50:05",
  "viewCount": "123456",
  "tags": ["Python", "入门"],
  "isVip": false,
  "contentLength": 23399,
  "translateTo": "en"
}

titleTranslated and contentTextTranslated are only present when translateTo is set.

Pricing

Event	Price
Actor start	$0.05
Per 1,000 articles (no translation)	$3.00
Per 1,000 articles (with translation)	$6.00

Examples (with translation):

100 articles → $0.05 + $0.60 = $0.65
1,000 articles → $0.05 + $6.00 = $6.05
5,000 articles → $0.05 + $30.00 = $30.05

FAQ

Does this require login or cookies? No. CSDN serves full article HTML server-side without JavaScript. Plain HTTP requests are sufficient.

Does translation require an API key? No. Translation uses Google Translate via an unofficial free endpoint. No API key or account needed.

Which languages are supported for translation? Any language supported by Google Translate. Common codes: en (English), id (Indonesian), ja (Japanese), ko (Korean), de (German), fr (French), es (Spanish).

What does isVip mean? Articles with isVip: true are behind CSDN's VIP paywall. The scraper will still return a partial preview (~500 chars) but not the full content.

What is contentHtml? The raw HTML of the article body (the #content_views element). Useful if you need to preserve formatting, code blocks, or images.

Smart Article Extractor

parseforge/article-extractor

Extract clean article content from any news, blog, or publisher site! Pull full body text, author, publish date, word count, language, reading time, images, and metadata at scale. Ideal for content research, media monitoring, SEO audits, and AI training. Start extracting articles in minutes!

ParseForge

Article Content Extractor

codingfrontend/article-content-extractor

Extract clean article content, metadata and structured information from any web page. Returns title, description, author, publish date, plain content, word count, images, and more.

Coding Frontned

Article / Content Extractor

chuckling_hemp/article-extractor

Extract the main readable content from any article or blog URL: title, author, published date, full body text, word count, lead image, and site name. Uses JSON-LD, Open Graph meta, and DOM fallbacks — fast and reliable, no login or browser needed for public pages.

Matt Cook

Article Content Extractor 📄

easyapi/article-content-extractor

Extract clean article content, metadata and structured information from any web page. Supports multiple URLs and returns well-formatted JSON with title, description, content, author, publish date and more. 🔍📄

EasyApi

145

5.0

Substack Post Content Fetcher

seemuapps/substack-post-content

Fetch the full HTML content of any public Substack post by URL. Body text, title, subtitle, tags, engagement stats, and author details.

Andrew

News Article Scraper — Newsroom & Press Release Extractor

scrapepilot/company-ok

Scrape full article content from any newsroom, press release page, or blog. Get title, author, publish date, summary, SEO keywords, word count, and full body text. Auto-discovers article links. Checkpoint resume. $5 per 1,000 articles

Scrape Pilot

AI Blog Dataset Creator

datapilot/ai-blog-dataset-creator

Smart Article Scraper Actor extracts structured article data from URLs using, and Newspaper3k. It collects title, author, publish date, tags, full content, language, and word count. Supports proxy usage, JavaScript-rendered pages, and outputs clean JSON datasets.