CASC SpaceChina Corporate News Scraper avatar

CASC SpaceChina Corporate News Scraper

Pricing

Pay per event

Go to Apify Store
CASC SpaceChina Corporate News Scraper

CASC SpaceChina Corporate News Scraper

Scrapes press releases and corporate news from SpaceChina.com — the public news portal of CASC (China Aerospace Science and Technology Corporation). Extracts articles from the 集团要闻, 媒体聚焦, and 专题报道 subchannels with full body text, publish date, and subsidiary mentions.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

22 days ago

Last modified

Categories

Share

Scrapes press releases and corporate news from SpaceChina.com — the public news portal of CASC (China Aerospace Science and Technology Corporation / 中国航天科技集团有限公司). Extracts structured articles from three news subchannels with full body text, publication date, image URLs, PDF attachments, and automatic CASC subsidiary detection.

What it collects

FieldDescription
article_idUnique numeric ID from the article URL (e.g. 4632103)
subchannelSource subchannel: 集团要闻, 媒体聚焦, or 专题报道
title_zhArticle title in Chinese
title_enEnglish title (null — future enhancement via english.spacechina.com mirror)
body_htmlFull article body HTML
body_textFull article body as plain text
publish_datePublication date (ISO 8601, e.g. 2026-06-11)
source_urlCanonical article URL
mentioned_subsidiariesCASC academy/subsidiary names detected in body (一院 through 八院, CALT, CAST, SAST)
imagesAbsolute URLs of embedded article images
attachmentsAbsolute URLs of PDF attachments (e.g. annual social-responsibility reports)

Subchannels covered

KeyChineseEnglish
jtyw集团要闻Group News — primary launch and operations press releases
mjjj媒体聚焦Media Focus — external press coverage aggregated
ztbd专题报道Special Reports — themed coverage (missions, events, policy)

The actor crawls all pages within each selected subchannel, following the site's paginated listing structure automatically.

Input

ParameterTypeRequiredDescription
maxItemsintegerYesMaximum number of articles to scrape. Set to a high value (or remove the cap) for a full historical crawl (~3,000+ articles across all subchannels).
subchannelsarrayYesWhich subchannels to include. Accepts any combination of jtyw, mjjj, ztbd. Default: all three.

Example input

{
"maxItems": 100,
"subchannels": ["jtyw"]
}

Use cases

  • Defense and aerospace intelligence — Track every CASC press release mentioning specific launch vehicles, academies, or programs.
  • ESG / sanctions screening — Identify CASC subsidiaries (一院 through 八院) named in corporate announcements for mil-civ fusion exposure mapping.
  • Trade compliance — Monitor export-control-relevant announcements (new satellite programs, foreign partnerships, dual-use technology disclosures).
  • Annual reports — The 专题报道 channel carries annual social-responsibility reports back to 2013 as PDF attachments.
  • Research and journalism — Build a full-text searchable archive of CASC's public-facing communications.

Notes

  • Chinese-language content: All articles are in Simplified Chinese. The body_text field is suitable for NLP pipelines and translation workflows.
  • English mirror: The english.spacechina.com mirror exists but has minimal content. title_en is always null in this release.
  • Subsidiary detection: The mentioned_subsidiaries field uses pattern-matching on the body text for the eight CASC academies and their common abbreviations. It is heuristic and may miss references using full official names.
  • Historical depth: The site retains articles back to at least 2013 across all subchannels, representing the full accessible archive of CASC's public news.