Japan Kanpou Official Gazette Scraper (官報) avatar

Japan Kanpou Official Gazette Scraper (官報)

Pricing

Pay per event

Go to Apify Store
Japan Kanpou Official Gazette Scraper (官報)

Japan Kanpou Official Gazette Scraper (官報)

Extract notices from Kanpou (官報), Japan official gazette. Crawls 号外 (special), 本紙 (main), and procurement editions. Structured parsing for bankruptcy (破産), civil rehabilitation, dissolution — extracts debtor, court, case number, trustee, and claim deadline. Japan equivalent of the Bundesanzeiger.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

Japan Kanpou Official Gazette Scraper

Scrape notices from Kanpou (官報), Japan's official government gazette. Crawls 号外 (special), 本紙 (main), and procurement editions across the 90-day rolling archive. Structured parsing for bankruptcy (破産), civil rehabilitation (民事再生), dissolution (解散), and liquidation (清算) notices — extracts debtor, court, case number, trustee, and claim deadline.


Kanpou Scraper Features

  • Crawls all four Kanpou editions: 本紙 (main), 号外 (special), 政府調達 (procurement), 特別号外
  • Returns date range, issue number, notice category, and direct PDF URL for every notice
  • Structured field extraction for bankruptcy and civil-rehab notices: debtor, address, 法人番号 (hojin bangou), court, case number, trustee, claim deadline
  • Filter by notice category — bankruptcy, civil rehab, dissolution, liquidation, auction, statutory orders, or all
  • Returns both Gregorian and wareki (令和) date formats
  • No login, no proxy — kanpou.npb.go.jp serves traffic publicly

Who Uses Kanpou Data?

  • Credit risk teams — Detect new corporate bankruptcies and civil-rehab filings against Japanese counterparties in near-real time
  • M&A and distressed-asset desks — Track dissolution and liquidation notices for acquisition opportunities
  • Compliance and KYC vendors — Enrich Japanese entity records with bankruptcy / sanctioned-counterparty flags
  • Legal and IP firms — Monitor trustee appointments and claim-filing deadlines on bankruptcy cases of interest
  • Financial researchers — Build longitudinal datasets of Japanese corporate failures by sector and region

How the Kanpou Scraper Works

  1. Configure scope — Pick a date range (within the free 90-day rolling archive), edition types, and notice categories.
  2. Index crawl — The scraper walks the issue index for each date in range, collecting notice metadata from the section listings.
  3. PDF extraction (optional) — With extractPdfText enabled, each notice PDF is downloaded and parsed for structured fields. About 3x slower than metadata-only mode, and required for bankruptcy/civil-rehab field extraction.
  4. Export — One record per notice in your Apify dataset, with both raw and structured fields populated.

Input

Recent bankruptcy and civil-rehab notices

{
"dateFrom": "2026-04-01",
"dateTo": "2026-05-11",
"issueTypes": ["gougai"],
"noticeCategories": ["hasan", "minji_saisei"],
"extractPdfText": true,
"maxItems": 500
}

All notice categories, metadata only

{
"dateFrom": "2026-05-01",
"dateTo": "2026-05-11",
"issueTypes": ["honshi", "gougai"],
"noticeCategories": ["all"],
"extractPdfText": false,
"maxItems": 0
}

Dissolutions and liquidations only

{
"issueTypes": ["gougai"],
"noticeCategories": ["kaisan", "seisan"],
"extractPdfText": true,
"maxItems": 200
}
FieldTypeDefaultDescription
dateFromstringStart of issue-date range (YYYY-MM-DD). Empty = 90-day archive start.
dateTostringEnd of issue-date range (YYYY-MM-DD). Empty = today.
issueTypesarray["gougai"]Editions: honshi, gougai, seifu_chotatsu, tokubetsu_gougai.
noticeCategoriesarraybankruptcy / civil rehab / dissolution / liquidationhasan, minji_saisei, kaisan, seisan, koubai, kokuji, seirei, shourei, or all.
extractPdfTextbooleantrueDownload and parse each PDF. Required for structured bankruptcy fields.
maxItemsinteger10Cap on records. 0 = unlimited.
proxyConfigurationobjectnoneProxy settings. Off by default.

Kanpou Scraper Output Fields

{
"issue_date": "2026-05-08",
"issue_date_wareki": "令和8年5月8日",
"issue_type": "号外",
"issue_number": "第104号",
"notice_id": "13",
"notice_category": "公告",
"notice_subcategory": "裁判所",
"title": "破産手続開始決定",
"body_text": "債務者 株式会社サンプル工業 (東京都港区赤坂1-2-3) について破産手続開始決定...",
"entity_name": "株式会社サンプル工業",
"entity_address": "東京都港区赤坂1-2-3",
"entity_corporate_number": "8010001012345",
"court_name": "東京地方裁判所",
"case_number": "令和8年(フ)第1234号",
"trustee_name": "山田 太郎",
"trustee_contact": "東京都千代田区霞が関3-2-1 03-1234-5678",
"claim_deadline": "2026-07-15",
"pdf_url": "https://kanpou.npb.go.jp/20260508/20260508g00104/20260508g001040013f.html",
"source_url": "https://kanpou.npb.go.jp/20260508/20260508g00104/"
}
FieldTypeDescription
issue_datestringKanpou issue date (Gregorian YYYY-MM-DD)
issue_date_warekistringIssue date in wareki format (e.g. 令和8年5月11日)
issue_typestringEdition: 本紙, 号外, 政府調達, 特別号外
issue_numberstringKanpou issue number (e.g. 第104号)
notice_idstringUnique notice ID within the issue (page number)
notice_categorystringNotice category heading (e.g. 公告, 省令, 告示)
notice_subcategorystringSubcategory (e.g. 裁判所, 会社その他)
titlestringNotice title text
body_textstringExtracted notice body text
entity_namestringDebtor / entity name for bankruptcy and dissolution notices
entity_addressstringDebtor address
entity_corporate_numberstring13-digit 法人番号 (hojin bangou) when present
court_namestringCourt name for judicial notices (e.g. 東京地方裁判所)
case_numberstringCase number 事件番号 (e.g. 令和8年(フ)第1234号)
trustee_namestringBankruptcy trustee or liquidator (破産管財人 / 清算人)
trustee_contactstringTrustee contact address or phone
claim_deadlinestringClaim filing deadline (YYYY-MM-DD)
pdf_urlstringDirect PDF URL for the notice
source_urlstringKanpou issue index URL

FAQ

How do I scrape Japan's official gazette?

Kanpou Scraper handles it. Set a date range within the 90-day rolling archive, pick edition types and notice categories, and run. With extractPdfText enabled, you get structured fields (debtor, court, trustee, deadline) for bankruptcy and civil-rehab notices.

How much does this actor cost to run?

Kanpou Scraper uses pay-per-event pricing on the default_2603_basic profile at a 1.0x coefficient. No proxy fees. A 500-notice run with PDF extraction enabled costs cents in platform fees.

Can I get historical Kanpou notices beyond 90 days?

Kanpou Scraper covers the free 90-day rolling archive at kanpou.npb.go.jp. Older notices are paywalled by the National Printing Bureau and are not scraped by this actor.

What categories include structured field extraction?

Kanpou Scraper extracts structured debtor/court/trustee/deadline fields for bankruptcy (破産) and civil rehabilitation (民事再生) notices. Dissolution and liquidation notices populate entity name, address, and 法人番号. Other categories return title and body text only.

Does this actor need proxies?

Kanpou Scraper runs proxy-free. kanpou.npb.go.jp is a Japanese government site that serves traffic publicly without rate-limiting at the volumes this actor produces.


Need More Features?

Need historical-archive access beyond 90 days, kanji-to-romaji normalization, or webhook delivery on new bankruptcy filings? Open an issue or get in touch.

Why Use Kanpou Scraper?

  • Structured judicial fields — Bankruptcy and civil-rehab notices have debtor, court, case number, trustee, and claim-deadline parsed into separate columns. Most gazette scrapers stop at title and PDF link.
  • 法人番号 when present — The 13-digit hojin bangou is extracted directly from notice text when included, which makes joining to Japanese corporate registries trivial.
  • Both calendars — Returns issue date in both Gregorian and wareki (令和) formats, so downstream tools and humans can both read it.