Japan Kanpou Official Gazette Scraper (官報)
Pricing
Pay per event
Japan Kanpou Official Gazette Scraper (官報)
Extract notices from Kanpou (官報), Japan official gazette. Crawls 号外 (special), 本紙 (main), and procurement editions. Structured parsing for bankruptcy (破産), civil rehabilitation, dissolution — extracts debtor, court, case number, trustee, and claim deadline. Japan equivalent of the Bundesanzeiger.
Pricing
Pay per event
Rating
0.0
(0)
Developer
BowTiedRaccoon
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Share
Japan Kanpou Official Gazette Scraper
Scrape notices from Kanpou (官報), Japan's official government gazette. Crawls 号外 (special), 本紙 (main), and procurement editions across the 90-day rolling archive. Structured parsing for bankruptcy (破産), civil rehabilitation (民事再生), dissolution (解散), and liquidation (清算) notices — extracts debtor, court, case number, trustee, and claim deadline.
Kanpou Scraper Features
- Crawls all four Kanpou editions: 本紙 (main), 号外 (special), 政府調達 (procurement), 特別号外
- Returns date range, issue number, notice category, and direct PDF URL for every notice
- Structured field extraction for bankruptcy and civil-rehab notices: debtor, address, 法人番号 (hojin bangou), court, case number, trustee, claim deadline
- Filter by notice category — bankruptcy, civil rehab, dissolution, liquidation, auction, statutory orders, or all
- Returns both Gregorian and wareki (令和) date formats
- No login, no proxy — kanpou.npb.go.jp serves traffic publicly
Who Uses Kanpou Data?
- Credit risk teams — Detect new corporate bankruptcies and civil-rehab filings against Japanese counterparties in near-real time
- M&A and distressed-asset desks — Track dissolution and liquidation notices for acquisition opportunities
- Compliance and KYC vendors — Enrich Japanese entity records with bankruptcy / sanctioned-counterparty flags
- Legal and IP firms — Monitor trustee appointments and claim-filing deadlines on bankruptcy cases of interest
- Financial researchers — Build longitudinal datasets of Japanese corporate failures by sector and region
How the Kanpou Scraper Works
- Configure scope — Pick a date range (within the free 90-day rolling archive), edition types, and notice categories.
- Index crawl — The scraper walks the issue index for each date in range, collecting notice metadata from the section listings.
- PDF extraction (optional) — With
extractPdfTextenabled, each notice PDF is downloaded and parsed for structured fields. About 3x slower than metadata-only mode, and required for bankruptcy/civil-rehab field extraction. - Export — One record per notice in your Apify dataset, with both raw and structured fields populated.
Input
Recent bankruptcy and civil-rehab notices
{"dateFrom": "2026-04-01","dateTo": "2026-05-11","issueTypes": ["gougai"],"noticeCategories": ["hasan", "minji_saisei"],"extractPdfText": true,"maxItems": 500}
All notice categories, metadata only
{"dateFrom": "2026-05-01","dateTo": "2026-05-11","issueTypes": ["honshi", "gougai"],"noticeCategories": ["all"],"extractPdfText": false,"maxItems": 0}
Dissolutions and liquidations only
{"issueTypes": ["gougai"],"noticeCategories": ["kaisan", "seisan"],"extractPdfText": true,"maxItems": 200}
| Field | Type | Default | Description |
|---|---|---|---|
| dateFrom | string | — | Start of issue-date range (YYYY-MM-DD). Empty = 90-day archive start. |
| dateTo | string | — | End of issue-date range (YYYY-MM-DD). Empty = today. |
| issueTypes | array | ["gougai"] | Editions: honshi, gougai, seifu_chotatsu, tokubetsu_gougai. |
| noticeCategories | array | bankruptcy / civil rehab / dissolution / liquidation | hasan, minji_saisei, kaisan, seisan, koubai, kokuji, seirei, shourei, or all. |
| extractPdfText | boolean | true | Download and parse each PDF. Required for structured bankruptcy fields. |
| maxItems | integer | 10 | Cap on records. 0 = unlimited. |
| proxyConfiguration | object | none | Proxy settings. Off by default. |
Kanpou Scraper Output Fields
{"issue_date": "2026-05-08","issue_date_wareki": "令和8年5月8日","issue_type": "号外","issue_number": "第104号","notice_id": "13","notice_category": "公告","notice_subcategory": "裁判所","title": "破産手続開始決定","body_text": "債務者 株式会社サンプル工業 (東京都港区赤坂1-2-3) について破産手続開始決定...","entity_name": "株式会社サンプル工業","entity_address": "東京都港区赤坂1-2-3","entity_corporate_number": "8010001012345","court_name": "東京地方裁判所","case_number": "令和8年(フ)第1234号","trustee_name": "山田 太郎","trustee_contact": "東京都千代田区霞が関3-2-1 03-1234-5678","claim_deadline": "2026-07-15","pdf_url": "https://kanpou.npb.go.jp/20260508/20260508g00104/20260508g001040013f.html","source_url": "https://kanpou.npb.go.jp/20260508/20260508g00104/"}
| Field | Type | Description |
|---|---|---|
| issue_date | string | Kanpou issue date (Gregorian YYYY-MM-DD) |
| issue_date_wareki | string | Issue date in wareki format (e.g. 令和8年5月11日) |
| issue_type | string | Edition: 本紙, 号外, 政府調達, 特別号外 |
| issue_number | string | Kanpou issue number (e.g. 第104号) |
| notice_id | string | Unique notice ID within the issue (page number) |
| notice_category | string | Notice category heading (e.g. 公告, 省令, 告示) |
| notice_subcategory | string | Subcategory (e.g. 裁判所, 会社その他) |
| title | string | Notice title text |
| body_text | string | Extracted notice body text |
| entity_name | string | Debtor / entity name for bankruptcy and dissolution notices |
| entity_address | string | Debtor address |
| entity_corporate_number | string | 13-digit 法人番号 (hojin bangou) when present |
| court_name | string | Court name for judicial notices (e.g. 東京地方裁判所) |
| case_number | string | Case number 事件番号 (e.g. 令和8年(フ)第1234号) |
| trustee_name | string | Bankruptcy trustee or liquidator (破産管財人 / 清算人) |
| trustee_contact | string | Trustee contact address or phone |
| claim_deadline | string | Claim filing deadline (YYYY-MM-DD) |
| pdf_url | string | Direct PDF URL for the notice |
| source_url | string | Kanpou issue index URL |
FAQ
How do I scrape Japan's official gazette?
Kanpou Scraper handles it. Set a date range within the 90-day rolling archive, pick edition types and notice categories, and run. With extractPdfText enabled, you get structured fields (debtor, court, trustee, deadline) for bankruptcy and civil-rehab notices.
How much does this actor cost to run?
Kanpou Scraper uses pay-per-event pricing on the default_2603_basic profile at a 1.0x coefficient. No proxy fees. A 500-notice run with PDF extraction enabled costs cents in platform fees.
Can I get historical Kanpou notices beyond 90 days?
Kanpou Scraper covers the free 90-day rolling archive at kanpou.npb.go.jp. Older notices are paywalled by the National Printing Bureau and are not scraped by this actor.
What categories include structured field extraction?
Kanpou Scraper extracts structured debtor/court/trustee/deadline fields for bankruptcy (破産) and civil rehabilitation (民事再生) notices. Dissolution and liquidation notices populate entity name, address, and 法人番号. Other categories return title and body text only.
Does this actor need proxies?
Kanpou Scraper runs proxy-free. kanpou.npb.go.jp is a Japanese government site that serves traffic publicly without rate-limiting at the volumes this actor produces.
Need More Features?
Need historical-archive access beyond 90 days, kanji-to-romaji normalization, or webhook delivery on new bankruptcy filings? Open an issue or get in touch.
Why Use Kanpou Scraper?
- Structured judicial fields — Bankruptcy and civil-rehab notices have debtor, court, case number, trustee, and claim-deadline parsed into separate columns. Most gazette scrapers stop at title and PDF link.
- 法人番号 when present — The 13-digit hojin bangou is extracted directly from notice text when included, which makes joining to Japanese corporate registries trivial.
- Both calendars — Returns issue date in both Gregorian and wareki (令和) formats, so downstream tools and humans can both read it.