Shukatsu Kaigi Company Review Scraper
Pricing
Pay per event
Shukatsu Kaigi Company Review Scraper
Scrapes Japanese new-grad (shinsotsu) company profiles from Shukatsu-Kaigi: ratings, reviews (口コミ), and selection-experience reports (就活速報).
Pricing
Pay per event
Rating
0.0
(0)
Developer
BowTiedRaccoon
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extracts new-grad (shinsotsu) company intelligence from Shukatsu-Kaigi (就活会議) — Japan's leading university-recruiting review platform. Returns structured company profiles including ratings, review counts, and selection-experience report metadata.
What you get
Each record represents one company and includes:
- Company profile — name (Japanese), industry, headquarters prefecture, company ID
- Aggregate rating — overall score (0–5 scale) from student and employee reviews
- Review counts — total number of 口コミ (reviews) and 就活速報 (selection experience reports)
- Review summaries — respondent type, graduation year, and public text excerpt for each inline review
- Selection report metadata — graduation year, faculty (文系/理系), gender, outcome stage (内定/interview-stage), and posted date for each visible selection report
- Entry sheet count — number of ES examples on file (bodies are login-gated; count is public)
- Source URL — canonical company page link
Inputs
| Field | Type | Description |
|---|---|---|
maxItems | integer | Maximum number of companies to collect. Default: 10 |
startUrls | array | Optional list of specific company URLs (e.g. https://syukatsu-kaigi.jp/companies/108402). When provided, the scraper skips the search-listing discovery and crawls exactly those companies. |
Discovery
When no startUrls are provided, the scraper paginates through /companies/search?page=N. Each page returns ~100 company links. The full corpus contains approximately 144,000 companies.
Notes
- Review and selection-report full bodies require a registered account. The scraper extracts public text excerpts (≤200 characters) and metadata fields available to anonymous visitors.
- Entry sheet bodies are always login-gated; only the count is extracted.
- The scraper respects the site's robots.txt:
/posted_contents,/rewards,/mypage, and query-parameter-filtered paths are not crawled. - Rate: conservative 500ms delay between requests, 5 concurrent workers.
Example output
{"company_id": "108402","company_name": "三菱UFJフィナンシャル・グループ","industry": "サービス(その他)","headquarters": "東京都","overall_rating": 4,"review_count": 45,"selection_report_count": 0,"entry_sheet_count": 0,"reviews": "[]","selection_reports": "[{\"graduation_year\":2026,\"faculty\":\"理系\",\"gender\":\"男性\",\"outcome_stage\":\"内定した学生\",\"posted_at\":\"2025年5月19日\"}]","source_url": "https://syukatsu-kaigi.jp/companies/108402","scrapedAt": "2026-06-03T02:45:00.000Z"}
Use cases
- Competitive intelligence for Japanese HR-tech and university career platforms
- Research into new-grad hiring patterns by industry and company
- Dataset enrichment for recruiting agencies serving the JP shinsotsu market
- Benchmarking company selection-process difficulty and interview-experience trends