LINE OpenChat Scraper
Pricing
Pay per event
LINE OpenChat Scraper
Scrape public room metadata from LINE OpenChat — Japan's largest public chat platform, also popular in Taiwan and Thailand. Extracts room name, description, member count, hashtags, region, and more from the public OpenChat directory. No LINE account required. Supports JP, TW, and TH markets.
Pricing
Pay per event
Rating
0.0
(0)
Developer
BowTiedRaccoon
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Scrape public room metadata from LINE OpenChat — Japan's largest public chat platform, also widely used in Taiwan and Thailand. Extracts room name, description, member count, hashtags, region, and icon URL from the public OpenChat directory. No LINE account required.
What It Does
LINE OpenChat hosts millions of public chat rooms covering entertainment, gaming, K-pop fandoms, local communities, news, language exchange, and more. This actor crawls the OpenChat public directory via sitemaps and extracts structured room metadata.
Coverage:
- Japan (
jp) — largest collection, millions of rooms - Taiwan (
tw) - Thailand (
th)
Use Cases
- JP/SEA market research — discover active communities around topics, brands, or products in Japanese and Southeast Asian markets
- Fandom & entertainment analytics — track member counts and activity for K-pop, anime, manga, and gaming communities
- Language community discovery — find language exchange rooms by keyword and region
- Regional trend monitoring — identify trending topics in JP/TW/TH via active public rooms
Output
Each record represents one LINE OpenChat room:
| Field | Type | Description |
|---|---|---|
room_id | string | Unique room hash identifier |
room_url | string | Canonical URL to the room page |
room_name | string | Room display name |
room_description | string | Full room description text |
room_member_count | integer | Current number of members |
room_tags | string | Hashtags from description (comma-separated) |
room_region | string | Region: jp, tw, or th |
room_language | string | Language code: ja, zh, or th |
room_icon_url | string | Room profile image URL (LINE CDN) |
source_url | string | URL scraped to produce this record |
Input
| Parameter | Type | Default | Description |
|---|---|---|---|
maxItems | integer | 100 | Maximum number of rooms to scrape |
region | string | jp | Region: jp, tw, th, or all |
How It Works
- Sitemap discovery — walks the OpenChat sitemap index (
/sitemap_index.xml) to find room URLs for the selected region(s). Each regional sitemap contains thousands of room URLs. - Detail extraction — fetches each room page (server-rendered Nuxt HTML — no JavaScript execution needed) and extracts structured metadata from the DOM.
- Hashtag parsing — extracts
#tagpatterns from room descriptions and returns them as a comma-separated string.
No Cloudflare bypass or proxy required. The site returns full HTML to standard HTTP requests. Rooms that have been deleted or made private are automatically skipped.
Rate Limits
LINE OpenChat does not impose aggressive rate limiting. The actor runs conservatively (5 concurrent requests) to be a good citizen.
Notes
- Deleted or private rooms in the sitemap return HTTP 200 status but render a 404-style page with no content. These are detected and skipped automatically (expect 5–15% skip rate in the JP sitemap).
- Member counts reflect the room's public membership at scrape time.
- Room URLs include the region prefix (
/jp/,/tw/,/th/) which determines the language code in the output.