LINE OpenChat Scraper avatar

LINE OpenChat Scraper

Pricing

Pay per event

Go to Apify Store
LINE OpenChat Scraper

LINE OpenChat Scraper

Scrape public room metadata from LINE OpenChat — Japan's largest public chat platform, also popular in Taiwan and Thailand. Extracts room name, description, member count, hashtags, region, and more from the public OpenChat directory. No LINE account required. Supports JP, TW, and TH markets.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

Scrape public room metadata from LINE OpenChat — Japan's largest public chat platform, also widely used in Taiwan and Thailand. Extracts room name, description, member count, hashtags, region, and icon URL from the public OpenChat directory. No LINE account required.

What It Does

LINE OpenChat hosts millions of public chat rooms covering entertainment, gaming, K-pop fandoms, local communities, news, language exchange, and more. This actor crawls the OpenChat public directory via sitemaps and extracts structured room metadata.

Coverage:

  • Japan (jp) — largest collection, millions of rooms
  • Taiwan (tw)
  • Thailand (th)

Use Cases

  • JP/SEA market research — discover active communities around topics, brands, or products in Japanese and Southeast Asian markets
  • Fandom & entertainment analytics — track member counts and activity for K-pop, anime, manga, and gaming communities
  • Language community discovery — find language exchange rooms by keyword and region
  • Regional trend monitoring — identify trending topics in JP/TW/TH via active public rooms

Output

Each record represents one LINE OpenChat room:

FieldTypeDescription
room_idstringUnique room hash identifier
room_urlstringCanonical URL to the room page
room_namestringRoom display name
room_descriptionstringFull room description text
room_member_countintegerCurrent number of members
room_tagsstringHashtags from description (comma-separated)
room_regionstringRegion: jp, tw, or th
room_languagestringLanguage code: ja, zh, or th
room_icon_urlstringRoom profile image URL (LINE CDN)
source_urlstringURL scraped to produce this record

Input

ParameterTypeDefaultDescription
maxItemsinteger100Maximum number of rooms to scrape
regionstringjpRegion: jp, tw, th, or all

How It Works

  1. Sitemap discovery — walks the OpenChat sitemap index (/sitemap_index.xml) to find room URLs for the selected region(s). Each regional sitemap contains thousands of room URLs.
  2. Detail extraction — fetches each room page (server-rendered Nuxt HTML — no JavaScript execution needed) and extracts structured metadata from the DOM.
  3. Hashtag parsing — extracts #tag patterns from room descriptions and returns them as a comma-separated string.

No Cloudflare bypass or proxy required. The site returns full HTML to standard HTTP requests. Rooms that have been deleted or made private are automatically skipped.

Rate Limits

LINE OpenChat does not impose aggressive rate limiting. The actor runs conservatively (5 concurrent requests) to be a good citizen.

Notes

  • Deleted or private rooms in the sitemap return HTTP 200 status but render a 404-style page with no content. These are detected and skipped automatically (expect 5–15% skip rate in the JP sitemap).
  • Member counts reflect the room's public membership at scrape time.
  • Room URLs include the region prefix (/jp/, /tw/, /th/) which determines the language code in the output.