Japan Real Estate Listings Scraper avatar

Japan Real Estate Listings Scraper

Pricing

Pay per usage

Go to Apify Store
Japan Real Estate Listings Scraper

Japan Real Estate Listings Scraper

Scrapes Japanese real estate listings from HOME'S (homes.co.jp). Extracts property metadata: price, area, address, layout, built year, station access, and more. Uses JSON-LD structured data when available for maximum reliability.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

K S

K S

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Categories

Share

A robust Apify Actor for scraping Japanese real estate listing metadata from HOME'S (homes.co.jp), one of Japan's largest property listing platforms operated by LIFULL Co., Ltd.

Key Features

  • JSON-LD First Strategy: Extracts structured data from <script type="application/ld+json"> tags containing RealEstateListing schema, ensuring high accuracy and resilience against UI changes.
  • DOM Fallback: When JSON-LD is unavailable, falls back to DOM-based extraction from property card elements.
  • Japanese Price Parsing: Handles all Japanese price formats including 万円 (10,000 JPY), (100,000,000 JPY), and mixed notation like 1億2,000万円.
  • Comprehensive Data: Extracts price, area (m²), address, layout (間取り), built year, station access, floor details, building structure, management fees, and more.
  • Rate-Limited & Polite: Built-in rate limiting (max 30 requests/minute, max concurrency 2) to respect the target site.
  • Schema Validation: Every record is validated against a Zod schema before being pushed to the dataset.
  • Deduplication: Automatic removal of duplicate listings by property ID.

Data Source

This actor scrapes from HOME'S (homes.co.jp), which publishes property listing metadata as structured data (JSON-LD with RealEstateListing schema). Only publicly available metadata is collected — no copyrighted content (photos, descriptions) is scraped.

Input Parameters

ParameterTypeDefaultDescription
areastring東京都Target prefecture (e.g. 東京都, 大阪府, 神奈川県)
propertyTypeenummansionProperty type: mansion (condo), kodate (house), tochi (land)
priceMininteger-Minimum price in 万円
priceMaxinteger-Maximum price in 万円
maxResultsinteger500Maximum number of listings to collect
sourceenumhomesData source (currently HOME'S only)

Output Fields

FieldTypeDescription
idstringUnique identifier (e.g. homes-1234567)
pricenumberPrice in 万円 (10,000 JPY units)
areanumberFloor area in m²
addressstringProperty address
layoutstringRoom layout (e.g. 2LDK, 3LDK)
builtYearnumberYear built (Western calendar)
sourcestringData source (homes)
sourceUrlstringURL to original listing
stationstring?Nearest station name
walkMinutesnumber?Walking minutes to station
floornumber?Floor number
totalFloorsnumber?Total floors in building
structurestring?Building structure (RC, SRC, etc.)
managementFeenumber?Monthly management fee (JPY)
repairReservenumber?Monthly repair reserve fund (JPY)
balconyAreanumber?Balcony area in m²
directionstring?Direction facing (南, 東南, etc.)
totalUnitsnumber?Total units in building
agencyNamestring?Real estate agency name

Usage Example

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_TOKEN' });
const run = await client.actor('your-username/jp-realestate').call({
area: '東京都',
propertyType: 'mansion',
priceMax: 5000,
maxResults: 100,
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(`Found ${items.length} listings`);
  • This actor collects only publicly available property listing metadata (prices, areas, addresses).
  • No copyrighted content (photos, detailed descriptions) is collected.
  • Respects robots.txt directives and implements polite rate limiting.
  • No personally identifiable information (PII) is collected.
  • Compliant with Japan's Copyright Act Article 30-4 (TDM exception).

Pricing

Approximately $4 per 1,000 results based on Apify platform compute units.

Support

For questions or feature requests, please open an issue on the actor's GitHub repository.