Housing Property Scraper avatar

Housing Property Scraper

Pricing

$4.49/month + usage

Go to Apify Store
Housing Property Scraper

Housing Property Scraper

Housing Property Scraper is a web scraping tool that extracts property listings from housing websites. It collects key details like title, price, location, amenities, images, and contact info. Ideal for real estate analysis, market research, and generating structured datasets for further processing.

Pricing

$4.49/month + usage

Rating

5.0

(2)

Developer

ScrapeAI

ScrapeAI

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Housing Search Scraper 🏘️

A powerful web scraper built to extract property listings from Housing.com search result pages. It collects detailed information such as property title, price, location, configuration (BHK), area, amenities, and listing URLs. Ideal for real estate data analysis, market research, and lead generation.

Features

  • Collects property data by intercepting API responses from Housing.com.
  • Handles infinite scroll and dropdown interactions automatically.
  • Supports configurable search URLs and per-search limits.
  • Includes single entry points (src/main.js) demonstrating different variable naming schemes with identical logic.

Getting Started

  1. Install dependencies

    $npm install
  2. Configure input

    • Edit INPUT.json or provide actor input through the Apify platform.
    • Example INPUT.json:
      {
      "searchUrls": [
      "https://www.housing.com/in/buy/search?..."
      ],
      "headless": true
      }
  3. Run locally

    $node src/main.js
  4. Docker / Actor

    • The Dockerfile in the repository can be used to build an image.
    • The .actor/actor.json file defines the actor specification for deployment on Apify.

File Overview

  • src/main.js – primary script with standard variable names.
  • src/scraper.js – shared scraping logic used by the entry point.

Input & Output Schemas

The actor uses JSON Schema definitions to validate inputs and dataset records:

  • Input schema: .actor/input_schema.json – specifies required searchUrls array and optional maxCrawlPerSearch integer.
  • Dataset schema: .actor/dataset_schema.json – very detailed schema describing the structure of scraped property objects.
    Certain fields such as hasAutoVideo and coverImage.videoUrl are optional, so that older records without video metadata still validate.

These files are referenced in .actor/actor.json and are used by the Apify platform when running the actor.

Logs & Storage

  • API requests/responses are saved under storage/logs.
  • Collected dataset entries are stored via Apify's dataset storage (schema defined in .actor/dataset_schema.json).

License

This project is provided as-is. Feel free to adapt and extend for your own use.