Pricing

$1.00 / 1,000 results

Real Estate Listing Extractor

Extract structured data from a SINGLE public real-estate listing page: address, price, beds, baths, area, property type, sale/rent, year built, agent, images, geo. schema.org JSON-LD -> OpenGraph -> heuristics. Pure code, SSRF-guarded, cost-safe (no proxy/headless/AI). Single-page, not bulk.

Pricing

$1.00 / 1,000 results

Rating

0.0

(0)

Developer

Ahmed Moussa

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Real Estate Listing Extractor (single page)

Turn a single public real-estate listing page URL into a clean, structured JSON record — deterministically, with no AI, no proxy, and no headless browser.

What it does

Given one listing-page URL (or a small bounded batch), the actor fetches the page once and extracts structured listing data from its embedded schema.org markup. It is built on the same proven, SSRF-guarded fetch core as the other OMEGA single-page extractors, with a deterministic real-estate parser on top. Pure code — every field is computed, never guessed by a language model.

Input

Field	Type	Description
`url`	string	A single public real-estate listing page URL (include `https://`).
`urls`	array	Optional bounded list of extra listing URLs (max 50 per run).

Example input:

{
  "url": "https://www.example-realty.com/listing/123-maple-st"
}

Output

One dataset item per URL:

{
  "url": "https://www.example-realty.com/listing/123-maple-st",
  "status": "completed",
  "address": "123 Maple St, Austin, TX, 78701, US",
  "price": "675000",
  "currency": "USD",
  "beds": "4",
  "baths": "2.5",
  "area_sqft": "2400",
  "property_type": "singlefamilyresidence",
  "listing_type": "sale",
  "year_built": "1998",
  "lot_size": "6997",
  "agent": "Acme Realty",
  "images": ["https://.../a.jpg", "https://.../b.jpg"],
  "description": "Charming family home with garden.",
  "geo": { "lat": "30.2672", "lng": "-97.7431" },
  "raw_prices": ["675000"],
  "method": "jsonld_realestate",
  "parse_confidence": "high",
  "extracted_at": "2026-06-24T10:00:00+00:00",
  "error": null
}

status is one of completed, failed, blocked, or empty. Any field that the page does not declare is returned as null (or []) — never invented.

Use cases

Normalise a listing URL into a row for a CRM, spreadsheet, or database.
Pull price / beds / baths / area for a comparables (comps) sheet.
Monitor a single listing's price and status over time.
Enrich an internal dataset of listing URLs with structured fields.

How it works

Extraction precedence (most reliable first); the layer used is reported in method:

schema.org JSON-LD — RealEstateListing, Residence (House, Apartment, SingleFamilyResidence, …), Accommodation/Place, and Product/Offer (price, currency, address, bedrooms, bathrooms, floor/lot size, year built, geo, images, broker/seller, sale-vs-rent via businessFunction).
OpenGraph / product meta — og:title, og:image, product:price:amount, product:price:currency, location meta.
Meta / heuristics — <title>/<h1> plus a conservative, currency-marked price detector (never infers a price from a bare number).

Areas declared in square metres (unitCode MTK) are converted to square feet. A code-owned parse_confidence (high/medium/low/none) reflects which layer matched and how many core fields were found.

Cost-safety

No proxy, no headless browser, no LLM, no paid API. One bounded HTTP GET per URL (hard caps: 5s connect / 10s read / 2 MB / 3 redirects).
$0 idle and $0 uncovered cost beyond Apify compute — nothing to subsidise.
SSRF-guarded and fail-closed: private/loopback/reserved IPs are blocked, with per-redirect re-validation, and a domain blocklist for bot-walled portals.

Limitations (honest)

This is single-page extraction, not bulk portal/MLS scraping. It fetches the page you give it and never follows links or paginates.
It only reads server-rendered markup. Pages that render entirely in the browser (heavy client-side JS) will expose little to a plain GET and return a low-confidence result.
Many large portals (Zillow, Realtor.com, Redfin, Rightmove, Zoopla, …) block bots and/or forbid scraping in their ToS — these are on a blocklist and return status: "blocked". Point the actor at a brokerage's or publisher's own listing page that exposes schema.org markup for best results.
Fields are only as good as the page's structured data. Missing data is returned as null; the actor never fabricates a value.

Local Business Directory Extractor (single page)

timely_quarterstaff/business-directory-scraper

Extract structured business data from a SINGLE public business/company page via schema.org LocalBusiness/Organization JSON-LD, OpenGraph & meta: name, address, phone, email, website, hours, rating, reviews, geo. SSRF-guarded pure code (no proxy/browser/AI). Single-page, not bulk directory scraping.

Ahmed Moussa

E-commerce Product Scraper

timely_quarterstaff/ecommerce-scraper

Deterministic SSRF-guarded extraction of structured product data from a SINGLE public product-page URL: title, price, currency, availability, brand, rating, reviews, images, SKU, description via JSON-LD/OpenGraph/meta. Pure code, no proxy/headless/AI/paid API. Single-page, not bulk crawling.

Ahmed Moussa

Real Estate Listings Finder

flamelit_arowana/real-estate-scraper

Fetches real estate property listings by city/state or ZIP code. Returns price, beds, baths, sqft, property type, and listing URL. Uses real estate APIs.

Kevin Grossi

Real Estate Scraper — extract property prices

acclaimed_ashram/real-estate-intelligence

Scrape any real estate listing page. Returns estimated property prices, price ranges, and market summary data from the page content.

Klako Cariol

Zillow Real Estate Listing Scraper

miccho27/zillow-listing-scraper

Scrape Zillow property listings: price, beds, baths, sqft, address, year built, zestimate, listing agent, and price history. Supports location-based search and direct listing URL modes.

Tatsuya Mizuno

Page to API - Sitemap to JSON

timely_quarterstaff/page-to-api-extractor

Turn any public site URL or sitemap.xml into a clean API-style JSON feed. Crawls a bounded set of pages (hard cap 50/run) and returns one structured record per page: title, meta, headings, links, main text, JSON-LD + OpenGraph. SSRF-guarded, pure code, no AI by default.

Ahmed Moussa

Realtor.com Property Listings Scraper

flamboyant_liner/realtor-listings-scraper

Scrape Realtor.com property listings by ZIP, city, or neighborhood. Extracts full address, price, beds, baths, sqft, property type, year built, photos, listing agent source, and the listing URL for for-sale, rental, and sold homes. Built for real-estate lead generation, market analysis, and comps.

Khrystyna Skotte

RE/MAX Canada Scraper — Real Estate Listings & Property Data

vladignatyev/remax-ca-scraper

Scrape RE/MAX Canada (remax.ca) real estate listings into clean JSON, CSV or Excel. Extract Canadian property data — price, beds, baths, address, property type, agent, MLS and geo — for homes for sale and rent in every province. A real estate dataset for investors, analysts and lead generation.

Vladimir Ignatev

JSON-LD Schema & Meta Tag Extractor

logiover/json-ld-schema-meta-tag-extractor

Bulk JSON-LD structured data scraper and meta tag extractor for any URL. Export Schema.org, OpenGraph and Twitter Cards to CSV/JSON. No API.

Logiover

Redfin Scraper – Property Listings, Prices & Images Extractor

motivational_nickel/redfin-scraper---property-listings-prices-images-extractor

Extract structured property data from Redfin listings, including price, beds, baths, square footage, year built, property type, address, and images. Built with JSON-LD extraction plus DOM fallback for stable results, transparent status reporting, and reliable real estate data workflows.