Booking Hotel Scraper avatar

Booking Hotel Scraper

Pricing

$5.00/month + usage

Go to Apify Store
Booking Hotel Scraper

Booking Hotel Scraper

It extracts hotel name, location, price per night, star rating, guest rating, review count, amenities, room types, images, and property description by navigating search results and visiting individual hotel detail pages. Suitable for price monitoring, hotel dataset creation, and market analysis.

Pricing

$5.00/month + usage

Rating

5.0

(4)

Developer

ScrapeAI

ScrapeAI

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Share

🏨 Booking.com Hotel Scraper

A powerful Apify Actor that scrapes hotel listings from Booking.com search results. It performs HTML content tag-based data extraction in two stages:

  1. Search Page — extracts property cards and queues detail URLs
  2. Detail Page — extracts full description, amenities, and review data

✅ Use Cases

  • 🔍 Scrape hotels in any city for a given date range and guest count
  • 📊 Compare hotel prices, star ratings, and review scores
  • 🏖️ Extract full amenity lists and property descriptions
  • 📋 Build travel comparison datasets

📥 Input Configuration

{
"startUrls": [
{
"url": "https://www.booking.com/searchresults.html?ss=Chennai&checkin=2026-02-18&checkout=2026-02-24&group_adults=2&no_rooms=1&group_children=0"
}
],
"ss": "Chennai",
"checkin": "2026-02-18",
"checkout": "2026-02-24",
"group_adults": 2,
"no_rooms": 1,
"group_children": 0,
"maxItems": 50,
"scrollTimeout": 40,
"proxy": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
},
"debugMode": false
}

🧾 Fields Explained

FieldTypeDescription
startUrlsarrayBooking.com search result URLs. If provided, individual parameters below are ignored.
ssstringDestination city (e.g. "Chennai"). Used when no startUrls are provided.
checkinstringCheck-in date in YYYY-MM-DD format
checkoutstringCheck-out date in YYYY-MM-DD format
group_adultsintegerNumber of adult guests
no_roomsintegerNumber of rooms required
group_childrenintegerNumber of child guests
maxItemsintegerMaximum number of hotel detail pages to scrape
scrollTimeoutintegerSeconds to scroll the search page to load more results
proxyobjectProxy configuration
debugModebooleanSave debug screenshots and verbose logs

🔗 URL Structure

The scraper supports direct Booking.com search URLs:

https://www.booking.com/searchresults.html?ss=Chennai&checkin=2026-02-18&checkout=2026-02-24&group_adults=2&no_rooms=1&group_children=0
ParameterDescription
ssDestination / city name
checkinCheck-in date
checkoutCheck-out date
group_adultsNumber of adults
no_roomsNumber of rooms
group_childrenNumber of children

📤 Output

Each record saved to the dataset contains:

{
"dataType": "hotel",
"hotel_name": "The Leela Palace Chennai",
"location": "MRC Nagar, Chennai",
"price_per_night": "₹ 110,860",
"rating": 5,
"review_score": 9.1,
"review_label": "Rated superb",
"review_count": "2,265 reviews",
"description": "Nestled along the Coromandel Coast...",
"amenities": [
"Free Wi-Fi",
"Swimming Pool",
"Spa",
"Fitness Center",
"Restaurant"
],
"thumbnail": "https://cf.bstatic.com/...",
"hotel_url": "https://www.booking.com/hotel/in/leela-palace-chennai.html",
"scrapedAt": "2026-02-19T10:00:00.000Z"
}

📋 Output Schema

FieldTypeDescription
dataTypestringAlways "hotel"
hotel_namestringHotel / property name
locationstringAddress or area
price_per_nightstringDisplayed nightly price with currency
ratingintegerStar rating (1–5), counted from SVG icons
review_scorenumberGuest review score (e.g. 9.1)
review_labelstringTextual rating (e.g. "Rated superb")
review_countstringNumber of reviews (e.g. "2,265 reviews")
descriptionstringFull property description
amenitiesarrayList of facilities
thumbnailstringThumbnail image URL
hotel_urlstringBooking.com hotel detail page URL
scrapedAtstringISO 8601 scrape timestamp

🔒 Proxy Configuration

{
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}

Residential proxies are recommended to avoid Booking.com's anti-bot measures.


🛠️ Tech Stack

  • 🧩 Apify SDK — actor and dataset handling
  • 🕷️ Crawlee — crawling and request queue management
  • 🌐 Puppeteer — headless browser automation
  • ⚙️ Node.js — fast, scalable backend