Hostelworld Hotels Scraper
Pricing
Pay per usage
Hostelworld Hotels Scraper
Extract hostel listings from Hostelworld globally. Get prices, ratings, amenities, reviews, locations & booking links. Ideal for travel price comparison, competitive intelligence, booking aggregation & accommodation market research. Real-time, production-ready output.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Shahid Irfan
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extract complete accommodation listings from Hostelworld city pages and search URLs. Collect hotels, hostels, and mixed-property results with pricing, ratings, facilities, rooms, images, and location data in a structured dataset. Useful for travel research, accommodation comparison, pricing analysis, and hospitality market monitoring.
Features
- Clean and messy URL support — Accepts standard city pages plus search-style URLs that include query parameters and city IDs
- Hotels and hostels together — Works with
/hotels/,/hostels/, and mixed listing pages - Rich property coverage — Captures property details, pricing layers, review scores, facilities, rooms, promotions, and image links
- Automatic pagination — Keeps fetching pages until your result limit or page cap is reached
- Duplicate protection — Prevents repeated properties from being added to the dataset
- Smart internal defaults — Reuses dates and guest counts from messy URLs when present, otherwise falls back internally
- Failure diagnostics — Detects broken listing responses and logs clear recovery signals instead of failing silently
Use Cases
Travel Planning
Compare accommodation options across cities before booking. Review price ranges, guest ratings, district information, and room availability in one dataset.
Market Intelligence
Track pricing and property positioning across destinations. Identify popular neighborhoods, promoted listings, and differences between hostels and hotels.
Hospitality Benchmarking
Analyze review breakdowns such as cleanliness, staff, facilities, and value for money. Use the data to compare property performance across markets.
Content and Comparison Sites
Build city guides, destination roundups, and accommodation comparison pages with structured listing data and direct property links.
Research and Analytics
Create datasets for tourism research, hospitality dashboards, or competitor monitoring with repeatable city-level collection.
Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
startUrl | String | Yes | — | Hostelworld city or search URL. Works with clean /hostels/ and /hotels/ links plus messy search URLs such as /pwa/s?...&id=13. |
results_wanted | Integer | No | 20 | Maximum number of properties to collect. |
max_pages | Integer | No | 5 | Safety cap on the number of listing pages to fetch. |
proxyConfiguration | Object | No | {"useApifyProxy": false} | Optional Apify Proxy configuration. |
Output Data
Each dataset item can contain:
| Field | Type | Description |
|---|---|---|
property_id | String | Hostelworld property ID |
property_name | String | Property name |
property_type | String | Property type such as hostel or hotel |
star_rating | Number | Official star rating |
address | String | Street address |
district | String | Primary district or neighborhood |
districts | Array | Additional district names when available |
description | String | Property overview text |
images | Array | Property image URLs |
image_count | Number | Number of collected image URLs |
city | String | City name |
country | String | Country name |
latitude | Number | Latitude |
longitude | Number | Longitude |
distance_km | Number | Distance from city center in kilometers |
rating_overall | Number | Overall guest rating |
total_ratings | Number | Number of guest ratings |
rating_security | Number | Security rating |
rating_location | Number | Location rating |
rating_staff | Number | Staff rating |
rating_atmosphere | Number | Atmosphere rating |
rating_cleanliness | Number | Cleanliness rating |
rating_facilities | Number | Facilities rating |
rating_value_for_money | Number | Value for money rating |
price_from | String | Lowest displayed price per night |
price_currency | String | Currency used for pricing |
lowest_dorm_price | String | Lowest dorm price per night |
lowest_private_price | String | Lowest private room price per night |
average_price | String | Average lowest nightly price |
average_price_original | String | Original average nightly price before discount |
average_dorm_price | String | Average dorm price |
average_private_price | String | Average private room price |
free_cancellation | Boolean | Whether free cancellation is available |
free_cancellation_until | String | Free cancellation deadline when available |
is_promoted | Boolean | Whether the property is promoted |
is_featured | Boolean | Whether the property is featured |
is_new | Boolean | Whether the property is newly listed |
very_popular | Boolean | Whether the property is marked as very popular |
hostelworld_recommends | Boolean | Whether the property is recommended |
facilities | Array | Facility names |
facilities_count | Number | Number of facilities collected |
rooms | Array | Room summaries with type, capacity, ensuite flag, and price |
room_types_count | Number | Number of room entries collected |
promotions | Array | Promotion summaries |
discount_percent | Number | Discount percentage when available |
property_url | String | Direct property link |
search_url | String | Canonical city search URL used for the run |
Usage Examples
Standard Hostel City Page
{"startUrl": "https://www.hostelworld.com/hostels/europe/spain/barcelona/","results_wanted": 20,"max_pages": 5}
Hotel City Page
{"startUrl": "https://www.hostelworld.com/hotels/europe/spain/barcelona/","results_wanted": 50,"max_pages": 5}
Messy Search URL
{"startUrl": "https://www.hostelworld.com/pwa/s?q=New%20York,%20USA&country=USA&city=New%20York&type=city&id=13&from=2026-06-16&to=2026-06-19&guests=2&page=1","results_wanted": 20,"max_pages": 3}
Sample Output
{"property_id": "61557","property_name": "St Christopher's Inn Barcelona","property_type": "HOSTEL","address": "Carrer de Bergara, 3","district": "Las Ramblas","city": "Barcelona","country": "Spain","latitude": 41.3861073,"longitude": 2.16762,"distance_km": 0.22,"rating_overall": 85,"total_ratings": 15633,"rating_security": 90,"rating_location": 96,"rating_staff": 85,"rating_atmosphere": 82,"rating_cleanliness": 82,"rating_facilities": 80,"rating_value_for_money": 81,"price_from": "33.41","price_currency": "USD","lowest_dorm_price": "33.41","lowest_private_price": "217.34","free_cancellation": false,"facilities_count": 18,"room_types_count": 6,"property_url": "https://www.hostelworld.com/hostels/p/61557/st-christophers-inn-barcelona/","search_url": "https://www.hostelworld.com/hostels/europe/spain/barcelona/"}
Tips for Best Results
Use Real City or Search URLs
- Copy URLs directly from Hostelworld city pages or search pages
- Both clean city links and query-based search URLs are supported
- If the URL already contains dates or guests, those values are reused automatically unless you override them
Control Collection Size
- Start with
results_wanted: 20for quick checks - Increase the result limit for larger cities with many pages
- Use
max_pagesas a hard safety cap when testing new markets
Price Comparisons
- Keep the same type of source URL structure across comparison runs
- Different travel dates embedded in the source URL can change availability and rates significantly
- Compare similar stay setups when building market benchmarks
Integrations
Connect your dataset with:
- Google Sheets — Review pricing and ratings in spreadsheets
- Airtable — Build searchable accommodation databases
- Make — Trigger downstream travel workflows
- Zapier — Send new run results into business automations
- Webhooks — Push fresh listing data to your own systems
- Slack — Notify teams when collection finishes
Export Formats
- JSON — For applications and automation
- CSV — For spreadsheets and flat-file analysis
- Excel — For business reporting
- XML — For system integrations
Frequently Asked Questions
Does it work with both hotels and hostels?
Yes. The actor accepts Hostelworld hotel pages, hostel pages, and mixed city listing URLs.
Can it handle messy search URLs?
Yes. Search-style URLs with query parameters, dates, guest counts, and direct city IDs are supported.
Will it collect all properties in a city?
It keeps collecting until it reaches results_wanted, the source runs out of pages, or max_pages is reached.
Why do some fields appear only on some properties?
Hostelworld does not expose every field for every listing. Missing fields are omitted instead of filled with empty placeholder values.
Can I override the dates embedded in a URL?
No exposed override is required. The actor derives stay settings from the source URL when available and otherwise uses internal defaults.
What happens if the source changes?
The actor retries listing requests with alternate browser-like headers, logs response-shape mismatches clearly, and probes the source URL for recovery signals before failing.
Support
For issues or feature requests, contact support through the Apify Console.
Resources
Legal Notice
This actor is designed for legitimate data collection purposes. Users are responsible for ensuring compliance with website terms of service and applicable laws. Use data responsibly and respect rate limits.