Hostelworld Hotels Scraper avatar

Hostelworld Hotels Scraper

Pricing

Pay per usage

Go to Apify Store
Hostelworld Hotels Scraper

Hostelworld Hotels Scraper

Extract hostel listings from Hostelworld globally. Get prices, ratings, amenities, reviews, locations & booking links. Ideal for travel price comparison, competitive intelligence, booking aggregation & accommodation market research. Real-time, production-ready output.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Extract complete accommodation listings from Hostelworld city pages and search URLs. Collect hotels, hostels, and mixed-property results with pricing, ratings, facilities, rooms, images, and location data in a structured dataset. Useful for travel research, accommodation comparison, pricing analysis, and hospitality market monitoring.

Features

  • Clean and messy URL support — Accepts standard city pages plus search-style URLs that include query parameters and city IDs
  • Hotels and hostels together — Works with /hotels/, /hostels/, and mixed listing pages
  • Rich property coverage — Captures property details, pricing layers, review scores, facilities, rooms, promotions, and image links
  • Automatic pagination — Keeps fetching pages until your result limit or page cap is reached
  • Duplicate protection — Prevents repeated properties from being added to the dataset
  • Smart internal defaults — Reuses dates and guest counts from messy URLs when present, otherwise falls back internally
  • Failure diagnostics — Detects broken listing responses and logs clear recovery signals instead of failing silently

Use Cases

Travel Planning

Compare accommodation options across cities before booking. Review price ranges, guest ratings, district information, and room availability in one dataset.

Market Intelligence

Track pricing and property positioning across destinations. Identify popular neighborhoods, promoted listings, and differences between hostels and hotels.

Hospitality Benchmarking

Analyze review breakdowns such as cleanliness, staff, facilities, and value for money. Use the data to compare property performance across markets.

Content and Comparison Sites

Build city guides, destination roundups, and accommodation comparison pages with structured listing data and direct property links.

Research and Analytics

Create datasets for tourism research, hospitality dashboards, or competitor monitoring with repeatable city-level collection.


Input Parameters

ParameterTypeRequiredDefaultDescription
startUrlStringYesHostelworld city or search URL. Works with clean /hostels/ and /hotels/ links plus messy search URLs such as /pwa/s?...&id=13.
results_wantedIntegerNo20Maximum number of properties to collect.
max_pagesIntegerNo5Safety cap on the number of listing pages to fetch.
proxyConfigurationObjectNo{"useApifyProxy": false}Optional Apify Proxy configuration.

Output Data

Each dataset item can contain:

FieldTypeDescription
property_idStringHostelworld property ID
property_nameStringProperty name
property_typeStringProperty type such as hostel or hotel
star_ratingNumberOfficial star rating
addressStringStreet address
districtStringPrimary district or neighborhood
districtsArrayAdditional district names when available
descriptionStringProperty overview text
imagesArrayProperty image URLs
image_countNumberNumber of collected image URLs
cityStringCity name
countryStringCountry name
latitudeNumberLatitude
longitudeNumberLongitude
distance_kmNumberDistance from city center in kilometers
rating_overallNumberOverall guest rating
total_ratingsNumberNumber of guest ratings
rating_securityNumberSecurity rating
rating_locationNumberLocation rating
rating_staffNumberStaff rating
rating_atmosphereNumberAtmosphere rating
rating_cleanlinessNumberCleanliness rating
rating_facilitiesNumberFacilities rating
rating_value_for_moneyNumberValue for money rating
price_fromStringLowest displayed price per night
price_currencyStringCurrency used for pricing
lowest_dorm_priceStringLowest dorm price per night
lowest_private_priceStringLowest private room price per night
average_priceStringAverage lowest nightly price
average_price_originalStringOriginal average nightly price before discount
average_dorm_priceStringAverage dorm price
average_private_priceStringAverage private room price
free_cancellationBooleanWhether free cancellation is available
free_cancellation_untilStringFree cancellation deadline when available
is_promotedBooleanWhether the property is promoted
is_featuredBooleanWhether the property is featured
is_newBooleanWhether the property is newly listed
very_popularBooleanWhether the property is marked as very popular
hostelworld_recommendsBooleanWhether the property is recommended
facilitiesArrayFacility names
facilities_countNumberNumber of facilities collected
roomsArrayRoom summaries with type, capacity, ensuite flag, and price
room_types_countNumberNumber of room entries collected
promotionsArrayPromotion summaries
discount_percentNumberDiscount percentage when available
property_urlStringDirect property link
search_urlStringCanonical city search URL used for the run

Usage Examples

Standard Hostel City Page

{
"startUrl": "https://www.hostelworld.com/hostels/europe/spain/barcelona/",
"results_wanted": 20,
"max_pages": 5
}

Hotel City Page

{
"startUrl": "https://www.hostelworld.com/hotels/europe/spain/barcelona/",
"results_wanted": 50,
"max_pages": 5
}

Messy Search URL

{
"startUrl": "https://www.hostelworld.com/pwa/s?q=New%20York,%20USA&country=USA&city=New%20York&type=city&id=13&from=2026-06-16&to=2026-06-19&guests=2&page=1",
"results_wanted": 20,
"max_pages": 3
}

Sample Output

{
"property_id": "61557",
"property_name": "St Christopher's Inn Barcelona",
"property_type": "HOSTEL",
"address": "Carrer de Bergara, 3",
"district": "Las Ramblas",
"city": "Barcelona",
"country": "Spain",
"latitude": 41.3861073,
"longitude": 2.16762,
"distance_km": 0.22,
"rating_overall": 85,
"total_ratings": 15633,
"rating_security": 90,
"rating_location": 96,
"rating_staff": 85,
"rating_atmosphere": 82,
"rating_cleanliness": 82,
"rating_facilities": 80,
"rating_value_for_money": 81,
"price_from": "33.41",
"price_currency": "USD",
"lowest_dorm_price": "33.41",
"lowest_private_price": "217.34",
"free_cancellation": false,
"facilities_count": 18,
"room_types_count": 6,
"property_url": "https://www.hostelworld.com/hostels/p/61557/st-christophers-inn-barcelona/",
"search_url": "https://www.hostelworld.com/hostels/europe/spain/barcelona/"
}

Tips for Best Results

Use Real City or Search URLs

  • Copy URLs directly from Hostelworld city pages or search pages
  • Both clean city links and query-based search URLs are supported
  • If the URL already contains dates or guests, those values are reused automatically unless you override them

Control Collection Size

  • Start with results_wanted: 20 for quick checks
  • Increase the result limit for larger cities with many pages
  • Use max_pages as a hard safety cap when testing new markets

Price Comparisons

  • Keep the same type of source URL structure across comparison runs
  • Different travel dates embedded in the source URL can change availability and rates significantly
  • Compare similar stay setups when building market benchmarks

Integrations

Connect your dataset with:

  • Google Sheets — Review pricing and ratings in spreadsheets
  • Airtable — Build searchable accommodation databases
  • Make — Trigger downstream travel workflows
  • Zapier — Send new run results into business automations
  • Webhooks — Push fresh listing data to your own systems
  • Slack — Notify teams when collection finishes

Export Formats

  • JSON — For applications and automation
  • CSV — For spreadsheets and flat-file analysis
  • Excel — For business reporting
  • XML — For system integrations

Frequently Asked Questions

Does it work with both hotels and hostels?

Yes. The actor accepts Hostelworld hotel pages, hostel pages, and mixed city listing URLs.

Can it handle messy search URLs?

Yes. Search-style URLs with query parameters, dates, guest counts, and direct city IDs are supported.

Will it collect all properties in a city?

It keeps collecting until it reaches results_wanted, the source runs out of pages, or max_pages is reached.

Why do some fields appear only on some properties?

Hostelworld does not expose every field for every listing. Missing fields are omitted instead of filled with empty placeholder values.

Can I override the dates embedded in a URL?

No exposed override is required. The actor derives stay settings from the source URL when available and otherwise uses internal defaults.

What happens if the source changes?

The actor retries listing requests with alternate browser-like headers, logs response-shape mismatches clearly, and probes the source URL for recovery signals before failing.


Support

For issues or feature requests, contact support through the Apify Console.

Resources


This actor is designed for legitimate data collection purposes. Users are responsible for ensuring compliance with website terms of service and applicable laws. Use data responsibly and respect rate limits.