Geizhals Scraper avatar
Geizhals Scraper

Pricing

Pay per usage

Go to Apify Store
Geizhals Scraper

Geizhals Scraper

Extract price comparisons, product specifications, and merchant offers from Geizhals, the leading price comparison platform in the DACH region. This lightweight actor is optimized for speed. For consistent results and to prevent blocking, using residential proxies is highly recommended

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

8 days ago

Last modified

Share

Geizhals Price Comparison Scraper

Powerful and efficient data extraction tool for Geizhals.eu, Europe's leading price comparison platform. Scrape product listings, detailed specifications, merchant offers, ratings, and pricing information across thousands of electronics, home appliances, and consumer products.

What You Can Extract

Extract comprehensive product data from Geizhals.eu including:

  • Product Information: Names, brands, model numbers, descriptions
  • Pricing Data: Current prices, price ranges, currency information
  • Merchant Offers: Multiple vendor prices for each product
  • Product Specifications: Technical details and features
  • Ratings & Reviews: User ratings, review counts, and quality scores
  • Images: Product images and thumbnails
  • Availability: Stock status and shipping information

Perfect for price monitoring, market research, competitive analysis, product catalog building, and e-commerce intelligence.

Key Features

  • Multi-Domain Support - Scrape from Geizhals.eu, Geizhals.de, or Geizhals.at
  • Smart Pagination - Automatically navigates through multiple result pages
  • Flexible Filtering - Filter by category, price range, and search queries
  • Detail Extraction - Optional deep scraping of individual product pages
  • Structured Output - Clean, consistent JSON format for easy integration
  • Proxy Support - Built-in proxy rotation to prevent blocking
  • Fast Performance - HTTP-based scraping using CheerioCrawler (no browsers required)
  • Duplicate Prevention - Automatic URL deduplication

Input Configuration

Configure the scraper with these parameters to customize your data extraction:

Basic Settings

ParameterTypeDescriptionRequired
categoryStringGeizhals category code (e.g., hvent for ventilators, nb for notebooks). Find codes in Geizhals URLs.No
searchQueryStringSearch keyword to filter products within category (e.g., "Dyson", "Samsung")No
startUrlStringDirect Geizhals URL to start scraping from. Overrides category/search parameters.No
countryStringTarget domain: eu (Europe), de (Germany), or at (Austria)No (default: eu)

Filtering Options

ParameterTypeDescription
minPriceNumberMinimum product price in EUR
maxPriceNumberMaximum product price in EUR
results_wantedIntegerMaximum number of products to extract (default: 100)
max_pagesIntegerLimit on listing pages to visit (default: 20)

Extraction Settings

ParameterTypeDescription
collectDetailsBooleanExtract full product specifications and offers from detail pages (default: true)
proxyConfigurationObjectProxy settings for reliable scraping (recommended: residential proxies)

Input Examples

Example 1: Scrape Ventilators Category

{
"category": "hvent",
"results_wanted": 50,
"max_pages": 5,
"collectDetails": true,
"country": "eu"
}

Example 2: Search for Specific Brand

{
"category": "nb",
"searchQuery": "ThinkPad",
"minPrice": 500,
"maxPrice": 1500,
"results_wanted": 100,
"country": "de"
}

Example 3: Scrape from Direct URL

{
"startUrl": "https://geizhals.eu/?cat=hvent&xf=9810_Dyson",
"collectDetails": true,
"results_wanted": 30
}

Output Data

The scraper provides structured JSON data with comprehensive product information.

Output Structure

FieldTypeDescription
nameStringProduct name/title
brandStringProduct brand/manufacturer
descriptionStringProduct description
priceNumberCurrent price (lowest available)
currencyStringPrice currency (typically EUR)
ratingNumberAverage user rating (1-5 scale)
review_countNumberTotal number of reviews
imageStringProduct image URL
skuStringProduct SKU/model number
specificationsObjectTechnical specifications (when collectDetails: true)
offersArrayList of merchant offers with prices (when collectDetails: true)
offers_countNumberTotal number of available offers
urlStringProduct detail page URL
scraped_fromStringSource type: listing or detail

Example Output

{
"name": "Dyson Cool AM07 Tower Fan",
"brand": "Dyson",
"description": "Tower fan with Air Multiplier technology",
"price": 275.00,
"currency": "EUR",
"rating": 4.5,
"review_count": 28,
"image": "https://geizhals.eu/p/123456.jpg",
"sku": "AM07",
"specifications": {
"Type": "Tower Fan",
"Power": "56W",
"Height": "100cm",
"Features": "Remote control, Sleep timer"
},
"offers": [
{
"merchant": "Amazon.de",
"price": 275.00,
"currency": "EUR"
},
{
"merchant": "MediaMarkt",
"price": 289.99,
"currency": "EUR"
}
],
"offers_count": 12,
"url": "https://geizhals.eu/dyson-cool-am07-tower-fan-a123456.html",
"scraped_from": "detail"
}

How to Use

Using Apify Console

  1. Navigate to the Actor page in Apify Console
  2. Configure input parameters in the Input tab
  3. Click "Start" to begin scraping
  4. Monitor progress in the Log tab
  5. Download results from the Dataset tab

Using Apify API

const ApifyClient = require('apify-client');
const client = new ApifyClient({
token: 'YOUR_APIFY_TOKEN',
});
const input = {
category: 'hvent',
results_wanted: 50,
collectDetails: true,
};
const run = await client.actor('YOUR_ACTOR_ID').call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Using Apify CLI

$apify call YOUR_ACTOR_ID --input='{"category":"hvent","results_wanted":50}'

Common Use Cases

Price Monitoring

Track product prices across multiple merchants to identify the best deals and price trends over time.

Market Research

Analyze product availability, pricing strategies, and competitive positioning across different categories.

Product Catalog Building

Build comprehensive product catalogs with specifications, images, and pricing for e-commerce platforms.

Competitive Intelligence

Monitor competitor product offerings, pricing, and merchant partnerships.

Price Comparison Tools

Power your own price comparison service with fresh, accurate data from Geizhals.

Performance & Costs

MetricDetails
Speed~50-100 products per minute (with detail scraping)
~200-300 products per minute (listing only)
Resource UsageLow memory footprint, HTTP-based (no browser)
Recommended ProxyResidential proxies for optimal reliability
Compute Units~0.01 CU per 100 products (approximate)

Best Practices

  • Use Residential Proxies - Geizhals may rate-limit datacenter IPs. Residential proxies ensure reliable scraping.
  • Set Reasonable Limits - Use results_wanted and max_pages to avoid unnecessarily long runs.
  • Enable Detail Scraping Selectively - Set collectDetails: false for faster scraping if you only need basic product information.
  • Category Codes - Find category codes by browsing Geizhals.eu and extracting the cat parameter from URLs.
  • Price Filtering - Use minPrice and maxPrice to focus on specific price ranges.
  • Schedule Regular Runs - Set up scheduled runs to monitor price changes and product availability.

Troubleshooting

No Results Returned

  • Verify the category code is correct
  • Check if the search query matches existing products
  • Ensure price filters aren't too restrictive
  • Try increasing max_pages

Rate Limiting / Blocking

  • Enable residential proxies in proxyConfiguration
  • Reduce maxConcurrency if making custom modifications
  • Add delays between requests if needed

Missing Product Details

  • Ensure collectDetails is set to true
  • Some products may have limited information available
  • Check if the product URL is accessible

Data Freshness

Data is scraped in real-time during each Actor run, ensuring you receive the most current pricing and availability information directly from Geizhals.

This Actor is designed for legitimate use cases such as market research, price monitoring, and data analysis. Please ensure your use complies with:

  • Geizhals.eu Terms of Service
  • Applicable data protection regulations (GDPR, etc.)
  • Robots.txt guidelines
  • Fair use principles (reasonable request rates, proxy usage)

Respect rate limits and use proxies to avoid overloading the target server.

Support & Feedback

Have questions or need assistance? Contact support through the Apify platform or leave feedback on the Actor page.

Version History

  • v1.0.0 - Initial release with full Geizhals scraping capabilities

Built with ❤️ using Apify SDK and Crawlee