Debijenkorf.nl Scraper
2 hours trial then $10.00/month - No credit card required now
Debijenkorf.nl Scraper
2 hours trial then $10.00/month - No credit card required now
Gain a competitive edge with the DeBijenkorf.nl Scraper, your go-to tool for extracting detailed product data from the Netherlands' premier luxury retailer. Stay ahead with insights on pricing, stock, and product details, all tailored to streamline your e-commerce or market research efforst.
Overview
The DeBijenkorf.nl Scraper is designed to extract comprehensive product data from the luxury retailer De Bijenkorf's website. This tool is ideal for e-commerce analysts, brand managers, and market researchers who need up-to-date information on products, pricing, stock availability, and other product-specific attributes.
De Bijenkorf is a leading luxury department store in the Netherlands, offering a wide range of products, including fashion, beauty, homeware, and more. This scraper allows you to easily collect structured data from the site, streamlining your data collection efforts.
Features
- Scrape detailed product information, including names, descriptions, pricing, and availability.
- Extract metadata, such as categories, brand details, and related product recommendations.
- High customization capabilities with start URLs and concurrency settings.
- Built-in proxy support for smooth and anonymous scraping.
- Support for various types of URLs, including product listing pages, category pages, and filtered product URLs.
How to Use
- Set Up: Ensure you have an Apify account and access to the Apify platform.
- Provide Start Input: Enter the desired start URLs for the categories or products you wish to scrape.
- Adjust Scraper Settings: Modify settings such as
maxConcurrency
,minConcurrency
, andmaxRequestRetries
to match your requirements. - Configure Proxy Settings: (Optional) Configure proxy settings to avoid rate limiting or IP blocks.
- Run the Scraper: Execute the scraper on the Apify platform. The output will be available in various formats, including JSON, CSV, or Excel.
- Export Results: Download the output data in your preferred format.
Input Configuration
Here is an example of how to set up the input for the DeBijenkorf.nl Scraper:
1{ 2 "startUrls": [ 3 { 4 "url": "https://www.debijenkorf.nl/herenmode/polo-s-t-shirts/t-shirts" 5 } 6 ], 7 "maxItems": 100, 8 "maxConcurrency": 10, 9 "minConcurrency": 1, 10 "maxRequestRetries": 100, 11 "proxy": { 12 "useApifyProxy": true, 13 "apifyProxyGroups": ["RESIDENTIAL"] 14 } 15}
Input Fields Explanation
- Start URLs (startUrls): List of URLs where the scraper will start collecting data. These can include category pages like "https://www.debijenkorf.nl/herenmode/polo-s-t-shirts/t-shirts", product pages, or even filter-based URLs such as "https://www.debijenkorf.nl/product-lister-page.html?fh_location=%2F%2Fcatalog01%2Fnl_NL%2Fcategories%3C%7Bcatalog01_80%7D". This flexibility allows targeted and comprehensive data collection across De Bijenkorf's website.
- Max Items (maxItems): Maximum number of items to scrape during a session. Default is
100
. - Max Concurrency (maxConcurrency): Maximum number of parallel requests. Default is
10
. - Min Concurrency (minConcurrency): Minimum number of parallel requests. Default is
1
. - Max Request Retries (maxRequestRetries): Number of retries for failed requests. Default is
100
. - Proxy Configuration (proxy): Specifies proxy servers for anonymous scraping. Default uses Apify’s residential proxies.
Output Structure
The scraper produces a list of products with each item containing fields such as:
Product Example:
1{ 2 "originalUrl": "www.debijenkorf.nl/escentric-molecules-molecule-discovery-set-eau-de-toilette-6408090148-640809014800000", 3 "catalogUrl": "https://ceres-catalog.debijenkorf.nl/catalog/product/show?productCode=6408090148&productVariantCode=640809014800000&cached=false&locale=nl_NL&api-version=2.41", 4 "productCode": "6408090148", 5 "productVariantCode": "640809014800000", 6 "archived": false, 7 "code": "6408090148", 8 "url": "//www.debijenkorf.nl/escentric-molecules-molecule-discovery-set-eau-de-toilette-6408090148", 9 "defaultVariantCode": "640809014800000", 10 "name": "Molecule Discovery Set Eau de Toilette", 11 "displayName": "Escentric Molecules Molecule Discovery Set Eau de Toilette", 12 "description": "<p>De Molecule Discovery Set is samengesteld om je nieuwsgierigheid aan te wakkeren en te voeden. Ontworpen om te ontdekken. Met deze set kun je alle vijf onze Molecule-geuren onderzoeken voordat je bepaalt welk parfum jij het lekkerst vindt. De geuren zitten in miniflacons van 8,5 ml, verkleind om de radiale wereld van Molecules te verkennen en al je parfumbehoeften onderweg te vervullen.</p>", 13 "brand": { 14 "name": "Escentric Molecules", 15 "relativeUrl": "/escentric-molecules", 16 "query": "fh_location=//catalog01/nl_NL/brand>{escentric_molecules}" 17 }, 18 "subBrand": { 19 "name": "TOM FORD BEAUTY", 20 "relativeUrl": "/product-lister-page.html?fh_location=//catalog01/nl_NL/brand%3E%7Btom_ford%7D/producttype%3E%7Btom_ford_beauty%7D", 21 "query": "fh_location=//catalog01/nl_NL/brand>{tom_ford}/producttype>{tom_ford_beauty}" 22 }, 23 "variantProducts": [ 24 { 25 "archived": false, 26 "code": "208009000500000", 27 "url": "//www.debijenkorf.nl/tom-ford-soleil-neige-eau-de-parfum-2080090005-208009000500000", 28 "sellingPrice": { 29 "currencyCode": "EUR", 30 "value": 160, 31 "type": "SALES" 32 }, 33 "unitPrice": { 34 "unit": "ml", 35 "quantity": 100, 36 "value": 533.33 37 }, 38 "size": "30 ml", 39 "availability": { 40 "stock": 1, 41 "buyableFromDate": 1601845200000, 42 "available": true 43 }, 44 "deliveryTime": "NEXT_DAY", 45 "images": [ 46 { 47 "type": "FRONT", 48 "url": "//cdn-1.debijenkorf.nl/default/tom-ford-soleil-neige-eau-de-parfum/?reference=020/800/0208009000500000_pro_flt_frt_01_1108_1528_6189982.jpg" 49 } 50 ], 51 "selectionImage": { 52 "type": "DETAIL", 53 "url": "//cdn-1.debijenkorf.nl/default/tom-ford-soleil-neige-eau-de-parfum/?reference=020/800/0208009000500000_pro_flt_det_01_1108_1528_6189980.jpg" 54 }, 55 "signings": { 56 "merchandise": [ 57 { 58 "key": "signings/bestseller", 59 "text": "Bestseller" 60 } 61 ], 62 "discount": [] 63 }, 64 "trackingMetadata": { 65 "color": null, 66 "size": "30 ml" 67 } 68 } 69 ], 70 "currentVariantProduct": { 71 "archived": false, 72 "code": "208009000500000", 73 "url": "//www.debijenkorf.nl/tom-ford-soleil-neige-eau-de-parfum-2080090005-208009000500000", 74 "sellingPrice": { 75 "currencyCode": "EUR", 76 "value": 160, 77 "type": "SALES" 78 }, 79 "unitPrice": { 80 "unit": "ml", 81 "quantity": 100, 82 "value": 533.33 83 }, 84 "size": "30 ml", 85 "availability": { 86 "stock": 1, 87 "buyableFromDate": 1601845200000, 88 "available": true 89 }, 90 "deliveryTime": "NEXT_DAY", 91 "images": [ 92 { 93 "type": "FRONT", 94 "url": "//cdn-1.debijenkorf.nl/default/tom-ford-soleil-neige-eau-de-parfum/?reference=020/800/0208009000500000_pro_flt_frt_01_1108_1528_6189982.jpg" 95 }, 96 { 97 "type": "DETAIL", 98 "url": "//cdn-1.debijenkorf.nl/default/tom-ford-soleil-neige-eau-de-parfum/?reference=020/800/0208009000500000_pro_flt_det_01_1108_1528_6189980.jpg" 99 }, 100 { 101 "type": "DETAIL", 102 "url": "//cdn-1.debijenkorf.nl/default/tom-ford-soleil-neige-eau-de-parfum/?reference=020/800/0208009000500000_pro_flt_det_02_1108_1528_6189981.jpg" 103 } 104 ], 105 "selectionImage": { 106 "type": "DETAIL", 107 "url": "//cdn-1.debijenkorf.nl/default/tom-ford-soleil-neige-eau-de-parfum/?reference=020/800/0208009000500000_pro_flt_det_01_1108_1528_6189980.jpg" 108 }, 109 "signings": { 110 "merchandise": [ 111 { 112 "key": "signings/bestseller", 113 "text": "Bestseller" 114 } 115 ], 116 "discount": [] 117 }, 118 "trackingMetadata": { 119 "color": null, 120 "size": "30 ml" 121 } 122 }, 123 "groupedAttributes": [ 124 { 125 "id": "composition", 126 "attributes": [ 127 { 128 "label": "Geurtype", 129 "value": "Houtachtig", 130 "id": "A.182", 131 "visible": true 132 }, 133 { 134 "label": "Ingrediënten", 135 "value": "ALCOHOL DENAT., PARFUM (FRAGRANCE), AQUA (WATER), LINALOOL, COUMARIN, LIMONENE, CITRONELLOL, EUGENOL, CINNAMAL, CITRAL", 136 "id": "A.1501", 137 "visible": true 138 }, 139 { 140 "label": "Inhoud", 141 "value": "30 ml", 142 "id": "A.96", 143 "visible": true 144 }, 145 { 146 "label": "Type parfum", 147 "value": "Eau de Parfum, Unisex geuren", 148 "id": "A.1083", 149 "visible": true 150 }, 151 { 152 "label": "Unisex geur", 153 "value": "Dit is een unisex geur en kan door iedereen gedragen worden", 154 "id": "A.1960", 155 "visible": true 156 }, 157 { 158 "label": "Verpakking", 159 "value": "Spray", 160 "id": "A.1059", 161 "visible": true 162 } 163 ] 164 } 165 ], 166 "fitMessage": null, 167 "categories": [ 168 { 169 "id": "3320", 170 "name": "Geuren", 171 "url": "FROM FREDHOPPER" 172 }... 173 ], 174 "categoryPath": [ 175 { 176 "id": "org_web_shop", 177 "name": "Main catalog for web shops", 178 "type": "Category", 179 "displayName": null, 180 "description": "Main catalog for web shops", 181 "online": "1" 182 }... 183 ], 184 "trackingMetadata": { 185 "name": "Penhaligon's Halfeti Eau de Parfum - travel size", 186 "path": "/Shop/Dames/Cosmetica/Geuren" 187 }, 188 "colorCount": 0, 189 "defaultColor": null, 190 "shipping": { 191 "type": "STANDARD", 192 "supplier": null 193 }, 194 "promotionName": "kafka", 195 "similarItems": [ 196 { 197 "id": "brand", 198 "name": "Penhaligon's", 199 "query": "fh_location=//catalog01/nl_NL/brand>{penhaligon_s}", 200 "url": "/penhaligon-s", 201 "relativeUrl": "/penhaligon-s", 202 "noFollow": false 203 }... 204 ], 205 "gift": false, 206 "designer": false, 207 "sustainable": false, 208 "defaultRelatedProducts": "bundle", 209 "relatedProducts": { 210 "bundle": { 211 "endpoint": "//ceres-catalog.debijenkorf.nl:443/catalog/product/list?productCodes=8517090000,8517090003,8517090008,8517090011,8517090012,8517090038&includeOnlyAvailable=true&includeVariants=false&apiVersion=2.41&locale=nl_NL", 212 "codes": [ 213 "8517090000", 214 "8517090003", 215 "8517090008", 216 "8517090011", 217 "8517090012", 218 "8517090038" 219 ], 220 "title": null, 221 "ruleName": null 222 }, 223 "crossSell": { 224 "endpoint": "//ceres-catalog.debijenkorf.nl:443/catalog/product/list?productCodes=3469090051,3469090074,4731090073,3469090181,6063090015,3469090180,2769090021,2450090205,2769090022,6116090371,2665090157,8535090022,2665090158,8535090023,8535090024,4081090093&includeOnlyAvailable=true&includeVariants=false&apiVersion=2.41&locale=nl_NL", 225 "codes": [ 226 "3469090051", 227 ], 228 "title": "Anderen bekeken ook", 229 "ruleName": "Catlevel3 - fallback - accessoires" 230 } 231 }, 232 "createdOn": 1734337912170, 233 "recommendationRanking": 0, 234 "alternativeUrls": { 235 "nl_NL": "//www.debijenkorf.nl/penhaligon-s-halfeti-eau-de-parfum-travel-size-8517090013-851709001306090", 236 "nl_BE": "//www.debijenkorf.be/penhaligon-s-halfeti-eau-de-parfum-travel-size-8517090013-851709001306090" 237 }, 238 "videos": [], 239 "returnInstructions": null, 240 "apiAttributes": { 241 "archived": { 242 "label": "archived", 243 "value": "-1", 244 "id": "archived", 245 "visible": false 246 }, 247 "A.1280": { 248 "label": "Navigation_Tracking_Path", 249 "value": "/Dames/Cosmetica/Geuren", 250 "id": "A.1280", 251 "visible": false 252 }, 253 "name_id": { 254 "label": "", 255 "value": "Penhaligon's Halfeti Eau de Parfum - travel size", 256 "id": "name_id", 257 "visible": false 258 }, 259 "ES_Other": { 260 "label": "ES_Other", 261 "value": "8517090000,8517090003,8517090008,8517090011,8517090012,8517090038", 262 "id": "ES_Other", 263 "visible": false 264 } 265 }, 266 "supplierModel": "WHOLESALE", 267}
Output Fields Explanation
Product Fields:
-
Original URL (
originalUrl
): The URL of the product page. -
Catalog URL (
catalogUrl
): The API catalog URL for detailed product information. -
Product Code (
productCode
): The unique identifier for the product. -
Product Variant Code (
productVariantCode
): The variant code for the specific product version. -
Archived (
archived
): Indicates whether the product is archived. -
Code (
code
): The product's internal code. -
URL (
url
): The relative URL of the product. -
Default Variant Code (
defaultVariantCode
): The default variant code for the product. -
Name (
name
): The name of the product. -
Display Name (
displayName
): The display name of the product. -
Description (
description
): Detailed description of the product. -
Brand (
brand
): Information about the product's brand, including name and related URLs. -
SubBrand (
subBrand
): Information about the sub-brand, including name and related URLs. -
Variant Products (``** like t****)**: Details about product variants such as size, price, and availability.
- Archived (
archived
): Indicates if the variant is archived. - Code (
code
): Unique identifier for the product variant. - URL (
url
): URL of the variant. - Selling Price (
sellingPrice
): Price details, including currency and value. - Unit Price (
unitPrice
): Price per unit of measurement. - Size (
size
): Size of the product variant. - Availability (
availability
): Stock availability and buyable date. - Delivery Time (
deliveryTime
): Estimated delivery time. - Images (
images
): URLs of product images. - Selection Image (
selectionImage
): Highlighted image for the variant. - Signings (
signings
): Merchandise-related tags such as "Bestseller". - Tracking Metadata (
trackingMetadata
): Metadata for tracking color and size.
- Archived (
-
Current Variant Product (
currentVariantProduct
): Detailed information about the currently selected product variant, including price, availability, and images.- Archived (
archived
): Boolean indicating whether this specific variant is archived. - Code (
code
): Unique identifier for the current product variant. - URL (
url
): The relative URL pointing to the specific variant's page on the De Bijenkorf website. - Selling Price (
sellingPrice
):- Currency Code (
currencyCode
): The currency of the selling price (e.g., "EUR"). - Value (
value
): The actual price of the product variant. - Type (
type
): Indicates the price type, such as "SALES" or "REGULAR".
- Currency Code (
- Unit Price (
unitPrice
):- Unit (
unit
): The measurement unit for the price (e.g., "ml"). - Quantity (
quantity
): The amount associated with the unit price. - Value (
value
): The calculated price per unit.
- Unit (
- Overridden Prices (
overriddenPrices
): A list of historical or promotional prices, if any. - Size (
size
): Size specification of the product variant (e.g., "30 ml"). - Color (
color
): The color associated with the variant, if applicable. - Minimum Order Quantity (
minOrderQuantity
): The minimum quantity required for purchase. - Availability (
availability
):- Stock (
stock
): Number of units available in stock. - Buyable From Date (
buyableFromDate
): The timestamp indicating when the variant becomes purchasable. - Available (
available
): Boolean indicating if the variant is currently available. - Available Future (
availableFuture
): Boolean indicating if the variant will be available in the future.
- Stock (
- Delivery Time (
deliveryTime
): Estimated time for the product to be delivered (e.g., "NEXT_DAY"). - Images (
images
): A collection of image objects, each containing:- Type (
type
): The context of the image (e.g., "FRONT", "DETAIL"). - URL (
url
): The URL of the image. - Position (
position
): Order or importance of the image.
- Type (
- Image Collections (
imageCollections
): Grouped images for specific purposes, if any. - Signings (
signings
):- Merchandise (
merchandise
): Tags or labels associated with the variant, such as "Bestseller". - Discount (
discount
): List of any applied discounts.
- Merchandise (
- Selection Image (
selectionImage
): The primary image used for selection purposes. - Color Swatch Image (
colorSwatchImage
): The image representing the color swatch, if available. - EAN (
ean
): European Article Number for the variant. - Department (
department
): ID of the department to which this variant belongs. - Department Name (
departmentName
): Name of the department. - Related Products (
relatedProducts
):- Model Also Wears (
modelAlsoWears
): Suggests related items often worn or purchased together, including:- Endpoint (
endpoint
): API endpoint for related items. - Codes (
codes
): Product codes of the related items.
- Endpoint (
- Model Also Wears (
- Tracking Metadata (
trackingMetadata
):- Color (
color
): Metadata for the color attribute. - Size (
size
): Metadata for the size attribute.
- Color (
- Lowest Price (
lowestPrice
): Historical lowest price for the variant, if tracked. - Lowest Price Flag (
lowestPriceFlag
): Indicates whether this is the lowest price observed. - Current (
current
): Boolean indicating if this variant is the current primary variant. - Bundle Set (
bundleSet
): Boolean indicating if this variant is part of a bundle. - Retail Set (
retailSet
): Boolean indicating if this variant is part of a retail set.
- Archived (
-
Grouped Attributes (groupedAttributes): A collection of grouped characteristics or features of the product, each identified by a unique
id
. These attributes provide detailed information about the product's composition, specifications, and fit. Below is an explanation of the fields:- Composition Group (
id: composition
): This group includes attributes related to the product's composition and features: - Label (
label
): A descriptive title for the attribute (e.g., "Geurtype"). - Value (
value
): The specific value or detail of the attribute (e.g., "Houtachtig" for the scent type). - ID (
id
): A unique identifier for the attribute (e.g., "A.182"). - Visible (
visible
): A boolean indicating if the attribute is displayed on the product page. - Specifications Group (
id: specifications
): This group is intended for additional technical specifications. It is empty in the provided example but can include details such as material composition, durability, or care instructions. - Fit Group (
id: fit
): This group covers attributes related to the fit or usability of the product. It is also empty in the example but can include size recommendations, fit descriptions, or usage tips.
- Composition Group (
-
Fit Message (fitMessage): A message providing specific fit-related advice or notes about the product. It is typically null unless additional guidance is available.
-
Categories (categories): A list of categories to which the product belongs, each containing:
- ID (id): The unique identifier for the category.
- Name (name): The name of the category (e.g., "Geuren").
- URL (url): The source or reference URL for the category.
-
Category Path (categoryPath): A hierarchical representation of the product’s location within the website’s catalog, each entry containing:
- ID (id): The unique identifier for the category level.
- Name (name): The name of the category level.
- Type (type): The type of the category (e.g., "Category").
- Display Name (displayName): A user-friendly display name for the category (can be null if not defined).
- Description (description): A brief description of the category (can be empty).
- Online (online): Indicates if the category is available online (
"1"
for available).
-
Tracking Metadata (trackingMetadata): Metadata for tracking and analytics, including:
- Name (name): The name of the product for tracking purposes.
- Path (path): The hierarchical path of the product within the website structure.
-
Color Count (colorCount): An integer representing the number of color options available for the product.
-
Default Color (defaultColor): The default or primary color associated with the product. Can be null if no color is specified.
-
Shipping (shipping): Information about the product's shipping options, including:
- Type (type): The type of shipping available (e.g., "STANDARD").
- Supplier (supplier): The supplier responsible for shipping (can be null).
-
Promotion Name (promotionName): The name of the promotional campaign or offer associated with the product.
-
Similar Items (similarItems): A list of related or recommended items, each containing:
- ID (id): The identifier for the similar item.
- Name (name): The name of the similar item.
- Query (query): The query string used to retrieve the similar item.
- URL (url): The full URL to the similar item’s page.
- Relative URL (relativeUrl): A shorter relative path to the similar item’s page.
- No Follow (noFollow): A boolean indicating if search engines should follow the link.
-
Display Properties (
displayProperties
):- Detail Page Variation (
detailPageVariation
): Specifies the type or variation of the detail page, such as "COSMETICS," to tailor the display format. - Show Quantity (
showQuantity
): Boolean that indicates whether the quantity selection is displayed on the product page. - Current Variant Selected (
currentVariantSelected
): Boolean that indicates if the current variant is selected as default or highlighted.
- Detail Page Variation (
-
Gift (
gift
): Indicates whether the product can be marked as a gift (Boolean). -
Designer (
designer
): Boolean specifying if the product is from a designer brand. -
Sustainable (
sustainable
): Boolean indicating if the product has sustainable or eco-friendly attributes. -
Default Related Products (
defaultRelatedProducts
): Specifies the type of related products (e.g., "bundle"). -
Related Products (
relatedProducts
):- Bundle (
bundle
): Contains details about products grouped in bundles, such as:- Endpoint (
endpoint
): API endpoint for fetching bundle details. - Codes (
codes
): Array of product codes included in the bundle. - Title (
title
): Optional title for the bundle. - Rule Name (
ruleName
): Rule associated with the bundle's selection.
- Endpoint (
- Cross Sell (
crossSell
): Contains cross-sell recommendations, such as:- Endpoint (
endpoint
): API endpoint for fetching cross-sell products. - Codes (
codes
): Array of product codes for cross-sell recommendations. - Title (
title
): Optional title for cross-sell recommendations. - Rule Name (
ruleName
): Rule defining the cross-sell logic.
- Endpoint (
- Bundle (
-
Created On (
createdOn
): Timestamp of when the product entry was created. -
Recommendation Ranking (
recommendationRanking
): Numeric rank used to prioritize recommendations. -
Alternative URLs (
alternativeUrls
): Contains alternative URLs for different regions or versions, such as:nl_NL
: URL for the Netherlands.nl_BE
: URL for Belgium.
-
Videos (
videos
): An array for video content related to the product, if any. -
Return Instructions (
returnInstructions
): Guidelines for returning the product, if provided. -
API Attributes (
apiAttributes
):- Archived (
archived
): Metadata indicating if the product is archived. - Navigation Tracking Path (
A.1280
): Path information for tracking navigation within the site. - Name ID (
name_id
): Internal identifier for the product's name. - ES Other (
ES_Other
): Metadata containing related product codes.
- Archived (
-
Supplier Model (
supplierModel
): Describes the supplier type (e.g., "WHOLESALE").
Explore More Scrapers
If you found this Zillow Cheerio Scraper useful, be sure to check out our other powerful scrapers and actors at memo23's Apify profile. We offer a wide range of tools to enhance your web scraping and automation needs across various platforms and use cases.
Support
- For issues or feature requests, please use the Issues section of this actor.
- If you need customization or have questions, feel free to contact the author:
- Author's website: https://muhamed-didovic.github.io/
- Email: muhamed.didovic@gmail.com
Additional Services
- Request customization or whole dataset: muhamed.didovic@gmail.com
- If you need anything else scraped, or this actor customized, email: muhamed.didovic@gmail.com
- For API services of this scraper (no Apify fee, just usage fee for the API), contact: muhamed.didovic@gmail.com
Actor Metrics
3 monthly users
-
0 No stars yet
>99% runs succeeded
Created in Dec 2024
Modified 10 days ago