Carrefour
This Actor is unavailable because the developer has decided to deprecate it. Would you like to try a similar Actor instead?
See alternative ActorsCarrefour
International Carrefour scraper by region, categories and language
Carrefour International Scraper
An E-commerce products scraper from genral goods.
Introduction
The Carrefour International is a global scraping tool that collects all products data depending on selected category and region.
Please check below on how to properly input correct values.
Inputs
Some inputs work only for specific regions, read below to understand better.
Region
By selecting the region you're basically selecting the website from which the products will be scraped, This is a straight forward input where you can select one region per run.
I will be adding new regions in the future, if you require a specific region, please contact me on this platform or via the email below.
Language
Some websites provide products and data in different language such as belgium, please make sure to select the target language before continuing on any of the below inputs.
Category
Selecting the category depends on which region you used because the values are different between websites.
Below, you will find a guide on how to extract and input the correct category value from each website
France
In the carrefour France website, click on the Menu button on the top left corner you will find Tous les categories
section, simply right click the category you want from there and copy the link of it and paste it into the categories input (seel image below)
Belgium
For Belgium and since it has the language option.
- First, make sure you have selected the desired language.
- Second, right click the category name and copy the url and paste it into input.
Spain
For Spain website, it only displays the second child of each category's products, So you are forced to be more precise about what you are scraping but you can add as much values as you need. Check screenshot below
Italy
For Italy website, you will have to go to the seach page: carrefour.it search
There you will find all available categories, what you want to do from there is :
- Right click the targeted category on the left
- click inspect
- copy the
value
from theinput
item
check video:
Brazil
For Brazil website, on the top left corner of the page click the Todos Departamentos
dropdown
Thdo the following :
- Right click the targeted category
- click inspect
- copy the
title
from the abovea
tag
check video:
Saudi Arabia, UAE, Qatar, Kuwait
For the above four regions you simply go into the main category page and copy/paste the url, example url:
https://www.carrefourkuwait.com/mafkwt/en/c/FKWT1660000
Language and Proxy setup
In the table below you will find each region's possible language choices and best proxy setup:
Region | Languages | Proxy |
---|---|---|
France | None | Residential |
Belgium | french,dutch | Residential |
Spain | None | spanish IP |
Italy | None | None |
Brazil | None | None |
Saudi Arabia | arabic, english | None |
UAE | arabic, english | None |
Qatar | arabic, english | None |
Kuwait | arabic, english | None |
OUTPUT
The output can be one of two options:
Either raw
, which will scrape all data found on the website.
Or structured
which will give a well structured dataset containing the necessary information
Here is an example of a structured data:
1{ 2 "website_name": "www.carrefour.fr", 3 "competence_date": "2024-08-24 20:30:26.401663", 4 "country_code": "FR", 5 "currency_code": "EUR", 6 "product_code": "product-3616957652659", 7 "product_std_code": "3616957652659", 8 "std_type": "EAN", 9 "brand": "PARIS 2024", 10 "product_title": "T-shirt manches courtes homme bleu L Jeux Olympiques PARIS 2024", 11 "category1": "Mode et Bagagerie", 12 "category2": "Homme", 13 "category3": "T-shirts et Polos", 14 "full_price": 5.99, 15 "price": 4.19, 16 "promotion_type": "promo", 17 "promotion_end_date": "2024-09-30T23:59:00+0200", 18 "package_desc": "100% coton. Du S au XXXL", 19 "unit_type": "", 20 "additional_tags": [], 21 "itemurl": "https://www.carrefour.fr/p/t-shirt-manches-courtes-homme-bleu-l-jeux-olympiques-paris-2024-3616957652659?t=35460", 22 "imageurl": [ 23 "https://media.carrefour.fr/medias/5c12719c2ca53ab28bf5eee97972a441/p_FORMAT/3616957652659-0.jpg" 24 ] 25}
Get in Touch
For custom scraper development or any inquiries, please feel free to contact me:
- Website: Bytecone
- Email: oussamamechri@bytecone.com
- LinkedIn: https://www.linkedin.com/in/oussama-mechri-64a3191a5/
- GitHub: https://github.com/oussamadz/
Authors
- oussamamechri@protonmail.com