Ichard Scraper avatar
Ichard Scraper

Pricing

$15.00/month + usage

Go to Apify Store
Ichard Scraper

Ichard Scraper

Extract comprehensive product data from Ichard.fr e-commerce website. Scrapes sitemap XML to discover all product URLs, then extracts detailed information including titles, prices, references, descriptions, categories, brands, compatible vehicle models, availability status, and product images.

Pricing

$15.00/month + usage

Rating

0.0

(0)

Developer

Corentin Robert

Corentin Robert

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Categories

Share

Ichard Scraper - Extract Product Data from Ichard.fr

Extract comprehensive product data from Ichard.fr e-commerce website. This Actor scrapes the Ichard website to provide you with a complete database of automotive parts and accessories including prices, descriptions, references, compatibility information, and availability status.

What does Ichard Scraper do?

The Ichard Scraper extracts comprehensive data from the Ichard e-commerce website, covering all products available in their catalog. The Actor automatically scrapes the sitemap XML to discover all product URLs, then extracts detailed product information including titles, prices, references, descriptions, categories, brands, compatible vehicle models, availability status, and product images. Simply provide the sitemap URL (or use the default) and the Actor will extract all product data with optimized parallel processing.

What can this Ichard Scraper do?

🚀 Key Features

  • Complete Product Coverage: Scrapes all products from the sitemap XML automatically
  • Comprehensive Data Extraction: Extracts all product details including prices, references, descriptions, categories, brands, models, availability, and images
  • Sitemap-Based Discovery: Automatically discovers all product URLs from the sitemap - no manual URL input required
  • High Success Rate: Accurate extraction of product information with automatic retry mechanism
  • Optimized Performance: Processes multiple products in parallel (configurable concurrency) for maximum speed
  • Complete Product Data: Extracts prices, references, descriptions, categories, brands, compatible vehicle models, availability status, and product images
  • Structured Output: Clean, normalized data ready for immediate use in CSV, JSON, Excel, or HTML formats

🎯 Platform Advantages

Your Actor + the Apify platform. They come as a package. This scraper benefits from:

  • Monitoring & Logs: Real-time execution monitoring with detailed logs to track scraping progress
  • API Access: Access your data programmatically via Apify API for seamless integration
  • Scheduling: Set up automated runs on a schedule to keep your product database up-to-date
  • Integrations: Connect to Make.com, Zapier, Google Sheets, and more for automated workflows
  • Proxy Rotation: Automatic proxy management for reliable scraping without IP blocking
  • Scalability: Handle large-scale scraping with cloud infrastructure that scales automatically
  • Data Storage: Secure dataset storage with multiple export formats (JSON, CSV, Excel, HTML)

What data can Ichard Scraper extract?

The Actor extracts comprehensive data from Ichard products. Here's what you can extract:

Data CategoryFields ExtractedDescription
Product Informationurl, title, price, priceTaxExcl, priceWithoutReduction, reference, description, descriptionFull, category, categoryName, productCategory, brand, models, availability, availabilityMessage, imageUrl, imageUrls, quantity, idProduct, taxRate, taxName, metaTitle, metaDescription, lastModifiedComplete product details and information

Detailed Field Description

Product Information

  • url: Full URL of the product page on Ichard.fr
  • title: Product title/name (e.g., "Bouchon essence chromé")
  • price: Product price including tax (TTC) as numeric value (e.g., "26.00")
  • priceTaxExcl: Product price excluding tax (HT) as numeric value (e.g., "21.67")
  • priceWithoutReduction: Original price before any discount
  • reference: Product reference/SKU code (e.g., "5605")
  • description: Short product description (first paragraph)
  • descriptionFull: Complete product description with all paragraphs
  • category: Product category extracted from breadcrumb (e.g., "4CV")
  • categoryName: Product category name from product data (e.g., "4CV")
  • productCategory: Product category from features (e.g., "Accessoires - Carrosserie")
  • brand: Product brand/manufacturer (e.g., "Renault")
  • models: Compatible vehicle models as comma-separated list (e.g., "4CV, Dauphine, Floride - Caravelle, Renault 8, Renault 10")
  • availability: Stock availability status code (e.g., "available")
  • availabilityMessage: Human-readable availability message (e.g., "En stock")
  • imageUrl: Full URL of the main product image
  • imageUrls: Comma-separated list of all product image URLs
  • quantity: Available stock quantity
  • idProduct: Internal product ID
  • taxRate: Tax rate percentage (e.g., "20" for 20%)
  • taxName: Tax name (e.g., "TVA FR 20%")
  • metaTitle: SEO meta title
  • metaDescription: SEO meta description
  • lastModified: Last modification date from sitemap (ISO format)

How to scrape Ichard.fr products?

Step-by-Step Tutorial

  1. Configure Input: Click on the Input tab and optionally adjust:

    • maxConcurrency: Number of parallel requests (default: 10)
    • requestTimeout: Request timeout in milliseconds (default: 30000)
    • sitemapUrl: Sitemap URL (default: https://ichard.fr/1_fr_0_sitemap.xml)
  2. Run the Actor: Click Start to begin scraping

  3. Monitor Progress: Watch the logs to see real-time progress and statistics

  4. Download Results: Once complete, download your data from the Dataset tab in JSON, CSV, Excel, or HTML format

The Actor automatically:

  • Fetches the sitemap XML from Ichard.fr
  • Filters product URLs (format: /category/id-title.html)
  • Scrapes each product page in parallel
  • Extracts all product information
  • Saves results to the dataset

How much will it cost to scrape Ichard.fr?

Scraping Ichard.fr products is priced based on Compute Units (CUs) consumed during the Actor run. The cost depends on:

  • Number of products: Each product page requires one request
  • Concurrency level: Higher concurrency processes more products simultaneously but uses more resources
  • Request timeouts: Longer timeouts may consume more CUs if pages load slowly

Estimated costs:

  • Free plan: Test with a small number of products
  • Starter plan: Scrape hundreds of products efficiently
  • Professional plan: Handle thousands of products with optimal performance

The Actor is optimized to minimize CU consumption by using efficient Cheerio parsing (no browser overhead) and configurable concurrency. Most runs will complete in a few minutes depending on the total number of products in the catalog.

Input

Ichard Scraper has the following input options. Click on the Input tab for more information:

  • maxConcurrency (optional): Maximum number of concurrent requests. Default: 10. Increase for faster scraping, decrease to reduce server load.
  • requestTimeout (optional): Request timeout in milliseconds. Default: 30000 (30 seconds).
  • sitemapUrl (optional): URL of the sitemap XML. Default: https://ichard.fr/1_fr_0_sitemap.xml.

Output

You can download the dataset extracted by Ichard Scraper in various formats such as JSON, HTML, CSV, or Excel.

Output Example

{
"url": "https://ichard.fr/4cv/13-bouchon-essence-chrome-renault-4cv-dauphine-r8-r10-floride-caravelle.html",
"title": "Bouchon essence chromé",
"price": "26.00",
"priceTaxExcl": "21.67",
"priceWithoutReduction": "26.004",
"reference": "5605",
"description": "Bouchon de réservoir d'essence chromé pour Renault 4CV - Dauphine R8, R10 - Floride et Caravelle",
"descriptionFull": "Bouchon de réservoir d'essence chromé pour Renault 4CV - Dauphine R8, R10 - Floride et Caravelle",
"category": "4CV",
"categoryName": "4CV",
"productCategory": "Accessoires - Carrosserie",
"brand": "Renault",
"models": "4CV, Dauphine, Floride - Caravelle, Renault 8, Renault 10",
"availability": "available",
"availabilityMessage": "En stock",
"imageUrl": "https://ichard.fr/981-large_default/bouchon-essence-chrome-renault-4cv-dauphine-r8-r10-floride-caravelle.jpg",
"imageUrls": "https://ichard.fr/981-large_default/bouchon-essence-chrome-renault-4cv-dauphine-r8-r10-floride-caravelle.jpg",
"quantity": "14",
"idProduct": "13",
"taxRate": "20",
"taxName": "TVA FR 20%",
"metaTitle": "Bouchon essence chromé Renault 4CV Dauphine r8 r10 Floride Caravelle - Ichard",
"metaDescription": "",
"lastModified": "2023-08-22T11:27:22+02:00"
}

Tips for Best Results

Performance Optimization

  • Adjust Concurrency: Increase maxConcurrency (up to 50) for faster scraping if the website can handle it
  • Timeout Settings: Use default timeout (30000ms) for most cases. Increase only if experiencing timeout errors
  • Monitor Progress: Check logs regularly to ensure smooth operation

Data Quality

  • Complete Extraction: The Actor extracts all available fields. Some products may have missing fields if not available on the page
  • Price Format: Prices are extracted as numeric values (e.g., "26.00") for easy analysis
  • Model Compatibility: Compatible vehicle models are extracted and deduplicated automatically

Our scrapers are ethical and do not extract any private user data, such as email addresses, personal information, or payment details. They only extract publicly available product information that is displayed on the website. We therefore believe that our scrapers, when used for ethical purposes by Apify users, are safe.

However, you should be aware that:

  • Product data is publicly available information
  • The scraper respects robots.txt and rate limits
  • You should comply with Ichard.fr's terms of service
  • Personal data protection regulations (GDPR, etc.) may apply depending on your use case

If you're unsure whether your use case is legitimate, consult your lawyers. You can also read our blog post on the legality of web scraping.

FAQ

How many products can I scrape?

The Actor can scrape all products available in the Ichard.fr sitemap. The exact number depends on the current catalog size. The Actor will automatically discover and process all product URLs from the sitemap.

What if a product page fails to load?

The Actor includes automatic retry mechanism (up to 2 retries) for failed requests. If a page still fails after retries, the Actor continues processing other products and logs the error.

Can I scrape specific categories only?

Currently, the Actor scrapes all products from the sitemap. You can filter the results after extraction by category using the dataset filters or by processing the exported data.

How often should I run this Actor?

It depends on your needs. For price monitoring, you might want to run it daily or weekly. For one-time catalog extraction, a single run is sufficient.

Does the Actor support incremental updates?

The Actor includes lastModified field from the sitemap, which you can use to identify recently updated products. You can filter results based on this field to get only new or updated products.

Can I get support or request features?

Yes! If you encounter issues or have feature requests, please use the Issues tab on the Actor page. We're open to feedback and continuously improving the Actor based on user needs.

Need a custom solution?

If you need a customized version of this Actor for specific requirements, feel free to contact us through the Actor page. We can create tailored solutions based on your needs.

Technical Details

Extraction Process

  1. Sitemap Scraping: The Actor first scrapes the sitemap XML to extract all product URLs
  2. URL Filtering: Automatically filters URLs to keep only product pages (format: /category/id-title.html)
  3. Product Page Scraping: For each product, the Actor visits the product page to extract detailed information
  4. Data Extraction: Uses Cheerio for efficient HTML parsing and extracts data from structured HTML elements and JSON-LD
  5. Parallel Processing: Processes multiple products simultaneously with configurable concurrency for optimal performance

Error Handling

  • Automatic retry mechanism for failed requests (up to 2 retries)
  • Timeout handling for slow-loading pages
  • Graceful error handling that continues processing other products
  • Fallback extraction methods for missing data fields

Limitations

  • The scraper depends on the structure of the Ichard website. If the website structure changes significantly, the Actor may need updates
  • Some products may not have all information fields available (e.g., missing description or models)
  • Rate limiting may apply if scraping too aggressively - use default concurrency settings for best results

Resources


Need help? Check the Issues tab for common problems and solutions, or contact support through the Actor page.