dm.de Products Scraper avatar

dm.de Products Scraper

Pricing

Pay per usage

Go to Apify Store
dm.de Products Scraper

dm.de Products Scraper

Scrape dm.de products, prices, descriptions & ratings at scale. Extract across categories, filters & pagination. Perfect for price monitoring, competitor analysis & market research. High-speed extraction with reliability built-in. Real-time data for beauty, health & wellness retailers.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

dm.de Product Scraper

Extract product listings from dm.de with reliable pagination and structured dataset output. Collect key product attributes such as identifiers, brand, title, pricing, ratings, images, and product URLs from category and search pages at scale.

Features

  • Multiple URL support — Works with dm.de category URLs, dm.de search URLs, and direct product listing URLs
  • Pagination handling — Automatically collects products across multiple pages
  • Clean output — Excludes empty and null values from dataset items
  • Deduplicated records — Prevents duplicate product entries across pages and inputs
  • Configurable run size — Control result count, pages, and page size

Use Cases

Category Monitoring

Track product assortment changes in specific dm.de categories over time.

Price Intelligence

Collect pricing snapshots for competitive analysis and reporting workflows.

Product Catalog Research

Build structured product datasets for downstream analytics and enrichment.


Input Parameters

ParameterTypeRequiredDefaultDescription
urlsArrayYes`[
"https://www.dm.de/haare/haarfarben"
]`One or more source URLs to scrape
results_wantedIntegerNo20Maximum number of products to store
max_pagesIntegerNo20Maximum pages to fetch per URL
proxyConfigurationObjectNo{ "useApifyProxy": false }Optional proxy settings

Output Data

Each dataset item can include:

FieldTypeDescription
sourceUrlStringInput URL that produced the record
pageNumberSource page number
danNumberdm internal product identifier
gtinNumberGlobal product identifier
brandNameStringProduct brand
titleStringProduct title
productUrlStringProduct detail URL
imageUrlsArrayProduct image URLs
priceStringDisplayed price
netPriceStringNet price when available
unitInfoArrayUnit/price info strings
ratingValueNumberRating average
ratingCountNumberNumber of ratings
categoriesArrayProduct categories

Usage Examples

Category URL

{
"urls": ["https://www.dm.de/haare/haarfarben"],
"results_wanted": 50,
"max_pages": 5
}

Search URL

{
"urls": ["https://www.dm.de/search?query=haarfarbe"],
"results_wanted": 60,
"max_pages": 4
}

Multiple URLs

{
"urls": [
"https://www.dm.de/haare/haarfarben",
"https://www.dm.de/search?query=haarfarbe"
],
"results_wanted": 120,
"max_pages": 6
}

Sample Output

{
"sourceUrl": "https://www.dm.de/haare/haarfarben",
"page": 0,
"dan": 1620805,
"gtin": 30178120,
"brandName": "L'ORÉAL PARiS PRÉFÉRENCE",
"title": "Haarkur Farbglanz Pflegebalsam, 54 ml",
"productUrl": "https://www.dm.de/p/d/1620805/l-oreal-paris-preference-haarkur-farbglanz-pflegebalsam",
"imageUrls": [
"https://products.dm-static.com/images/f_auto,q_auto,c_fit,h_320,w_320/..."
],
"price": "1,95 €",
"ratingValue": 4.9028,
"ratingCount": 72,
"categories": ["Dauerhafte Haarfarben"]
}

Tips for Best Results

Use Stable Source URLs

  • Prefer canonical category URLs for repeatable runs.
  • Use direct search URLs when you want keyword-driven output.

Tune Collection Limits

  • Start with results_wanted: 20 for quick validation.
  • Increase max_pages only when you need deeper coverage.

Proxy Usage

  • Enable proxies for large or frequent runs.
  • Keep defaults for small development runs.

Integrations

Connect your dataset with:

  • Google Sheets — Reporting and ad-hoc analysis
  • Airtable — Structured catalog workflows
  • Webhooks — Push run data to internal services
  • Make — No-code automations
  • Zapier — Trigger actions in business tools

Export Formats

  • JSON — API and app integrations
  • CSV — Spreadsheet workflows
  • Excel — Business reporting

Frequently Asked Questions

Does this scraper support multiple URLs in one run?

Yes. Provide one or more URLs in the urls array.

Will it stop automatically?

Yes. It stops when results_wanted is reached or page limits are hit.

Are empty values included in output?

No. Empty and null values are removed before records are stored.

Can I use it for dm.de search pages?

Yes. Search URLs are supported.

Can I control pagination depth?

Yes. Use max_pages and pageSize.


Support

For issues or feature requests, use the Apify Console issue channels for this actor.

Resources


This actor is designed for legitimate data collection use cases. You are responsible for complying with applicable laws and website terms when using extracted data.