strube.de scraper avatar

strube.de scraper

Pricing

from $1.00 / 1,000 results

Go to Apify Store
strube.de scraper

strube.de scraper

Our strube.de scraper effortlessly gathers URLs from all pages and extracts detailed information from each product page

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

youssef farhan

youssef farhan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Strube.de Noten-Scraper — Vollständige Produktdaten aus dem Strube-Webshop extrahieren

Scrape Strube Verlag product data automatically — sheet music, books, audio samples, and tiered pricing in one structured dataset. Built for music publishers, educators, and data teams who need complete catalog data from strube.de without manual copy-paste. The only Apify actor purpose-built for Strube Verlag's dynamic product structure, including bulk discount tiers and media file URLs.

What you get

Listing Info

  • Product URL
  • Title and subtitle
  • Categories (Kategorien) — multi-level
  • Article number (Artikelnummer)
  • Instrument / edition variant (Instrument / Ausgabe)
  • Product description (Beschreibung)
  • Composer / editor (Komponist)
  • Weight (Gewicht)

Pricing

  • Price per quantity tier (Price 1–N / Quantity 1–N)
  • Bulk discount tiers fetched dynamically via background requests

Media

  • PDF table-of-contents URLs (Inhaltsverzeichnisse)
  • Sheet music sample PDFs (Notenbeispiele)
  • MP3 audio preview URLs (Klangbeispiele)
  • Product image URLs (Image Links)

Sample output

{
"Product URL": "https://www.strube.de/produkt/posaunen-choralbuch-ausg-bayern-thueringen/",
"Title": "Posaunen-Choralbuch. Ausg. Bayern/Thüringen",
"Subtitle": "Hrsg. von Barbara Barsch, Erhard Frieß und Karl-Heinz Saretzki",
"Kategorien": "Bläsermusik, Bläserchor allgemein",
"Komponist": "Evangelischer Posaunendienst",
"Beschreibung": "Dieses Standardwerk bietet zu allen Melodien jeweils eine Intonation und einen vierstimmigen Begleitsatz.",
"Inhaltsverzeichnisse (one or more PDF-URL)": "",
"Notenbeispiele (one or more PDF-URLs)": "",
"Klangbeispiele (one or more MP3-URLs)": "https://www.strube.de/wp-content/uploads/2024/08/T2099.mp3",
"Image Links": "https://www.strube.de/wp-content/uploads/2019/08/B2099.GIF",
"Artikelnummer": "2099",
"Gewicht": "0.722",
"Instrument / Ausgabe": "Standard",
"Price 1": "29.50",
"Quantity 1": 1,
"Price 2": "27.00",
"Quantity 2": 5
}

Use cases

  • Music publishers auditing competitor catalogs and tracking Strube pricing changes
  • Sheet music retailers building product feeds or syncing Strube inventory to their own store
  • Music educators and choir directors compiling bulk lists of repertoire with sample links for review
  • Library acquisition teams exporting full Strube catalog metadata for procurement workflows
  • Music data aggregators indexing Noten and audio samples for search or recommendation engines
  • Research institutions analyzing the German sheet music market by instrumentation and category

Pricing

EventPrice
Per product scraped$0.001

Real example: Scrape 1000 Strube products ≈ $1.

✔ First results are free — test the output before committing to a full run.

How it works

  • Input: Provide one or more start URLs — category pages, search results, or individual product URLs from strube.de
  • Discovery: Automatically paginates through category listings to find every product
  • Deep parsing: Extracts all metadata fields including dynamic bulk-pricing tiers fetched via background requests — not just what's visible in the HTML
  • Variation handling: Each product variant (instrumentation, edition) becomes its own row in the dataset
  • Output: Data lands in your Apify Dataset, exportable as JSON, CSV, or Excel; supports scheduled runs and webhooks

Why this scraper

Most general-purpose scrapers miss three things on strube.de: the bulk discount price tiers (loaded via async requests), the multiple PDF/MP3 media URLs per product, and per-variation rows for multi-edition items. This actor handles all three natively.

  • More fields: Captures tiered pricing, all media asset URLs, weight, and subtitle — fields generic scrapers skip
  • Variation-aware: One product with 4 editions = 4 clean rows, not 1 partial record
  • No setup friction: Proxy rotation and anti-bot handling are built in — paste a URL and run

Input example

{
"startUrls": [
{ "url": "https://www.strube.de/produkt-kategorie/blaeserchor/" }
],
"maxItems": 500
}

FAQ

Does it handle Strube's dynamic bulk pricing? Yes. The actor fires the same background requests the browser uses to load quantity-tier prices and attaches them to each product row.

What output formats are available? JSON, CSV, Excel, and XML — all exportable directly from your Apify Dataset with one click.

Can I scrape a specific product category or a single product page? Yes. Pass any category URL, paginated search URL, or direct product URL as a start URL.

Does it require proxy setup? No. Residential proxy rotation is handled automatically inside the actor.

Can I schedule it to run daily? Yes. Use Apify's built-in scheduler to run on any cron schedule and receive fresh data automatically.

Can I connect this to my own pipeline via API? Yes. Call the actor via the Apify REST API and stream results directly into your database or application.

What if I need additional fields not listed here? Contact via automationbyexperts.com — custom field extraction is available.

Use via API or MCP

Call this actor programmatically via the Apify API using actor ID fayoussef/strube-de-scraper. It is also available as an MCP server for AI agents (Claude, ChatGPT, Cursor):