Audible Audiobook Catalog Scraper avatar

Audible Audiobook Catalog Scraper

Pricing

Pay per event

Go to Apify Store
Audible Audiobook Catalog Scraper

Audible Audiobook Catalog Scraper

Scrape the Audible audiobook catalog — narrators, series position, Whispersync flag, pricing, runtime, and ratings. 20+ fields per title. Seeded from Audible's official product sitemap; no search keyword required.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

Extract detailed audiobook data from the Audible catalog including narrators, series position, Whispersync availability, pricing, ratings, and more. Scrapes directly from Audible's published product sitemap — no search keyword required, no login needed.

What it scrapes

Each audiobook record includes 20+ fields:

FieldDescription
asinAmazon Standard Identification Number
titleAudiobook title
subtitleSubtitle when present
authorsAuthor name(s)
narratorsNarrator name(s) — the high-value field for agency lead-gen
seriesSeries name
series_positionPosition in series (e.g. "Book 1")
runtime_minutesTotal listening time in minutes
release_dateRelease date (ISO 8601)
publisherPublisher name
languageLanguage
categoriesAudible browse categories
rating_averageAverage star rating (0–5)
rating_countNumber of ratings
regular_pricePrice in USD (à la carte)
sale_priceSale price when on promotion
is_audible_originalTrue for Audible Original productions
is_whispersyncTrue if Whispersync for Voice is available
summaryFull book description
cover_urlCover image URL
sample_audio_urlDirect URL to the audio sample
product_urlCanonical Audible product page URL

How to use

Scrape the full catalog (or a slice)

Set Max Items to control how many titles to extract. Leave it low (e.g. 100) for a targeted sample, or set 0 for the full ~900k+ title catalog.

{
"maxItems": 1000
}

Scrape specific titles by URL

Paste Audible product page URLs directly. The sitemap walk is skipped and only those titles are scraped.

{
"maxItems": 10,
"startUrls": [
{"url": "https://www.audible.com/pd/Dune-Audiobook/B002V1OF70"},
{"url": "https://www.audible.com/pd/Project-Hail-Mary-Audiobook/B08G9PRS1K"}
]
}

Use cases

  • Narrator agency lead-gen — extract every narrator + audiobook + publisher for outreach
  • Price tracking — monitor regular and sale prices across the catalog
  • Recommendation engines — series position, runtime, categories for similarity matching
  • Market research — publisher distribution, rating benchmarks, Whispersync availability
  • Content audits — completeness checks across a publisher's Audible catalog

Discovery method

URLs are sourced from Audible's official product sitemap index (productDetail_sitemap_index.xml) which covers the full US catalog. Child sitemaps are gzip-compressed XML files, each containing thousands of product URLs. No API key or Audible account required.

Notes

  • The actor targets the US Audible store. International editions may show different pricing.
  • sale_price is null for most titles — Audible rarely surfaces sale pricing in page metadata.
  • The sample_audio_url is constructed from the product SKU and links to a .aax format sample.
  • Full catalog crawls are large. Start with maxItems: 1000 to evaluate data quality before scaling.