Amazon Book Scraper — Books Data & Metadata Extractor avatar

Amazon Book Scraper — Books Data & Metadata Extractor

Pricing

$8.99/month + usage

Go to Apify Store
Amazon Book Scraper — Books Data & Metadata Extractor

Amazon Book Scraper — Books Data & Metadata Extractor

Scrape Amazon books data from any keyword, URL, or ASIN list. Get full book metadata — title, author, rating, reviews, price, publisher, pages, language, and cover image. Supports 7 Amazon marketplaces. No login. $8.99/month. 2-hour free trial.

Pricing

$8.99/month + usage

Rating

0.0

(0)

Developer

Scrape Pilot

Scrape Pilot

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

2

Monthly active users

4 days ago

Last modified

Share

📚 Amazon Book Scraper — Books Data & Metadata Extractor

The most complete Amazon Book Scraper on Apify. Extract full Amazon books data from any keyword search, direct book URL, or bulk ASIN list — title, author, rating, reviews, price, description, publisher, publication date, page count, language, availability status, and high-resolution cover image. Supports 7 Amazon marketplaces. No login. No API key. Instant structured output.


📌 Table of Contents


🔍 What Is This Actor?

Amazon Book Scraper is a production-ready Apify actor that extracts complete Amazon books data and Amazon book metadata from any keyword search, direct Amazon book URL, or bulk list of ASINs — across 7 Amazon marketplaces.

Provide a search keyword like "machine learning" or "Stephen King", paste a direct Amazon book URL, or supply a list of ASINs — and receive back a clean, structured dataset for every book found: title, author, star rating, review count, price, full description, publisher, publication date, page count, language, availability status, and cover image URL.

This Amazon book scraper handles keyword search with sort options, pagination across multiple result pages, direct detail page extraction with automatic fallback URL formats, and partial record recovery when full data is unavailable — making it the most reliable Amazon books data tool on Apify.


🚀 Why Use This Amazon Book Scraper?

FeatureThis ActorManual ResearchAmazon APIOther Scrapers
Keyword search → bulk books data✅ Paginated❌ Slow⚠️ Limited⚠️
Direct URL + ASIN bulk input✅ Both modes⚠️
Full Amazon book metadata✅ 15 fields⚠️ Partial⚠️
Publisher, pub date, pages, language⚠️
Price & availability status✅ Manual⚠️⚠️
High-resolution cover image⚠️⚠️
7 Amazon marketplaces⚠️
Sort by bestseller, new, reviews✅ Built-in
No login or API keyN/A❌ Required
Export to CSV / Excel✅ Via Apify

Bottom line: This Amazon book scraper is the only actor that combines keyword search with sort options, direct ASIN lookup, multi-marketplace support, and a complete 15-field Amazon book metadata record — all in one tool with no credentials needed.


🌍 Supported Marketplaces

CodeMarketplaceDomain
usUnited Statesamazon.com
ukUnited Kingdomamazon.co.uk
deGermanyamazon.de
inIndiaamazon.in
caCanadaamazon.ca
auAustraliaamazon.com.au
jpJapanamazon.co.jp

Simply set the country input to the marketplace code. The actor automatically targets the correct Amazon domain and currency for that market.


🎯 Use Cases

📊 Book Market Research & Publishing Intelligence

  • Scrape Amazon books data for any genre or topic to analyze pricing trends, rating distributions, and review volumes
  • Identify bestselling books in a category with full metadata for competitive publishing research
  • Track new releases, publication dates, and publisher activity across Amazon marketplaces

🛒 Price Monitoring & Comparison

  • Monitor Amazon book prices across multiple marketplaces (US, UK, DE, IN) for arbitrage or pricing strategy
  • Track price changes on a watchlist of ASINs by scheduling regular scraper runs
  • Compare paperback, hardcover, and Kindle pricing using the same ASIN list

🤖 AI & Recommendation Systems

  • Build book recommendation datasets by scraping Amazon book metadata — title, author, genre, description, ratings
  • Collect training data for NLP models using book descriptions, titles, and category tags
  • Extract cover images and metadata for visual book recommendation interfaces

🏪 E-Commerce & Affiliate Integrations

  • Populate a book directory, affiliate site, or comparison platform with structured Amazon books data
  • Automate product catalog updates by re-scraping ASINs on a schedule
  • Build structured book listings with prices, ratings, and descriptions for content sites

🎓 Academic & Library Research

  • Collect publication metadata (publisher, date, pages, language) for academic bibliographic research
  • Build structured datasets of books in specific fields for literature review automation
  • Study Amazon review and rating patterns across genres for consumer behavior research

📰 Content Creation & Journalism

  • Research book topics, authors, and publication histories at scale for editorial content
  • Gather structured book metadata for book review sites, newsletters, or media platforms
  • Track Amazon bestseller rankings and new releases for publishing industry reporting

⚙️ Input Parameters

{
"keyword": "machine learning",
"url": "",
"urls": [],
"country": "us",
"sort_by": "bestseller",
"max_results": 20,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}
ParameterTypeDefaultDescription
keywordstring""Search keyword — any book title, author name, topic, or ISBN (e.g. "python programming", "Stephen King")
urlstring""Single Amazon book URL or ASIN — processed as a direct detail page lookup
urlsarray or string[]Multiple Amazon book URLs or bare ASINs for bulk extraction — newline-separated string also accepted
countrystring"us"Target Amazon marketplace — "us", "uk", "de", "in", "ca", "au", "jp"
sort_bystring"relevance"Search result sort order — "relevance", "bestseller", "new", "avg_review"
max_resultsinteger20Maximum books to return across all input modes
proxyConfigurationobjectResidentialApify proxy config — residential proxy strongly recommended for Amazon

Tip: You can combine keyword and urls in the same run. Keyword search results are processed first, then direct URLs and ASINs are scraped individually. All results are merged into a single output dataset.


📋 Output Fields

Every record from this Amazon book scraper includes complete Amazon book metadata:

FieldTypeDescriptionExample
urlstringFull Amazon book page URL"https://www.amazon.com/dp/B08XY..."
asinstringAmazon Standard Identification Number"B08XY12345"
titlestringFull book title (max 500 chars)"Deep Learning with Python"
authorstringAuthor name(s)"François Chollet"
ratingfloatAverage star rating4.6
reviews_countintegerTotal customer review count2841
pricestringListed price with currency symbol"$39.99"
descriptionstringFull book description (max 2000 chars)"The definitive guide to..."
imagestringHigh-resolution cover image URL"https://images-na.ssl-images-amazon.com/..."
publisherstringPublisher name"Manning Publications"
pub_datestringPublication date"October 14, 2021"
pages_countintegerTotal page count504
languagestringBook language"English"
statusstringAvailability status"available", "out_of_stock", "unavailable"
scraped_atstringExtraction timestamp (ISO 8601 UTC)"2024-03-15T10:30:00Z"

📦 Example Input & Output

Input — keyword search:

{
"keyword": "deep learning",
"country": "us",
"sort_by": "bestseller",
"max_results": 3
}

Output (one record):

{
"url": "https://www.amazon.com/dp/B08XY12345/",
"asin": "B08XY12345",
"title": "Deep Learning with Python, Second Edition",
"author": "François Chollet",
"rating": 4.6,
"reviews_count": 2841,
"price": "$39.99",
"description": "The definitive guide to deep learning using Python and Keras. Revised and updated to cover the latest deep learning techniques...",
"image": "https://images-na.ssl-images-amazon.com/images/I/81abc123.jpg",
"publisher": "Manning Publications",
"pub_date": "October 14, 2021",
"pages_count": 504,
"language": "English",
"status": "available",
"attempt": 1,
"scraped_at": "2024-03-15T10:30:00Z"
}

💰 Pricing & Free Trial

PlanPriceIncludes
Free Trial$02 hours full access — no credit card required
Monthly$8.99 / monthUnlimited runs, all input modes, all 7 marketplaces

Everything included in every plan:

  • ✅ Keyword search with pagination and sort options
  • ✅ Direct URL and bulk ASIN extraction
  • ✅ Complete Amazon book metadata — 15 fields per book
  • ✅ 7 Amazon marketplaces (US, UK, DE, IN, CA, AU, JP)
  • ✅ Sort by relevance, bestseller, newest, or average review
  • ✅ Availability status per book
  • ✅ High-resolution cover image URL
  • ✅ JSON + CSV + Excel export from Apify dataset
  • ✅ Scheduled runs for automated price and metadata monitoring

Start your 2-hour free trial now — no credit card needed. Click Try for free at the top of this page.


⚡ Performance & Limits

ModeCountEstimated Time
Single book URL or ASIN1~8–20 seconds
Keyword search20 books~3–6 minutes
Bulk ASINs20 books~4–8 minutes
Keyword search (paginated)50 books~12–20 minutes
  • Results are pushed to the Apify dataset in real time as each book is processed
  • Partial records are saved for books where full detail page extraction is blocked
  • Per-URL retry logic with automatic alternate URL format fallback
  • Residential proxy strongly recommended for reliable Amazon access at any volume

❓ FAQ

Q: Can I scrape books from multiple Amazon marketplaces in one run? A: Each run targets one marketplace via the country input. For multi-marketplace scraping, run the actor multiple times with different country values, or use Apify's task scheduling to run them in parallel.

Q: Can I input bare ASINs without a full URL? A: Yes. The urls field accepts bare 10-character ASINs (e.g. B08XY12345) directly — the actor automatically builds the correct Amazon URL for the selected marketplace.

Q: What does status: "partial_search_only" mean? A: When a book detail page cannot be fetched (blocked or slow), the actor saves a partial record using the data available from the search results page — title, author, rating, reviews, price, and image — rather than losing the record entirely.

Q: Why is price null for some books? A: Some Amazon book listings do not display a public price — this is common for books sold exclusively through third-party sellers, pre-order titles, or marketplace-only listings. The field returns null when no price is present on the page.

Q: Can I sort search results by bestseller or newest release? A: Yes. Use the sort_by parameter: "bestseller" for Amazon bestseller rank order, "new" for newest releases first, "avg_review" for highest rated, or "relevance" for default search relevance.

Q: Is residential proxy required? A: Amazon actively blocks datacenter IP addresses. A residential proxy makes requests appear to come from regular home internet connections, which significantly improves reliability — especially for keyword searches and high-volume ASIN lookups. It is strongly recommended.

Q: Can I export results to Excel or CSV? A: Yes. All results are pushed to the Apify dataset, which can be exported to JSON, CSV, Excel, and more directly from the Apify Console after each run completes.

Q: What happens if Amazon blocks a specific book page? A: The actor automatically retries with alternate URL formats, rotates browser fingerprints, and applies backoff delays. If all attempts fail, a partial record with available data is saved and the run continues with the remaining books.


📜 Changelog

v2.0.0 (Current)

  • ✅ Three input modes: keyword search, direct URL, and bulk ASIN list
  • ✅ Full Amazon book metadata — 15 fields per record
  • ✅ Publisher, publication date, page count, and language extraction
  • ✅ Availability status detection per book
  • ✅ High-resolution cover image URL
  • ✅ 7 Amazon marketplace support (US, UK, DE, IN, CA, AU, JP)
  • ✅ Sort options: relevance, bestseller, newest, average review
  • ✅ Automatic pagination across multiple search result pages
  • ✅ Alternate URL format fallback for 404 and blocked pages
  • ✅ Partial record recovery from search results when detail page fails
  • ✅ Proxy rotation support for high-volume runs
  • ✅ Real-time dataset push as each book is processed

v1.0.0

  • Initial release with basic keyword search and core field extraction

🏷️ Tags

amazon book scraper amazon books data amazon book metadata amazon scraper book data extractor amazon asin scraper book price tracker amazon search scraper book metadata amazon product scraper book research tool amazon bestseller scraper


This actor accesses publicly visible Amazon book listing pages in the same way a regular user browses the Amazon website.

Please note:

  • Use extracted Amazon books data only for lawful purposes — research, price monitoring, content creation, affiliate marketing, and academic use are common legitimate applications
  • Do not use this Amazon book scraper to systematically copy Amazon's catalog for redistribution or to build a competing retail platform
  • Respect Amazon's Terms of Service — do not use this tool at volumes designed to overload or disrupt Amazon's infrastructure
  • Book descriptions, cover images, and metadata are Amazon's intellectual property — always credit the source appropriately in your application
  • The actor developer is not responsible for how extracted Amazon book metadata is used

🤝 Support & Feedback

  • Bug report? Contact us via the Apify actor page
  • Feature request? Post in the Apify Community forum
  • Loving it? Please leave a ⭐ review — it helps other users find this actor!

Built with ❤️ on Apify
The most complete Amazon Book Scraper — full metadata, 7 marketplaces, keyword search & bulk ASIN

💰 $8.99/month · 🆓 2-hour free trial · No credit card required