Goodreads Scraper
Pricing
Pay per event
Goodreads Scraper
Extract book data from Goodreads: titles, authors, ratings, reviews, genres, ISBN, pages, format, publication date, awards, and more. Accepts book or author URLs as input.
Pricing
Pay per event
Rating
0.0
(0)
Developer
BowTiedRaccoon
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extract book data from Goodreads. Titles, authors, ratings, reviews, genres, ISBN — in clean JSON, without writing a single line of code.
Point it at a book URL or author page. It does the rest.
What you get
18 fields per book record. Not estimated fields — actual verified output from the live site.
| Field | Type | Description |
|---|---|---|
url | string | Canonical Goodreads book URL |
goodreads_id | string | Numeric Goodreads book ID |
title | string | Book title |
authors | string | Author name(s), comma-separated |
rating | number | Average star rating (e.g. 4.31) |
rating_count | integer | Total number of ratings |
review_count | integer | Total number of text reviews |
isbn | string | ISBN-13 |
genres | string | Genres, comma-separated |
description | string | Full book synopsis |
pages | integer | Page count |
book_format | string | Format (Hardcover, Paperback, etc.) |
language | string | Primary language |
published_date | string | First publication date |
awards | string | Notable awards, comma-separated |
cover_image | string | Cover image URL |
series | string | Series name and number, if applicable |
scraped_at | string | ISO timestamp of extraction |
How to use it
1. Add book or author URLs
Go to any Goodreads book page and copy the URL:
https://www.goodreads.com/book/show/375802.Of_Mice_and_Men
Or use an author page to collect all the books on that author's profile:
https://www.goodreads.com/author/show/589.Orson_Scott_Card
2. Set an optional limit
Leave Max Items blank to scrape everything. Set it to a specific number to cap the run.
3. Run and download
Results land in the dataset. Export as JSON, CSV, or JSONL.
Data source notes
Data comes from structured metadata embedded in each book page — not fragile screen scraping. The scraper reads schema.org/Book JSON-LD for core fields (title, ISBN, rating, format, language, awards) and supplements with targeted CSS selectors for descriptions and genres.
No proxy required. No Cloudflare bypass required. The site serves full HTML to a standard Chrome fingerprint.
Input configuration
| Parameter | Type | Required | Description |
|---|---|---|---|
startUrls | array | Yes | Goodreads book or author page URLs to scrape |
maxItems | integer | No | Maximum number of books to return. Leave blank for no cap. |
Use cases
- Book research — collect metadata on reading lists, award winners, genre catalogues
- Ratings analysis — benchmark a book's reception against competitors or comparable titles
- Publishing intelligence — track how an author's catalog performs across formats and editions
- Recommendation systems — seed training data with structured book metadata, genres, and ratings
- Personal use — export your reading list's data for use in spreadsheets or Notion
Pricing
Pay-per-event. Each book record counts as one billable event. The start fee covers infrastructure.
Built by OrbTop. Questions or issues? Use the feedback fields in the input form.