Goodreads Scraper avatar

Goodreads Scraper

Pricing

Pay per event

Go to Apify Store
Goodreads Scraper

Goodreads Scraper

Extract book data from Goodreads: titles, authors, ratings, reviews, genres, ISBN, pages, format, publication date, awards, and more. Accepts book or author URLs as input.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Extract book data from Goodreads. Titles, authors, ratings, reviews, genres, ISBN — in clean JSON, without writing a single line of code.

Point it at a book URL or author page. It does the rest.

What you get

18 fields per book record. Not estimated fields — actual verified output from the live site.

FieldTypeDescription
urlstringCanonical Goodreads book URL
goodreads_idstringNumeric Goodreads book ID
titlestringBook title
authorsstringAuthor name(s), comma-separated
ratingnumberAverage star rating (e.g. 4.31)
rating_countintegerTotal number of ratings
review_countintegerTotal number of text reviews
isbnstringISBN-13
genresstringGenres, comma-separated
descriptionstringFull book synopsis
pagesintegerPage count
book_formatstringFormat (Hardcover, Paperback, etc.)
languagestringPrimary language
published_datestringFirst publication date
awardsstringNotable awards, comma-separated
cover_imagestringCover image URL
seriesstringSeries name and number, if applicable
scraped_atstringISO timestamp of extraction

How to use it

1. Add book or author URLs

Go to any Goodreads book page and copy the URL:

https://www.goodreads.com/book/show/375802.Of_Mice_and_Men

Or use an author page to collect all the books on that author's profile:

https://www.goodreads.com/author/show/589.Orson_Scott_Card

2. Set an optional limit

Leave Max Items blank to scrape everything. Set it to a specific number to cap the run.

3. Run and download

Results land in the dataset. Export as JSON, CSV, or JSONL.

Data source notes

Data comes from structured metadata embedded in each book page — not fragile screen scraping. The scraper reads schema.org/Book JSON-LD for core fields (title, ISBN, rating, format, language, awards) and supplements with targeted CSS selectors for descriptions and genres.

No proxy required. No Cloudflare bypass required. The site serves full HTML to a standard Chrome fingerprint.

Input configuration

ParameterTypeRequiredDescription
startUrlsarrayYesGoodreads book or author page URLs to scrape
maxItemsintegerNoMaximum number of books to return. Leave blank for no cap.

Use cases

  • Book research — collect metadata on reading lists, award winners, genre catalogues
  • Ratings analysis — benchmark a book's reception against competitors or comparable titles
  • Publishing intelligence — track how an author's catalog performs across formats and editions
  • Recommendation systems — seed training data with structured book metadata, genres, and ratings
  • Personal use — export your reading list's data for use in spreadsheets or Notion

Pricing

Pay-per-event. Each book record counts as one billable event. The start fee covers infrastructure.


Built by OrbTop. Questions or issues? Use the feedback fields in the input form.