Goodreads Scraper

  • epctex/goodreads-scraper
  • Modified
  • Users 199
  • Runs 14.7k
  • Created by Author's avatarepctex

Scrape goodreads.com for data on millions of books. Crawl book details for images, ISBN, author, description, title, buy links, number of reviews, page number, language, and all other details. You can specify search terms, filters, and much more.

Free trial for 3 days

Then $15.00/month

No credit card required now

Goodreads Scraper

Free trial for 3 days

Then $15.00/month

To run the code examples, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token. For a more detailed explanation, please read about running Actors via the API in Apify Docs.

# Set API token
API_TOKEN=<YOUR_API_TOKEN>

# Prepare Actor input
cat > input.json <<'EOF'
{
  "search": "harry potter",
  "startUrls": [
    "https://www.goodreads.com/search?q=game+of+thrones&qid=",
    "https://www.goodreads.com/book/show/59576065-acceptance",
    "https://www.goodreads.com/list/show/1362.Best_History_Books_",
    "https://www.goodreads.com/shelf/show/fiction",
    "https://www.goodreads.com/genres/business",
    "https://www.goodreads.com/author/list/1221698.Neil_Gaiman"
  ],
  "maxItems": 20,
  "endPage": 1,
  "extendOutputFunction": "($) => { return {} }",
  "customMapFunction": "(object) => { return {...object} }",
  "proxy": {
    "useApifyProxy": true
  }
}
EOF

# Run the Actor
curl "https://api.apify.com/v2/acts/epctex~goodreads-scraper/runs?token=$API_TOKEN" \
  -X POST \
  -d @input.json \
  -H 'Content-Type: application/json'