
Goodreads Scraper
- epctex/goodreads-scraper
- Modified
- Users 199
- Runs 14.7k
- Created by
epctex
Scrape goodreads.com for data on millions of books. Crawl book details for images, ISBN, author, description, title, buy links, number of reviews, page number, language, and all other details. You can specify search terms, filters, and much more.
To run the code examples, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token. For a more detailed explanation, please read about running Actors via the API in Apify Docs.
from apify_client import ApifyClient
# Initialize the ApifyClient with your API token
client = ApifyClient("<YOUR_API_TOKEN>")
# Prepare the Actor input
run_input = {
"search": "harry potter",
"startUrls": [
"https://www.goodreads.com/search?q=game+of+thrones&qid=",
"https://www.goodreads.com/book/show/59576065-acceptance",
"https://www.goodreads.com/list/show/1362.Best_History_Books_",
"https://www.goodreads.com/shelf/show/fiction",
"https://www.goodreads.com/genres/business",
"https://www.goodreads.com/author/list/1221698.Neil_Gaiman",
],
"maxItems": 20,
"endPage": 1,
"extendOutputFunction": "($) => { return {} }",
"customMapFunction": "(object) => { return {...object} }",
"proxy": { "useApifyProxy": True },
}
# Run the Actor and wait for it to finish
run = client.actor("epctex/goodreads-scraper").call(run_input=run_input)
# Fetch and print Actor results from the run's dataset (if there are any)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)