Project Gutenberg Books Scraper avatar

Project Gutenberg Books Scraper

Pricing

$2.00 / 1,000 book scrapeds

Go to Apify Store
Project Gutenberg Books Scraper

Project Gutenberg Books Scraper

Scrape public-domain books from Project Gutenberg via the Gutendex API. Filter by topic, author, language, search query. Returns title, authors, languages, copyright, download_count, formats (EPUB, MOBI, TXT, HTML), subjects, bookshelves. Pay per book returned.

Pricing

$2.00 / 1,000 book scrapeds

Rating

0.0

(0)

Developer

Gio

Gio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Scrape public-domain books from Project Gutenberg via the Gutendex API.

Use cases

  • Build a public-domain ebook library
  • Mine classic literature for ML training
  • Discover books by topic, language, or popularity

Input

  • search — title/author search
  • topic — bookshelf/subject substring (fiction, history, science…)
  • languages — comma-separated 2-letter codes
  • sort — popular / ascending / descending
  • maxItems — number of books to return

Output

Each item: { id, title, authors, translators, subjects, bookshelves, languages, copyright, download_count, formats, summaries, url, scrapedAt }

formats includes direct download URLs for EPUB, MOBI, TXT, HTML.

Pricing

$0.002 / book returned.

How it works

Public Gutendex API (gutendex.com/books). Open data — no auth, no anti-bot.