Pricing

from $10.00 / 1,000 results

Project Gutenberg Scraper

Scrape Project Gutenberg (gutenberg.org). Search 70K+ free public domain ebooks. Extract titles, authors, subjects, download formats (EPUB, Kindle, TXT, HTML), and full metadata.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

lulz bot

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Features

Search by title/author: Find books by any keyword
Filter by topic: Browse by subject like "science fiction", "philosophy", "children"
Filter by language: English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese
Full metadata: Authors with birth/death years, subjects, bookshelves, download counts
Download links: Direct URLs for EPUB, HTML, plain text, Kindle, and cover images
Pagination: Automatically follows paginated results up to your limit

Output Fields

Field	Description
`id`	Gutenberg book ID
`title`	Book title
`authors`	Array of authors with name, birthYear, deathYear
`subjects`	Array of Library of Congress subjects
`bookshelves`	Array of Gutenberg bookshelves
`languages`	Array of language codes (e.g. "en", "fr")
`downloadCount`	Total download count
`formats`	Object with epub, html, txt, kindle, coverImage URLs
`copyright`	Boolean copyright status
`mediaType`	Media type (usually "Text")
`scrapedAt`	ISO timestamp

Input Options

Search Query: Search by title or author name
Topic: Filter by subject/bookshelf
Language: Filter by language
Max Results: Limit number of books (default 50, max 5000)

Use Cases

Digital library building: Bulk download public domain books
Literary research: Analyze authors, subjects, and popularity trends
NLP/AI training: Gather text corpora by language or topic
Education: Find free reading materials by subject area
Data journalism: Analyze most popular public domain works

Example Output

{
  "id": 1342,
  "title": "Pride and Prejudice",
  "authors": [{"name": "Austen, Jane", "birthYear": 1775, "deathYear": 1817}],
  "subjects": ["Courtship -- Fiction", "England -- Fiction", "Sisters -- Fiction"],
  "bookshelves": ["Best Books Ever Listings"],
  "languages": ["en"],
  "downloadCount": 75892,
  "formats": {
    "epub": "https://www.gutenberg.org/ebooks/1342.epub3.images",
    "html": "https://www.gutenberg.org/files/1342/1342-h/1342-h.htm",
    "txt": "https://www.gutenberg.org/files/1342/1342-0.txt"
  },
  "copyright": false,
  "scrapedAt": "2026-04-26T12:00:00.000Z"
}

Run on Apify

This scraper runs on the Apify platform -- a full-stack web scraping and automation cloud. Sign up for a free account to get started with 30-day trial of all features.

Try Apify free ->

Gutenberg Books Scraper

fortuitous_pirate/gutenberg-books-scraper

Scrape book metadata from Project Gutenberg: 70,000+ free public domain ebooks. Search by title, author, topic, or language. Returns authors, subjects, formats, and download links.

Fortuitous Pirate

Project Gutenberg Scraper

crawlerbros/project-gutenberg-scraper

Search and download Project Gutenberg's 75,000+ free ebooks. Filter by keyword, topic, language, author era, copyright status, and available format (EPUB, Kindle, PDF, plain text).

Crawler Bros

Project Gutenberg Books Scraper | 70K+ Free eBooks

parseforge/gutendex-project-gutenberg-books-scraper

Export 70,000+ public-domain books from Project Gutenberg via the Gutendex API. Search by keyword, language, topic, or author lifespan, or fetch by book ID. Pull titles, authors, subjects, languages, download links, and full-text formats. Download as CSV, Excel, JSON, or XML.

ParseForge

Project Gutenberg Top Books Scraper

rambunctious_fingerprint/project-gutenberg-scraper

Casey Marsh

Project Gutenberg Books Scraper

parseforge/project-gutenberg-books-scraper

Search 75,000+ free public-domain books from Project Gutenberg. Returns title, author with birth/death years, cover image, plain-text and EPUB download URLs, Kindle and HTML formats, subjects, bookshelves, language, copyright status, summaries and download counts. Filter by author or language.

ParseForge

Gutendex Books Scraper - Gutenberg Metadata

benthepythondev/gutendex-books-scraper

Search Project Gutenberg books and export ID, title, authors, subjects, languages, copyright status, downloads, formats and links.

Ben

Project Gutenberg Books Scraper

gio21/gutenberg-books-scraper

Scrape public-domain books from Project Gutenberg via the Gutendex API. Filter by topic, author, language, search query. Returns title, authors, languages, copyright, download_count, formats (EPUB, MOBI, TXT, HTML), subjects, bookshelves. Pay per book returned.

Gio

Gutenberg Scraper

velvety_bedbug/gutenberg-scraper

Scrape free public domain books from Project Gutenberg via the Gutendex API. Search by title/author, filter by topic, language, or author birth/death year. Returns book metadata and download URLs for text, HTML, and EPUB formats. 78,000+ books. Free, no auth required.

Peters Bugs

Project Gutenberg Ebook Scraper (Gutendex)

jungle_synthesizer/gutenberg-gutendex-public-domain-ebook-scraper

Scrape the full Project Gutenberg catalog via the Gutendex JSON API. Filter by search, language, subject, author era, and download count. Returns EPUB, Kindle, plain-text, and HTML download URLs — built for AI training corpora, NLP datasets, and TTS pipelines.

BowTiedRaccoon

Project Gutenberg Research Scraper

happyfhantum/project-gutenberg-research-scraper

Exhaustively searches Project Gutenberg's 70,000+ free ebooks using multi-page pagination and smart filtering. Perfect for academic research, finding complete author works, or discovering books on specialized topics. Gets all results, not just the first page.