Pricing

from $19.00 / 1,000 result items

Project Gutenberg Books Scraper | 70K+ Free eBooks

Export 70,000+ public-domain books from Project Gutenberg via the Gutendex API. Search by keyword, language, topic, or author lifespan, or fetch by book ID. Pull titles, authors, subjects, languages, download links, and full-text formats. Download as CSV, Excel, JSON, or XML.

Pricing

from $19.00 / 1,000 result items

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

4 days ago

Last modified

📚 Project Gutenberg (Gutendex) Scraper

🚀 Export 70,000+ public-domain books with metadata and full-text download links in seconds.

This Apify Actor extracts structured data from Project Gutenberg (Gutendex), returning clean JSON / CSV / Excel / XML datasets ready for analytics, integrations, or research workflows. Built by ParseForge for reliability and freshness.

🎯 Target Audience	💡 Primary Use Cases
Data analysts, engineers, researchers	Analytics pipelines, BI dashboards, datasets
SaaS, fintech, marketing, ops teams	Lead gen, enrichment, monitoring
Hobbyists, journalists, indie devs	Side projects, content, exploration

📋 What the Project Gutenberg (Gutendex) Scraper does

Queries the public Project Gutenberg (Gutendex) API / feed and structures the response
Returns one record per item with 10 normalized fields
Supports filters configurable from the input schema
Outputs to CSV, Excel, JSON, XML via Apify dataset
Auto-limits to 10 items on the free plan; up to 1,000,000 on paid

💡 Why it matters: clean, ready-to-query data without manual scraping, parsing, or babysitting an API client.

📊 Data fields

Each record includes: allFormats, authors, bookId, bookshelves, copyright, coverImageUrl, downloadCount, editors, epubUrl, htmlUrl, kindleUrl, languages, mediaType, plainTextUrl, readOnlineUrl, scrapedAt, subjects, summary, title, translators, url. These field names come straight from the actor's dataset schema, so what you see here is what lands in your dataset.

🚀 How to use

Create a free Apify account with $5 credit
Open this Actor and click Try for free
Configure the input (maxItems and any filters)
Click Start
Download the dataset as CSV / Excel / JSON / XML

🔗 Recommended Actors

Actor	Description
Wikipedia On This Day Scraper	Daily Wikipedia historical events
Public Holidays Scraper	Holidays for 100+ countries
SPDX Software Licenses Scraper	Open-source license metadata
ISO Country Codes Scraper	IBAN + ISO country codes

💡 Pro Tip: browse the complete ParseForge collection.

⚠️ Disclaimer: independent tool, not affiliated with Project Gutenberg (Gutendex). Only publicly available data collected.

🆘 Need Help?

If you hit a bug, have questions about setup, or need a scraper we haven't built yet, open our contact form or write to parseforge@protonmail.com. We also take on paid custom data projects.

For faster answers, join our Discord. It's the best place to get support and suggest new actors.

Gutenberg Books Scraper

fortuitous_pirate/gutenberg-books-scraper

Scrape book metadata from Project Gutenberg: 70,000+ free public domain ebooks. Search by title, author, topic, or language. Returns authors, subjects, formats, and download links.

Fortuitous Pirate

Gutendex Books Scraper - Gutenberg Metadata

benthepythondev/gutendex-books-scraper

Search Project Gutenberg books and export ID, title, authors, subjects, languages, copyright status, downloads, formats and links.

Ben

Gutenberg Scraper

velvety_bedbug/gutenberg-scraper

Scrape free public domain books from Project Gutenberg via the Gutendex API. Search by title/author, filter by topic, language, or author birth/death year. Returns book metadata and download URLs for text, HTML, and EPUB formats. 78,000+ books. Free, no auth required.

Peters Bugs

Project Gutenberg Scraper

lulzasaur/gutenberg-scraper

Scrape Project Gutenberg (gutenberg.org). Search 70K+ free public domain ebooks. Extract titles, authors, subjects, download formats (EPUB, Kindle, TXT, HTML), and full metadata.

lulz bot

Project Gutenberg Books Scraper

gio21/gutenberg-books-scraper

Scrape public-domain books from Project Gutenberg via the Gutendex API. Filter by topic, author, language, search query. Returns title, authors, languages, copyright, download_count, formats (EPUB, MOBI, TXT, HTML), subjects, bookshelves. Pay per book returned.

Gio

Project Gutenberg Top Books Scraper

rambunctious_fingerprint/project-gutenberg-scraper

Casey Marsh

Project Gutenberg Books Scraper

parseforge/project-gutenberg-books-scraper

Search 75,000+ free public-domain books from Project Gutenberg. Returns title, author with birth/death years, cover image, plain-text and EPUB download URLs, Kindle and HTML formats, subjects, bookshelves, language, copyright status, summaries and download counts. Filter by author or language.

ParseForge

Project Gutenberg Scraper

crawlerbros/project-gutenberg-scraper

Search and download Project Gutenberg's 75,000+ free ebooks. Filter by keyword, topic, language, author era, copyright status, and available format (EPUB, Kindle, PDF, plain text).

Crawler Bros

Project Gutenberg Ebook Scraper (Gutendex)

jungle_synthesizer/gutenberg-gutendex-public-domain-ebook-scraper

Scrape the full Project Gutenberg catalog via the Gutendex JSON API. Filter by search, language, subject, author era, and download count. Returns EPUB, Kindle, plain-text, and HTML download URLs — built for AI training corpora, NLP datasets, and TTS pipelines.

BowTiedRaccoon

Project Gutenberg Research Scraper

happyfhantum/project-gutenberg-research-scraper

Exhaustively searches Project Gutenberg's 70,000+ free ebooks using multi-page pagination and smart filtering. Perfect for academic research, finding complete author works, or discovering books on specialized topics. Gets all results, not just the first page.