gutenberg epub downloader Download - gutenberg epub downloader Source code download

gutenberg epub downloader

Other source code

Download

Projekt Gutenberg (German) Scraper

This script helps you scrape a list of book URLs from the Projekt Gutenberg website, filter out unwanted URLs, and download the corresponding EPUB files using the epub2go service.

A friend of mine complained about Projekt Gutenberg hiding the ePub files of the books they digitized in their store behind a paywall. He wanted to get all books in ePub format, and I decided to make it happen, since the books are readily available in HTML already. After some research, I stumbled upon the epub2go service, which made it easier to convert books from HTML to ePub without the need for local dependencies and computation.

This script automates the process of downloading books from Projekt Gutenberg, converts them to ePub format using the epub2go service and stores the converted files to your local machine*.

(*This is currently quite ugly since it just dumps them all into the script's working directory)

Features

Scrape book URLs from Projekt Gutenberg
Filter out unwanted URLs (that aren't books)
Downloads converted ePub files using epub2go service
Adds delay between requests to avoid overloading the service

Setup

Follow these steps to set up and run the script:

Download the latest ChromeDriver for Selenium that matches your installed Chrome/Chromium version. Place the binary in your desired location and update the path in the code.
Download and unpack the latest Google Chrome or Chromium browser for headless execution of client-side JavaScript.
Install the required Python dependencies using pip:
```
pip install -r requirements.txt
```

Roadmap

Configurable delay between downloads and conversions
Parallelization of downloads to increase download speed (with a sane limit to ensure we're not overloading epub2go)
Scrape the full author names and book titles in advance, then create a directory structure based on books/author/book_title, and place the ePub files in there

Expand

Additional Information

Version
Type Other source code
Update Time 2024-11-24
size 50MB
From Github

Related Applications

adobefirefly downloader

2024-11-11
Onlyfans Downloader

2024-11-10
StudyFlix Downloader

2024-11-10
TikTok Downloader

2024-11-02
YouTube Downloader

2009-05-07
RapidGet downloader

2009-04-28

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
waymo open dataset

Other source code

December 2023 Update
SmartTube

Other source code

24.71 Stable
Sunamu

Other source code

Release 2.2.0
waymo open dataset

Other source code

December 2023 Update
wp functions

Other categories

1.0.0
termwind

Other categories

v2.3.0

Related Information All