Most downloaded public domain books

Please note that datasets, machine-learning models, weights, topologies, research papers and other content, including open source software, (collectively referred to as Content) provided and/or suggested by Peltarion for use in the Platform and otherwise, may be subject to separate third party terms of use or license terms. You are solely responsible for complying with the applicable terms. Peltarion makes no representations or warranties about Content. You expressly relieve us from any and all liability, loss or risk arising (directly or indirectly) from Your use of any third party content.

These datasets have been built on downloaded and preprocessed eBooks from Project Gutenberg; a body of over 60,000 public domain eBooks.

Every eBook is released under The Full Project Gutenberg License. For use of the Project Gutenberg website, their Terms of Use apply.

In order to download many books, this Gutenberg mirror was used. It is a package that contains a variety of scripts to make working with the Project Gutenberg body of public domain texts easier.

This dataset is used in the tutorial Writing style tutor.

License for mirror

Copyright 2014 Clemens Wolff

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at:

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Download dataset

Peltarion has gathered 2 ready-made datasets from the books in Project Gutenberg:

Was this page helpful?
Yes No