Project Gutenberg: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Peter J. King
(tidied (links, spacing, cats, etc.))
mNo edit summary
 
(16 intermediate revisions by 7 users not shown)
Line 1: Line 1:
{{subpages}}
{{dambigbox|Project Gutenberg|Gutenberg}}
<!-- DO NOT DELETE DURING BIG SPEEDY DELETE -->
<!-- DO NOT DELETE DURING BIG SPEEDY DELETE -->
'''Project Gutenberg''' (often abbreviated '''PG''') is a volunteer effort to digitize, archive, and distribute cultural works. Founded in 1971, it is the oldest [[digital library]]. Most of its items are the full texts of [[public domain]] [[book]]s. The project tries to make the items in its collection as free as possible, in long-lasting, open formats that can be used on almost any computer.
'''Project Gutenberg''' is an online library of over 60,000 free eBooks<ref><span class="newtab">[https://www.gutenberg.org/ Welcome to Project Gutenberg]</span>, last access 5/19/2021</ref>.  It started as a volunteer effort to digitize, archive, and distribute cultural works, including the original text and any available translations. Founded in 1971, it is the oldest free [[digital library]]. Most of its items are the full texts of [[public domain]] [[book]]s. The project tries to make the items in its collection as free as possible, in long-lasting, open formats that can be used on almost any computer.  Scanned books are available in a variety of formats, including plain text, PDF, ePub and Kindle formats.  The library includes some of the world’s great literature, with focus on older works for which U.S. copyright has expired. 
 
Use of the site is entirely free and does not even required a logon account to be created.


==History==
==History==
Line 13: Line 18:
Hart later came to an arrangement with [[Carnegie Mellon University]], which agreed to administer Project Gutenberg's finances. As the volume of e-texts increased, volunteers began to take over the project's day-to-day operations that Hart had run.  In 1999 [[Walnut Creek CDROM]] released a two-CD set of the PG texts as of August 1999, titled ''Project Gutenberg: A Library containing over 1,600 Electronic Texts from the Project Gutenberg at Carnegie Mellon University''.
Hart later came to an arrangement with [[Carnegie Mellon University]], which agreed to administer Project Gutenberg's finances. As the volume of e-texts increased, volunteers began to take over the project's day-to-day operations that Hart had run.  In 1999 [[Walnut Creek CDROM]] released a two-CD set of the PG texts as of August 1999, titled ''Project Gutenberg: A Library containing over 1,600 Electronic Texts from the Project Gutenberg at Carnegie Mellon University''.


In 2000, a [[non-profit organization|non-profit corporation]], the Project Gutenberg Literary Archive Foundation, Inc. was chartered in [[Mississippi]] to handle the project's legal needs. Donations to it are [[tax deduction|tax-deductible]].  Long-time Project Gutenberg volunteer [[Gregory Newby]] became the foundation's first [[chief executive officer|CEO]].
In 2000, a [[non-profit organization|non-profit corporation]], the Project Gutenberg Literary Archive Foundation, Inc. was chartered in [[Mississippi (U.S. state)|Mississippi]] to handle the project's legal needs. Donations to it are [[tax deduction|tax-deductible]].  Long-time Project Gutenberg volunteer [[Gregory Newby]] became the foundation's first [[chief executive officer|CEO]].


Also in 2000, [[Charles Franks]] founded [[Distributed Proofreaders]], which allowed the proofreading of scanned texts to be distributed among many volunteers over the Internet. This effort greatly increased the number and variety of texts being added to Project Gutenberg, as well as making it easier for new volunteers to start contributing.
Also in 2000, [[Charles Franks]] founded [[Distributed Proofreaders]], which allowed the proofreading of scanned texts to be distributed among many volunteers over the Internet. This effort greatly increased the number and variety of texts being added to Project Gutenberg, as well as making it easier for new volunteers to start contributing.
Line 24: Line 29:


==Scope of collection==
==Scope of collection==
In August 2006 Project Gutenberg claimed to have over 19,000 items in its collection, with an average of over fifty new eBooks being added each week. <ref>According to
In August 2006 Project Gutenberg claimed to have over 19,000 items in its collection, with an average of over fifty new eBooks being added each week.  
[http://www.gutenberg.org/dirs/GUTINDEX-2006.txt gutindex-2006], there were 1,653 new Project Gutenberg numbers posted in the first 33 weeks of 2006. This averages out to 50.09 per week. This does not include additions to affiliated projects.</ref>


These are primarily works of [[literature]] from the [[western culture|Western cultural tradition]]. In addition to literature such as novels, poetry, short stories, and drama, Project Gutenberg also has [[cookbook]]s, [[reference work]]s and issues of periodicals. The Project Gutenberg collection also has a few non-text items such as audio files and music notation files.
Those were primarily works of [[literature]] from the [[western culture|Western cultural tradition]]. In addition to literature such as novels, poetry, short stories, and drama, Project Gutenberg also has [[cookbook]]s, [[reference work]]s and issues of periodicals. The Project Gutenberg collection also has a few non-text items such as audio files and music notation files.


Most releases are in English, but there are also significant numbers in many other langauges. In August 2006 the non-English languages most represented were (in order): [[French language|French]], [[German language|German]], [[Finnish language|Finnish]], [[Dutch language|Dutch]], and [[Spanish language|Spanish]].
Most releases are in English, but there are also significant numbers in many other languages. In August 2006 the non-English languages most represented were (in order): [[French language|French]], [[German language|German]], [[Finnish language|Finnish]], [[Dutch language|Dutch]], and [[Spanish language|Spanish]].


Whenever possible, Gutenberg releases are available in [[Binary and text files|plain text]], mainly using [[US-ASCII]] [[character encoding]] but frequently extended to [[ISO-8859-1]]. Other formats may be released as well, when submitted by volunteers, with the most common being [[HTML]]. Formats which are not easily editable, such as [[Portable Document Format|PDF]], are generally not considered to fit in with the goals of Project Gutenberg, although a few have been added to the collection. For years, there has been discussion of using some type of [[XML]], although progress on that has been slow.
Whenever possible, Gutenberg releases are available in [[Binary and text files|plain text]], mainly using [[US-ASCII]] [[character encoding]] but frequently extended to [[ISO-8859-1]]. Other formats may be released as well, when submitted by volunteers, with the most common being [[HTML]]. Formats which are not easily editable, such as [[Portable Document Format|PDF]], are generally not considered to fit in with the goals of Project Gutenberg, although a few have been added to the collection. For years, there has been discussion of using some type of [[XML]], although progress on that has been slow.


==Ideals==
==Ideals==
Michael Hart said in 2004: "The mission of Project Gutenberg is simple: 'To encourage the creation and distribution of eBooks.'" <ref>"The Project Gutenberg Mission Statement", updated [[October 23]] [[2004]] [http://www.gutenberg.org/wiki/Gutenberg:Project_Gutenberg_Mission_Statement_by_Michael_Hart]</ref>
Michael Hart said in 2004: "The mission of Project Gutenberg is simple: 'To encourage the creation and distribution of eBooks.'" <ref><span class="newtab">[https://www.gutenberg.org/about/background/mission_statement.html The Project Gutenberg Mission Statement from 2004]</span>, last access 5/19/2021</ref>


A slogan of the project is: "break down the bars of ignorance and illiteracy", because its volunteers aim to continue spreading public [[literacy]] and appreciation for the literary heritage just as [[public library|public libraries]] began to do in the early twentieth century.
A slogan of the project is: "break down the bars of ignorance and illiteracy", because its volunteers aim to continue spreading public [[literacy]] and appreciation for the literary heritage just as [[public library|public libraries]] began to do in the early twentieth century.
Line 55: Line 59:
==Criticism==  
==Criticism==  
Project Gutenberg has been criticized for lack of scholarly rigor in its e-texts: for example, in inadequate detailing of editions used and in the omission of original published prefaces and critical apparatus. A marked improvement in preserving such text can be seen by comparing earlier texts with newer ones; most new e-texts preserve edition information and prefaces.
Project Gutenberg has been criticized for lack of scholarly rigor in its e-texts: for example, in inadequate detailing of editions used and in the omission of original published prefaces and critical apparatus. A marked improvement in preserving such text can be seen by comparing earlier texts with newer ones; most new e-texts preserve edition information and prefaces.
==Affiliated projects==
All affiliated projects are independent organizations which share the same ideals, and have been given permission to use the ''Project Gutenberg'' trademark. They often have a particular national, or linguistic focus.
*[[Project Gutenberg Australia]] hosts many texts which are public domain according to [[Australian copyright law]], but still under copyright (or of uncertain status) in the United States, with a focus on Australian writers and books about Australia.
*[http://www.gutenberg.nl PG-EU] is a sister project which operates under the copyright law of the [[European Union]].  One of its aims is to include as many languages as possible into Project Gutenberg.  It operates in [[Unicode]] to ensure that all alphabets can be represented easily and correctly.
*[[Project Gutenberg of the Philippines]] [http://www.gutenberg.ph] "aims to make as many books available to as many people as possible, with a special focus on the Philippines and Philippine languages".
*[[Project Gutenberg Europe]] [http://pge.rastko.net] is a project run by [[Project Rastko]] in Serbia-Montenegro. It aims at being a Project Gutenberg for all of Europe, and has started to post its first projects in 2005. It is running the [[Distributed Proofreaders]] software to quickly produce etexts.
*[[Project Gutenberg Luxembourg]] [http://www.gutenberg.lu] publishes mostly, but not exclusively, books that are written in [[Luxembourgish language|Luxembourgish]].
*[[Project Gutenberg Consortia Center]] [http://www.gutenberg.us] is an affiliate specializing in collections of collections.  These do not have the editorial oversight or consistent formatting of the main Project Gutenberg.  Thematic collections, as well as numerous languages, are featured.
*[[Projekti Lönnrot]] [http://www.lonnrot.net] is a project started by Finnish Project Gutenberg volunteers.
Although [[Projekt Gutenberg-DE]] was given permission to use the Gutenberg name years ago, not everyone considers it to be an affiliated project, because of philosophical differences. Projekt Gutenberg-DE copyrights its product and limits access to browsable web-versions of its texts.
For a list of other similar projects, some of which have been inspired by Project Gutenberg, see the [[list of digital library projects]].


==Notes==
==Notes==
<references/>
<references/>[[Category:Suggestion Bot Tag]]
 
== See also ==
* [[Google Book Search]]
* [[Open Content Alliance]]
 
== External links ==
* [http://www.gutenberg.org/ Official website]
* [http://www.pgdp.net/ Distributed Proofreaders] a worldwide group of volunteer editors which are now the main source of ebooks for Project Gutenberg
* [http://gutenberg.hwg.org/ HTML Writers Guild] provides guidance in using XHTML and XML markup for Project Gutenberg
* {{gutenberg author| id=Project+Gutenberg | name=Project Gutenberg}} (note that many of these have been renamed to Project Gutenberg for trademark concerns, and are not original with the Project)
* [http://www.sandroid.org/GutenMark/index.html GutenMark] &mdash; a tool for automatically creating high-quality HTML or LaTeX markup from Project Gutenberg etexts. (not affiliated with Project Gutenberg in any way.)
 
[[Category:Library and Information Science Workgroup]]
[[Category:Media Workgroup]]
[[Category:CZ Live]]

Latest revision as of 16:00, 7 October 2024

This article is a stub and thus not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
 
This editable Main Article is under development and subject to a disclaimer.
This article is about Project Gutenberg. For other uses of the term Gutenberg, please see Gutenberg (disambiguation).

Project Gutenberg is an online library of over 60,000 free eBooks[1]. It started as a volunteer effort to digitize, archive, and distribute cultural works, including the original text and any available translations. Founded in 1971, it is the oldest free digital library. Most of its items are the full texts of public domain books. The project tries to make the items in its collection as free as possible, in long-lasting, open formats that can be used on almost any computer. Scanned books are available in a variety of formats, including plain text, PDF, ePub and Kindle formats. The library includes some of the world’s great literature, with focus on older works for which U.S. copyright has expired.

Use of the site is entirely free and does not even required a logon account to be created.

History

Project Gutenberg was started by Michael Hart in 1971. Hart, a student at the University of Illinois, obtained access to a Xerox Sigma V mainframe computer in the university's Materials Research Lab. Through friendly operators, he received an account with a virtually unlimited amount of computer time; its value has since been variously estimated at $100,000 or $100,000,000. Hart has said he wanted to "give back" this gift by doing something that could be considered to be of great value.

This particular computer happened to be one of the 15 nodes on the computer network that would become the Internet. Hart believed that computers would one day be accessible to the general public and decided to make works of literature available in electronic form for free. He happened to have a copy of the United States Declaration of Independence in his backpack, and this became the first Project Gutenberg e-text.

He named the project for Johannes Gutenberg, the fifteenth-century German printer who propelled the movable-type printing press revolution.

By the mid-1990s, Hart was running Project Gutenberg from Illinois Benedictine College. More volunteers had joined the effort. Most text was entered manually until image scanners and optical character recognition software improved and became more widely available.

Hart later came to an arrangement with Carnegie Mellon University, which agreed to administer Project Gutenberg's finances. As the volume of e-texts increased, volunteers began to take over the project's day-to-day operations that Hart had run. In 1999 Walnut Creek CDROM released a two-CD set of the PG texts as of August 1999, titled Project Gutenberg: A Library containing over 1,600 Electronic Texts from the Project Gutenberg at Carnegie Mellon University.

In 2000, a non-profit corporation, the Project Gutenberg Literary Archive Foundation, Inc. was chartered in Mississippi to handle the project's legal needs. Donations to it are tax-deductible. Long-time Project Gutenberg volunteer Gregory Newby became the foundation's first CEO.

Also in 2000, Charles Franks founded Distributed Proofreaders, which allowed the proofreading of scanned texts to be distributed among many volunteers over the Internet. This effort greatly increased the number and variety of texts being added to Project Gutenberg, as well as making it easier for new volunteers to start contributing.

Pietro Di Miceli, an Italian volunteer, developed and administered the first Project Gutenberg website and started the development of the Project's online catalog. In his ten years in this role (1994–2004), the Project web pages won a number of awards, often being featured in "best of the Web" listings, and contributing to the Project popularity [1].

Starting in 2004, an improved online catalog made Project Gutenberg content easier to browse, access, and link to.

Project Gutenberg is now hosted by ibiblio at the University of North Carolina at Chapel Hill.

Scope of collection

In August 2006 Project Gutenberg claimed to have over 19,000 items in its collection, with an average of over fifty new eBooks being added each week.

Those were primarily works of literature from the Western cultural tradition. In addition to literature such as novels, poetry, short stories, and drama, Project Gutenberg also has cookbooks, reference works and issues of periodicals. The Project Gutenberg collection also has a few non-text items such as audio files and music notation files.

Most releases are in English, but there are also significant numbers in many other languages. In August 2006 the non-English languages most represented were (in order): French, German, Finnish, Dutch, and Spanish.

Whenever possible, Gutenberg releases are available in plain text, mainly using US-ASCII character encoding but frequently extended to ISO-8859-1. Other formats may be released as well, when submitted by volunteers, with the most common being HTML. Formats which are not easily editable, such as PDF, are generally not considered to fit in with the goals of Project Gutenberg, although a few have been added to the collection. For years, there has been discussion of using some type of XML, although progress on that has been slow.

Ideals

Michael Hart said in 2004: "The mission of Project Gutenberg is simple: 'To encourage the creation and distribution of eBooks.'" [2]

A slogan of the project is: "break down the bars of ignorance and illiteracy", because its volunteers aim to continue spreading public literacy and appreciation for the literary heritage just as public libraries began to do in the early twentieth century.

Project Gutenberg is intentionally decentralized. For example, there is no selection policy dictating what texts to add. Instead, individual volunteers work on what they are interested in, or have available.

The Project Gutenberg collection is intended to preserve items for the long term, so they cannot be lost by any one localized accident. In an effort to ensure this, the entire collection is backed-up regularly and mirrored on servers in many different locations.

Copyright issues

Project Gutenberg is careful to verify the status of its ebooks according to U.S. copyright law. Material is added to the Project Gutenberg archive only after it has received a copyright clearance, and records of these clearances are saved for future reference.

Unlike some other digital library projects, Project Gutenberg does not claim new copyright on titles it publishes. Instead, it encourages their free reproduction and distribution.

Most books in the Project Gutenberg collection are distributed as public domain under U.S. copyright law. The legalese included with each eBook puts some restrictions on what can be done with the texts (such as distributing them in modified form, or for commercial purposes) as long as the Project Gutenberg trademark is used. If the header is stripped and the trademark not used, then the public domain texts can be reused without any restrictions.

There are also a few copyrighted texts that Project Gutenberg distributes with permission. These are subject to further restrictions as specified by the copyright holder.

In 1998 the Sonny Bono Copyright Term Extension Act extended the duration of already-existing copyright by twenty years. This has prevented Project Gutenberg from adding many titles that would otherwise have become public domain in the U.S.

Criticism

Project Gutenberg has been criticized for lack of scholarly rigor in its e-texts: for example, in inadequate detailing of editions used and in the omission of original published prefaces and critical apparatus. A marked improvement in preserving such text can be seen by comparing earlier texts with newer ones; most new e-texts preserve edition information and prefaces.

Notes