The uncompressed image of a page is twenty megabytes,
which means that a 300-page book occupies six gigabytes.
A million books, then, would consume 6 petabytes
(the scale goes mega, giga, tera, peta), which is
a challenge with today's technology.
The Internet Archive designed a system specifically for
preservation and access to petabyte-size collections,
called the Petabox. A commercial company, Capricorn
Technologies, now manufactures these machines.
Preservation of this data requires moving it onto new
systems every 3 years and replicating in multiple locations.
The Internet Archive has partnerships in Europe and Egypt
to help ensure the long-term care of these digital artifacts.
[[25]]
p024 _
-chap- _
toc-1 _
p025w _
toc-2 _
+chap+ _
p026