From gbnewby at pglaf.org  Wed Aug  5 16:55:11 2009
From: gbnewby at pglaf.org (Greg Newby)
Date: Wed, 5 Aug 2009 16:55:11 -0700
Subject: [gutvol-d] eBook readers for Linux desktop
Message-ID: <20090805235511.GA8221@mail.pglaf.org>

Nice article I encountered.  It's PDF online, but I actually
got the dead trees edition :)

Collabora, GutenPY and FBReader are all featured.

 http://www.linux-magazine.com/Issues/2009/104/FRESH-FROM-THE-PRESS

  -- Greg

From hart at pobox.com  Thu Aug  6 08:25:08 2009
From: hart at pobox.com (Michael S. Hart)
Date: Thu, 6 Aug 2009 07:25:08 -0800 (AKDT)
Subject: [gutvol-d] 25,000th English eBook
Message-ID: <alpine.DEB.1.00.0908060721430.31061@snowy.arsc.alaska.edu>


As of today:

24879 in  English


At our usual rate of progress we should reach #25,000
sometime this month.


Have we created a list of possible candidates yet?


Suggestions welcome.


Thanks!!!


Thanks!!!


Michael S. Hart
Founder
Project Gutenberg

From hart at pobox.com  Sun Aug  9 07:19:20 2009
From: hart at pobox.com (Michael S. Hart)
Date: Sun, 9 Aug 2009 06:19:20 -0800 (AKDT)
Subject: [gutvol-d] !@! Gutenberg's 25,000th English eBook
Message-ID: <alpine.DEB.1.00.0908090618590.11685@snowy.arsc.alaska.edu>


As of today:

24902   English


We are asking for suggestions for #25,000!!!


We probably have just over 10 days. . . .


Thanks!!!


Michael

From marcello at perathoner.de  Sun Aug  9 12:01:30 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Sun, 09 Aug 2009 21:01:30 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <891262632.99249.1248790575502.JavaMail.mail@webmail05>
References: <891262632.99249.1248790575502.JavaMail.mail@webmail05>
Message-ID: <4A7F1D0A.8060500@perathoner.de>

Joshua Hutchinson wrote:

> Any chance of creating on the fly zips of some of the books?  For instance, the 
> audio books are huge and usually divided along chapter lines.  Single file zips 
> are very useful (and something we've done on some of them manually) but the 
> space waste is huge.  On the fly zipping of those files would save huge in 
> storage space.

The other way round would be much more useful: store only a zipped 
version of everything.

All browsers support unzipping on-the-fly (all bibrec pages we serve are 
in a compressed format since years now) so the only people who'd notice 
anything was the ftp users. (But he who has mastered to type "ftp" can 
also be made to type "unzip", so no problem there.)

From marcello at perathoner.de  Sun Aug  9 12:04:25 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Sun, 09 Aug 2009 21:04:25 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <a82cdbb90907280917h26fda315he470a9520b07d65c@mail.gmail.com>
References: <891262632.99249.1248790575502.JavaMail.mail@webmail05>	<20090728143306.GA28986@mail.pglaf.org>
	<a82cdbb90907280917h26fda315he470a9520b07d65c@mail.gmail.com>
Message-ID: <4A7F1DB9.9070404@perathoner.de>

David A. Desrosiers wrote:

> If you're looking at it at that level, why not just offer some
> streaming audio of the books as well? 

What's the advantage of streaming audio vs. getting the whole file at once?


From marcello at perathoner.de  Sun Aug  9 12:16:54 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Sun, 09 Aug 2009 21:16:54 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <212322090907291511r2d2a2ebfw198542bca2c2eb76@mail.gmail.com>
References: <891262632.99249.1248790575502.JavaMail.mail@webmail05>
	<20090728143306.GA28986@mail.pglaf.org>	<a82cdbb90907280917h26fda315he470a9520b07d65c@mail.gmail.com>
	<20090729172510.GC8946@mail.pglaf.org>
	<212322090907291511r2d2a2ebfw198542bca2c2eb76@mail.gmail.com>
Message-ID: <4A7F20A6.2020003@perathoner.de>

Paulo Levi wrote:
> Tell me, please, if the gutenberg rtf index file, besides being
> autogenerated, is also sorted.
> 
> What i mean is, i'm indexing parts of the file, and i gain a major
> speed up of treating the file as:
> 
> a massive list of pgterms:etext definitions
> 
> followed by a (more massive) list of pgterms:file definitions
> 
> This allows the string comparisons i have to do to be lower (about
> n/2), but at the cost that any etext record in between the second list
> is not picked up.
> 
> That won't be changed, but is only for my peace of mind.
> Also, in the pgterms:file records, are the records referring to the
> same file consecutive ? I ask because if so, i could do the same sort
> of filtering Aaron Cannon is doing in its dvd project, to speed up the
> index some more and remove duplicates.
> (If they aren't consecutive i would have to  issue queries  between
> building the index to see if they were already inserted).
> 
> I have nothing against xpath, indeed i think the scanning of the file
> in lucene already uses something similar. But i need free text
> searches, and they have to be fast (i'm already experimenting with a
> memory cache after the query too, and it works okish for my
> application)


All future changes to the file will be backward-compatible in an XML way.

Meaning: the same XPath queries will yield (supersets of) the same 
result sets.

I do NOT guarantee that the sequence of entities will be sorted the same 
way. Your sorting needs will probably be much different from the next 
guy's, so the sorting is up to you.


From marcello at perathoner.de  Sun Aug  9 12:35:59 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Sun, 09 Aug 2009 21:35:59 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <79902BC0-1EED-4021-B529-60A6CE413F38@ark.in-berlin.de>
References: <212322090907151725x194f5361j2741f331dbf775f1@mail.gmail.com>	<212322090907151728x84b224by697776eec6e265a4@mail.gmail.com>	<a82cdbb90907261925l1ba27052m1813b59b14f2e785@mail.gmail.com>	<4A6D5639.5080600@perathoner.de>	<a82cdbb90907270734k24ecd8b5y6a75e3b3e8297eb6@mail.gmail.com>	<002494FB-121D-4CC8-80B0-AE3EFA238ADB@ark.in-berlin.de>	<a82cdbb90907271142p34e72bbdv86211a862580560f@mail.gmail.com>
	<79902BC0-1EED-4021-B529-60A6CE413F38@ark.in-berlin.de>
Message-ID: <4A7F251F.7000605@perathoner.de>

Ralf Stephan wrote:

> I confirm that neither the Plucker nor the Mobile formats
> are mentioned in the catalog file. Do you have an
> explanation, Marcello?

Because the plucker files are not in the archive. They are generated 
on-the-fly (sort of) and then cached for some time. We don't know which 
files are in the cache at any actual moment.

This design was changed for the epub and mobi files. They are generated 
manually and stay in the cache (until manually rebuilt). They could be 
added to the database as well and to the RDF file too. Just a small 
matter of programming ...

Just now I'm concentrating on making better epubs. By the time I'll be 
satisfied, I'll tackle the databse code.


From marcello at perathoner.de  Sun Aug  9 13:00:23 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Sun, 09 Aug 2009 22:00:23 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <20090728143306.GA28986@mail.pglaf.org>
References: <891262632.99249.1248790575502.JavaMail.mail@webmail05>
	<20090728143306.GA28986@mail.pglaf.org>
Message-ID: <4A7F2AD7.5010801@perathoner.de>

Greg Newby wrote:

> A more general approach would be to let visitors to www.gutenberg.org
> put their selected files (including those generated on-the-fly)
> on a bookshelf (i.e., shopping cart), then download in one big file,
> or several small ones.  

Or we could tell them how to use the download manager in their browsers ...

Seriously, whenever I download a couple of books, I just click away and 
let the downloads complete in the background.


I concur that there has something to be done for the audio files. Even 
just clicking on 10+ files can be bothersome.

But I always favor the lowest-tech solution.

So why not add a M3U playlist to the directory? M3U just a text file 
with a list of urls and is supported by Winamp and many other players.

   http://gonze.com/playlists/playlist-format-survey.html


From marcello at perathoner.de  Sun Aug  9 13:17:57 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Sun, 09 Aug 2009 22:17:57 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <a82cdbb90907271142p34e72bbdv86211a862580560f@mail.gmail.com>
References: <212322090907151725x194f5361j2741f331dbf775f1@mail.gmail.com>	<212322090907151728x84b224by697776eec6e265a4@mail.gmail.com>	<a82cdbb90907261925l1ba27052m1813b59b14f2e785@mail.gmail.com>	<4A6D5639.5080600@perathoner.de>	<a82cdbb90907270734k24ecd8b5y6a75e3b3e8297eb6@mail.gmail.com>	<002494FB-121D-4CC8-80B0-AE3EFA238ADB@ark.in-berlin.de>
	<a82cdbb90907271142p34e72bbdv86211a862580560f@mail.gmail.com>
Message-ID: <4A7F2EF5.1050101@perathoner.de>

David A. Desrosiers wrote:
> On Mon, Jul 27, 2009 at 1:45 PM, Ralf Stephan<ralf at ark.in-berlin.de> wrote:
>> My, can't we admit that XPath is a bit over our head,
>> so we prefer confronting the admin we're supposed
>> to be cooperating with? Wrt resources, my guess it's
>> about par traffic-wise (1-5k per book vs. megabytes
>> of RDF) but much better CPU-wise. That is, if you don't
>> want the RDF for other fine things like metadata etc.
> 
> I think you've missed my point.
> 
> The RDF flat-out cannot tell me which of the target _formats_ are
> available for immediate download to the users. I'm not looking for
> which _titles_ are available in the catalog, I'm looking for which
> _formats_ are available. Also note that I'm already parsing the feeds
> to see what the top 'n' titles are already, so parsing XML via
> whatever methods I need is not the blocker here.
> 
> Let me give you an example of two titles available in the catalog:
> 
> Verg?nglichkeit by Sigmund Freud
> http://www.gutenberg.org/cache/plucker/29514/29514
> 
> The Lost Word by Henry Van Dyke
> http://www.gutenberg.org/cache/plucker/4384/4384
> 
> Both of these _titles_ are available in the Gutenberg catalog, but the
> second one is not available in the Plucker _format_ for immediate
> download. Big difference from parsing title availability from the
> catalog.rdf file.

So you are doing a HEAD on the cache location? I hope you don't have 
many of these in the field, because you're going to look very sorry 
whenever the location of the cache changes. (It will! I give you fair 
notice for free :-) )


> Make sense now?

No.

Why is that "immediate download" bit so important for you? You will get 
a completely random set of files. (A cached plucker file expires 7 days 
after *generation* not after the last *access*. So all you get is the 
set of files generated in the last 7 days.)

And a wrong set too. The first file could have been deleted on the 
server long before you finished your barrage of HEAD request.


From joshua at hutchinson.net  Mon Aug 10 05:47:10 2009
From: joshua at hutchinson.net (Joshua Hutchinson)
Date: Mon, 10 Aug 2009 12:47:10 +0000 (GMT)
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
Message-ID: <85614071.121560.1249908430271.JavaMail.mail@webmail09>

An HTML attachment was scrubbed...
URL: <http://lists.pglaf.org/mailman/private/gutvol-d/attachments/20090810/ed1f56c5/attachment.html>

From joshua at hutchinson.net  Mon Aug 10 05:52:15 2009
From: joshua at hutchinson.net (Joshua Hutchinson)
Date: Mon, 10 Aug 2009 12:52:15 +0000 (GMT)
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
Message-ID: <2011678691.121630.1249908735714.JavaMail.mail@webmail09>

An HTML attachment was scrubbed...
URL: <http://lists.pglaf.org/mailman/private/gutvol-d/attachments/20090810/7d6d226d/attachment.html>

From marcello at perathoner.de  Mon Aug 10 06:03:20 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Mon, 10 Aug 2009 15:03:20 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <85614071.121560.1249908430271.JavaMail.mail@webmail09>
References: <85614071.121560.1249908430271.JavaMail.mail@webmail09>
Message-ID: <4A801A98.8020603@perathoner.de>

Joshua Hutchinson wrote:

> Well, the one problem I can see if that it becomes very hard to link to a 
> specific file within the zip file. 

Internet Archive already has a solution. They can extract a jpg from a 
zip on the fly. Look at this url:

 
http://ia331340.us.archive.org/zipview.php?zip=/3/items/goodytwoshoes00newyiala/goodytwoshoes00newyiala_flippy.zip&file=0002.jpg


"Borrowing" that code should be easy.


> For instance, with the audio files, we have index files that let you link 
> directly to a particular section/chapter.  If that file is within a zip file, 
> that becomes a problem.

No problem. Keep the index outside the zip file then.


From marcello at perathoner.de  Mon Aug 10 13:05:02 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Mon, 10 Aug 2009 22:05:02 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <85614071.121560.1249908430271.JavaMail.mail@webmail09>
References: <85614071.121560.1249908430271.JavaMail.mail@webmail09>
Message-ID: <4A807D6E.2040409@perathoner.de>

Joshua Hutchinson wrote:


> Well, the one problem I can see if that it becomes very hard to link to a 
> specific file within the zip file. 

Actually it takes just a hundred lines of code:
 
http://www.gutenberg.org/zipcat/files/20686/20686-ogg.zip?ogg/20686-01.ogg


From joshua at hutchinson.net  Mon Aug 10 13:22:14 2009
From: joshua at hutchinson.net (Joshua Hutchinson)
Date: Mon, 10 Aug 2009 20:22:14 +0000 (GMT)
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
Message-ID: <2088833402.128411.1249935734280.JavaMail.mail@webmail05>

An HTML attachment was scrubbed...
URL: <http://lists.pglaf.org/mailman/private/gutvol-d/attachments/20090810/1683b306/attachment.html>

From marcello at perathoner.de  Mon Aug 10 14:47:25 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Mon, 10 Aug 2009 23:47:25 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <2088833402.128411.1249935734280.JavaMail.mail@webmail05>
References: <2088833402.128411.1249935734280.JavaMail.mail@webmail05>
Message-ID: <4A80956D.7010700@perathoner.de>

Joshua Hutchinson wrote:

> Hey, very nice!  Can I assume this functionality is now live?  If so, I can save 
> a bunch of space in the audio files.  :)

Something of this kind will be live soon.

I'd like it to use exactly the same url as the url to the file, so the 
users won't even notice we changed something.

Audio books will need just the zips for the different audio codecs, one 
readme, one index file and maybe one m3u playlist for every codec.

I noticed the file structure for audio book zips doesn't follow exactly 
the structure for html zips. The directories ogg/ mp3/ spx/ should 
actually be named 12345-ogg/ 12345-mp3/ 12345-spx/. That would be more 
in line with the 12345-h/ convention we use for html. Also the top 
directory inside the zipped files should be 12345-ogg/ etc.


From user5013 at aol.com  Mon Aug 10 15:19:33 2009
From: user5013 at aol.com (Christa & Jay Toser)
Date: Mon, 10 Aug 2009 17:19:33 -0500
Subject: [gutvol-d] Book #25,000
Message-ID: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com>

I may be ignorant on this, but

I cannot find a complete English copy of the Koran (or however they  
spell it) anywhere on Project Gutenberg.

So also, HOW do they spell it in English?

Jay Toser

From hart at pobox.com  Mon Aug 10 15:34:54 2009
From: hart at pobox.com (Michael S. Hart)
Date: Mon, 10 Aug 2009 14:34:54 -0800 (AKDT)
Subject: [gutvol-d] Re: Book #25,000
In-Reply-To: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com>
References: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com>
Message-ID: <alpine.DEB.1.00.0908101427430.3906@snowy.arsc.alaska.edu>


On Mon, 10 Aug 2009, Christa & Jay Toser wrote:

> I may be ignorant on this, but
>
> I cannot find a complete English copy of the Koran (or however they spell it)
> anywhere on Project Gutenberg.
>
> So also, HOW do they spell it in English?
>
> Jay Toser


As with anything translated from another language,
particularly one with another alphabet, spelling a
word or phrase in English may be inconsistent:

Koran
Kuran
Quran
Q'ran
Qu'ran
Q'uran
Qur'a-n
Alcoran
Al-Qur'a-n

with various accents, all seem to pop up, but this
list is neither complete nor in best order; sorry,
but I am sure I could come up with better if I had
enough time for some serious research.

As for doing this one as #25,000, I'm afraid there
are too many copyright questions already on what a
public domain version might be. . .and. . .in fact
a number of people have told me what I interpret a
non-legal opinion as:

There is no public domain version of The Koran for
Project Gutenberg to work with in English.

So far we have not had any closure on this.

Hence my apologies,


Michael


From ajhaines at shaw.ca  Mon Aug 10 15:46:58 2009
From: ajhaines at shaw.ca (Al Haines (shaw))
Date: Mon, 10 Aug 2009 15:46:58 -0700
Subject: [gutvol-d] Re: Book #25,000
References: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com>
Message-ID: <FE8C3BB3574C4A1E9779C75ED086F1E2@alp2400>

I got seven matches for "Koran" on a "Title Word" search, none for the 
alternates I could think of--Kuran, Qoran, Quran.  I leave it to others to 
speak to their completeness.

FYI--there's a raft of "Koran" matches in Internet Archive 
(http://www.archive.org/search.php?query=title%3A%28koran%29


----- Original Message ----- 
From: "Christa & Jay Toser" <user5013 at aol.com>
To: <gutvol-d at lists.pglaf.org>
Sent: Monday, August 10, 2009 3:19 PM
Subject: [gutvol-d] Book #25,000


>I may be ignorant on this, but
>
> I cannot find a complete English copy of the Koran (or however they  spell 
> it) anywhere on Project Gutenberg.
>
> So also, HOW do they spell it in English?
>
> Jay Toser
> _______________________________________________
> gutvol-d mailing list
> gutvol-d at lists.pglaf.org
> http://lists.pglaf.org/mailman/listinfo/gutvol-d
> 


From jtinsley at pobox.com  Mon Aug 10 16:03:35 2009
From: jtinsley at pobox.com (Jim Tinsley)
Date: Mon, 10 Aug 2009 19:03:35 -0400
Subject: [gutvol-d] Re: Book #25,000
In-Reply-To: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com>
References: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com>
Message-ID: <20090810230334.GA12208@panix.com>

On Mon, 10 Aug 2009 17:19:33 -0500, Christa & Jay Toser <user5013 at aol.com> wrote:

>I may be ignorant on this, but
>
>I cannot find a complete English copy of the Koran (or however they  
>spell it) anywhere on Project Gutenberg.
>
>So also, HOW do they spell it in English?

I look in for the first time in forever, and see a question I can 
answer - at length, if nobody stops me. :-) I've had some emailed 
discussion with people who were quite vehement on the subject.

My formatting is out of date, but the catalog does still seem to find the books:

Three Translations of The Koran (Al-Qur'an) side by side                 16955
  16955.txt  16955.zip

Feb 2005 The Koran, Translated by George Sale              [koranxxb.xxx] 7440
    koran09b.txt    koran09b.zip   koran09br.zip
  (Note: previously incorrectly listed as The Koran, by Mohammed/Mohammad)

Sep 2002 The Koran, Translated by J.M Rodwell              [koranxxa.xxx] 3434
    koran10a.txt    koran10a.zip    koran11a.txt    koran11a.zip    koran12a.txt
    koran12a.zip
  (Note: previously incorrectly listed as The Koran, by Mohammed/Mohammad)

Sep 2001 The Koran, Translated by J.M. Rodwell             [koranxxx.xxx] 2800
     koran10.txt     koran10.zip
  (Note:  previously incorrectly listed as The Koran, by Mohammed/Mohammad)

jim


From gbnewby at pglaf.org  Mon Aug 10 22:25:17 2009
From: gbnewby at pglaf.org (Greg Newby)
Date: Mon, 10 Aug 2009 22:25:17 -0700
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <4A7F2AD7.5010801@perathoner.de>
References: <891262632.99249.1248790575502.JavaMail.mail@webmail05>
	<20090728143306.GA28986@mail.pglaf.org>
	<4A7F2AD7.5010801@perathoner.de>
Message-ID: <20090811052517.GB2868@mail.pglaf.org>

On Sun, Aug 09, 2009 at 10:00:23PM +0200, Marcello Perathoner wrote:
> Greg Newby wrote:
>
>> A more general approach would be to let visitors to www.gutenberg.org
>> put their selected files (including those generated on-the-fly)
>> on a bookshelf (i.e., shopping cart), then download in one big file,
>> or several small ones.  
>
> Or we could tell them how to use the download manager in their browsers ...
>
> Seriously, whenever I download a couple of books, I just click away and  
> let the downloads complete in the background.

A bookshelf (or shopping cart, or call it something else) isn't just
about managing downloads started immediately upon finding something
of interest. 

It's about keeping track of books to download in the future,
books already downloaded, and books to start downloading later
(say, overnight or when someone goes from home to a location
with higher bandwidth).

Sure, people can use browser bookmarks for some of this, but:
 - bookmarks need to then be managed, for after the book is downloaded
 - bookmarking the bibrec page is easy, but right-click "bookmark
   target as..." to actually select a file format is harder for our
   many naive users
 - A larger # of bookmarks for future reading won't have very 
   useful functionality; a shopping cart model at www.gutenberg.org
   could have some more useful features

Probably another site will implement this for the Gutenberg
content, eventually.  Some of the resellers already have this type
of thing.
  -- Greg

PS: I really like adding an m3u playlist file to audio content


> I concur that there has something to be done for the audio files. Even  
> just clicking on 10+ files can be bothersome.
>
> But I always favor the lowest-tech solution.
>
> So why not add a M3U playlist to the directory? M3U just a text file  
> with a list of urls and is supported by Winamp and many other players.
>
>   http://gonze.com/playlists/playlist-format-survey.html

From gbnewby at pglaf.org  Mon Aug 10 22:30:45 2009
From: gbnewby at pglaf.org (Greg Newby)
Date: Mon, 10 Aug 2009 22:30:45 -0700
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <4A7F1DB9.9070404@perathoner.de>
References: <891262632.99249.1248790575502.JavaMail.mail@webmail05>
	<20090728143306.GA28986@mail.pglaf.org>
	<a82cdbb90907280917h26fda315he470a9520b07d65c@mail.gmail.com>
	<4A7F1DB9.9070404@perathoner.de>
Message-ID: <20090811053045.GD2868@mail.pglaf.org>

On Sun, Aug 09, 2009 at 09:04:25PM +0200, Marcello Perathoner wrote:
> David A. Desrosiers wrote:
>
>> If you're looking at it at that level, why not just offer some
>> streaming audio of the books as well? 
>
> What's the advantage of streaming audio vs. getting the whole file at once?

Listening while downloading.  This seems to happen for some people,
but not others.

I don't know whether differences are in content-type, the server,
or somewhere else, but a frequent email Q sent to help@ is, "I clicked
your MP3, and I got a message that it downloaded.  Where is it, and how
to I listen?"
  In that case, the browser downloaded the file without opening it
in a player.

I get the opposite question, too: "I clicked your MP3 and it played,
but now I want to save it to my computer to transfer to my MP3 player."

I don't know enough about the mix of protocols, server settings, browsers,
players, etc. to know whether there is a single solution.  But from our 
reader/listener's point of view, I think two links per audiobook file
(MP3, OGG, etc.) would be desirable:

   a. Listen now (streaming)
   b. Download for later

  -- Greg

From marcello at perathoner.de  Tue Aug 11 00:49:17 2009
From: marcello at perathoner.de (Marcello Perathoner)
Date: Tue, 11 Aug 2009 09:49:17 +0200
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
In-Reply-To: <20090811053045.GD2868@mail.pglaf.org>
References: <891262632.99249.1248790575502.JavaMail.mail@webmail05>	<20090728143306.GA28986@mail.pglaf.org>	<a82cdbb90907280917h26fda315he470a9520b07d65c@mail.gmail.com>	<4A7F1DB9.9070404@perathoner.de>
	<20090811053045.GD2868@mail.pglaf.org>
Message-ID: <4A81227D.10307@perathoner.de>

Greg Newby wrote:

> On Sun, Aug 09, 2009 at 09:04:25PM +0200, Marcello Perathoner wrote:
>> David A. Desrosiers wrote:
>>
>>> If you're looking at it at that level, why not just offer some
>>> streaming audio of the books as well? 
>> What's the advantage of streaming audio vs. getting the whole file at once?
> 
> Listening while downloading.  This seems to happen for some people,
> but not others.

Many players can play while downloading from a file.

Firefox on Linux starts "reading" within 3 seconds of clicking on a 30 
Megs mp3.

Streaming has its only advantage if you don't have enough disc to store 
the file on the device or if you use multicast protocols, both of which 
we can rule out for PG.


> I don't know whether differences are in content-type, the server,
> or somewhere else, but a frequent email Q sent to help@ is, "I clicked
> your MP3, and I got a message that it downloaded.  Where is it, and how
> to I listen?"
>   In that case, the browser downloaded the file without opening it
> in a player.
> 
> I get the opposite question, too: "I clicked your MP3 and it played,
> but now I want to save it to my computer to transfer to my MP3 player."

This entirely depends on the settings of the browser. The server only 
sends a header: "content-type: audio/ogg" and the browser decides what 
to do with it.

Somebody ought to write an illustrated how-to for users, explaining how 
to config their browser for immediate playback on left-click and how to 
use right-click/save-as for saving to disc.


> I don't know enough about the mix of protocols, server settings, browsers,
> players, etc. to know whether there is a single solution.  But from our 
> reader/listener's point of view, I think two links per audiobook file
> (MP3, OGG, etc.) would be desirable:
> 
>    a. Listen now (streaming)
>    b. Download for later

This *may* work using the non-standard "content-disposition" header, 
which different browsers implement in slightly different ways ... so 
then we'll have a swimming-pool of snakes.


From hart at pobox.com  Tue Aug 11 06:28:09 2009
From: hart at pobox.com (Michael S. Hart)
Date: Tue, 11 Aug 2009 05:28:09 -0800 (AKDT)
Subject: [gutvol-d] 81 Left to English eBook #25,000
Message-ID: <alpine.DEB.1.00.0908110527260.22176@snowy.arsc.alaska.edu>


We still don't have a winner!


From joshua at hutchinson.net  Tue Aug 11 07:52:52 2009
From: joshua at hutchinson.net (Joshua Hutchinson)
Date: Tue, 11 Aug 2009 14:52:52 +0000 (GMT)
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
Message-ID: <1275438696.7865.1250002372944.JavaMail.mail@webmail03>

An HTML attachment was scrubbed...
URL: <http://lists.pglaf.org/mailman/private/gutvol-d/attachments/20090811/29622cda/attachment.html>

From hart at pobox.com  Tue Aug 11 14:03:21 2009
From: hart at pobox.com (Michael S. Hart)
Date: Tue, 11 Aug 2009 13:03:21 -0800 (AKDT)
Subject: [gutvol-d] Re: A Gutenberg text.A few poss. typos (fwd)
Message-ID: <alpine.DEB.1.00.0908111302280.30663@snowy.arsc.alaska.edu>


My email isn't getting through to bugs or errors at pglaf.org

So I'm forwarding this here.


Michael


---------- Forwarded message ----------
Date: Tue, 11 Aug 2009 13:17:09 -0400
From: Whitney Landon <swlandon at prexar.com>
To: Michael S. Hart <hart at pglaf.org>
Subject: Re: A Gutenberg text.A few poss. typos

Happenstantial more-than-a-few-weeks-later find of this -- there's been no
contact from  bugs at pglaf.org . Thanks again.
WL
----- Original Message ----- From: "Michael S. Hart" <hart at pglaf.org>
To: "Whitney Landon" <swlandon at prexar.com>
Cc: <hart at pobox.com>
Sent: Thursday, March 12, 2009 6:06 AM
Subject: Re: A Gutenberg text.A few poss. typos


>
> I forwarded to bugs at pglaf.org for you.
>
> Let me know if you don't hear from them in a few weeks,
> which is entirely possible, and I may take of it for you.
>
> Many thanks!!!
>
>
> Michael
>
>
>
> On Thu, 12 Mar 2009, Whitney Landon wrote:
>
> > A few probable/possible typos [in brackets, below] and a possible
> > omission from:Title: The Great God Success
> >
> > Author: John Graham (David Graham Phillips)
> >
> > Release Date: April, 2005 [EBook #7989]
> > [Yes, we are more than one year ahead of schedule]
> > [This file was first posted on June 10, 2003]
> >
> > Edition: 10
> >
> > 8gtgd10.txtOriginally published in 1901, per Wikipedia. No text date in
> > Gutenberg download."Precisely. I think I will put you next [to]
> > him."Washington Square, South, about--three years [that double-dash
> > punctuation is weird; looking at it now it could well be the way it was
> > published, having seen it a lot, but it's still weird]I do not want your
> > friends[.] I do not want to be
> > respectable. Howard held her away from him and studied her face, surprise
> > in his eyes.
> > "Isn't it strange?" he said.[paragraph break? Howard continues ...]
> > "Here I've been seeing youopen window and she W[w]as propped with
> > pillowsHoward took his rooms in the Royalton. ... his club where he was
> > living for a few days [two don't mesh; hard to imagine how it's a typo,
> > tho]_do_ [many such constructions: translate to italic if possible; can
> > "search" for the _ (underline) to find them; maybe impossible with
> > font/face]his power of concentration[.] Hea S[s]ense of relief and of
> > calmness [probably]Provost and her niece, Miss Marion[typo? jab at
> > newspapers' proclivities for typos? "Marian" with two "a"s, everywhere
> > else]At a[ ]bound it had becomeHe drew a[ ]long breath
> > and-------------------Thanks for the book. I learned about the author for
> > the first time in the Times to/yesterday
> > http://www.nytimes.com/2009/03/11/us/politics/11senate.html?_r=1followed
> > that with some Wikipedia, then came (only 2nd or 3rd time, but with
> > confidence) to PG .... Finished in a sitting. Quite an experience. Thanks
> > again.Whitney LandonBrooksville, Maine
> >


--------------------------------------------------------------------------------


No virus found in this incoming message.
Checked by AVG - www.avg.com
Version: 8.0.237 / Virus Database: 270.11.9/1991 - Release Date: 03/09/09
07:14:00

From ajhaines at shaw.ca  Tue Aug 11 14:49:10 2009
From: ajhaines at shaw.ca (Al Haines (shaw))
Date: Tue, 11 Aug 2009 14:49:10 -0700
Subject: [gutvol-d] Re: A Gutenberg text.A few poss. typos (fwd)
References: <alpine.DEB.1.00.0908111302280.30663@snowy.arsc.alaska.edu>
Message-ID: <3054A343AAF943E284F361387ACC295E@alp2400>

I've forwarded it to Errata, which is currently auto-forwarded to the WWers.


----- Original Message ----- 
From: "Michael S. Hart" <hart at pobox.com>
To: "The gutvol-d Mailing List" <gutvol-d at lists.pglaf.org>
Sent: Tuesday, August 11, 2009 2:03 PM
Subject: [gutvol-d] Re: A Gutenberg text.A few poss. typos (fwd)


>
>
> My email isn't getting through to bugs or errors at pglaf.org
>
> So I'm forwarding this here.
>
>
> Michael
>
>
> ---------- Forwarded message ----------
> Date: Tue, 11 Aug 2009 13:17:09 -0400
> From: Whitney Landon <swlandon at prexar.com>
> To: Michael S. Hart <hart at pglaf.org>
> Subject: Re: A Gutenberg text.A few poss. typos
>
> Happenstantial more-than-a-few-weeks-later find of this -- there's been no
> contact from  bugs at pglaf.org . Thanks again.
> WL
> ----- Original Message ----- From: "Michael S. Hart" <hart at pglaf.org>
> To: "Whitney Landon" <swlandon at prexar.com>
> Cc: <hart at pobox.com>
> Sent: Thursday, March 12, 2009 6:06 AM
> Subject: Re: A Gutenberg text.A few poss. typos
>
>
>>
>> I forwarded to bugs at pglaf.org for you.
>>
>> Let me know if you don't hear from them in a few weeks,
>> which is entirely possible, and I may take of it for you.
>>
>> Many thanks!!!
>>
>>
>> Michael
>>
>>
>>
>> On Thu, 12 Mar 2009, Whitney Landon wrote:
>>
>> > A few probable/possible typos [in brackets, below] and a possible
>> > omission from:Title: The Great God Success
>> >
>> > Author: John Graham (David Graham Phillips)
>> >
>> > Release Date: April, 2005 [EBook #7989]
>> > [Yes, we are more than one year ahead of schedule]
>> > [This file was first posted on June 10, 2003]
>> >
>> > Edition: 10
>> >
>> > 8gtgd10.txtOriginally published in 1901, per Wikipedia. No text date in
>> > Gutenberg download."Precisely. I think I will put you next [to]
>> > him."Washington Square, South, about--three years [that double-dash
>> > punctuation is weird; looking at it now it could well be the way it was
>> > published, having seen it a lot, but it's still weird]I do not want 
>> > your
>> > friends[.] I do not want to be
>> > respectable. Howard held her away from him and studied her face, 
>> > surprise
>> > in his eyes.
>> > "Isn't it strange?" he said.[paragraph break? Howard continues ...]
>> > "Here I've been seeing youopen window and she W[w]as propped with
>> > pillowsHoward took his rooms in the Royalton. ... his club where he was
>> > living for a few days [two don't mesh; hard to imagine how it's a typo,
>> > tho]_do_ [many such constructions: translate to italic if possible; can
>> > "search" for the _ (underline) to find them; maybe impossible with
>> > font/face]his power of concentration[.] Hea S[s]ense of relief and of
>> > calmness [probably]Provost and her niece, Miss Marion[typo? jab at
>> > newspapers' proclivities for typos? "Marian" with two "a"s, everywhere
>> > else]At a[ ]bound it had becomeHe drew a[ ]long breath
>> > and-------------------Thanks for the book. I learned about the author 
>> > for
>> > the first time in the Times to/yesterday
>> > http://www.nytimes.com/2009/03/11/us/politics/11senate.html?_r=1followed
>> > that with some Wikipedia, then came (only 2nd or 3rd time, but with
>> > confidence) to PG .... Finished in a sitting. Quite an experience. 
>> > Thanks
>> > again.Whitney LandonBrooksville, Maine
>> >
>
>
> --------------------------------------------------------------------------------
>
>
>
> No virus found in this incoming message.
> Checked by AVG - www.avg.com
> Version: 8.0.237 / Virus Database: 270.11.9/1991 - Release Date: 03/09/09
> 07:14:00
> _______________________________________________
> gutvol-d mailing list
> gutvol-d at lists.pglaf.org
> http://lists.pglaf.org/mailman/listinfo/gutvol-d
> 


From joshua at hutchinson.net  Wed Aug 12 13:18:58 2009
From: joshua at hutchinson.net (Joshua Hutchinson)
Date: Wed, 12 Aug 2009 20:18:58 +0000 (GMT)
Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg
Message-ID: <1210883944.40773.1250108338335.JavaMail.mail@webmail03>

An HTML attachment was scrubbed...
URL: <http://lists.pglaf.org/mailman/private/gutvol-d/attachments/20090812/031fd42f/attachment.html>

From pterandon at gmail.com  Sun Aug 16 19:09:43 2009
From: pterandon at gmail.com (Greg M. Johnson)
Date: Sun, 16 Aug 2009 22:09:43 -0400
Subject: [gutvol-d] Re: Suggestions for the DVD must haves?
Message-ID: <a0bf3e960908161909k188d2912oba918f2d6e893523@mail.gmail.com>

Here's an updated idea of how to do it:


1) Put in the complete, where available:
i) Melvyn Bragg's Books that Changed the World
ii) Norwegian Book Clubs in Oslo (2002)
http://www.gutenberg.org/wiki/Best_Books_Ever_Listings_%28Bookshelf%29

2) Put in the Harvard Classics
http://www.gutenberg.org/wiki/Harvard_Classics_%28Bookshelf%29

3) Put in the BestSellers 1900-1917:
http://www.gutenberg.org/wiki/Bestsellers%2C_American%2C_1900-1922_%28Bookshelf%29


4) The Most Popular 100 books from the last 30 days:
http://www.gutenberg.org/browse/scores/top#books-last30

5) Then, use a program to alternate between these three below until you run
out of room:

i) One randomly chosen item off of a randomly chosen Bookshelves
http://www.gutenberg.org/wiki/Category:Bookshelf

ii) The complete works of one author under your Most popular Authors last 30
days:
http://www.gutenberg.org/browse/scores/top#authors-last30


Just an idea. But that might be a cool, comprehensive way to make a
collection.  Candy + cod liver oil.


-- 
Greg M. Johnson
http://pterandon.blogspot.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.pglaf.org/mailman/private/gutvol-d/attachments/20090816/5e3870e9/attachment.html>

From hart at pobox.com  Mon Aug 24 19:26:07 2009
From: hart at pobox.com (Michael S. Hart)
Date: Mon, 24 Aug 2009 19:26:07 -0700 (PDT)
Subject: [gutvol-d] Rosetta Stone as eBook
Message-ID: <alpine.DEB.2.00.0908241923170.2447@mail.pglaf.org>


We are exploring doing something on the order of public domain
translations of the various Ancient Greek, Coptic, Hieroglyphs
et al from The Rosetta Stone.

Please advise if you would be interested in helping.


We put this in Preprints:

  http://preprints.readingroo.ms/rosetta/

Including a README.TXT

Original source:

  http://books.google.com/books?id=Q5Ow9Ub_BIgC&oe=UTF-8

it's also at archive.org, with raw OCR


Many thanks!!!


  -- Greg and Michael

From greg at durendal.org  Mon Aug 31 16:30:10 2009
From: greg at durendal.org (Greg Weeks)
Date: Mon, 31 Aug 2009 19:30:10 -0400 (EDT)
Subject: [gutvol-d] Grosly Broken browser or wiki
Message-ID: <Pine.LNX.4.63.0908311929090.32579@durendal.durendal.org>


Something is messed up. Can someone undo the last edit I just did for the 
Science fiction bookshelf wiki page. It wiped it clean.

-- 
Greg Weeks
http://durendal.org:8080/greg/


From kevin.pulliam at gmail.com  Mon Aug 31 18:02:36 2009
From: kevin.pulliam at gmail.com (Kevin Pulliam)
Date: Mon, 31 Aug 2009 20:02:36 -0500
Subject: [gutvol-d] Re: Grosly Broken browser or wiki
In-Reply-To: <Pine.LNX.4.63.0908311929090.32579@durendal.durendal.org>
References: <Pine.LNX.4.63.0908311929090.32579@durendal.durendal.org>
Message-ID: <78defb40908311802g1c2c3f55m170172232611b74c@mail.gmail.com>

Apparently I don't have the permission to fix what happened, but I did
snag a save of the state of the page immediately prior to your edit.
(That way if an auto process trims the history it can still be
restored by someone with high enough permissions)

Note: I tried in both IE and Firefox.

Kevin Pulliam


On 8/31/09, Greg Weeks <greg at durendal.org> wrote:
>
>  Something is messed up. Can someone undo the last edit I just did for the
> Science fiction bookshelf wiki page. It wiped it clean.
>
>  --
>  Greg Weeks
>  http://durendal.org:8080/greg/
>
>  _______________________________________________
>  gutvol-d mailing list
>  gutvol-d at lists.pglaf.org
>  http://lists.pglaf.org/mailman/listinfo/gutvol-d
>

From greg at durendal.org  Mon Aug 31 18:40:04 2009
From: greg at durendal.org (Greg Weeks)
Date: Mon, 31 Aug 2009 21:40:04 -0400 (EDT)
Subject: [gutvol-d] Re: Grosly Broken browser or wiki
In-Reply-To: <78defb40908311802g1c2c3f55m170172232611b74c@mail.gmail.com>
References: <Pine.LNX.4.63.0908311929090.32579@durendal.durendal.org>
	<78defb40908311802g1c2c3f55m170172232611b74c@mail.gmail.com>
Message-ID: <Pine.LNX.4.63.0908312139180.3657@durendal.durendal.org>

On Mon, 31 Aug 2009, Kevin Pulliam wrote:

> Apparently I don't have the permission to fix what happened, but I did
> snag a save of the state of the page immediately prior to your edit.
> (That way if an auto process trims the history it can still be
> restored by someone with high enough permissions)
>
> Note: I tried in both IE and Firefox.

I tried IE and two different versions of Firefox. All three ended back 
with a blank page. :(


>
> On 8/31/09, Greg Weeks <greg at durendal.org> wrote:
>>
>>  Something is messed up. Can someone undo the last edit I just did for the
>> Science fiction bookshelf wiki page. It wiped it clean.
>>
>>  --
>>  Greg Weeks
>>  http://durendal.org:8080/greg/
>>
>>  _______________________________________________
>>  gutvol-d mailing list
>>  gutvol-d at lists.pglaf.org
>>  http://lists.pglaf.org/mailman/listinfo/gutvol-d
>>
>

-- 
Greg Weeks
http://durendal.org:8080/greg/