From gbnewby at pglaf.org Wed Aug 5 16:55:11 2009 From: gbnewby at pglaf.org (Greg Newby) Date: Wed, 5 Aug 2009 16:55:11 -0700 Subject: [gutvol-d] eBook readers for Linux desktop Message-ID: <20090805235511.GA8221@mail.pglaf.org> Nice article I encountered. It's PDF online, but I actually got the dead trees edition :) Collabora, GutenPY and FBReader are all featured. http://www.linux-magazine.com/Issues/2009/104/FRESH-FROM-THE-PRESS -- Greg From hart at pobox.com Thu Aug 6 08:25:08 2009 From: hart at pobox.com (Michael S. Hart) Date: Thu, 6 Aug 2009 07:25:08 -0800 (AKDT) Subject: [gutvol-d] 25,000th English eBook Message-ID: As of today: 24879 in English At our usual rate of progress we should reach #25,000 sometime this month. Have we created a list of possible candidates yet? Suggestions welcome. Thanks!!! Thanks!!! Michael S. Hart Founder Project Gutenberg From hart at pobox.com Sun Aug 9 07:19:20 2009 From: hart at pobox.com (Michael S. Hart) Date: Sun, 9 Aug 2009 06:19:20 -0800 (AKDT) Subject: [gutvol-d] !@! Gutenberg's 25,000th English eBook Message-ID: As of today: 24902 English We are asking for suggestions for #25,000!!! We probably have just over 10 days. . . . Thanks!!! Michael From marcello at perathoner.de Sun Aug 9 12:01:30 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Sun, 09 Aug 2009 21:01:30 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <891262632.99249.1248790575502.JavaMail.mail@webmail05> References: <891262632.99249.1248790575502.JavaMail.mail@webmail05> Message-ID: <4A7F1D0A.8060500@perathoner.de> Joshua Hutchinson wrote: > Any chance of creating on the fly zips of some of the books? For instance, the > audio books are huge and usually divided along chapter lines. Single file zips > are very useful (and something we've done on some of them manually) but the > space waste is huge. On the fly zipping of those files would save huge in > storage space. The other way round would be much more useful: store only a zipped version of everything. All browsers support unzipping on-the-fly (all bibrec pages we serve are in a compressed format since years now) so the only people who'd notice anything was the ftp users. (But he who has mastered to type "ftp" can also be made to type "unzip", so no problem there.) From marcello at perathoner.de Sun Aug 9 12:04:25 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Sun, 09 Aug 2009 21:04:25 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: References: <891262632.99249.1248790575502.JavaMail.mail@webmail05> <20090728143306.GA28986@mail.pglaf.org> Message-ID: <4A7F1DB9.9070404@perathoner.de> David A. Desrosiers wrote: > If you're looking at it at that level, why not just offer some > streaming audio of the books as well? What's the advantage of streaming audio vs. getting the whole file at once? From marcello at perathoner.de Sun Aug 9 12:16:54 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Sun, 09 Aug 2009 21:16:54 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <212322090907291511r2d2a2ebfw198542bca2c2eb76@mail.gmail.com> References: <891262632.99249.1248790575502.JavaMail.mail@webmail05> <20090728143306.GA28986@mail.pglaf.org> <20090729172510.GC8946@mail.pglaf.org> <212322090907291511r2d2a2ebfw198542bca2c2eb76@mail.gmail.com> Message-ID: <4A7F20A6.2020003@perathoner.de> Paulo Levi wrote: > Tell me, please, if the gutenberg rtf index file, besides being > autogenerated, is also sorted. > > What i mean is, i'm indexing parts of the file, and i gain a major > speed up of treating the file as: > > a massive list of pgterms:etext definitions > > followed by a (more massive) list of pgterms:file definitions > > This allows the string comparisons i have to do to be lower (about > n/2), but at the cost that any etext record in between the second list > is not picked up. > > That won't be changed, but is only for my peace of mind. > Also, in the pgterms:file records, are the records referring to the > same file consecutive ? I ask because if so, i could do the same sort > of filtering Aaron Cannon is doing in its dvd project, to speed up the > index some more and remove duplicates. > (If they aren't consecutive i would have to issue queries between > building the index to see if they were already inserted). > > I have nothing against xpath, indeed i think the scanning of the file > in lucene already uses something similar. But i need free text > searches, and they have to be fast (i'm already experimenting with a > memory cache after the query too, and it works okish for my > application) All future changes to the file will be backward-compatible in an XML way. Meaning: the same XPath queries will yield (supersets of) the same result sets. I do NOT guarantee that the sequence of entities will be sorted the same way. Your sorting needs will probably be much different from the next guy's, so the sorting is up to you. From marcello at perathoner.de Sun Aug 9 12:35:59 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Sun, 09 Aug 2009 21:35:59 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <79902BC0-1EED-4021-B529-60A6CE413F38@ark.in-berlin.de> References: <212322090907151725x194f5361j2741f331dbf775f1@mail.gmail.com> <212322090907151728x84b224by697776eec6e265a4@mail.gmail.com> <4A6D5639.5080600@perathoner.de> <002494FB-121D-4CC8-80B0-AE3EFA238ADB@ark.in-berlin.de> <79902BC0-1EED-4021-B529-60A6CE413F38@ark.in-berlin.de> Message-ID: <4A7F251F.7000605@perathoner.de> Ralf Stephan wrote: > I confirm that neither the Plucker nor the Mobile formats > are mentioned in the catalog file. Do you have an > explanation, Marcello? Because the plucker files are not in the archive. They are generated on-the-fly (sort of) and then cached for some time. We don't know which files are in the cache at any actual moment. This design was changed for the epub and mobi files. They are generated manually and stay in the cache (until manually rebuilt). They could be added to the database as well and to the RDF file too. Just a small matter of programming ... Just now I'm concentrating on making better epubs. By the time I'll be satisfied, I'll tackle the databse code. From marcello at perathoner.de Sun Aug 9 13:00:23 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Sun, 09 Aug 2009 22:00:23 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <20090728143306.GA28986@mail.pglaf.org> References: <891262632.99249.1248790575502.JavaMail.mail@webmail05> <20090728143306.GA28986@mail.pglaf.org> Message-ID: <4A7F2AD7.5010801@perathoner.de> Greg Newby wrote: > A more general approach would be to let visitors to www.gutenberg.org > put their selected files (including those generated on-the-fly) > on a bookshelf (i.e., shopping cart), then download in one big file, > or several small ones. Or we could tell them how to use the download manager in their browsers ... Seriously, whenever I download a couple of books, I just click away and let the downloads complete in the background. I concur that there has something to be done for the audio files. Even just clicking on 10+ files can be bothersome. But I always favor the lowest-tech solution. So why not add a M3U playlist to the directory? M3U just a text file with a list of urls and is supported by Winamp and many other players. http://gonze.com/playlists/playlist-format-survey.html From marcello at perathoner.de Sun Aug 9 13:17:57 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Sun, 09 Aug 2009 22:17:57 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: References: <212322090907151725x194f5361j2741f331dbf775f1@mail.gmail.com> <212322090907151728x84b224by697776eec6e265a4@mail.gmail.com> <4A6D5639.5080600@perathoner.de> <002494FB-121D-4CC8-80B0-AE3EFA238ADB@ark.in-berlin.de> Message-ID: <4A7F2EF5.1050101@perathoner.de> David A. Desrosiers wrote: > On Mon, Jul 27, 2009 at 1:45 PM, Ralf Stephan wrote: >> My, can't we admit that XPath is a bit over our head, >> so we prefer confronting the admin we're supposed >> to be cooperating with? Wrt resources, my guess it's >> about par traffic-wise (1-5k per book vs. megabytes >> of RDF) but much better CPU-wise. That is, if you don't >> want the RDF for other fine things like metadata etc. > > I think you've missed my point. > > The RDF flat-out cannot tell me which of the target _formats_ are > available for immediate download to the users. I'm not looking for > which _titles_ are available in the catalog, I'm looking for which > _formats_ are available. Also note that I'm already parsing the feeds > to see what the top 'n' titles are already, so parsing XML via > whatever methods I need is not the blocker here. > > Let me give you an example of two titles available in the catalog: > > Verg?nglichkeit by Sigmund Freud > http://www.gutenberg.org/cache/plucker/29514/29514 > > The Lost Word by Henry Van Dyke > http://www.gutenberg.org/cache/plucker/4384/4384 > > Both of these _titles_ are available in the Gutenberg catalog, but the > second one is not available in the Plucker _format_ for immediate > download. Big difference from parsing title availability from the > catalog.rdf file. So you are doing a HEAD on the cache location? I hope you don't have many of these in the field, because you're going to look very sorry whenever the location of the cache changes. (It will! I give you fair notice for free :-) ) > Make sense now? No. Why is that "immediate download" bit so important for you? You will get a completely random set of files. (A cached plucker file expires 7 days after *generation* not after the last *access*. So all you get is the set of files generated in the last 7 days.) And a wrong set too. The first file could have been deleted on the server long before you finished your barrage of HEAD request. From joshua at hutchinson.net Mon Aug 10 05:47:10 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Mon, 10 Aug 2009 12:47:10 +0000 (GMT) Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg Message-ID: <85614071.121560.1249908430271.JavaMail.mail@webmail09> An HTML attachment was scrubbed... URL: From joshua at hutchinson.net Mon Aug 10 05:52:15 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Mon, 10 Aug 2009 12:52:15 +0000 (GMT) Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg Message-ID: <2011678691.121630.1249908735714.JavaMail.mail@webmail09> An HTML attachment was scrubbed... URL: From marcello at perathoner.de Mon Aug 10 06:03:20 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Mon, 10 Aug 2009 15:03:20 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <85614071.121560.1249908430271.JavaMail.mail@webmail09> References: <85614071.121560.1249908430271.JavaMail.mail@webmail09> Message-ID: <4A801A98.8020603@perathoner.de> Joshua Hutchinson wrote: > Well, the one problem I can see if that it becomes very hard to link to a > specific file within the zip file. Internet Archive already has a solution. They can extract a jpg from a zip on the fly. Look at this url: http://ia331340.us.archive.org/zipview.php?zip=/3/items/goodytwoshoes00newyiala/goodytwoshoes00newyiala_flippy.zip&file=0002.jpg "Borrowing" that code should be easy. > For instance, with the audio files, we have index files that let you link > directly to a particular section/chapter. If that file is within a zip file, > that becomes a problem. No problem. Keep the index outside the zip file then. From marcello at perathoner.de Mon Aug 10 13:05:02 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Mon, 10 Aug 2009 22:05:02 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <85614071.121560.1249908430271.JavaMail.mail@webmail09> References: <85614071.121560.1249908430271.JavaMail.mail@webmail09> Message-ID: <4A807D6E.2040409@perathoner.de> Joshua Hutchinson wrote: > Well, the one problem I can see if that it becomes very hard to link to a > specific file within the zip file. Actually it takes just a hundred lines of code: http://www.gutenberg.org/zipcat/files/20686/20686-ogg.zip?ogg/20686-01.ogg From joshua at hutchinson.net Mon Aug 10 13:22:14 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Mon, 10 Aug 2009 20:22:14 +0000 (GMT) Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg Message-ID: <2088833402.128411.1249935734280.JavaMail.mail@webmail05> An HTML attachment was scrubbed... URL: From marcello at perathoner.de Mon Aug 10 14:47:25 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Mon, 10 Aug 2009 23:47:25 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <2088833402.128411.1249935734280.JavaMail.mail@webmail05> References: <2088833402.128411.1249935734280.JavaMail.mail@webmail05> Message-ID: <4A80956D.7010700@perathoner.de> Joshua Hutchinson wrote: > Hey, very nice! Can I assume this functionality is now live? If so, I can save > a bunch of space in the audio files. :) Something of this kind will be live soon. I'd like it to use exactly the same url as the url to the file, so the users won't even notice we changed something. Audio books will need just the zips for the different audio codecs, one readme, one index file and maybe one m3u playlist for every codec. I noticed the file structure for audio book zips doesn't follow exactly the structure for html zips. The directories ogg/ mp3/ spx/ should actually be named 12345-ogg/ 12345-mp3/ 12345-spx/. That would be more in line with the 12345-h/ convention we use for html. Also the top directory inside the zipped files should be 12345-ogg/ etc. From user5013 at aol.com Mon Aug 10 15:19:33 2009 From: user5013 at aol.com (Christa & Jay Toser) Date: Mon, 10 Aug 2009 17:19:33 -0500 Subject: [gutvol-d] Book #25,000 Message-ID: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com> I may be ignorant on this, but I cannot find a complete English copy of the Koran (or however they spell it) anywhere on Project Gutenberg. So also, HOW do they spell it in English? Jay Toser From hart at pobox.com Mon Aug 10 15:34:54 2009 From: hart at pobox.com (Michael S. Hart) Date: Mon, 10 Aug 2009 14:34:54 -0800 (AKDT) Subject: [gutvol-d] Re: Book #25,000 In-Reply-To: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com> References: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com> Message-ID: On Mon, 10 Aug 2009, Christa & Jay Toser wrote: > I may be ignorant on this, but > > I cannot find a complete English copy of the Koran (or however they spell it) > anywhere on Project Gutenberg. > > So also, HOW do they spell it in English? > > Jay Toser As with anything translated from another language, particularly one with another alphabet, spelling a word or phrase in English may be inconsistent: Koran Kuran Quran Q'ran Qu'ran Q'uran Qur'a-n Alcoran Al-Qur'a-n with various accents, all seem to pop up, but this list is neither complete nor in best order; sorry, but I am sure I could come up with better if I had enough time for some serious research. As for doing this one as #25,000, I'm afraid there are too many copyright questions already on what a public domain version might be. . .and. . .in fact a number of people have told me what I interpret a non-legal opinion as: There is no public domain version of The Koran for Project Gutenberg to work with in English. So far we have not had any closure on this. Hence my apologies, Michael From ajhaines at shaw.ca Mon Aug 10 15:46:58 2009 From: ajhaines at shaw.ca (Al Haines (shaw)) Date: Mon, 10 Aug 2009 15:46:58 -0700 Subject: [gutvol-d] Re: Book #25,000 References: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com> Message-ID: I got seven matches for "Koran" on a "Title Word" search, none for the alternates I could think of--Kuran, Qoran, Quran. I leave it to others to speak to their completeness. FYI--there's a raft of "Koran" matches in Internet Archive (http://www.archive.org/search.php?query=title%3A%28koran%29 ----- Original Message ----- From: "Christa & Jay Toser" To: Sent: Monday, August 10, 2009 3:19 PM Subject: [gutvol-d] Book #25,000 >I may be ignorant on this, but > > I cannot find a complete English copy of the Koran (or however they spell > it) anywhere on Project Gutenberg. > > So also, HOW do they spell it in English? > > Jay Toser > _______________________________________________ > gutvol-d mailing list > gutvol-d at lists.pglaf.org > http://lists.pglaf.org/mailman/listinfo/gutvol-d > From jtinsley at pobox.com Mon Aug 10 16:03:35 2009 From: jtinsley at pobox.com (Jim Tinsley) Date: Mon, 10 Aug 2009 19:03:35 -0400 Subject: [gutvol-d] Re: Book #25,000 In-Reply-To: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com> References: <02973144-1592-47E6-924E-16B7DACA21F2@aol.com> Message-ID: <20090810230334.GA12208@panix.com> On Mon, 10 Aug 2009 17:19:33 -0500, Christa & Jay Toser wrote: >I may be ignorant on this, but > >I cannot find a complete English copy of the Koran (or however they >spell it) anywhere on Project Gutenberg. > >So also, HOW do they spell it in English? I look in for the first time in forever, and see a question I can answer - at length, if nobody stops me. :-) I've had some emailed discussion with people who were quite vehement on the subject. My formatting is out of date, but the catalog does still seem to find the books: Three Translations of The Koran (Al-Qur'an) side by side 16955 16955.txt 16955.zip Feb 2005 The Koran, Translated by George Sale [koranxxb.xxx] 7440 koran09b.txt koran09b.zip koran09br.zip (Note: previously incorrectly listed as The Koran, by Mohammed/Mohammad) Sep 2002 The Koran, Translated by J.M Rodwell [koranxxa.xxx] 3434 koran10a.txt koran10a.zip koran11a.txt koran11a.zip koran12a.txt koran12a.zip (Note: previously incorrectly listed as The Koran, by Mohammed/Mohammad) Sep 2001 The Koran, Translated by J.M. Rodwell [koranxxx.xxx] 2800 koran10.txt koran10.zip (Note: previously incorrectly listed as The Koran, by Mohammed/Mohammad) jim From gbnewby at pglaf.org Mon Aug 10 22:25:17 2009 From: gbnewby at pglaf.org (Greg Newby) Date: Mon, 10 Aug 2009 22:25:17 -0700 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <4A7F2AD7.5010801@perathoner.de> References: <891262632.99249.1248790575502.JavaMail.mail@webmail05> <20090728143306.GA28986@mail.pglaf.org> <4A7F2AD7.5010801@perathoner.de> Message-ID: <20090811052517.GB2868@mail.pglaf.org> On Sun, Aug 09, 2009 at 10:00:23PM +0200, Marcello Perathoner wrote: > Greg Newby wrote: > >> A more general approach would be to let visitors to www.gutenberg.org >> put their selected files (including those generated on-the-fly) >> on a bookshelf (i.e., shopping cart), then download in one big file, >> or several small ones. > > Or we could tell them how to use the download manager in their browsers ... > > Seriously, whenever I download a couple of books, I just click away and > let the downloads complete in the background. A bookshelf (or shopping cart, or call it something else) isn't just about managing downloads started immediately upon finding something of interest. It's about keeping track of books to download in the future, books already downloaded, and books to start downloading later (say, overnight or when someone goes from home to a location with higher bandwidth). Sure, people can use browser bookmarks for some of this, but: - bookmarks need to then be managed, for after the book is downloaded - bookmarking the bibrec page is easy, but right-click "bookmark target as..." to actually select a file format is harder for our many naive users - A larger # of bookmarks for future reading won't have very useful functionality; a shopping cart model at www.gutenberg.org could have some more useful features Probably another site will implement this for the Gutenberg content, eventually. Some of the resellers already have this type of thing. -- Greg PS: I really like adding an m3u playlist file to audio content > I concur that there has something to be done for the audio files. Even > just clicking on 10+ files can be bothersome. > > But I always favor the lowest-tech solution. > > So why not add a M3U playlist to the directory? M3U just a text file > with a list of urls and is supported by Winamp and many other players. > > http://gonze.com/playlists/playlist-format-survey.html From gbnewby at pglaf.org Mon Aug 10 22:30:45 2009 From: gbnewby at pglaf.org (Greg Newby) Date: Mon, 10 Aug 2009 22:30:45 -0700 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <4A7F1DB9.9070404@perathoner.de> References: <891262632.99249.1248790575502.JavaMail.mail@webmail05> <20090728143306.GA28986@mail.pglaf.org> <4A7F1DB9.9070404@perathoner.de> Message-ID: <20090811053045.GD2868@mail.pglaf.org> On Sun, Aug 09, 2009 at 09:04:25PM +0200, Marcello Perathoner wrote: > David A. Desrosiers wrote: > >> If you're looking at it at that level, why not just offer some >> streaming audio of the books as well? > > What's the advantage of streaming audio vs. getting the whole file at once? Listening while downloading. This seems to happen for some people, but not others. I don't know whether differences are in content-type, the server, or somewhere else, but a frequent email Q sent to help@ is, "I clicked your MP3, and I got a message that it downloaded. Where is it, and how to I listen?" In that case, the browser downloaded the file without opening it in a player. I get the opposite question, too: "I clicked your MP3 and it played, but now I want to save it to my computer to transfer to my MP3 player." I don't know enough about the mix of protocols, server settings, browsers, players, etc. to know whether there is a single solution. But from our reader/listener's point of view, I think two links per audiobook file (MP3, OGG, etc.) would be desirable: a. Listen now (streaming) b. Download for later -- Greg From marcello at perathoner.de Tue Aug 11 00:49:17 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Tue, 11 Aug 2009 09:49:17 +0200 Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg In-Reply-To: <20090811053045.GD2868@mail.pglaf.org> References: <891262632.99249.1248790575502.JavaMail.mail@webmail05> <20090728143306.GA28986@mail.pglaf.org> <4A7F1DB9.9070404@perathoner.de> <20090811053045.GD2868@mail.pglaf.org> Message-ID: <4A81227D.10307@perathoner.de> Greg Newby wrote: > On Sun, Aug 09, 2009 at 09:04:25PM +0200, Marcello Perathoner wrote: >> David A. Desrosiers wrote: >> >>> If you're looking at it at that level, why not just offer some >>> streaming audio of the books as well? >> What's the advantage of streaming audio vs. getting the whole file at once? > > Listening while downloading. This seems to happen for some people, > but not others. Many players can play while downloading from a file. Firefox on Linux starts "reading" within 3 seconds of clicking on a 30 Megs mp3. Streaming has its only advantage if you don't have enough disc to store the file on the device or if you use multicast protocols, both of which we can rule out for PG. > I don't know whether differences are in content-type, the server, > or somewhere else, but a frequent email Q sent to help@ is, "I clicked > your MP3, and I got a message that it downloaded. Where is it, and how > to I listen?" > In that case, the browser downloaded the file without opening it > in a player. > > I get the opposite question, too: "I clicked your MP3 and it played, > but now I want to save it to my computer to transfer to my MP3 player." This entirely depends on the settings of the browser. The server only sends a header: "content-type: audio/ogg" and the browser decides what to do with it. Somebody ought to write an illustrated how-to for users, explaining how to config their browser for immediate playback on left-click and how to use right-click/save-as for saving to disc. > I don't know enough about the mix of protocols, server settings, browsers, > players, etc. to know whether there is a single solution. But from our > reader/listener's point of view, I think two links per audiobook file > (MP3, OGG, etc.) would be desirable: > > a. Listen now (streaming) > b. Download for later This *may* work using the non-standard "content-disposition" header, which different browsers implement in slightly different ways ... so then we'll have a swimming-pool of snakes. From hart at pobox.com Tue Aug 11 06:28:09 2009 From: hart at pobox.com (Michael S. Hart) Date: Tue, 11 Aug 2009 05:28:09 -0800 (AKDT) Subject: [gutvol-d] 81 Left to English eBook #25,000 Message-ID: We still don't have a winner! From joshua at hutchinson.net Tue Aug 11 07:52:52 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Tue, 11 Aug 2009 14:52:52 +0000 (GMT) Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg Message-ID: <1275438696.7865.1250002372944.JavaMail.mail@webmail03> An HTML attachment was scrubbed... URL: From hart at pobox.com Tue Aug 11 14:03:21 2009 From: hart at pobox.com (Michael S. Hart) Date: Tue, 11 Aug 2009 13:03:21 -0800 (AKDT) Subject: [gutvol-d] Re: A Gutenberg text.A few poss. typos (fwd) Message-ID: My email isn't getting through to bugs or errors at pglaf.org So I'm forwarding this here. Michael ---------- Forwarded message ---------- Date: Tue, 11 Aug 2009 13:17:09 -0400 From: Whitney Landon To: Michael S. Hart Subject: Re: A Gutenberg text.A few poss. typos Happenstantial more-than-a-few-weeks-later find of this -- there's been no contact from bugs at pglaf.org . Thanks again. WL ----- Original Message ----- From: "Michael S. Hart" To: "Whitney Landon" Cc: Sent: Thursday, March 12, 2009 6:06 AM Subject: Re: A Gutenberg text.A few poss. typos > > I forwarded to bugs at pglaf.org for you. > > Let me know if you don't hear from them in a few weeks, > which is entirely possible, and I may take of it for you. > > Many thanks!!! > > > Michael > > > > On Thu, 12 Mar 2009, Whitney Landon wrote: > > > A few probable/possible typos [in brackets, below] and a possible > > omission from:Title: The Great God Success > > > > Author: John Graham (David Graham Phillips) > > > > Release Date: April, 2005 [EBook #7989] > > [Yes, we are more than one year ahead of schedule] > > [This file was first posted on June 10, 2003] > > > > Edition: 10 > > > > 8gtgd10.txtOriginally published in 1901, per Wikipedia. No text date in > > Gutenberg download."Precisely. I think I will put you next [to] > > him."Washington Square, South, about--three years [that double-dash > > punctuation is weird; looking at it now it could well be the way it was > > published, having seen it a lot, but it's still weird]I do not want your > > friends[.] I do not want to be > > respectable. Howard held her away from him and studied her face, surprise > > in his eyes. > > "Isn't it strange?" he said.[paragraph break? Howard continues ...] > > "Here I've been seeing youopen window and she W[w]as propped with > > pillowsHoward took his rooms in the Royalton. ... his club where he was > > living for a few days [two don't mesh; hard to imagine how it's a typo, > > tho]_do_ [many such constructions: translate to italic if possible; can > > "search" for the _ (underline) to find them; maybe impossible with > > font/face]his power of concentration[.] Hea S[s]ense of relief and of > > calmness [probably]Provost and her niece, Miss Marion[typo? jab at > > newspapers' proclivities for typos? "Marian" with two "a"s, everywhere > > else]At a[ ]bound it had becomeHe drew a[ ]long breath > > and-------------------Thanks for the book. I learned about the author for > > the first time in the Times to/yesterday > > http://www.nytimes.com/2009/03/11/us/politics/11senate.html?_r=1followed > > that with some Wikipedia, then came (only 2nd or 3rd time, but with > > confidence) to PG .... Finished in a sitting. Quite an experience. Thanks > > again.Whitney LandonBrooksville, Maine > > -------------------------------------------------------------------------------- No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.0.237 / Virus Database: 270.11.9/1991 - Release Date: 03/09/09 07:14:00 From ajhaines at shaw.ca Tue Aug 11 14:49:10 2009 From: ajhaines at shaw.ca (Al Haines (shaw)) Date: Tue, 11 Aug 2009 14:49:10 -0700 Subject: [gutvol-d] Re: A Gutenberg text.A few poss. typos (fwd) References: Message-ID: <3054A343AAF943E284F361387ACC295E@alp2400> I've forwarded it to Errata, which is currently auto-forwarded to the WWers. ----- Original Message ----- From: "Michael S. Hart" To: "The gutvol-d Mailing List" Sent: Tuesday, August 11, 2009 2:03 PM Subject: [gutvol-d] Re: A Gutenberg text.A few poss. typos (fwd) > > > My email isn't getting through to bugs or errors at pglaf.org > > So I'm forwarding this here. > > > Michael > > > ---------- Forwarded message ---------- > Date: Tue, 11 Aug 2009 13:17:09 -0400 > From: Whitney Landon > To: Michael S. Hart > Subject: Re: A Gutenberg text.A few poss. typos > > Happenstantial more-than-a-few-weeks-later find of this -- there's been no > contact from bugs at pglaf.org . Thanks again. > WL > ----- Original Message ----- From: "Michael S. Hart" > To: "Whitney Landon" > Cc: > Sent: Thursday, March 12, 2009 6:06 AM > Subject: Re: A Gutenberg text.A few poss. typos > > >> >> I forwarded to bugs at pglaf.org for you. >> >> Let me know if you don't hear from them in a few weeks, >> which is entirely possible, and I may take of it for you. >> >> Many thanks!!! >> >> >> Michael >> >> >> >> On Thu, 12 Mar 2009, Whitney Landon wrote: >> >> > A few probable/possible typos [in brackets, below] and a possible >> > omission from:Title: The Great God Success >> > >> > Author: John Graham (David Graham Phillips) >> > >> > Release Date: April, 2005 [EBook #7989] >> > [Yes, we are more than one year ahead of schedule] >> > [This file was first posted on June 10, 2003] >> > >> > Edition: 10 >> > >> > 8gtgd10.txtOriginally published in 1901, per Wikipedia. No text date in >> > Gutenberg download."Precisely. I think I will put you next [to] >> > him."Washington Square, South, about--three years [that double-dash >> > punctuation is weird; looking at it now it could well be the way it was >> > published, having seen it a lot, but it's still weird]I do not want >> > your >> > friends[.] I do not want to be >> > respectable. Howard held her away from him and studied her face, >> > surprise >> > in his eyes. >> > "Isn't it strange?" he said.[paragraph break? Howard continues ...] >> > "Here I've been seeing youopen window and she W[w]as propped with >> > pillowsHoward took his rooms in the Royalton. ... his club where he was >> > living for a few days [two don't mesh; hard to imagine how it's a typo, >> > tho]_do_ [many such constructions: translate to italic if possible; can >> > "search" for the _ (underline) to find them; maybe impossible with >> > font/face]his power of concentration[.] Hea S[s]ense of relief and of >> > calmness [probably]Provost and her niece, Miss Marion[typo? jab at >> > newspapers' proclivities for typos? "Marian" with two "a"s, everywhere >> > else]At a[ ]bound it had becomeHe drew a[ ]long breath >> > and-------------------Thanks for the book. I learned about the author >> > for >> > the first time in the Times to/yesterday >> > http://www.nytimes.com/2009/03/11/us/politics/11senate.html?_r=1followed >> > that with some Wikipedia, then came (only 2nd or 3rd time, but with >> > confidence) to PG .... Finished in a sitting. Quite an experience. >> > Thanks >> > again.Whitney LandonBrooksville, Maine >> > > > > -------------------------------------------------------------------------------- > > > > No virus found in this incoming message. > Checked by AVG - www.avg.com > Version: 8.0.237 / Virus Database: 270.11.9/1991 - Release Date: 03/09/09 > 07:14:00 > _______________________________________________ > gutvol-d mailing list > gutvol-d at lists.pglaf.org > http://lists.pglaf.org/mailman/listinfo/gutvol-d > From joshua at hutchinson.net Wed Aug 12 13:18:58 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Wed, 12 Aug 2009 20:18:58 +0000 (GMT) Subject: [gutvol-d] Re: Fwd: Programmatic fetching books from Gutenberg Message-ID: <1210883944.40773.1250108338335.JavaMail.mail@webmail03> An HTML attachment was scrubbed... URL: From pterandon at gmail.com Sun Aug 16 19:09:43 2009 From: pterandon at gmail.com (Greg M. Johnson) Date: Sun, 16 Aug 2009 22:09:43 -0400 Subject: [gutvol-d] Re: Suggestions for the DVD must haves? Message-ID: Here's an updated idea of how to do it: 1) Put in the complete, where available: i) Melvyn Bragg's Books that Changed the World ii) Norwegian Book Clubs in Oslo (2002) http://www.gutenberg.org/wiki/Best_Books_Ever_Listings_%28Bookshelf%29 2) Put in the Harvard Classics http://www.gutenberg.org/wiki/Harvard_Classics_%28Bookshelf%29 3) Put in the BestSellers 1900-1917: http://www.gutenberg.org/wiki/Bestsellers%2C_American%2C_1900-1922_%28Bookshelf%29 4) The Most Popular 100 books from the last 30 days: http://www.gutenberg.org/browse/scores/top#books-last30 5) Then, use a program to alternate between these three below until you run out of room: i) One randomly chosen item off of a randomly chosen Bookshelves http://www.gutenberg.org/wiki/Category:Bookshelf ii) The complete works of one author under your Most popular Authors last 30 days: http://www.gutenberg.org/browse/scores/top#authors-last30 Just an idea. But that might be a cool, comprehensive way to make a collection. Candy + cod liver oil. -- Greg M. Johnson http://pterandon.blogspot.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From hart at pobox.com Mon Aug 24 19:26:07 2009 From: hart at pobox.com (Michael S. Hart) Date: Mon, 24 Aug 2009 19:26:07 -0700 (PDT) Subject: [gutvol-d] Rosetta Stone as eBook Message-ID: We are exploring doing something on the order of public domain translations of the various Ancient Greek, Coptic, Hieroglyphs et al from The Rosetta Stone. Please advise if you would be interested in helping. We put this in Preprints: http://preprints.readingroo.ms/rosetta/ Including a README.TXT Original source: http://books.google.com/books?id=Q5Ow9Ub_BIgC&oe=UTF-8 it's also at archive.org, with raw OCR Many thanks!!! -- Greg and Michael From greg at durendal.org Mon Aug 31 16:30:10 2009 From: greg at durendal.org (Greg Weeks) Date: Mon, 31 Aug 2009 19:30:10 -0400 (EDT) Subject: [gutvol-d] Grosly Broken browser or wiki Message-ID: Something is messed up. Can someone undo the last edit I just did for the Science fiction bookshelf wiki page. It wiped it clean. -- Greg Weeks http://durendal.org:8080/greg/ From kevin.pulliam at gmail.com Mon Aug 31 18:02:36 2009 From: kevin.pulliam at gmail.com (Kevin Pulliam) Date: Mon, 31 Aug 2009 20:02:36 -0500 Subject: [gutvol-d] Re: Grosly Broken browser or wiki In-Reply-To: References: Message-ID: <78defb40908311802g1c2c3f55m170172232611b74c@mail.gmail.com> Apparently I don't have the permission to fix what happened, but I did snag a save of the state of the page immediately prior to your edit. (That way if an auto process trims the history it can still be restored by someone with high enough permissions) Note: I tried in both IE and Firefox. Kevin Pulliam On 8/31/09, Greg Weeks wrote: > > Something is messed up. Can someone undo the last edit I just did for the > Science fiction bookshelf wiki page. It wiped it clean. > > -- > Greg Weeks > http://durendal.org:8080/greg/ > > _______________________________________________ > gutvol-d mailing list > gutvol-d at lists.pglaf.org > http://lists.pglaf.org/mailman/listinfo/gutvol-d > From greg at durendal.org Mon Aug 31 18:40:04 2009 From: greg at durendal.org (Greg Weeks) Date: Mon, 31 Aug 2009 21:40:04 -0400 (EDT) Subject: [gutvol-d] Re: Grosly Broken browser or wiki In-Reply-To: <78defb40908311802g1c2c3f55m170172232611b74c@mail.gmail.com> References: <78defb40908311802g1c2c3f55m170172232611b74c@mail.gmail.com> Message-ID: On Mon, 31 Aug 2009, Kevin Pulliam wrote: > Apparently I don't have the permission to fix what happened, but I did > snag a save of the state of the page immediately prior to your edit. > (That way if an auto process trims the history it can still be > restored by someone with high enough permissions) > > Note: I tried in both IE and Firefox. I tried IE and two different versions of Firefox. All three ended back with a blank page. :( > > On 8/31/09, Greg Weeks wrote: >> >> Something is messed up. Can someone undo the last edit I just did for the >> Science fiction bookshelf wiki page. It wiped it clean. >> >> -- >> Greg Weeks >> http://durendal.org:8080/greg/ >> >> _______________________________________________ >> gutvol-d mailing list >> gutvol-d at lists.pglaf.org >> http://lists.pglaf.org/mailman/listinfo/gutvol-d >> > -- Greg Weeks http://durendal.org:8080/greg/