From Bowerbird at aol.com Mon Mar 2 02:12:19 2009 From: Bowerbird at aol.com (Bowerbird at aol.com) Date: Mon, 2 Mar 2009 05:12:19 EST Subject: [gutvol-d] publishers get to decide if they want d.r.m. or not Message-ID: "publishers get to decide if they want d.r.m. or not." that's what jeff bezos said when he was on the daily show last week, appearing on the very day the kindle 2 shipped. some small publishers and e-publishers are griping now, saying that no, actually amazon forces them to use d.r.m. they are confused. mightily confused. and it's amusing... see, when bezos talks about "publishers", he's not talking about _them_... he's talking about the _big_ publishers... you know, the major houses, owned by the corporations. small publishers are nothing but small potatoes to amazon, which -- you will recall -- is the world's biggest bookstore, named after one of the mightiest rivers on the whole planet. small publishers are an afterthought to amazon, much like the impulse items by the checkstand at your grocery store... and those big publishers? they _want_ d.r.m. _demand_ it. wouldn't sell any books at amazon if it didn't give 'em d.r.m. amazon is all about making those major publishers happy. when they balked at text-to-speech, amazon pulled it back. and if those corporate publishers balked at d.r.m., you can bet your sweet bippy jeff would pull that back just as fast... seriously, these small publishers think they're real important because they have a blog. you're nothing, kids. move over, get out of the way, and make some room for the big guys... oh yeah, jeff wasn't lying either. if you, as a publisher, decide that you do not want to have d.r.m. on your books, you can always sell unencumbered mobipocket books to customers. or plain-text-books, .rtf, or .html. there's all kinds of options. amazon will even pay the expense of doing any conversions. (since, remember?, you're just a drop in the bucket to them.) but you say you want to sell non-d.r.m. books _at_ amazon? well, like any retailer, they decide what they want to sell, and if you want to sell in their store, you accept their conditions. if you don't want to accept them, you don't have to sell there. and no, they're not going to make an "exception" for you... at least, not unless their major partners -- big publishers -- tell them that they should. and why would the big houses let _you_ have the advantage of selling non-d.r.m. books? they wouldn't. so they make amazon d.r.m. all the books. and amazon obliges, because amazon wants them happy. and they couldn't care less about whether _you_ are happy. that's how it works... money makes the world go 'round... -bowerbird ************** A Good Credit Score is 700 or Above. See yours in just 2 easy steps! (http://pr.atwola.com/promoclk/100126575x1218822736x1201267884/aol?redir=http:%2F%2Fwww.freecreditreport.com%2Fpm%2Fdefault.aspx%3Fsc%3D668072%26hmpgID %3D62%26bcd%3DfebemailfooterNO62) -------------- next part -------------- An HTML attachment was scrubbed... URL: From Bowerbird at aol.com Thu Mar 5 15:21:22 2009 From: Bowerbird at aol.com (Bowerbird at aol.com) Date: Thu, 5 Mar 2009 18:21:22 EST Subject: [gutvol-d] need a book scanned? Message-ID: this has been available for a few months now, but hasn't gotten the attention that it deserves... in conjunction with the boston public library, the open library -- or is it the internet archive?, or is it the open content alliance?, i can never tell -- is now offering a "scan on demand" service... if a public-domain book in the boston pubic library hasn't yet been scanned, you can bump its priority up to the top of the list by requesting it be scanned. within "3-5 days", it'll be scanned and placed online, with all the various formats, which includes the o.c.r. what a wonderful present, eh? "check it out" (so to speak): > http://openlibrary.org/bpl -bowerbird ************** A Good Credit Score is 700 or Above. See yours in just 2 easy steps! (http://pr.atwola.com/promoclk/100126575x1219957551x1201325337/aol?redir=http:%2F%2Fwww.freecreditreport.com%2Fpm%2Fdefault.aspx%3Fsc%3D668072%26hmpgID %3D62%26bcd%3DfebemailfooterNO62) -------------- next part -------------- An HTML attachment was scrubbed... URL: From Seekgod7 at aol.com Thu Mar 5 21:47:45 2009 From: Seekgod7 at aol.com (Seekgod7 at aol.com) Date: Fri, 6 Mar 2009 00:47:45 EST Subject: [gutvol-d] need a book scanned? Message-ID: In a message dated 3/5/2009 3:27:44 PM Pacific Standard Time, Bowerbird at aol.com writes: http://openlibrary.org/bpl Is it possible to get copies of ebooks such as As a man thinkith or Think and grow Rich along with the KJV Bible in print for a fee naturally. Thanks Scott Richards 951 807 5658 PST 1-7 PST _seekgod7 at aol.com_ (http://seekgod7 at aol.com/) Global Prayer Warriors Major Intercession from around the world if your in need of prayer please let us know We hate Spam as much as you do we only want to send these messages who are friends of Global Prayer Warriors or personal friends of Scott Richards, if you would like to be removed from these mailings kindly let me know and I will take the appropriate steps to make you happy. God Bless you. **************A Good Credit Score is 700 or Above. See yours in just 2 easy steps! (http://pr.atwola.com/promoclk/100126575x1219957551x1201325337/aol?redir=http:%2F%2Fwww.freecreditreport.com%2Fpm%2Fdefault.aspx%3Fsc%3D668072%26hmpgID %3D62%26bcd%3DfebemailfooterNO62) -------------- next part -------------- An HTML attachment was scrubbed... URL: From ajhaines at shaw.ca Thu Mar 5 22:17:59 2009 From: ajhaines at shaw.ca (Al Haines (shaw)) Date: Thu, 5 Mar 2009 22:17:59 -0800 Subject: [gutvol-d] need a book scanned? References: Message-ID: "As a Man Thinketh" (note the correct spelling of "thinketh") is in the Project Gutenberg collection - http://www.gutenberg.org/etext/4507 (This was pointed out to you several weeks ago.) The King James Bible (each Book is a separate file) is also in the PG collection--search on the title words "king james" at PG's main page. Both the above are freely downloadable. If you want to donate to Project Gutenberg, go here: http://www.gutenberg.org/wiki/Gutenberg:Project_Gutenberg_Needs_Your_Donation "Think and Grow Rich" is probably not in the public domain in the US. You can purchase it at Amazon Books. By the way, the "your" in the last line of your signature block should be "you're". ----- Original Message ----- From: Seekgod7 at aol.com To: gutvol-d at lists.pglaf.org Sent: Thursday, March 05, 2009 9:47 PM Subject: Re: [gutvol-d] need a book scanned? In a message dated 3/5/2009 3:27:44 PM Pacific Standard Time, Bowerbird at aol.com writes: http://openlibrary.org/bpl Is it possible to get copies of ebooks such as As a man thinkith or Think and grow Rich along with the KJV Bible in print for a fee naturally. Thanks Scott Richards 951 807 5658 PST 1-7 PST seekgod7 at aol.com Global Prayer Warriors Major Intercession from around the world if your in need of prayer please let us know We hate Spam as much as you do we only want to send these messages who are friends of Global Prayer Warriors or personal friends of Scott Richards, if you would like to be removed from these mailings kindly let me know and I will take the appropriate steps to make you happy. God Bless you. ------------------------------------------------------------------------------ A Good Credit Score is 700 or Above. See yours in just 2 easy steps! ------------------------------------------------------------------------------ _______________________________________________ gutvol-d mailing list gutvol-d at lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d -------------- next part -------------- An HTML attachment was scrubbed... URL: From klofstrom at gmail.com Thu Mar 5 22:21:46 2009 From: klofstrom at gmail.com (Karen Lofstrom) Date: Thu, 5 Mar 2009 20:21:46 -1000 Subject: [gutvol-d] need a book scanned? In-Reply-To: References: Message-ID: <1e8e65080903052221k58429a6bnb7134c133b93813b@mail.gmail.com> On Thu, Mar 5, 2009 at 8:17 PM, Al Haines (shaw) wrote: > Both the above are freely downloadable. I'm starting to suspect that this fellow is posting "questions" to the list just to get his long and proselytizing sig on view. -- Zora From joshua at hutchinson.net Wed Mar 11 05:57:34 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Wed, 11 Mar 2009 12:57:34 +0000 (GMT) Subject: [gutvol-d] Kindle/Mobipocket support Message-ID: <2102467441.71155.1236776254129.JavaMail.mail@webmail02> An HTML attachment was scrubbed... URL: From ricardofdiogo at gmail.com Wed Mar 11 06:40:22 2009 From: ricardofdiogo at gmail.com (Ricardo F Diogo) Date: Wed, 11 Mar 2009 13:40:22 +0000 Subject: [gutvol-d] Kindle/Mobipocket support In-Reply-To: <2102467441.71155.1236776254129.JavaMail.mail@webmail02> References: <2102467441.71155.1236776254129.JavaMail.mail@webmail02> Message-ID: <9c6138c50903110640m5e11eb4eo938bf05b77c4da51@mail.gmail.com> 2009/3/11 Joshua Hutchinson : > Morning all! > > I know we have some books with Mobipocket formats in the catalog, so I know > we *can* add that format. > > My question is what is our policy on existing books.? For instance, are we > interested in adding new formats to existing ebook numbers? > > > Disclaimer: Yes, I just got a Kindle for my birthday.? Yes, I love the > thing.? So, yes, I have a vested interest in the answer. Happy birthday Joshua and congrats for your Kindle. I'd say nothing actually prevents you from sending in a mobipocket file. PG however prefers open and editable formats (which is not the case of mobipocket files). There are other websites that specialize in converting PG's files into a wider range of formats (see Manybooks.net). However: PG is now experimenting with EPUB files. I'd suggest you to contact Marcello (in case he doesn't jump by) in order to create an EPUB version of your favourite ebook. EPUB is meant to become the new standard of ebooks. (In my opinion PG as a great role here). EPUB, by the way, can be read by your Kindle, by Sony's PRS and by other devices as well. You can convert your files using a a free, open software called Calibre developed by Kovid Goyal. I'm not sure why PG's EPUBs don't support images, thought. (Guess it's a server space thing). Ricardo From bruce at zuhause.org Wed Mar 11 07:46:23 2009 From: bruce at zuhause.org (Bruce Albrecht) Date: Wed, 11 Mar 2009 09:46:23 -0500 Subject: [gutvol-d] Kindle/Mobipocket support In-Reply-To: <9c6138c50903110640m5e11eb4eo938bf05b77c4da51@mail.gmail.com> References: <2102467441.71155.1236776254129.JavaMail.mail@webmail02> <9c6138c50903110640m5e11eb4eo938bf05b77c4da51@mail.gmail.com> Message-ID: <49B7CEBF.4010703@zuhause.org> Ricardo F Diogo wrote: > I'm not sure why PG's EPUBs don't support images, thought. (Guess it's > a server space thing). Perhaps the PG EPubs are getting converted from the text edition and not the HTML edition. I believe that's true for the plucker versions as well. From marcello at perathoner.de Wed Mar 11 11:59:26 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Wed, 11 Mar 2009 19:59:26 +0100 Subject: [gutvol-d] Kindle/Mobipocket support In-Reply-To: <9c6138c50903110640m5e11eb4eo938bf05b77c4da51@mail.gmail.com> References: <2102467441.71155.1236776254129.JavaMail.mail@webmail02> <9c6138c50903110640m5e11eb4eo938bf05b77c4da51@mail.gmail.com> Message-ID: <49B80A0E.4030807@perathoner.de> Ricardo F Diogo wrote: > PG is now experimenting with EPUB files. I'd suggest you to contact > Marcello (in case he doesn't jump by) in order to create an EPUB > version of your favourite ebook. The converter is running in the background. Sadly, at this early stage, it dies often ... #55 is converted, though. > EPUB, by the way, can be read by your Kindle, by Sony's PRS and by > other devices as well. Directly? Or after conversion? > You can convert your files using a a free, open software called > Calibre developed by Kovid Goyal. Or by mobigen from mobipocket.com. > I'm not sure why PG's EPUBs don't support images, thought. (Guess it's > a server space thing). Because they are meant to be read on memory-impaired devices like mobile phones. I plan to add EPUBs with (reduced size) images later on. But what is the optimal size? -- Marcello Perathoner, Cologne, Germany webmaster at gutenberg.org From lee at novomail.net Wed Mar 11 14:21:48 2009 From: lee at novomail.net (Lee Passey) Date: Wed, 11 Mar 2009 15:21:48 -0600 Subject: [gutvol-d] Kindle/Mobipocket support In-Reply-To: <2102467441.71155.1236776254129.JavaMail.mail@webmail02> References: <2102467441.71155.1236776254129.JavaMail.mail@webmail02> Message-ID: <49B82B6C.8070103@novomail.net> Joshua Hutchinson wrote: > Morning all! > > I know we have some books with Mobipocket formats in the catalog, so I > know we *can* add that format. > > My question is what is our policy on existing books. For instance, are > we interested in adding new formats to existing ebook numbers? According to a continuing thread on The Ebook Community list, the Kindle directly supports (i.e. can read natively without conversion) .epub, Mobipocket's flavor of .prc, and HTML (among others). This is not surprising. The .epub format is little more than a common zip file containing a .opf file to provide navigation and one or more HTML (and other included) files that can be said to be an entire publication. Mobipocket .prc (the new version) is a single file that contains one or more dumbed-down HTML files using a variation of the LZ compression algorithm. My memory is a little fuzzy on this point, but I believe that during conversion from .epub it extracts some of the data from the .opf file and stores it in a proprietary header format. Apparently, the underlying rendering engine of the Kindle is HTML-based. For MobiPocket's .prc format it extracts the HTML and passes it to the rendering engine. In the case of .epub it extracts the HTML and passes it to the rendering engine. In the case of HTML, it just renders it directly. While some people chafe at the notion, it is apparent to me that HTML is the de facto universal e-book format. Because Gutenberg serves up many files in HTML format, I would say that Kindle support is already present. In fact, if you're happy with Gutenberg's ITF format (yes, I know that that's redundant) I believe the Kindle will render that natively as well. If you are so inclined, I look forward to a report on just how well the Kindle handles these various formats, and I suspect many others are as well. From marcello at perathoner.de Thu Mar 12 00:15:49 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Thu, 12 Mar 2009 08:15:49 +0100 Subject: [gutvol-d] Kindle/Mobipocket support In-Reply-To: <49B82B6C.8070103@novomail.net> References: <2102467441.71155.1236776254129.JavaMail.mail@webmail02> <49B82B6C.8070103@novomail.net> Message-ID: <49B8B6A5.7010307@perathoner.de> Lee Passey wrote: > According to a continuing thread on The Ebook Community list, the Kindle > directly supports (i.e. can read natively without conversion) .epub, > Mobipocket's flavor of .prc, and HTML (among others). Josh? Did you hear that? I'm looking forward to some user report on how to read PG .epub files on the Kindle. And a nice picture or two for the front page ... :-) -- Marcello Perathoner, Cologne, Germany webmaster at gutenberg.org From joshua at hutchinson.net Fri Mar 13 05:34:23 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Fri, 13 Mar 2009 12:34:23 +0000 (GMT) Subject: [gutvol-d] Kindle/Mobipocket support Message-ID: <1535618809.100827.1236947663987.JavaMail.mail@webmail02> An HTML attachment was scrubbed... URL: From joshua at hutchinson.net Fri Mar 13 06:44:08 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Fri, 13 Mar 2009 13:44:08 +0000 (GMT) Subject: [gutvol-d] Kindle/Mobipocket support Message-ID: <32340587.101763.1236951848993.JavaMail.mail@webmail02> An HTML attachment was scrubbed... URL: From joshua at hutchinson.net Fri Mar 13 08:32:40 2009 From: joshua at hutchinson.net (Joshua Hutchinson) Date: Fri, 13 Mar 2009 15:32:40 +0000 (GMT) Subject: [gutvol-d] Kindle/Mobipocket support Message-ID: <1147779187.103689.1236958360354.JavaMail.mail@webmail02> An HTML attachment was scrubbed... URL: From Bowerbird at aol.com Fri Mar 13 09:46:19 2009 From: Bowerbird at aol.com (Bowerbird at aol.com) Date: Fri, 13 Mar 2009 12:46:19 EDT Subject: [gutvol-d] kindle support Message-ID: my spam folder has lots of posts in it on "kindle support". i haven't bothered to read them, but... not a problem! just convert your .tei files to that format... what's that? you say you don't have many .tei files? really? what happened to your big plans, dating back to 2001? the whole library was gonna be .tei, according to them. surely they have come to fruition by now? no? hmmm. oh well, i guess you can just search the kindle catalog, where you will find that amazon has converted many of the plain-ascii e-texts for availability on the kindle. hell, some of 'em might even have _your_name_ in 'em! michael hart's vision _once_again_ proves to be 20/20... (michael, you must be getting bored with this scenario.) but there sure are a lot of cracked crystal balls around here. idiots... -bowerbird p.s. by the way, i think michael's birthday is about now? ************** A Good Credit Score is 700 or Above. See yours in just 2 easy steps! (http://pr.atwola.com/promoclk/100126575x1220439616x1201372437/aol?redir=http:%2F%2Fwww.freecreditreport.com%2Fpm%2Fdefault.aspx%3Fsc%3D668072%26hmpgID %3D62%26bcd%3DfebemailfooterNO62) -------------- next part -------------- An HTML attachment was scrubbed... URL: From marcello at perathoner.de Tue Mar 17 15:55:13 2009 From: marcello at perathoner.de (Marcello Perathoner) Date: Tue, 17 Mar 2009 23:55:13 +0100 Subject: [gutvol-d] NOTICE: gutenberg wiki user database cleanup Message-ID: <49C02A51.3070200@perathoner.de> In a few days I'm going to delete all inactive users of the wiki (who didn't make an edit in the last 3 months). If you want to keep your account, go and make an edit to your user page. -- Marcello Perathoner, Cologne, Germany webmaster at gutenberg.org From prosfilaes at gmail.com Tue Mar 17 16:25:10 2009 From: prosfilaes at gmail.com (David Starner) Date: Tue, 17 Mar 2009 16:25:10 -0700 Subject: [gutvol-d] NOTICE: gutenberg wiki user database cleanup In-Reply-To: <49C02A51.3070200@perathoner.de> References: <49C02A51.3070200@perathoner.de> Message-ID: <6d99d1fd0903171625h54a4a153y9d5e8f7f705512b1@mail.gmail.com> On Tue, Mar 17, 2009 at 3:55 PM, Marcello Perathoner wrote: > In a few days I'm going to delete all inactive users of the wiki (who didn't > make an edit in the last 3 months). > > If you want to keep your account, go and make an edit to your user page. Why? Is there any real need for us to delete inactive users, unlike every other wiki? Under the most common interpretation of the GFDL as it applies to wikis, to delete a user and let a new user take that name would violate the requirement to keep attribution and an accurate history. -- ?u ne ekzistas balzamo en Gilead? From hart at pglaf.org Tue Mar 17 17:47:46 2009 From: hart at pglaf.org (Michael S. Hart) Date: Tue, 17 Mar 2009 17:47:46 -0700 (PDT) Subject: [gutvol-d] !@! Re: NOTICE: gutenberg wiki user database cleanup In-Reply-To: <6d99d1fd0903171625h54a4a153y9d5e8f7f705512b1@mail.gmail.com> References: <49C02A51.3070200@perathoner.de> <6d99d1fd0903171625h54a4a153y9d5e8f7f705512b1@mail.gmail.com> Message-ID: We have plenty of volunteers who only show up much more rarely than every three months, perhaps up to three years, but even then, I say leave them be. . .better to have them welcome after 10 years than to make them feel unwelcome. Thanks!!! Michael S. Hart Founder Project Gutenberg On Tue, 17 Mar 2009, David Starner wrote: > On Tue, Mar 17, 2009 at 3:55 PM, Marcello Perathoner wrote: > In a few days I'm going to delete all inactive users of the wiki (who didn't > make an edit in the last 3 months). > > If you want to keep your account, go and make an edit to your user page. Why? Is there any real need for us to delete inactive users, unlike every other wiki? Under the most common interpretation of the GFDL as it applies to wikis, to delete a user and let a new user take that name would violate the requirement to keep attribution and an accurate history. -- ??u ne ekzistas balzamo en Gilead? _______________________________________________ gutvol-d mailing list gutvol-d at lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d From gbnewby at pglaf.org Thu Mar 19 18:11:42 2009 From: gbnewby at pglaf.org (Greg Newby) Date: Thu, 19 Mar 2009 18:11:42 -0700 Subject: [gutvol-d] NOTICE: gutenberg wiki user database cleanup In-Reply-To: <6d99d1fd0903171625h54a4a153y9d5e8f7f705512b1@mail.gmail.com> References: <49C02A51.3070200@perathoner.de> <6d99d1fd0903171625h54a4a153y9d5e8f7f705512b1@mail.gmail.com> Message-ID: <20090320011142.GA1468@mail.pglaf.org> On Tue, Mar 17, 2009 at 04:25:10PM -0700, David Starner wrote: > On Tue, Mar 17, 2009 at 3:55 PM, Marcello Perathoner > wrote: > > In a few days I'm going to delete all inactive users of the wiki (who didn't > > make an edit in the last 3 months). > > > > If you want to keep your account, go and make an edit to your user page. > > > Why? Is there any real need for us to delete inactive users, unlike > every other wiki? Under the most common interpretation of the GFDL as > it applies to wikis, to delete a user and let a new user take that > name would violate the requirement to keep attribution and an accurate > history. It seems to me that we can leave the usernames in place, but disable or reset their passwords so that people will need to re-authenticate when they try to access the system again. I agree that (a) we shouldn't leave inactive people with active usernames, but (b) we should not anonymize or disable authors/contributors, lest they some day want to regain access. -- Greg From gbnewby at pglaf.org Thu Mar 19 18:18:21 2009 From: gbnewby at pglaf.org (Greg Newby) Date: Thu, 19 Mar 2009 18:18:21 -0700 Subject: [gutvol-d] Fwd: gutenberg:reformatting Message-ID: <20090320011821.GA1773@mail.pglaf.org> Anyone want to try this? It's a pretty simplistic approach. ---------- Forwarded message ---------- Date: Thu, 19 Mar 2009 12:23:14 +0100 (CET) From: jek at kortext.com To: hart at pobox.com Subject: gutenberg:reformatting Not all devices can handle any plain vanilla text file. My mp3 player can read ebooks, but words are broken in inappropriate places. Files have to be preformatted for a 14 characters wide display, lines have to be broken with dos style line endings (\r\n) and all file names must end with txt. I wrote the enclosed awk script to bypass these limitations. I converted about 3000 ebooks last night. It took about an hour and a half on my eeepc. The script can be freely distributed in case other people have similar problems. Regards, Jon-Egil Korsvold ----- End forwarded message ----- #!/usr/bin/mawk -f # Written by Jon-Egil Korsvold on friday the 13th of March 2009. Mare is short for Mawk Reformatter. The program can # reformat text files to increase readability on small devices with dumb ebook readers. My mp3 player has a 14 characters # wide display, and the ebook reader breaks the words in inappropriate places. This program doesn't split long words, # but the line is broken after each long word, so they won't mess up the display for more than a few lines. # # This program can be freely distributed. You may give away copies of it, but you may not sell it or remove my name from it. # Use at your own risk!! Run the program without arguments to get the manual _before_ you attempt anything else! You may # need too edit the path to mawk above and md some of the commands below. No warranty, have fun! This program has not been # extensively tested. It should be considered beta software. # # # Jon-Egil Korsvold Korsvold 15th of March 2009 # # BEGIN { tempfile="/tmp/mare.txt" fc1="find -L " fc2=" -noleaf|egrep txt$\|htm$\|html$ >> "tempfile rm="rm "tempfile md="mkdir -p " #for directories sep="/" x=0 #Holds the current line position in characters y=0 #Holds the length of the current word val=0 #Holds the return value, if greater than 0, the help text is printed os="err" #Dos or *nix #Exit if less than four arguments were used (width of display in characters, -d/-u, output dir and source dir) if (ARGC > 3) { # Get and set width in characters, exit with error message unless the value is a number count=ARGV[1] ARGV[1]="" if (count !~ /[0-9]+/) { val=1 exit } #The os value is initially "err". Set it to dos or nix if the appropriate switch was used. Define line endings #accordingly. Exit with error message if os=err (No switch was used) if (ARGV[2] ~ /^-d$/) { os="dos" nl="\r\n" } else { if (ARGV[2] ~ /^-u$/) { os="nix" nl="\n" } } if (os ~ /^err$/) { print ("You have to use -d or -u as the second argument!") val=1 exit } ARGV[2]="" #Get and set output directory. Add a trailing slash if necessary. odir=ARGV[3] ARGV[3]="" if (odir ~ /\./) { print ("The third argument has to be a directory. A file won't do!") val=1 exit } if (odir !~ sep"$") { odir=odir""sep } #Loop through the rest of the command line arguments. Call find and grep to get the files in directories, #but write files to tempfile directly. Skip unsopported file types with a warning. fctr=4 while (fctr < ARGC) { idir=ARGV[fctr] ARGV[fctr]="" if (idir ~ /\./) { if (idir ~ /\.txt|\.htm|\.phtml|\.shtml|\.htm/) { system ("echo " idir " >> " tempfile) } else { print ("The file type of " idir " isn't supported!") } } else { system(fc1 idir fc2) } fctr++ } FS=sep fctr=0 #Exit with error message if tempfile is empty or doesn't exist. if (getline < tempfile < 1) { print ("No files found!") val=1 exit } close (tempfile) #Traverse tempfile line by line and use slash as field separator. The whole line is stored in pa (path array) #which holds the input files. The last field holds the file name without the path, and it is stored i fa #(file array). The field before the last field holds directory information. It is stored in da (directory array). #Directories are created as needed below. while (getline < tempfile > 0) { x=NF fa[fctr]=$x #file array if (x > 1) { x-- da[fctr]=$x #directory array (odir/da[actr]/) if (da[fctr] !~ sep"$") { da[fctr]=da[fctr]""sep } } else { da[fctr]="" } system (md odir""da[fctr]) pa[fctr]=$0 #path array (for input files) fctr++ } #Reduce by one to get the last element of the arrays. Reset field separator to get words. Remove tempfile. fctr-- FS=" " system (rm) #Loop through the arrays from the last to the first element (0). Try to open the elements in pa as files #and print a warning on errors. while (fctr >= 0) { if (getline < pa[fctr] < 1) { print ("Error processing "pa[fctr]) } close(pa[fctr]) #Loop through the words in each line. while (getline < pa[fctr] > 0) { gsub ("\r", "") #Remove dos endings ctr=1 #Used to reference fields in the current record #Set output file, i.e. edit the path, add format information and change the #file type to txt. ofile=fa[fctr] gsub(/\..*/,"",ofile) ofile=odir""da[fctr]"fmt-"count"-"ofile".txt" #Keep track of the length of current word (y) and the position on the line (x), break lines #accordingly with the content of nl (dos or nix endings) #Skip lines starting and ending with css or html commands while (ctr <= NF && $0 !~ /^\<.*\>$/ && $0 !~ /^\{.*\}$/) { y=length($ctr) x=x+y if (x < count) #Increment x to account for trailing space { x++ } else { printf("%s",nl) > ofile x=y+1 } #Remove some embedded html and css commands and superfluous spaces gsub (/\<.*\>/, "") gsub (/\{.*\}/, "") gsub (/[ ][ ]+/, " ") printf("%s ",$ctr) > ofile ctr++ #Increment to reference next field (word) and loop } if (NF == 0 && $0 !~ /^\<.*\>$/ && $0 !~ /^\{.*\}$/) #Print a double newline to make a paragraph if the record was empty { printf("%s%s", nl, nl) > ofile x=0 } } printf("%s%s", nl, nl) > ofile print("Writing to "ofile) close(ofile) fctr-- #Next file in array } exit } else { #exit with error message if less than four arguments were used val=1 exit } } #Exit with the help text in case of errors END{ if (val == 1) { print ("\n\nMare (mawk reformatter) reformats ebooks for viewing on small displays.\n") print ("Width in characters, option, output directory, input directories or files") print ("Example: mare 20 -d ebooks /mnt/sda2/gutenberg /mnt/sda2/freeread") print ("Reformat all text and html files in the last two directories.") print ("Use 20 characters per line and dos style line endings.") print ("The resulting files are written to the last level of the original") print ("directory tree in the directory ebooks in the current directory.") print ("Run the program without arguments to get this help!\n") print ("Valid options:") print ("-d\tUse dos style line endings") print ("-u\tUse *nix style line endings\n\n") print ("Requirements:") print ("-\tmawk") print ("-\ta *nix version of find") print ("-\ta *nix version of mkdir") print ("-\techo") print ("-\tegrep") print ("-\trm\n") print ("The target os can be dos/win or *nix.") print ("The host os probably has to be *nix.\n") print ("Written in March 2009 by Jon-Egil Korsvold.") print ("Use at your own risk, no warranty!") print ("The program can be freely distributed with author information,") print ("but not sold. Happy reading!") } } From ricardofdiogo at gmail.com Sat Mar 21 22:27:32 2009 From: ricardofdiogo at gmail.com (Ricardo F Diogo) Date: Sun, 22 Mar 2009 05:27:32 +0000 Subject: [gutvol-d] Epub experiment Message-ID: <9c6138c50903212227r33918bfdq3cfea231a2b31956@mail.gmail.com> The original page numbers of p-books are very often kept in PG's HTML files as side notes as . Distributed Proofreaders' Post Processors seam to have adopted this practice as a rule of accuracy. When I open an EPUB file using DE the text seems to be piled up at left with the side numbers at right. Should DP produce different HTMLs for cleaner epubs? How different? Should the EPUB conversion tool be altered to overcome this formatting practice? The posting team also forces in the beginning of the etext (specially for credits). The EPUBs we are autogenerating seam to dislike it. (See #16429). Ricardo From Seekgod7 at aol.com Sun Mar 22 18:35:06 2009 From: Seekgod7 at aol.com (Seekgod7 at aol.com) Date: Sun, 22 Mar 2009 21:35:06 EDT Subject: [gutvol-d] Epub experiment Message-ID: thank you Scott Richards 951 807 5658 PST 1-7 PST _seekgod7 at aol.com_ (http://seekgod7 at aol.com/) Global Prayer Warriors Major Intercession from around the world if your in need of prayer please let us know We hate Spam as much as you do we only want to send these messages who are friends of Global Prayer Warriors or personal friends of Scott Richards, if you would like to be removed from these mailings kindly let me know and I will take the appropriate steps to make you happy. God Bless you. **************A Good Credit Score is 700 or Above. See yours in just 2 easy steps! (http://pr.atwola.com/promoclk/100126575x1220685763x1201394209/aol?redir=http:%2F%2Fwww.freecreditreport.com%2Fpm%2Fdefault.aspx%3Fsc%3D668072%26hmpgID %3D62%26bcd%3DMarchfooterNO62) -------------- next part -------------- An HTML attachment was scrubbed... URL: