Google turns classic books into free gibberish eBooks
Posted on 27 Aug 2009 at 17:01
Google has added free eBook downloads to its Books service.
Google Books was originally conceived as a way to make titles available to everyone online free of charge. It works by Google borrowing the original titles and then scanning them in, allowing readers to see the books with their original designs and fonts.
Now Google is offering free eBook downloads in ePub format for titles that are out of copyright. It sounds like a good idea in theory, but to create an ePub eBook, Google has had to use OCR on the original scans in order to extract the text.
Sadly, nobody at Google seems to have bothered to have checked the final version, making some texts impossible to understand. It seems to make a mockery of Google's statement that "Digitizing books allows us to provide more access to great literature for a wider set of the world's population".
Gulliver's Travels is one such book to go through the translation process, making it completely unreadable, as you can see from this extract.
"The j.-.i;-i had lined it on all sides with the solu.-.?i cloth she could get, well quilted Uih!l; neath, furnished it with her baby's bi d. Provided me with linen and other necessaries. And made every thing as convenient as sl.c could."
This appalling translation makes no sense and can't compete with the excellent Project Gutenberg, which offers out-of-copyright books that have been scanned and corrected properly in ePub format.
The Project Gutenberg version of Gulliver's Travels correctly turns the above paragraph into text.
"The girl had lined it on all sides with the softest cloth she could get, well quilted underneath, furnished it with her baby's bed, provided me with linen and other necessaries, and made everything as convenient as she could."
Some of Google's other ePub books are better, including Treasure Island, but we still found that it was littered with mistakes and odd characters. Until Google decides it's going to check the text after it's been through the OCR process, Project Gutenberg remains the best destination for free eBooks.
Author: David Ludlow
Find a review
advertisement
Magix Rescue Your Videotapes 2.0
Category: SoftwareRating:
Price: £40
TuneUp Utilities 2010
Category: SoftwareRating:
Price: £18
Pinnacle Studio HD 14 Ultimate
Category: SoftwareRating:
Price: £80
Symantec Norton Internet Security 2010
Category: SoftwareRating:
Price: £33
- Star Trek Online to invade London's Millennium Bridge on Valentine's Day
- Adobe hits back at Apple after Jobs brands company 'lazy'
- 250GB hard disk for Xbox 360 on horizon
- Google releases Chrome 4 with '1,500 new features'
- Fake anti-virus malware blocks access to the web's most popular sites
- Guinness World Records 2010: Gamer's Edition celebrates the best, the worst and weirdest in the world of gaming.
- RockYou breach exposes world's most popular passwords
- France joins Germany in war against IE
- Germany declares war on Internet Explorer
- Grand Theft Auto arrives on the iPhone
advertisement
Compare 30+ mobile broadband deals





Printed from www.expertreviews.co.uk
Social Bookmark this article: What is this?