Google turns classic books into free gibberish eBooks
Posted on 27 Aug 2009 at 17:01
Google has added free eBook downloads to its Books service.
Google Books was originally conceived as a way to make titles available to everyone online free of charge. It works by Google borrowing the original titles and then scanning them in, allowing readers to see the books with their original designs and fonts.
Now Google is offering free eBook downloads in ePub format for titles that are out of copyright. It sounds like a good idea in theory, but to create an ePub eBook, Google has had to use OCR on the original scans in order to extract the text.
Sadly, nobody at Google seems to have bothered to have checked the final version, making some texts impossible to understand. It seems to make a mockery of Google's statement that "Digitizing books allows us to provide more access to great literature for a wider set of the world's population".
Gulliver's Travels is one such book to go through the translation process, making it completely unreadable, as you can see from this extract.
"The j.-.i;-i had lined it on all sides with the solu.-.?i cloth she could get, well quilted Uih!l; neath, furnished it with her baby's bi d. Provided me with linen and other necessaries. And made every thing as convenient as sl.c could."
This appalling translation makes no sense and can't compete with the excellent Project Gutenberg, which offers out-of-copyright books that have been scanned and corrected properly in ePub format.
The Project Gutenberg version of Gulliver's Travels correctly turns the above paragraph into text.
"The girl had lined it on all sides with the softest cloth she could get, well quilted underneath, furnished it with her baby's bed, provided me with linen and other necessaries, and made everything as convenient as she could."
Some of Google's other ePub books are better, including Treasure Island, but we still found that it was littered with mistakes and odd characters. Until Google decides it's going to check the text after it's been through the OCR process, Project Gutenberg remains the best destination for free eBooks.
Author: David Ludlow
Find a review
advertisement
Serif Digital Scrapbook Artist 2
Category: SoftwareRating:
Price: £25
Corel CorelDraw Graphics Suite X5
Category: SoftwareRating:
Price: £357
VMWare WorkStation 7
Category: SoftwareRating:
Price: £144
HDRSoft Photomatix Light
Category: SoftwareRating:
Price: £28
Mass Effect 2
Category: SoftwareRating:
Price: £20
- Batman: Arkham Asylum wins top award at Bafta Video Game awards
- Google plans to launch a TV platform
- Facebook "stalking" apps are all rogue
- PowerDVD 10 provides '3D upscaling'
- Outrage as Bioshock 2 publisher 2K Games charges extra for content already on install discs
- Malware found on new Vodafone smartphone
- Microsoft shows how a game can be played on Xbox or PC and continued on a Windows Phone 7 Series handset
- China says it's willing to prosecute Google hackers
- Monkey Island 2 special edition rumoured to be in development
- Google's latest acquisition targets Microsoft's most profitable business
advertisement
Compare 30+ mobile broadband deals




Printed from www.expertreviews.co.uk
Social Bookmark this article: What is this?