ULTRA MOBILE PC TIPS: Google Books

Thursday, August 31, 2006

Google Books

Now you can download old books from Googles in PDF format.

But here is the best part,check the following picture:

Google has scanned every single page, including the empty ones of old books, then they have inserted these images in PDF files. The result are huge files where is impossible to do a search for a word or sentence; you can't look in a PDF for a word that's part of an image. They should have converted each scanned image into text and then create with that text much smaller PDF files with full search capability.

What a waste of HDD space and resources!

4 comments:

Anonymous9:32 PM, August 31, 2006
If you want searchable texts of the same books, stick with the group that has been doing it for decades. Project Gutenberg.
ReplyDelete
Replies
Frank J Garcia9:42 PM, August 31, 2006
Exactly, and a lot more efficiently!
ReplyDelete
Replies
Anonymous9:21 AM, September 01, 2006
To be fair to Google on this, OCR isn't really accurate enough or fast enough to scan books on such a massive scale, especially when you consider that the older books they are using may have less clear printing, illustrations etc.
ReplyDelete
Replies
Frank J Garcia9:34 AM, September 01, 2006
That´s why I like Project Gutenberg, because they have taken the time to clean all texts after OCRs and bring us copies of this texts in different formats.

If you look in Project Gutenberg you will find all texts now available in Google in TXT format. So my question is, why this waste of time in a product that is not good. Because to have pictures of each book page is useless if you can´t search in this texts.
ReplyDelete
Replies

Add comment

Spam will be deleted, do not waste your time.