"The only way to be truly satisfied, is to do what you believe is great work. And the only way to do great work is to love what you do." Steve Jobs
Once "inking" gets into your veins you will never be able to live without it. Frank J. Garcia

Thursday, August 31, 2006

Google Books

Now you can download old books from Googles in PDF format.

googlebook


But here is the best part,check the following picture:

book2


Google has scanned every single page, including the empty ones of old books, then they have inserted these images in PDF files. The result are huge files where is impossible to do a search for a word or sentence; you can't look in a PDF for a word that's part of an image. They should have converted each scanned image into text and then create with that text much smaller PDF files with full search capability.

What a waste of HDD space and resources!

4 comments:

  1. If you want searchable texts of the same books, stick with the group that has been doing it for decades. Project Gutenberg.

    ReplyDelete
  2. Exactly, and a lot more efficiently!

    ReplyDelete
  3. To be fair to Google on this, OCR isn't really accurate enough or fast enough to scan books on such a massive scale, especially when you consider that the older books they are using may have less clear printing, illustrations etc.

    ReplyDelete
  4. That´s why I like Project Gutenberg, because they have taken the time to clean all texts after OCRs and bring us copies of this texts in different formats.

    If you look in Project Gutenberg you will find all texts now available in Google in TXT format. So my question is, why this waste of time in a product that is not good. Because to have pictures of each book page is useless if you can´t search in this texts.

    ReplyDelete

Spam will be deleted, do not waste your time.