Well, this is rather cool, wouldn't you say?
A significant extension of our groundbreaking Look Inside the Book feature, Search Inside the Book allows you to search millions of pages to find exactly the book you want to buy. Now instead of just displaying books whose title, author, or publisher-provided keywords match your search terms, your search results will surface titles based on every word inside the book. Using Search Inside the Book is as simple as running an Amazon.com search.
Posted by jzawodn at October 23, 2003 07:50 PM
Hmmm, I wonder if Amazon is taking advantage of MySql's fulltext search?
:)
MySQL, get real...
And I pity the poor third-world workers who slaved away to scan in all those pages ;P.
It might be likely they actually got the pages from the publishers.
Actually, they probably had to type in all the text. In binary.
Kind of cool, but doesn't seem to work very well. Maybe not all books are indexed.
Amazon's page for publishers says they need to submit a "physical copy" of each book, so I think they're just OCR'ing the lot of them. I've seen some evidence of this, too - the excerpts in the results sometimes include the headers at the top of the page, something a text copy from the publisher isn't likely to have.
I've been keeping an eye on this for a while. They have scanned the lot of them. I pity the people who did that for a living.
Great tool to search after a book, but it's only in English;-(
I pity no one that has a job. I used to. But given the number of people that would gladly run a scanner for money to pay bills and buy food, pitying those that do seems foolish.
tingilinde found a good article: "update ... here is a Wired article on the effort. The article is particularly interesting as it goes into the efforts of Brewster Kahle."
That's like saying not to pity the kids chained to looms that make rugs because they have a job. Nah, I still pity them. I would rather they have a job that they liked, and used their talents, and were good at. It's just a matter of scale to pity the poor bored souls running the scanner.
I won't be long when all books published will be accompanied with a simple text file and all online purveyors will have access or the ability to do full text searches.
I hope for the day when all scientific/engineering journals are searchable and available.
Anything Udi Manber is part of is bound to be exceptionally clever.
I think the Amazon search inside the book feature is the first step towards the full digitization of all books -- a universal digital library!
See my own post on this: http://www.mediajunk.com/public/archives/000201.html
Oops, I should have said see my own post on this.
Can anyone post an example of this search.
It's either I'm blind or they removed it.