Google Search By Image

Pewpewarrows · on June 14, 2011

TinEye and Google Goggles applied to the entire Internet? Yes please. I can't wait to play around with this.

What will be even more interesting is if they release an API for it in the future. Sites like imgur and reddit could then suggest if you're uploading or submitting a similar image to one that already exists.

wccrawford · on June 14, 2011

I just tried TinEye (hadn't heard of it before) and it appears to only look for images that are modifications of the image being searched for.

I wonder if Google's will work the same, or if it will be possible to find other images that are similar, but not based on the same image? That's what I really want.

Jun8 · on June 14, 2011

The "similarity" of two images is a very complicated concept, when people are asked to tag images the overlap rarely goes above 20% on average (e.g. check out http://images.google.com/imagelabeler/ and try your hand at it). Think about it: you may think at the object level (both images have cars), concept level (both are happy images), color, etc. This is why large online stock image sellers still rely on tags extensively.

As a rough analogy, consider a textual example: Find a sentence similar to "It is a truth universally acknowledged, that a single man in possession of a good fortune, must be in want of a wife." Now, if you enter this sentence in Google, it retrieves documents that contain it, in TinyEye fashion. What other similarity is desired? Should it retrieve essays on Austen, on marriage, 17th century English literature...?

If you are interested in image similarity search, check out the Pascal challenge (http://pascallin.ecs.soton.ac.uk/challenges/VOC/voc2011/inde...). The advances in the last 5-6 years on object detection and visual feature extraction (which image similarity relies on) is amazing.

ch0wn · on June 14, 2011

Here's an interesting article about image hashing and fingerprinting algorithms: http://www.hackerfactor.com/blog/index.php?/archives/432-Loo...

safeaim · on June 14, 2011

Everything from that blog is interesting, do you or anyone else got any suggestions for similar blogs?

tilt · on June 14, 2011

Actually the presentation showed a query for an old picture returning images taken from the same place. It's being said that they're doing it by matching images' basic lines and shapes.

Andrenid · on June 15, 2011

An API would be amazing. Could make a service that tells you if any of your Flickr/dA/Picasa photos are being used anywhere else on the web, etc. I'd pay good money for that service (as a photographer who has had photos stolen/used-without-permission).

BoppreH · on June 14, 2011

I use TinEye a lot and I think this will be great. My typical uses of TinEye, that will probably be improved with the bigger stock:

- somebody put text over a nice image and I want the unmodified version

- searching for bigger, better quality versions of an image (e.g. wallpapers)

- finding other images from the same author/gallery (since it links to the sites that hosts the copies)

- finding the name of the movie, person or object pictured (because copies will be hosted with different, probably meaningful names and in pages with subtitles)

dmix · on June 14, 2011

From my understanding, TinEye makes money from corporate B2B deals. Their consumer product is mostly just a technology demo/marketing piece rather than a part of their core business.

So they probably don't have much to worry about.

cpeterso · on June 14, 2011

Retrievr is a prototype of a similar idea: search for Flickr photos by drawing (or uploaded image).

http://labs.systemone.at/retrievr/

blauwbilgorgel · on June 15, 2011

Though I do sometimes use these services, since a few years I block these spiders on my own sites and those of clients.

Often this would happen: Client or user finds an image through Google Image Search on another blog or website, with unclear copyright. Then the client or user would upload this to the server. Then the client, or me, would receive a letter from a (GettyImages) lawyer: If we would please pay for the full licensing right of that 160x160 pixel image.

Thinking about it, I find accommodating to these services can lead to nothing but trouble: Either legal trouble, or hit-and-run users stealing your images, because you paid for higher resolution.

I hope I can separately block this Google service from Google Image Search. Although Google Image Search isn't as good to webmasters as it used to be (especially for those that rely on advertisement clicks) and the users it can send can be negligible: Ranking higher in Google Image Search seems to correlate to ranking higher in Google Web Search.

If it is part and parcel of Google Image Search, I might reconsider my robots.txt directive for Googlebot-Image. They just now opened this up for the public, but it is likely they are already using this internally to gauge (media) quality factors on-page.

P.S.: It would be interesting to see what happens when Google doesn't partner up with GettyImages or iStockPhoto, like in the early days of TinEye you could abuse that service to find the same stock images without watermarks, on the sites of people that already paid for that image.

P.P.S.: Now you can add RDFa or Microdata to mark up your images with a copyright statement, what would happen to sites that host copyrighted images, tagged "not for reproduction"? Google should be able to find the canonical image and "punish" those that don't comply with its copyright.

tilt · on June 14, 2011

From QAs: it won't perform face recognition

GMali · on June 14, 2011

It actually should. Probably put FBI out of business too.

jnhnum1 · on June 14, 2011

Clearly it's not being omitted for technical reasons. http://www.huffingtonpost.com/2011/06/01/facial-recognition-...

apu · on June 15, 2011

Maybe what Schmidt said about Google's intentions was true, but another reason they haven't done it is because it's not possible right now, or at least not in a fully-automated way across most images on the web. The state-of-the-art on the easier "verification" problem ("are these two images of the same person?") are shown here: http://vis-www.cs.umass.edu/lfw/results.html

The best results are under 90% accuracy, which sounds pretty good, until you realize that random chance is 50%, and for recognition ("who is this person?"), you're essentially exponentiating that 90% by the number of different people you want to recognize.

iansinke · on June 15, 2011

This reminds me of apps like Shazam. While they definitely serve an amazing purpose (recognizing songs by finding a similar region of sound) what I would really like is an app that could recognize my humming a song--which probably sounds nothing like the actual song itself (different key, speed, entirely different voice, etc.)

skimbrel · on June 15, 2011

Check out SoundHound: http://www.soundhound.com/

It does exactly what you ask for.

derobert · on June 14, 2011

I wonder if this is based on the search-by-image that Google Goggles (on Android) already does.

krisw · on June 14, 2011

Either way, hopefully they've improved it - I was never able to get Goggles to work very well at all (versus TinEye).

tilt · on June 14, 2011

From QAs: submitted images will be treated like any other query and they'll stay private

itswindy · on June 15, 2011

Rightheaven clones jump from joy :)

damonpace · on June 14, 2011

Love this! But I really only see a mobile use for this, rather than a desktop use.

antihero · on June 14, 2011

I wonder how it'll compare with TinEye.

tropin · on June 14, 2011

Easy, indepently of how good is, just trying to fight against Google resources will crush TinEye. Today must be a terrible day for them.

guyzero · on June 14, 2011

Idee's business model has little to do with TinEye. They're probably not very worried.

ratzinho87 · on June 14, 2011

The article said the technology is very similar to Google Goggles. From what I could deduce, Goggles uses some kind of local invariant descriptors (like SIFT or SURF), and TinEye uses some kind of global descriptor (maybe something like this: http://www.hackerfactor.com/blog/index.php?/archives/432-Loo... ).

If that is right, Google will be able to retrieve different photos in which the same object appears, whereas TinEye only retrieves the same image, with or without some changes. So, they're quite different beasts.

Can't wait to see if I am right...

pixcavator · on June 14, 2011

Actually, TinEye is pretty good at partial matching.

krisw · on June 14, 2011

I compared/contrasted TinEye against Google CBIR for a bunch of images (I use TinEye a lot) and I have to say TinEye looks better than Google CBIR so far. TinEye has less "zero results" and more search results overall, I get the feeling TinEye deals with "Photoshopped images" better somehow.