Life is a game, take it seriously

How things work: CamFind

In Computer Vision on May 11, 2013 at 3:45 pm

by Gooly (Li Yang Ku)

camfind algorithm

If you remember, I talked about how the image search app Amazon Flow might work last year, and promised I would talk about Google Goggle later, which I didn’t (and I blame my unpredictable life and laziness).  To compensate that I am gonna talk about a similar app, but before downloading the app please read the whole post for your own sake.

The app I am gonna discuss today is CamFind. Someone recommended the app on Facebook so I decided to give it a try. And then I went through the whole sequence of emotion from 1. Curious, 2. Amazed of the accuracy, 3. Skeptical, 4. Super Amazed, 5. Embarrassed about how stupid I am compared to the vision team behind the app, 6. Did some more test, skeptical++, 7. Search and relieved.

So now the story. I downloaded the app and took a picture of my Adventure Time Spanish poster (hell yeah I Love Adventure Time), it took a few seconds but returned an accurate result. It is good but Amazon Flow can also do that. Then I started to test the limit, magazine check, rice vinegar bottle check, mug check, weird looking vacuum check. By this time I am pretty shocked.

adventure time

At first I suspect some one must be looking at the picture and replying the result. To tell the truth this is a common practice among start-ups. You don’t have the technology or business yet but want to test if a concept works, so you fake the technology and business, if it works you get money from VCs, if it doesn’t they also have a strategy for that “fail fast”. One of the famous examples is the first car sold on the internet. The guy that started the website literally went to buy a car and delivered it himself on the first order. Ton’s of business started this way.

However the response speed is far faster than I thought it would be if someone is behind and I was pretty sure nobody would use this strategy on this kind of business. So I did a few more tests and it got every thing reasonable well, except for recognizing my plastic chair as a plastic table and my chipmunk doll as a bear doll. It even got my Dilbert sticker right, yeah it says it’s a “Dilbert sticker”. I was so shocked that I felt embarrassed that I don’t know such technology exists. After reading so many papers in the past few years, I can’t believe I missed out some of the most amazing ones.

So I desperately searched on the web, at the same time dreaming of buying the company just to know how they did it. And I finally found the missing piece. Seth Geib posted the following comment in one of the reviews.

I have entered a few images that there is pretty much no way any algorithm could detect and it returned a spot on result. Also I get different, very specific results if I submit an image multiple times. I am positive they have a team of people screening these images, which is a definite privacy concern.

UPDATE: Upon asking this question to the CamFind team on their FB page they responded with the following:

“Hi, to answer your question, CamFind uses a combination of computer vision and human crowdsourcing to identify the object photographed.”

So I guess just be aware that any images you take will be screened by individuals. They should really note this in their app first and foremost upon opening it.

What a relief.

regret

I am not sure what their plan is, but I am not sure if this is a concept needed to be tested on with this strategy. It’s like asking if you want an artificial secretary that would understand all your needs and give you information in a few seconds with no cost. Faking a robot with human to test if human needs a robot is kind of a weird concept. Even if you proved that this is a good business model, how do you build the vision technology? You can’t just throw money at people and get vision algorithms that work like human. Some business guys just don’t get it.

It’s possible that with more people using this app, they would obtain a large labeled image database, and can train their algorithms based on that and improve the automatic part to be the best image search on the market. But I can tell you, nope, there is no way their algorithms gonna be half as good as having a human behind. We humans didn’t learn to recognize by scanning through billions of photos, and the fastest path to build a human like vision system would be to have them learn as how we learned.

And by the way, I am glad I didn’t take any pictures that I shouldn’t be taking with the app. I hope you didn’t either.

  1. I think your problem is that you talk about something you have no idea about. You have not even tested an app, and already wrote a blog post. May be next time you can make a post about automated control stagnant systems in mechanics since you write stuff you have no clue about. I’ve been using this app for almost 1 weeks, and so far I enjoy it. It gives me price comparison and reviews. I know the app has so much more, I just have not tested it yet. It’s cool to search for a product without a need to type it or describe it. Imagine what it does for blind people?! So instead of discouraging people to use it, make yourself useful, write a blog post so people can share this info with blind community.

  2. No worries, not that many people read this blog.

  3. downloading the app now!

    • This is the best search engine app in the world. Next time do your homework. They utilize a combination of computer vision and human crowdsourcing. I use this app multiple times a day. It’s free and the most useful app on my Iphone. I heard they’re working on an Android version next.

  4. More people will certainly read a blog based on subject. I was wondering as well how it worked and it does creep me out a little. But it obviously works well. Whether or not the screening is discouraging is up to the user but it’s the same concept as any search engine really. I find this a helpful discretion. I feel a bit embarrassed taking pictures of my face haha. It’s ok. Now I feel bad for bothering them with my crude hand-drawn picture of a phallus and the f-word and trying to get them to say it. They said “White napkin with writing on it”. But once they did say “Fuck”. I may write an apology later. ;p

  5. I also happens to know something about computer vision, and is amazed of the ability of these apps, just like you did: how come such algorithms exist without me having any clue about how it works?
    I even found a list of image recognition APIs. http://blog.mashape.com/post/63663156973/list-of-14-image-recognition-apis
    I screened through them, mainly out of curiosity, and they nearly all have one common thing: they say no word about the technology, and none of the founders appear to have a research background in computer vision.
    but well, we really can’t tell whats behind their servers. may be they used pigeons http://www.google.com.hk/technology/pigeonrank.html
    so maybe we can perform a more scientific analysis to these technologies if we have the leisure.

  6. I’ve only heard of things such as this lately and admit this is a bit surprising. I’ve been part of a human crowdsourcing team before(captcha recognition) and I admit this is a good use of people looking for work.

  7. Thanks for writing this – after using the app a bit I felt very skeptical too and knew there had to be a human behind the responses, thanks for providing the explanation.

  8. Just downloaded the app and withing 30 min I got a notice that 4 people were following me. So does that mean they are watching my searches? So much for privacy. I wrote to the app group and if I don’t get a good response, will delete it, considering it to be possible malware.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: