facebook rss twitter

Google announces advanced image recognition software

by Mark Tyson on 19 November 2014, 13:05

Tags: Google (NASDAQ:GOOG)

Quick Link: HEXUS.net/qaclqj

Add to My Vault: x

Researchers at Google and Stanford University have announced an advanced image recognition software development that is capable of describing and captioning complex photos with far greater accuracy than ever before, reports the BBC.

Most currently available image recognition software is limited to recognising individual objects. However algorithms written by the Google/Stanford team are said to be able to describe photos with near-human levels of understanding, automatically producing captions that identifies entire scenes with a very high degree of accuracy. For example descriptions such as "a group of young people playing a game of frisbee" or "a herd of elephants marching on a grassy plain," were accurately generated by the software.

The research could "eventually help visually impaired people understand pictures, provide alternate text for images in parts of the world where mobile connections are slow, and make it easier for everyone to search on Google for images," explained Google in a blog post.

The system uses two neural networks: one which deals with image recognition, and another with natural language processing capabilities. It is capable of learning and can gradually be trained to identify how sentences relate to what the image shows, and as a result, making the captions produced around twice as precise as any previous software could.

As you might expect, the newly developed system is still not perfect, and as you can see from the examples below, it can still get things wrong. Nevertheless, the resulting software makes a huge leap in this AI field, and by training computers to mimic how the human brain works, the research could be beneficial to future breakthroughs in imagery and perhaps even speech identifying software.

"A picture may be worth a thousand words," Google wrote. "But sometimes it's the words that are the most useful - so it's important we figure out ways to translate from images to words automatically and accurately."

Interested? You can find more further details about this project in this paper.



HEXUS Forums :: 4 Comments

Login with Forum Account

Don't have an account? Register today!
This may be very useful for automating image analysis of child abuse etc for law enforcement uses, making irritating flesh-tone based firewall and image blockers redundant as well as taking humans who work to weed out unsuitable images manually from social media sites (recent article in Wired about that, well worth finding a way to avoid subjecting workers from the worst excesses of humanity, which are posted on the internet).
Looks like a good development - okay, so there's false readings, but surely that's to be expected in a system of this complexity? I'm sure that, with time, the “failure” rate will fall.
Yes they will use it to spy on everyone and put a face to there database, with all your details,
your D.O.B, Medical records, Social security number,
Bank details,
and every bit of information they can retrieve about you
link all this to there G.P.S. SPY in the sky,
so they can track every movement you make,
use it to frame you or anything they want, of course the Americans will have the
RF.ID CHIP before any one else
this is the way the world going, THE MARK OF THE BEAST,

so please don't think it would be used for the good of mankind it will not
it will be used by the corrupt government's American and the British

they are the most corrupt.
anyone whom says its not is lying to him/her self
Tom G
Don't forget the lizard people.