Google's Image Recognition Software Can Now Describe Entire Scenes
Image recognition was already good —but it's getting way, way better. A research collaboration between Google and Stanford University is producing software that increasingly describes the entire scene portrayed in a picture, not just individual objects. The New York Times reports that algorithms written by the team attempt to explain what's happening in images—in language that actually makes sense. So it spits out sentences like "a group of young people playing a game of frisbee" or "a person riding a motorcycle on a dirt road." It does that using two neural networks: one deals with image recognition, the other with natural language processing. The system uses computer learning, so it's fed a series of captioned images and it gradually learns how sentences relate to what the image shows. The resulting software is, according to the team, about twice as accurate as any software to have gone before it. It's not, however, perfect. Check, for instance, the image above:Read full article from Google's Image Recognition Software Can Now Describe Entire Scenes
No comments:
Post a Comment