facebook rss twitter

Microsoft AI can draw an object from your description

by Mark Tyson on 19 January 2018, 12:31

Tags: Microsoft (NASDAQ:MSFT)

Quick Link: HEXUS.net/qadpzg

Add to My Vault: x

Microsoft has a new 'drawing bot' that can create pictures, pixel by pixel, of objects based upon user description. According to a blog post by Microsoft Research, the new bot has "produced a nearly three-fold boost in image quality compared to the previous state-of-the-art technique for text-to-image generation," using an industry standard test.

Above you can see a picture of an ordinary looking bird. The image was built up by Microsoft's latest AI from a brief textual description of a yellow bird with black wings and a short beak resting upon a branch. As pictured, there may or may not be such a bird in existence on earth. That might be a rather humdrum example as the researchers say the AI can paint, from scratch, "everything from ordinary pastoral scenes, such as grazing livestock, to the absurd, such as a floating double-decker bus". The AI does in effect use its own imagination when filling in the gaps in a description of a scene.

Behind the new AI drawing bot technology is a Generative Adversarial Network, or GAN. In this case the 'adversaries' are two machine learning models; one that generates the imagery from text descriptions, and another that uses text descriptions to judge the authenticity of images. "Working together, the discriminator pushes the generator toward perfection," says the blog.

Previous AIs would tend to choke on too much detail, says Microsoft. For example, if told to draw a bird with a green crown, yellow wings and red belly a smudgy image would usually be generated, even though the same previous AI's could do simple bird drawings very well. Microsoft's new attentional GAN, or AttnGAN, is designed to improve drawing accuracy when given more textual detail - it has more focus and 'common sense' which is accrued in its training stages.

Real world applications of the new 'drawing bot' could be as a sketch assistant for painters or interior designers, or it could be a tool for voice-activated photo refinement. With further development time and processing power it is possible that the AI could go on to create animations based upon story board descriptions, reducing a lot of the everyday animation labour in studios.



HEXUS Forums :: 5 Comments

Login with Forum Account

Don't have an account? Register today!
soon the AI will be able to create a 3 hour visual stunning movie from scratch “cortana I am bored create for me a movie with spaceships”
How anyone is going to resits getting it to paint rude pictures is beyond me. ;)
Corky34
How anyone is going to resits getting it to paint rude pictures is beyond me. ;)

“This bird is a male domesticated red junglefowl…”
Fake news will rise to new levels. “It can not fake if they have picture of it” Sometimes I already am surprised what people believe in
I think there are many positive applications for AI, if only to show humans they are beatable at board games they developed thousands of years ago. I think there are some areas that could be left to humans, like art.

If they can develop this, what would be it's real purpose. Often that only becomes clear when people get to play with it. How would it compare to a criminal sketch artist?