facebook rss twitter

Listen through soundproof glass by watching a crisp-bag vibrate

by Mark Tyson on 5 August 2014, 11:26

Tags: MIT, Adobe (NASDAQ:ADBE), Microsoft (NASDAQ:MSFT)

Quick Link: HEXUS.net/qachfb

Add to My Vault: x

Please log in to view Printer Friendly Layout

You've been through your counter-espionage check list and scanned the room for hidden microphones and other kinds of surveillance devices. You've even gone as far as making sure you are in a room with soundproof glass and will hold your conversation so your lips aren't visible to anyone that could see you talking through the window. However you've missed one critical vulnerability, a bag of crisps (potato chips) on the desk beside you…

Salt & Shake & Vibrate & leak your private conversations

Sound waves produced when we talk move the air and these air movements are picked up by a listeners' ear-drums which receive the air vibrations. But other objects in a room also vibrate in sympathy with the sound waves. Something like a crisp packet makes an ideal observable surface to see these air vibrations it seems.

MIT News reports that "researchers at MIT, Microsoft, and Adobe have developed an algorithm that can reconstruct an audio signal by analyzing minute vibrations of objects depicted in video". Experiments using the algorithm show that the researchers were able to listen in to speech by filming the vibration of the surface of a bag of crisps in the room from a distance of 15ft, though a barrier of soundproof glass. The camera bnecomes a 'visual microphone'.

Abe Davis, a graduate student in electrical engineering and computer science at MIT and first author on the new paper explained "The motion of this vibration creates a very subtle visual signal that’s usually invisible to the naked eye. People didn’t realize that this information was there."

To be able to reconstruct audio from the video of the vibrating object the video FPS rate ideally needs to be higher than the frequency of the audio signal - otherwise there is a cut off which can make it hard to decode what is being said. The researchers found that a frame rate of between 2,000 and 6,000 FPS worked well. However even an "ordinary digital camera" with a 60fps video mode could also be useful for determining key info about the people speaking in a room.

If you're thinking James Bond would never be seen in the same room as a packet of crisps - so he would be safe - the scientists have also successfully snooped on audio via the leaves of a pot plant and the surface of a glass of water - so a martini would also be able to give away the secrets discussed in this closed soundproofed room too.

The researchers will present their findings and further details of their experiments at SIGGRAPH computer graphics conference.

HEXUS Forums :: 17 Comments

Login with Forum Account

Don't have an account? Register today!
So…what's the point of this?
What's the point of this?! You realise people said the exact same thing about pretty much every major invention at its first publication? Pppf this electricity malarky, what's the point of that?

The ramifications of this technology could be very far reaching. From the obvious security applications (and consequent privacy implications), to probably hundreds of other far more obscure applications which may not now be obvious, but with further development will be logical next steps. This type of analysis could have applications in photography, film, materials science, for example.

I think it's pretty cool.
So…what's the point of this?

Vibration analysis of machinery perhaps? It might be possible to use the same techniques to work out which panels of a car resonate and help soundproof it, or even to reduce metal fatigue.
So…what's the point of this?

The best applications are the ones we don't even know exist yet.
So we can expect this to be added to the Kinect v3 :)