Wearables

Sonar-enabled glasses read wearers' silently spoken voice commands

Sonar-enabled glasses read wearers' silently spoken voice commands
The EchoSpeech glasses (modeled here by lead scientist Ruidong Zhang) could be utilized to control devices, or even to help the voiceless communicate
The EchoSpeech glasses (modeled here by lead scientist Ruidong Zhang) could be utilized to control devices, or even to help the voiceless communicate
View 2 Images
Because the EchoSpeech glasses don't utilize video files (which can be quite large), all of the data processing can be done on a paired smartphone
1/2
Because the EchoSpeech glasses don't utilize video files (which can be quite large), all of the data processing can be done on a paired smartphone
The EchoSpeech glasses (modeled here by lead scientist Ruidong Zhang) could be utilized to control devices, or even to help the voiceless communicate
2/2
The EchoSpeech glasses (modeled here by lead scientist Ruidong Zhang) could be utilized to control devices, or even to help the voiceless communicate

Some people lack the power of speech, while others may find themselves in noisy settings where speaking voice commands out loud just won't work. Such folks might have use for the EchoSpeech glasses, which read their user's silently spoken words.

The experimental eyewear is being developed by a team at Cornell University's Smart Computer Interfaces for Future Interactions (SciFi) Lab.

Two downwards-facing miniature speakers are mounted on the underside of the frame beneath one lens, while two mini microphones are located beneath the other. The speakers emit inaudible sound waves, which are reflected off the wearer's moving mouth and back up to the mics.

Those echoes are analyzed in real time by a deep learning algorithm on a wirelessly linked smartphone. That algorithm was trained to associate specific echoes with specific mouth movements, which are in turn associated with specific silently spoken commands.

Because the EchoSpeech glasses don't utilize video files (which can be quite large), all of the data processing can be done on a paired smartphone
Because the EchoSpeech glasses don't utilize video files (which can be quite large), all of the data processing can be done on a paired smartphone

EchoSpeech is currently capable of recognizing 31 such commands with about 95% accuracy, and only requires a few minutes of training for each user. And importantly for people with privacy concerns, the system doesn't incorporate any cameras, nor does it send any information to the internet.

What's more, because it doesn't utilize a power-hungry camera, it can run for up to 10 hours on one charge of its battery. By contrast, the researchers claim that experimental camera-based systems are only good for about 30 minutes of use per charge.

The university is now working on commercializing the technology.

"For people who cannot vocalize sound, this silent speech technology could be an excellent input for a voice synthesizer," said doctoral student Ruidong Zhang, who is leading the study. "It could give patients their voices back."

The SciFi Lab previously developed a somewhat similar system called EarIO, which uses a sonar-equipped ear-worn device to capture the wearer's facial expressions – although it's utilized mainly to create digital avatars. That said, the University at Buffalo's EarCommand system does read silently spoken words via an earbud which detects distinctive ear canal deformations produced by specific mouth movements.

EchoSpeech is demonstrated in the following video.

EchoSpeech: AI-equipped eyeglasses can read silent speech

Source: Cornell University

1 comment
1 comment
Karmudjun
Lip reading glasses! I can see how they might be of use for the deaf, but the glasses work on the wearer, not on the person looking at them. Besides - the deaf might be able to take the synthesized speech to print in order to assist their lack of hearing, but technically this is just useful for the unable to vocalize. I'm sure there is a niche, but I sure was dismayed as I read through the article. There has to be a niche - and i hope it gets publicized for that niche population - but I don't think it affects too many. Stroke patients for example, would need quite a bit of speech therapy to benefit from such a device. But hey, research is research and may it help someone as intended!
  翻译: