Étiquette : speech recognition

17 avril 2018 / noflux

«People are remarkably good at focusing their attention on a particular person in a noisy environment, mentally “muting” all other voices and sounds. Known as the cocktail party effect, this capability comes natural to us humans. However, automatic speech separation — separating an audio signal into its individual speech sources — while a well-studied problem, remains a significant challenge for computers. In “Looking to Listen at the Cocktail Party”, we present a deep learning audio-visual model for isolating a single speech signal from a mixture of sounds such as other voices and background noise».

Source : Research Blog: Looking to Listen: Audio-Visual Speech Separation

7 novembre 2016 / noflux

#VoCo allows you to change words in a voiceover simply by typing new words. Presented live during the Adobe MAX 2016 Sneak Peeks, co-hosted by Jordan Peele.

via Adobe Creative Cloud

2 septembre 2016 / noflux

For English, speech recognition was three times faster than typing, and the error rate was 20.4 percent lower. In Mandarin Chinese, speech was 2.8 times faster, with an error rate 63.4 percent lower than typing.

Source : Speech recognition faster at texting | Stanford News

Étiquette : speech recognition

Archives

Catégories

Méta