Moutaz Fakhry, Yuri Granik, et al.
SPIE Photomask Technology + EUV Lithography 2011
Automatic speech recognition (ASR) research has traditionally focused on single-talker recognition. In many scenarios, however, the signal of interest is obscured by acoustic interference, including speech from other talkers. The human auditory system takes advantage of stereo inputs our ears to spatially filter the acoustic environment. Microphone array techniques can also take advantage of multiple inputs. However, even when restricted to a single channel, multiple talkers are still parsed remarkably well by humans but are indecipherable to conventional single-talker ASR systems. In fact, robustness to noise, reverberation, and interfering speakers is considered to be one of the six remaining grand challenges of ASR [47], [48]. © 2010 IEEE.
Moutaz Fakhry, Yuri Granik, et al.
SPIE Photomask Technology + EUV Lithography 2011
Arnon Amir, Michael Lindenbaum
IEEE Transactions on Pattern Analysis and Machine Intelligence
Kenneth L. Clarkson, K. Georg Hampel, et al.
VTC Spring 2007
W.C. Tang, H. Rosen, et al.
SPIE Optics, Electro-Optics, and Laser Applications in Science and Engineering 1991