Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface

A method of detecting speech events in a multiple-sound-source condition using audio and video information is proposed. For detecting speech events, sound localization using a microphone array and human tracking by stereo vision is combined by a Bayesian network. From the inference results of the Ba...

Full description

Bibliographic Details
Main Authors:	Futoshi Asano, Kiyoshi Yamamoto, Isao Hara, Jun Ogata, Takashi Yoshimura, Yoichi Motomura, Naoyuki Ichimura, Hideki Asoh
Format:	Article
Language:	English
Published:	SpringerOpen 2004-09-01
Series:	EURASIP Journal on Advances in Signal Processing
Subjects:	information fusion sound localization human tracking adaptive beamformer speech recognition.
Online Access:	http://dx.doi.org/10.1155/S1110865704402303

Internet

http://dx.doi.org/10.1155/S1110865704402303

Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface

Internet

Similar Items