Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface

A method of detecting speech events in a multiple-sound-source condition using audio and video information is proposed. For detecting speech events, sound localization using a microphone array and human tracking by stereo vision is combined by a Bayesian network. From the inference results of the Ba...

Full description

Bibliographic Details
Main Authors: Futoshi Asano, Kiyoshi Yamamoto, Isao Hara, Jun Ogata, Takashi Yoshimura, Yoichi Motomura, Naoyuki Ichimura, Hideki Asoh
Format: Article
Language:English
Published: SpringerOpen 2004-09-01
Series:EURASIP Journal on Advances in Signal Processing
Subjects:
Online Access:http://dx.doi.org/10.1155/S1110865704402303