Single channel blind source separation

Single channel blind source separation (SCBSS) is an intensively researched field with numerous important applications. This research sets out to investigate the separation of monaural mixed audio recordings without relying on training knowledge. This research proposes a novel method based on variab...

Full description

Bibliographic Details
Main Author:	Gao, Bin
Published:	University of Newcastle Upon Tyne 2011
Subjects:	621.3828
Online Access:	http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.566865

id	ndltd-bl.uk-oai-ethos.bl.uk-566865
record_format	oai_dc
spelling	ndltd-bl.uk-oai-ethos.bl.uk-5668652015-03-20T03:35:34ZSingle channel blind source separationGao, Bin2011Single channel blind source separation (SCBSS) is an intensively researched field with numerous important applications. This research sets out to investigate the separation of monaural mixed audio recordings without relying on training knowledge. This research proposes a novel method based on variable regularised sparse nonnegative matrix factorization which decomposes an information-bearing matrix into two-dimensional convolution of factor matrices that represent the spectral basis and temporal code of the sources. In this work, a variational Bayesian approach has been developed for computing the sparsity parameters of the matrix factorization. To further improve the previous work, this research proposes a new method based on decomposing the mixture into a series of oscillatory components termed as the intrinsic mode functions (IMF). It is shown that IMFs have several desirable properties unique to SCBSS problem and how these properties can be advantaged to relax the constraints posed by the problem. In addition, this research develops a novel method for feature extraction using psycho-acoustic model. The monaural mixed signal is transformed to a cochleagram using the gammatone filterbank, whose bandwidths increase incrementally as the center frequency increases; thus resulting to non-uniform time-frequency (TF) resolution in the analysis of audio signal. Within this domain, a family of Itakura-Saito (IS) divergence based novel two-dimensional matrix factorization has been developed. The proposed matrix factorizations have the property of scale invariant which enables lower energy components in the cochleagram to be treated with equal importance as the high energy ones. Results show that all the developed algorithms presented in this thesis have outperformed conventional methods.621.3828University of Newcastle Upon Tynehttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.566865http://hdl.handle.net/10443/1300Electronic Thesis or Dissertation
collection	NDLTD
sources	NDLTD
topic	621.3828
spellingShingle	621.3828 Gao, Bin Single channel blind source separation
description	Single channel blind source separation (SCBSS) is an intensively researched field with numerous important applications. This research sets out to investigate the separation of monaural mixed audio recordings without relying on training knowledge. This research proposes a novel method based on variable regularised sparse nonnegative matrix factorization which decomposes an information-bearing matrix into two-dimensional convolution of factor matrices that represent the spectral basis and temporal code of the sources. In this work, a variational Bayesian approach has been developed for computing the sparsity parameters of the matrix factorization. To further improve the previous work, this research proposes a new method based on decomposing the mixture into a series of oscillatory components termed as the intrinsic mode functions (IMF). It is shown that IMFs have several desirable properties unique to SCBSS problem and how these properties can be advantaged to relax the constraints posed by the problem. In addition, this research develops a novel method for feature extraction using psycho-acoustic model. The monaural mixed signal is transformed to a cochleagram using the gammatone filterbank, whose bandwidths increase incrementally as the center frequency increases; thus resulting to non-uniform time-frequency (TF) resolution in the analysis of audio signal. Within this domain, a family of Itakura-Saito (IS) divergence based novel two-dimensional matrix factorization has been developed. The proposed matrix factorizations have the property of scale invariant which enables lower energy components in the cochleagram to be treated with equal importance as the high energy ones. Results show that all the developed algorithms presented in this thesis have outperformed conventional methods.
author	Gao, Bin
author_facet	Gao, Bin
author_sort	Gao, Bin
title	Single channel blind source separation
title_short	Single channel blind source separation
title_full	Single channel blind source separation
title_fullStr	Single channel blind source separation
title_full_unstemmed	Single channel blind source separation
title_sort	single channel blind source separation
publisher	University of Newcastle Upon Tyne
publishDate	2011
url	http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.566865
work_keys_str_mv	AT gaobin singlechannelblindsourceseparation
_version_	1716782036706918400

Single channel blind source separation

Similar Items