Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking

碩士 === 國立暨南國際大學 === 電機工程學系 === 107 === In this study, we focus on the issue of noise distortion in speech signals, and develop two novel unsupervised speech enhancement algorithms including temporal lowpass filtering (TLF) and relative-to-maximum masking (RMM). Both of these two algorithms are condu...

Full description

Bibliographic Details
Main Authors:	LIU,KUAN-YI, 劉冠毅
Other Authors:	HUNG,JEIH-WEHI
Format:	Others
Language:	en_US
Published:	2019
Online Access:	http://ndltd.ncl.edu.tw/handle/f6vwk9

id	ndltd-TW-107NCNU0442024
record_format	oai_dc
spelling	ndltd-TW-107NCNU04420242019-09-20T03:25:53Z http://ndltd.ncl.edu.tw/handle/f6vwk9 Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking 整合全卷積神經網路、時序低通濾波與時頻遮罩法之語音強化 LIU,KUAN-YI 劉冠毅碩士國立暨南國際大學電機工程學系 107 In this study, we focus on the issue of noise distortion in speech signals, and develop two novel unsupervised speech enhancement algorithms including temporal lowpass filtering (TLF) and relative-to-maximum masking (RMM). Both of these two algorithms are conducted on the magnitude spectrogram of speech signals. TLF uses a simple moving-average filter to emphasize the low modulation frequencies of speech signals, which are believed to contain richer linguistic information and exhibit higher signal-to-noise ratios (SNR). Comparatively, in RMM we apply a mask that is directly multiplied with the speech spectrogram in a point-wise manner, and the used masking value is directly proportional to the magnitude of each temporal-frequency (T-F) point in the spectrogram. The preliminary experiments conducted on a subset of TIMIT database show that the two novel methods can promote the quality of noise-corrupted speech signals significantly, and both of them can be integrated with a well-known supervised speech enhancement scenario, namely fully convolutional network, to achieve even better perceptual speech quality values. HUNG,JEIH-WEHI 洪志偉 2019 學位論文 ; thesis 50 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立暨南國際大學 === 電機工程學系 === 107 === In this study, we focus on the issue of noise distortion in speech signals, and develop two novel unsupervised speech enhancement algorithms including temporal lowpass filtering (TLF) and relative-to-maximum masking (RMM). Both of these two algorithms are conducted on the magnitude spectrogram of speech signals. TLF uses a simple moving-average filter to emphasize the low modulation frequencies of speech signals, which are believed to contain richer linguistic information and exhibit higher signal-to-noise ratios (SNR). Comparatively, in RMM we apply a mask that is directly multiplied with the speech spectrogram in a point-wise manner, and the used masking value is directly proportional to the magnitude of each temporal-frequency (T-F) point in the spectrogram. The preliminary experiments conducted on a subset of TIMIT database show that the two novel methods can promote the quality of noise-corrupted speech signals significantly, and both of them can be integrated with a well-known supervised speech enhancement scenario, namely fully convolutional network, to achieve even better perceptual speech quality values.
author2	HUNG,JEIH-WEHI
author_facet	HUNG,JEIH-WEHI LIU,KUAN-YI 劉冠毅
author	LIU,KUAN-YI 劉冠毅
spellingShingle	LIU,KUAN-YI 劉冠毅 Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking
author_sort	LIU,KUAN-YI
title	Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking
title_short	Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking
title_full	Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking
title_fullStr	Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking
title_full_unstemmed	Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking
title_sort	speech enhancement based on the integration of fully convolutional network, temporal lowpass filtering and spectrogram masking
publishDate	2019
url	http://ndltd.ncl.edu.tw/handle/f6vwk9
work_keys_str_mv	AT liukuanyi speechenhancementbasedontheintegrationoffullyconvolutionalnetworktemporallowpassfilteringandspectrogrammasking AT liúguānyì speechenhancementbasedontheintegrationoffullyconvolutionalnetworktemporallowpassfilteringandspectrogrammasking AT liukuanyi zhěnghéquánjuǎnjīshénjīngwǎnglùshíxùdītōnglǜbōyǔshípínzhēzhàofǎzhīyǔyīnqiánghuà AT liúguānyì zhěnghéquánjuǎnjīshénjīngwǎnglùshíxùdītōnglǜbōyǔshípínzhēzhàofǎzhīyǔyīnqiánghuà
_version_	1719252755653066752

Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking

Similar Items