Multi-Sinusoidal Excitation Model in Audio Coding

碩士 === 國立交通大學 === 電信研究所 === 81 === Current research on high-quality audio coding focuses on either subband or transform coding whereby the masking effect is exploited in dynamic bit allocation to result in an inaudible noise spectrum. Howev...

Full description

Bibliographic Details
Main Authors: Li-Wei Wang, 王禮瑋
Other Authors: Wen-Whei Chang
Format: Others
Language:en_US
Published: 1993
Online Access:http://ndltd.ncl.edu.tw/handle/37237233673152760964
id ndltd-TW-081NCTU0436026
record_format oai_dc
spelling ndltd-TW-081NCTU04360262016-07-20T04:11:37Z http://ndltd.ncl.edu.tw/handle/37237233673152760964 Multi-Sinusoidal Excitation Model in Audio Coding 多重正弦激勵的寬頻音訊編碼 Li-Wei Wang 王禮瑋 碩士 國立交通大學 電信研究所 81 Current research on high-quality audio coding focuses on either subband or transform coding whereby the masking effect is exploited in dynamic bit allocation to result in an inaudible noise spectrum. However, the frequency-domain coding approaches consume heavy computations and the redundancy remains to be removed further. On the other hand, time-domain coders have the potential to achieve transparent reproduction by incorporating an appropriate excitation structure. In this thesis, we propose a multi-sinusoids excitation structure which consists of the sum of gain-scaled sinusoidal functions, rather than the white noise as in Code-Excited LPC (CELP). The performance results show significant improvement not only in the conventional distortion measure such as SNR and segSNR, but also in the newly proposed objective quality measure called generalized bark spectral distortion (GBSD). Further more, a multi- sinusoids excitation structure also simplifies the computation load in about 90 times. Wen-Whei Chang 張文輝 1993 學位論文 ; thesis 60 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信研究所 === 81 === Current research on high-quality audio coding focuses on either subband or transform coding whereby the masking effect is exploited in dynamic bit allocation to result in an inaudible noise spectrum. However, the frequency-domain coding approaches consume heavy computations and the redundancy remains to be removed further. On the other hand, time-domain coders have the potential to achieve transparent reproduction by incorporating an appropriate excitation structure. In this thesis, we propose a multi-sinusoids excitation structure which consists of the sum of gain-scaled sinusoidal functions, rather than the white noise as in Code-Excited LPC (CELP). The performance results show significant improvement not only in the conventional distortion measure such as SNR and segSNR, but also in the newly proposed objective quality measure called generalized bark spectral distortion (GBSD). Further more, a multi- sinusoids excitation structure also simplifies the computation load in about 90 times.
author2 Wen-Whei Chang
author_facet Wen-Whei Chang
Li-Wei Wang
王禮瑋
author Li-Wei Wang
王禮瑋
spellingShingle Li-Wei Wang
王禮瑋
Multi-Sinusoidal Excitation Model in Audio Coding
author_sort Li-Wei Wang
title Multi-Sinusoidal Excitation Model in Audio Coding
title_short Multi-Sinusoidal Excitation Model in Audio Coding
title_full Multi-Sinusoidal Excitation Model in Audio Coding
title_fullStr Multi-Sinusoidal Excitation Model in Audio Coding
title_full_unstemmed Multi-Sinusoidal Excitation Model in Audio Coding
title_sort multi-sinusoidal excitation model in audio coding
publishDate 1993
url http://ndltd.ncl.edu.tw/handle/37237233673152760964
work_keys_str_mv AT liweiwang multisinusoidalexcitationmodelinaudiocoding
AT wánglǐwěi multisinusoidalexcitationmodelinaudiocoding
AT liweiwang duōzhòngzhèngxiánjīlìdekuānpínyīnxùnbiānmǎ
AT wánglǐwěi duōzhòngzhèngxiánjīlìdekuānpínyīnxùnbiānmǎ
_version_ 1718354691367108608