Transform based pitch-scale modification algorithms for speech

碩士 === 南台科技大學 === 資訊工程系 === 99 === Pitch scale modification (PSM) has been widely used in language learning, kara ok, Text-to-Speech (TTS), entertainment and digital learning. This thesis proposed three PSM algorithms: PSM based on DCT (DCT-PSM), SOLA based linear prediction PPSM (SOLA-LPDCT) and Mo...

Full description

Bibliographic Details
Main Authors: Yu-Hao Chang, 張育豪
Other Authors: Fu-Kun Chen
Format: Others
Language:zh-TW
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/61218956144197743637
id ndltd-TW-099STUT8392002
record_format oai_dc
spelling ndltd-TW-099STUT83920022016-11-22T04:13:40Z http://ndltd.ncl.edu.tw/handle/61218956144197743637 Transform based pitch-scale modification algorithms for speech 基於轉換域之語音音高調整演算法 Yu-Hao Chang 張育豪 碩士 南台科技大學 資訊工程系 99 Pitch scale modification (PSM) has been widely used in language learning, kara ok, Text-to-Speech (TTS), entertainment and digital learning. This thesis proposed three PSM algorithms: PSM based on DCT (DCT-PSM), SOLA based linear prediction PPSM (SOLA-LPDCT) and Modified SOLA-LPDCT. The proposed DCT-PSM algorithm uses discrete cosine transform (DCT) instead of discrete Fourier transform (DFT) to rearrange the spectrum for pitch shifting. The SOLA-LPDCT algorithm uses the time-scale modification algorithm to capture the suitable frame for PSM processing. And then, the linear prediction coding (LPC) is performed for PSM processing in the LPC residual signal domain. Finally, the modified SOLA-LPDCT approach tries to improve the performance by using the first order interpolation in the PSM processing of SOLA-LPDCT method. Simulation results show that the proposed algorithms not only reduce the computational complexity but also improve the performance while comparing the mentioned methods in literature. Fu-Kun Chen 陳福坤 2011 學位論文 ; thesis 100 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 南台科技大學 === 資訊工程系 === 99 === Pitch scale modification (PSM) has been widely used in language learning, kara ok, Text-to-Speech (TTS), entertainment and digital learning. This thesis proposed three PSM algorithms: PSM based on DCT (DCT-PSM), SOLA based linear prediction PPSM (SOLA-LPDCT) and Modified SOLA-LPDCT. The proposed DCT-PSM algorithm uses discrete cosine transform (DCT) instead of discrete Fourier transform (DFT) to rearrange the spectrum for pitch shifting. The SOLA-LPDCT algorithm uses the time-scale modification algorithm to capture the suitable frame for PSM processing. And then, the linear prediction coding (LPC) is performed for PSM processing in the LPC residual signal domain. Finally, the modified SOLA-LPDCT approach tries to improve the performance by using the first order interpolation in the PSM processing of SOLA-LPDCT method. Simulation results show that the proposed algorithms not only reduce the computational complexity but also improve the performance while comparing the mentioned methods in literature.
author2 Fu-Kun Chen
author_facet Fu-Kun Chen
Yu-Hao Chang
張育豪
author Yu-Hao Chang
張育豪
spellingShingle Yu-Hao Chang
張育豪
Transform based pitch-scale modification algorithms for speech
author_sort Yu-Hao Chang
title Transform based pitch-scale modification algorithms for speech
title_short Transform based pitch-scale modification algorithms for speech
title_full Transform based pitch-scale modification algorithms for speech
title_fullStr Transform based pitch-scale modification algorithms for speech
title_full_unstemmed Transform based pitch-scale modification algorithms for speech
title_sort transform based pitch-scale modification algorithms for speech
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/61218956144197743637
work_keys_str_mv AT yuhaochang transformbasedpitchscalemodificationalgorithmsforspeech
AT zhāngyùháo transformbasedpitchscalemodificationalgorithmsforspeech
AT yuhaochang jīyúzhuǎnhuànyùzhīyǔyīnyīngāodiàozhěngyǎnsuànfǎ
AT zhāngyùháo jīyúzhuǎnhuànyùzhīyǔyīnyīngāodiàozhěngyǎnsuànfǎ
_version_ 1718397561046302720