Transform based pitch-scale modification algorithms for speech
碩士 === 南台科技大學 === 資訊工程系 === 99 === Pitch scale modification (PSM) has been widely used in language learning, kara ok, Text-to-Speech (TTS), entertainment and digital learning. This thesis proposed three PSM algorithms: PSM based on DCT (DCT-PSM), SOLA based linear prediction PPSM (SOLA-LPDCT) and Mo...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/61218956144197743637 |
id |
ndltd-TW-099STUT8392002 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099STUT83920022016-11-22T04:13:40Z http://ndltd.ncl.edu.tw/handle/61218956144197743637 Transform based pitch-scale modification algorithms for speech 基於轉換域之語音音高調整演算法 Yu-Hao Chang 張育豪 碩士 南台科技大學 資訊工程系 99 Pitch scale modification (PSM) has been widely used in language learning, kara ok, Text-to-Speech (TTS), entertainment and digital learning. This thesis proposed three PSM algorithms: PSM based on DCT (DCT-PSM), SOLA based linear prediction PPSM (SOLA-LPDCT) and Modified SOLA-LPDCT. The proposed DCT-PSM algorithm uses discrete cosine transform (DCT) instead of discrete Fourier transform (DFT) to rearrange the spectrum for pitch shifting. The SOLA-LPDCT algorithm uses the time-scale modification algorithm to capture the suitable frame for PSM processing. And then, the linear prediction coding (LPC) is performed for PSM processing in the LPC residual signal domain. Finally, the modified SOLA-LPDCT approach tries to improve the performance by using the first order interpolation in the PSM processing of SOLA-LPDCT method. Simulation results show that the proposed algorithms not only reduce the computational complexity but also improve the performance while comparing the mentioned methods in literature. Fu-Kun Chen 陳福坤 2011 學位論文 ; thesis 100 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 南台科技大學 === 資訊工程系 === 99 === Pitch scale modification (PSM) has been widely used in language learning, kara ok, Text-to-Speech (TTS), entertainment and digital learning. This thesis proposed three PSM algorithms: PSM based on DCT (DCT-PSM), SOLA based linear prediction PPSM (SOLA-LPDCT) and Modified SOLA-LPDCT. The proposed DCT-PSM algorithm uses discrete cosine transform (DCT) instead of discrete Fourier transform (DFT) to rearrange the spectrum for pitch shifting. The SOLA-LPDCT algorithm uses the time-scale modification algorithm to capture the suitable frame for PSM processing. And then, the linear prediction coding (LPC) is performed for PSM processing in the LPC residual signal domain. Finally, the modified SOLA-LPDCT approach tries to improve the performance by using the first order interpolation in the PSM processing of SOLA-LPDCT method. Simulation results show that the proposed algorithms not only reduce the computational complexity but also improve the performance while comparing the mentioned methods in literature.
|
author2 |
Fu-Kun Chen |
author_facet |
Fu-Kun Chen Yu-Hao Chang 張育豪 |
author |
Yu-Hao Chang 張育豪 |
spellingShingle |
Yu-Hao Chang 張育豪 Transform based pitch-scale modification algorithms for speech |
author_sort |
Yu-Hao Chang |
title |
Transform based pitch-scale modification algorithms for speech |
title_short |
Transform based pitch-scale modification algorithms for speech |
title_full |
Transform based pitch-scale modification algorithms for speech |
title_fullStr |
Transform based pitch-scale modification algorithms for speech |
title_full_unstemmed |
Transform based pitch-scale modification algorithms for speech |
title_sort |
transform based pitch-scale modification algorithms for speech |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/61218956144197743637 |
work_keys_str_mv |
AT yuhaochang transformbasedpitchscalemodificationalgorithmsforspeech AT zhāngyùháo transformbasedpitchscalemodificationalgorithmsforspeech AT yuhaochang jīyúzhuǎnhuànyùzhīyǔyīnyīngāodiàozhěngyǎnsuànfǎ AT zhāngyùháo jīyúzhuǎnhuànyùzhīyǔyīnyīngāodiàozhěngyǎnsuànfǎ |
_version_ |
1718397561046302720 |