Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real ti...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Electronics and Telecommunications Research Institute (ETRI)
2020-07-01
|
Series: | ETRI Journal |
Subjects: | |
Online Access: | https://doi.org/10.4218/etrij.2019-0364 |
id |
doaj-f6d8266c24fa4cb0993a612853131602 |
---|---|
record_format |
Article |
spelling |
doaj-f6d8266c24fa4cb0993a6128531316022021-02-25T03:49:35ZengElectronics and Telecommunications Research Institute (ETRI)ETRI Journal1225-64632020-07-01431829410.4218/etrij.2019-036410.4218/etrij.2019-0364Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesisSandeep KumarIn this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear‐predictive‐coding‐based speech analysis‐synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN‐based speech classifier performs better than the ACF‐, AMDF‐, cepstrum‐, WACF‐ and ZCR‐E‐based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF‐based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN‐based speech classifier is greater compared with other classifiers.https://doi.org/10.4218/etrij.2019-0364acfamdfcepstrumneural networkreal‐time systemspeech classificationwacfzcr‐e |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sandeep Kumar |
spellingShingle |
Sandeep Kumar Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis ETRI Journal acf amdf cepstrum neural network real‐time system speech classification wacf zcr‐e |
author_facet |
Sandeep Kumar |
author_sort |
Sandeep Kumar |
title |
Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis |
title_short |
Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis |
title_full |
Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis |
title_fullStr |
Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis |
title_full_unstemmed |
Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis |
title_sort |
real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis |
publisher |
Electronics and Telecommunications Research Institute (ETRI) |
series |
ETRI Journal |
issn |
1225-6463 |
publishDate |
2020-07-01 |
description |
In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear‐predictive‐coding‐based speech analysis‐synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN‐based speech classifier performs better than the ACF‐, AMDF‐, cepstrum‐, WACF‐ and ZCR‐E‐based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF‐based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN‐based speech classifier is greater compared with other classifiers. |
topic |
acf amdf cepstrum neural network real‐time system speech classification wacf zcr‐e |
url |
https://doi.org/10.4218/etrij.2019-0364 |
work_keys_str_mv |
AT sandeepkumar realtimeimplementationandperformanceevaluationofspeechclassifiersinspeechanalysissynthesis |
_version_ |
1724252246184558592 |