Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM
碩士 === 國立成功大學 === 電機工程學系碩博士班 === 96 === A novel architecture for ubiquitous and robust of text-independent speaker recognition based on SVM approach is proposed. In this architecture, multiple far-field microphones of configuration is adopted to receive the pervasive speech signals, and the distance...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2008
|
Online Access: | http://ndltd.ncl.edu.tw/handle/86629603830179768114 |
id |
ndltd-TW-096NCKU5442155 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-096NCKU54421552017-07-20T04:35:15Z http://ndltd.ncl.edu.tw/handle/86629603830179768114 Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM 泛在環境與具強健性之非文字相關語者辨識及SVM/SMO演算法之FPGA設計與實現 Gaung-Hui Gu 古光輝 碩士 國立成功大學 電機工程學系碩博士班 96 A novel architecture for ubiquitous and robust of text-independent speaker recognition based on SVM approach is proposed. In this architecture, multiple far-field microphones of configuration is adopted to receive the pervasive speech signals, and the distance effect between speaker and microphone is supposed to be ignored. Then the multi-channel speech signals are added together through a mixer. In a ubiquitous computing environment, the received speech signal is usually heavily corrupted by background noises. An SNR-aware subspace speech of enhancement approach is used as a pre-processing to enhance the mixed informational signal as well as suppressing the noise. Considering the text-independent speaker recognition, this proposed work applies multi-class support vectors machine (SVM) instead of using conventional Gaussian mixture models (GMMs). In our experiments, the speaker recognition rate up to 97.2% with the proposed ubiquitous architecture of speaker recognition system. Additionally, we proposed a hardware realization of speaker identification system based on sequential minimal optimization (SMO) algorithm of SVM. We also proposed more efficient method of cache table utilization, and intend to save more then one half of cache table space as well as to reduce processing time of kernel function. Moreover, the heuristics selection method of SMO algorithm is implemented into hardware design to reduce the training time. In our experiments, the training time can reduce 2.17 times less than non-use of heuristics selection method on PC. And our finding shows that the identification ratio up to 92.5% of accuracy and reduced 53% of training time in hardware implementation. Jhing-Fa Wang 王駿發 2008 學位論文 ; thesis 75 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 電機工程學系碩博士班 === 96 === A novel architecture for ubiquitous and robust of text-independent speaker recognition based on SVM approach is proposed. In this architecture, multiple far-field microphones of configuration is adopted to receive the pervasive speech signals, and the distance effect between speaker and microphone is supposed to be ignored. Then the multi-channel speech signals are added together through a mixer. In a ubiquitous computing environment, the received speech signal is usually heavily corrupted by background noises. An SNR-aware subspace speech of enhancement approach is used as a pre-processing to enhance the mixed informational signal as well as suppressing the noise. Considering the text-independent speaker recognition, this proposed work applies multi-class support vectors machine (SVM) instead of using conventional Gaussian mixture models (GMMs). In our experiments, the speaker recognition rate up to 97.2% with the proposed ubiquitous architecture of speaker recognition system.
Additionally, we proposed a hardware realization of speaker identification system based on sequential minimal optimization (SMO) algorithm of SVM. We also proposed more efficient method of cache table utilization, and intend to save more then one half of cache table space as well as to reduce processing time of kernel function. Moreover, the heuristics selection method of SMO algorithm is implemented into hardware design to reduce the training time. In our experiments, the training time can reduce 2.17 times less than non-use of heuristics selection method on PC. And our finding shows that the identification ratio up to 92.5% of accuracy and reduced 53% of training time in hardware implementation.
|
author2 |
Jhing-Fa Wang |
author_facet |
Jhing-Fa Wang Gaung-Hui Gu 古光輝 |
author |
Gaung-Hui Gu 古光輝 |
spellingShingle |
Gaung-Hui Gu 古光輝 Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM |
author_sort |
Gaung-Hui Gu |
title |
Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM |
title_short |
Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM |
title_full |
Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM |
title_fullStr |
Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM |
title_full_unstemmed |
Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM |
title_sort |
ubiquitous and robust text-independent speaker recognition and fpga implementation for smo algorithm of svm |
publishDate |
2008 |
url |
http://ndltd.ncl.edu.tw/handle/86629603830179768114 |
work_keys_str_mv |
AT gaunghuigu ubiquitousandrobusttextindependentspeakerrecognitionandfpgaimplementationforsmoalgorithmofsvm AT gǔguānghuī ubiquitousandrobusttextindependentspeakerrecognitionandfpgaimplementationforsmoalgorithmofsvm AT gaunghuigu fànzàihuánjìngyǔjùqiángjiànxìngzhīfēiwénzìxiāngguānyǔzhěbiànshíjísvmsmoyǎnsuànfǎzhīfpgashèjìyǔshíxiàn AT gǔguānghuī fànzàihuánjìngyǔjùqiángjiànxìngzhīfēiwénzìxiāngguānyǔzhěbiànshíjísvmsmoyǎnsuànfǎzhīfpgashèjìyǔshíxiàn |
_version_ |
1718502101815918592 |