Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM

碩士 === 國立成功大學 === 電機工程學系碩博士班 === 96 === A novel architecture for ubiquitous and robust of text-independent speaker recognition based on SVM approach is proposed. In this architecture, multiple far-field microphones of configuration is adopted to receive the pervasive speech signals, and the distance...

Full description

Bibliographic Details
Main Authors: Gaung-Hui Gu, 古光輝
Other Authors: Jhing-Fa Wang
Format: Others
Language:en_US
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/86629603830179768114
id ndltd-TW-096NCKU5442155
record_format oai_dc
spelling ndltd-TW-096NCKU54421552017-07-20T04:35:15Z http://ndltd.ncl.edu.tw/handle/86629603830179768114 Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM 泛在環境與具強健性之非文字相關語者辨識及SVM/SMO演算法之FPGA設計與實現 Gaung-Hui Gu 古光輝 碩士 國立成功大學 電機工程學系碩博士班 96 A novel architecture for ubiquitous and robust of text-independent speaker recognition based on SVM approach is proposed. In this architecture, multiple far-field microphones of configuration is adopted to receive the pervasive speech signals, and the distance effect between speaker and microphone is supposed to be ignored. Then the multi-channel speech signals are added together through a mixer. In a ubiquitous computing environment, the received speech signal is usually heavily corrupted by background noises. An SNR-aware subspace speech of enhancement approach is used as a pre-processing to enhance the mixed informational signal as well as suppressing the noise. Considering the text-independent speaker recognition, this proposed work applies multi-class support vectors machine (SVM) instead of using conventional Gaussian mixture models (GMMs). In our experiments, the speaker recognition rate up to 97.2% with the proposed ubiquitous architecture of speaker recognition system. Additionally, we proposed a hardware realization of speaker identification system based on sequential minimal optimization (SMO) algorithm of SVM. We also proposed more efficient method of cache table utilization, and intend to save more then one half of cache table space as well as to reduce processing time of kernel function. Moreover, the heuristics selection method of SMO algorithm is implemented into hardware design to reduce the training time. In our experiments, the training time can reduce 2.17 times less than non-use of heuristics selection method on PC. And our finding shows that the identification ratio up to 92.5% of accuracy and reduced 53% of training time in hardware implementation. Jhing-Fa Wang 王駿發 2008 學位論文 ; thesis 75 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 電機工程學系碩博士班 === 96 === A novel architecture for ubiquitous and robust of text-independent speaker recognition based on SVM approach is proposed. In this architecture, multiple far-field microphones of configuration is adopted to receive the pervasive speech signals, and the distance effect between speaker and microphone is supposed to be ignored. Then the multi-channel speech signals are added together through a mixer. In a ubiquitous computing environment, the received speech signal is usually heavily corrupted by background noises. An SNR-aware subspace speech of enhancement approach is used as a pre-processing to enhance the mixed informational signal as well as suppressing the noise. Considering the text-independent speaker recognition, this proposed work applies multi-class support vectors machine (SVM) instead of using conventional Gaussian mixture models (GMMs). In our experiments, the speaker recognition rate up to 97.2% with the proposed ubiquitous architecture of speaker recognition system. Additionally, we proposed a hardware realization of speaker identification system based on sequential minimal optimization (SMO) algorithm of SVM. We also proposed more efficient method of cache table utilization, and intend to save more then one half of cache table space as well as to reduce processing time of kernel function. Moreover, the heuristics selection method of SMO algorithm is implemented into hardware design to reduce the training time. In our experiments, the training time can reduce 2.17 times less than non-use of heuristics selection method on PC. And our finding shows that the identification ratio up to 92.5% of accuracy and reduced 53% of training time in hardware implementation.
author2 Jhing-Fa Wang
author_facet Jhing-Fa Wang
Gaung-Hui Gu
古光輝
author Gaung-Hui Gu
古光輝
spellingShingle Gaung-Hui Gu
古光輝
Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM
author_sort Gaung-Hui Gu
title Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM
title_short Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM
title_full Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM
title_fullStr Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM
title_full_unstemmed Ubiquitous and Robust Text-Independent Speaker Recognition and FPGA Implementation for SMO algorithm of SVM
title_sort ubiquitous and robust text-independent speaker recognition and fpga implementation for smo algorithm of svm
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/86629603830179768114
work_keys_str_mv AT gaunghuigu ubiquitousandrobusttextindependentspeakerrecognitionandfpgaimplementationforsmoalgorithmofsvm
AT gǔguānghuī ubiquitousandrobusttextindependentspeakerrecognitionandfpgaimplementationforsmoalgorithmofsvm
AT gaunghuigu fànzàihuánjìngyǔjùqiángjiànxìngzhīfēiwénzìxiāngguānyǔzhěbiànshíjísvmsmoyǎnsuànfǎzhīfpgashèjìyǔshíxiàn
AT gǔguānghuī fànzàihuánjìngyǔjùqiángjiànxìngzhīfēiwénzìxiāngguānyǔzhěbiànshíjísvmsmoyǎnsuànfǎzhīfpgashèjìyǔshíxiàn
_version_ 1718502101815918592