The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application

博士 === 國立成功大學 === 電機工程學系碩博士班 === 100 === In smart life, the development of smart portable devices and smart home appliances, have attracted the researchers to improve in their tiny size, high performance, interactive application, and powerful functionality. The speaker recognition plays the importan...

Full description

Bibliographic Details
Main Authors:	Ta-WenKuan, 官大文
Other Authors:	Jhing-Fa Wang
Format:	Others
Language:	en_US
Published:	2012
Online Access:	http://ndltd.ncl.edu.tw/handle/88636292185407496374

id	ndltd-TW-100NCKU5442033
record_format	oai_dc
spelling	ndltd-TW-100NCKU54420332015-10-13T21:33:11Z http://ndltd.ncl.edu.tw/handle/88636292185407496374 The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application 語者識別之超大型積體電路架構設計及其在智慧玄關應用之研究與實現 Ta-WenKuan 官大文博士國立成功大學電機工程學系碩博士班 100 In smart life, the development of smart portable devices and smart home appliances, have attracted the researchers to improve in their tiny size, high performance, interactive application, and powerful functionality. The speaker recognition plays the important role for the owner recognition in mobile device, and the enrollment authentication at smart home. In this dissertation, we explore the speaker recognition in two fields, that is, the hardware implementation and the smart home application. In hardware realization, multiple platforms, such as ARM platform, FPGA platform and ARM+FPGA platform, are adopted to explore the speaker recognition, and realize into the embedded SoC system, VLSI architecture design and Hardware/Software co-design. In smart home, the speaker recognition is investigated in intelligent porch system to attain the nature way for home user authentication and to interact smartly with home appliances. However, the adverse and mismatch conditions influence the speaker expert, therefore, the speaker expert is proposed to fuse with other human cues, such as, speech expert, face expert and height detector, to reach the multi-modal and biometric recognition system for smart home. In general, the speaker recognition can be categorized in two modalities, i.e. speaker identification and speaker verification. The speaker identification scores and determines the target speaker’s identity from unknown speaker in a close set of trained models, whereas the speaker verification verifies the claimed voice with corresponding claimed identity, through a confident threshold to determine the target speaker, such a task can be regarded as an open set. Two critical phases are commonly addressed in speaker recognition, that is, model training and speaker recognition. Generally, the model training is time-consuming particularly in mobile device. This motives us to examine the training phase in hardware implementation to accelerate the training performance. In this dissertation, the Support Vector Machine (SVM) is exhibited for the speaker model training and classification, and the Sequential Minimum Optimization (SMO) algorithm in SVM, is used to accelerate the speaker model training. In order to realize the complex SMO algorithm on multiple hardware platforms, the SMO algorithm is analyzed and modified prior to the feasible steps and blocks, and then realized on several hardware platforms. The experimental results show that the VLSI design of SMO algorithm indeed accelerates the training speed, and the accuracy in speaker identification has no big difference compared with software simulation. Jhing-Fa Wang 王駿發 2012 學位論文 ; thesis 98 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	博士 === 國立成功大學 === 電機工程學系碩博士班 === 100 === In smart life, the development of smart portable devices and smart home appliances, have attracted the researchers to improve in their tiny size, high performance, interactive application, and powerful functionality. The speaker recognition plays the important role for the owner recognition in mobile device, and the enrollment authentication at smart home. In this dissertation, we explore the speaker recognition in two fields, that is, the hardware implementation and the smart home application. In hardware realization, multiple platforms, such as ARM platform, FPGA platform and ARM+FPGA platform, are adopted to explore the speaker recognition, and realize into the embedded SoC system, VLSI architecture design and Hardware/Software co-design. In smart home, the speaker recognition is investigated in intelligent porch system to attain the nature way for home user authentication and to interact smartly with home appliances. However, the adverse and mismatch conditions influence the speaker expert, therefore, the speaker expert is proposed to fuse with other human cues, such as, speech expert, face expert and height detector, to reach the multi-modal and biometric recognition system for smart home. In general, the speaker recognition can be categorized in two modalities, i.e. speaker identification and speaker verification. The speaker identification scores and determines the target speaker’s identity from unknown speaker in a close set of trained models, whereas the speaker verification verifies the claimed voice with corresponding claimed identity, through a confident threshold to determine the target speaker, such a task can be regarded as an open set. Two critical phases are commonly addressed in speaker recognition, that is, model training and speaker recognition. Generally, the model training is time-consuming particularly in mobile device. This motives us to examine the training phase in hardware implementation to accelerate the training performance. In this dissertation, the Support Vector Machine (SVM) is exhibited for the speaker model training and classification, and the Sequential Minimum Optimization (SMO) algorithm in SVM, is used to accelerate the speaker model training. In order to realize the complex SMO algorithm on multiple hardware platforms, the SMO algorithm is analyzed and modified prior to the feasible steps and blocks, and then realized on several hardware platforms. The experimental results show that the VLSI design of SMO algorithm indeed accelerates the training speed, and the accuracy in speaker identification has no big difference compared with software simulation.
author2	Jhing-Fa Wang
author_facet	Jhing-Fa Wang Ta-WenKuan 官大文
author	Ta-WenKuan 官大文
spellingShingle	Ta-WenKuan 官大文 The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application
author_sort	Ta-WenKuan
title	The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application
title_short	The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application
title_full	The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application
title_fullStr	The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application
title_full_unstemmed	The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application
title_sort	research and realization of speaker recognition on vlsi architecture design and intelligent porch application
publishDate	2012
url	http://ndltd.ncl.edu.tw/handle/88636292185407496374
work_keys_str_mv	AT tawenkuan theresearchandrealizationofspeakerrecognitiononvlsiarchitecturedesignandintelligentporchapplication AT guāndàwén theresearchandrealizationofspeakerrecognitiononvlsiarchitecturedesignandintelligentporchapplication AT tawenkuan yǔzhěshíbiézhīchāodàxíngjītǐdiànlùjiàgòushèjìjíqízàizhìhuìxuánguānyīngyòngzhīyánjiūyǔshíxiàn AT guāndàwén yǔzhěshíbiézhīchāodàxíngjītǐdiànlùjiàgòushèjìjíqízàizhìhuìxuánguānyīngyòngzhīyánjiūyǔshíxiàn AT tawenkuan researchandrealizationofspeakerrecognitiononvlsiarchitecturedesignandintelligentporchapplication AT guāndàwén researchandrealizationofspeakerrecognitiononvlsiarchitecturedesignandintelligentporchapplication
_version_	1718065925261885440

The Research and Realization of Speaker Recognition on VLSI Architecture Design and Intelligent Porch Application

Similar Items