Implementation of ASSR System Based on HMM and Syllable Models on FPGA

碩士 === 國立成功大學 === 電機工程學系 === 104 === Hidden Markov Models (HMMs) is one of the most popular methods for modern speech recognition. In this thesis, we propose an Automatic Speech-Speaker Recognition (ASSR) system on a FPGA platform. The ASSR system includes four parts: 1) pre-processing, 2) feature e...

Full description

Bibliographic Details
Main Authors: Wei-XiangLiao, 廖韋翔
Other Authors: Jhing-Fa Wang
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/7ppebg
id ndltd-TW-104NCKU5442137
record_format oai_dc
spelling ndltd-TW-104NCKU54421372019-05-15T22:54:11Z http://ndltd.ncl.edu.tw/handle/7ppebg Implementation of ASSR System Based on HMM and Syllable Models on FPGA 以FPGA實現基於HMM之音節模型組成之語音辨識系統 Wei-XiangLiao 廖韋翔 碩士 國立成功大學 電機工程學系 104 Hidden Markov Models (HMMs) is one of the most popular methods for modern speech recognition. In this thesis, we propose an Automatic Speech-Speaker Recognition (ASSR) system on a FPGA platform. The ASSR system includes four parts: 1) pre-processing, 2) feature extraction, 3) speech and speaker recognition and 4) Out-of-Vocabulary (OOV) and Out-of-Speaker (OOS) detection. This study adopts the Mel-frequency cepstral coefficients (MFCCs) as the features for feature extraction module. We use Hidden Markov Model (HMM) to build the acoustic model for each phoneme, and evaluate our approaches on two databases: the THCHS-30 (Tsinghua Chinese 30 hour database) and the CMU ARCTIC Databases. The binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. The last part of ASSR uses the grammar to detect OOV, and the OOS detection algorithm to detect OOS. The experiments are conducted on two types of platforms including PC and Xilinx Spartan-6 FPGA. The experimental results indicate that the proposed work can achieve 90.8% of Mandarin speech recognition and 86.6% of English speech recognition rate, respectively. The work can achieve 88.7% of OOV detection rate of Mandarin and 84.9% of OOV detection rate of English as well. The speaker recognition rate also reaches to 81.3% and OOS detection rate reaches to 80.8%, respectively. Jhing-Fa Wang 王駿發 2016 學位論文 ; thesis 64 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 電機工程學系 === 104 === Hidden Markov Models (HMMs) is one of the most popular methods for modern speech recognition. In this thesis, we propose an Automatic Speech-Speaker Recognition (ASSR) system on a FPGA platform. The ASSR system includes four parts: 1) pre-processing, 2) feature extraction, 3) speech and speaker recognition and 4) Out-of-Vocabulary (OOV) and Out-of-Speaker (OOS) detection. This study adopts the Mel-frequency cepstral coefficients (MFCCs) as the features for feature extraction module. We use Hidden Markov Model (HMM) to build the acoustic model for each phoneme, and evaluate our approaches on two databases: the THCHS-30 (Tsinghua Chinese 30 hour database) and the CMU ARCTIC Databases. The binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. The last part of ASSR uses the grammar to detect OOV, and the OOS detection algorithm to detect OOS. The experiments are conducted on two types of platforms including PC and Xilinx Spartan-6 FPGA. The experimental results indicate that the proposed work can achieve 90.8% of Mandarin speech recognition and 86.6% of English speech recognition rate, respectively. The work can achieve 88.7% of OOV detection rate of Mandarin and 84.9% of OOV detection rate of English as well. The speaker recognition rate also reaches to 81.3% and OOS detection rate reaches to 80.8%, respectively.
author2 Jhing-Fa Wang
author_facet Jhing-Fa Wang
Wei-XiangLiao
廖韋翔
author Wei-XiangLiao
廖韋翔
spellingShingle Wei-XiangLiao
廖韋翔
Implementation of ASSR System Based on HMM and Syllable Models on FPGA
author_sort Wei-XiangLiao
title Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_short Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_full Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_fullStr Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_full_unstemmed Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_sort implementation of assr system based on hmm and syllable models on fpga
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/7ppebg
work_keys_str_mv AT weixiangliao implementationofassrsystembasedonhmmandsyllablemodelsonfpga
AT liàowéixiáng implementationofassrsystembasedonhmmandsyllablemodelsonfpga
AT weixiangliao yǐfpgashíxiànjīyúhmmzhīyīnjiémóxíngzǔchéngzhīyǔyīnbiànshíxìtǒng
AT liàowéixiáng yǐfpgashíxiànjīyúhmmzhīyīnjiémóxíngzǔchéngzhīyǔyīnbiànshíxìtǒng
_version_ 1719137463583113216