VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features

碩士 === 國立成功大學 === 電機工程學系碩博士班 === 91 === In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem...

Full description

Bibliographic Details
Main Authors: Tze-Hsuan Huang, 黃子軒
Other Authors: Jhing-Fa Wang
Format: Others
Language:en_US
Published: 2003
Online Access:http://ndltd.ncl.edu.tw/handle/50906995482258528290
id ndltd-TW-091NCKU5442211
record_format oai_dc
spelling ndltd-TW-091NCKU54422112016-06-22T04:14:02Z http://ndltd.ncl.edu.tw/handle/50906995482258528290 VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features 以MPEG-7特徵為基礎的居家環境聲音辨識器之超大型積體電路架構設計 Tze-Hsuan Huang 黃子軒 碩士 國立成功大學 電機工程學系碩博士班 91 In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem where the parameter is not generalized [2~5]. The HMM based sound recognizer has been introduced by [8] to resolve this drawback. However, it adopts spectrum parameter and will result in high dimensional feature vectors. This thesis successfully solves the shortcoming by taking the basis extraction. The recognition rate is about 82% while only spectrogram is adopted as the parameter. The improved recognition rate is about 95% while above three mentioned MPEG-7 audio features are regarded as the parameters in our environmental sound recognizer. Moreover, related VLSI architectures for this sound recognition system are also proposed. The first one is the feature extraction module. The most complicated computations in the module are the division and nth-root operations. We utilize the CORDIC method to devise a divider. For the nth-root operation, a specific circuit is designed in accordance with the Brahmagupta iteration algorithm. For the Viterbi algorithm, a dedicated hardware architecture is also presented. This architecture is designed based on the 4-step fully Viterbi algorithm. This speed-up of this module is also ascribed to the fully pipeline systolic array architecture. Jhing-Fa Wang 王駿發 2003 學位論文 ; thesis 57 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 電機工程學系碩博士班 === 91 === In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem where the parameter is not generalized [2~5]. The HMM based sound recognizer has been introduced by [8] to resolve this drawback. However, it adopts spectrum parameter and will result in high dimensional feature vectors. This thesis successfully solves the shortcoming by taking the basis extraction. The recognition rate is about 82% while only spectrogram is adopted as the parameter. The improved recognition rate is about 95% while above three mentioned MPEG-7 audio features are regarded as the parameters in our environmental sound recognizer. Moreover, related VLSI architectures for this sound recognition system are also proposed. The first one is the feature extraction module. The most complicated computations in the module are the division and nth-root operations. We utilize the CORDIC method to devise a divider. For the nth-root operation, a specific circuit is designed in accordance with the Brahmagupta iteration algorithm. For the Viterbi algorithm, a dedicated hardware architecture is also presented. This architecture is designed based on the 4-step fully Viterbi algorithm. This speed-up of this module is also ascribed to the fully pipeline systolic array architecture.
author2 Jhing-Fa Wang
author_facet Jhing-Fa Wang
Tze-Hsuan Huang
黃子軒
author Tze-Hsuan Huang
黃子軒
spellingShingle Tze-Hsuan Huang
黃子軒
VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
author_sort Tze-Hsuan Huang
title VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_short VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_full VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_fullStr VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_full_unstemmed VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_sort vlsi architectures for home environmentalsound recognition based on mpeg-7 features
publishDate 2003
url http://ndltd.ncl.edu.tw/handle/50906995482258528290
work_keys_str_mv AT tzehsuanhuang vlsiarchitecturesforhomeenvironmentalsoundrecognitionbasedonmpeg7features
AT huángzixuān vlsiarchitecturesforhomeenvironmentalsoundrecognitionbasedonmpeg7features
AT tzehsuanhuang yǐmpeg7tèzhēngwèijīchǔdejūjiāhuánjìngshēngyīnbiànshíqìzhīchāodàxíngjītǐdiànlùjiàgòushèjì
AT huángzixuān yǐmpeg7tèzhēngwèijīchǔdejūjiāhuánjìngshēngyīnbiànshíqìzhīchāodàxíngjītǐdiànlùjiàgòushèjì
_version_ 1718314394335576064