Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System
With the fast development of computer and information technology, multimedia data has become the most important form of information media. Auditory information plays an important role in information location, this comes from the fact that it can be difficult to find useful information. Thus audio cl...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9072167/ |
id |
doaj-a306e82ae32e4bff87460723083b962b |
---|---|
record_format |
Article |
spelling |
doaj-a306e82ae32e4bff87460723083b962b2021-03-30T01:39:41ZengIEEEIEEE Access2169-35362020-01-018786307863810.1109/ACCESS.2020.29886869072167Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine SystemLinyuan Fan0https://orcid.org/0000-0001-9905-4188School of Statistics, Capital University of Economics and Business, Beijing, ChinaWith the fast development of computer and information technology, multimedia data has become the most important form of information media. Auditory information plays an important role in information location, this comes from the fact that it can be difficult to find useful information. Thus audio classification becomes more important in audio analysis as it prepares for content-based audio retrieval. There is quite a bit of research on the topic of audio classification methods, audio feature analysis, and extraction based on audio classification. Many works of literature extract features of audio signals based on time or Fourier transform frequency domain. The emergence of the wavelet theory provides a time-frequency analysis tool for signal analysis. Wavelet transformation is a local transformation of the signal in time and frequency which can effectively extract information from the signal, and perform multi-scale refinement analysis on functions or signals through operations such as stretching and translation instead of the traditional Fourier transformation. In the time-frequency analysis of the signal, the wavelet analysis captures the local time and frequency characters of the signal which can improve the ability of signal analysis. It can also change certain locals of the signal without affecting other aspects of it. In this paper, the frequency domain features are combined with the wavelet domain features. At the same time that the MFCC features are extracted, the discrete wavelet transform is used to extract the features of the wavelet domain. Then the statistical features are extracted for each audio example, and the SVM model is used to realize the different forms of audio classification identification.https://ieeexplore.ieee.org/document/9072167/Content audiowavelet transformaudio featureaudio processing |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Linyuan Fan |
spellingShingle |
Linyuan Fan Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System IEEE Access Content audio wavelet transform audio feature audio processing |
author_facet |
Linyuan Fan |
author_sort |
Linyuan Fan |
title |
Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System |
title_short |
Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System |
title_full |
Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System |
title_fullStr |
Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System |
title_full_unstemmed |
Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System |
title_sort |
audio example recognition and retrieval based on geometric incremental learning support vector machine system |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
With the fast development of computer and information technology, multimedia data has become the most important form of information media. Auditory information plays an important role in information location, this comes from the fact that it can be difficult to find useful information. Thus audio classification becomes more important in audio analysis as it prepares for content-based audio retrieval. There is quite a bit of research on the topic of audio classification methods, audio feature analysis, and extraction based on audio classification. Many works of literature extract features of audio signals based on time or Fourier transform frequency domain. The emergence of the wavelet theory provides a time-frequency analysis tool for signal analysis. Wavelet transformation is a local transformation of the signal in time and frequency which can effectively extract information from the signal, and perform multi-scale refinement analysis on functions or signals through operations such as stretching and translation instead of the traditional Fourier transformation. In the time-frequency analysis of the signal, the wavelet analysis captures the local time and frequency characters of the signal which can improve the ability of signal analysis. It can also change certain locals of the signal without affecting other aspects of it. In this paper, the frequency domain features are combined with the wavelet domain features. At the same time that the MFCC features are extracted, the discrete wavelet transform is used to extract the features of the wavelet domain. Then the statistical features are extracted for each audio example, and the SVM model is used to realize the different forms of audio classification identification. |
topic |
Content audio wavelet transform audio feature audio processing |
url |
https://ieeexplore.ieee.org/document/9072167/ |
work_keys_str_mv |
AT linyuanfan audioexamplerecognitionandretrievalbasedongeometricincrementallearningsupportvectormachinesystem |
_version_ |
1724186696383201280 |