Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System

With the fast development of computer and information technology, multimedia data has become the most important form of information media. Auditory information plays an important role in information location, this comes from the fact that it can be difficult to find useful information. Thus audio cl...

Full description

Bibliographic Details
Main Author: Linyuan Fan
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9072167/
id doaj-a306e82ae32e4bff87460723083b962b
record_format Article
spelling doaj-a306e82ae32e4bff87460723083b962b2021-03-30T01:39:41ZengIEEEIEEE Access2169-35362020-01-018786307863810.1109/ACCESS.2020.29886869072167Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine SystemLinyuan Fan0https://orcid.org/0000-0001-9905-4188School of Statistics, Capital University of Economics and Business, Beijing, ChinaWith the fast development of computer and information technology, multimedia data has become the most important form of information media. Auditory information plays an important role in information location, this comes from the fact that it can be difficult to find useful information. Thus audio classification becomes more important in audio analysis as it prepares for content-based audio retrieval. There is quite a bit of research on the topic of audio classification methods, audio feature analysis, and extraction based on audio classification. Many works of literature extract features of audio signals based on time or Fourier transform frequency domain. The emergence of the wavelet theory provides a time-frequency analysis tool for signal analysis. Wavelet transformation is a local transformation of the signal in time and frequency which can effectively extract information from the signal, and perform multi-scale refinement analysis on functions or signals through operations such as stretching and translation instead of the traditional Fourier transformation. In the time-frequency analysis of the signal, the wavelet analysis captures the local time and frequency characters of the signal which can improve the ability of signal analysis. It can also change certain locals of the signal without affecting other aspects of it. In this paper, the frequency domain features are combined with the wavelet domain features. At the same time that the MFCC features are extracted, the discrete wavelet transform is used to extract the features of the wavelet domain. Then the statistical features are extracted for each audio example, and the SVM model is used to realize the different forms of audio classification identification.https://ieeexplore.ieee.org/document/9072167/Content audiowavelet transformaudio featureaudio processing
collection DOAJ
language English
format Article
sources DOAJ
author Linyuan Fan
spellingShingle Linyuan Fan
Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System
IEEE Access
Content audio
wavelet transform
audio feature
audio processing
author_facet Linyuan Fan
author_sort Linyuan Fan
title Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System
title_short Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System
title_full Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System
title_fullStr Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System
title_full_unstemmed Audio Example Recognition and Retrieval Based on Geometric Incremental Learning Support Vector Machine System
title_sort audio example recognition and retrieval based on geometric incremental learning support vector machine system
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description With the fast development of computer and information technology, multimedia data has become the most important form of information media. Auditory information plays an important role in information location, this comes from the fact that it can be difficult to find useful information. Thus audio classification becomes more important in audio analysis as it prepares for content-based audio retrieval. There is quite a bit of research on the topic of audio classification methods, audio feature analysis, and extraction based on audio classification. Many works of literature extract features of audio signals based on time or Fourier transform frequency domain. The emergence of the wavelet theory provides a time-frequency analysis tool for signal analysis. Wavelet transformation is a local transformation of the signal in time and frequency which can effectively extract information from the signal, and perform multi-scale refinement analysis on functions or signals through operations such as stretching and translation instead of the traditional Fourier transformation. In the time-frequency analysis of the signal, the wavelet analysis captures the local time and frequency characters of the signal which can improve the ability of signal analysis. It can also change certain locals of the signal without affecting other aspects of it. In this paper, the frequency domain features are combined with the wavelet domain features. At the same time that the MFCC features are extracted, the discrete wavelet transform is used to extract the features of the wavelet domain. Then the statistical features are extracted for each audio example, and the SVM model is used to realize the different forms of audio classification identification.
topic Content audio
wavelet transform
audio feature
audio processing
url https://ieeexplore.ieee.org/document/9072167/
work_keys_str_mv AT linyuanfan audioexamplerecognitionandretrievalbasedongeometricincrementallearningsupportvectormachinesystem
_version_ 1724186696383201280