Multimodal Semantic Analysis and Annotation for Basketball Video
<p/> <p>This paper presents a new multiple-modality method for extracting semantic information from basketball video. The visual, motion, and audio information are extracted from video to first generate some low-level video segmentation and classification. Domain knowledge is further exp...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2006-01-01
|
Series: | EURASIP Journal on Advances in Signal Processing |
Online Access: | http://dx.doi.org/10.1155/ASP/2006/32135 |
id |
doaj-c2d3303a3bab433bb2bf6bbd90c1e43e |
---|---|
record_format |
Article |
spelling |
doaj-c2d3303a3bab433bb2bf6bbd90c1e43e2020-11-25T00:19:07ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802006-01-0120061032135Multimodal Semantic Analysis and Annotation for Basketball VideoLiu SongXu MinYi HaoranChia Liang-TienRajan Deepu<p/> <p>This paper presents a new multiple-modality method for extracting semantic information from basketball video. The visual, motion, and audio information are extracted from video to first generate some low-level video segmentation and classification. Domain knowledge is further exploited for detecting interesting events in the basketball video. For video, both visual and motion prediction information are utilized for shot and scene boundary detection algorithm; this will be followed by scene classification. For audio, audio keysounds are sets of specific audio sounds related to semantic events and a classification method based on hidden Markov model (HMM) is used for audio keysound identification. Subsequently, by analyzing the multimodal information, the positions of potential semantic events, such as "foul" and "shot at the basket," are located with additional domain knowledge. Finally, a video annotation is generated according to MPEG-7 multimedia description schemes (MDSs). Experimental results demonstrate the effectiveness of the proposed method.</p> http://dx.doi.org/10.1155/ASP/2006/32135 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Liu Song Xu Min Yi Haoran Chia Liang-Tien Rajan Deepu |
spellingShingle |
Liu Song Xu Min Yi Haoran Chia Liang-Tien Rajan Deepu Multimodal Semantic Analysis and Annotation for Basketball Video EURASIP Journal on Advances in Signal Processing |
author_facet |
Liu Song Xu Min Yi Haoran Chia Liang-Tien Rajan Deepu |
author_sort |
Liu Song |
title |
Multimodal Semantic Analysis and Annotation for Basketball Video |
title_short |
Multimodal Semantic Analysis and Annotation for Basketball Video |
title_full |
Multimodal Semantic Analysis and Annotation for Basketball Video |
title_fullStr |
Multimodal Semantic Analysis and Annotation for Basketball Video |
title_full_unstemmed |
Multimodal Semantic Analysis and Annotation for Basketball Video |
title_sort |
multimodal semantic analysis and annotation for basketball video |
publisher |
SpringerOpen |
series |
EURASIP Journal on Advances in Signal Processing |
issn |
1687-6172 1687-6180 |
publishDate |
2006-01-01 |
description |
<p/> <p>This paper presents a new multiple-modality method for extracting semantic information from basketball video. The visual, motion, and audio information are extracted from video to first generate some low-level video segmentation and classification. Domain knowledge is further exploited for detecting interesting events in the basketball video. For video, both visual and motion prediction information are utilized for shot and scene boundary detection algorithm; this will be followed by scene classification. For audio, audio keysounds are sets of specific audio sounds related to semantic events and a classification method based on hidden Markov model (HMM) is used for audio keysound identification. Subsequently, by analyzing the multimodal information, the positions of potential semantic events, such as "foul" and "shot at the basket," are located with additional domain knowledge. Finally, a video annotation is generated according to MPEG-7 multimedia description schemes (MDSs). Experimental results demonstrate the effectiveness of the proposed method.</p> |
url |
http://dx.doi.org/10.1155/ASP/2006/32135 |
work_keys_str_mv |
AT liusong multimodalsemanticanalysisandannotationforbasketballvideo AT xumin multimodalsemanticanalysisandannotationforbasketballvideo AT yihaoran multimodalsemanticanalysisandannotationforbasketballvideo AT chialiangtien multimodalsemanticanalysisandannotationforbasketballvideo AT rajandeepu multimodalsemanticanalysisandannotationforbasketballvideo |
_version_ |
1725373215610503168 |