Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring

碩士 === 國立交通大學 === 資訊工程系 === 90 === In this paper, we propose a novel approach to generate the table of video content based on shot description of motion activity and textual information of closed caption in MPEG-Ⅱ sports videos. In order to speed up in scene change detection, instead of e...

Full description

Bibliographic Details
Main Authors: Shu-Jiuan Lin, 林淑娟
Other Authors: Suh-Yin Lee
Format: Others
Language:en_US
Published: 2002
Online Access:http://ndltd.ncl.edu.tw/handle/90085530504310189590
id ndltd-TW-090NCTU0392028
record_format oai_dc
spelling ndltd-TW-090NCTU03920282016-06-27T16:08:59Z http://ndltd.ncl.edu.tw/handle/90085530504310189590 Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring 利用運動強度分析,影片片段辨識及畫面字幕偵測建構視訊影片內容結構之研究 Shu-Jiuan Lin 林淑娟 碩士 國立交通大學 資訊工程系 90 In this paper, we propose a novel approach to generate the table of video content based on shot description of motion activity and textual information of closed caption in MPEG-Ⅱ sports videos. In order to speed up in scene change detection, instead of examining scene cut frame by frame, GOP-based approach first checks video streams GOP by GOP and then finds out the actual scene boundaries in the frame level. Segmented shots are described by the proposed object-based motion activity descriptor. The descriptor is computed based on the object 2D-histogram, in which long-term consistency of spatial-temporal relationship of moving objects within video shots is considered. Utilizing the characterized features of motion activity in video shots, video clips are recognized by the proposed algorithm of shot identification. Subsequently, the specific shots of interest are selected and the proposed mechanism of closed caption localization is exploited to detect captions in these shots. Moreover, the SOM (Self-Organization Map) based algorithm is designed as a filter to distinguish the superimposed closed captions from the high-textured background regions. Finally, we can construct a sports video content visualization system and provide the table of video content composed of the hierarchical structure of story units, consecutive shots and closed captions. Furthermore, we supply users with the dynamic tree structure of video content. The experimental results show the effectiveness of the proposed system and reveal the feasibility of the hierarchical structuring of video content. Suh-Yin Lee 李素瑛 2002 學位論文 ; thesis 79 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 資訊工程系 === 90 === In this paper, we propose a novel approach to generate the table of video content based on shot description of motion activity and textual information of closed caption in MPEG-Ⅱ sports videos. In order to speed up in scene change detection, instead of examining scene cut frame by frame, GOP-based approach first checks video streams GOP by GOP and then finds out the actual scene boundaries in the frame level. Segmented shots are described by the proposed object-based motion activity descriptor. The descriptor is computed based on the object 2D-histogram, in which long-term consistency of spatial-temporal relationship of moving objects within video shots is considered. Utilizing the characterized features of motion activity in video shots, video clips are recognized by the proposed algorithm of shot identification. Subsequently, the specific shots of interest are selected and the proposed mechanism of closed caption localization is exploited to detect captions in these shots. Moreover, the SOM (Self-Organization Map) based algorithm is designed as a filter to distinguish the superimposed closed captions from the high-textured background regions. Finally, we can construct a sports video content visualization system and provide the table of video content composed of the hierarchical structure of story units, consecutive shots and closed captions. Furthermore, we supply users with the dynamic tree structure of video content. The experimental results show the effectiveness of the proposed system and reveal the feasibility of the hierarchical structuring of video content.
author2 Suh-Yin Lee
author_facet Suh-Yin Lee
Shu-Jiuan Lin
林淑娟
author Shu-Jiuan Lin
林淑娟
spellingShingle Shu-Jiuan Lin
林淑娟
Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
author_sort Shu-Jiuan Lin
title Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_short Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_full Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_fullStr Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_full_unstemmed Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_sort motion activity based shot identification and closed caption localization for video structuring
publishDate 2002
url http://ndltd.ncl.edu.tw/handle/90085530504310189590
work_keys_str_mv AT shujiuanlin motionactivitybasedshotidentificationandclosedcaptionlocalizationforvideostructuring
AT línshūjuān motionactivitybasedshotidentificationandclosedcaptionlocalizationforvideostructuring
AT shujiuanlin lìyòngyùndòngqiángdùfēnxīyǐngpiànpiànduànbiànshíjíhuàmiànzìmùzhēncèjiàngòushìxùnyǐngpiànnèiróngjiégòuzhīyánjiū
AT línshūjuān lìyòngyùndòngqiángdùfēnxīyǐngpiànpiànduànbiànshíjíhuàmiànzìmùzhēncèjiàngòushìxùnyǐngpiànnèiróngjiégòuzhīyánjiū
_version_ 1718324440131960832