Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring

碩士 === 國立交通大學 === 資訊工程系 === 90 === In this paper, we propose a novel approach to generate the table of video content based on shot description of motion activity and textual information of closed caption in MPEG-Ⅱ sports videos. In order to speed up in scene change detection, instead of e...

Full description

Bibliographic Details
Main Authors:	Shu-Jiuan Lin, 林淑娟
Other Authors:	Suh-Yin Lee
Format:	Others
Language:	en_US
Published:	2002
Online Access:	http://ndltd.ncl.edu.tw/handle/90085530504310189590

id	ndltd-TW-090NCTU0392028
record_format	oai_dc
spelling	ndltd-TW-090NCTU03920282016-06-27T16:08:59Z http://ndltd.ncl.edu.tw/handle/90085530504310189590 Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring 利用運動強度分析,影片片段辨識及畫面字幕偵測建構視訊影片內容結構之研究 Shu-Jiuan Lin 林淑娟碩士國立交通大學資訊工程系 90 In this paper, we propose a novel approach to generate the table of video content based on shot description of motion activity and textual information of closed caption in MPEG-Ⅱ sports videos. In order to speed up in scene change detection, instead of examining scene cut frame by frame, GOP-based approach first checks video streams GOP by GOP and then finds out the actual scene boundaries in the frame level. Segmented shots are described by the proposed object-based motion activity descriptor. The descriptor is computed based on the object 2D-histogram, in which long-term consistency of spatial-temporal relationship of moving objects within video shots is considered. Utilizing the characterized features of motion activity in video shots, video clips are recognized by the proposed algorithm of shot identification. Subsequently, the specific shots of interest are selected and the proposed mechanism of closed caption localization is exploited to detect captions in these shots. Moreover, the SOM (Self-Organization Map) based algorithm is designed as a filter to distinguish the superimposed closed captions from the high-textured background regions. Finally, we can construct a sports video content visualization system and provide the table of video content composed of the hierarchical structure of story units, consecutive shots and closed captions. Furthermore, we supply users with the dynamic tree structure of video content. The experimental results show the effectiveness of the proposed system and reveal the feasibility of the hierarchical structuring of video content. Suh-Yin Lee 李素瑛 2002 學位論文 ; thesis 79 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立交通大學 === 資訊工程系 === 90 === In this paper, we propose a novel approach to generate the table of video content based on shot description of motion activity and textual information of closed caption in MPEG-Ⅱ sports videos. In order to speed up in scene change detection, instead of examining scene cut frame by frame, GOP-based approach first checks video streams GOP by GOP and then finds out the actual scene boundaries in the frame level. Segmented shots are described by the proposed object-based motion activity descriptor. The descriptor is computed based on the object 2D-histogram, in which long-term consistency of spatial-temporal relationship of moving objects within video shots is considered. Utilizing the characterized features of motion activity in video shots, video clips are recognized by the proposed algorithm of shot identification. Subsequently, the specific shots of interest are selected and the proposed mechanism of closed caption localization is exploited to detect captions in these shots. Moreover, the SOM (Self-Organization Map) based algorithm is designed as a filter to distinguish the superimposed closed captions from the high-textured background regions. Finally, we can construct a sports video content visualization system and provide the table of video content composed of the hierarchical structure of story units, consecutive shots and closed captions. Furthermore, we supply users with the dynamic tree structure of video content. The experimental results show the effectiveness of the proposed system and reveal the feasibility of the hierarchical structuring of video content.
author2	Suh-Yin Lee
author_facet	Suh-Yin Lee Shu-Jiuan Lin 林淑娟
author	Shu-Jiuan Lin 林淑娟
spellingShingle	Shu-Jiuan Lin 林淑娟 Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
author_sort	Shu-Jiuan Lin
title	Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_short	Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_full	Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_fullStr	Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_full_unstemmed	Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring
title_sort	motion activity based shot identification and closed caption localization for video structuring
publishDate	2002
url	http://ndltd.ncl.edu.tw/handle/90085530504310189590
work_keys_str_mv	AT shujiuanlin motionactivitybasedshotidentificationandclosedcaptionlocalizationforvideostructuring AT línshūjuān motionactivitybasedshotidentificationandclosedcaptionlocalizationforvideostructuring AT shujiuanlin lìyòngyùndòngqiángdùfēnxīyǐngpiànpiànduànbiànshíjíhuàmiànzìmùzhēncèjiàngòushìxùnyǐngpiànnèiróngjiégòuzhīyánjiū AT línshūjuān lìyòngyùndòngqiángdùfēnxīyǐngpiànpiànduànbiànshíjíhuàmiànzìmùzhēncèjiàngòushìxùnyǐngpiànnèiróngjiégòuzhīyánjiū
_version_	1718324440131960832

Motion Activity Based Shot Identification and Closed Caption Localization for Video Structuring

Similar Items