News Story Clustering with Bag of Word Model and Affinity Propagation

碩士 === 國立中正大學 === 資訊工程研究所 === 99 === The 24-hour news TV channels repeat the same news stories again and again. To skip browsing repeated news stories, we propose a framework to cluster topic-related news stories together, and thus facilitate efficient browsing and summarization. Our proposed system...

Full description

Bibliographic Details
Main Authors: Chin-Chao Huang, 黃朝琴
Other Authors: Wei-Ta Chu
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/90083697719839941592
id ndltd-TW-099CCU00392060
record_format oai_dc
spelling ndltd-TW-099CCU003920602016-04-13T04:16:57Z http://ndltd.ncl.edu.tw/handle/90083697719839941592 News Story Clustering with Bag of Word Model and Affinity Propagation 基於袋字模型及親和性互動演算法之新聞報導分群 Chin-Chao Huang 黃朝琴 碩士 國立中正大學 資訊工程研究所 99 The 24-hour news TV channels repeat the same news stories again and again. To skip browsing repeated news stories, we propose a framework to cluster topic-related news stories together, and thus facilitate efficient browsing and summarization. Our proposed system continuously monitors news broadcast, and automatically segments news videos into shots, removes commercial breaks, and detects anchorpersons. Each news story is represented by the bag of visual word (BoW) model, the bag of trajectory (BoT) model to describe what and how objects present in it. We also utilize concept detectors to detect concepts in news stories, and apply them to construct semantic features. We measure similarity between news stories by the earth mover’s distance, and the affinity propagation algorithm is used to cluster stories of the same topic together. Four news videos captured from Taiwanese news TV channels are studied in this thesis. We evaluate news story clustering in a news TV channel and across different channels. We also show performance of automatic news story segmentation. We verify that the news story clustering problem is much harder than near-duplicate detection and video copy detection, because video content of news stories with the same topic may vary. With the proposed methods, we conclude that various news stories can be effectively clustered. Wei-Ta Chu 朱威達 2011 學位論文 ; thesis 76 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中正大學 === 資訊工程研究所 === 99 === The 24-hour news TV channels repeat the same news stories again and again. To skip browsing repeated news stories, we propose a framework to cluster topic-related news stories together, and thus facilitate efficient browsing and summarization. Our proposed system continuously monitors news broadcast, and automatically segments news videos into shots, removes commercial breaks, and detects anchorpersons. Each news story is represented by the bag of visual word (BoW) model, the bag of trajectory (BoT) model to describe what and how objects present in it. We also utilize concept detectors to detect concepts in news stories, and apply them to construct semantic features. We measure similarity between news stories by the earth mover’s distance, and the affinity propagation algorithm is used to cluster stories of the same topic together. Four news videos captured from Taiwanese news TV channels are studied in this thesis. We evaluate news story clustering in a news TV channel and across different channels. We also show performance of automatic news story segmentation. We verify that the news story clustering problem is much harder than near-duplicate detection and video copy detection, because video content of news stories with the same topic may vary. With the proposed methods, we conclude that various news stories can be effectively clustered.
author2 Wei-Ta Chu
author_facet Wei-Ta Chu
Chin-Chao Huang
黃朝琴
author Chin-Chao Huang
黃朝琴
spellingShingle Chin-Chao Huang
黃朝琴
News Story Clustering with Bag of Word Model and Affinity Propagation
author_sort Chin-Chao Huang
title News Story Clustering with Bag of Word Model and Affinity Propagation
title_short News Story Clustering with Bag of Word Model and Affinity Propagation
title_full News Story Clustering with Bag of Word Model and Affinity Propagation
title_fullStr News Story Clustering with Bag of Word Model and Affinity Propagation
title_full_unstemmed News Story Clustering with Bag of Word Model and Affinity Propagation
title_sort news story clustering with bag of word model and affinity propagation
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/90083697719839941592
work_keys_str_mv AT chinchaohuang newsstoryclusteringwithbagofwordmodelandaffinitypropagation
AT huángcháoqín newsstoryclusteringwithbagofwordmodelandaffinitypropagation
AT chinchaohuang jīyúdàizìmóxíngjíqīnhéxìnghùdòngyǎnsuànfǎzhīxīnwénbàodǎofēnqún
AT huángcháoqín jīyúdàizìmóxíngjíqīnhéxìnghùdòngyǎnsuànfǎzhīxīnwénbàodǎofēnqún
_version_ 1718222253430145024