Semi-supervised method for Improving Stance Classification on Insufficient Labeled Chinese Newspaper

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === We aim at developing an intelligent program to classify the stance on the Chinese news article on several controversial topics based on the former crawled data. The difficulty in this problem is the insufficient labeled news so that the model cannot learn enoug...

Full description

Bibliographic Details
Main Authors: Yu Ran, 冉昱
Other Authors: Shou-De Lin
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/94458318292215345926
id ndltd-TW-105NTU05392006
record_format oai_dc
spelling ndltd-TW-105NTU053920062017-11-12T04:38:58Z http://ndltd.ncl.edu.tw/handle/94458318292215345926 Semi-supervised method for Improving Stance Classification on Insufficient Labeled Chinese Newspaper 半監督學習模型以改善標籤數稀少的中文新聞的立場偵測分類 Yu Ran 冉昱 碩士 國立臺灣大學 資訊工程學研究所 105 We aim at developing an intelligent program to classify the stance on the Chinese news article on several controversial topics based on the former crawled data. The difficulty in this problem is the insufficient labeled news so that the model cannot learn enough knowledge. Wei-Ming mainly focus on the feature division, feature clustering to reduct the feature dimension and get higher accuracy with supervised method. We aimed at how to make full use of unlabeled data and use deep learning representation vector as feature to get the result beyond the Wei-Ming’s method. We first use paragraph vector as news’ feature and compare them with word feature and dependency feature, then we use the semi-supervised method, that is self-learning and ladder network with paragraph vector feature. We get the better result in topic 2 with self-learning and other 3 topics beyond the Wei-Ming’s method. Shou-De Lin 林守德 2017 學位論文 ; thesis 23 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === We aim at developing an intelligent program to classify the stance on the Chinese news article on several controversial topics based on the former crawled data. The difficulty in this problem is the insufficient labeled news so that the model cannot learn enough knowledge. Wei-Ming mainly focus on the feature division, feature clustering to reduct the feature dimension and get higher accuracy with supervised method. We aimed at how to make full use of unlabeled data and use deep learning representation vector as feature to get the result beyond the Wei-Ming’s method. We first use paragraph vector as news’ feature and compare them with word feature and dependency feature, then we use the semi-supervised method, that is self-learning and ladder network with paragraph vector feature. We get the better result in topic 2 with self-learning and other 3 topics beyond the Wei-Ming’s method.
author2 Shou-De Lin
author_facet Shou-De Lin
Yu Ran
冉昱
author Yu Ran
冉昱
spellingShingle Yu Ran
冉昱
Semi-supervised method for Improving Stance Classification on Insufficient Labeled Chinese Newspaper
author_sort Yu Ran
title Semi-supervised method for Improving Stance Classification on Insufficient Labeled Chinese Newspaper
title_short Semi-supervised method for Improving Stance Classification on Insufficient Labeled Chinese Newspaper
title_full Semi-supervised method for Improving Stance Classification on Insufficient Labeled Chinese Newspaper
title_fullStr Semi-supervised method for Improving Stance Classification on Insufficient Labeled Chinese Newspaper
title_full_unstemmed Semi-supervised method for Improving Stance Classification on Insufficient Labeled Chinese Newspaper
title_sort semi-supervised method for improving stance classification on insufficient labeled chinese newspaper
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/94458318292215345926
work_keys_str_mv AT yuran semisupervisedmethodforimprovingstanceclassificationoninsufficientlabeledchinesenewspaper
AT rǎnyù semisupervisedmethodforimprovingstanceclassificationoninsufficientlabeledchinesenewspaper
AT yuran bànjiāndūxuéxímóxíngyǐgǎishànbiāoqiānshùxīshǎodezhōngwénxīnwéndelìchǎngzhēncèfēnlèi
AT rǎnyù bànjiāndūxuéxímóxíngyǐgǎishànbiāoqiānshùxīshǎodezhōngwénxīnwéndelìchǎngzhēncèfēnlèi
_version_ 1718561710018658304