Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity

博士 === 國立臺灣大學 === 資訊工程學研究所 === 102 === Discourse relation is the rhetorical relation between two discourse units (i.e. clauses, sentences, or blocks of sentences). The famous discourse relations include Temporal, Contingency, Comparison, Expansion, and so on. A discourse relation indicates how its t...

Full description

Bibliographic Details
Main Authors: Hen-Hsen Huang, 黃瀚萱
Other Authors: Hsin-Hsi Chen
Format: Others
Language:en_US
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/52225480080083270297
id ndltd-TW-102NTU05392119
record_format oai_dc
spelling ndltd-TW-102NTU053921192016-03-09T04:24:22Z http://ndltd.ncl.edu.tw/handle/52225480080083270297 Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity 中文語篇標記解釋與語篇關係辨識及其在意見極性分析之研究 Hen-Hsen Huang 黃瀚萱 博士 國立臺灣大學 資訊工程學研究所 102 Discourse relation is the rhetorical relation between two discourse units (i.e. clauses, sentences, or blocks of sentences). The famous discourse relations include Temporal, Contingency, Comparison, Expansion, and so on. A discourse relation indicates how its two discourse units cohere, and this information influences the meaning of text. Discourse relation is important clue to many applications such as summarization, opinion mining, textual entailment, and event recognition. Recently the research on automatically English discourse relation recognition is rapid growth due to the release of corpora like Rhetoric Structure Theory Discourse Treebank (RST-DT) and Penn Discourse Treebank (PDTB). Unlike English, Chinese discourse relation recognition is more challenging because of the lack of resources and the special issues in Chinese. In this dissertation, we give an in-depth study on Chinese discourse relation analysis. We propose a statistical algorithm to recognize the discourse relation in both levels of inter-sentential and intra-sentential. We also show our preliminary results on Chinese discourse parsing at sentence level. In Chinese, many long sentences contain more than two clauses and form complex discourse structures. Discourse parsing fetches the hierarchical structure and relation among the clauses in a given sentence. Discourse markers are key clue to discourse process, but the use of Chinese discourse marker is inherent ambiguity. To interpret the ambiguous Chinese discourse markers, we propose a semi-supervised framework to estimate the distribution of each Chinese discourse marker from a large-sized corpus, the ClueWeb09. This semi-supervised framework with the estimated distributions finally improve the performance of Chinese discourse relation recognition. Discourse relations and sentiment polarities are interactive in text. We investigate their correlation with ClueWeb09. A moderate-sized data annotated by human are analyzed and compared with the huge data heuristically labeled by machine. As a result, the association between sentiment and discourse is validated. In this dissertation, we focus on the four-way discourse relation classification. We will investigate the finer-grained classification on discourse relations in the future. In addition, we will further tackle the issue of Chinese discourse parsing at paragraph level and document level. Hsin-Hsi Chen 陳信希 2014 學位論文 ; thesis 102 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立臺灣大學 === 資訊工程學研究所 === 102 === Discourse relation is the rhetorical relation between two discourse units (i.e. clauses, sentences, or blocks of sentences). The famous discourse relations include Temporal, Contingency, Comparison, Expansion, and so on. A discourse relation indicates how its two discourse units cohere, and this information influences the meaning of text. Discourse relation is important clue to many applications such as summarization, opinion mining, textual entailment, and event recognition. Recently the research on automatically English discourse relation recognition is rapid growth due to the release of corpora like Rhetoric Structure Theory Discourse Treebank (RST-DT) and Penn Discourse Treebank (PDTB). Unlike English, Chinese discourse relation recognition is more challenging because of the lack of resources and the special issues in Chinese. In this dissertation, we give an in-depth study on Chinese discourse relation analysis. We propose a statistical algorithm to recognize the discourse relation in both levels of inter-sentential and intra-sentential. We also show our preliminary results on Chinese discourse parsing at sentence level. In Chinese, many long sentences contain more than two clauses and form complex discourse structures. Discourse parsing fetches the hierarchical structure and relation among the clauses in a given sentence. Discourse markers are key clue to discourse process, but the use of Chinese discourse marker is inherent ambiguity. To interpret the ambiguous Chinese discourse markers, we propose a semi-supervised framework to estimate the distribution of each Chinese discourse marker from a large-sized corpus, the ClueWeb09. This semi-supervised framework with the estimated distributions finally improve the performance of Chinese discourse relation recognition. Discourse relations and sentiment polarities are interactive in text. We investigate their correlation with ClueWeb09. A moderate-sized data annotated by human are analyzed and compared with the huge data heuristically labeled by machine. As a result, the association between sentiment and discourse is validated. In this dissertation, we focus on the four-way discourse relation classification. We will investigate the finer-grained classification on discourse relations in the future. In addition, we will further tackle the issue of Chinese discourse parsing at paragraph level and document level.
author2 Hsin-Hsi Chen
author_facet Hsin-Hsi Chen
Hen-Hsen Huang
黃瀚萱
author Hen-Hsen Huang
黃瀚萱
spellingShingle Hen-Hsen Huang
黃瀚萱
Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity
author_sort Hen-Hsen Huang
title Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity
title_short Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity
title_full Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity
title_fullStr Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity
title_full_unstemmed Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity
title_sort interpretation of chinese discourse markers, discourse relation recognition, and their relationships with sentiment polarity
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/52225480080083270297
work_keys_str_mv AT henhsenhuang interpretationofchinesediscoursemarkersdiscourserelationrecognitionandtheirrelationshipswithsentimentpolarity
AT huánghànxuān interpretationofchinesediscoursemarkersdiscourserelationrecognitionandtheirrelationshipswithsentimentpolarity
AT henhsenhuang zhōngwényǔpiānbiāojìjiěshìyǔyǔpiānguānxìbiànshíjíqízàiyìjiànjíxìngfēnxīzhīyánjiū
AT huánghànxuān zhōngwényǔpiānbiāojìjiěshìyǔyǔpiānguānxìbiànshíjíqízàiyìjiànjíxìngfēnxīzhīyánjiū
_version_ 1718200988686352384