Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity
博士 === 國立臺灣大學 === 資訊工程學研究所 === 102 === Discourse relation is the rhetorical relation between two discourse units (i.e. clauses, sentences, or blocks of sentences). The famous discourse relations include Temporal, Contingency, Comparison, Expansion, and so on. A discourse relation indicates how its t...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2014
|
Online Access: | http://ndltd.ncl.edu.tw/handle/52225480080083270297 |
id |
ndltd-TW-102NTU05392119 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-102NTU053921192016-03-09T04:24:22Z http://ndltd.ncl.edu.tw/handle/52225480080083270297 Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity 中文語篇標記解釋與語篇關係辨識及其在意見極性分析之研究 Hen-Hsen Huang 黃瀚萱 博士 國立臺灣大學 資訊工程學研究所 102 Discourse relation is the rhetorical relation between two discourse units (i.e. clauses, sentences, or blocks of sentences). The famous discourse relations include Temporal, Contingency, Comparison, Expansion, and so on. A discourse relation indicates how its two discourse units cohere, and this information influences the meaning of text. Discourse relation is important clue to many applications such as summarization, opinion mining, textual entailment, and event recognition. Recently the research on automatically English discourse relation recognition is rapid growth due to the release of corpora like Rhetoric Structure Theory Discourse Treebank (RST-DT) and Penn Discourse Treebank (PDTB). Unlike English, Chinese discourse relation recognition is more challenging because of the lack of resources and the special issues in Chinese. In this dissertation, we give an in-depth study on Chinese discourse relation analysis. We propose a statistical algorithm to recognize the discourse relation in both levels of inter-sentential and intra-sentential. We also show our preliminary results on Chinese discourse parsing at sentence level. In Chinese, many long sentences contain more than two clauses and form complex discourse structures. Discourse parsing fetches the hierarchical structure and relation among the clauses in a given sentence. Discourse markers are key clue to discourse process, but the use of Chinese discourse marker is inherent ambiguity. To interpret the ambiguous Chinese discourse markers, we propose a semi-supervised framework to estimate the distribution of each Chinese discourse marker from a large-sized corpus, the ClueWeb09. This semi-supervised framework with the estimated distributions finally improve the performance of Chinese discourse relation recognition. Discourse relations and sentiment polarities are interactive in text. We investigate their correlation with ClueWeb09. A moderate-sized data annotated by human are analyzed and compared with the huge data heuristically labeled by machine. As a result, the association between sentiment and discourse is validated. In this dissertation, we focus on the four-way discourse relation classification. We will investigate the finer-grained classification on discourse relations in the future. In addition, we will further tackle the issue of Chinese discourse parsing at paragraph level and document level. Hsin-Hsi Chen 陳信希 2014 學位論文 ; thesis 102 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立臺灣大學 === 資訊工程學研究所 === 102 === Discourse relation is the rhetorical relation between two discourse units (i.e. clauses, sentences, or blocks of sentences). The famous discourse relations include Temporal, Contingency, Comparison, Expansion, and so on. A discourse relation indicates how its two discourse units cohere, and this information influences the meaning of text. Discourse relation is important clue to many applications such as summarization, opinion mining, textual entailment, and event recognition.
Recently the research on automatically English discourse relation recognition is rapid growth due to the release of corpora like Rhetoric Structure Theory Discourse Treebank (RST-DT) and Penn Discourse Treebank (PDTB). Unlike English, Chinese discourse relation recognition is more challenging because of the lack of resources and the special issues in Chinese.
In this dissertation, we give an in-depth study on Chinese discourse relation analysis. We propose a statistical algorithm to recognize the discourse relation in both levels of inter-sentential and intra-sentential. We also show our preliminary results on Chinese discourse parsing at sentence level. In Chinese, many long sentences contain more than two clauses and form complex discourse structures. Discourse parsing fetches the hierarchical structure and relation among the clauses in a given sentence.
Discourse markers are key clue to discourse process, but the use of Chinese discourse marker is inherent ambiguity. To interpret the ambiguous Chinese discourse markers, we propose a semi-supervised framework to estimate the distribution of each Chinese discourse marker from a large-sized corpus, the ClueWeb09. This semi-supervised framework with the estimated distributions finally improve the performance of Chinese discourse relation recognition.
Discourse relations and sentiment polarities are interactive in text. We investigate their correlation with ClueWeb09. A moderate-sized data annotated by human are analyzed and compared with the huge data heuristically labeled by machine. As a result, the association between sentiment and discourse is validated.
In this dissertation, we focus on the four-way discourse relation classification. We will investigate the finer-grained classification on discourse relations in the future. In addition, we will further tackle the issue of Chinese discourse parsing at paragraph level and document level.
|
author2 |
Hsin-Hsi Chen |
author_facet |
Hsin-Hsi Chen Hen-Hsen Huang 黃瀚萱 |
author |
Hen-Hsen Huang 黃瀚萱 |
spellingShingle |
Hen-Hsen Huang 黃瀚萱 Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity |
author_sort |
Hen-Hsen Huang |
title |
Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity |
title_short |
Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity |
title_full |
Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity |
title_fullStr |
Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity |
title_full_unstemmed |
Interpretation of Chinese Discourse Markers, Discourse Relation Recognition, and their Relationships with Sentiment Polarity |
title_sort |
interpretation of chinese discourse markers, discourse relation recognition, and their relationships with sentiment polarity |
publishDate |
2014 |
url |
http://ndltd.ncl.edu.tw/handle/52225480080083270297 |
work_keys_str_mv |
AT henhsenhuang interpretationofchinesediscoursemarkersdiscourserelationrecognitionandtheirrelationshipswithsentimentpolarity AT huánghànxuān interpretationofchinesediscoursemarkersdiscourserelationrecognitionandtheirrelationshipswithsentimentpolarity AT henhsenhuang zhōngwényǔpiānbiāojìjiěshìyǔyǔpiānguānxìbiànshíjíqízàiyìjiànjíxìngfēnxīzhīyánjiū AT huánghànxuān zhōngwényǔpiānbiāojìjiěshìyǔyǔpiānguānxìbiànshíjíqízàiyìjiànjíxìngfēnxīzhīyánjiū |
_version_ |
1718200988686352384 |