AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/p3grat |
id |
ndltd-TW-105NTU05392131 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-105NTU053921312019-05-15T23:39:46Z http://ndltd.ncl.edu.tw/handle/p3grat AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract 最佳化AUC之LSTM-CRF於論文中演算法識別應用 Brian Chen 陳柏穎 碩士 國立臺灣大學 資訊工程學研究所 105 In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as a sequential labeled task and propose to use a state-of-the-art deep learning model LSTM-CRF as our solution. However, the data or labels are generally imbalanced since not all the sentence in the abstract is describing its algorithm. That is, the ratio between different labels is skewed. As a result, it is not suitable to use traditional LSTM-CRF model since it only optimizes accuracy. Instead, it is more reasonable to optimize AUC in imbalanced data because it can deal with skewed labels and perform better in predicting rare labels. Our experiment shows that the proposed AUC-optimized LSTM-CRF outperforms the traditional LSTM-CRF. We also show the ranking of algorithms used currently, and find the trend of different algorithms used in recent years. Moreover, we are able to discover some new algorithms that do not exist in our training data. 林守德 2017 學位論文 ; thesis 33 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as a sequential labeled task and propose to use a state-of-the-art deep learning model LSTM-CRF as our solution. However, the data or labels are generally imbalanced since not all the sentence in the abstract is describing its algorithm. That is, the ratio between different labels is skewed. As a result, it is not suitable to use traditional LSTM-CRF model since it only optimizes accuracy. Instead, it is more reasonable to optimize AUC in imbalanced data because it can deal with skewed labels and perform better in predicting rare labels. Our experiment shows that the proposed AUC-optimized LSTM-CRF outperforms the traditional LSTM-CRF. We also show the ranking of algorithms used currently, and find the trend of different algorithms used in recent years. Moreover, we are able to discover some new algorithms that do not exist in our training data.
|
author2 |
林守德 |
author_facet |
林守德 Brian Chen 陳柏穎 |
author |
Brian Chen 陳柏穎 |
spellingShingle |
Brian Chen 陳柏穎 AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract |
author_sort |
Brian Chen |
title |
AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract |
title_short |
AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract |
title_full |
AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract |
title_fullStr |
AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract |
title_full_unstemmed |
AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract |
title_sort |
auc oriented bidirectional lstm-crf models to identify algorithms described in an abstract |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/p3grat |
work_keys_str_mv |
AT brianchen aucorientedbidirectionallstmcrfmodelstoidentifyalgorithmsdescribedinanabstract AT chénbǎiyǐng aucorientedbidirectionallstmcrfmodelstoidentifyalgorithmsdescribedinanabstract AT brianchen zuìjiāhuàauczhīlstmcrfyúlùnwénzhōngyǎnsuànfǎshíbiéyīngyòng AT chénbǎiyǐng zuìjiāhuàauczhīlstmcrfyúlùnwénzhōngyǎnsuànfǎshíbiéyīngyòng |
_version_ |
1719152291183853568 |