AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as...

Full description

Bibliographic Details
Main Authors: Brian Chen, 陳柏穎
Other Authors: 林守德
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/p3grat
id ndltd-TW-105NTU05392131
record_format oai_dc
spelling ndltd-TW-105NTU053921312019-05-15T23:39:46Z http://ndltd.ncl.edu.tw/handle/p3grat AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract 最佳化AUC之LSTM-CRF於論文中演算法識別應用 Brian Chen 陳柏穎 碩士 國立臺灣大學 資訊工程學研究所 105 In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as a sequential labeled task and propose to use a state-of-the-art deep learning model LSTM-CRF as our solution. However, the data or labels are generally imbalanced since not all the sentence in the abstract is describing its algorithm. That is, the ratio between different labels is skewed. As a result, it is not suitable to use traditional LSTM-CRF model since it only optimizes accuracy. Instead, it is more reasonable to optimize AUC in imbalanced data because it can deal with skewed labels and perform better in predicting rare labels. Our experiment shows that the proposed AUC-optimized LSTM-CRF outperforms the traditional LSTM-CRF. We also show the ranking of algorithms used currently, and find the trend of different algorithms used in recent years. Moreover, we are able to discover some new algorithms that do not exist in our training data. 林守德 2017 學位論文 ; thesis 33 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as a sequential labeled task and propose to use a state-of-the-art deep learning model LSTM-CRF as our solution. However, the data or labels are generally imbalanced since not all the sentence in the abstract is describing its algorithm. That is, the ratio between different labels is skewed. As a result, it is not suitable to use traditional LSTM-CRF model since it only optimizes accuracy. Instead, it is more reasonable to optimize AUC in imbalanced data because it can deal with skewed labels and perform better in predicting rare labels. Our experiment shows that the proposed AUC-optimized LSTM-CRF outperforms the traditional LSTM-CRF. We also show the ranking of algorithms used currently, and find the trend of different algorithms used in recent years. Moreover, we are able to discover some new algorithms that do not exist in our training data.
author2 林守德
author_facet 林守德
Brian Chen
陳柏穎
author Brian Chen
陳柏穎
spellingShingle Brian Chen
陳柏穎
AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
author_sort Brian Chen
title AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_short AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_full AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_fullStr AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_full_unstemmed AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_sort auc oriented bidirectional lstm-crf models to identify algorithms described in an abstract
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/p3grat
work_keys_str_mv AT brianchen aucorientedbidirectionallstmcrfmodelstoidentifyalgorithmsdescribedinanabstract
AT chénbǎiyǐng aucorientedbidirectionallstmcrfmodelstoidentifyalgorithmsdescribedinanabstract
AT brianchen zuìjiāhuàauczhīlstmcrfyúlùnwénzhōngyǎnsuànfǎshíbiéyīngyòng
AT chénbǎiyǐng zuìjiāhuàauczhīlstmcrfyúlùnwénzhōngyǎnsuànfǎshíbiéyīngyòng
_version_ 1719152291183853568