AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as...

Full description

Bibliographic Details
Main Authors:	Brian Chen, 陳柏穎
Other Authors:	林守德
Format:	Others
Language:	en_US
Published:	2017
Online Access:	http://ndltd.ncl.edu.tw/handle/p3grat

id	ndltd-TW-105NTU05392131
record_format	oai_dc
spelling	ndltd-TW-105NTU053921312019-05-15T23:39:46Z http://ndltd.ncl.edu.tw/handle/p3grat AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract 最佳化AUC之LSTM-CRF於論文中演算法識別應用 Brian Chen 陳柏穎碩士國立臺灣大學資訊工程學研究所 105 In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as a sequential labeled task and propose to use a state-of-the-art deep learning model LSTM-CRF as our solution. However, the data or labels are generally imbalanced since not all the sentence in the abstract is describing its algorithm. That is, the ratio between different labels is skewed. As a result, it is not suitable to use traditional LSTM-CRF model since it only optimizes accuracy. Instead, it is more reasonable to optimize AUC in imbalanced data because it can deal with skewed labels and perform better in predicting rare labels. Our experiment shows that the proposed AUC-optimized LSTM-CRF outperforms the traditional LSTM-CRF. We also show the ranking of algorithms used currently, and find the trend of different algorithms used in recent years. Moreover, we are able to discover some new algorithms that do not exist in our training data. 林守德 2017 學位論文 ; thesis 33 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === In this thesis, we attempt to identify algorithms mentioned in the paper abstract. We further want to discriminate the algorithm proposed in this paper from algorithms only mentioned or compared, since we are more interested in the former. We model this task as a sequential labeled task and propose to use a state-of-the-art deep learning model LSTM-CRF as our solution. However, the data or labels are generally imbalanced since not all the sentence in the abstract is describing its algorithm. That is, the ratio between different labels is skewed. As a result, it is not suitable to use traditional LSTM-CRF model since it only optimizes accuracy. Instead, it is more reasonable to optimize AUC in imbalanced data because it can deal with skewed labels and perform better in predicting rare labels. Our experiment shows that the proposed AUC-optimized LSTM-CRF outperforms the traditional LSTM-CRF. We also show the ranking of algorithms used currently, and find the trend of different algorithms used in recent years. Moreover, we are able to discover some new algorithms that do not exist in our training data.
author2	林守德
author_facet	林守德 Brian Chen 陳柏穎
author	Brian Chen 陳柏穎
spellingShingle	Brian Chen 陳柏穎 AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
author_sort	Brian Chen
title	AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_short	AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_full	AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_fullStr	AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_full_unstemmed	AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract
title_sort	auc oriented bidirectional lstm-crf models to identify algorithms described in an abstract
publishDate	2017
url	http://ndltd.ncl.edu.tw/handle/p3grat
work_keys_str_mv	AT brianchen aucorientedbidirectionallstmcrfmodelstoidentifyalgorithmsdescribedinanabstract AT chénbǎiyǐng aucorientedbidirectionallstmcrfmodelstoidentifyalgorithmsdescribedinanabstract AT brianchen zuìjiāhuàauczhīlstmcrfyúlùnwénzhōngyǎnsuànfǎshíbiéyīngyòng AT chénbǎiyǐng zuìjiāhuàauczhīlstmcrfyúlùnwénzhōngyǎnsuànfǎshíbiéyīngyòng
_version_	1719152291183853568

AUC oriented Bidirectional LSTM-CRF Models to Identify Algorithms Described in an Abstract

Similar Items