A Boundary Assembling Method for Nested Biomedical Named Entity Recognition

Biomedical named entity recognition (BNER) is an important task in biomedical natural language processing, in which neologisms (new terms, words) are coined constantly. Most of the existing work can only identify biomedical named entities with flattened structures and ignore nested biomedical named...

Full description

Bibliographic Details
Main Authors: Yanping Chen, Ying Hu, Yijing Li, Ruizhang Huang, Yongbin Qin, Yuefei Wu, Qinghua Zheng, Ping Chen
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9268950/
id doaj-b8bcd04a0e38416f9e9bb89875ae2fdd
record_format Article
spelling doaj-b8bcd04a0e38416f9e9bb89875ae2fdd2021-03-30T03:51:10ZengIEEEIEEE Access2169-35362020-01-01821414121415210.1109/ACCESS.2020.30401829268950A Boundary Assembling Method for Nested Biomedical Named Entity RecognitionYanping Chen0https://orcid.org/0000-0002-9946-3157Ying Hu1https://orcid.org/0000-0002-4113-4369Yijing Li2Ruizhang Huang3Yongbin Qin4https://orcid.org/0000-0002-1960-8628Yuefei Wu5https://orcid.org/0000-0002-1708-4074Qinghua Zheng6Ping Chen7https://orcid.org/0000-0003-3789-7686Guizhou Provincial Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang, ChinaGuizhou Provincial Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang, ChinaGuizhou Provincial Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang, ChinaGuizhou Provincial Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang, ChinaGuizhou Provincial Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang, ChinaGuizhou Provincial Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang, ChinaDepartment of Computer Science, University of Massachusetts Boston, Boston, MA, USADepartment of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, ChinaBiomedical named entity recognition (BNER) is an important task in biomedical natural language processing, in which neologisms (new terms, words) are coined constantly. Most of the existing work can only identify biomedical named entities with flattened structures and ignore nested biomedical named entities and discontinuous biomedical named entities. Because biomedical domains often use nested structures to represent semantic information of named entities, existing methods fail to utilize abundant information when processing biomedical texts. This paper focuses on identifying nested biomedical named entities using a boundary assembly (BA) model, which is a cascading framework consisting of three steps. First, start and end named entity boundaries are identified and then assembled into named entity candidates. Finally, a classifier is implemented for filtering false named entities. Our approach is effective in handling nesting and discontinuous problems in biomedical named entity recognition tasks. It improves the performance considerably, achieving an F1-score of 81.34% on the GENIA dataset.https://ieeexplore.ieee.org/document/9268950/Biomedical nested named entity recognitiondeep learninginformation extraction
collection DOAJ
language English
format Article
sources DOAJ
author Yanping Chen
Ying Hu
Yijing Li
Ruizhang Huang
Yongbin Qin
Yuefei Wu
Qinghua Zheng
Ping Chen
spellingShingle Yanping Chen
Ying Hu
Yijing Li
Ruizhang Huang
Yongbin Qin
Yuefei Wu
Qinghua Zheng
Ping Chen
A Boundary Assembling Method for Nested Biomedical Named Entity Recognition
IEEE Access
Biomedical nested named entity recognition
deep learning
information extraction
author_facet Yanping Chen
Ying Hu
Yijing Li
Ruizhang Huang
Yongbin Qin
Yuefei Wu
Qinghua Zheng
Ping Chen
author_sort Yanping Chen
title A Boundary Assembling Method for Nested Biomedical Named Entity Recognition
title_short A Boundary Assembling Method for Nested Biomedical Named Entity Recognition
title_full A Boundary Assembling Method for Nested Biomedical Named Entity Recognition
title_fullStr A Boundary Assembling Method for Nested Biomedical Named Entity Recognition
title_full_unstemmed A Boundary Assembling Method for Nested Biomedical Named Entity Recognition
title_sort boundary assembling method for nested biomedical named entity recognition
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Biomedical named entity recognition (BNER) is an important task in biomedical natural language processing, in which neologisms (new terms, words) are coined constantly. Most of the existing work can only identify biomedical named entities with flattened structures and ignore nested biomedical named entities and discontinuous biomedical named entities. Because biomedical domains often use nested structures to represent semantic information of named entities, existing methods fail to utilize abundant information when processing biomedical texts. This paper focuses on identifying nested biomedical named entities using a boundary assembly (BA) model, which is a cascading framework consisting of three steps. First, start and end named entity boundaries are identified and then assembled into named entity candidates. Finally, a classifier is implemented for filtering false named entities. Our approach is effective in handling nesting and discontinuous problems in biomedical named entity recognition tasks. It improves the performance considerably, achieving an F1-score of 81.34% on the GENIA dataset.
topic Biomedical nested named entity recognition
deep learning
information extraction
url https://ieeexplore.ieee.org/document/9268950/
work_keys_str_mv AT yanpingchen aboundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yinghu aboundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yijingli aboundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT ruizhanghuang aboundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yongbinqin aboundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yuefeiwu aboundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT qinghuazheng aboundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT pingchen aboundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yanpingchen boundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yinghu boundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yijingli boundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT ruizhanghuang boundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yongbinqin boundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT yuefeiwu boundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT qinghuazheng boundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
AT pingchen boundaryassemblingmethodfornestedbiomedicalnamedentityrecognition
_version_ 1724182776180113408