Improving deep learning method for biomedical named entity recognition by using entity definition information

Background: Biomedical named entity recognition (NER) is a fundamental task of biomedical text mining that finds the boundaries of entity mentions in biomedical text and determines their entity type. To accelerate the development of biomedical NER techniques in Spanish, the PharmaCoNER organizers la...

Full description

Bibliographic Details
Main Authors:	Chen, Q. (Author), Chen, S. (Author), Tang, B. (Author), Wang, X. (Author), Xiong, Y. (Author), Yan, J. (Author), Zhou, Y. (Author)
Format:	Article
Language:	English
Published:	BioMed Central Ltd 2021
Subjects:	article Biomedical named entity recognition Character recognition Data mining deep learning Deep learning Deep Learning Entity definition information Entity-types F1 scores Information use Learning methods Machine reading comprehension Natural language processing systems One-pass methods Query processing reading Reading comprehension recall Span-level one-pass method State of the art Text processing
Online Access:	View Fulltext in Publisher


LEADER	04101nam a2200517Ia 4500
001	10.1186-s12859-021-04236-y
008	220427s2021 CNT 000 0 und d
020			\|a 14712105 (ISSN)
245	1	0	\|a Improving deep learning method for biomedical named entity recognition by using entity definition information
260		0	\|b BioMed Central Ltd \|c 2021
856			\|z View Fulltext in Publisher \|u https://doi.org/10.1186/s12859-021-04236-y
520	3		\|a Background: Biomedical named entity recognition (NER) is a fundamental task of biomedical text mining that finds the boundaries of entity mentions in biomedical text and determines their entity type. To accelerate the development of biomedical NER techniques in Spanish, the PharmaCoNER organizers launched a competition to recognize pharmacological substances, compounds, and proteins. Biomedical NER is usually recognized as a sequence labeling task, and almost all state-of-the-art sequence labeling methods ignore the meaning of different entity types. In this paper, we investigate some methods to introduce the meaning of entity types in deep learning methods for biomedical NER and apply them to the PharmaCoNER 2019 challenge. The meaning of each entity type is represented by its definition information. Material and method: We investigate how to use entity definition information in the following two methods: (1) SQuad-style machine reading comprehension (MRC) methods that treat entity definition information as query and biomedical text as context and predict answer spans as entities. (2) Span-level one-pass (SOne) methods that predict entity spans of one type by one type and introduce entity type meaning, which is represented by entity definition information. All models are trained and tested on the PharmaCoNER 2019 corpus, and their performance is evaluated by strict micro-average precision, recall, and F1-score. Results: Entity definition information brings improvements to both SQuad-style MRC and SOne methods by about 0.003 in micro-averaged F1-score. The SQuad-style MRC model using entity definition information as query achieves the best performance with a micro-averaged precision of 0.9225, a recall of 0.9050, and an F1-score of 0.9137, respectively. It outperforms the best model of the PharmaCoNER 2019 challenge by 0.0032 in F1-score. Compared with the state-of-the-art model without using manually-crafted features, our model obtains a 1% improvement in F1-score, which is significant. These results indicate that entity definition information is useful for deep learning methods on biomedical NER. Conclusion: Our entity definition information enhanced models achieve the state-of-the-art micro-average F1 score of 0.9137, which implies that entity definition information has a positive impact on biomedical NER detection. In the future, we will explore more entity definition information from knowledge graph. © 2021, The Author(s).
650	0	4	\|a article
650	0	4	\|a Biomedical named entity recognition
650	0	4	\|a Biomedical named entity recognition
650	0	4	\|a Character recognition
650	0	4	\|a Data mining
650	0	4	\|a deep learning
650	0	4	\|a Deep learning
650	0	4	\|a Deep Learning
650	0	4	\|a Entity definition information
650	0	4	\|a Entity definition information
650	0	4	\|a Entity-types
650	0	4	\|a F1 scores
650	0	4	\|a Information use
650	0	4	\|a Learning methods
650	0	4	\|a Machine reading comprehension
650	0	4	\|a Machine reading comprehension
650	0	4	\|a Natural language processing systems
650	0	4	\|a One-pass methods
650	0	4	\|a Query processing
650	0	4	\|a reading
650	0	4	\|a Reading comprehension
650	0	4	\|a recall
650	0	4	\|a Span-level one-pass method
650	0	4	\|a Span-level one-pass method
650	0	4	\|a State of the art
650	0	4	\|a Text processing
700	1		\|a Chen, Q. \|e author
700	1		\|a Chen, S. \|e author
700	1		\|a Tang, B. \|e author
700	1		\|a Wang, X. \|e author
700	1		\|a Xiong, Y. \|e author
700	1		\|a Yan, J. \|e author
700	1		\|a Zhou, Y. \|e author
773			\|t BMC Bioinformatics

Improving deep learning method for biomedical named entity recognition by using entity definition information

Similar Items