A Bidirectional Iterative Algorithm for Nested Named Entity Recognition

Nested named entity recognition (NER) is a special case of structured prediction in which annotated sequences can be contained inside each other. It is a challenging and significant problem in natural language processing. In this paper, we propose a novel framework for nested named entity recognitio...

Full description

Bibliographic Details
Main Authors: Slawomir Dadas, Jaroslaw Protasiewicz
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9146650/
id doaj-0f517a1bac0f4c2d9f45d6ba803e03b9
record_format Article
spelling doaj-0f517a1bac0f4c2d9f45d6ba803e03b92021-03-30T03:24:31ZengIEEEIEEE Access2169-35362020-01-01813509113510210.1109/ACCESS.2020.30115989146650A Bidirectional Iterative Algorithm for Nested Named Entity RecognitionSlawomir Dadas0https://orcid.org/0000-0002-9177-6685Jaroslaw Protasiewicz1https://orcid.org/0000-0002-9204-921XNational Information Processing Institute, Warsaw, PolandNational Information Processing Institute, Warsaw, PolandNested named entity recognition (NER) is a special case of structured prediction in which annotated sequences can be contained inside each other. It is a challenging and significant problem in natural language processing. In this paper, we propose a novel framework for nested named entity recognition tasks. Our approach is based on a deep learning model which can be called in an iterative way, expanding the set of predicted entity mentions with each subsequent iteration. The proposed framework combines two such models trained to identify named entities in different directions: from general to specific (outside-in), and from specific to general (inside-out). The predictions of both models are then aggregated by a selection policy. We propose and evaluate several selection policies which can be used with our algorithm. Our method does not impose any restrictions on the length of entity mentions, number of entity classes, depth, or structure of the predicted output. The framework has been validated experimentally on four well-known nested named entity recognition datasets: GENIA, NNE, PolEval, and GermEval. The datasets differ in terms of domain (biomedical, news, mixed), language (English, Polish, German), and the structure of nesting (simple, complex). Through extensive tests, we prove that the approach we have proposed outperforms existing methods for nested named entity recognition.https://ieeexplore.ieee.org/document/9146650/Information extractionnatural language processingnested named entity recognition
collection DOAJ
language English
format Article
sources DOAJ
author Slawomir Dadas
Jaroslaw Protasiewicz
spellingShingle Slawomir Dadas
Jaroslaw Protasiewicz
A Bidirectional Iterative Algorithm for Nested Named Entity Recognition
IEEE Access
Information extraction
natural language processing
nested named entity recognition
author_facet Slawomir Dadas
Jaroslaw Protasiewicz
author_sort Slawomir Dadas
title A Bidirectional Iterative Algorithm for Nested Named Entity Recognition
title_short A Bidirectional Iterative Algorithm for Nested Named Entity Recognition
title_full A Bidirectional Iterative Algorithm for Nested Named Entity Recognition
title_fullStr A Bidirectional Iterative Algorithm for Nested Named Entity Recognition
title_full_unstemmed A Bidirectional Iterative Algorithm for Nested Named Entity Recognition
title_sort bidirectional iterative algorithm for nested named entity recognition
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Nested named entity recognition (NER) is a special case of structured prediction in which annotated sequences can be contained inside each other. It is a challenging and significant problem in natural language processing. In this paper, we propose a novel framework for nested named entity recognition tasks. Our approach is based on a deep learning model which can be called in an iterative way, expanding the set of predicted entity mentions with each subsequent iteration. The proposed framework combines two such models trained to identify named entities in different directions: from general to specific (outside-in), and from specific to general (inside-out). The predictions of both models are then aggregated by a selection policy. We propose and evaluate several selection policies which can be used with our algorithm. Our method does not impose any restrictions on the length of entity mentions, number of entity classes, depth, or structure of the predicted output. The framework has been validated experimentally on four well-known nested named entity recognition datasets: GENIA, NNE, PolEval, and GermEval. The datasets differ in terms of domain (biomedical, news, mixed), language (English, Polish, German), and the structure of nesting (simple, complex). Through extensive tests, we prove that the approach we have proposed outperforms existing methods for nested named entity recognition.
topic Information extraction
natural language processing
nested named entity recognition
url https://ieeexplore.ieee.org/document/9146650/
work_keys_str_mv AT slawomirdadas abidirectionaliterativealgorithmfornestednamedentityrecognition
AT jaroslawprotasiewicz abidirectionaliterativealgorithmfornestednamedentityrecognition
AT slawomirdadas bidirectionaliterativealgorithmfornestednamedentityrecognition
AT jaroslawprotasiewicz bidirectionaliterativealgorithmfornestednamedentityrecognition
_version_ 1724183483695235072