A Bidirectional Iterative Algorithm for Nested Named Entity Recognition
Nested named entity recognition (NER) is a special case of structured prediction in which annotated sequences can be contained inside each other. It is a challenging and significant problem in natural language processing. In this paper, we propose a novel framework for nested named entity recognitio...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9146650/ |
id |
doaj-0f517a1bac0f4c2d9f45d6ba803e03b9 |
---|---|
record_format |
Article |
spelling |
doaj-0f517a1bac0f4c2d9f45d6ba803e03b92021-03-30T03:24:31ZengIEEEIEEE Access2169-35362020-01-01813509113510210.1109/ACCESS.2020.30115989146650A Bidirectional Iterative Algorithm for Nested Named Entity RecognitionSlawomir Dadas0https://orcid.org/0000-0002-9177-6685Jaroslaw Protasiewicz1https://orcid.org/0000-0002-9204-921XNational Information Processing Institute, Warsaw, PolandNational Information Processing Institute, Warsaw, PolandNested named entity recognition (NER) is a special case of structured prediction in which annotated sequences can be contained inside each other. It is a challenging and significant problem in natural language processing. In this paper, we propose a novel framework for nested named entity recognition tasks. Our approach is based on a deep learning model which can be called in an iterative way, expanding the set of predicted entity mentions with each subsequent iteration. The proposed framework combines two such models trained to identify named entities in different directions: from general to specific (outside-in), and from specific to general (inside-out). The predictions of both models are then aggregated by a selection policy. We propose and evaluate several selection policies which can be used with our algorithm. Our method does not impose any restrictions on the length of entity mentions, number of entity classes, depth, or structure of the predicted output. The framework has been validated experimentally on four well-known nested named entity recognition datasets: GENIA, NNE, PolEval, and GermEval. The datasets differ in terms of domain (biomedical, news, mixed), language (English, Polish, German), and the structure of nesting (simple, complex). Through extensive tests, we prove that the approach we have proposed outperforms existing methods for nested named entity recognition.https://ieeexplore.ieee.org/document/9146650/Information extractionnatural language processingnested named entity recognition |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Slawomir Dadas Jaroslaw Protasiewicz |
spellingShingle |
Slawomir Dadas Jaroslaw Protasiewicz A Bidirectional Iterative Algorithm for Nested Named Entity Recognition IEEE Access Information extraction natural language processing nested named entity recognition |
author_facet |
Slawomir Dadas Jaroslaw Protasiewicz |
author_sort |
Slawomir Dadas |
title |
A Bidirectional Iterative Algorithm for Nested Named Entity Recognition |
title_short |
A Bidirectional Iterative Algorithm for Nested Named Entity Recognition |
title_full |
A Bidirectional Iterative Algorithm for Nested Named Entity Recognition |
title_fullStr |
A Bidirectional Iterative Algorithm for Nested Named Entity Recognition |
title_full_unstemmed |
A Bidirectional Iterative Algorithm for Nested Named Entity Recognition |
title_sort |
bidirectional iterative algorithm for nested named entity recognition |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Nested named entity recognition (NER) is a special case of structured prediction in which annotated sequences can be contained inside each other. It is a challenging and significant problem in natural language processing. In this paper, we propose a novel framework for nested named entity recognition tasks. Our approach is based on a deep learning model which can be called in an iterative way, expanding the set of predicted entity mentions with each subsequent iteration. The proposed framework combines two such models trained to identify named entities in different directions: from general to specific (outside-in), and from specific to general (inside-out). The predictions of both models are then aggregated by a selection policy. We propose and evaluate several selection policies which can be used with our algorithm. Our method does not impose any restrictions on the length of entity mentions, number of entity classes, depth, or structure of the predicted output. The framework has been validated experimentally on four well-known nested named entity recognition datasets: GENIA, NNE, PolEval, and GermEval. The datasets differ in terms of domain (biomedical, news, mixed), language (English, Polish, German), and the structure of nesting (simple, complex). Through extensive tests, we prove that the approach we have proposed outperforms existing methods for nested named entity recognition. |
topic |
Information extraction natural language processing nested named entity recognition |
url |
https://ieeexplore.ieee.org/document/9146650/ |
work_keys_str_mv |
AT slawomirdadas abidirectionaliterativealgorithmfornestednamedentityrecognition AT jaroslawprotasiewicz abidirectionaliterativealgorithmfornestednamedentityrecognition AT slawomirdadas bidirectionaliterativealgorithmfornestednamedentityrecognition AT jaroslawprotasiewicz bidirectionaliterativealgorithmfornestednamedentityrecognition |
_version_ |
1724183483695235072 |