Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology

Weaponry equipment names belong to an important military naming entity that is difficult to identify because of features, such as complex components, miscellaneous, and scarce annotation corpus. Here, the automatic recognition of weaponry equipment names is specifically explored, a NER (Named Entity...

Full description

Bibliographic Details
Main Authors: Chenguang Liu, Yongli Yu, Xingxin Li, Peng Wang
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9528359/
id doaj-cc8874e2fa1749caa9eeaac89aab8b35
record_format Article
spelling doaj-cc8874e2fa1749caa9eeaac89aab8b352021-09-17T23:00:22ZengIEEEIEEE Access2169-35362021-01-01912672812673410.1109/ACCESS.2021.31099119528359Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction TechnologyChenguang Liu0https://orcid.org/0000-0002-0058-8073Yongli Yu1Xingxin Li2Peng Wang3https://orcid.org/0000-0002-5931-8852Army Engineering University of the People’s Liberation Army, Shijiazhang, ChinaArmy Engineering University of the People’s Liberation Army, Shijiazhang, ChinaArmy Engineering University of the People’s Liberation Army, Shijiazhang, ChinaTest and Training Research Centre, Army Test and Training Base, Jinzhou, ChinaWeaponry equipment names belong to an important military naming entity that is difficult to identify because of features, such as complex components, miscellaneous, and scarce annotation corpus. Here, the automatic recognition of weaponry equipment names is specifically explored, a NER (Named Entity Recognition) algorithm is proposed based on BI-LSTM-CRF (Bi-directional Long Short Term Memory Conditional Random Field), thereby demonstrating the effectiveness of domain features in domain-specific entity recognition. Firstly, Chinese characters are represented by word embedding and input into the model. Then, the input feature vector sequence is processed by BI-LSTM (Bi-directional Long Short Term Memory) NN (Neural Network) to extract context semantic learning features. Finally, the learned features are connected to the linear CRF (Conditional Random Field), the NEs (Named Entities) in the equipment support field are labeled, and the NER results are obtained and output. The experimental results show that the accuracy of the NER algorithm based on the BI-LSTM-CRF model is 92.02%, the recall rate is 93.21%, and the F1 value reaches 93.88%. The effect of this model is better than the BI-LSTM NN model and LSTM-CRF (Long Short Term Memory Conditional Random Field) NN model. The proposed model provides some references for entity recognition in the field of equipment support.https://ieeexplore.ieee.org/document/9528359/Text information extractionnamed entity recognitionequipment supportautomatic word segmentationinformation identification
collection DOAJ
language English
format Article
sources DOAJ
author Chenguang Liu
Yongli Yu
Xingxin Li
Peng Wang
spellingShingle Chenguang Liu
Yongli Yu
Xingxin Li
Peng Wang
Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology
IEEE Access
Text information extraction
named entity recognition
equipment support
automatic word segmentation
information identification
author_facet Chenguang Liu
Yongli Yu
Xingxin Li
Peng Wang
author_sort Chenguang Liu
title Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology
title_short Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology
title_full Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology
title_fullStr Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology
title_full_unstemmed Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology
title_sort named entity recognition in equipment support field using tri-training algorithm and text information extraction technology
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Weaponry equipment names belong to an important military naming entity that is difficult to identify because of features, such as complex components, miscellaneous, and scarce annotation corpus. Here, the automatic recognition of weaponry equipment names is specifically explored, a NER (Named Entity Recognition) algorithm is proposed based on BI-LSTM-CRF (Bi-directional Long Short Term Memory Conditional Random Field), thereby demonstrating the effectiveness of domain features in domain-specific entity recognition. Firstly, Chinese characters are represented by word embedding and input into the model. Then, the input feature vector sequence is processed by BI-LSTM (Bi-directional Long Short Term Memory) NN (Neural Network) to extract context semantic learning features. Finally, the learned features are connected to the linear CRF (Conditional Random Field), the NEs (Named Entities) in the equipment support field are labeled, and the NER results are obtained and output. The experimental results show that the accuracy of the NER algorithm based on the BI-LSTM-CRF model is 92.02%, the recall rate is 93.21%, and the F1 value reaches 93.88%. The effect of this model is better than the BI-LSTM NN model and LSTM-CRF (Long Short Term Memory Conditional Random Field) NN model. The proposed model provides some references for entity recognition in the field of equipment support.
topic Text information extraction
named entity recognition
equipment support
automatic word segmentation
information identification
url https://ieeexplore.ieee.org/document/9528359/
work_keys_str_mv AT chenguangliu namedentityrecognitioninequipmentsupportfieldusingtritrainingalgorithmandtextinformationextractiontechnology
AT yongliyu namedentityrecognitioninequipmentsupportfieldusingtritrainingalgorithmandtextinformationextractiontechnology
AT xingxinli namedentityrecognitioninequipmentsupportfieldusingtritrainingalgorithmandtextinformationextractiontechnology
AT pengwang namedentityrecognitioninequipmentsupportfieldusingtritrainingalgorithmandtextinformationextractiontechnology
_version_ 1717377095811727360