Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology
Weaponry equipment names belong to an important military naming entity that is difficult to identify because of features, such as complex components, miscellaneous, and scarce annotation corpus. Here, the automatic recognition of weaponry equipment names is specifically explored, a NER (Named Entity...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9528359/ |
id |
doaj-cc8874e2fa1749caa9eeaac89aab8b35 |
---|---|
record_format |
Article |
spelling |
doaj-cc8874e2fa1749caa9eeaac89aab8b352021-09-17T23:00:22ZengIEEEIEEE Access2169-35362021-01-01912672812673410.1109/ACCESS.2021.31099119528359Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction TechnologyChenguang Liu0https://orcid.org/0000-0002-0058-8073Yongli Yu1Xingxin Li2Peng Wang3https://orcid.org/0000-0002-5931-8852Army Engineering University of the People’s Liberation Army, Shijiazhang, ChinaArmy Engineering University of the People’s Liberation Army, Shijiazhang, ChinaArmy Engineering University of the People’s Liberation Army, Shijiazhang, ChinaTest and Training Research Centre, Army Test and Training Base, Jinzhou, ChinaWeaponry equipment names belong to an important military naming entity that is difficult to identify because of features, such as complex components, miscellaneous, and scarce annotation corpus. Here, the automatic recognition of weaponry equipment names is specifically explored, a NER (Named Entity Recognition) algorithm is proposed based on BI-LSTM-CRF (Bi-directional Long Short Term Memory Conditional Random Field), thereby demonstrating the effectiveness of domain features in domain-specific entity recognition. Firstly, Chinese characters are represented by word embedding and input into the model. Then, the input feature vector sequence is processed by BI-LSTM (Bi-directional Long Short Term Memory) NN (Neural Network) to extract context semantic learning features. Finally, the learned features are connected to the linear CRF (Conditional Random Field), the NEs (Named Entities) in the equipment support field are labeled, and the NER results are obtained and output. The experimental results show that the accuracy of the NER algorithm based on the BI-LSTM-CRF model is 92.02%, the recall rate is 93.21%, and the F1 value reaches 93.88%. The effect of this model is better than the BI-LSTM NN model and LSTM-CRF (Long Short Term Memory Conditional Random Field) NN model. The proposed model provides some references for entity recognition in the field of equipment support.https://ieeexplore.ieee.org/document/9528359/Text information extractionnamed entity recognitionequipment supportautomatic word segmentationinformation identification |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chenguang Liu Yongli Yu Xingxin Li Peng Wang |
spellingShingle |
Chenguang Liu Yongli Yu Xingxin Li Peng Wang Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology IEEE Access Text information extraction named entity recognition equipment support automatic word segmentation information identification |
author_facet |
Chenguang Liu Yongli Yu Xingxin Li Peng Wang |
author_sort |
Chenguang Liu |
title |
Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology |
title_short |
Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology |
title_full |
Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology |
title_fullStr |
Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology |
title_full_unstemmed |
Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology |
title_sort |
named entity recognition in equipment support field using tri-training algorithm and text information extraction technology |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
Weaponry equipment names belong to an important military naming entity that is difficult to identify because of features, such as complex components, miscellaneous, and scarce annotation corpus. Here, the automatic recognition of weaponry equipment names is specifically explored, a NER (Named Entity Recognition) algorithm is proposed based on BI-LSTM-CRF (Bi-directional Long Short Term Memory Conditional Random Field), thereby demonstrating the effectiveness of domain features in domain-specific entity recognition. Firstly, Chinese characters are represented by word embedding and input into the model. Then, the input feature vector sequence is processed by BI-LSTM (Bi-directional Long Short Term Memory) NN (Neural Network) to extract context semantic learning features. Finally, the learned features are connected to the linear CRF (Conditional Random Field), the NEs (Named Entities) in the equipment support field are labeled, and the NER results are obtained and output. The experimental results show that the accuracy of the NER algorithm based on the BI-LSTM-CRF model is 92.02%, the recall rate is 93.21%, and the F1 value reaches 93.88%. The effect of this model is better than the BI-LSTM NN model and LSTM-CRF (Long Short Term Memory Conditional Random Field) NN model. The proposed model provides some references for entity recognition in the field of equipment support. |
topic |
Text information extraction named entity recognition equipment support automatic word segmentation information identification |
url |
https://ieeexplore.ieee.org/document/9528359/ |
work_keys_str_mv |
AT chenguangliu namedentityrecognitioninequipmentsupportfieldusingtritrainingalgorithmandtextinformationextractiontechnology AT yongliyu namedentityrecognitioninequipmentsupportfieldusingtritrainingalgorithmandtextinformationextractiontechnology AT xingxinli namedentityrecognitioninequipmentsupportfieldusingtritrainingalgorithmandtextinformationextractiontechnology AT pengwang namedentityrecognitioninequipmentsupportfieldusingtritrainingalgorithmandtextinformationextractiontechnology |
_version_ |
1717377095811727360 |