Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning
Manufacturing text often exists as unlabeled data; the entity is fine-grained and the extraction is difficult. The above problems mean that the manufacturing industry knowledge utilization rate is low. This paper proposes a novel Chinese fine-grained NER (named entity recognition) method based on sy...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-11-01
|
Series: | Symmetry |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-8994/12/12/1986 |
id |
doaj-f8230cf85f294f6d8aa51d68c0736177 |
---|---|
record_format |
Article |
spelling |
doaj-f8230cf85f294f6d8aa51d68c07361772020-12-01T00:02:58ZengMDPI AGSymmetry2073-89942020-11-01121986198610.3390/sym12121986Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer LearningLiguo Yao0Haisong Huang1Kuan-Wei Wang2Shih-Huan Chen3Qiaoqiao Xiong4Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, ChinaKey Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, ChinaDepartment of Business Administration, National Central University, Taoyuan 320003, TaiwanDepartment of Industrial Engineering and Management, Yuan Ze University, Taoyuan 320003, TaiwanDepartment of Mechanical and Manufacturing Engineering, Faculty of Engineering, University Putra Malaysia, Serdang, Selangor 43400, MalaysiaManufacturing text often exists as unlabeled data; the entity is fine-grained and the extraction is difficult. The above problems mean that the manufacturing industry knowledge utilization rate is low. This paper proposes a novel Chinese fine-grained NER (named entity recognition) method based on symmetry lightweight deep multinetwork collaboration (ALBERT-AttBiLSTM-CRF) and model transfer considering active learning (MTAL) to research fine-grained named entity recognition of a few labeled Chinese textual data types. The method is divided into two stages. In the first stage, the ALBERT-AttBiLSTM-CRF was applied for verification in the CLUENER2020 dataset (Public dataset) to get a pretrained model; the experiments show that the model obtains an F1 score of 0.8962, which is better than the best baseline algorithm, an improvement of 9.2%. In the second stage, the pretrained model was transferred into the Manufacturing-NER dataset (our dataset), and we used the active learning strategy to optimize the model effect. The final F1 result of Manufacturing-NER was 0.8931 after the model transfer (it was higher than 0.8576 before the model transfer); so, this method represents an improvement of 3.55%. Our method effectively transfers the existing knowledge from public source data to scientific target data, solving the problem of named entity recognition with scarce labeled domain data, and proves its effectiveness.https://www.mdpi.com/2073-8994/12/12/1986Chinese named entity recognitionfine-grainedlightweight deep multi-networkmodel transferNLP |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Liguo Yao Haisong Huang Kuan-Wei Wang Shih-Huan Chen Qiaoqiao Xiong |
spellingShingle |
Liguo Yao Haisong Huang Kuan-Wei Wang Shih-Huan Chen Qiaoqiao Xiong Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning Symmetry Chinese named entity recognition fine-grained lightweight deep multi-network model transfer NLP |
author_facet |
Liguo Yao Haisong Huang Kuan-Wei Wang Shih-Huan Chen Qiaoqiao Xiong |
author_sort |
Liguo Yao |
title |
Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning |
title_short |
Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning |
title_full |
Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning |
title_fullStr |
Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning |
title_full_unstemmed |
Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning |
title_sort |
fine-grained mechanical chinese named entity recognition based on albert-attbilstm-crf and transfer learning |
publisher |
MDPI AG |
series |
Symmetry |
issn |
2073-8994 |
publishDate |
2020-11-01 |
description |
Manufacturing text often exists as unlabeled data; the entity is fine-grained and the extraction is difficult. The above problems mean that the manufacturing industry knowledge utilization rate is low. This paper proposes a novel Chinese fine-grained NER (named entity recognition) method based on symmetry lightweight deep multinetwork collaboration (ALBERT-AttBiLSTM-CRF) and model transfer considering active learning (MTAL) to research fine-grained named entity recognition of a few labeled Chinese textual data types. The method is divided into two stages. In the first stage, the ALBERT-AttBiLSTM-CRF was applied for verification in the CLUENER2020 dataset (Public dataset) to get a pretrained model; the experiments show that the model obtains an F1 score of 0.8962, which is better than the best baseline algorithm, an improvement of 9.2%. In the second stage, the pretrained model was transferred into the Manufacturing-NER dataset (our dataset), and we used the active learning strategy to optimize the model effect. The final F1 result of Manufacturing-NER was 0.8931 after the model transfer (it was higher than 0.8576 before the model transfer); so, this method represents an improvement of 3.55%. Our method effectively transfers the existing knowledge from public source data to scientific target data, solving the problem of named entity recognition with scarce labeled domain data, and proves its effectiveness. |
topic |
Chinese named entity recognition fine-grained lightweight deep multi-network model transfer NLP |
url |
https://www.mdpi.com/2073-8994/12/12/1986 |
work_keys_str_mv |
AT liguoyao finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning AT haisonghuang finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning AT kuanweiwang finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning AT shihhuanchen finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning AT qiaoqiaoxiong finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning |
_version_ |
1724411292974841856 |