Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning

Manufacturing text often exists as unlabeled data; the entity is fine-grained and the extraction is difficult. The above problems mean that the manufacturing industry knowledge utilization rate is low. This paper proposes a novel Chinese fine-grained NER (named entity recognition) method based on sy...

Full description

Bibliographic Details
Main Authors: Liguo Yao, Haisong Huang, Kuan-Wei Wang, Shih-Huan Chen, Qiaoqiao Xiong
Format: Article
Language:English
Published: MDPI AG 2020-11-01
Series:Symmetry
Subjects:
NLP
Online Access:https://www.mdpi.com/2073-8994/12/12/1986
id doaj-f8230cf85f294f6d8aa51d68c0736177
record_format Article
spelling doaj-f8230cf85f294f6d8aa51d68c07361772020-12-01T00:02:58ZengMDPI AGSymmetry2073-89942020-11-01121986198610.3390/sym12121986Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer LearningLiguo Yao0Haisong Huang1Kuan-Wei Wang2Shih-Huan Chen3Qiaoqiao Xiong4Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, ChinaKey Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, ChinaDepartment of Business Administration, National Central University, Taoyuan 320003, TaiwanDepartment of Industrial Engineering and Management, Yuan Ze University, Taoyuan 320003, TaiwanDepartment of Mechanical and Manufacturing Engineering, Faculty of Engineering, University Putra Malaysia, Serdang, Selangor 43400, MalaysiaManufacturing text often exists as unlabeled data; the entity is fine-grained and the extraction is difficult. The above problems mean that the manufacturing industry knowledge utilization rate is low. This paper proposes a novel Chinese fine-grained NER (named entity recognition) method based on symmetry lightweight deep multinetwork collaboration (ALBERT-AttBiLSTM-CRF) and model transfer considering active learning (MTAL) to research fine-grained named entity recognition of a few labeled Chinese textual data types. The method is divided into two stages. In the first stage, the ALBERT-AttBiLSTM-CRF was applied for verification in the CLUENER2020 dataset (Public dataset) to get a pretrained model; the experiments show that the model obtains an F1 score of 0.8962, which is better than the best baseline algorithm, an improvement of 9.2%. In the second stage, the pretrained model was transferred into the Manufacturing-NER dataset (our dataset), and we used the active learning strategy to optimize the model effect. The final F1 result of Manufacturing-NER was 0.8931 after the model transfer (it was higher than 0.8576 before the model transfer); so, this method represents an improvement of 3.55%. Our method effectively transfers the existing knowledge from public source data to scientific target data, solving the problem of named entity recognition with scarce labeled domain data, and proves its effectiveness.https://www.mdpi.com/2073-8994/12/12/1986Chinese named entity recognitionfine-grainedlightweight deep multi-networkmodel transferNLP
collection DOAJ
language English
format Article
sources DOAJ
author Liguo Yao
Haisong Huang
Kuan-Wei Wang
Shih-Huan Chen
Qiaoqiao Xiong
spellingShingle Liguo Yao
Haisong Huang
Kuan-Wei Wang
Shih-Huan Chen
Qiaoqiao Xiong
Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning
Symmetry
Chinese named entity recognition
fine-grained
lightweight deep multi-network
model transfer
NLP
author_facet Liguo Yao
Haisong Huang
Kuan-Wei Wang
Shih-Huan Chen
Qiaoqiao Xiong
author_sort Liguo Yao
title Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning
title_short Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning
title_full Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning
title_fullStr Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning
title_full_unstemmed Fine-Grained Mechanical Chinese Named Entity Recognition Based on ALBERT-AttBiLSTM-CRF and Transfer Learning
title_sort fine-grained mechanical chinese named entity recognition based on albert-attbilstm-crf and transfer learning
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2020-11-01
description Manufacturing text often exists as unlabeled data; the entity is fine-grained and the extraction is difficult. The above problems mean that the manufacturing industry knowledge utilization rate is low. This paper proposes a novel Chinese fine-grained NER (named entity recognition) method based on symmetry lightweight deep multinetwork collaboration (ALBERT-AttBiLSTM-CRF) and model transfer considering active learning (MTAL) to research fine-grained named entity recognition of a few labeled Chinese textual data types. The method is divided into two stages. In the first stage, the ALBERT-AttBiLSTM-CRF was applied for verification in the CLUENER2020 dataset (Public dataset) to get a pretrained model; the experiments show that the model obtains an F1 score of 0.8962, which is better than the best baseline algorithm, an improvement of 9.2%. In the second stage, the pretrained model was transferred into the Manufacturing-NER dataset (our dataset), and we used the active learning strategy to optimize the model effect. The final F1 result of Manufacturing-NER was 0.8931 after the model transfer (it was higher than 0.8576 before the model transfer); so, this method represents an improvement of 3.55%. Our method effectively transfers the existing knowledge from public source data to scientific target data, solving the problem of named entity recognition with scarce labeled domain data, and proves its effectiveness.
topic Chinese named entity recognition
fine-grained
lightweight deep multi-network
model transfer
NLP
url https://www.mdpi.com/2073-8994/12/12/1986
work_keys_str_mv AT liguoyao finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning
AT haisonghuang finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning
AT kuanweiwang finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning
AT shihhuanchen finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning
AT qiaoqiaoxiong finegrainedmechanicalchinesenamedentityrecognitionbasedonalbertattbilstmcrfandtransferlearning
_version_ 1724411292974841856