Intelligent prescription-generating models of traditional chinese medicine based on deep learning

Objective: This study aimed to construct an intelligent prescription-generating (IPG) model based on deep-learning natural language processing (NLP) technology for multiple prescriptions in Chinese medicine. Materials and Methods: We selected the Treatise on Febrile Diseases and the Synopsis of Gold...

Full description

Bibliographic Details
Main Authors: Qing-Yang Shi, Li-Zi Tan, Lim Lian Seng, Hui-Jun Wang
Format: Article
Language:English
Published: Wolters Kluwer Medknow Publications 2021-01-01
Series:World Journal of Traditional Chinese Medicine
Subjects:
Online Access:http://www.wjtcm.net/article.asp?issn=2311-8571;year=2021;volume=7;issue=3;spage=361;epage=369;aulast=Shi
id doaj-43d2a92316934b1f900f98fc64012268
record_format Article
spelling doaj-43d2a92316934b1f900f98fc640122682021-08-20T06:16:16ZengWolters Kluwer Medknow PublicationsWorld Journal of Traditional Chinese Medicine2311-85712021-01-017336136910.4103/wjtcm.wjtcm_54_21Intelligent prescription-generating models of traditional chinese medicine based on deep learningQing-Yang ShiLi-Zi TanLim Lian SengHui-Jun WangObjective: This study aimed to construct an intelligent prescription-generating (IPG) model based on deep-learning natural language processing (NLP) technology for multiple prescriptions in Chinese medicine. Materials and Methods: We selected the Treatise on Febrile Diseases and the Synopsis of Golden Chamber as basic datasets with EDA data augmentation, and the Yellow Emperor's Canon of Internal Medicine, the Classic of the Miraculous Pivot, and the Classic on Medical Problems as supplementary datasets for fine-tuning. We selected the word-embedding model based on the Imperial Collection of Four, the bidirectional encoder representations from transformers (BERT) model based on the Chinese Wikipedia, and the robustly optimized BERT approach (RoBERTa) model based on the Chinese Wikipedia and a general database. In addition, the BERT model was fine-tuned using the supplementary datasets to generate a Traditional Chinese Medicine-BERT model. Multiple IPG models were constructed based on the pretraining strategy and experiments were performed. Metrics of precision, recall, and F1-score were used to assess the model performance. Based on the trained models, we extracted and visualized the semantic features of some typical texts from treatise on febrile diseases and investigated the patterns. Results: Among all the trained models, the RoBERTa-large model performed the best, with a test set precision of 92.22%, recall of 86.71%, and F1-score of 89.38% and 10-fold cross-validation precision of 94.5% ± 2.5%, recall of 90.47% ± 4.1%, and F1-score of 92.38% ± 2.8%. The semantic feature extraction results based on this model showed that the model was intelligently stratified based on different meanings such that the within-layer's patterns showed the associations of symptom–symptoms, disease–symptoms, and symptom–punctuations, while the between-layer's patterns showed a progressive or dynamic symptom and disease transformation. Conclusions: Deep-learning-based NLP technology significantly improves the performance of IPG model. In addition, NLP-based semantic feature extraction may be vital to further investigate the ancient Chinese medicine texts.http://www.wjtcm.net/article.asp?issn=2311-8571;year=2021;volume=7;issue=3;spage=361;epage=369;aulast=Shiancient books of chinese medicinebidirectional encoder representations from transformersdeep learningintelligent prescription-generating modelspretrained models
collection DOAJ
language English
format Article
sources DOAJ
author Qing-Yang Shi
Li-Zi Tan
Lim Lian Seng
Hui-Jun Wang
spellingShingle Qing-Yang Shi
Li-Zi Tan
Lim Lian Seng
Hui-Jun Wang
Intelligent prescription-generating models of traditional chinese medicine based on deep learning
World Journal of Traditional Chinese Medicine
ancient books of chinese medicine
bidirectional encoder representations from transformers
deep learning
intelligent prescription-generating models
pretrained models
author_facet Qing-Yang Shi
Li-Zi Tan
Lim Lian Seng
Hui-Jun Wang
author_sort Qing-Yang Shi
title Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_short Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_full Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_fullStr Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_full_unstemmed Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_sort intelligent prescription-generating models of traditional chinese medicine based on deep learning
publisher Wolters Kluwer Medknow Publications
series World Journal of Traditional Chinese Medicine
issn 2311-8571
publishDate 2021-01-01
description Objective: This study aimed to construct an intelligent prescription-generating (IPG) model based on deep-learning natural language processing (NLP) technology for multiple prescriptions in Chinese medicine. Materials and Methods: We selected the Treatise on Febrile Diseases and the Synopsis of Golden Chamber as basic datasets with EDA data augmentation, and the Yellow Emperor's Canon of Internal Medicine, the Classic of the Miraculous Pivot, and the Classic on Medical Problems as supplementary datasets for fine-tuning. We selected the word-embedding model based on the Imperial Collection of Four, the bidirectional encoder representations from transformers (BERT) model based on the Chinese Wikipedia, and the robustly optimized BERT approach (RoBERTa) model based on the Chinese Wikipedia and a general database. In addition, the BERT model was fine-tuned using the supplementary datasets to generate a Traditional Chinese Medicine-BERT model. Multiple IPG models were constructed based on the pretraining strategy and experiments were performed. Metrics of precision, recall, and F1-score were used to assess the model performance. Based on the trained models, we extracted and visualized the semantic features of some typical texts from treatise on febrile diseases and investigated the patterns. Results: Among all the trained models, the RoBERTa-large model performed the best, with a test set precision of 92.22%, recall of 86.71%, and F1-score of 89.38% and 10-fold cross-validation precision of 94.5% ± 2.5%, recall of 90.47% ± 4.1%, and F1-score of 92.38% ± 2.8%. The semantic feature extraction results based on this model showed that the model was intelligently stratified based on different meanings such that the within-layer's patterns showed the associations of symptom–symptoms, disease–symptoms, and symptom–punctuations, while the between-layer's patterns showed a progressive or dynamic symptom and disease transformation. Conclusions: Deep-learning-based NLP technology significantly improves the performance of IPG model. In addition, NLP-based semantic feature extraction may be vital to further investigate the ancient Chinese medicine texts.
topic ancient books of chinese medicine
bidirectional encoder representations from transformers
deep learning
intelligent prescription-generating models
pretrained models
url http://www.wjtcm.net/article.asp?issn=2311-8571;year=2021;volume=7;issue=3;spage=361;epage=369;aulast=Shi
work_keys_str_mv AT qingyangshi intelligentprescriptiongeneratingmodelsoftraditionalchinesemedicinebasedondeeplearning
AT lizitan intelligentprescriptiongeneratingmodelsoftraditionalchinesemedicinebasedondeeplearning
AT limlianseng intelligentprescriptiongeneratingmodelsoftraditionalchinesemedicinebasedondeeplearning
AT huijunwang intelligentprescriptiongeneratingmodelsoftraditionalchinesemedicinebasedondeeplearning
_version_ 1721201388851036160