Intelligent prescription-generating models of traditional chinese medicine based on deep learning

Objective: This study aimed to construct an intelligent prescription-generating (IPG) model based on deep-learning natural language processing (NLP) technology for multiple prescriptions in Chinese medicine. Materials and Methods: We selected the Treatise on Febrile Diseases and the Synopsis of Gold...

Full description

Bibliographic Details
Main Authors:	Qing-Yang Shi, Li-Zi Tan, Lim Lian Seng, Hui-Jun Wang
Format:	Article
Language:	English
Published:	Wolters Kluwer Medknow Publications 2021-01-01
Series:	World Journal of Traditional Chinese Medicine
Subjects:	ancient books of chinese medicine bidirectional encoder representations from transformers deep learning intelligent prescription-generating models pretrained models
Online Access:	http://www.wjtcm.net/article.asp?issn=2311-8571;year=2021;volume=7;issue=3;spage=361;epage=369;aulast=Shi

id	doaj-43d2a92316934b1f900f98fc64012268
record_format	Article
spelling	doaj-43d2a92316934b1f900f98fc640122682021-08-20T06:16:16ZengWolters Kluwer Medknow PublicationsWorld Journal of Traditional Chinese Medicine2311-85712021-01-017336136910.4103/wjtcm.wjtcm_54_21Intelligent prescription-generating models of traditional chinese medicine based on deep learningQing-Yang ShiLi-Zi TanLim Lian SengHui-Jun WangObjective: This study aimed to construct an intelligent prescription-generating (IPG) model based on deep-learning natural language processing (NLP) technology for multiple prescriptions in Chinese medicine. Materials and Methods: We selected the Treatise on Febrile Diseases and the Synopsis of Golden Chamber as basic datasets with EDA data augmentation, and the Yellow Emperor's Canon of Internal Medicine, the Classic of the Miraculous Pivot, and the Classic on Medical Problems as supplementary datasets for fine-tuning. We selected the word-embedding model based on the Imperial Collection of Four, the bidirectional encoder representations from transformers (BERT) model based on the Chinese Wikipedia, and the robustly optimized BERT approach (RoBERTa) model based on the Chinese Wikipedia and a general database. In addition, the BERT model was fine-tuned using the supplementary datasets to generate a Traditional Chinese Medicine-BERT model. Multiple IPG models were constructed based on the pretraining strategy and experiments were performed. Metrics of precision, recall, and F1-score were used to assess the model performance. Based on the trained models, we extracted and visualized the semantic features of some typical texts from treatise on febrile diseases and investigated the patterns. Results: Among all the trained models, the RoBERTa-large model performed the best, with a test set precision of 92.22%, recall of 86.71%, and F1-score of 89.38% and 10-fold cross-validation precision of 94.5% ± 2.5%, recall of 90.47% ± 4.1%, and F1-score of 92.38% ± 2.8%. The semantic feature extraction results based on this model showed that the model was intelligently stratified based on different meanings such that the within-layer's patterns showed the associations of symptom–symptoms, disease–symptoms, and symptom–punctuations, while the between-layer's patterns showed a progressive or dynamic symptom and disease transformation. Conclusions: Deep-learning-based NLP technology significantly improves the performance of IPG model. In addition, NLP-based semantic feature extraction may be vital to further investigate the ancient Chinese medicine texts.http://www.wjtcm.net/article.asp?issn=2311-8571;year=2021;volume=7;issue=3;spage=361;epage=369;aulast=Shiancient books of chinese medicinebidirectional encoder representations from transformersdeep learningintelligent prescription-generating modelspretrained models
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Qing-Yang Shi Li-Zi Tan Lim Lian Seng Hui-Jun Wang
spellingShingle	Qing-Yang Shi Li-Zi Tan Lim Lian Seng Hui-Jun Wang Intelligent prescription-generating models of traditional chinese medicine based on deep learning World Journal of Traditional Chinese Medicine ancient books of chinese medicine bidirectional encoder representations from transformers deep learning intelligent prescription-generating models pretrained models
author_facet	Qing-Yang Shi Li-Zi Tan Lim Lian Seng Hui-Jun Wang
author_sort	Qing-Yang Shi
title	Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_short	Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_full	Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_fullStr	Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_full_unstemmed	Intelligent prescription-generating models of traditional chinese medicine based on deep learning
title_sort	intelligent prescription-generating models of traditional chinese medicine based on deep learning
publisher	Wolters Kluwer Medknow Publications
series	World Journal of Traditional Chinese Medicine
issn	2311-8571
publishDate	2021-01-01
description	Objective: This study aimed to construct an intelligent prescription-generating (IPG) model based on deep-learning natural language processing (NLP) technology for multiple prescriptions in Chinese medicine. Materials and Methods: We selected the Treatise on Febrile Diseases and the Synopsis of Golden Chamber as basic datasets with EDA data augmentation, and the Yellow Emperor's Canon of Internal Medicine, the Classic of the Miraculous Pivot, and the Classic on Medical Problems as supplementary datasets for fine-tuning. We selected the word-embedding model based on the Imperial Collection of Four, the bidirectional encoder representations from transformers (BERT) model based on the Chinese Wikipedia, and the robustly optimized BERT approach (RoBERTa) model based on the Chinese Wikipedia and a general database. In addition, the BERT model was fine-tuned using the supplementary datasets to generate a Traditional Chinese Medicine-BERT model. Multiple IPG models were constructed based on the pretraining strategy and experiments were performed. Metrics of precision, recall, and F1-score were used to assess the model performance. Based on the trained models, we extracted and visualized the semantic features of some typical texts from treatise on febrile diseases and investigated the patterns. Results: Among all the trained models, the RoBERTa-large model performed the best, with a test set precision of 92.22%, recall of 86.71%, and F1-score of 89.38% and 10-fold cross-validation precision of 94.5% ± 2.5%, recall of 90.47% ± 4.1%, and F1-score of 92.38% ± 2.8%. The semantic feature extraction results based on this model showed that the model was intelligently stratified based on different meanings such that the within-layer's patterns showed the associations of symptom–symptoms, disease–symptoms, and symptom–punctuations, while the between-layer's patterns showed a progressive or dynamic symptom and disease transformation. Conclusions: Deep-learning-based NLP technology significantly improves the performance of IPG model. In addition, NLP-based semantic feature extraction may be vital to further investigate the ancient Chinese medicine texts.
topic	ancient books of chinese medicine bidirectional encoder representations from transformers deep learning intelligent prescription-generating models pretrained models
url	http://www.wjtcm.net/article.asp?issn=2311-8571;year=2021;volume=7;issue=3;spage=361;epage=369;aulast=Shi
work_keys_str_mv	AT qingyangshi intelligentprescriptiongeneratingmodelsoftraditionalchinesemedicinebasedondeeplearning AT lizitan intelligentprescriptiongeneratingmodelsoftraditionalchinesemedicinebasedondeeplearning AT limlianseng intelligentprescriptiongeneratingmodelsoftraditionalchinesemedicinebasedondeeplearning AT huijunwang intelligentprescriptiongeneratingmodelsoftraditionalchinesemedicinebasedondeeplearning
_version_	1721201388851036160

Intelligent prescription-generating models of traditional chinese medicine based on deep learning

Similar Items