Mining microbe–disease interactions from literature via a transfer learning model

Background: Interactions of microbes and diseases are of great importance for biomedical research. However, large-scale of microbe–disease interactions are hidden in the biomedical literature. The structured databases for microbe–disease interactions are in limited amounts. In this paper, we aim to...

Full description

Bibliographic Details
Main Authors: Chen, J.X (Author), Qiu, Y. (Author), Wu, C. (Author), Xiao, X. (Author), Yang, C. (Author), Yi, J. (Author)
Format: Article
Language:English
Published: BioMed Central Ltd 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 03538nam a2200541Ia 4500
001 10.1186-s12859-021-04346-7
008 220427s2021 CNT 000 0 und d
020 |a 14712105 (ISSN) 
245 1 0 |a Mining microbe–disease interactions from literature via a transfer learning model 
260 0 |b BioMed Central Ltd  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1186/s12859-021-04346-7 
520 3 |a Background: Interactions of microbes and diseases are of great importance for biomedical research. However, large-scale of microbe–disease interactions are hidden in the biomedical literature. The structured databases for microbe–disease interactions are in limited amounts. In this paper, we aim to construct a large-scale database for microbe–disease interactions automatically. We attained this goal via applying text mining methods based on a deep learning model with a moderate curation cost. We also built a user-friendly web interface that allows researchers to navigate and query required information. Results: Firstly, we manually constructed a golden-standard corpus and a sliver-standard corpus (SSC) for microbe–disease interactions for curation. Moreover, we proposed a text mining framework for microbe–disease interaction extraction based on a pretrained model BERE. We applied named entity recognition tools to detect microbe and disease mentions from the free biomedical texts. After that, we fine-tuned the pretrained model BERE to recognize relations between targeted entities, which was originally built for drug–target interactions or drug–drug interactions. The introduction of SSC for model fine-tuning greatly improved detection performance for microbe–disease interactions, with an average reduction in error of approximately 10%. The MDIDB website offers data browsing, custom searching for specific diseases or microbes, and batch downloading. Conclusions: Evaluation results demonstrate that our method outperform the baseline model (rule-based PKDE4J) with an average F1-score of 73.81%. For further validation, we randomly sampled nearly 1000 predicted interactions by our model, and manually checked the correctness of each interaction, which gives a 73% accuracy. The MDIDB webiste is freely avaliable throuth http://dbmdi.com/index/. © 2021, The Author(s). 
650 0 4 |a Bacteria 
650 0 4 |a Biomedical literature 
650 0 4 |a Biomedical research 
650 0 4 |a Biomedical Research 
650 0 4 |a data mining 
650 0 4 |a Data Mining 
650 0 4 |a Deep learning 
650 0 4 |a Detection performance 
650 0 4 |a Drug interactions 
650 0 4 |a Evaluation results 
650 0 4 |a Interaction extraction 
650 0 4 |a Large-scale database 
650 0 4 |a Learning systems 
650 0 4 |a machine learning 
650 0 4 |a Machine Learning 
650 0 4 |a medical research 
650 0 4 |a Microbe–disease interactions 
650 0 4 |a Named entity recognition 
650 0 4 |a Named-entity recognition 
650 0 4 |a Natural language processing systems 
650 0 4 |a publication 
650 0 4 |a Publications 
650 0 4 |a Query processing 
650 0 4 |a Relation extraction 
650 0 4 |a Structured database 
650 0 4 |a Text mining 
650 0 4 |a Transfer learning 
650 0 4 |a Transfer learning 
650 0 4 |a User interfaces 
700 1 |a Chen, J.X.  |e author 
700 1 |a Qiu, Y.  |e author 
700 1 |a Wu, C.  |e author 
700 1 |a Xiao, X.  |e author 
700 1 |a Yang, C.  |e author 
700 1 |a Yi, J.  |e author 
773 |t BMC Bioinformatics