Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration

Abstract Background Prediction of drug-disease interactions is promising for either drug repositioning or disease treatment fields. The discovery of novel drug-disease interactions, on one hand can help to find novel indictions for the approved drugs; on the other hand can provide new therapeutic ap...

Full description

Bibliographic Details
Main Authors: Guangsheng Wu, Juan Liu, Caihua Wang
Format: Article
Language:English
Published: BMC 2017-12-01
Series:BMC Medical Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12920-017-0311-0
id doaj-512ce64134b745d3a5f66fc7719755c5
record_format Article
spelling doaj-512ce64134b745d3a5f66fc7719755c52021-04-02T04:08:33ZengBMCBMC Medical Genomics1755-87942017-12-0110S5173010.1186/s12920-017-0311-0Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integrationGuangsheng Wu0Juan Liu1Caihua Wang2State Key Laboratory of Software Engineering, School of Computer Science, Wuhan UniversityState Key Laboratory of Software Engineering, School of Computer Science, Wuhan UniversityState Key Laboratory of Software Engineering, School of Computer Science, Wuhan UniversityAbstract Background Prediction of drug-disease interactions is promising for either drug repositioning or disease treatment fields. The discovery of novel drug-disease interactions, on one hand can help to find novel indictions for the approved drugs; on the other hand can provide new therapeutic approaches for the diseases. Recently, computational methods for finding drug-disease interactions have attracted lots of attention because of their far more higher efficiency and lower cost than the traditional wet experiment methods. However, they still face several challenges, such as the organization of the heterogeneous data, the performance of the model, and so on. Methods In this work, we present to hierarchically integrate the heterogeneous data into three layers. The drug-drug and disease-disease similarities are first calculated separately in each layer, and then the similarities from three layers are linearly fused into comprehensive drug similarities and disease similarities, which can then be used to measure the similarities between two drug-disease pairs. We construct a novel weighted drug-disease pair network, where a node is a drug-disease pair with known or unknown treatment relation, an edge represents the node-node relation which is weighted with the similarity score between two pairs. Now that similar drug-disease pairs are supposed to show similar treatment patterns, we can find the optimal graph cut of the network. The drug-disease pair with unknown relation can then be considered to have similar treatment relation with that within the same cut. Therefore, we develop a semi-supervised graph cut algorithm, SSGC, to find the optimal graph cut, based on which we can identify the potential drug-disease treatment interactions. Results By comparing with three representative network-based methods, SSGC achieves the highest performances, in terms of both AUC score and the identification rates of true drug-disease pairs. The experiments with different integration strategies also demonstrate that considering several sources of data can improve the performances of the predictors. Further case studies on four diseases, the top-ranked drug-disease associations have been confirmed by KEGG, CTD database and the literature, illustrating the usefulness of SSGC. Conclusions The proposed comprehensive similarity scores from multi-views and multiple layers and the graph-cut based algorithm can greatly improve the prediction performances of drug-disease associations.http://link.springer.com/article/10.1186/s12920-017-0311-0Drug-disease interactionIntegration strategySimilarityGraph cutGuilt-by-association
collection DOAJ
language English
format Article
sources DOAJ
author Guangsheng Wu
Juan Liu
Caihua Wang
spellingShingle Guangsheng Wu
Juan Liu
Caihua Wang
Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration
BMC Medical Genomics
Drug-disease interaction
Integration strategy
Similarity
Graph cut
Guilt-by-association
author_facet Guangsheng Wu
Juan Liu
Caihua Wang
author_sort Guangsheng Wu
title Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration
title_short Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration
title_full Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration
title_fullStr Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration
title_full_unstemmed Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration
title_sort predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration
publisher BMC
series BMC Medical Genomics
issn 1755-8794
publishDate 2017-12-01
description Abstract Background Prediction of drug-disease interactions is promising for either drug repositioning or disease treatment fields. The discovery of novel drug-disease interactions, on one hand can help to find novel indictions for the approved drugs; on the other hand can provide new therapeutic approaches for the diseases. Recently, computational methods for finding drug-disease interactions have attracted lots of attention because of their far more higher efficiency and lower cost than the traditional wet experiment methods. However, they still face several challenges, such as the organization of the heterogeneous data, the performance of the model, and so on. Methods In this work, we present to hierarchically integrate the heterogeneous data into three layers. The drug-drug and disease-disease similarities are first calculated separately in each layer, and then the similarities from three layers are linearly fused into comprehensive drug similarities and disease similarities, which can then be used to measure the similarities between two drug-disease pairs. We construct a novel weighted drug-disease pair network, where a node is a drug-disease pair with known or unknown treatment relation, an edge represents the node-node relation which is weighted with the similarity score between two pairs. Now that similar drug-disease pairs are supposed to show similar treatment patterns, we can find the optimal graph cut of the network. The drug-disease pair with unknown relation can then be considered to have similar treatment relation with that within the same cut. Therefore, we develop a semi-supervised graph cut algorithm, SSGC, to find the optimal graph cut, based on which we can identify the potential drug-disease treatment interactions. Results By comparing with three representative network-based methods, SSGC achieves the highest performances, in terms of both AUC score and the identification rates of true drug-disease pairs. The experiments with different integration strategies also demonstrate that considering several sources of data can improve the performances of the predictors. Further case studies on four diseases, the top-ranked drug-disease associations have been confirmed by KEGG, CTD database and the literature, illustrating the usefulness of SSGC. Conclusions The proposed comprehensive similarity scores from multi-views and multiple layers and the graph-cut based algorithm can greatly improve the prediction performances of drug-disease associations.
topic Drug-disease interaction
Integration strategy
Similarity
Graph cut
Guilt-by-association
url http://link.springer.com/article/10.1186/s12920-017-0311-0
work_keys_str_mv AT guangshengwu predictingdrugdiseaseinteractionsbysemisupervisedgraphcutalgorithmandthreelayerdataintegration
AT juanliu predictingdrugdiseaseinteractionsbysemisupervisedgraphcutalgorithmandthreelayerdataintegration
AT caihuawang predictingdrugdiseaseinteractionsbysemisupervisedgraphcutalgorithmandthreelayerdataintegration
_version_ 1724173461991981056