Predicting combinative drug pairs towards realistic screening via integrating heterogeneous features

Abstract Background Drug Combination is one of the effective approaches for treating complex diseases. However, determining combinative drug pairs in clinical trials is still costly. Thus, computational approaches are used to identify potential drug pairs in advance. Existing computational approache...

Full description

Bibliographic Details
Main Authors: Jian-Yu Shi, Jia-Xin Li, Ke Gao, Peng Lei, Siu-Ming Yiu
Format: Article
Language:English
Published: BMC 2017-10-01
Series:BMC Bioinformatics
Online Access:http://link.springer.com/article/10.1186/s12859-017-1818-2
id doaj-cc5cbb653a8d4ba78cb0ef087d0f5767
record_format Article
spelling doaj-cc5cbb653a8d4ba78cb0ef087d0f57672020-11-24T21:51:10ZengBMCBMC Bioinformatics1471-21052017-10-0118S121910.1186/s12859-017-1818-2Predicting combinative drug pairs towards realistic screening via integrating heterogeneous featuresJian-Yu Shi0Jia-Xin Li1Ke Gao2Peng Lei3Siu-Ming Yiu4School of Life Sciences, Northwestern Polytechnical UniversitySchool of Life Sciences, Northwestern Polytechnical UniversitySchool of Computer Science, Northwestern Polytechnical UniversityDepartment of Chinese Medicine, Shaanxi Provincial People’s HospitalDepartment of Computer Science, the University of Hong KongAbstract Background Drug Combination is one of the effective approaches for treating complex diseases. However, determining combinative drug pairs in clinical trials is still costly. Thus, computational approaches are used to identify potential drug pairs in advance. Existing computational approaches have the following shortcomings: (i) the lack of an effective integration of heterogeneous features leads to a time-consuming training and even results in an over-fitted classifier; and (ii) the narrow consideration of predicting potential drug combinations only among known drugs having known combinations cannot meet the demand of realistic screenings, which pay more attention to potential combinative pairs among newly-coming drugs that have no approved combination with other drugs at all. Results In this paper, to tackle the above two problems, we propose a novel drug-driven approach for predicting potential combinative pairs on a large scale. We define four new features based on heterogeneous data and design an efficient fusion scheme to integrate these feature. Moreover importantly, we elaborate appropriate cross-validations towards realistic screening scenarios of drug combinations involving both known drugs and new drugs. In addition, we perform an extra investigation to show how each kind of heterogeneous features is related to combinative drug pairs. The investigation inspires the design of our approach. Experiments on real data demonstrate the effectiveness of our fusion scheme for integrating heterogeneous features and its predicting power in three scenarios of realistic screening. In terms of both AUC and AUPR, the prediction among known drugs achieves 0.954 and 0.821, that between known drugs and new drugs achieves 0.909 and 0.635, and that among new drugs achieves 0.809 and 0.592 respectively. Conclusions Our approach provides not only an effective tool to integrate heterogeneous features but also the first tool to predict potential combinative pairs among new drugs.http://link.springer.com/article/10.1186/s12859-017-1818-2
collection DOAJ
language English
format Article
sources DOAJ
author Jian-Yu Shi
Jia-Xin Li
Ke Gao
Peng Lei
Siu-Ming Yiu
spellingShingle Jian-Yu Shi
Jia-Xin Li
Ke Gao
Peng Lei
Siu-Ming Yiu
Predicting combinative drug pairs towards realistic screening via integrating heterogeneous features
BMC Bioinformatics
author_facet Jian-Yu Shi
Jia-Xin Li
Ke Gao
Peng Lei
Siu-Ming Yiu
author_sort Jian-Yu Shi
title Predicting combinative drug pairs towards realistic screening via integrating heterogeneous features
title_short Predicting combinative drug pairs towards realistic screening via integrating heterogeneous features
title_full Predicting combinative drug pairs towards realistic screening via integrating heterogeneous features
title_fullStr Predicting combinative drug pairs towards realistic screening via integrating heterogeneous features
title_full_unstemmed Predicting combinative drug pairs towards realistic screening via integrating heterogeneous features
title_sort predicting combinative drug pairs towards realistic screening via integrating heterogeneous features
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2017-10-01
description Abstract Background Drug Combination is one of the effective approaches for treating complex diseases. However, determining combinative drug pairs in clinical trials is still costly. Thus, computational approaches are used to identify potential drug pairs in advance. Existing computational approaches have the following shortcomings: (i) the lack of an effective integration of heterogeneous features leads to a time-consuming training and even results in an over-fitted classifier; and (ii) the narrow consideration of predicting potential drug combinations only among known drugs having known combinations cannot meet the demand of realistic screenings, which pay more attention to potential combinative pairs among newly-coming drugs that have no approved combination with other drugs at all. Results In this paper, to tackle the above two problems, we propose a novel drug-driven approach for predicting potential combinative pairs on a large scale. We define four new features based on heterogeneous data and design an efficient fusion scheme to integrate these feature. Moreover importantly, we elaborate appropriate cross-validations towards realistic screening scenarios of drug combinations involving both known drugs and new drugs. In addition, we perform an extra investigation to show how each kind of heterogeneous features is related to combinative drug pairs. The investigation inspires the design of our approach. Experiments on real data demonstrate the effectiveness of our fusion scheme for integrating heterogeneous features and its predicting power in three scenarios of realistic screening. In terms of both AUC and AUPR, the prediction among known drugs achieves 0.954 and 0.821, that between known drugs and new drugs achieves 0.909 and 0.635, and that among new drugs achieves 0.809 and 0.592 respectively. Conclusions Our approach provides not only an effective tool to integrate heterogeneous features but also the first tool to predict potential combinative pairs among new drugs.
url http://link.springer.com/article/10.1186/s12859-017-1818-2
work_keys_str_mv AT jianyushi predictingcombinativedrugpairstowardsrealisticscreeningviaintegratingheterogeneousfeatures
AT jiaxinli predictingcombinativedrugpairstowardsrealisticscreeningviaintegratingheterogeneousfeatures
AT kegao predictingcombinativedrugpairstowardsrealisticscreeningviaintegratingheterogeneousfeatures
AT penglei predictingcombinativedrugpairstowardsrealisticscreeningviaintegratingheterogeneousfeatures
AT siumingyiu predictingcombinativedrugpairstowardsrealisticscreeningviaintegratingheterogeneousfeatures
_version_ 1725880072916697088