driveR: a novel method for prioritizing cancer driver genes using somatic genomics data
Abstract Background Cancer develops due to “driver” alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis app...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-05-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-021-04203-7 |
id |
doaj-c67ed18962014c568dbc1348bd00006c |
---|---|
record_format |
Article |
spelling |
doaj-c67ed18962014c568dbc1348bd00006c2021-05-30T11:52:43ZengBMCBMC Bioinformatics1471-21052021-05-0122111710.1186/s12859-021-04203-7driveR: a novel method for prioritizing cancer driver genes using somatic genomics dataEge Ülgen0O. Uğur Sezerman1Department of Biostatistics and Medical Informatics, School of Medicine, Acibadem Mehmet Ali Aydinlar UniversityDepartment of Biostatistics and Medical Informatics, School of Medicine, Acibadem Mehmet Ali Aydinlar UniversityAbstract Background Cancer develops due to “driver” alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis approach for driver gene prioritization utilizing somatic genomics data, called driveR. Results Combining genomics information and prior biological knowledge, driveR accurately prioritizes cancer driver genes via a multi-task learning model. Testing on 28 different datasets, this study demonstrates that driveR performs adequately, achieving a median AUC of 0.684 (range 0.651–0.861) on the 28 batch analysis test datasets, and a median AUC of 0.773 (range 0–1) on the 5157 personalized analysis test samples. Moreover, it outperforms existing approaches, achieving a significantly higher median AUC than all of MutSigCV (Wilcoxon rank-sum test p < 0.001), DriverNet (p < 0.001), OncodriveFML (p < 0.001) and MutPanning (p < 0.001) on batch analysis test datasets, and a significantly higher median AUC than DawnRank (p < 0.001) and PRODIGY (p < 0.001) on personalized analysis datasets. Conclusions This study demonstrates that the proposed method is an accurate and easy-to-utilize approach for prioritizing driver genes in cancer genomes in personalized or batch analyses. driveR is available on CRAN: https://cran.r-project.org/package=driveR .https://doi.org/10.1186/s12859-021-04203-7Driver genePrioritizationCancerSomatic |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ege Ülgen O. Uğur Sezerman |
spellingShingle |
Ege Ülgen O. Uğur Sezerman driveR: a novel method for prioritizing cancer driver genes using somatic genomics data BMC Bioinformatics Driver gene Prioritization Cancer Somatic |
author_facet |
Ege Ülgen O. Uğur Sezerman |
author_sort |
Ege Ülgen |
title |
driveR: a novel method for prioritizing cancer driver genes using somatic genomics data |
title_short |
driveR: a novel method for prioritizing cancer driver genes using somatic genomics data |
title_full |
driveR: a novel method for prioritizing cancer driver genes using somatic genomics data |
title_fullStr |
driveR: a novel method for prioritizing cancer driver genes using somatic genomics data |
title_full_unstemmed |
driveR: a novel method for prioritizing cancer driver genes using somatic genomics data |
title_sort |
driver: a novel method for prioritizing cancer driver genes using somatic genomics data |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2021-05-01 |
description |
Abstract Background Cancer develops due to “driver” alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis approach for driver gene prioritization utilizing somatic genomics data, called driveR. Results Combining genomics information and prior biological knowledge, driveR accurately prioritizes cancer driver genes via a multi-task learning model. Testing on 28 different datasets, this study demonstrates that driveR performs adequately, achieving a median AUC of 0.684 (range 0.651–0.861) on the 28 batch analysis test datasets, and a median AUC of 0.773 (range 0–1) on the 5157 personalized analysis test samples. Moreover, it outperforms existing approaches, achieving a significantly higher median AUC than all of MutSigCV (Wilcoxon rank-sum test p < 0.001), DriverNet (p < 0.001), OncodriveFML (p < 0.001) and MutPanning (p < 0.001) on batch analysis test datasets, and a significantly higher median AUC than DawnRank (p < 0.001) and PRODIGY (p < 0.001) on personalized analysis datasets. Conclusions This study demonstrates that the proposed method is an accurate and easy-to-utilize approach for prioritizing driver genes in cancer genomes in personalized or batch analyses. driveR is available on CRAN: https://cran.r-project.org/package=driveR . |
topic |
Driver gene Prioritization Cancer Somatic |
url |
https://doi.org/10.1186/s12859-021-04203-7 |
work_keys_str_mv |
AT egeulgen driveranovelmethodforprioritizingcancerdrivergenesusingsomaticgenomicsdata AT ougursezerman driveranovelmethodforprioritizingcancerdrivergenesusingsomaticgenomicsdata |
_version_ |
1721419971187179520 |