driveR: a novel method for prioritizing cancer driver genes using somatic genomics data

Abstract Background Cancer develops due to “driver” alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis app...

Full description

Bibliographic Details
Main Authors: Ege Ülgen, O. Uğur Sezerman
Format: Article
Language:English
Published: BMC 2021-05-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-021-04203-7
id doaj-c67ed18962014c568dbc1348bd00006c
record_format Article
spelling doaj-c67ed18962014c568dbc1348bd00006c2021-05-30T11:52:43ZengBMCBMC Bioinformatics1471-21052021-05-0122111710.1186/s12859-021-04203-7driveR: a novel method for prioritizing cancer driver genes using somatic genomics dataEge Ülgen0O. Uğur Sezerman1Department of Biostatistics and Medical Informatics, School of Medicine, Acibadem Mehmet Ali Aydinlar UniversityDepartment of Biostatistics and Medical Informatics, School of Medicine, Acibadem Mehmet Ali Aydinlar UniversityAbstract Background Cancer develops due to “driver” alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis approach for driver gene prioritization utilizing somatic genomics data, called driveR. Results Combining genomics information and prior biological knowledge, driveR accurately prioritizes cancer driver genes via a multi-task learning model. Testing on 28 different datasets, this study demonstrates that driveR performs adequately, achieving a median AUC of 0.684 (range 0.651–0.861) on the 28 batch analysis test datasets, and a median AUC of 0.773 (range 0–1) on the 5157 personalized analysis test samples. Moreover, it outperforms existing approaches, achieving a significantly higher median AUC than all of MutSigCV (Wilcoxon rank-sum test p < 0.001), DriverNet (p < 0.001), OncodriveFML (p < 0.001) and MutPanning (p < 0.001) on batch analysis test datasets, and a significantly higher median AUC than DawnRank (p < 0.001) and PRODIGY (p < 0.001) on personalized analysis datasets. Conclusions This study demonstrates that the proposed method is an accurate and easy-to-utilize approach for prioritizing driver genes in cancer genomes in personalized or batch analyses. driveR is available on CRAN: https://cran.r-project.org/package=driveR .https://doi.org/10.1186/s12859-021-04203-7Driver genePrioritizationCancerSomatic
collection DOAJ
language English
format Article
sources DOAJ
author Ege Ülgen
O. Uğur Sezerman
spellingShingle Ege Ülgen
O. Uğur Sezerman
driveR: a novel method for prioritizing cancer driver genes using somatic genomics data
BMC Bioinformatics
Driver gene
Prioritization
Cancer
Somatic
author_facet Ege Ülgen
O. Uğur Sezerman
author_sort Ege Ülgen
title driveR: a novel method for prioritizing cancer driver genes using somatic genomics data
title_short driveR: a novel method for prioritizing cancer driver genes using somatic genomics data
title_full driveR: a novel method for prioritizing cancer driver genes using somatic genomics data
title_fullStr driveR: a novel method for prioritizing cancer driver genes using somatic genomics data
title_full_unstemmed driveR: a novel method for prioritizing cancer driver genes using somatic genomics data
title_sort driver: a novel method for prioritizing cancer driver genes using somatic genomics data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2021-05-01
description Abstract Background Cancer develops due to “driver” alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis approach for driver gene prioritization utilizing somatic genomics data, called driveR. Results Combining genomics information and prior biological knowledge, driveR accurately prioritizes cancer driver genes via a multi-task learning model. Testing on 28 different datasets, this study demonstrates that driveR performs adequately, achieving a median AUC of 0.684 (range 0.651–0.861) on the 28 batch analysis test datasets, and a median AUC of 0.773 (range 0–1) on the 5157 personalized analysis test samples. Moreover, it outperforms existing approaches, achieving a significantly higher median AUC than all of MutSigCV (Wilcoxon rank-sum test p < 0.001), DriverNet (p < 0.001), OncodriveFML (p < 0.001) and MutPanning (p < 0.001) on batch analysis test datasets, and a significantly higher median AUC than DawnRank (p < 0.001) and PRODIGY (p < 0.001) on personalized analysis datasets. Conclusions This study demonstrates that the proposed method is an accurate and easy-to-utilize approach for prioritizing driver genes in cancer genomes in personalized or batch analyses. driveR is available on CRAN: https://cran.r-project.org/package=driveR .
topic Driver gene
Prioritization
Cancer
Somatic
url https://doi.org/10.1186/s12859-021-04203-7
work_keys_str_mv AT egeulgen driveranovelmethodforprioritizingcancerdrivergenesusingsomaticgenomicsdata
AT ougursezerman driveranovelmethodforprioritizingcancerdrivergenesusingsomaticgenomicsdata
_version_ 1721419971187179520