Simultaneous identification of multiple driver pathways in cancer.
Distinguishing the somatic mutations responsible for cancer (driver mutations) from random, passenger mutations is a key challenge in cancer genomics. Driver mutations generally target cellular signaling and regulatory pathways consisting of multiple genes. This heterogeneity complicates the identif...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2013-01-01
|
Series: | PLoS Computational Biology |
Online Access: | https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23717195/pdf/?tool=EBI |
id |
doaj-18f6e798e1c848eeb0f54886920e4b89 |
---|---|
record_format |
Article |
spelling |
doaj-18f6e798e1c848eeb0f54886920e4b892021-04-21T15:09:24ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582013-01-0195e100305410.1371/journal.pcbi.1003054Simultaneous identification of multiple driver pathways in cancer.Mark D M LeisersonDima BlokhRoded SharanBenjamin J RaphaelDistinguishing the somatic mutations responsible for cancer (driver mutations) from random, passenger mutations is a key challenge in cancer genomics. Driver mutations generally target cellular signaling and regulatory pathways consisting of multiple genes. This heterogeneity complicates the identification of driver mutations by their recurrence across samples, as different combinations of mutations in driver pathways are observed in different samples. We introduce the Multi-Dendrix algorithm for the simultaneous identification of multiple driver pathways de novo in somatic mutation data from a cohort of cancer samples. The algorithm relies on two combinatorial properties of mutations in a driver pathway: high coverage and mutual exclusivity. We derive an integer linear program that finds set of mutations exhibiting these properties. We apply Multi-Dendrix to somatic mutations from glioblastoma, breast cancer, and lung cancer samples. Multi-Dendrix identifies sets of mutations in genes that overlap with known pathways - including Rb, p53, PI(3)K, and cell cycle pathways - and also novel sets of mutually exclusive mutations, including mutations in several transcription factors or other genes involved in transcriptional regulation. These sets are discovered directly from mutation data with no prior knowledge of pathways or gene interactions. We show that Multi-Dendrix outperforms other algorithms for identifying combinations of mutations and is also orders of magnitude faster on genome-scale data. Software available at: http://compbio.cs.brown.edu/software.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23717195/pdf/?tool=EBI |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mark D M Leiserson Dima Blokh Roded Sharan Benjamin J Raphael |
spellingShingle |
Mark D M Leiserson Dima Blokh Roded Sharan Benjamin J Raphael Simultaneous identification of multiple driver pathways in cancer. PLoS Computational Biology |
author_facet |
Mark D M Leiserson Dima Blokh Roded Sharan Benjamin J Raphael |
author_sort |
Mark D M Leiserson |
title |
Simultaneous identification of multiple driver pathways in cancer. |
title_short |
Simultaneous identification of multiple driver pathways in cancer. |
title_full |
Simultaneous identification of multiple driver pathways in cancer. |
title_fullStr |
Simultaneous identification of multiple driver pathways in cancer. |
title_full_unstemmed |
Simultaneous identification of multiple driver pathways in cancer. |
title_sort |
simultaneous identification of multiple driver pathways in cancer. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Computational Biology |
issn |
1553-734X 1553-7358 |
publishDate |
2013-01-01 |
description |
Distinguishing the somatic mutations responsible for cancer (driver mutations) from random, passenger mutations is a key challenge in cancer genomics. Driver mutations generally target cellular signaling and regulatory pathways consisting of multiple genes. This heterogeneity complicates the identification of driver mutations by their recurrence across samples, as different combinations of mutations in driver pathways are observed in different samples. We introduce the Multi-Dendrix algorithm for the simultaneous identification of multiple driver pathways de novo in somatic mutation data from a cohort of cancer samples. The algorithm relies on two combinatorial properties of mutations in a driver pathway: high coverage and mutual exclusivity. We derive an integer linear program that finds set of mutations exhibiting these properties. We apply Multi-Dendrix to somatic mutations from glioblastoma, breast cancer, and lung cancer samples. Multi-Dendrix identifies sets of mutations in genes that overlap with known pathways - including Rb, p53, PI(3)K, and cell cycle pathways - and also novel sets of mutually exclusive mutations, including mutations in several transcription factors or other genes involved in transcriptional regulation. These sets are discovered directly from mutation data with no prior knowledge of pathways or gene interactions. We show that Multi-Dendrix outperforms other algorithms for identifying combinations of mutations and is also orders of magnitude faster on genome-scale data. Software available at: http://compbio.cs.brown.edu/software. |
url |
https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23717195/pdf/?tool=EBI |
work_keys_str_mv |
AT markdmleiserson simultaneousidentificationofmultipledriverpathwaysincancer AT dimablokh simultaneousidentificationofmultipledriverpathwaysincancer AT rodedsharan simultaneousidentificationofmultipledriverpathwaysincancer AT benjaminjraphael simultaneousidentificationofmultipledriverpathwaysincancer |
_version_ |
1714667870812635136 |