DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways
Cataloging mutated driver genes that confer a selective growth advantage for tumor cells from sporadic passenger mutations is a critical problem in cancer genomic research. Previous studies have reported that some driver genes are not highly frequently mutated and cannot be tested as statistically s...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2017-10-01
|
Series: | PeerJ Computer Science |
Subjects: | |
Online Access: | https://peerj.com/articles/cs-133.pdf |
id |
doaj-b42bcaec88f241579bafe172ef4dd578 |
---|---|
record_format |
Article |
spelling |
doaj-b42bcaec88f241579bafe172ef4dd5782020-11-24T23:23:02ZengPeerJ Inc.PeerJ Computer Science2376-59922017-10-013e13310.7717/peerj-cs.133DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathwaysJianing Xi0Minghui Wang1Ao Li2School of Information Science and Technology, University of Science and Technology of China, Hefei, ChinaSchool of Information Science and Technology, University of Science and Technology of China, Hefei, ChinaSchool of Information Science and Technology, University of Science and Technology of China, Hefei, ChinaCataloging mutated driver genes that confer a selective growth advantage for tumor cells from sporadic passenger mutations is a critical problem in cancer genomic research. Previous studies have reported that some driver genes are not highly frequently mutated and cannot be tested as statistically significant, which complicates the identification of driver genes. To address this issue, some existing approaches incorporate prior knowledge from an interactome to detect driver genes which may be dysregulated by interaction network context. However, altered operations of many pathways in cancer progression have been frequently observed, and prior knowledge from pathways is not exploited in the driver gene identification task. In this paper, we introduce a driver gene prioritization method called driver gene identification through pathway and interactome information (DGPathinter), which is based on knowledge-based matrix factorization model with prior knowledge from both interactome and pathways incorporated. When DGPathinter is applied on somatic mutation datasets of three types of cancers and evaluated by known driver genes, the prioritizing performances of DGPathinter are better than the existing interactome driven methods. The top ranked genes detected by DGPathinter are also significantly enriched for known driver genes. Moreover, most of the top ranked scored pathways given by DGPathinter are also cancer progression-associated pathways. These results suggest that DGPathinter is a useful tool to identify potential driver genes.https://peerj.com/articles/cs-133.pdfMatrix factorizationPrior knowledgeBioinformaticsData mining |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jianing Xi Minghui Wang Ao Li |
spellingShingle |
Jianing Xi Minghui Wang Ao Li DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways PeerJ Computer Science Matrix factorization Prior knowledge Bioinformatics Data mining |
author_facet |
Jianing Xi Minghui Wang Ao Li |
author_sort |
Jianing Xi |
title |
DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways |
title_short |
DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways |
title_full |
DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways |
title_fullStr |
DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways |
title_full_unstemmed |
DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways |
title_sort |
dgpathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways |
publisher |
PeerJ Inc. |
series |
PeerJ Computer Science |
issn |
2376-5992 |
publishDate |
2017-10-01 |
description |
Cataloging mutated driver genes that confer a selective growth advantage for tumor cells from sporadic passenger mutations is a critical problem in cancer genomic research. Previous studies have reported that some driver genes are not highly frequently mutated and cannot be tested as statistically significant, which complicates the identification of driver genes. To address this issue, some existing approaches incorporate prior knowledge from an interactome to detect driver genes which may be dysregulated by interaction network context. However, altered operations of many pathways in cancer progression have been frequently observed, and prior knowledge from pathways is not exploited in the driver gene identification task. In this paper, we introduce a driver gene prioritization method called driver gene identification through pathway and interactome information (DGPathinter), which is based on knowledge-based matrix factorization model with prior knowledge from both interactome and pathways incorporated. When DGPathinter is applied on somatic mutation datasets of three types of cancers and evaluated by known driver genes, the prioritizing performances of DGPathinter are better than the existing interactome driven methods. The top ranked genes detected by DGPathinter are also significantly enriched for known driver genes. Moreover, most of the top ranked scored pathways given by DGPathinter are also cancer progression-associated pathways. These results suggest that DGPathinter is a useful tool to identify potential driver genes. |
topic |
Matrix factorization Prior knowledge Bioinformatics Data mining |
url |
https://peerj.com/articles/cs-133.pdf |
work_keys_str_mv |
AT jianingxi dgpathinteranovelmodelforidentifyingdrivergenesviaknowledgedrivenmatrixfactorizationwithpriorknowledgefrominteractomeandpathways AT minghuiwang dgpathinteranovelmodelforidentifyingdrivergenesviaknowledgedrivenmatrixfactorizationwithpriorknowledgefrominteractomeandpathways AT aoli dgpathinteranovelmodelforidentifyingdrivergenesviaknowledgedrivenmatrixfactorizationwithpriorknowledgefrominteractomeandpathways |
_version_ |
1725565792255213568 |