DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways

Cataloging mutated driver genes that confer a selective growth advantage for tumor cells from sporadic passenger mutations is a critical problem in cancer genomic research. Previous studies have reported that some driver genes are not highly frequently mutated and cannot be tested as statistically s...

Full description

Bibliographic Details
Main Authors: Jianing Xi, Minghui Wang, Ao Li
Format: Article
Language:English
Published: PeerJ Inc. 2017-10-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-133.pdf
id doaj-b42bcaec88f241579bafe172ef4dd578
record_format Article
spelling doaj-b42bcaec88f241579bafe172ef4dd5782020-11-24T23:23:02ZengPeerJ Inc.PeerJ Computer Science2376-59922017-10-013e13310.7717/peerj-cs.133DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathwaysJianing Xi0Minghui Wang1Ao Li2School of Information Science and Technology, University of Science and Technology of China, Hefei, ChinaSchool of Information Science and Technology, University of Science and Technology of China, Hefei, ChinaSchool of Information Science and Technology, University of Science and Technology of China, Hefei, ChinaCataloging mutated driver genes that confer a selective growth advantage for tumor cells from sporadic passenger mutations is a critical problem in cancer genomic research. Previous studies have reported that some driver genes are not highly frequently mutated and cannot be tested as statistically significant, which complicates the identification of driver genes. To address this issue, some existing approaches incorporate prior knowledge from an interactome to detect driver genes which may be dysregulated by interaction network context. However, altered operations of many pathways in cancer progression have been frequently observed, and prior knowledge from pathways is not exploited in the driver gene identification task. In this paper, we introduce a driver gene prioritization method called driver gene identification through pathway and interactome information (DGPathinter), which is based on knowledge-based matrix factorization model with prior knowledge from both interactome and pathways incorporated. When DGPathinter is applied on somatic mutation datasets of three types of cancers and evaluated by known driver genes, the prioritizing performances of DGPathinter are better than the existing interactome driven methods. The top ranked genes detected by DGPathinter are also significantly enriched for known driver genes. Moreover, most of the top ranked scored pathways given by DGPathinter are also cancer progression-associated pathways. These results suggest that DGPathinter is a useful tool to identify potential driver genes.https://peerj.com/articles/cs-133.pdfMatrix factorizationPrior knowledgeBioinformaticsData mining
collection DOAJ
language English
format Article
sources DOAJ
author Jianing Xi
Minghui Wang
Ao Li
spellingShingle Jianing Xi
Minghui Wang
Ao Li
DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways
PeerJ Computer Science
Matrix factorization
Prior knowledge
Bioinformatics
Data mining
author_facet Jianing Xi
Minghui Wang
Ao Li
author_sort Jianing Xi
title DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways
title_short DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways
title_full DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways
title_fullStr DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways
title_full_unstemmed DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways
title_sort dgpathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways
publisher PeerJ Inc.
series PeerJ Computer Science
issn 2376-5992
publishDate 2017-10-01
description Cataloging mutated driver genes that confer a selective growth advantage for tumor cells from sporadic passenger mutations is a critical problem in cancer genomic research. Previous studies have reported that some driver genes are not highly frequently mutated and cannot be tested as statistically significant, which complicates the identification of driver genes. To address this issue, some existing approaches incorporate prior knowledge from an interactome to detect driver genes which may be dysregulated by interaction network context. However, altered operations of many pathways in cancer progression have been frequently observed, and prior knowledge from pathways is not exploited in the driver gene identification task. In this paper, we introduce a driver gene prioritization method called driver gene identification through pathway and interactome information (DGPathinter), which is based on knowledge-based matrix factorization model with prior knowledge from both interactome and pathways incorporated. When DGPathinter is applied on somatic mutation datasets of three types of cancers and evaluated by known driver genes, the prioritizing performances of DGPathinter are better than the existing interactome driven methods. The top ranked genes detected by DGPathinter are also significantly enriched for known driver genes. Moreover, most of the top ranked scored pathways given by DGPathinter are also cancer progression-associated pathways. These results suggest that DGPathinter is a useful tool to identify potential driver genes.
topic Matrix factorization
Prior knowledge
Bioinformatics
Data mining
url https://peerj.com/articles/cs-133.pdf
work_keys_str_mv AT jianingxi dgpathinteranovelmodelforidentifyingdrivergenesviaknowledgedrivenmatrixfactorizationwithpriorknowledgefrominteractomeandpathways
AT minghuiwang dgpathinteranovelmodelforidentifyingdrivergenesviaknowledgedrivenmatrixfactorizationwithpriorknowledgefrominteractomeandpathways
AT aoli dgpathinteranovelmodelforidentifyingdrivergenesviaknowledgedrivenmatrixfactorizationwithpriorknowledgefrominteractomeandpathways
_version_ 1725565792255213568