Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data

<p>Abstract</p> <p>Background</p> <p>Identification of protein complexes and functional modules from protein-protein interaction (PPI) networks is crucial to understanding the principles of cellular organization and predicting protein functions. In the past few years, m...

Full description

Bibliographic Details
Main Authors: Li Min, Wu Xuehong, Wang Jianxin, Pan Yi
Format: Article
Language:English
Published: BMC 2012-05-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/13/109
id doaj-f8f9438fcc8740059a5399323b8a6fb9
record_format Article
spelling doaj-f8f9438fcc8740059a5399323b8a6fb92020-11-24T20:53:40ZengBMCBMC Bioinformatics1471-21052012-05-0113110910.1186/1471-2105-13-109Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression dataLi MinWu XuehongWang JianxinPan Yi<p>Abstract</p> <p>Background</p> <p>Identification of protein complexes and functional modules from protein-protein interaction (PPI) networks is crucial to understanding the principles of cellular organization and predicting protein functions. In the past few years, many computational methods have been proposed. However, most of them considered the PPI networks as static graphs and overlooked the dynamics inherent within these networks. Moreover, few of them can distinguish between protein complexes and functional modules.</p> <p>Results</p> <p>In this paper, a new framework is proposed to distinguish between protein complexes and functional modules by integrating gene expression data into protein-protein interaction (PPI) data. A series of time-sequenced subnetworks (TSNs) is constructed according to the time that the interactions were activated. The algorithm TSN-PCD was then developed to identify protein complexes from these TSNs. As protein complexes are significantly related to functional modules, a new algorithm DFM-CIN is proposed to discover functional modules based on the identified complexes. The experimental results show that the combination of temporal gene expression data with PPI data contributes to identifying protein complexes more precisely. A quantitative comparison based on f-measure reveals that our algorithm TSN-PCD outperforms the other previous protein complex discovery algorithms. Furthermore, we evaluate the identified functional modules by using “Biological Process” annotated in GO (Gene Ontology). The validation shows that the identified functional modules are statistically significant in terms of “Biological Process”. More importantly, the relationship between protein complexes and functional modules are studied.</p> <p>Conclusions</p> <p>The proposed framework based on the integration of PPI data and gene expression data makes it possible to identify protein complexes and functional modules more effectively. Moveover, the proposed new framework and algorithms can distinguish between protein complexes and functional modules. Our findings suggest that functional modules are closely related to protein complexes and a functional module may consist of one or multiple protein complexes. The program is available at <url>http://netlab.csu.edu.cn/bioinfomatics/limin/DFM-CIN/index.html</url>.</p> http://www.biomedcentral.com/1471-2105/13/109
collection DOAJ
language English
format Article
sources DOAJ
author Li Min
Wu Xuehong
Wang Jianxin
Pan Yi
spellingShingle Li Min
Wu Xuehong
Wang Jianxin
Pan Yi
Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data
BMC Bioinformatics
author_facet Li Min
Wu Xuehong
Wang Jianxin
Pan Yi
author_sort Li Min
title Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data
title_short Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data
title_full Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data
title_fullStr Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data
title_full_unstemmed Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data
title_sort towards the identification of protein complexes and functional modules by integrating ppi network and gene expression data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2012-05-01
description <p>Abstract</p> <p>Background</p> <p>Identification of protein complexes and functional modules from protein-protein interaction (PPI) networks is crucial to understanding the principles of cellular organization and predicting protein functions. In the past few years, many computational methods have been proposed. However, most of them considered the PPI networks as static graphs and overlooked the dynamics inherent within these networks. Moreover, few of them can distinguish between protein complexes and functional modules.</p> <p>Results</p> <p>In this paper, a new framework is proposed to distinguish between protein complexes and functional modules by integrating gene expression data into protein-protein interaction (PPI) data. A series of time-sequenced subnetworks (TSNs) is constructed according to the time that the interactions were activated. The algorithm TSN-PCD was then developed to identify protein complexes from these TSNs. As protein complexes are significantly related to functional modules, a new algorithm DFM-CIN is proposed to discover functional modules based on the identified complexes. The experimental results show that the combination of temporal gene expression data with PPI data contributes to identifying protein complexes more precisely. A quantitative comparison based on f-measure reveals that our algorithm TSN-PCD outperforms the other previous protein complex discovery algorithms. Furthermore, we evaluate the identified functional modules by using “Biological Process” annotated in GO (Gene Ontology). The validation shows that the identified functional modules are statistically significant in terms of “Biological Process”. More importantly, the relationship between protein complexes and functional modules are studied.</p> <p>Conclusions</p> <p>The proposed framework based on the integration of PPI data and gene expression data makes it possible to identify protein complexes and functional modules more effectively. Moveover, the proposed new framework and algorithms can distinguish between protein complexes and functional modules. Our findings suggest that functional modules are closely related to protein complexes and a functional module may consist of one or multiple protein complexes. The program is available at <url>http://netlab.csu.edu.cn/bioinfomatics/limin/DFM-CIN/index.html</url>.</p>
url http://www.biomedcentral.com/1471-2105/13/109
work_keys_str_mv AT limin towardstheidentificationofproteincomplexesandfunctionalmodulesbyintegratingppinetworkandgeneexpressiondata
AT wuxuehong towardstheidentificationofproteincomplexesandfunctionalmodulesbyintegratingppinetworkandgeneexpressiondata
AT wangjianxin towardstheidentificationofproteincomplexesandfunctionalmodulesbyintegratingppinetworkandgeneexpressiondata
AT panyi towardstheidentificationofproteincomplexesandfunctionalmodulesbyintegratingppinetworkandgeneexpressiondata
_version_ 1716796587323162624