Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks

<p>Abstract</p> <p>Background</p> <p>Biological processes are carried out by coordinated modules of interacting molecules. As clustering methods demonstrate that genes with similar expression display increased likelihood of being associated with a common functional modu...

Full description

Bibliographic Details
Main Authors: Kohane Isaac S, Wolfe Cecily J, Butte Atul J
Format: Article
Language:English
Published: BMC 2005-09-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/6/227
id doaj-3db600b1a9014aeeb8517c373a67bbc0
record_format Article
spelling doaj-3db600b1a9014aeeb8517c373a67bbc02020-11-24T22:30:37ZengBMCBMC Bioinformatics1471-21052005-09-016122710.1186/1471-2105-6-227Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networksKohane Isaac SWolfe Cecily JButte Atul J<p>Abstract</p> <p>Background</p> <p>Biological processes are carried out by coordinated modules of interacting molecules. As clustering methods demonstrate that genes with similar expression display increased likelihood of being associated with a common functional module, networks of coexpressed genes provide one framework for assigning gene function. This has informed the guilt-by-association (GBA) heuristic, widely invoked in functional genomics. Yet although the idea of GBA is accepted, the breadth of GBA applicability is uncertain.</p> <p>Results</p> <p>We developed methods to systematically explore the breadth of GBA across a large and varied corpus of expression data to answer the following question: To what extent is the GBA heuristic broadly applicable to the transcriptome and conversely how broadly is GBA captured by <it>a priori </it>knowledge represented in the Gene Ontology (GO)? Our study provides an investigation of the functional organization of five coexpression networks using data from three mammalian organisms. Our method calculates a probabilistic score between each gene and each Gene Ontology category that reflects coexpression enrichment of a GO module. For each GO category we use Receiver Operating Curves to assess whether these probabilistic scores reflect GBA. This methodology applied to five different coexpression networks demonstrates that the signature of guilt-by-association is ubiquitous and reproducible and that the GBA heuristic is broadly applicable across the population of nine hundred Gene Ontology categories. We also demonstrate the existence of highly reproducible patterns of coexpression between some pairs of GO categories.</p> <p>Conclusion</p> <p>We conclude that GBA has universal value and that transcriptional control may be more modular than previously realized. Our analyses also suggest that methodologies combining coexpression measurements across multiple genes in a biologically-defined module can aid in characterizing gene function or in characterizing whether pairs of functions operate together.</p> http://www.biomedcentral.com/1471-2105/6/227
collection DOAJ
language English
format Article
sources DOAJ
author Kohane Isaac S
Wolfe Cecily J
Butte Atul J
spellingShingle Kohane Isaac S
Wolfe Cecily J
Butte Atul J
Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks
BMC Bioinformatics
author_facet Kohane Isaac S
Wolfe Cecily J
Butte Atul J
author_sort Kohane Isaac S
title Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks
title_short Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks
title_full Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks
title_fullStr Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks
title_full_unstemmed Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks
title_sort systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2005-09-01
description <p>Abstract</p> <p>Background</p> <p>Biological processes are carried out by coordinated modules of interacting molecules. As clustering methods demonstrate that genes with similar expression display increased likelihood of being associated with a common functional module, networks of coexpressed genes provide one framework for assigning gene function. This has informed the guilt-by-association (GBA) heuristic, widely invoked in functional genomics. Yet although the idea of GBA is accepted, the breadth of GBA applicability is uncertain.</p> <p>Results</p> <p>We developed methods to systematically explore the breadth of GBA across a large and varied corpus of expression data to answer the following question: To what extent is the GBA heuristic broadly applicable to the transcriptome and conversely how broadly is GBA captured by <it>a priori </it>knowledge represented in the Gene Ontology (GO)? Our study provides an investigation of the functional organization of five coexpression networks using data from three mammalian organisms. Our method calculates a probabilistic score between each gene and each Gene Ontology category that reflects coexpression enrichment of a GO module. For each GO category we use Receiver Operating Curves to assess whether these probabilistic scores reflect GBA. This methodology applied to five different coexpression networks demonstrates that the signature of guilt-by-association is ubiquitous and reproducible and that the GBA heuristic is broadly applicable across the population of nine hundred Gene Ontology categories. We also demonstrate the existence of highly reproducible patterns of coexpression between some pairs of GO categories.</p> <p>Conclusion</p> <p>We conclude that GBA has universal value and that transcriptional control may be more modular than previously realized. Our analyses also suggest that methodologies combining coexpression measurements across multiple genes in a biologically-defined module can aid in characterizing gene function or in characterizing whether pairs of functions operate together.</p>
url http://www.biomedcentral.com/1471-2105/6/227
work_keys_str_mv AT kohaneisaacs systematicsurveyrevealsgeneralapplicabilityofguiltbyassociationwithingenecoexpressionnetworks
AT wolfececilyj systematicsurveyrevealsgeneralapplicabilityofguiltbyassociationwithingenecoexpressionnetworks
AT butteatulj systematicsurveyrevealsgeneralapplicabilityofguiltbyassociationwithingenecoexpressionnetworks
_version_ 1725740108218368000