R/BHC: fast Bayesian hierarchical clustering for microarray data

<p>Abstract</p> <p>Background</p> <p>Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtain...

Full description

Bibliographic Details
Main Authors: Grant Murray, Truman William M, Ghahramani Zoubin, Xu Yang, Heller Katherine, Savage Richard S, Denby Katherine J, Wild David L
Format: Article
Language:English
Published: BMC 2009-08-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/10/242
id doaj-39cf0cf8047944bea1bdc384c02d6126
record_format Article
spelling doaj-39cf0cf8047944bea1bdc384c02d61262020-11-25T00:42:33ZengBMCBMC Bioinformatics1471-21052009-08-0110124210.1186/1471-2105-10-242R/BHC: fast Bayesian hierarchical clustering for microarray dataGrant MurrayTruman William MGhahramani ZoubinXu YangHeller KatherineSavage Richard SDenby Katherine JWild David L<p>Abstract</p> <p>Background</p> <p>Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtained.</p> <p>Results</p> <p>We present an R/Bioconductor port of a fast novel algorithm for Bayesian agglomerative hierarchical clustering and demonstrate its use in clustering gene expression microarray data. The method performs bottom-up hierarchical clustering, using a Dirichlet Process (infinite mixture) to model uncertainty in the data and Bayesian model selection to decide at each step which clusters to merge.</p> <p>Conclusion</p> <p>Biologically plausible results are presented from a well studied data set: expression profiles of <it>A. thaliana </it>subjected to a variety of biotic and abiotic stresses. Our method avoids several limitations of traditional methods, for example how many clusters there should be and how to choose a principled distance metric.</p> http://www.biomedcentral.com/1471-2105/10/242
collection DOAJ
language English
format Article
sources DOAJ
author Grant Murray
Truman William M
Ghahramani Zoubin
Xu Yang
Heller Katherine
Savage Richard S
Denby Katherine J
Wild David L
spellingShingle Grant Murray
Truman William M
Ghahramani Zoubin
Xu Yang
Heller Katherine
Savage Richard S
Denby Katherine J
Wild David L
R/BHC: fast Bayesian hierarchical clustering for microarray data
BMC Bioinformatics
author_facet Grant Murray
Truman William M
Ghahramani Zoubin
Xu Yang
Heller Katherine
Savage Richard S
Denby Katherine J
Wild David L
author_sort Grant Murray
title R/BHC: fast Bayesian hierarchical clustering for microarray data
title_short R/BHC: fast Bayesian hierarchical clustering for microarray data
title_full R/BHC: fast Bayesian hierarchical clustering for microarray data
title_fullStr R/BHC: fast Bayesian hierarchical clustering for microarray data
title_full_unstemmed R/BHC: fast Bayesian hierarchical clustering for microarray data
title_sort r/bhc: fast bayesian hierarchical clustering for microarray data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2009-08-01
description <p>Abstract</p> <p>Background</p> <p>Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtained.</p> <p>Results</p> <p>We present an R/Bioconductor port of a fast novel algorithm for Bayesian agglomerative hierarchical clustering and demonstrate its use in clustering gene expression microarray data. The method performs bottom-up hierarchical clustering, using a Dirichlet Process (infinite mixture) to model uncertainty in the data and Bayesian model selection to decide at each step which clusters to merge.</p> <p>Conclusion</p> <p>Biologically plausible results are presented from a well studied data set: expression profiles of <it>A. thaliana </it>subjected to a variety of biotic and abiotic stresses. Our method avoids several limitations of traditional methods, for example how many clusters there should be and how to choose a principled distance metric.</p>
url http://www.biomedcentral.com/1471-2105/10/242
work_keys_str_mv AT grantmurray rbhcfastbayesianhierarchicalclusteringformicroarraydata
AT trumanwilliamm rbhcfastbayesianhierarchicalclusteringformicroarraydata
AT ghahramanizoubin rbhcfastbayesianhierarchicalclusteringformicroarraydata
AT xuyang rbhcfastbayesianhierarchicalclusteringformicroarraydata
AT hellerkatherine rbhcfastbayesianhierarchicalclusteringformicroarraydata
AT savagerichards rbhcfastbayesianhierarchicalclusteringformicroarraydata
AT denbykatherinej rbhcfastbayesianhierarchicalclusteringformicroarraydata
AT wilddavidl rbhcfastbayesianhierarchicalclusteringformicroarraydata
_version_ 1725281727364988928