R/BHC: fast Bayesian hierarchical clustering for microarray data
<p>Abstract</p> <p>Background</p> <p>Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtain...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2009-08-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/10/242 |
id |
doaj-39cf0cf8047944bea1bdc384c02d6126 |
---|---|
record_format |
Article |
spelling |
doaj-39cf0cf8047944bea1bdc384c02d61262020-11-25T00:42:33ZengBMCBMC Bioinformatics1471-21052009-08-0110124210.1186/1471-2105-10-242R/BHC: fast Bayesian hierarchical clustering for microarray dataGrant MurrayTruman William MGhahramani ZoubinXu YangHeller KatherineSavage Richard SDenby Katherine JWild David L<p>Abstract</p> <p>Background</p> <p>Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtained.</p> <p>Results</p> <p>We present an R/Bioconductor port of a fast novel algorithm for Bayesian agglomerative hierarchical clustering and demonstrate its use in clustering gene expression microarray data. The method performs bottom-up hierarchical clustering, using a Dirichlet Process (infinite mixture) to model uncertainty in the data and Bayesian model selection to decide at each step which clusters to merge.</p> <p>Conclusion</p> <p>Biologically plausible results are presented from a well studied data set: expression profiles of <it>A. thaliana </it>subjected to a variety of biotic and abiotic stresses. Our method avoids several limitations of traditional methods, for example how many clusters there should be and how to choose a principled distance metric.</p> http://www.biomedcentral.com/1471-2105/10/242 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Grant Murray Truman William M Ghahramani Zoubin Xu Yang Heller Katherine Savage Richard S Denby Katherine J Wild David L |
spellingShingle |
Grant Murray Truman William M Ghahramani Zoubin Xu Yang Heller Katherine Savage Richard S Denby Katherine J Wild David L R/BHC: fast Bayesian hierarchical clustering for microarray data BMC Bioinformatics |
author_facet |
Grant Murray Truman William M Ghahramani Zoubin Xu Yang Heller Katherine Savage Richard S Denby Katherine J Wild David L |
author_sort |
Grant Murray |
title |
R/BHC: fast Bayesian hierarchical clustering for microarray data |
title_short |
R/BHC: fast Bayesian hierarchical clustering for microarray data |
title_full |
R/BHC: fast Bayesian hierarchical clustering for microarray data |
title_fullStr |
R/BHC: fast Bayesian hierarchical clustering for microarray data |
title_full_unstemmed |
R/BHC: fast Bayesian hierarchical clustering for microarray data |
title_sort |
r/bhc: fast bayesian hierarchical clustering for microarray data |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2009-08-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtained.</p> <p>Results</p> <p>We present an R/Bioconductor port of a fast novel algorithm for Bayesian agglomerative hierarchical clustering and demonstrate its use in clustering gene expression microarray data. The method performs bottom-up hierarchical clustering, using a Dirichlet Process (infinite mixture) to model uncertainty in the data and Bayesian model selection to decide at each step which clusters to merge.</p> <p>Conclusion</p> <p>Biologically plausible results are presented from a well studied data set: expression profiles of <it>A. thaliana </it>subjected to a variety of biotic and abiotic stresses. Our method avoids several limitations of traditional methods, for example how many clusters there should be and how to choose a principled distance metric.</p> |
url |
http://www.biomedcentral.com/1471-2105/10/242 |
work_keys_str_mv |
AT grantmurray rbhcfastbayesianhierarchicalclusteringformicroarraydata AT trumanwilliamm rbhcfastbayesianhierarchicalclusteringformicroarraydata AT ghahramanizoubin rbhcfastbayesianhierarchicalclusteringformicroarraydata AT xuyang rbhcfastbayesianhierarchicalclusteringformicroarraydata AT hellerkatherine rbhcfastbayesianhierarchicalclusteringformicroarraydata AT savagerichards rbhcfastbayesianhierarchicalclusteringformicroarraydata AT denbykatherinej rbhcfastbayesianhierarchicalclusteringformicroarraydata AT wilddavidl rbhcfastbayesianhierarchicalclusteringformicroarraydata |
_version_ |
1725281727364988928 |