Bayesian Biclustering on Discrete Data: Variable Selection Methods

Biclustering is a technique for clustering rows and columns of a data matrix simultaneously. Over the past few years, we have seen its applications in biology-related fields, as well as in many data mining projects. As opposed to classical clustering methods, biclustering groups objects that are sim...

Full description

Bibliographic Details
Main Author: Guo, Lei
Other Authors: Liu, Jun
Language:en_US
Published: Harvard University 2013
Subjects:
Online Access:http://dissertations.umi.com/gsas.harvard:11201
http://nrs.harvard.edu/urn-3:HUL.InstRepos:11181216
Description
Summary:Biclustering is a technique for clustering rows and columns of a data matrix simultaneously. Over the past few years, we have seen its applications in biology-related fields, as well as in many data mining projects. As opposed to classical clustering methods, biclustering groups objects that are similar only on a subset of variables. Many biclustering algorithms on continuous data have emerged over the last decade. In this dissertation, we will focus on two Bayesian biclustering algorithms we developed for discrete data, more specifically categorical data and ordinal data. === Statistics