The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data

Motivation: Identifying genes with bimodal expression patterns from large-scale expression profiling data is an important analytical task. Model-based clustering is popular for this purpose. That technique commonly uses the Bayesian information criterion (BIC) for model selection. In practice, howev...

Full description

Bibliographic Details
Main Authors: Jing Wang, Sijin Wen, W. Fraser Symmans, Lajos Pusztai, Kevin R. Coombes
Format: Article
Language:English
Published: SAGE Publishing 2009-01-01
Series:Cancer Informatics
Subjects:
Online Access:http://www.la-press.com/the-bimodality-index-a-criterion-for-discoveringnbsp-and-ranking-bimod-a1579
id doaj-598fc70332d740418d837c9c4f359d4b
record_format Article
spelling doaj-598fc70332d740418d837c9c4f359d4b2020-11-25T02:49:38ZengSAGE PublishingCancer Informatics1176-93512009-01-017199216The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling DataJing WangSijin WenW. Fraser SymmansLajos PusztaiKevin R. CoombesMotivation: Identifying genes with bimodal expression patterns from large-scale expression profiling data is an important analytical task. Model-based clustering is popular for this purpose. That technique commonly uses the Bayesian information criterion (BIC) for model selection. In practice, however, BIC appears to be overly sensitive and may lead to the identification of bimodally expressed genes that are unreliable or not clinically useful. We propose using a novel criterion, the bimodality index, not only to identify but also to rank meaningful and reliable bimodal patterns. The bimodality index can be computed using either a mixture model-based algorithm or Markov chain Monte Carlo techniques.Results: We carried out simulation studies and applied the method to real data from a cancer gene expression profiling study. Our findings suggest that BIC behaves like a lax cutoff based on the bimodality index, and that the bimodality index provides an objective measure to identify and rank meaningful and reliable bimodal patterns from large-scale gene expression datasets. R code to compute the bimodality index is included in the ClassDiscovery package of the Object-Oriented Microarray and Proteomic Analysis (OOMPA) suite available at the web site http://bioinformatics.mdanderson.org/Software/OOMPA. http://www.la-press.com/the-bimodality-index-a-criterion-for-discoveringnbsp-and-ranking-bimod-a1579gene expression profilingcancerbimodality index
collection DOAJ
language English
format Article
sources DOAJ
author Jing Wang
Sijin Wen
W. Fraser Symmans
Lajos Pusztai
Kevin R. Coombes
spellingShingle Jing Wang
Sijin Wen
W. Fraser Symmans
Lajos Pusztai
Kevin R. Coombes
The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data
Cancer Informatics
gene expression profiling
cancer
bimodality index
author_facet Jing Wang
Sijin Wen
W. Fraser Symmans
Lajos Pusztai
Kevin R. Coombes
author_sort Jing Wang
title The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data
title_short The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data
title_full The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data
title_fullStr The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data
title_full_unstemmed The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data
title_sort bimodality index: a criterion for discovering and ranking bimodal signatures from cancer gene expression profiling data
publisher SAGE Publishing
series Cancer Informatics
issn 1176-9351
publishDate 2009-01-01
description Motivation: Identifying genes with bimodal expression patterns from large-scale expression profiling data is an important analytical task. Model-based clustering is popular for this purpose. That technique commonly uses the Bayesian information criterion (BIC) for model selection. In practice, however, BIC appears to be overly sensitive and may lead to the identification of bimodally expressed genes that are unreliable or not clinically useful. We propose using a novel criterion, the bimodality index, not only to identify but also to rank meaningful and reliable bimodal patterns. The bimodality index can be computed using either a mixture model-based algorithm or Markov chain Monte Carlo techniques.Results: We carried out simulation studies and applied the method to real data from a cancer gene expression profiling study. Our findings suggest that BIC behaves like a lax cutoff based on the bimodality index, and that the bimodality index provides an objective measure to identify and rank meaningful and reliable bimodal patterns from large-scale gene expression datasets. R code to compute the bimodality index is included in the ClassDiscovery package of the Object-Oriented Microarray and Proteomic Analysis (OOMPA) suite available at the web site http://bioinformatics.mdanderson.org/Software/OOMPA.
topic gene expression profiling
cancer
bimodality index
url http://www.la-press.com/the-bimodality-index-a-criterion-for-discoveringnbsp-and-ranking-bimod-a1579
work_keys_str_mv AT jingwang thebimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT sijinwen thebimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT wfrasersymmans thebimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT lajospusztai thebimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT kevinrcoombes thebimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT jingwang bimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT sijinwen bimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT wfrasersymmans bimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT lajospusztai bimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
AT kevinrcoombes bimodalityindexacriterionfordiscoveringandrankingbimodalsignaturesfromcancergeneexpressionprofilingdata
_version_ 1724742277998313472