Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors

<p>Abstract</p> <p>Background</p> <p>In the analysis of high-throughput data with a clinical outcome, researchers mostly focus on genes/proteins that show first-order relations with the clinical outcome. While this approach yields biomarkers and biological mechanisms th...

Full description

Bibliographic Details
Main Authors: Yu Tianwei, Bai Yun
Format: Article
Language:English
Published: BMC 2011-11-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/12/563
id doaj-8c85bbbc65fc42879aa35f84f275f0ed
record_format Article
spelling doaj-8c85bbbc65fc42879aa35f84f275f0ed2020-11-25T01:00:59ZengBMCBMC Genomics1471-21642011-11-0112156310.1186/1471-2164-12-563Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factorsYu TianweiBai Yun<p>Abstract</p> <p>Background</p> <p>In the analysis of high-throughput data with a clinical outcome, researchers mostly focus on genes/proteins that show first-order relations with the clinical outcome. While this approach yields biomarkers and biological mechanisms that are easily interpretable, it may miss information that is important to the understanding of disease mechanism and/or treatment response. Here we test the hypothesis that unobserved factors can be mobilized by the living system to coordinate the response to the clinical factors.</p> <p>Results</p> <p>We developed a computational method named Guided Latent Factor Discovery (GLFD) to identify hidden factors that act in combination with the observed clinical factors to control gene modules. In simulation studies, the method recovered masked factors effectively. Using real microarray data, we demonstrate that the method identifies latent factors that are biologically relevant, and extracts more information than analyzing only the first-order response to the clinical outcome.</p> <p>Conclusions</p> <p>Finding latent factors using GLFD brings extra insight into the mechanisms of the disease/drug response. The R code of the method is available at <url>http://userwww.service.emory.edu/~tyu8/GLFD</url>.</p> http://www.biomedcentral.com/1471-2164/12/563
collection DOAJ
language English
format Article
sources DOAJ
author Yu Tianwei
Bai Yun
spellingShingle Yu Tianwei
Bai Yun
Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors
BMC Genomics
author_facet Yu Tianwei
Bai Yun
author_sort Yu Tianwei
title Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors
title_short Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors
title_full Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors
title_fullStr Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors
title_full_unstemmed Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors
title_sort improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2011-11-01
description <p>Abstract</p> <p>Background</p> <p>In the analysis of high-throughput data with a clinical outcome, researchers mostly focus on genes/proteins that show first-order relations with the clinical outcome. While this approach yields biomarkers and biological mechanisms that are easily interpretable, it may miss information that is important to the understanding of disease mechanism and/or treatment response. Here we test the hypothesis that unobserved factors can be mobilized by the living system to coordinate the response to the clinical factors.</p> <p>Results</p> <p>We developed a computational method named Guided Latent Factor Discovery (GLFD) to identify hidden factors that act in combination with the observed clinical factors to control gene modules. In simulation studies, the method recovered masked factors effectively. Using real microarray data, we demonstrate that the method identifies latent factors that are biologically relevant, and extracts more information than analyzing only the first-order response to the clinical outcome.</p> <p>Conclusions</p> <p>Finding latent factors using GLFD brings extra insight into the mechanisms of the disease/drug response. The R code of the method is available at <url>http://userwww.service.emory.edu/~tyu8/GLFD</url>.</p>
url http://www.biomedcentral.com/1471-2164/12/563
work_keys_str_mv AT yutianwei improvinggeneexpressiondatainterpretationbyfindinglatentfactorsthatcoregulategenemoduleswithclinicalfactors
AT baiyun improvinggeneexpressiondatainterpretationbyfindinglatentfactorsthatcoregulategenemoduleswithclinicalfactors
_version_ 1725211532501975040