A regression analysis of gene expression in ES cells reveals two gene classes that are significantly different in epigenetic patterns

<p>Abstract</p> <p>Background</p> <p>To understand the gene regulatory system that governs the self-renewal and pluripotency of embryonic stem cells (ESCs) is an important step for promoting regenerative medicine. In it, the role of several core transcription factors (T...

Full description

Bibliographic Details
Main Authors: Nakai Kenta, Park Sung-Joon
Format: Article
Language:English
Published: BMC 2011-02-01
Series:BMC Bioinformatics
id doaj-c11b05c4c5cc449ea573dbd2bb66f89a
record_format Article
spelling doaj-c11b05c4c5cc449ea573dbd2bb66f89a2020-11-25T00:27:26ZengBMCBMC Bioinformatics1471-21052011-02-0112Suppl 1S5010.1186/1471-2105-12-S1-S50A regression analysis of gene expression in ES cells reveals two gene classes that are significantly different in epigenetic patternsNakai KentaPark Sung-Joon<p>Abstract</p> <p>Background</p> <p>To understand the gene regulatory system that governs the self-renewal and pluripotency of embryonic stem cells (ESCs) is an important step for promoting regenerative medicine. In it, the role of several core transcription factors (TFs), such as Oct4, Sox2 and Nanog, has been intensively investigated, details of their involvement in the genome-wide gene regulation are still not well clarified.</p> <p>Methods</p> <p>We constructed a predictive model of genome-wide gene expression in mouse ESCs from publicly available ChIP-seq data of 12 core TFs. The tag sequences were remapped on the genome by various alignment tools. Then, the binding density of each TF is calculated from the genome-wide bona fide TF binding sites. The TF-binding data was combined with the data of several epigenetic states (DNA methylation, several histone modifications, and CpG island) of promoter regions. These data as well as the ordinary peak intensity data were used as predictors of a simple linear regression model that predicts absolute gene expression. We also developed a pipeline for analyzing the effects of predictors and their interactions.</p> <p>Results</p> <p>Through our analysis, we identified two classes of genes that are either well explained or inefficiently explained by our model. The latter class seems to be genes that are not directly regulated by the core TFs. The regulatory regions of these gene classes show apparently distinct patterns of DNA methylation, histone modifications, existence of CpG islands, and gene ontology terms, suggesting the relative importance of epigenetic effects. Furthermore, we identified statistically significant TF interactions correlated with the epigenetic modification patterns.</p> <p>Conclusions</p> <p>Here, we proposed an improved prediction method in explaining the ESC-specific gene expression. Our study implies that the majority of genes are more or less directly regulated by the core TFs. In addition, our result is consistent with the general idea of relative importance of epigenetic effects in ESCs.</p>
collection DOAJ
language English
format Article
sources DOAJ
author Nakai Kenta
Park Sung-Joon
spellingShingle Nakai Kenta
Park Sung-Joon
A regression analysis of gene expression in ES cells reveals two gene classes that are significantly different in epigenetic patterns
BMC Bioinformatics
author_facet Nakai Kenta
Park Sung-Joon
author_sort Nakai Kenta
title A regression analysis of gene expression in ES cells reveals two gene classes that are significantly different in epigenetic patterns
title_short A regression analysis of gene expression in ES cells reveals two gene classes that are significantly different in epigenetic patterns
title_full A regression analysis of gene expression in ES cells reveals two gene classes that are significantly different in epigenetic patterns
title_fullStr A regression analysis of gene expression in ES cells reveals two gene classes that are significantly different in epigenetic patterns
title_full_unstemmed A regression analysis of gene expression in ES cells reveals two gene classes that are significantly different in epigenetic patterns
title_sort regression analysis of gene expression in es cells reveals two gene classes that are significantly different in epigenetic patterns
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-02-01
description <p>Abstract</p> <p>Background</p> <p>To understand the gene regulatory system that governs the self-renewal and pluripotency of embryonic stem cells (ESCs) is an important step for promoting regenerative medicine. In it, the role of several core transcription factors (TFs), such as Oct4, Sox2 and Nanog, has been intensively investigated, details of their involvement in the genome-wide gene regulation are still not well clarified.</p> <p>Methods</p> <p>We constructed a predictive model of genome-wide gene expression in mouse ESCs from publicly available ChIP-seq data of 12 core TFs. The tag sequences were remapped on the genome by various alignment tools. Then, the binding density of each TF is calculated from the genome-wide bona fide TF binding sites. The TF-binding data was combined with the data of several epigenetic states (DNA methylation, several histone modifications, and CpG island) of promoter regions. These data as well as the ordinary peak intensity data were used as predictors of a simple linear regression model that predicts absolute gene expression. We also developed a pipeline for analyzing the effects of predictors and their interactions.</p> <p>Results</p> <p>Through our analysis, we identified two classes of genes that are either well explained or inefficiently explained by our model. The latter class seems to be genes that are not directly regulated by the core TFs. The regulatory regions of these gene classes show apparently distinct patterns of DNA methylation, histone modifications, existence of CpG islands, and gene ontology terms, suggesting the relative importance of epigenetic effects. Furthermore, we identified statistically significant TF interactions correlated with the epigenetic modification patterns.</p> <p>Conclusions</p> <p>Here, we proposed an improved prediction method in explaining the ESC-specific gene expression. Our study implies that the majority of genes are more or less directly regulated by the core TFs. In addition, our result is consistent with the general idea of relative importance of epigenetic effects in ESCs.</p>
work_keys_str_mv AT nakaikenta aregressionanalysisofgeneexpressioninescellsrevealstwogeneclassesthataresignificantlydifferentinepigeneticpatterns
AT parksungjoon aregressionanalysisofgeneexpressioninescellsrevealstwogeneclassesthataresignificantlydifferentinepigeneticpatterns
AT nakaikenta regressionanalysisofgeneexpressioninescellsrevealstwogeneclassesthataresignificantlydifferentinepigeneticpatterns
AT parksungjoon regressionanalysisofgeneexpressioninescellsrevealstwogeneclassesthataresignificantlydifferentinepigeneticpatterns
_version_ 1725339807814516736