All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues

<p>Abstract</p> <p>Background</p> <p>The promoters of housekeeping genes are well-bound by RNA polymerase II (RNAP) in different tissues. Although the promoters of these genes are known to contain CpG islands, the specific DNA sequences that are associated with high RNA...

Full description

Bibliographic Details
Main Authors: Myakishev Maxim V, Rishi Vikas, Glass Kimberly, Shlyakhtenko Andrey, Rozenberg Julian M, FitzGerald Peter C, Vinson Charles
Format: Article
Language:English
Published: BMC 2008-02-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/9/67
id doaj-aa2f6c930f4d4d0f96f82d9284e29173
record_format Article
spelling doaj-aa2f6c930f4d4d0f96f82d9284e291732020-11-24T21:40:16ZengBMCBMC Genomics1471-21642008-02-01916710.1186/1471-2164-9-67All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissuesMyakishev Maxim VRishi VikasGlass KimberlyShlyakhtenko AndreyRozenberg Julian MFitzGerald Peter CVinson Charles<p>Abstract</p> <p>Background</p> <p>The promoters of housekeeping genes are well-bound by RNA polymerase II (RNAP) in different tissues. Although the promoters of these genes are known to contain CpG islands, the specific DNA sequences that are associated with high RNAP binding to housekeeping promoters has not been described.</p> <p>Results</p> <p>ChIP-chip experiments from three mouse tissues, liver, heart ventricles, and primary keratinocytes, indicate that 94% of promoters have similar RNAP binding, ranging from well-bound to poorly-bound in all tissues. Using all 8-base pair long sequences as a test set, we have identified the DNA sequences that are enriched in promoters of housekeeping genes, focusing on those DNA sequences which are preferentially localized in the proximal promoter. We observe a bimodal distribution. Virtually all sequences enriched in promoters with high RNAP binding values contain a CpG dinucleotide. These results suggest that only transcription factor binding sites (TFBS) that contain the CpG dinucleotide are involved in RNAP binding to housekeeping promoters while TFBS that do not contain a CpG are involved in regulated promoter activity. Abundant 8-mers that are preferentially localized in the proximal promoters and exhibit the best enrichment in RNAP bound promoters are all variants of six known CpG-containing TFBS: ETS, NRF-1, BoxA, SP1, CRE, and E-Box. The frequency of these six DNA motifs can predict housekeeping promoters as accurately as the presence of a CpG island, suggesting that they are the structural elements critical for CpG island function. Experimental EMSA results demonstrate that methylation of the CpG in the ETS, NRF-1, and SP1 motifs prevent DNA binding in nuclear extracts in both keratinocytes and liver.</p> <p>Conclusion</p> <p>In general, TFBS that do not contain a CpG are involved in regulated gene expression while TFBS that contain a CpG are involved in constitutive gene expression with some CpG containing sequences also involved in inducible and tissue specific gene regulation. These TFBS are not bound when the CpG is methylated. Unmethylated CpG dinucleotides in the TFBS in CpG islands allow the transcription factors to find their binding sites which occur only in promoters, in turn localizing RNAP to promoters.</p> http://www.biomedcentral.com/1471-2164/9/67
collection DOAJ
language English
format Article
sources DOAJ
author Myakishev Maxim V
Rishi Vikas
Glass Kimberly
Shlyakhtenko Andrey
Rozenberg Julian M
FitzGerald Peter C
Vinson Charles
spellingShingle Myakishev Maxim V
Rishi Vikas
Glass Kimberly
Shlyakhtenko Andrey
Rozenberg Julian M
FitzGerald Peter C
Vinson Charles
All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues
BMC Genomics
author_facet Myakishev Maxim V
Rishi Vikas
Glass Kimberly
Shlyakhtenko Andrey
Rozenberg Julian M
FitzGerald Peter C
Vinson Charles
author_sort Myakishev Maxim V
title All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues
title_short All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues
title_full All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues
title_fullStr All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues
title_full_unstemmed All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues
title_sort all and only cpg containing sequences are enriched in promoters abundantly bound by rna polymerase ii in multiple tissues
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2008-02-01
description <p>Abstract</p> <p>Background</p> <p>The promoters of housekeeping genes are well-bound by RNA polymerase II (RNAP) in different tissues. Although the promoters of these genes are known to contain CpG islands, the specific DNA sequences that are associated with high RNAP binding to housekeeping promoters has not been described.</p> <p>Results</p> <p>ChIP-chip experiments from three mouse tissues, liver, heart ventricles, and primary keratinocytes, indicate that 94% of promoters have similar RNAP binding, ranging from well-bound to poorly-bound in all tissues. Using all 8-base pair long sequences as a test set, we have identified the DNA sequences that are enriched in promoters of housekeeping genes, focusing on those DNA sequences which are preferentially localized in the proximal promoter. We observe a bimodal distribution. Virtually all sequences enriched in promoters with high RNAP binding values contain a CpG dinucleotide. These results suggest that only transcription factor binding sites (TFBS) that contain the CpG dinucleotide are involved in RNAP binding to housekeeping promoters while TFBS that do not contain a CpG are involved in regulated promoter activity. Abundant 8-mers that are preferentially localized in the proximal promoters and exhibit the best enrichment in RNAP bound promoters are all variants of six known CpG-containing TFBS: ETS, NRF-1, BoxA, SP1, CRE, and E-Box. The frequency of these six DNA motifs can predict housekeeping promoters as accurately as the presence of a CpG island, suggesting that they are the structural elements critical for CpG island function. Experimental EMSA results demonstrate that methylation of the CpG in the ETS, NRF-1, and SP1 motifs prevent DNA binding in nuclear extracts in both keratinocytes and liver.</p> <p>Conclusion</p> <p>In general, TFBS that do not contain a CpG are involved in regulated gene expression while TFBS that contain a CpG are involved in constitutive gene expression with some CpG containing sequences also involved in inducible and tissue specific gene regulation. These TFBS are not bound when the CpG is methylated. Unmethylated CpG dinucleotides in the TFBS in CpG islands allow the transcription factors to find their binding sites which occur only in promoters, in turn localizing RNAP to promoters.</p>
url http://www.biomedcentral.com/1471-2164/9/67
work_keys_str_mv AT myakishevmaximv allandonlycpgcontainingsequencesareenrichedinpromotersabundantlyboundbyrnapolymeraseiiinmultipletissues
AT rishivikas allandonlycpgcontainingsequencesareenrichedinpromotersabundantlyboundbyrnapolymeraseiiinmultipletissues
AT glasskimberly allandonlycpgcontainingsequencesareenrichedinpromotersabundantlyboundbyrnapolymeraseiiinmultipletissues
AT shlyakhtenkoandrey allandonlycpgcontainingsequencesareenrichedinpromotersabundantlyboundbyrnapolymeraseiiinmultipletissues
AT rozenbergjulianm allandonlycpgcontainingsequencesareenrichedinpromotersabundantlyboundbyrnapolymeraseiiinmultipletissues
AT fitzgeraldpeterc allandonlycpgcontainingsequencesareenrichedinpromotersabundantlyboundbyrnapolymeraseiiinmultipletissues
AT vinsoncharles allandonlycpgcontainingsequencesareenrichedinpromotersabundantlyboundbyrnapolymeraseiiinmultipletissues
_version_ 1725926999857299456