Discovery of stress responsive DNA regulatory motifs in Arabidopsis.

The discovery of DNA regulatory motifs in the sequenced genomes using computational methods remains challenging. Here, we present MotifIndexer--a comprehensive strategy for de novo identification of DNA regulatory motifs at a genome level. Using word-counting methods, we indexed the existence of eve...

Full description

Bibliographic Details
Main Authors: Shisong Ma, Shawn Bachan, Matthew Porto, Hans J Bohnert, Michael Snyder, Savithramma P Dinesh-Kumar
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3418279?pdf=render
id doaj-85c77bc3056a40daa7d2b49d47859814
record_format Article
spelling doaj-85c77bc3056a40daa7d2b49d478598142020-11-25T02:42:26ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0178e4319810.1371/journal.pone.0043198Discovery of stress responsive DNA regulatory motifs in Arabidopsis.Shisong MaShawn BachanMatthew PortoHans J BohnertMichael SnyderSavithramma P Dinesh-KumarThe discovery of DNA regulatory motifs in the sequenced genomes using computational methods remains challenging. Here, we present MotifIndexer--a comprehensive strategy for de novo identification of DNA regulatory motifs at a genome level. Using word-counting methods, we indexed the existence of every 8-mer oligo composed of bases A, C, G, T, r, y, s, w, m, k, n or 12-mer oligo composed of A, C, G, T, n, in the promoters of all predicted genes of Arabidopsis thaliana genome and of selected stress-induced co-expressed genes. From this analysis, we identified number of over-represented motifs. Among these, major critical motifs were identified using a position filter. We used a model based on uniform distribution and the z-scores derived from this model to describe position bias. Interestingly, many motifs showed position bias towards the transcription start site. We extended this model to show biased distribution of motifs in the genomes of both A. thaliana and rice. We also used MotifIndexer to identify conserved motifs in co-expressed gene groups from two Arabidopsis species, A. thaliana and A. lyrata. This new comparative genomics method does not depend on alignments of homologous gene promoter sequences.http://europepmc.org/articles/PMC3418279?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Shisong Ma
Shawn Bachan
Matthew Porto
Hans J Bohnert
Michael Snyder
Savithramma P Dinesh-Kumar
spellingShingle Shisong Ma
Shawn Bachan
Matthew Porto
Hans J Bohnert
Michael Snyder
Savithramma P Dinesh-Kumar
Discovery of stress responsive DNA regulatory motifs in Arabidopsis.
PLoS ONE
author_facet Shisong Ma
Shawn Bachan
Matthew Porto
Hans J Bohnert
Michael Snyder
Savithramma P Dinesh-Kumar
author_sort Shisong Ma
title Discovery of stress responsive DNA regulatory motifs in Arabidopsis.
title_short Discovery of stress responsive DNA regulatory motifs in Arabidopsis.
title_full Discovery of stress responsive DNA regulatory motifs in Arabidopsis.
title_fullStr Discovery of stress responsive DNA regulatory motifs in Arabidopsis.
title_full_unstemmed Discovery of stress responsive DNA regulatory motifs in Arabidopsis.
title_sort discovery of stress responsive dna regulatory motifs in arabidopsis.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2012-01-01
description The discovery of DNA regulatory motifs in the sequenced genomes using computational methods remains challenging. Here, we present MotifIndexer--a comprehensive strategy for de novo identification of DNA regulatory motifs at a genome level. Using word-counting methods, we indexed the existence of every 8-mer oligo composed of bases A, C, G, T, r, y, s, w, m, k, n or 12-mer oligo composed of A, C, G, T, n, in the promoters of all predicted genes of Arabidopsis thaliana genome and of selected stress-induced co-expressed genes. From this analysis, we identified number of over-represented motifs. Among these, major critical motifs were identified using a position filter. We used a model based on uniform distribution and the z-scores derived from this model to describe position bias. Interestingly, many motifs showed position bias towards the transcription start site. We extended this model to show biased distribution of motifs in the genomes of both A. thaliana and rice. We also used MotifIndexer to identify conserved motifs in co-expressed gene groups from two Arabidopsis species, A. thaliana and A. lyrata. This new comparative genomics method does not depend on alignments of homologous gene promoter sequences.
url http://europepmc.org/articles/PMC3418279?pdf=render
work_keys_str_mv AT shisongma discoveryofstressresponsivednaregulatorymotifsinarabidopsis
AT shawnbachan discoveryofstressresponsivednaregulatorymotifsinarabidopsis
AT matthewporto discoveryofstressresponsivednaregulatorymotifsinarabidopsis
AT hansjbohnert discoveryofstressresponsivednaregulatorymotifsinarabidopsis
AT michaelsnyder discoveryofstressresponsivednaregulatorymotifsinarabidopsis
AT savithrammapdineshkumar discoveryofstressresponsivednaregulatorymotifsinarabidopsis
_version_ 1724774028833456128