Quantitative analysis of correlation between AT and GC biases among bacterial genomes.

Due to different replication mechanisms between the leading and lagging strands, nucleotide composition asymmetries widely exist in bacterial genomes. A general consideration reveals that the leading strand is enriched in Guanine (G) and Thymine (T), and the lagging strand shows richness in Adenine...

Full description

Bibliographic Details
Main Authors: Ge Zhang, Feng Gao
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5291525?pdf=render
id doaj-3fd7f136ea884806bb8fe672aaa379b7
record_format Article
spelling doaj-3fd7f136ea884806bb8fe672aaa379b72020-11-25T01:38:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01122e017140810.1371/journal.pone.0171408Quantitative analysis of correlation between AT and GC biases among bacterial genomes.Ge ZhangFeng GaoDue to different replication mechanisms between the leading and lagging strands, nucleotide composition asymmetries widely exist in bacterial genomes. A general consideration reveals that the leading strand is enriched in Guanine (G) and Thymine (T), and the lagging strand shows richness in Adenine (A) and Cytosine (C). However, some bacteria like Bacillus subtilis have been discovered composing more A than T in the leading strand. To investigate the difference, we analyze the nucleotide asymmetry from the aspect of AT and GC bias correlations. In this study, we propose a windowless method, the Z-curve Correlation Coefficient (ZCC) index, based on the Z-curve method, and analyzed more than 2000 bacterial genomes. We find that the majority of bacteria reveal negative correlations between AT and GC biases, while most genomes in Firmicutes and Tenericutes have positive ZCC indexes. The presence of PolC, purine asymmetry and stronger genes preference in the leading strand are not confined to Firmicutes, but also likely to happen in other phyla dominated by positive ZCC indexes. This method also provides a new insight into other relevant features like aerobism, and can be applied to analyze the correlation between RY (Purine and Pyrimidine) and MK (Amino and Keto) bias and so on.http://europepmc.org/articles/PMC5291525?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Ge Zhang
Feng Gao
spellingShingle Ge Zhang
Feng Gao
Quantitative analysis of correlation between AT and GC biases among bacterial genomes.
PLoS ONE
author_facet Ge Zhang
Feng Gao
author_sort Ge Zhang
title Quantitative analysis of correlation between AT and GC biases among bacterial genomes.
title_short Quantitative analysis of correlation between AT and GC biases among bacterial genomes.
title_full Quantitative analysis of correlation between AT and GC biases among bacterial genomes.
title_fullStr Quantitative analysis of correlation between AT and GC biases among bacterial genomes.
title_full_unstemmed Quantitative analysis of correlation between AT and GC biases among bacterial genomes.
title_sort quantitative analysis of correlation between at and gc biases among bacterial genomes.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description Due to different replication mechanisms between the leading and lagging strands, nucleotide composition asymmetries widely exist in bacterial genomes. A general consideration reveals that the leading strand is enriched in Guanine (G) and Thymine (T), and the lagging strand shows richness in Adenine (A) and Cytosine (C). However, some bacteria like Bacillus subtilis have been discovered composing more A than T in the leading strand. To investigate the difference, we analyze the nucleotide asymmetry from the aspect of AT and GC bias correlations. In this study, we propose a windowless method, the Z-curve Correlation Coefficient (ZCC) index, based on the Z-curve method, and analyzed more than 2000 bacterial genomes. We find that the majority of bacteria reveal negative correlations between AT and GC biases, while most genomes in Firmicutes and Tenericutes have positive ZCC indexes. The presence of PolC, purine asymmetry and stronger genes preference in the leading strand are not confined to Firmicutes, but also likely to happen in other phyla dominated by positive ZCC indexes. This method also provides a new insight into other relevant features like aerobism, and can be applied to analyze the correlation between RY (Purine and Pyrimidine) and MK (Amino and Keto) bias and so on.
url http://europepmc.org/articles/PMC5291525?pdf=render
work_keys_str_mv AT gezhang quantitativeanalysisofcorrelationbetweenatandgcbiasesamongbacterialgenomes
AT fenggao quantitativeanalysisofcorrelationbetweenatandgcbiasesamongbacterialgenomes
_version_ 1725055790366064640