CpG usage in RNA viruses: data and hypotheses.

CpG repression in RNA viruses has been known for decades, but a reasonable explanation has not yet been proposed to explain this phenomenon. In this study, we calculated the CpG odds ratio of all RNA viruses that have available genome sequences and analyzed the correlation with their genome polarity...

Full description

Bibliographic Details
Main Authors: Xiaofei Cheng, Nasar Virk, Wei Chen, Shuqin Ji, Shuxian Ji, Yuqiang Sun, Xiaoyun Wu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3781069?pdf=render
id doaj-4a570777d61f431d8e3ba59b13e79654
record_format Article
spelling doaj-4a570777d61f431d8e3ba59b13e796542020-11-25T00:47:04ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0189e7410910.1371/journal.pone.0074109CpG usage in RNA viruses: data and hypotheses.Xiaofei ChengNasar VirkWei ChenShuqin JiShuxian JiYuqiang SunXiaoyun WuCpG repression in RNA viruses has been known for decades, but a reasonable explanation has not yet been proposed to explain this phenomenon. In this study, we calculated the CpG odds ratio of all RNA viruses that have available genome sequences and analyzed the correlation with their genome polarity, base composition, synonymous codon usage, phylogenetic relationship, and host. The results indicated that the viral base composition, synonymous codon usage and host selection were the dominant factors that determined the CpG bias in RNA viruses. CpG usage variation between the different viral groups was caused by different combinations of these pressures, which also differed from each other in strength. The consistent under-representation of CpG usage in -ssRNA viruses is determined predominantly by base composition, which may be a consequence of the U/A preferred mutation bias of -ssRNA viruses, whereas the CpG usage of +ssRNA viruses is affected greatly by their hosts. As a result, most +ssRNA viruses mimic their hosts' CpG usage. Unbiased CpG usage in dsRNA viruses is most likely a result of their dsRNA genome, which allows the viruses to escape from the host-driven CpG elimination pressure. CpG was under-represented in all reverse-transcribing viruses (RT viruses), suggesting that DNA methylation is an important factor affecting the CpG usage of retroviruses. However, vertebrate-infecting RT viruses may also suffer host' CpG elimination pressure that also acts on +ssRNA viruses, which results in further under-representation of CpG in the vertebrate-infecting RT viruses.http://europepmc.org/articles/PMC3781069?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Xiaofei Cheng
Nasar Virk
Wei Chen
Shuqin Ji
Shuxian Ji
Yuqiang Sun
Xiaoyun Wu
spellingShingle Xiaofei Cheng
Nasar Virk
Wei Chen
Shuqin Ji
Shuxian Ji
Yuqiang Sun
Xiaoyun Wu
CpG usage in RNA viruses: data and hypotheses.
PLoS ONE
author_facet Xiaofei Cheng
Nasar Virk
Wei Chen
Shuqin Ji
Shuxian Ji
Yuqiang Sun
Xiaoyun Wu
author_sort Xiaofei Cheng
title CpG usage in RNA viruses: data and hypotheses.
title_short CpG usage in RNA viruses: data and hypotheses.
title_full CpG usage in RNA viruses: data and hypotheses.
title_fullStr CpG usage in RNA viruses: data and hypotheses.
title_full_unstemmed CpG usage in RNA viruses: data and hypotheses.
title_sort cpg usage in rna viruses: data and hypotheses.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description CpG repression in RNA viruses has been known for decades, but a reasonable explanation has not yet been proposed to explain this phenomenon. In this study, we calculated the CpG odds ratio of all RNA viruses that have available genome sequences and analyzed the correlation with their genome polarity, base composition, synonymous codon usage, phylogenetic relationship, and host. The results indicated that the viral base composition, synonymous codon usage and host selection were the dominant factors that determined the CpG bias in RNA viruses. CpG usage variation between the different viral groups was caused by different combinations of these pressures, which also differed from each other in strength. The consistent under-representation of CpG usage in -ssRNA viruses is determined predominantly by base composition, which may be a consequence of the U/A preferred mutation bias of -ssRNA viruses, whereas the CpG usage of +ssRNA viruses is affected greatly by their hosts. As a result, most +ssRNA viruses mimic their hosts' CpG usage. Unbiased CpG usage in dsRNA viruses is most likely a result of their dsRNA genome, which allows the viruses to escape from the host-driven CpG elimination pressure. CpG was under-represented in all reverse-transcribing viruses (RT viruses), suggesting that DNA methylation is an important factor affecting the CpG usage of retroviruses. However, vertebrate-infecting RT viruses may also suffer host' CpG elimination pressure that also acts on +ssRNA viruses, which results in further under-representation of CpG in the vertebrate-infecting RT viruses.
url http://europepmc.org/articles/PMC3781069?pdf=render
work_keys_str_mv AT xiaofeicheng cpgusageinrnavirusesdataandhypotheses
AT nasarvirk cpgusageinrnavirusesdataandhypotheses
AT weichen cpgusageinrnavirusesdataandhypotheses
AT shuqinji cpgusageinrnavirusesdataandhypotheses
AT shuxianji cpgusageinrnavirusesdataandhypotheses
AT yuqiangsun cpgusageinrnavirusesdataandhypotheses
AT xiaoyunwu cpgusageinrnavirusesdataandhypotheses
_version_ 1725262148940070912