Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19

The ongoing outbreak of coronavirus disease COVID-19 is significantly implicated by global heterogeneity in the genome organization of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The causative agents of global heterogeneity in the whole genome of SARS-CoV-2 are not well characteriz...

Full description

Bibliographic Details
Main Authors: Saadullah Khattak, Mohd Ahmar Rauf, Qamar Zaman, Yasir Ali, Shabeen Fatima, Pir Muhammad, Tao Li, Hamza Ali Khan, Azhar Abbas Khan, Ebenezeri Erasto Ngowi, Dong-Dong Wu, Xin-Ying Ji
Format: Article
Language:English
Published: MDPI AG 2021-06-01
Series:Biomolecules
Subjects:
Online Access:https://www.mdpi.com/2218-273X/11/6/912
id doaj-c429d4cbef7a4080a9b132e327f78203
record_format Article
spelling doaj-c429d4cbef7a4080a9b132e327f782032021-07-01T00:34:16ZengMDPI AGBiomolecules2218-273X2021-06-011191291210.3390/biom11060912Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19Saadullah Khattak0Mohd Ahmar Rauf1Qamar Zaman2Yasir Ali3Shabeen Fatima4Pir Muhammad5Tao Li6Hamza Ali Khan7Azhar Abbas Khan8Ebenezeri Erasto Ngowi9Dong-Dong Wu10Xin-Ying Ji11Henan International Joint Laboratory for Nuclear Protein Regulation, School of Basic Medical Sciences, Henan University, Kaifeng 475004, Henan, ChinaSchool of Life Sciences, Henan University, Kaifeng 475004, Henan, ChinaDepartment of Bioinformatics, Hazara University, Mansehra 21120, PakistanNational Center for Bioinformatics, Quaid-i-Azam University, Islamabad 45320, PakistanDepartment of Biosciences and Bioinformatics, Capital University of Science and Technology, Islamabad 44000, PakistanHenan-Macquarie University Joint Centre for Biomedical Innovation, School of Life Sciences, Henan University, Kaifeng 475004, Henan, ChinaHenan International Joint Laboratory for Nuclear Protein Regulation, School of Basic Medical Sciences, Henan University, Kaifeng 475004, Henan, ChinaDepartment of Bioinformatics, Hazara University, Mansehra 21120, PakistanHenan International Joint Laboratory for Nuclear Protein Regulation, School of Basic Medical Sciences, Henan University, Kaifeng 475004, Henan, ChinaHenan International Joint Laboratory for Nuclear Protein Regulation, School of Basic Medical Sciences, Henan University, Kaifeng 475004, Henan, ChinaHenan International Joint Laboratory for Nuclear Protein Regulation, School of Basic Medical Sciences, Henan University, Kaifeng 475004, Henan, ChinaHenan International Joint Laboratory for Nuclear Protein Regulation, School of Basic Medical Sciences, Henan University, Kaifeng 475004, Henan, ChinaThe ongoing outbreak of coronavirus disease COVID-19 is significantly implicated by global heterogeneity in the genome organization of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The causative agents of global heterogeneity in the whole genome of SARS-CoV-2 are not well characterized due to the lack of comparative study of a large enough sample size from around the globe to reduce the standard deviation to the acceptable margin of error. To better understand the SARS-CoV-2 genome architecture, we have performed a comprehensive analysis of codon usage bias of sixty (60) strains to get a snapshot of its global heterogeneity. Our study shows a relatively low codon usage bias in the SARS-CoV-2 viral genome globally, with nearly all the over-preferred codons’ A.U. ended. We concluded that the SARS-CoV-2 genome is primarily shaped by mutation pressure; however, marginal selection pressure cannot be overlooked. Within the A/U rich virus genomes of SARS-CoV-2, the standard deviation in G.C. (42.91% ± 5.84%) and the GC3 value (30.14% ± 6.93%) points towards global heterogeneity of the virus. Several SARS-CoV-2 viral strains were originated from different viral lineages at the exact geographic location also supports this fact. Taking all together, these findings suggest that the general root ancestry of the global genomes are different with different genome’s level adaptation to host. This research may provide new insights into the codon patterns, host adaptation, and global heterogeneity of SARS-CoV-2.https://www.mdpi.com/2218-273X/11/6/912CoronavirusSARS-CoV-2Codon usage biasCOVID-19heterogeneity of COVID-19mutational bias
collection DOAJ
language English
format Article
sources DOAJ
author Saadullah Khattak
Mohd Ahmar Rauf
Qamar Zaman
Yasir Ali
Shabeen Fatima
Pir Muhammad
Tao Li
Hamza Ali Khan
Azhar Abbas Khan
Ebenezeri Erasto Ngowi
Dong-Dong Wu
Xin-Ying Ji
spellingShingle Saadullah Khattak
Mohd Ahmar Rauf
Qamar Zaman
Yasir Ali
Shabeen Fatima
Pir Muhammad
Tao Li
Hamza Ali Khan
Azhar Abbas Khan
Ebenezeri Erasto Ngowi
Dong-Dong Wu
Xin-Ying Ji
Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19
Biomolecules
Coronavirus
SARS-CoV-2
Codon usage bias
COVID-19
heterogeneity of COVID-19
mutational bias
author_facet Saadullah Khattak
Mohd Ahmar Rauf
Qamar Zaman
Yasir Ali
Shabeen Fatima
Pir Muhammad
Tao Li
Hamza Ali Khan
Azhar Abbas Khan
Ebenezeri Erasto Ngowi
Dong-Dong Wu
Xin-Ying Ji
author_sort Saadullah Khattak
title Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19
title_short Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19
title_full Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19
title_fullStr Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19
title_full_unstemmed Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19
title_sort genome-wide analysis of codon usage patterns of sars-cov-2 virus reveals global heterogeneity of covid-19
publisher MDPI AG
series Biomolecules
issn 2218-273X
publishDate 2021-06-01
description The ongoing outbreak of coronavirus disease COVID-19 is significantly implicated by global heterogeneity in the genome organization of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The causative agents of global heterogeneity in the whole genome of SARS-CoV-2 are not well characterized due to the lack of comparative study of a large enough sample size from around the globe to reduce the standard deviation to the acceptable margin of error. To better understand the SARS-CoV-2 genome architecture, we have performed a comprehensive analysis of codon usage bias of sixty (60) strains to get a snapshot of its global heterogeneity. Our study shows a relatively low codon usage bias in the SARS-CoV-2 viral genome globally, with nearly all the over-preferred codons’ A.U. ended. We concluded that the SARS-CoV-2 genome is primarily shaped by mutation pressure; however, marginal selection pressure cannot be overlooked. Within the A/U rich virus genomes of SARS-CoV-2, the standard deviation in G.C. (42.91% ± 5.84%) and the GC3 value (30.14% ± 6.93%) points towards global heterogeneity of the virus. Several SARS-CoV-2 viral strains were originated from different viral lineages at the exact geographic location also supports this fact. Taking all together, these findings suggest that the general root ancestry of the global genomes are different with different genome’s level adaptation to host. This research may provide new insights into the codon patterns, host adaptation, and global heterogeneity of SARS-CoV-2.
topic Coronavirus
SARS-CoV-2
Codon usage bias
COVID-19
heterogeneity of COVID-19
mutational bias
url https://www.mdpi.com/2218-273X/11/6/912
work_keys_str_mv AT saadullahkhattak genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT mohdahmarrauf genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT qamarzaman genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT yasirali genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT shabeenfatima genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT pirmuhammad genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT taoli genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT hamzaalikhan genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT azharabbaskhan genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT ebenezerierastongowi genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT dongdongwu genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
AT xinyingji genomewideanalysisofcodonusagepatternsofsarscov2virusrevealsglobalheterogeneityofcovid19
_version_ 1721348319658115072