Massively parallel gene expression variation measurement of a synonymous codon library

Abstract Background Cell-to-cell variation in gene expression strongly affects population behavior and is key to multiple biological processes. While codon usage is known to affect ensemble gene expression, how codon usage influences variation in gene expression between single cells is not well unde...

Full description

Bibliographic Details
Main Authors: Alexander Schmitz, Fuzhong Zhang
Format: Article
Language:English
Published: BMC 2021-03-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-021-07462-z
id doaj-dc4bcd0a7afb46c18729e0c9f00e60fd
record_format Article
spelling doaj-dc4bcd0a7afb46c18729e0c9f00e60fd2021-03-11T11:53:44ZengBMCBMC Genomics1471-21642021-03-0122111210.1186/s12864-021-07462-zMassively parallel gene expression variation measurement of a synonymous codon libraryAlexander Schmitz0Fuzhong Zhang1Department of Energy, Environmental and Chemical Engineering, Washington University in St. LouisDepartment of Energy, Environmental and Chemical Engineering, Washington University in St. LouisAbstract Background Cell-to-cell variation in gene expression strongly affects population behavior and is key to multiple biological processes. While codon usage is known to affect ensemble gene expression, how codon usage influences variation in gene expression between single cells is not well understood. Results Here, we used a Sort-seq based massively parallel strategy to quantify gene expression variation from a green fluorescent protein (GFP) library containing synonymous codons in Escherichia coli. We found that sequences containing codons with higher tRNA Adaptation Index (TAI) scores, and higher codon adaptation index (CAI) scores, have higher GFP variance. This trend is not observed for codons with high Normalized Translation Efficiency Index (nTE) scores nor from the free energy of folding of the mRNA secondary structure. GFP noise, or squared coefficient of variance (CV2), scales with mean protein abundance for low-abundant proteins but does not change at high mean protein abundance. Conclusions Our results suggest that the main source of noise for high-abundance proteins is likely not originating at translation elongation. Additionally, the drastic change in mean protein abundance with small changes in protein noise seen from our library implies that codon optimization can be performed without concerning gene expression noise for biotechnology applications.https://doi.org/10.1186/s12864-021-07462-zSort-seqProtein abundanceCodon usageSingle-cellGene expression variation
collection DOAJ
language English
format Article
sources DOAJ
author Alexander Schmitz
Fuzhong Zhang
spellingShingle Alexander Schmitz
Fuzhong Zhang
Massively parallel gene expression variation measurement of a synonymous codon library
BMC Genomics
Sort-seq
Protein abundance
Codon usage
Single-cell
Gene expression variation
author_facet Alexander Schmitz
Fuzhong Zhang
author_sort Alexander Schmitz
title Massively parallel gene expression variation measurement of a synonymous codon library
title_short Massively parallel gene expression variation measurement of a synonymous codon library
title_full Massively parallel gene expression variation measurement of a synonymous codon library
title_fullStr Massively parallel gene expression variation measurement of a synonymous codon library
title_full_unstemmed Massively parallel gene expression variation measurement of a synonymous codon library
title_sort massively parallel gene expression variation measurement of a synonymous codon library
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2021-03-01
description Abstract Background Cell-to-cell variation in gene expression strongly affects population behavior and is key to multiple biological processes. While codon usage is known to affect ensemble gene expression, how codon usage influences variation in gene expression between single cells is not well understood. Results Here, we used a Sort-seq based massively parallel strategy to quantify gene expression variation from a green fluorescent protein (GFP) library containing synonymous codons in Escherichia coli. We found that sequences containing codons with higher tRNA Adaptation Index (TAI) scores, and higher codon adaptation index (CAI) scores, have higher GFP variance. This trend is not observed for codons with high Normalized Translation Efficiency Index (nTE) scores nor from the free energy of folding of the mRNA secondary structure. GFP noise, or squared coefficient of variance (CV2), scales with mean protein abundance for low-abundant proteins but does not change at high mean protein abundance. Conclusions Our results suggest that the main source of noise for high-abundance proteins is likely not originating at translation elongation. Additionally, the drastic change in mean protein abundance with small changes in protein noise seen from our library implies that codon optimization can be performed without concerning gene expression noise for biotechnology applications.
topic Sort-seq
Protein abundance
Codon usage
Single-cell
Gene expression variation
url https://doi.org/10.1186/s12864-021-07462-z
work_keys_str_mv AT alexanderschmitz massivelyparallelgeneexpressionvariationmeasurementofasynonymouscodonlibrary
AT fuzhongzhang massivelyparallelgeneexpressionvariationmeasurementofasynonymouscodonlibrary
_version_ 1724225119399706624