Massively parallel gene expression variation measurement of a synonymous codon library
Abstract Background Cell-to-cell variation in gene expression strongly affects population behavior and is key to multiple biological processes. While codon usage is known to affect ensemble gene expression, how codon usage influences variation in gene expression between single cells is not well unde...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-03-01
|
Series: | BMC Genomics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12864-021-07462-z |
id |
doaj-dc4bcd0a7afb46c18729e0c9f00e60fd |
---|---|
record_format |
Article |
spelling |
doaj-dc4bcd0a7afb46c18729e0c9f00e60fd2021-03-11T11:53:44ZengBMCBMC Genomics1471-21642021-03-0122111210.1186/s12864-021-07462-zMassively parallel gene expression variation measurement of a synonymous codon libraryAlexander Schmitz0Fuzhong Zhang1Department of Energy, Environmental and Chemical Engineering, Washington University in St. LouisDepartment of Energy, Environmental and Chemical Engineering, Washington University in St. LouisAbstract Background Cell-to-cell variation in gene expression strongly affects population behavior and is key to multiple biological processes. While codon usage is known to affect ensemble gene expression, how codon usage influences variation in gene expression between single cells is not well understood. Results Here, we used a Sort-seq based massively parallel strategy to quantify gene expression variation from a green fluorescent protein (GFP) library containing synonymous codons in Escherichia coli. We found that sequences containing codons with higher tRNA Adaptation Index (TAI) scores, and higher codon adaptation index (CAI) scores, have higher GFP variance. This trend is not observed for codons with high Normalized Translation Efficiency Index (nTE) scores nor from the free energy of folding of the mRNA secondary structure. GFP noise, or squared coefficient of variance (CV2), scales with mean protein abundance for low-abundant proteins but does not change at high mean protein abundance. Conclusions Our results suggest that the main source of noise for high-abundance proteins is likely not originating at translation elongation. Additionally, the drastic change in mean protein abundance with small changes in protein noise seen from our library implies that codon optimization can be performed without concerning gene expression noise for biotechnology applications.https://doi.org/10.1186/s12864-021-07462-zSort-seqProtein abundanceCodon usageSingle-cellGene expression variation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Alexander Schmitz Fuzhong Zhang |
spellingShingle |
Alexander Schmitz Fuzhong Zhang Massively parallel gene expression variation measurement of a synonymous codon library BMC Genomics Sort-seq Protein abundance Codon usage Single-cell Gene expression variation |
author_facet |
Alexander Schmitz Fuzhong Zhang |
author_sort |
Alexander Schmitz |
title |
Massively parallel gene expression variation measurement of a synonymous codon library |
title_short |
Massively parallel gene expression variation measurement of a synonymous codon library |
title_full |
Massively parallel gene expression variation measurement of a synonymous codon library |
title_fullStr |
Massively parallel gene expression variation measurement of a synonymous codon library |
title_full_unstemmed |
Massively parallel gene expression variation measurement of a synonymous codon library |
title_sort |
massively parallel gene expression variation measurement of a synonymous codon library |
publisher |
BMC |
series |
BMC Genomics |
issn |
1471-2164 |
publishDate |
2021-03-01 |
description |
Abstract Background Cell-to-cell variation in gene expression strongly affects population behavior and is key to multiple biological processes. While codon usage is known to affect ensemble gene expression, how codon usage influences variation in gene expression between single cells is not well understood. Results Here, we used a Sort-seq based massively parallel strategy to quantify gene expression variation from a green fluorescent protein (GFP) library containing synonymous codons in Escherichia coli. We found that sequences containing codons with higher tRNA Adaptation Index (TAI) scores, and higher codon adaptation index (CAI) scores, have higher GFP variance. This trend is not observed for codons with high Normalized Translation Efficiency Index (nTE) scores nor from the free energy of folding of the mRNA secondary structure. GFP noise, or squared coefficient of variance (CV2), scales with mean protein abundance for low-abundant proteins but does not change at high mean protein abundance. Conclusions Our results suggest that the main source of noise for high-abundance proteins is likely not originating at translation elongation. Additionally, the drastic change in mean protein abundance with small changes in protein noise seen from our library implies that codon optimization can be performed without concerning gene expression noise for biotechnology applications. |
topic |
Sort-seq Protein abundance Codon usage Single-cell Gene expression variation |
url |
https://doi.org/10.1186/s12864-021-07462-z |
work_keys_str_mv |
AT alexanderschmitz massivelyparallelgeneexpressionvariationmeasurementofasynonymouscodonlibrary AT fuzhongzhang massivelyparallelgeneexpressionvariationmeasurementofasynonymouscodonlibrary |
_version_ |
1724225119399706624 |