Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains

Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and...

Full description

Bibliographic Details
Main Authors: Jenny L. Draper, Lori M. Hansen, David L. Bernick, Samar Abedrabbo, Jason G. Underwood, Nguyet Kong, Bihua C. Huang, Allison M. Weis, Bart C. Weimer, Arnoud H. M. van Vliet, Nader Pourmand, Jay V. Solnick, Kevin Karplus, Karen M. Ottemann, Claire M. Fraser
Format: Article
Language:English
Published: American Society for Microbiology 2017-02-01
Series:mBio
Online Access:http://mbio.asm.org/cgi/content/full/8/1/e02321-16
id doaj-2f293b07c80d49af8e65f8c970701bb9
record_format Article
spelling doaj-2f293b07c80d49af8e65f8c970701bb92021-07-02T09:06:33ZengAmerican Society for MicrobiologymBio2150-75112017-02-0181e02321-1610.1128/mBio.02321-16Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori StrainsJenny L. DraperLori M. HansenDavid L. BernickSamar AbedrabboJason G. UnderwoodNguyet KongBihua C. HuangAllison M. WeisBart C. WeimerArnoud H. M. van VlietNader PourmandJay V. SolnickKevin KarplusKaren M. OttemannClaire M. FraserMany bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.http://mbio.asm.org/cgi/content/full/8/1/e02321-16
collection DOAJ
language English
format Article
sources DOAJ
author Jenny L. Draper
Lori M. Hansen
David L. Bernick
Samar Abedrabbo
Jason G. Underwood
Nguyet Kong
Bihua C. Huang
Allison M. Weis
Bart C. Weimer
Arnoud H. M. van Vliet
Nader Pourmand
Jay V. Solnick
Kevin Karplus
Karen M. Ottemann
Claire M. Fraser
spellingShingle Jenny L. Draper
Lori M. Hansen
David L. Bernick
Samar Abedrabbo
Jason G. Underwood
Nguyet Kong
Bihua C. Huang
Allison M. Weis
Bart C. Weimer
Arnoud H. M. van Vliet
Nader Pourmand
Jay V. Solnick
Kevin Karplus
Karen M. Ottemann
Claire M. Fraser
Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains
mBio
author_facet Jenny L. Draper
Lori M. Hansen
David L. Bernick
Samar Abedrabbo
Jason G. Underwood
Nguyet Kong
Bihua C. Huang
Allison M. Weis
Bart C. Weimer
Arnoud H. M. van Vliet
Nader Pourmand
Jay V. Solnick
Kevin Karplus
Karen M. Ottemann
Claire M. Fraser
author_sort Jenny L. Draper
title Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains
title_short Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains
title_full Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains
title_fullStr Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains
title_full_unstemmed Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains
title_sort fallacy of the unique genome: sequence diversity within single helicobacter pylori strains
publisher American Society for Microbiology
series mBio
issn 2150-7511
publishDate 2017-02-01
description Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.
url http://mbio.asm.org/cgi/content/full/8/1/e02321-16
work_keys_str_mv AT jennyldraper fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT lorimhansen fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT davidlbernick fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT samarabedrabbo fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT jasongunderwood fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT nguyetkong fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT bihuachuang fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT allisonmweis fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT bartcweimer fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT arnoudhmvanvliet fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT naderpourmand fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT jayvsolnick fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT kevinkarplus fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT karenmottemann fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
AT clairemfraser fallacyoftheuniquegenomesequencediversitywithinsinglehelicobacterpyloristrains
_version_ 1721333487595683840