Improving Characterization of Understudied Human Microbiomes Using Targeted Phylogenetics

The human microbiome plays a critically important role in health and disease, but current understanding of the mechanisms underlying the interactions between the varying microbiome and the different host environments is lacking. Having access to a database of fully sequenced bacterial genomes provid...

Full description

Bibliographic Details
Main Authors: Bruce A. Rosa, Kathie Mihindukulasuriya, Kymberlie Hallsworth-Pepin, Aye Wollam, John Martin, Caroline Snowden, William Michael Dunne, George M. Weinstock, Carey-Ann D. Burnham, Makedonka Mitreva
Format: Article
Language:English
Published: American Society for Microbiology 2020-02-01
Series:mSystems
Subjects:
hmp
Online Access:https://doi.org/10.1128/mSystems.00096-20
id doaj-b29d230d368e4e19bd2a4a9d2b76aa20
record_format Article
spelling doaj-b29d230d368e4e19bd2a4a9d2b76aa202020-11-25T02:11:11ZengAmerican Society for MicrobiologymSystems2379-50772020-02-0151e00096-2010.1128/mSystems.00096-20Improving Characterization of Understudied Human Microbiomes Using Targeted PhylogeneticsBruce A. RosaKathie MihindukulasuriyaKymberlie Hallsworth-PepinAye WollamJohn MartinCaroline SnowdenWilliam Michael DunneGeorge M. WeinstockCarey-Ann D. BurnhamMakedonka MitrevaThe human microbiome plays a critically important role in health and disease, but current understanding of the mechanisms underlying the interactions between the varying microbiome and the different host environments is lacking. Having access to a database of fully sequenced bacterial genomes provides invaluable insights into microbial functions, but currently sequenced genomes for the human microbiome have largely come from a limited number of body sites (primarily feces), while other sites such as the skin, respiratory tract, and urinary tract are underrepresented, resulting in as little as 13% of bacterium-derived reads mapping to known bacterial genomes. Here, we sequenced and assembled 665 new bacterial genomes, prioritized from a larger database to select underrepresented body sites and bacterial taxa in the existing databases. As a result, we substantially improve mapping rates for samples from the Human Microbiome Project and provide an important contribution to human bacterial genomic databases for future studies.Whole-genome bacterial sequences are required to better understand microbial functions, niche-specific bacterial metabolism, and disease states. Although genomic sequences are available for many of the human-associated bacteria from commonly tested body habitats (e.g., feces), as few as 13% of bacterium-derived reads from other sites such as the skin map to known bacterial genomes. To facilitate a better characterization of metagenomic shotgun reads from underrepresented body sites, we collected over 10,000 bacterial isolates originating from 14 human body habitats, identified novel taxonomic groups based on full-length 16S rRNA gene sequences, clustered the sequences to ensure that no individual taxonomic group was overselected for sequencing, prioritized bacteria from underrepresented body sites (such as skin and respiratory and urinary tracts), and sequenced and assembled genomes for 665 new bacterial strains. Here, we show that addition of these genomes improved read mapping rates of Human Microbiome Project (HMP) metagenomic samples by nearly 30% for the previously underrepresented phylum Fusobacteria, and 27.5% of the novel genomes generated here had high representation in at least one of the tested HMP samples, compared to 12.5% of the sequences in the public databases, indicating an enrichment of useful novel genomic sequences resulting from the prioritization procedure. As our understanding of the human microbiome continues to improve and to enter the realm of therapy developments, targeted approaches such as this to improve genomic databases will increase in importance from both an academic and a clinical perspective.https://doi.org/10.1128/mSystems.00096-20hmpgenomehuman microbiomemicrobiomeresource
collection DOAJ
language English
format Article
sources DOAJ
author Bruce A. Rosa
Kathie Mihindukulasuriya
Kymberlie Hallsworth-Pepin
Aye Wollam
John Martin
Caroline Snowden
William Michael Dunne
George M. Weinstock
Carey-Ann D. Burnham
Makedonka Mitreva
spellingShingle Bruce A. Rosa
Kathie Mihindukulasuriya
Kymberlie Hallsworth-Pepin
Aye Wollam
John Martin
Caroline Snowden
William Michael Dunne
George M. Weinstock
Carey-Ann D. Burnham
Makedonka Mitreva
Improving Characterization of Understudied Human Microbiomes Using Targeted Phylogenetics
mSystems
hmp
genome
human microbiome
microbiome
resource
author_facet Bruce A. Rosa
Kathie Mihindukulasuriya
Kymberlie Hallsworth-Pepin
Aye Wollam
John Martin
Caroline Snowden
William Michael Dunne
George M. Weinstock
Carey-Ann D. Burnham
Makedonka Mitreva
author_sort Bruce A. Rosa
title Improving Characterization of Understudied Human Microbiomes Using Targeted Phylogenetics
title_short Improving Characterization of Understudied Human Microbiomes Using Targeted Phylogenetics
title_full Improving Characterization of Understudied Human Microbiomes Using Targeted Phylogenetics
title_fullStr Improving Characterization of Understudied Human Microbiomes Using Targeted Phylogenetics
title_full_unstemmed Improving Characterization of Understudied Human Microbiomes Using Targeted Phylogenetics
title_sort improving characterization of understudied human microbiomes using targeted phylogenetics
publisher American Society for Microbiology
series mSystems
issn 2379-5077
publishDate 2020-02-01
description The human microbiome plays a critically important role in health and disease, but current understanding of the mechanisms underlying the interactions between the varying microbiome and the different host environments is lacking. Having access to a database of fully sequenced bacterial genomes provides invaluable insights into microbial functions, but currently sequenced genomes for the human microbiome have largely come from a limited number of body sites (primarily feces), while other sites such as the skin, respiratory tract, and urinary tract are underrepresented, resulting in as little as 13% of bacterium-derived reads mapping to known bacterial genomes. Here, we sequenced and assembled 665 new bacterial genomes, prioritized from a larger database to select underrepresented body sites and bacterial taxa in the existing databases. As a result, we substantially improve mapping rates for samples from the Human Microbiome Project and provide an important contribution to human bacterial genomic databases for future studies.Whole-genome bacterial sequences are required to better understand microbial functions, niche-specific bacterial metabolism, and disease states. Although genomic sequences are available for many of the human-associated bacteria from commonly tested body habitats (e.g., feces), as few as 13% of bacterium-derived reads from other sites such as the skin map to known bacterial genomes. To facilitate a better characterization of metagenomic shotgun reads from underrepresented body sites, we collected over 10,000 bacterial isolates originating from 14 human body habitats, identified novel taxonomic groups based on full-length 16S rRNA gene sequences, clustered the sequences to ensure that no individual taxonomic group was overselected for sequencing, prioritized bacteria from underrepresented body sites (such as skin and respiratory and urinary tracts), and sequenced and assembled genomes for 665 new bacterial strains. Here, we show that addition of these genomes improved read mapping rates of Human Microbiome Project (HMP) metagenomic samples by nearly 30% for the previously underrepresented phylum Fusobacteria, and 27.5% of the novel genomes generated here had high representation in at least one of the tested HMP samples, compared to 12.5% of the sequences in the public databases, indicating an enrichment of useful novel genomic sequences resulting from the prioritization procedure. As our understanding of the human microbiome continues to improve and to enter the realm of therapy developments, targeted approaches such as this to improve genomic databases will increase in importance from both an academic and a clinical perspective.
topic hmp
genome
human microbiome
microbiome
resource
url https://doi.org/10.1128/mSystems.00096-20
work_keys_str_mv AT brucearosa improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT kathiemihindukulasuriya improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT kymberliehallsworthpepin improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT ayewollam improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT johnmartin improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT carolinesnowden improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT williammichaeldunne improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT georgemweinstock improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT careyanndburnham improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
AT makedonkamitreva improvingcharacterizationofunderstudiedhumanmicrobiomesusingtargetedphylogenetics
_version_ 1715551556641751040