High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.

Profiling microbial community function from metagenomic sequencing data remains a computationally challenging problem. Mapping millions of DNA reads from such samples to reference protein databases requires long run-times, and short read lengths can result in spurious hits to unrelated proteins (los...

Full description

Bibliographic Details
Main Authors: James Kaminski, Molly K Gibson, Eric A Franzosa, Nicola Segata, Gautam Dantas, Curtis Huttenhower
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-12-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1004557
id doaj-114882a8150b440f88b8c89d4aa0982c
record_format Article
spelling doaj-114882a8150b440f88b8c89d4aa0982c2021-04-21T15:34:13ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582015-12-011112e100455710.1371/journal.pcbi.1004557High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.James KaminskiMolly K GibsonEric A FranzosaNicola SegataGautam DantasCurtis HuttenhowerProfiling microbial community function from metagenomic sequencing data remains a computationally challenging problem. Mapping millions of DNA reads from such samples to reference protein databases requires long run-times, and short read lengths can result in spurious hits to unrelated proteins (loss of specificity). We developed ShortBRED (Short, Better Representative Extract Dataset) to address these challenges, facilitating fast, accurate functional profiling of metagenomic samples. ShortBRED consists of two components: (i) a method that reduces reference proteins of interest to short, highly representative amino acid sequences ("markers") and (ii) a search step that maps reads to these markers to quantify the relative abundance of their associated proteins. After evaluating ShortBRED on synthetic data, we applied it to profile antibiotic resistance protein families in the gut microbiomes of individuals from the United States, China, Malawi, and Venezuela. Our results support antibiotic resistance as a core function in the human gut microbiome, with tetracycline-resistant ribosomal protection proteins and Class A beta-lactamases being the most widely distributed resistance mechanisms worldwide. ShortBRED markers are applicable to other homology-based search tasks, which we demonstrate here by identifying phylogenetic signatures of antibiotic resistance across more than 3,000 microbial isolate genomes. ShortBRED can be applied to profile a wide variety of protein families of interest; the software, source code, and documentation are available for download at http://huttenhower.sph.harvard.edu/shortbred.https://doi.org/10.1371/journal.pcbi.1004557
collection DOAJ
language English
format Article
sources DOAJ
author James Kaminski
Molly K Gibson
Eric A Franzosa
Nicola Segata
Gautam Dantas
Curtis Huttenhower
spellingShingle James Kaminski
Molly K Gibson
Eric A Franzosa
Nicola Segata
Gautam Dantas
Curtis Huttenhower
High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.
PLoS Computational Biology
author_facet James Kaminski
Molly K Gibson
Eric A Franzosa
Nicola Segata
Gautam Dantas
Curtis Huttenhower
author_sort James Kaminski
title High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.
title_short High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.
title_full High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.
title_fullStr High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.
title_full_unstemmed High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.
title_sort high-specificity targeted functional profiling in microbial communities with shortbred.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2015-12-01
description Profiling microbial community function from metagenomic sequencing data remains a computationally challenging problem. Mapping millions of DNA reads from such samples to reference protein databases requires long run-times, and short read lengths can result in spurious hits to unrelated proteins (loss of specificity). We developed ShortBRED (Short, Better Representative Extract Dataset) to address these challenges, facilitating fast, accurate functional profiling of metagenomic samples. ShortBRED consists of two components: (i) a method that reduces reference proteins of interest to short, highly representative amino acid sequences ("markers") and (ii) a search step that maps reads to these markers to quantify the relative abundance of their associated proteins. After evaluating ShortBRED on synthetic data, we applied it to profile antibiotic resistance protein families in the gut microbiomes of individuals from the United States, China, Malawi, and Venezuela. Our results support antibiotic resistance as a core function in the human gut microbiome, with tetracycline-resistant ribosomal protection proteins and Class A beta-lactamases being the most widely distributed resistance mechanisms worldwide. ShortBRED markers are applicable to other homology-based search tasks, which we demonstrate here by identifying phylogenetic signatures of antibiotic resistance across more than 3,000 microbial isolate genomes. ShortBRED can be applied to profile a wide variety of protein families of interest; the software, source code, and documentation are available for download at http://huttenhower.sph.harvard.edu/shortbred.
url https://doi.org/10.1371/journal.pcbi.1004557
work_keys_str_mv AT jameskaminski highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT mollykgibson highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT ericafranzosa highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT nicolasegata highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT gautamdantas highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT curtishuttenhower highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
_version_ 1714667230360240128