Accuracy of microbial community diversity estimated by closed- and open-reference OTUs

Next-generation sequencing of 16S ribosomal RNA is widely used to survey microbial communities. Sequences are typically assigned to Operational Taxonomic Units (OTUs). Closed- and open-reference OTU assignment matches reads to a reference database at 97% identity (closed), then clusters unmatched re...

Full description

Bibliographic Details
Main Author:	Robert C. Edgar
Format:	Article
Language:	English
Published:	PeerJ Inc. 2017-10-01
Series:	PeerJ
Subjects:	OTU Alpha diversity Beta diversity QIIME Open-reference Closed-reference
Online Access:	https://peerj.com/articles/3889.pdf

id	doaj-117e612f8e424517bf48a31b4c384282
record_format	Article
spelling	doaj-117e612f8e424517bf48a31b4c3842822020-11-25T02:03:43ZengPeerJ Inc.PeerJ2167-83592017-10-015e388910.7717/peerj.3889Accuracy of microbial community diversity estimated by closed- and open-reference OTUsRobert C. Edgar0Sonoma, CA, United States of AmericaNext-generation sequencing of 16S ribosomal RNA is widely used to survey microbial communities. Sequences are typically assigned to Operational Taxonomic Units (OTUs). Closed- and open-reference OTU assignment matches reads to a reference database at 97% identity (closed), then clusters unmatched reads using a de novo method (open). Implementations of these methods in the QIIME package were tested on several mock community datasets with 20 strains using different sequencing technologies and primers. Richness (number of reported OTUs) was often greatly exaggerated, with hundreds or thousands of OTUs generated on Illumina datasets. Between-sample diversity was also found to be highly exaggerated in many cases, with weighted Jaccard distances between identical mock samples often close to one, indicating very low similarity. Non-overlapping hyper-variable regions in 70% of species were assigned to different OTUs. On mock communities with Illumina V4 reads, 56% to 88% of predicted genus names were false positives. Biological inferences obtained using these methods are therefore not reliable.https://peerj.com/articles/3889.pdfOTUAlpha diversityBeta diversityQIIMEOpen-referenceClosed-reference
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Robert C. Edgar
spellingShingle	Robert C. Edgar Accuracy of microbial community diversity estimated by closed- and open-reference OTUs PeerJ OTU Alpha diversity Beta diversity QIIME Open-reference Closed-reference
author_facet	Robert C. Edgar
author_sort	Robert C. Edgar
title	Accuracy of microbial community diversity estimated by closed- and open-reference OTUs
title_short	Accuracy of microbial community diversity estimated by closed- and open-reference OTUs
title_full	Accuracy of microbial community diversity estimated by closed- and open-reference OTUs
title_fullStr	Accuracy of microbial community diversity estimated by closed- and open-reference OTUs
title_full_unstemmed	Accuracy of microbial community diversity estimated by closed- and open-reference OTUs
title_sort	accuracy of microbial community diversity estimated by closed- and open-reference otus
publisher	PeerJ Inc.
series	PeerJ
issn	2167-8359
publishDate	2017-10-01
description	Next-generation sequencing of 16S ribosomal RNA is widely used to survey microbial communities. Sequences are typically assigned to Operational Taxonomic Units (OTUs). Closed- and open-reference OTU assignment matches reads to a reference database at 97% identity (closed), then clusters unmatched reads using a de novo method (open). Implementations of these methods in the QIIME package were tested on several mock community datasets with 20 strains using different sequencing technologies and primers. Richness (number of reported OTUs) was often greatly exaggerated, with hundreds or thousands of OTUs generated on Illumina datasets. Between-sample diversity was also found to be highly exaggerated in many cases, with weighted Jaccard distances between identical mock samples often close to one, indicating very low similarity. Non-overlapping hyper-variable regions in 70% of species were assigned to different OTUs. On mock communities with Illumina V4 reads, 56% to 88% of predicted genus names were false positives. Biological inferences obtained using these methods are therefore not reliable.
topic	OTU Alpha diversity Beta diversity QIIME Open-reference Closed-reference
url	https://peerj.com/articles/3889.pdf
work_keys_str_mv	AT robertcedgar accuracyofmicrobialcommunitydiversityestimatedbyclosedandopenreferenceotus
_version_	1724946166240509952

Accuracy of microbial community diversity estimated by closed- and open-reference OTUs

Similar Items