Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences

Molecular data form an important research tool in most branches of mycology. A non-trivial proportion of the public fungal DNA sequences are, however, compromised in terms of quality and reliability, contributing noise and bias to sequence-borne inferences such as phylogenetic analysis, diversity as...

Full description

Bibliographic Details
Main Authors: R. Henrik Nilsson, Leho Tedersoo, Kessy Abarenkov, Martin Ryberg, Erik Kristiansson, Martin Hartmann, Conrad L. Schoch, Johan A. A. Nylander, Johannes Bergsten, Teresita M. Porter, Ari Jumpponen, Parag Vaishampayan, Otso Ovaskainen, Nils Hallenberg, Johan Bengtsson-Palme, K. Martin Eriksson, Karl-Henrik Larsson, Ellen Larsson, Urmas Kõljalg
Format: Article
Language:English
Published: Pensoft Publishers 2012-09-01
Series:MycoKeys
Online Access:http://mycokeys.pensoft.net/lib/ajax_srv/article_elements_srv.php?action=download_pdf&item_id=1186
id doaj-91fd1a8d5e1749d4a33d539bf6328b9c
record_format Article
spelling doaj-91fd1a8d5e1749d4a33d539bf6328b9c2020-11-24T23:24:38ZengPensoft PublishersMycoKeys1314-40571314-40492012-09-0140376310.3897/mycokeys.4.36061186Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequencesR. Henrik NilssonLeho TedersooKessy AbarenkovMartin RybergErik KristianssonMartin HartmannConrad L. SchochJohan A. A. NylanderJohannes BergstenTeresita M. PorterAri JumpponenParag VaishampayanOtso OvaskainenNils HallenbergJohan Bengtsson-PalmeK. Martin ErikssonKarl-Henrik LarssonEllen LarssonUrmas KõljalgMolecular data form an important research tool in most branches of mycology. A non-trivial proportion of the public fungal DNA sequences are, however, compromised in terms of quality and reliability, contributing noise and bias to sequence-borne inferences such as phylogenetic analysis, diversity assessment, and barcoding. In this paper we discuss various aspects and pitfalls of sequence quality assessment. Based on our observations, we provide a set of guidelines to assist in manual quality management of newly generated, near-full-length (Sanger-derived) fungal ITS sequences and to some extent also sequences of shorter read lengths, other genes or markers, and groups of organisms. The guidelines are intentionally non-technical and do not require substantial bioinformatics skills or significant computational power. Despite their simple nature, we feel they would have caught the vast majority of the severely compromised ITS sequences in the public corpus. Our guidelines are nevertheless not infallible, and common sense and intuition remain important elements in the pursuit of compromised sequence data. The guidelines focus on basic sequence authenticity and reliability of the newly generated sequences, and the user may want to consider additional resources and steps to accomplish the best possible quality control. A discussion on the technical resources for further sequence quality management is therefore provided in the supplementary material.http://mycokeys.pensoft.net/lib/ajax_srv/article_elements_srv.php?action=download_pdf&item_id=1186
collection DOAJ
language English
format Article
sources DOAJ
author R. Henrik Nilsson
Leho Tedersoo
Kessy Abarenkov
Martin Ryberg
Erik Kristiansson
Martin Hartmann
Conrad L. Schoch
Johan A. A. Nylander
Johannes Bergsten
Teresita M. Porter
Ari Jumpponen
Parag Vaishampayan
Otso Ovaskainen
Nils Hallenberg
Johan Bengtsson-Palme
K. Martin Eriksson
Karl-Henrik Larsson
Ellen Larsson
Urmas Kõljalg
spellingShingle R. Henrik Nilsson
Leho Tedersoo
Kessy Abarenkov
Martin Ryberg
Erik Kristiansson
Martin Hartmann
Conrad L. Schoch
Johan A. A. Nylander
Johannes Bergsten
Teresita M. Porter
Ari Jumpponen
Parag Vaishampayan
Otso Ovaskainen
Nils Hallenberg
Johan Bengtsson-Palme
K. Martin Eriksson
Karl-Henrik Larsson
Ellen Larsson
Urmas Kõljalg
Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences
MycoKeys
author_facet R. Henrik Nilsson
Leho Tedersoo
Kessy Abarenkov
Martin Ryberg
Erik Kristiansson
Martin Hartmann
Conrad L. Schoch
Johan A. A. Nylander
Johannes Bergsten
Teresita M. Porter
Ari Jumpponen
Parag Vaishampayan
Otso Ovaskainen
Nils Hallenberg
Johan Bengtsson-Palme
K. Martin Eriksson
Karl-Henrik Larsson
Ellen Larsson
Urmas Kõljalg
author_sort R. Henrik Nilsson
title Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences
title_short Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences
title_full Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences
title_fullStr Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences
title_full_unstemmed Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences
title_sort five simple guidelines for establishing basic authenticity and reliability of newly generated fungal its sequences
publisher Pensoft Publishers
series MycoKeys
issn 1314-4057
1314-4049
publishDate 2012-09-01
description Molecular data form an important research tool in most branches of mycology. A non-trivial proportion of the public fungal DNA sequences are, however, compromised in terms of quality and reliability, contributing noise and bias to sequence-borne inferences such as phylogenetic analysis, diversity assessment, and barcoding. In this paper we discuss various aspects and pitfalls of sequence quality assessment. Based on our observations, we provide a set of guidelines to assist in manual quality management of newly generated, near-full-length (Sanger-derived) fungal ITS sequences and to some extent also sequences of shorter read lengths, other genes or markers, and groups of organisms. The guidelines are intentionally non-technical and do not require substantial bioinformatics skills or significant computational power. Despite their simple nature, we feel they would have caught the vast majority of the severely compromised ITS sequences in the public corpus. Our guidelines are nevertheless not infallible, and common sense and intuition remain important elements in the pursuit of compromised sequence data. The guidelines focus on basic sequence authenticity and reliability of the newly generated sequences, and the user may want to consider additional resources and steps to accomplish the best possible quality control. A discussion on the technical resources for further sequence quality management is therefore provided in the supplementary material.
url http://mycokeys.pensoft.net/lib/ajax_srv/article_elements_srv.php?action=download_pdf&item_id=1186
work_keys_str_mv AT rhenriknilsson fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT lehotedersoo fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT kessyabarenkov fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT martinryberg fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT erikkristiansson fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT martinhartmann fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT conradlschoch fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT johanaanylander fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT johannesbergsten fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT teresitamporter fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT arijumpponen fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT paragvaishampayan fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT otsoovaskainen fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT nilshallenberg fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT johanbengtssonpalme fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT kmartineriksson fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT karlhenriklarsson fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT ellenlarsson fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
AT urmaskoljalg fivesimpleguidelinesforestablishingbasicauthenticityandreliabilityofnewlygeneratedfungalitssequences
_version_ 1725559665635360768