EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution.

Environmental sequencing has greatly expanded our knowledge of micro-eukaryotic diversity and ecology by revealing previously unknown lineages and their distribution. However, the value of these data is critically dependent on the quality of the reference databases used to assign an identity to envi...

Full description

Bibliographic Details
Main Authors: Javier Del Campo, Martin Kolisko, Vittorio Boscaro, Luciana F Santoferrara, Serafim Nenarokov, Ramon Massana, Laure Guillou, Alastair Simpson, Cedric Berney, Colomban de Vargas, Matthew W Brown, Patrick J Keeling, Laura Wegener Parfrey
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-09-01
Series:PLoS Biology
Online Access:http://europepmc.org/articles/PMC6160240?pdf=render
id doaj-f1e5fe69263c476fa2c9e9ebae18ef34
record_format Article
spelling doaj-f1e5fe69263c476fa2c9e9ebae18ef342021-07-02T11:33:16ZengPublic Library of Science (PLoS)PLoS Biology1544-91731545-78852018-09-01169e200584910.1371/journal.pbio.2005849EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution.Javier Del CampoMartin KoliskoVittorio BoscaroLuciana F SantoferraraSerafim NenarokovRamon MassanaLaure GuillouAlastair SimpsonCedric BerneyColomban de VargasMatthew W BrownPatrick J KeelingLaura Wegener ParfreyEnvironmental sequencing has greatly expanded our knowledge of micro-eukaryotic diversity and ecology by revealing previously unknown lineages and their distribution. However, the value of these data is critically dependent on the quality of the reference databases used to assign an identity to environmental sequences. Existing databases contain errors and struggle to keep pace with rapidly changing eukaryotic taxonomy, the influx of novel diversity, and computational challenges related to assembling the high-quality alignments and trees needed for accurate characterization of lineage diversity. EukRef (eukref.org) is an ongoing community-driven initiative that addresses these challenges by bringing together taxonomists with expertise spanning the eukaryotic tree of life and microbial ecologists, who use environmental sequence data to develop reliable reference databases across the diversity of microbial eukaryotes. EukRef organizes and facilitates rigorous mining and annotation of sequence data by providing protocols, guidelines, and tools. The EukRef pipeline and tools allow users interested in a particular group of microbial eukaryotes to retrieve all sequences belonging to that group from International Nucleotide Sequence Database Collaboration (INSDC) (GenBank, the European Nucleotide Archive [ENA], or the DNA DataBank of Japan [DDBJ]), to place those sequences in a phylogenetic tree, and to curate taxonomic and environmental information for the group. We provide guidelines to facilitate the process and to standardize taxonomic annotations. The final outputs of this process are (1) a reference tree and alignment, (2) a reference sequence database, including taxonomic and environmental information, and (3) a list of putative chimeras and other artifactual sequences. These products will be useful for the broad community as they become publicly available (at eukref.org) and are shared with existing reference databases.http://europepmc.org/articles/PMC6160240?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Javier Del Campo
Martin Kolisko
Vittorio Boscaro
Luciana F Santoferrara
Serafim Nenarokov
Ramon Massana
Laure Guillou
Alastair Simpson
Cedric Berney
Colomban de Vargas
Matthew W Brown
Patrick J Keeling
Laura Wegener Parfrey
spellingShingle Javier Del Campo
Martin Kolisko
Vittorio Boscaro
Luciana F Santoferrara
Serafim Nenarokov
Ramon Massana
Laure Guillou
Alastair Simpson
Cedric Berney
Colomban de Vargas
Matthew W Brown
Patrick J Keeling
Laura Wegener Parfrey
EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution.
PLoS Biology
author_facet Javier Del Campo
Martin Kolisko
Vittorio Boscaro
Luciana F Santoferrara
Serafim Nenarokov
Ramon Massana
Laure Guillou
Alastair Simpson
Cedric Berney
Colomban de Vargas
Matthew W Brown
Patrick J Keeling
Laura Wegener Parfrey
author_sort Javier Del Campo
title EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution.
title_short EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution.
title_full EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution.
title_fullStr EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution.
title_full_unstemmed EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution.
title_sort eukref: phylogenetic curation of ribosomal rna to enhance understanding of eukaryotic diversity and distribution.
publisher Public Library of Science (PLoS)
series PLoS Biology
issn 1544-9173
1545-7885
publishDate 2018-09-01
description Environmental sequencing has greatly expanded our knowledge of micro-eukaryotic diversity and ecology by revealing previously unknown lineages and their distribution. However, the value of these data is critically dependent on the quality of the reference databases used to assign an identity to environmental sequences. Existing databases contain errors and struggle to keep pace with rapidly changing eukaryotic taxonomy, the influx of novel diversity, and computational challenges related to assembling the high-quality alignments and trees needed for accurate characterization of lineage diversity. EukRef (eukref.org) is an ongoing community-driven initiative that addresses these challenges by bringing together taxonomists with expertise spanning the eukaryotic tree of life and microbial ecologists, who use environmental sequence data to develop reliable reference databases across the diversity of microbial eukaryotes. EukRef organizes and facilitates rigorous mining and annotation of sequence data by providing protocols, guidelines, and tools. The EukRef pipeline and tools allow users interested in a particular group of microbial eukaryotes to retrieve all sequences belonging to that group from International Nucleotide Sequence Database Collaboration (INSDC) (GenBank, the European Nucleotide Archive [ENA], or the DNA DataBank of Japan [DDBJ]), to place those sequences in a phylogenetic tree, and to curate taxonomic and environmental information for the group. We provide guidelines to facilitate the process and to standardize taxonomic annotations. The final outputs of this process are (1) a reference tree and alignment, (2) a reference sequence database, including taxonomic and environmental information, and (3) a list of putative chimeras and other artifactual sequences. These products will be useful for the broad community as they become publicly available (at eukref.org) and are shared with existing reference databases.
url http://europepmc.org/articles/PMC6160240?pdf=render
work_keys_str_mv AT javierdelcampo eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT martinkolisko eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT vittorioboscaro eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT lucianafsantoferrara eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT serafimnenarokov eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT ramonmassana eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT laureguillou eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT alastairsimpson eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT cedricberney eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT colombandevargas eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT matthewwbrown eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT patrickjkeeling eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
AT laurawegenerparfrey eukrefphylogeneticcurationofribosomalrnatoenhanceunderstandingofeukaryoticdiversityanddistribution
_version_ 1721331049823207424