Collaborative Annotation Redefines Gene Sets for Crucial Phytopathogens

Accurate and comprehensive annotation of genomic sequences underpins advances in managing plant disease. However, important plant pathogens still have incomplete and inconsistent gene sets and lack dedicated funding or teams to improve this annotation. This paper describes a collaborative approach t...

Full description

Bibliographic Details
Main Authors: Helder Pedro, Andrew D. Yates, Paul J. Kersey, Nishadi H. De Silva
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-11-01
Series:Frontiers in Microbiology
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fmicb.2019.02477/full
id doaj-411c14aa6eee4d42842b542ed9bba0f7
record_format Article
spelling doaj-411c14aa6eee4d42842b542ed9bba0f72020-11-25T00:53:43ZengFrontiers Media S.A.Frontiers in Microbiology1664-302X2019-11-011010.3389/fmicb.2019.02477489155Collaborative Annotation Redefines Gene Sets for Crucial PhytopathogensHelder PedroAndrew D. YatesPaul J. KerseyNishadi H. De SilvaAccurate and comprehensive annotation of genomic sequences underpins advances in managing plant disease. However, important plant pathogens still have incomplete and inconsistent gene sets and lack dedicated funding or teams to improve this annotation. This paper describes a collaborative approach to gene curation to address this shortcoming. In the first instance, over 40 members of the Botrytis cinerea community from eight countries, with training and infrastructural support from Ensembl Fungi, used the gene editing tool Apollo to systematically review the entire gene set (11,707 protein coding genes) in 6–7 months. This has subsequently been checked and disseminated. Following this, a similar project for another pathogen, Blumeria graminis f. sp. hordei, also led to a completely redefined gene set. Currently, we are working with the Zymoseptoria tritici community to enable them to achieve the same. While the tangible outcome of these projects is improved gene sets, it is apparent that the inherent agreement and ownership of a single gene set by research teams as they undergo this curation process are consequential to the acceleration of research in the field. With the generation of large data sets increasingly affordable, there is value in unifying both the divergent data sets and their associated research teams, pooling time, expertise, and resources. Community-driven annotation efforts can pave the way for a new kind of collaboration among pathogen research communities to generate well-annotated reference data sets, beneficial not just for the genome being examined but for related species and the refinement of automatic gene prediction tools.https://www.frontiersin.org/article/10.3389/fmicb.2019.02477/fullcommunity gene annotationapolloEnsembl genomesfungiplant pathogens
collection DOAJ
language English
format Article
sources DOAJ
author Helder Pedro
Andrew D. Yates
Paul J. Kersey
Nishadi H. De Silva
spellingShingle Helder Pedro
Andrew D. Yates
Paul J. Kersey
Nishadi H. De Silva
Collaborative Annotation Redefines Gene Sets for Crucial Phytopathogens
Frontiers in Microbiology
community gene annotation
apollo
Ensembl genomes
fungi
plant pathogens
author_facet Helder Pedro
Andrew D. Yates
Paul J. Kersey
Nishadi H. De Silva
author_sort Helder Pedro
title Collaborative Annotation Redefines Gene Sets for Crucial Phytopathogens
title_short Collaborative Annotation Redefines Gene Sets for Crucial Phytopathogens
title_full Collaborative Annotation Redefines Gene Sets for Crucial Phytopathogens
title_fullStr Collaborative Annotation Redefines Gene Sets for Crucial Phytopathogens
title_full_unstemmed Collaborative Annotation Redefines Gene Sets for Crucial Phytopathogens
title_sort collaborative annotation redefines gene sets for crucial phytopathogens
publisher Frontiers Media S.A.
series Frontiers in Microbiology
issn 1664-302X
publishDate 2019-11-01
description Accurate and comprehensive annotation of genomic sequences underpins advances in managing plant disease. However, important plant pathogens still have incomplete and inconsistent gene sets and lack dedicated funding or teams to improve this annotation. This paper describes a collaborative approach to gene curation to address this shortcoming. In the first instance, over 40 members of the Botrytis cinerea community from eight countries, with training and infrastructural support from Ensembl Fungi, used the gene editing tool Apollo to systematically review the entire gene set (11,707 protein coding genes) in 6–7 months. This has subsequently been checked and disseminated. Following this, a similar project for another pathogen, Blumeria graminis f. sp. hordei, also led to a completely redefined gene set. Currently, we are working with the Zymoseptoria tritici community to enable them to achieve the same. While the tangible outcome of these projects is improved gene sets, it is apparent that the inherent agreement and ownership of a single gene set by research teams as they undergo this curation process are consequential to the acceleration of research in the field. With the generation of large data sets increasingly affordable, there is value in unifying both the divergent data sets and their associated research teams, pooling time, expertise, and resources. Community-driven annotation efforts can pave the way for a new kind of collaboration among pathogen research communities to generate well-annotated reference data sets, beneficial not just for the genome being examined but for related species and the refinement of automatic gene prediction tools.
topic community gene annotation
apollo
Ensembl genomes
fungi
plant pathogens
url https://www.frontiersin.org/article/10.3389/fmicb.2019.02477/full
work_keys_str_mv AT helderpedro collaborativeannotationredefinesgenesetsforcrucialphytopathogens
AT andrewdyates collaborativeannotationredefinesgenesetsforcrucialphytopathogens
AT pauljkersey collaborativeannotationredefinesgenesetsforcrucialphytopathogens
AT nishadihdesilva collaborativeannotationredefinesgenesetsforcrucialphytopathogens
_version_ 1725236858668974080