Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes

Abstract Background With the availability of complete genome sequences of both human and non-human Plasmodium parasites, it is now possible to use comparative genomics to look for orthology across Plasmodium species and for species specific genes. This comparative analyses could provide important cl...

Full description

Bibliographic Details
Main Authors: Juliana Bernardes, Catherine Vaquero, Alessandra Carbone
Format: Article
Language:English
Published: BMC 2017-06-01
Series:Malaria Journal
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12936-017-1887-8
id doaj-ea449857e4d845289034d4874066285a
record_format Article
spelling doaj-ea449857e4d845289034d4874066285a2020-11-24T21:41:21ZengBMCMalaria Journal1475-28752017-06-0116111010.1186/s12936-017-1887-8Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomesJuliana Bernardes0Catherine Vaquero1Alessandra Carbone2Laboratoire de Biologie Computationnelle et Quantitative, UMR 7238, IBPS, CNRS, UPMC Univ-Paris 6, Sorbonne UniversitésCentre d’ Immunologie et des Maladies Infectieuses (CIMI-Paris), CNRS ERL 8255, INSERM U1135, UPMC Univ-Paris 6, Sorbonne UniversitésLaboratoire de Biologie Computationnelle et Quantitative, UMR 7238, IBPS, CNRS, UPMC Univ-Paris 6, Sorbonne UniversitésAbstract Background With the availability of complete genome sequences of both human and non-human Plasmodium parasites, it is now possible to use comparative genomics to look for orthology across Plasmodium species and for species specific genes. This comparative analyses could provide important clues for the development of new strategies to prevent and treat malaria in humans, however, the number of functionally annotated proteins is still low for all Plasmodium species. In the context of genomes that are hard to annotate because of sequence divergence, such as Plasmodium, domain co-occurrence becomes particularly important to trust predictions. In particular, domain architecture prediction can be used to improve the performance of existing annotation methods since homologous proteins might share their architectural context. Results Plasmobase is a unique database designed for the comparative study of Plasmodium genomes. Domain architecture reconstruction in Plasmobase relies on DAMA, the state-of-the-art method in architecture prediction, while domain annotation is realised with CLADE, a novel annotation tool based on a multi-source strategy. Plasmobase significantly increases the Pfam domain coverage of all Plasmodium genomes, it proposes new domain architectures as well as new domain families that have never been reported before for these genomes. It proposes a visualization of domain architectures and allows for an easy comparison among architectures within Plasmodium species and with other species, described in UniProt. Conclusions Plasmobase is a valuable new resource for domain annotation in Plasmodium genomes. Its graphical presentation of protein sequences, based on domain architectures, will hopefully be of interest for comparative genomic studies. It should help to discover species-specific genes, possibly underlying important phenotypic differences between parasites, and orthologous gene families for deciphering the biology of these complex and important Apicomplexan organisms. In conclusion, Plasmobase is a flexible and rich site where any biologist can find something of his/her own interest. Availability Plasmobase is accessible at http://genome.lcqb.upmc.fr/plasmobase/ .http://link.springer.com/article/10.1186/s12936-017-1887-8PlasmodiumGenomeProtein architectureDomain architecture predictionAnnotationGenome comparison
collection DOAJ
language English
format Article
sources DOAJ
author Juliana Bernardes
Catherine Vaquero
Alessandra Carbone
spellingShingle Juliana Bernardes
Catherine Vaquero
Alessandra Carbone
Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes
Malaria Journal
Plasmodium
Genome
Protein architecture
Domain architecture prediction
Annotation
Genome comparison
author_facet Juliana Bernardes
Catherine Vaquero
Alessandra Carbone
author_sort Juliana Bernardes
title Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes
title_short Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes
title_full Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes
title_fullStr Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes
title_full_unstemmed Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes
title_sort plasmobase: a comparative database of predicted domain architectures for plasmodium genomes
publisher BMC
series Malaria Journal
issn 1475-2875
publishDate 2017-06-01
description Abstract Background With the availability of complete genome sequences of both human and non-human Plasmodium parasites, it is now possible to use comparative genomics to look for orthology across Plasmodium species and for species specific genes. This comparative analyses could provide important clues for the development of new strategies to prevent and treat malaria in humans, however, the number of functionally annotated proteins is still low for all Plasmodium species. In the context of genomes that are hard to annotate because of sequence divergence, such as Plasmodium, domain co-occurrence becomes particularly important to trust predictions. In particular, domain architecture prediction can be used to improve the performance of existing annotation methods since homologous proteins might share their architectural context. Results Plasmobase is a unique database designed for the comparative study of Plasmodium genomes. Domain architecture reconstruction in Plasmobase relies on DAMA, the state-of-the-art method in architecture prediction, while domain annotation is realised with CLADE, a novel annotation tool based on a multi-source strategy. Plasmobase significantly increases the Pfam domain coverage of all Plasmodium genomes, it proposes new domain architectures as well as new domain families that have never been reported before for these genomes. It proposes a visualization of domain architectures and allows for an easy comparison among architectures within Plasmodium species and with other species, described in UniProt. Conclusions Plasmobase is a valuable new resource for domain annotation in Plasmodium genomes. Its graphical presentation of protein sequences, based on domain architectures, will hopefully be of interest for comparative genomic studies. It should help to discover species-specific genes, possibly underlying important phenotypic differences between parasites, and orthologous gene families for deciphering the biology of these complex and important Apicomplexan organisms. In conclusion, Plasmobase is a flexible and rich site where any biologist can find something of his/her own interest. Availability Plasmobase is accessible at http://genome.lcqb.upmc.fr/plasmobase/ .
topic Plasmodium
Genome
Protein architecture
Domain architecture prediction
Annotation
Genome comparison
url http://link.springer.com/article/10.1186/s12936-017-1887-8
work_keys_str_mv AT julianabernardes plasmobaseacomparativedatabaseofpredicteddomainarchitecturesforplasmodiumgenomes
AT catherinevaquero plasmobaseacomparativedatabaseofpredicteddomainarchitecturesforplasmodiumgenomes
AT alessandracarbone plasmobaseacomparativedatabaseofpredicteddomainarchitecturesforplasmodiumgenomes
_version_ 1725922509267664896