Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy.

Recent advances in high-throughput technologies have resulted in a tremendous increase in the amount of omics data produced in plant science. This increase, in conjunction with the heterogeneity and variability of the data, presents a major challenge to adopt an integrative research approach. We are...

Full description

Bibliographic Details
Main Authors: Aravind Venkatesan, Gildas Tagny Ngompe, Nordine El Hassouni, Imene Chentli, Valentin Guignon, Clement Jonquet, Manuel Ruiz, Pierre Larmande
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0198270
id doaj-6c7f1f21cb6f4d838f551020647c9767
record_format Article
spelling doaj-6c7f1f21cb6f4d838f551020647c97672021-03-03T21:04:39ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-011311e019827010.1371/journal.pone.0198270Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy.Aravind VenkatesanGildas Tagny NgompeNordine El HassouniImene ChentliValentin GuignonClement JonquetManuel RuizPierre LarmandeRecent advances in high-throughput technologies have resulted in a tremendous increase in the amount of omics data produced in plant science. This increase, in conjunction with the heterogeneity and variability of the data, presents a major challenge to adopt an integrative research approach. We are facing an urgent need to effectively integrate and assimilate complementary datasets to understand the biological system as a whole. The Semantic Web offers technologies for the integration of heterogeneous data and their transformation into explicit knowledge thanks to ontologies. We have developed the Agronomic Linked Data (AgroLD- www.agrold.org), a knowledge-based system relying on Semantic Web technologies and exploiting standard domain ontologies, to integrate data about plant species of high interest for the plant science community e.g., rice, wheat, arabidopsis. We present some integration results of the project, which initially focused on genomics, proteomics and phenomics. AgroLD is now an RDF (Resource Description Format) knowledge base of 100M triples created by annotating and integrating more than 50 datasets coming from 10 data sources-such as Gramene.org and TropGeneDB-with 10 ontologies-such as the Gene Ontology and Plant Trait Ontology. Our evaluation results show users appreciate the multiple query modes which support different use cases. AgroLD's objective is to offer a domain specific knowledge platform to solve complex biological and agronomical questions related to the implication of genes/proteins in, for instances, plant disease resistance or high yield traits. We expect the resolution of these questions to facilitate the formulation of new scientific hypotheses to be validated with a knowledge-oriented approach.https://doi.org/10.1371/journal.pone.0198270
collection DOAJ
language English
format Article
sources DOAJ
author Aravind Venkatesan
Gildas Tagny Ngompe
Nordine El Hassouni
Imene Chentli
Valentin Guignon
Clement Jonquet
Manuel Ruiz
Pierre Larmande
spellingShingle Aravind Venkatesan
Gildas Tagny Ngompe
Nordine El Hassouni
Imene Chentli
Valentin Guignon
Clement Jonquet
Manuel Ruiz
Pierre Larmande
Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy.
PLoS ONE
author_facet Aravind Venkatesan
Gildas Tagny Ngompe
Nordine El Hassouni
Imene Chentli
Valentin Guignon
Clement Jonquet
Manuel Ruiz
Pierre Larmande
author_sort Aravind Venkatesan
title Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy.
title_short Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy.
title_full Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy.
title_fullStr Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy.
title_full_unstemmed Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy.
title_sort agronomic linked data (agrold): a knowledge-based system to enable integrative biology in agronomy.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description Recent advances in high-throughput technologies have resulted in a tremendous increase in the amount of omics data produced in plant science. This increase, in conjunction with the heterogeneity and variability of the data, presents a major challenge to adopt an integrative research approach. We are facing an urgent need to effectively integrate and assimilate complementary datasets to understand the biological system as a whole. The Semantic Web offers technologies for the integration of heterogeneous data and their transformation into explicit knowledge thanks to ontologies. We have developed the Agronomic Linked Data (AgroLD- www.agrold.org), a knowledge-based system relying on Semantic Web technologies and exploiting standard domain ontologies, to integrate data about plant species of high interest for the plant science community e.g., rice, wheat, arabidopsis. We present some integration results of the project, which initially focused on genomics, proteomics and phenomics. AgroLD is now an RDF (Resource Description Format) knowledge base of 100M triples created by annotating and integrating more than 50 datasets coming from 10 data sources-such as Gramene.org and TropGeneDB-with 10 ontologies-such as the Gene Ontology and Plant Trait Ontology. Our evaluation results show users appreciate the multiple query modes which support different use cases. AgroLD's objective is to offer a domain specific knowledge platform to solve complex biological and agronomical questions related to the implication of genes/proteins in, for instances, plant disease resistance or high yield traits. We expect the resolution of these questions to facilitate the formulation of new scientific hypotheses to be validated with a knowledge-oriented approach.
url https://doi.org/10.1371/journal.pone.0198270
work_keys_str_mv AT aravindvenkatesan agronomiclinkeddataagroldaknowledgebasedsystemtoenableintegrativebiologyinagronomy
AT gildastagnyngompe agronomiclinkeddataagroldaknowledgebasedsystemtoenableintegrativebiologyinagronomy
AT nordineelhassouni agronomiclinkeddataagroldaknowledgebasedsystemtoenableintegrativebiologyinagronomy
AT imenechentli agronomiclinkeddataagroldaknowledgebasedsystemtoenableintegrativebiologyinagronomy
AT valentinguignon agronomiclinkeddataagroldaknowledgebasedsystemtoenableintegrativebiologyinagronomy
AT clementjonquet agronomiclinkeddataagroldaknowledgebasedsystemtoenableintegrativebiologyinagronomy
AT manuelruiz agronomiclinkeddataagroldaknowledgebasedsystemtoenableintegrativebiologyinagronomy
AT pierrelarmande agronomiclinkeddataagroldaknowledgebasedsystemtoenableintegrativebiologyinagronomy
_version_ 1714818912365838336