LinkedImm: a linked data graph database for integrating immunological data

Abstract Background Many systems biology studies leverage the integration of multiple data types (across different data sources) to offer a more comprehensive view of the biological system being studied. While SQL (Structured Query Language) databases are popular in the biomedical domain, NoSQL data...

Full description

Bibliographic Details
Main Authors: Syed Ahmad Chan Bukhari, Shrikant Pawar, Jeff Mandell, Steven H. Kleinstein, Kei-Hoi Cheung
Format: Article
Language:English
Published: BMC 2021-08-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-021-04031-9
id doaj-1c737d6f0b8e4731b3f689c8787d4e2b
record_format Article
spelling doaj-1c737d6f0b8e4731b3f689c8787d4e2b2021-08-29T11:15:33ZengBMCBMC Bioinformatics1471-21052021-08-0122S911410.1186/s12859-021-04031-9LinkedImm: a linked data graph database for integrating immunological dataSyed Ahmad Chan Bukhari0Shrikant Pawar1Jeff Mandell2Steven H. Kleinstein3Kei-Hoi Cheung4Division of Computer Science, Mathematics and Science, Collins College of Professional Studies, St. John’s UniversityDepartment of Genetics, Yale School of MedicineProgram in Computational Biology and Bioinformatics, Yale School of MedicineProgram in Computational Biology and Bioinformatics, Yale School of MedicineProgram in Computational Biology and Bioinformatics, Yale School of MedicineAbstract Background Many systems biology studies leverage the integration of multiple data types (across different data sources) to offer a more comprehensive view of the biological system being studied. While SQL (Structured Query Language) databases are popular in the biomedical domain, NoSQL database technologies have been used as a more relationship-based, flexible and scalable method of data integration. Results We have created a graph database integrating data from multiple sources. In addition to using a graph-based query language (Cypher) for data retrieval, we have developed a web-based dashboard that allows users to easily browse and plot data without the need to learn Cypher. We have also implemented a visual graph query interface for users to browse graph data. Finally, we have built a prototype to allow the user to query the graph database in natural language. Conclusion We have demonstrated the feasibility and flexibility of using a graph database for storing and querying immunological data with complex biological relationships. Querying a graph database through such relationships has the potential to discover novel relationships among heterogeneous biological data and metadata.https://doi.org/10.1186/s12859-021-04031-9OntologyKnowledgebaseGraph databaseImmunologyInfluenza vaccine
collection DOAJ
language English
format Article
sources DOAJ
author Syed Ahmad Chan Bukhari
Shrikant Pawar
Jeff Mandell
Steven H. Kleinstein
Kei-Hoi Cheung
spellingShingle Syed Ahmad Chan Bukhari
Shrikant Pawar
Jeff Mandell
Steven H. Kleinstein
Kei-Hoi Cheung
LinkedImm: a linked data graph database for integrating immunological data
BMC Bioinformatics
Ontology
Knowledgebase
Graph database
Immunology
Influenza vaccine
author_facet Syed Ahmad Chan Bukhari
Shrikant Pawar
Jeff Mandell
Steven H. Kleinstein
Kei-Hoi Cheung
author_sort Syed Ahmad Chan Bukhari
title LinkedImm: a linked data graph database for integrating immunological data
title_short LinkedImm: a linked data graph database for integrating immunological data
title_full LinkedImm: a linked data graph database for integrating immunological data
title_fullStr LinkedImm: a linked data graph database for integrating immunological data
title_full_unstemmed LinkedImm: a linked data graph database for integrating immunological data
title_sort linkedimm: a linked data graph database for integrating immunological data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2021-08-01
description Abstract Background Many systems biology studies leverage the integration of multiple data types (across different data sources) to offer a more comprehensive view of the biological system being studied. While SQL (Structured Query Language) databases are popular in the biomedical domain, NoSQL database technologies have been used as a more relationship-based, flexible and scalable method of data integration. Results We have created a graph database integrating data from multiple sources. In addition to using a graph-based query language (Cypher) for data retrieval, we have developed a web-based dashboard that allows users to easily browse and plot data without the need to learn Cypher. We have also implemented a visual graph query interface for users to browse graph data. Finally, we have built a prototype to allow the user to query the graph database in natural language. Conclusion We have demonstrated the feasibility and flexibility of using a graph database for storing and querying immunological data with complex biological relationships. Querying a graph database through such relationships has the potential to discover novel relationships among heterogeneous biological data and metadata.
topic Ontology
Knowledgebase
Graph database
Immunology
Influenza vaccine
url https://doi.org/10.1186/s12859-021-04031-9
work_keys_str_mv AT syedahmadchanbukhari linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata
AT shrikantpawar linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata
AT jeffmandell linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata
AT stevenhkleinstein linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata
AT keihoicheung linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata
_version_ 1721187010480177152