LinkedImm: a linked data graph database for integrating immunological data
Abstract Background Many systems biology studies leverage the integration of multiple data types (across different data sources) to offer a more comprehensive view of the biological system being studied. While SQL (Structured Query Language) databases are popular in the biomedical domain, NoSQL data...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-08-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-021-04031-9 |
id |
doaj-1c737d6f0b8e4731b3f689c8787d4e2b |
---|---|
record_format |
Article |
spelling |
doaj-1c737d6f0b8e4731b3f689c8787d4e2b2021-08-29T11:15:33ZengBMCBMC Bioinformatics1471-21052021-08-0122S911410.1186/s12859-021-04031-9LinkedImm: a linked data graph database for integrating immunological dataSyed Ahmad Chan Bukhari0Shrikant Pawar1Jeff Mandell2Steven H. Kleinstein3Kei-Hoi Cheung4Division of Computer Science, Mathematics and Science, Collins College of Professional Studies, St. John’s UniversityDepartment of Genetics, Yale School of MedicineProgram in Computational Biology and Bioinformatics, Yale School of MedicineProgram in Computational Biology and Bioinformatics, Yale School of MedicineProgram in Computational Biology and Bioinformatics, Yale School of MedicineAbstract Background Many systems biology studies leverage the integration of multiple data types (across different data sources) to offer a more comprehensive view of the biological system being studied. While SQL (Structured Query Language) databases are popular in the biomedical domain, NoSQL database technologies have been used as a more relationship-based, flexible and scalable method of data integration. Results We have created a graph database integrating data from multiple sources. In addition to using a graph-based query language (Cypher) for data retrieval, we have developed a web-based dashboard that allows users to easily browse and plot data without the need to learn Cypher. We have also implemented a visual graph query interface for users to browse graph data. Finally, we have built a prototype to allow the user to query the graph database in natural language. Conclusion We have demonstrated the feasibility and flexibility of using a graph database for storing and querying immunological data with complex biological relationships. Querying a graph database through such relationships has the potential to discover novel relationships among heterogeneous biological data and metadata.https://doi.org/10.1186/s12859-021-04031-9OntologyKnowledgebaseGraph databaseImmunologyInfluenza vaccine |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Syed Ahmad Chan Bukhari Shrikant Pawar Jeff Mandell Steven H. Kleinstein Kei-Hoi Cheung |
spellingShingle |
Syed Ahmad Chan Bukhari Shrikant Pawar Jeff Mandell Steven H. Kleinstein Kei-Hoi Cheung LinkedImm: a linked data graph database for integrating immunological data BMC Bioinformatics Ontology Knowledgebase Graph database Immunology Influenza vaccine |
author_facet |
Syed Ahmad Chan Bukhari Shrikant Pawar Jeff Mandell Steven H. Kleinstein Kei-Hoi Cheung |
author_sort |
Syed Ahmad Chan Bukhari |
title |
LinkedImm: a linked data graph database for integrating immunological data |
title_short |
LinkedImm: a linked data graph database for integrating immunological data |
title_full |
LinkedImm: a linked data graph database for integrating immunological data |
title_fullStr |
LinkedImm: a linked data graph database for integrating immunological data |
title_full_unstemmed |
LinkedImm: a linked data graph database for integrating immunological data |
title_sort |
linkedimm: a linked data graph database for integrating immunological data |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2021-08-01 |
description |
Abstract Background Many systems biology studies leverage the integration of multiple data types (across different data sources) to offer a more comprehensive view of the biological system being studied. While SQL (Structured Query Language) databases are popular in the biomedical domain, NoSQL database technologies have been used as a more relationship-based, flexible and scalable method of data integration. Results We have created a graph database integrating data from multiple sources. In addition to using a graph-based query language (Cypher) for data retrieval, we have developed a web-based dashboard that allows users to easily browse and plot data without the need to learn Cypher. We have also implemented a visual graph query interface for users to browse graph data. Finally, we have built a prototype to allow the user to query the graph database in natural language. Conclusion We have demonstrated the feasibility and flexibility of using a graph database for storing and querying immunological data with complex biological relationships. Querying a graph database through such relationships has the potential to discover novel relationships among heterogeneous biological data and metadata. |
topic |
Ontology Knowledgebase Graph database Immunology Influenza vaccine |
url |
https://doi.org/10.1186/s12859-021-04031-9 |
work_keys_str_mv |
AT syedahmadchanbukhari linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata AT shrikantpawar linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata AT jeffmandell linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata AT stevenhkleinstein linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata AT keihoicheung linkedimmalinkeddatagraphdatabaseforintegratingimmunologicaldata |
_version_ |
1721187010480177152 |