Network graph representation of COVID-19 scientific publications to aid knowledge discovery

Introduction Numerous scientific journal articles related to COVID-19 have been rapidly published, making navigation and understanding of relationships difficult.Methods A graph network was constructed from the publicly available COVID-19 Open Research Dataset (CORD-19) of COVID-19-related publicati...

Full description

Bibliographic Details
Main Authors: Neil J Sebire, George Cernile, Trevor Heritage, Taralyn Schwering, Shana Kazemlou, Yulia Borecki
Format: Article
Language:English
Published: BMJ Publishing Group 2021-07-01
Series:BMJ Health & Care Informatics
Online Access:https://informatics.bmj.com/content/28/1/e100254.full
id doaj-184e93895f434b01b31f84f5c9600165
record_format Article
spelling doaj-184e93895f434b01b31f84f5c96001652021-07-31T09:30:06ZengBMJ Publishing GroupBMJ Health & Care Informatics2632-10092021-07-0128110.1136/bmjhci-2020-100254Network graph representation of COVID-19 scientific publications to aid knowledge discoveryNeil J Sebire0George Cernile1Trevor Heritage2Taralyn Schwering3Shana Kazemlou4Yulia Borecki5HDRUK, London, UKInspirata, Tampa, Florida, USAInspirata, Tampa, Florida, USAInspirata, Tampa, Florida, USAInspirata, Tampa, Florida, USAInspirata, Tampa, Florida, USAIntroduction Numerous scientific journal articles related to COVID-19 have been rapidly published, making navigation and understanding of relationships difficult.Methods A graph network was constructed from the publicly available COVID-19 Open Research Dataset (CORD-19) of COVID-19-related publications using an engine leveraging medical knowledge bases to identify discrete medical concepts and an open-source tool (Gephi) to visualise the network.Results The network shows connections between diseases, medications and procedures identified from the title and abstract of 195 958 COVID-19-related publications (CORD-19 Dataset). Connections between terms with few publications, those unconnected to the main network and those irrelevant were not displayed. Nodes were coloured by knowledge base and the size of the node related to the number of publications containing the term. The data set and visualisations were made publicly accessible via a webtool.Conclusion Knowledge management approaches (text mining and graph networks) can effectively allow rapid navigation and exploration of entity inter-relationships to improve understanding of diseases such as COVID-19.https://informatics.bmj.com/content/28/1/e100254.full
collection DOAJ
language English
format Article
sources DOAJ
author Neil J Sebire
George Cernile
Trevor Heritage
Taralyn Schwering
Shana Kazemlou
Yulia Borecki
spellingShingle Neil J Sebire
George Cernile
Trevor Heritage
Taralyn Schwering
Shana Kazemlou
Yulia Borecki
Network graph representation of COVID-19 scientific publications to aid knowledge discovery
BMJ Health & Care Informatics
author_facet Neil J Sebire
George Cernile
Trevor Heritage
Taralyn Schwering
Shana Kazemlou
Yulia Borecki
author_sort Neil J Sebire
title Network graph representation of COVID-19 scientific publications to aid knowledge discovery
title_short Network graph representation of COVID-19 scientific publications to aid knowledge discovery
title_full Network graph representation of COVID-19 scientific publications to aid knowledge discovery
title_fullStr Network graph representation of COVID-19 scientific publications to aid knowledge discovery
title_full_unstemmed Network graph representation of COVID-19 scientific publications to aid knowledge discovery
title_sort network graph representation of covid-19 scientific publications to aid knowledge discovery
publisher BMJ Publishing Group
series BMJ Health & Care Informatics
issn 2632-1009
publishDate 2021-07-01
description Introduction Numerous scientific journal articles related to COVID-19 have been rapidly published, making navigation and understanding of relationships difficult.Methods A graph network was constructed from the publicly available COVID-19 Open Research Dataset (CORD-19) of COVID-19-related publications using an engine leveraging medical knowledge bases to identify discrete medical concepts and an open-source tool (Gephi) to visualise the network.Results The network shows connections between diseases, medications and procedures identified from the title and abstract of 195 958 COVID-19-related publications (CORD-19 Dataset). Connections between terms with few publications, those unconnected to the main network and those irrelevant were not displayed. Nodes were coloured by knowledge base and the size of the node related to the number of publications containing the term. The data set and visualisations were made publicly accessible via a webtool.Conclusion Knowledge management approaches (text mining and graph networks) can effectively allow rapid navigation and exploration of entity inter-relationships to improve understanding of diseases such as COVID-19.
url https://informatics.bmj.com/content/28/1/e100254.full
work_keys_str_mv AT neiljsebire networkgraphrepresentationofcovid19scientificpublicationstoaidknowledgediscovery
AT georgecernile networkgraphrepresentationofcovid19scientificpublicationstoaidknowledgediscovery
AT trevorheritage networkgraphrepresentationofcovid19scientificpublicationstoaidknowledgediscovery
AT taralynschwering networkgraphrepresentationofcovid19scientificpublicationstoaidknowledgediscovery
AT shanakazemlou networkgraphrepresentationofcovid19scientificpublicationstoaidknowledgediscovery
AT yuliaborecki networkgraphrepresentationofcovid19scientificpublicationstoaidknowledgediscovery
_version_ 1721246901740765184