An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights

With an unprecedented growth in the biomedical literature, keeping up to date with the new developments presents an immense challenge. Publications are often studied in isolation of the established literature, with interpretation being subjective and often introducing human bias. With ontology-drive...

Full description

Bibliographic Details
Main Authors: Orges Koci, Michael Logan, Vaios Svolos, Richard K. Russell, Konstantinos Gerasimidis, Umer Zeeshan Ijaz
Format: Article
Language:English
Published: PeerJ Inc. 2018-07-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/5047.pdf
id doaj-31c7e001b2a746dbbeb9874a65141e99
record_format Article
spelling doaj-31c7e001b2a746dbbeb9874a65141e992020-11-24T20:51:58ZengPeerJ Inc.PeerJ2167-83592018-07-016e504710.7717/peerj.5047An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insightsOrges Koci0Michael Logan1Vaios Svolos2Richard K. Russell3Konstantinos Gerasimidis4Umer Zeeshan Ijaz5Human Nutrition, School of Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UKInfrastructure and Environment Research Division, School of Engineering, University of Glasgow, Glasgow, UKHuman Nutrition, School of Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UKDepartment of Paediatric Gastroenterology, Hepatology and Nutrition, Royal Hospital for Children, Glasgow, UKHuman Nutrition, School of Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UKInfrastructure and Environment Research Division, School of Engineering, University of Glasgow, Glasgow, UKWith an unprecedented growth in the biomedical literature, keeping up to date with the new developments presents an immense challenge. Publications are often studied in isolation of the established literature, with interpretation being subjective and often introducing human bias. With ontology-driven annotation of biomedical data gaining popularity in recent years and online databases offering metatags with rich textual information, it is now possible to automatically text-mine ontological terms and complement the laborious task of manual management, interpretation, and analysis of the accumulated literature with downstream statistical analysis. In this paper, we have formulated an automated workflow through which we have identified ontological information, including nutrition-related terms in PubMed abstracts (from 1991 to 2016) for two main types of Inflammatory Bowel Diseases: Crohn’s Disease and Ulcerative Colitis; and two other gastrointestinal (GI) diseases, namely, Coeliac Disease and Irritable Bowel Syndrome. Our analysis reveals unique clustering patterns as well as spatial and temporal trends inherent to the considered GI diseases in terms of literature that has been accumulated so far. Although automated interpretation cannot replace human judgement, the developed workflow shows promising results and can be a useful tool in systematic literature reviews. The workflow is available at https://github.com/KociOrges/pytag.https://peerj.com/articles/5047.pdfOntologyInflammatory bowel diseaseText miningEcological statisticsHuman nutritionOrdination
collection DOAJ
language English
format Article
sources DOAJ
author Orges Koci
Michael Logan
Vaios Svolos
Richard K. Russell
Konstantinos Gerasimidis
Umer Zeeshan Ijaz
spellingShingle Orges Koci
Michael Logan
Vaios Svolos
Richard K. Russell
Konstantinos Gerasimidis
Umer Zeeshan Ijaz
An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
PeerJ
Ontology
Inflammatory bowel disease
Text mining
Ecological statistics
Human nutrition
Ordination
author_facet Orges Koci
Michael Logan
Vaios Svolos
Richard K. Russell
Konstantinos Gerasimidis
Umer Zeeshan Ijaz
author_sort Orges Koci
title An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
title_short An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
title_full An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
title_fullStr An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
title_full_unstemmed An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
title_sort automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2018-07-01
description With an unprecedented growth in the biomedical literature, keeping up to date with the new developments presents an immense challenge. Publications are often studied in isolation of the established literature, with interpretation being subjective and often introducing human bias. With ontology-driven annotation of biomedical data gaining popularity in recent years and online databases offering metatags with rich textual information, it is now possible to automatically text-mine ontological terms and complement the laborious task of manual management, interpretation, and analysis of the accumulated literature with downstream statistical analysis. In this paper, we have formulated an automated workflow through which we have identified ontological information, including nutrition-related terms in PubMed abstracts (from 1991 to 2016) for two main types of Inflammatory Bowel Diseases: Crohn’s Disease and Ulcerative Colitis; and two other gastrointestinal (GI) diseases, namely, Coeliac Disease and Irritable Bowel Syndrome. Our analysis reveals unique clustering patterns as well as spatial and temporal trends inherent to the considered GI diseases in terms of literature that has been accumulated so far. Although automated interpretation cannot replace human judgement, the developed workflow shows promising results and can be a useful tool in systematic literature reviews. The workflow is available at https://github.com/KociOrges/pytag.
topic Ontology
Inflammatory bowel disease
Text mining
Ecological statistics
Human nutrition
Ordination
url https://peerj.com/articles/5047.pdf
work_keys_str_mv AT orgeskoci anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT michaellogan anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT vaiossvolos anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT richardkrussell anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT konstantinosgerasimidis anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT umerzeeshanijaz anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT orgeskoci automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT michaellogan automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT vaiossvolos automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT richardkrussell automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT konstantinosgerasimidis automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
AT umerzeeshanijaz automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights
_version_ 1716800695981572096