An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
With an unprecedented growth in the biomedical literature, keeping up to date with the new developments presents an immense challenge. Publications are often studied in isolation of the established literature, with interpretation being subjective and often introducing human bias. With ontology-drive...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2018-07-01
|
Series: | PeerJ |
Subjects: | |
Online Access: | https://peerj.com/articles/5047.pdf |
id |
doaj-31c7e001b2a746dbbeb9874a65141e99 |
---|---|
record_format |
Article |
spelling |
doaj-31c7e001b2a746dbbeb9874a65141e992020-11-24T20:51:58ZengPeerJ Inc.PeerJ2167-83592018-07-016e504710.7717/peerj.5047An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insightsOrges Koci0Michael Logan1Vaios Svolos2Richard K. Russell3Konstantinos Gerasimidis4Umer Zeeshan Ijaz5Human Nutrition, School of Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UKInfrastructure and Environment Research Division, School of Engineering, University of Glasgow, Glasgow, UKHuman Nutrition, School of Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UKDepartment of Paediatric Gastroenterology, Hepatology and Nutrition, Royal Hospital for Children, Glasgow, UKHuman Nutrition, School of Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UKInfrastructure and Environment Research Division, School of Engineering, University of Glasgow, Glasgow, UKWith an unprecedented growth in the biomedical literature, keeping up to date with the new developments presents an immense challenge. Publications are often studied in isolation of the established literature, with interpretation being subjective and often introducing human bias. With ontology-driven annotation of biomedical data gaining popularity in recent years and online databases offering metatags with rich textual information, it is now possible to automatically text-mine ontological terms and complement the laborious task of manual management, interpretation, and analysis of the accumulated literature with downstream statistical analysis. In this paper, we have formulated an automated workflow through which we have identified ontological information, including nutrition-related terms in PubMed abstracts (from 1991 to 2016) for two main types of Inflammatory Bowel Diseases: Crohn’s Disease and Ulcerative Colitis; and two other gastrointestinal (GI) diseases, namely, Coeliac Disease and Irritable Bowel Syndrome. Our analysis reveals unique clustering patterns as well as spatial and temporal trends inherent to the considered GI diseases in terms of literature that has been accumulated so far. Although automated interpretation cannot replace human judgement, the developed workflow shows promising results and can be a useful tool in systematic literature reviews. The workflow is available at https://github.com/KociOrges/pytag.https://peerj.com/articles/5047.pdfOntologyInflammatory bowel diseaseText miningEcological statisticsHuman nutritionOrdination |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Orges Koci Michael Logan Vaios Svolos Richard K. Russell Konstantinos Gerasimidis Umer Zeeshan Ijaz |
spellingShingle |
Orges Koci Michael Logan Vaios Svolos Richard K. Russell Konstantinos Gerasimidis Umer Zeeshan Ijaz An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights PeerJ Ontology Inflammatory bowel disease Text mining Ecological statistics Human nutrition Ordination |
author_facet |
Orges Koci Michael Logan Vaios Svolos Richard K. Russell Konstantinos Gerasimidis Umer Zeeshan Ijaz |
author_sort |
Orges Koci |
title |
An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights |
title_short |
An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights |
title_full |
An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights |
title_fullStr |
An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights |
title_full_unstemmed |
An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights |
title_sort |
automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights |
publisher |
PeerJ Inc. |
series |
PeerJ |
issn |
2167-8359 |
publishDate |
2018-07-01 |
description |
With an unprecedented growth in the biomedical literature, keeping up to date with the new developments presents an immense challenge. Publications are often studied in isolation of the established literature, with interpretation being subjective and often introducing human bias. With ontology-driven annotation of biomedical data gaining popularity in recent years and online databases offering metatags with rich textual information, it is now possible to automatically text-mine ontological terms and complement the laborious task of manual management, interpretation, and analysis of the accumulated literature with downstream statistical analysis. In this paper, we have formulated an automated workflow through which we have identified ontological information, including nutrition-related terms in PubMed abstracts (from 1991 to 2016) for two main types of Inflammatory Bowel Diseases: Crohn’s Disease and Ulcerative Colitis; and two other gastrointestinal (GI) diseases, namely, Coeliac Disease and Irritable Bowel Syndrome. Our analysis reveals unique clustering patterns as well as spatial and temporal trends inherent to the considered GI diseases in terms of literature that has been accumulated so far. Although automated interpretation cannot replace human judgement, the developed workflow shows promising results and can be a useful tool in systematic literature reviews. The workflow is available at https://github.com/KociOrges/pytag. |
topic |
Ontology Inflammatory bowel disease Text mining Ecological statistics Human nutrition Ordination |
url |
https://peerj.com/articles/5047.pdf |
work_keys_str_mv |
AT orgeskoci anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT michaellogan anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT vaiossvolos anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT richardkrussell anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT konstantinosgerasimidis anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT umerzeeshanijaz anautomatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT orgeskoci automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT michaellogan automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT vaiossvolos automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT richardkrussell automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT konstantinosgerasimidis automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights AT umerzeeshanijaz automatedidentificationandanalysisofontologicaltermsingastrointestinaldiseasesandnutritionrelatedliteratureprovidesusefulinsights |
_version_ |
1716800695981572096 |