Using Free Text From Medical Notes To Enrich a Longitudinal Cohort Study

Introduction The Cleft Collective Cohort study is the world's largest multidisciplinary cleft lip/palate research programme. Despite being one of the most common birth anomalies, the causes of clefting are unknown. Treatment involves a considerable burden of care from birth onwards, together w...

Full description

Bibliographic Details
Main Authors: Kerry Humphries, Amy Davies, Yvonne Wren, Jonathan Sandy
Format: Article
Language:English
Published: Swansea University 2018-08-01
Series:International Journal of Population Data Science
Online Access:https://ijpds.org/article/view/653
id doaj-5045f2e9eb8a485890cdbfdef64f0166
record_format Article
spelling doaj-5045f2e9eb8a485890cdbfdef64f01662020-11-24T21:47:07ZengSwansea UniversityInternational Journal of Population Data Science2399-49082018-08-013410.23889/ijpds.v3i4.653Using Free Text From Medical Notes To Enrich a Longitudinal Cohort StudyKerry Humphries0Amy Davies1Yvonne Wren2Jonathan Sandy3University of BristolUniversity of BristolUniversity of BristolUniversity of Bristol Introduction The Cleft Collective Cohort study is the world's largest multidisciplinary cleft lip/palate research programme. Despite being one of the most common birth anomalies, the causes of clefting are unknown. Treatment involves a considerable burden of care from birth onwards, together with a variety of social and psychological challenges. Objectives and Approach Our aim is to create the infrastructure and resources necessary to gain important new knowledge that will advance our understanding of the causes of cleft lip/palate, inform treatment and improve the lives of people born with cleft; data linkage is a key aspect of this. There are challenges associated with linking to multiple data sources (NHS Digital, Cleft National Registry). However, using the consent obtained, we are also able to link directly to the participant’s medical notes held by their NHS cleft team; enabling us to access a variety of data, including free text. Results The study already collects social and demographic data via questionnaires and genetic data from biological samples. Data linkage enriches this but also enables us to validate and address missing data problems. However, linkage to external sources brings many challenges, including, governance, costs, access issues. By gaining direct access to cleft team medical notes these issues are significantly reduced and provide us with rich phenotype data that cannot be obtained elsewhere, for example, via ‘read codes’ within electronic medical records. By tailoring our own data collection tool we can collect specific cleft data to enhance the resource and allow for subtype analyses. This process can be repeated throughout the duration of this longitudinal study for subsequent data. Conclusion/Implications Data linkage is a valuable resource but comes with many challenges. One route to overcome many of these issues is by accessing free text data directly from participants medical notes. The richness of these data allows for more in depth phenotypic analyses. https://ijpds.org/article/view/653
collection DOAJ
language English
format Article
sources DOAJ
author Kerry Humphries
Amy Davies
Yvonne Wren
Jonathan Sandy
spellingShingle Kerry Humphries
Amy Davies
Yvonne Wren
Jonathan Sandy
Using Free Text From Medical Notes To Enrich a Longitudinal Cohort Study
International Journal of Population Data Science
author_facet Kerry Humphries
Amy Davies
Yvonne Wren
Jonathan Sandy
author_sort Kerry Humphries
title Using Free Text From Medical Notes To Enrich a Longitudinal Cohort Study
title_short Using Free Text From Medical Notes To Enrich a Longitudinal Cohort Study
title_full Using Free Text From Medical Notes To Enrich a Longitudinal Cohort Study
title_fullStr Using Free Text From Medical Notes To Enrich a Longitudinal Cohort Study
title_full_unstemmed Using Free Text From Medical Notes To Enrich a Longitudinal Cohort Study
title_sort using free text from medical notes to enrich a longitudinal cohort study
publisher Swansea University
series International Journal of Population Data Science
issn 2399-4908
publishDate 2018-08-01
description Introduction The Cleft Collective Cohort study is the world's largest multidisciplinary cleft lip/palate research programme. Despite being one of the most common birth anomalies, the causes of clefting are unknown. Treatment involves a considerable burden of care from birth onwards, together with a variety of social and psychological challenges. Objectives and Approach Our aim is to create the infrastructure and resources necessary to gain important new knowledge that will advance our understanding of the causes of cleft lip/palate, inform treatment and improve the lives of people born with cleft; data linkage is a key aspect of this. There are challenges associated with linking to multiple data sources (NHS Digital, Cleft National Registry). However, using the consent obtained, we are also able to link directly to the participant’s medical notes held by their NHS cleft team; enabling us to access a variety of data, including free text. Results The study already collects social and demographic data via questionnaires and genetic data from biological samples. Data linkage enriches this but also enables us to validate and address missing data problems. However, linkage to external sources brings many challenges, including, governance, costs, access issues. By gaining direct access to cleft team medical notes these issues are significantly reduced and provide us with rich phenotype data that cannot be obtained elsewhere, for example, via ‘read codes’ within electronic medical records. By tailoring our own data collection tool we can collect specific cleft data to enhance the resource and allow for subtype analyses. This process can be repeated throughout the duration of this longitudinal study for subsequent data. Conclusion/Implications Data linkage is a valuable resource but comes with many challenges. One route to overcome many of these issues is by accessing free text data directly from participants medical notes. The richness of these data allows for more in depth phenotypic analyses.
url https://ijpds.org/article/view/653
work_keys_str_mv AT kerryhumphries usingfreetextfrommedicalnotestoenrichalongitudinalcohortstudy
AT amydavies usingfreetextfrommedicalnotestoenrichalongitudinalcohortstudy
AT yvonnewren usingfreetextfrommedicalnotestoenrichalongitudinalcohortstudy
AT jonathansandy usingfreetextfrommedicalnotestoenrichalongitudinalcohortstudy
_version_ 1725899248251174912