Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position Paper

BackgroundClinical free-text data (eg, outpatient letters or nursing notes) represent a vast, untapped source of rich information that, if more accessible for research, would clarify and supplement information coded in structured data fields. Data usually need to be deidentif...

Full description

Bibliographic Details
Main Authors: Jones, Kerina H, Ford, Elizabeth M, Lea, Nathan, Griffiths, Lucy J, Hassan, Lamiece, Heys, Sharon, Squires, Emma, Nenadic, Goran
Format: Article
Language:English
Published: JMIR Publications 2020-06-01
Series:Journal of Medical Internet Research
Online Access:http://www.jmir.org/2020/6/e16760/
id doaj-2d2d899d0ca040d1891d93942e3d05ff
record_format Article
spelling doaj-2d2d899d0ca040d1891d93942e3d05ff2021-04-02T18:40:58ZengJMIR PublicationsJournal of Medical Internet Research1438-88712020-06-01226e1676010.2196/16760Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position PaperJones, Kerina HFord, Elizabeth MLea, NathanGriffiths, Lucy JHassan, LamieceHeys, SharonSquires, EmmaNenadic, Goran BackgroundClinical free-text data (eg, outpatient letters or nursing notes) represent a vast, untapped source of rich information that, if more accessible for research, would clarify and supplement information coded in structured data fields. Data usually need to be deidentified or anonymized before they can be reused for research, but there is a lack of established guidelines to govern effective deidentification and use of free-text information and avoid damaging data utility as a by-product. ObjectiveThis study aimed to develop recommendations for the creation of data governance standards to integrate with existing frameworks for personal data use, to enable free-text data to be used safely for research for patient and public benefit. MethodsWe outlined data protection legislation and regulations relating to the United Kingdom for context and conducted a rapid literature review and UK-based case studies to explore data governance models used in working with free-text data. We also engaged with stakeholders, including text-mining researchers and the general public, to explore perceived barriers and solutions in working with clinical free-text. ResultsWe proposed a set of recommendations, including the need for authoritative guidance on data governance for the reuse of free-text data, to ensure public transparency in data flows and uses, to treat deidentified free-text data as potentially identifiable with use limited to accredited data safe havens, and to commit to a culture of continuous improvement to understand the relationships between the efficacy of deidentification and reidentification risks, so this can be communicated to all stakeholders. ConclusionsBy drawing together the findings of a combination of activities, we present a position paper to contribute to the development of data governance standards for the reuse of clinical free-text data for secondary purposes. While working in accordance with existing data governance frameworks, there is a need for further work to take forward the recommendations we have proposed, with commitment and investment, to assure and expand the safe reuse of clinical free-text data for public benefit.http://www.jmir.org/2020/6/e16760/
collection DOAJ
language English
format Article
sources DOAJ
author Jones, Kerina H
Ford, Elizabeth M
Lea, Nathan
Griffiths, Lucy J
Hassan, Lamiece
Heys, Sharon
Squires, Emma
Nenadic, Goran
spellingShingle Jones, Kerina H
Ford, Elizabeth M
Lea, Nathan
Griffiths, Lucy J
Hassan, Lamiece
Heys, Sharon
Squires, Emma
Nenadic, Goran
Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position Paper
Journal of Medical Internet Research
author_facet Jones, Kerina H
Ford, Elizabeth M
Lea, Nathan
Griffiths, Lucy J
Hassan, Lamiece
Heys, Sharon
Squires, Emma
Nenadic, Goran
author_sort Jones, Kerina H
title Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position Paper
title_short Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position Paper
title_full Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position Paper
title_fullStr Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position Paper
title_full_unstemmed Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position Paper
title_sort toward the development of data governance standards for using clinical free-text data in health research: position paper
publisher JMIR Publications
series Journal of Medical Internet Research
issn 1438-8871
publishDate 2020-06-01
description BackgroundClinical free-text data (eg, outpatient letters or nursing notes) represent a vast, untapped source of rich information that, if more accessible for research, would clarify and supplement information coded in structured data fields. Data usually need to be deidentified or anonymized before they can be reused for research, but there is a lack of established guidelines to govern effective deidentification and use of free-text information and avoid damaging data utility as a by-product. ObjectiveThis study aimed to develop recommendations for the creation of data governance standards to integrate with existing frameworks for personal data use, to enable free-text data to be used safely for research for patient and public benefit. MethodsWe outlined data protection legislation and regulations relating to the United Kingdom for context and conducted a rapid literature review and UK-based case studies to explore data governance models used in working with free-text data. We also engaged with stakeholders, including text-mining researchers and the general public, to explore perceived barriers and solutions in working with clinical free-text. ResultsWe proposed a set of recommendations, including the need for authoritative guidance on data governance for the reuse of free-text data, to ensure public transparency in data flows and uses, to treat deidentified free-text data as potentially identifiable with use limited to accredited data safe havens, and to commit to a culture of continuous improvement to understand the relationships between the efficacy of deidentification and reidentification risks, so this can be communicated to all stakeholders. ConclusionsBy drawing together the findings of a combination of activities, we present a position paper to contribute to the development of data governance standards for the reuse of clinical free-text data for secondary purposes. While working in accordance with existing data governance frameworks, there is a need for further work to take forward the recommendations we have proposed, with commitment and investment, to assure and expand the safe reuse of clinical free-text data for public benefit.
url http://www.jmir.org/2020/6/e16760/
work_keys_str_mv AT joneskerinah towardthedevelopmentofdatagovernancestandardsforusingclinicalfreetextdatainhealthresearchpositionpaper
AT fordelizabethm towardthedevelopmentofdatagovernancestandardsforusingclinicalfreetextdatainhealthresearchpositionpaper
AT leanathan towardthedevelopmentofdatagovernancestandardsforusingclinicalfreetextdatainhealthresearchpositionpaper
AT griffithslucyj towardthedevelopmentofdatagovernancestandardsforusingclinicalfreetextdatainhealthresearchpositionpaper
AT hassanlamiece towardthedevelopmentofdatagovernancestandardsforusingclinicalfreetextdatainhealthresearchpositionpaper
AT heyssharon towardthedevelopmentofdatagovernancestandardsforusingclinicalfreetextdatainhealthresearchpositionpaper
AT squiresemma towardthedevelopmentofdatagovernancestandardsforusingclinicalfreetextdatainhealthresearchpositionpaper
AT nenadicgoran towardthedevelopmentofdatagovernancestandardsforusingclinicalfreetextdatainhealthresearchpositionpaper
_version_ 1721551131868397568