Application of openEHR archetypes to automate data quality rules for electronic health records: a case study

Abstract Background Ensuring data is of appropriate quality is essential for the secondary use of electronic health records (EHRs) in research and clinical decision support. An effective method of data quality assessment (DQA) is automating data quality rules (DQRs) to replace the time-consuming, la...

Full description

Bibliographic Details
Main Authors: Qi Tian, Zhexi Han, Ping Yu, Jiye An, Xudong Lu, Huilong Duan
Format: Article
Language:English
Published: BMC 2021-04-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-021-01481-2
id doaj-c2c7e42a1317408c85a7f21ac458213b
record_format Article
spelling doaj-c2c7e42a1317408c85a7f21ac458213b2021-04-04T11:39:28ZengBMCBMC Medical Informatics and Decision Making1472-69472021-04-0121111110.1186/s12911-021-01481-2Application of openEHR archetypes to automate data quality rules for electronic health records: a case studyQi Tian0Zhexi Han1Ping Yu2Jiye An3Xudong Lu4Huilong Duan5College of Biomedical Engineering and Instrument Science, Zhejiang UniversityCollege of Biomedical Engineering and Instrument Science, Zhejiang UniversityCentre for Digital Transformation, School of Computing and Information Technology, University of WollongongCollege of Biomedical Engineering and Instrument Science, Zhejiang UniversityCollege of Biomedical Engineering and Instrument Science, Zhejiang UniversityCollege of Biomedical Engineering and Instrument Science, Zhejiang UniversityAbstract Background Ensuring data is of appropriate quality is essential for the secondary use of electronic health records (EHRs) in research and clinical decision support. An effective method of data quality assessment (DQA) is automating data quality rules (DQRs) to replace the time-consuming, labor-intensive manual process of creating DQRs, which is difficult to guarantee standard and comparable DQA results. This paper presents a case study of automatically creating DQRs based on openEHR archetypes in a Chinese hospital to investigate the feasibility and challenges of automating DQA for EHR data. Methods The clinical data repository (CDR) of the Shanxi Dayi Hospital is an archetype-based relational database. Four steps are undertaken to automatically create DQRs in this CDR database. First, the keywords and features relevant to DQA of archetypes were identified via mapping them to a well-established DQA framework, Kahn’s DQA framework. Second, the templates of DQRs in correspondence with these identified keywords and features were created in the structured query language (SQL). Third, the quality constraints were retrieved from archetypes. Fourth, these quality constraints were automatically converted to DQRs according to the pre-designed templates and mapping relationships of archetypes and data tables. We utilized the archetypes of the CDR to automatically create DQRs to meet quality requirements of the Chinese Application-Level Ranking Standard for EHR Systems (CARSES) and evaluated their coverage by comparing with expert-created DQRs. Results We used 27 archetypes to automatically create 359 DQRs. 319 of them are in agreement with the expert-created DQRs, covering 84.97% (311/366) requirements of the CARSES. The auto-created DQRs had varying levels of coverage of the four quality domains mandated by the CARSES: 100% (45/45) of consistency, 98.11% (208/212) of completeness, 54.02% (57/87) of conformity, and 50% (11/22) of timeliness. Conclusion It’s feasible to create DQRs automatically based on openEHR archetypes. This study evaluated the coverage of the auto-created DQRs to a typical DQA task of Chinese hospitals, the CARSES. The challenges of automating DQR creation were identified, such as quality requirements based on semantic, and complex constraints of multiple elements. This research can enlighten the exploration of DQR auto-creation and contribute to the automatic DQA.https://doi.org/10.1186/s12911-021-01481-2Data quality assessmentData quality ruleOpenEHR archetypesAutomaticSecondary use of EHR
collection DOAJ
language English
format Article
sources DOAJ
author Qi Tian
Zhexi Han
Ping Yu
Jiye An
Xudong Lu
Huilong Duan
spellingShingle Qi Tian
Zhexi Han
Ping Yu
Jiye An
Xudong Lu
Huilong Duan
Application of openEHR archetypes to automate data quality rules for electronic health records: a case study
BMC Medical Informatics and Decision Making
Data quality assessment
Data quality rule
OpenEHR archetypes
Automatic
Secondary use of EHR
author_facet Qi Tian
Zhexi Han
Ping Yu
Jiye An
Xudong Lu
Huilong Duan
author_sort Qi Tian
title Application of openEHR archetypes to automate data quality rules for electronic health records: a case study
title_short Application of openEHR archetypes to automate data quality rules for electronic health records: a case study
title_full Application of openEHR archetypes to automate data quality rules for electronic health records: a case study
title_fullStr Application of openEHR archetypes to automate data quality rules for electronic health records: a case study
title_full_unstemmed Application of openEHR archetypes to automate data quality rules for electronic health records: a case study
title_sort application of openehr archetypes to automate data quality rules for electronic health records: a case study
publisher BMC
series BMC Medical Informatics and Decision Making
issn 1472-6947
publishDate 2021-04-01
description Abstract Background Ensuring data is of appropriate quality is essential for the secondary use of electronic health records (EHRs) in research and clinical decision support. An effective method of data quality assessment (DQA) is automating data quality rules (DQRs) to replace the time-consuming, labor-intensive manual process of creating DQRs, which is difficult to guarantee standard and comparable DQA results. This paper presents a case study of automatically creating DQRs based on openEHR archetypes in a Chinese hospital to investigate the feasibility and challenges of automating DQA for EHR data. Methods The clinical data repository (CDR) of the Shanxi Dayi Hospital is an archetype-based relational database. Four steps are undertaken to automatically create DQRs in this CDR database. First, the keywords and features relevant to DQA of archetypes were identified via mapping them to a well-established DQA framework, Kahn’s DQA framework. Second, the templates of DQRs in correspondence with these identified keywords and features were created in the structured query language (SQL). Third, the quality constraints were retrieved from archetypes. Fourth, these quality constraints were automatically converted to DQRs according to the pre-designed templates and mapping relationships of archetypes and data tables. We utilized the archetypes of the CDR to automatically create DQRs to meet quality requirements of the Chinese Application-Level Ranking Standard for EHR Systems (CARSES) and evaluated their coverage by comparing with expert-created DQRs. Results We used 27 archetypes to automatically create 359 DQRs. 319 of them are in agreement with the expert-created DQRs, covering 84.97% (311/366) requirements of the CARSES. The auto-created DQRs had varying levels of coverage of the four quality domains mandated by the CARSES: 100% (45/45) of consistency, 98.11% (208/212) of completeness, 54.02% (57/87) of conformity, and 50% (11/22) of timeliness. Conclusion It’s feasible to create DQRs automatically based on openEHR archetypes. This study evaluated the coverage of the auto-created DQRs to a typical DQA task of Chinese hospitals, the CARSES. The challenges of automating DQR creation were identified, such as quality requirements based on semantic, and complex constraints of multiple elements. This research can enlighten the exploration of DQR auto-creation and contribute to the automatic DQA.
topic Data quality assessment
Data quality rule
OpenEHR archetypes
Automatic
Secondary use of EHR
url https://doi.org/10.1186/s12911-021-01481-2
work_keys_str_mv AT qitian applicationofopenehrarchetypestoautomatedataqualityrulesforelectronichealthrecordsacasestudy
AT zhexihan applicationofopenehrarchetypestoautomatedataqualityrulesforelectronichealthrecordsacasestudy
AT pingyu applicationofopenehrarchetypestoautomatedataqualityrulesforelectronichealthrecordsacasestudy
AT jiyean applicationofopenehrarchetypestoautomatedataqualityrulesforelectronichealthrecordsacasestudy
AT xudonglu applicationofopenehrarchetypestoautomatedataqualityrulesforelectronichealthrecordsacasestudy
AT huilongduan applicationofopenehrarchetypestoautomatedataqualityrulesforelectronichealthrecordsacasestudy
_version_ 1721542545543004160