Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse

Abstract Background Medication trend studies show the changes of medication over the years and may be replicated using a clinical Data Warehouse (CDW). Even nowadays, a lot of the patient information, like medication data, in the EHR is stored in the format of free text. As the conventional approach...

Full description

Bibliographic Details
Main Authors: Georg Dietrich, Jonathan Krebs, Leon Liman, Georg Fette, Maximilian Ertl, Mathias Kaspar, Stefan Störk, Frank Puppe
Format: Article
Language:English
Published: BMC 2019-01-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12911-018-0729-0
id doaj-0e535d9cb0054e7b9b39d40ed4c9639b
record_format Article
spelling doaj-0e535d9cb0054e7b9b39d40ed4c9639b2020-11-25T02:03:46ZengBMCBMC Medical Informatics and Decision Making1472-69472019-01-0119112110.1186/s12911-018-0729-0Replicating medication trend studies using ad hoc information extraction in a clinical data warehouseGeorg Dietrich0Jonathan Krebs1Leon Liman2Georg Fette3Maximilian Ertl4Mathias Kaspar5Stefan Störk6Frank Puppe7Computer Science, Unviversity of WürzburgComputer Science, Unviversity of WürzburgComputer Science, Unviversity of WürzburgComputer Science, Unviversity of WürzburgService Center Medical Informatics, University Hospital of WürzburgComprehensive Heart Failure Center, University and University Hospital Hospital of WürzburgComprehensive Heart Failure Center, University and University Hospital Hospital of WürzburgComputer Science, Unviversity of WürzburgAbstract Background Medication trend studies show the changes of medication over the years and may be replicated using a clinical Data Warehouse (CDW). Even nowadays, a lot of the patient information, like medication data, in the EHR is stored in the format of free text. As the conventional approach of information extraction (IE) demands a high developmental effort, we used ad hoc IE instead. This technique queries information and extracts it on the fly from texts contained in the CDW. Methods We present a generalizable approach of ad hoc IE for pharmacotherapy (medications and their daily dosage) presented in hospital discharge letters. We added import and query features to the CDW system, like error tolerant queries to deal with misspellings and proximity search for the extraction of the daily dosage. During the data integration process in the CDW, negated, historical and non-patient context data are filtered. For the replication studies, we used a drug list grouped by ATC (Anatomical Therapeutic Chemical Classification System) codes as input for queries to the CDW. Results We achieve an F1 score of 0.983 (precision 0.997, recall 0.970) for extracting medication from discharge letters and an F1 score of 0.974 (precision 0.977, recall 0.972) for extracting the dosage. We replicated three published medical trend studies for hypertension, atrial fibrillation and chronic kidney disease. Overall, 93% of the main findings could be replicated, 68% of sub-findings, and 75% of all findings. One study could be completely replicated with all main and sub-findings. Conclusion A novel approach for ad hoc IE is presented. It is very suitable for basic medical texts like discharge letters and finding reports. Ad hoc IE is by definition more limited than conventional IE and does not claim to replace it, but it substantially exceeds the search capabilities of many CDWs and it is convenient to conduct replication studies fast and with high quality.http://link.springer.com/article/10.1186/s12911-018-0729-0Data warehouseMedication extractionInformation extraction
collection DOAJ
language English
format Article
sources DOAJ
author Georg Dietrich
Jonathan Krebs
Leon Liman
Georg Fette
Maximilian Ertl
Mathias Kaspar
Stefan Störk
Frank Puppe
spellingShingle Georg Dietrich
Jonathan Krebs
Leon Liman
Georg Fette
Maximilian Ertl
Mathias Kaspar
Stefan Störk
Frank Puppe
Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
BMC Medical Informatics and Decision Making
Data warehouse
Medication extraction
Information extraction
author_facet Georg Dietrich
Jonathan Krebs
Leon Liman
Georg Fette
Maximilian Ertl
Mathias Kaspar
Stefan Störk
Frank Puppe
author_sort Georg Dietrich
title Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
title_short Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
title_full Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
title_fullStr Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
title_full_unstemmed Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
title_sort replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
publisher BMC
series BMC Medical Informatics and Decision Making
issn 1472-6947
publishDate 2019-01-01
description Abstract Background Medication trend studies show the changes of medication over the years and may be replicated using a clinical Data Warehouse (CDW). Even nowadays, a lot of the patient information, like medication data, in the EHR is stored in the format of free text. As the conventional approach of information extraction (IE) demands a high developmental effort, we used ad hoc IE instead. This technique queries information and extracts it on the fly from texts contained in the CDW. Methods We present a generalizable approach of ad hoc IE for pharmacotherapy (medications and their daily dosage) presented in hospital discharge letters. We added import and query features to the CDW system, like error tolerant queries to deal with misspellings and proximity search for the extraction of the daily dosage. During the data integration process in the CDW, negated, historical and non-patient context data are filtered. For the replication studies, we used a drug list grouped by ATC (Anatomical Therapeutic Chemical Classification System) codes as input for queries to the CDW. Results We achieve an F1 score of 0.983 (precision 0.997, recall 0.970) for extracting medication from discharge letters and an F1 score of 0.974 (precision 0.977, recall 0.972) for extracting the dosage. We replicated three published medical trend studies for hypertension, atrial fibrillation and chronic kidney disease. Overall, 93% of the main findings could be replicated, 68% of sub-findings, and 75% of all findings. One study could be completely replicated with all main and sub-findings. Conclusion A novel approach for ad hoc IE is presented. It is very suitable for basic medical texts like discharge letters and finding reports. Ad hoc IE is by definition more limited than conventional IE and does not claim to replace it, but it substantially exceeds the search capabilities of many CDWs and it is convenient to conduct replication studies fast and with high quality.
topic Data warehouse
Medication extraction
Information extraction
url http://link.springer.com/article/10.1186/s12911-018-0729-0
work_keys_str_mv AT georgdietrich replicatingmedicationtrendstudiesusingadhocinformationextractioninaclinicaldatawarehouse
AT jonathankrebs replicatingmedicationtrendstudiesusingadhocinformationextractioninaclinicaldatawarehouse
AT leonliman replicatingmedicationtrendstudiesusingadhocinformationextractioninaclinicaldatawarehouse
AT georgfette replicatingmedicationtrendstudiesusingadhocinformationextractioninaclinicaldatawarehouse
AT maximilianertl replicatingmedicationtrendstudiesusingadhocinformationextractioninaclinicaldatawarehouse
AT mathiaskaspar replicatingmedicationtrendstudiesusingadhocinformationextractioninaclinicaldatawarehouse
AT stefanstork replicatingmedicationtrendstudiesusingadhocinformationextractioninaclinicaldatawarehouse
AT frankpuppe replicatingmedicationtrendstudiesusingadhocinformationextractioninaclinicaldatawarehouse
_version_ 1724945922142502912