Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph

© 2019 The Authors. Increasingly large electronic health records (EHRs) provide an opportunity to algorithmi-cally learn medical knowledge. In one prominent example, a causal health knowledge graph could learn relationships between diseases and symptoms and then serve as a diagnostic tool to be refi...

Full description

Bibliographic Details
Main Authors: Chen, Irene Y (Author), Agrawal, Monica (Author), Horng, Steven (Author), Sontag, David (Author)
Format: Article
Language:English
Published: World Scientific Pub Co Pte Lt, 2021-11-08T17:13:57Z.
Subjects:
Online Access:Get fulltext
LEADER 01819 am a22001933u 4500
001 137717
042 |a dc 
100 1 0 |a Chen, Irene Y  |e author 
700 1 0 |a Agrawal, Monica  |e author 
700 1 0 |a Horng, Steven  |e author 
700 1 0 |a Sontag, David  |e author 
245 0 0 |a Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph 
260 |b World Scientific Pub Co Pte Lt,   |c 2021-11-08T17:13:57Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/137717 
520 |a © 2019 The Authors. Increasingly large electronic health records (EHRs) provide an opportunity to algorithmi-cally learn medical knowledge. In one prominent example, a causal health knowledge graph could learn relationships between diseases and symptoms and then serve as a diagnostic tool to be refined with additional clinical input. Prior research has demonstrated the ability to construct such a graph from over 270,000 emergency department patient visits. In this work, we describe methods to evaluate a health knowledge graph for robustness. Moving beyond precision and recall, we analyze for which diseases and for which patients the graph is most accurate. We identify sample size and unmeasured confounders as major sources of error in the health knowledge graph. We introduce a method to leverage non-linear functions in building the causal graph to better understand existing model assumptions. Finally, to assess model generalizability, we extend to a larger set of complete patient visits within a hospital system. We conclude with a discussion on how to robustly extract medical knowl-edge from EHRs. Supplementary material: http://clinicalml.org/papers/ChenEtAl PSB20 suppl.pdf. 
546 |a en 
655 7 |a Article 
773 |t 10.1142/9789811215636_0003 
773 |t Pacific Symposium on Biocomputing