Machine Learning and Rule-based Approaches to Assertion Classification

Objectives The authors study two approaches to assertion classification. One of these approaches, Extended NegEx (ENegEx), extends the rule-based NegEx algorithm to cover alter-association assertions; the other, Statistical Assertion Classifier (StAC), presents a machine learning solution to asserti...

Full description

Bibliographic Details
Main Authors: Uzuner, Ozlem (Contributor), Zhang, Xiaoran (Contributor), Sibanda, Tawanda (Contributor)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor)
Format: Article
Language:English
Published: BMJ Publishing Group, 2010-03-09T21:43:46Z.
Subjects:
Online Access:Get fulltext
LEADER 02325 am a22002293u 4500
001 52450
042 |a dc 
100 1 0 |a Uzuner, Ozlem  |e author 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
100 1 0 |a Uzuner, Ozlem  |e contributor 
100 1 0 |a Uzuner, Ozlem  |e contributor 
100 1 0 |a Zhang, Xiaoran  |e contributor 
100 1 0 |a Sibanda, Tawanda  |e contributor 
700 1 0 |a Zhang, Xiaoran  |e author 
700 1 0 |a Sibanda, Tawanda  |e author 
245 0 0 |a Machine Learning and Rule-based Approaches to Assertion Classification 
260 |b BMJ Publishing Group,   |c 2010-03-09T21:43:46Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/52450 
520 |a Objectives The authors study two approaches to assertion classification. One of these approaches, Extended NegEx (ENegEx), extends the rule-based NegEx algorithm to cover alter-association assertions; the other, Statistical Assertion Classifier (StAC), presents a machine learning solution to assertion classification. Design For each mention of each medical problem, both approaches determine whether the problem, as asserted by the context of that mention, is present, absent, or uncertain in the patient, or associated with someone other than the patient. The authors use these two systems to (1) extend negation and uncertainty extraction to recognition of alter-association assertions, (2) determine the contribution of lexical and syntactic context to assertion classification, and (3) test if a machine learning approach to assertion classification can be as generally applicable and useful as its rule-based counterparts. Measurements The authors evaluated assertion classification approaches with precision, recall, and F-measure. Results The ENegEx algorithm is a general algorithm that can be directly applied to new corpora. Despite being based on machine learning, StAC can also be applied out-of-the-box to new corpora and achieve similar generality. Conclusion The StAC models that are developed on discharge summaries can be successfully applied to radiology reports. These models benefit the most from words found in the ± 4 word window of the target and can outperform ENegEx. 
546 |a en_US 
655 7 |a Article 
773 |t Journal of the American Medical Informatics Association