Ellipsis Handling in A Medical Diagnosis Dialog System

碩士 === 國立臺灣海洋大學 === 資訊工程學系 === 100 === Computerized virtual patient (CVP) is a domain specific dialog system. In this system, we handle ellipsis in medical diagnosis. Virtual patient is an important teaching method for medical college’s student. It can help student to learn how to judge patient’s co...

Full description

Bibliographic Details
Main Author: 鮑建威
Other Authors: Chuan-Jie Lin
Format: Others
Language:zh-TW
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/50928352094819149040
id ndltd-TW-100NTOU5394004
record_format oai_dc
spelling ndltd-TW-100NTOU53940042015-10-13T22:01:07Z http://ndltd.ncl.edu.tw/handle/50928352094819149040 Ellipsis Handling in A Medical Diagnosis Dialog System 醫療問診對話系統中省略現象之處理 鮑建威 碩士 國立臺灣海洋大學 資訊工程學系 100 Computerized virtual patient (CVP) is a domain specific dialog system. In this system, we handle ellipsis in medical diagnosis. Virtual patient is an important teaching method for medical college’s student. It can help student to learn how to judge patient’s condition from medical diagnosis. CVP need to resolve oral phenomenon something like our goal ellipsis. If we don’t handle ellipsis which is not easy to find corresponding problems and answers in the standard problem set of teaching text. Ellipsis handling includes ellipsis detection, type classification and recovery. There are many domain specific dialog system, but no one similar ours. Ellipses in our thesis are classified according to omitted element. Medical diagnosis template saves necessary information from dialogs. Our system is a hybrid system, rule-based module and machine learning. Rule-based module uses information from template to detect, classify and recover ellipsis. If some ellipsis can’t be detected by rule-based module, machine learning will implement. We learn a classifier for detecting ellipsis. Features include lexical surface, word information, POS, verb tense, punctuation and special terms from observation. These features also can be combined. After detection, rule-based module classifies and recovers ellipsis. The training and testing data are from virtual patient’s teaching record and medical diagnosis record in the hospital. Our machine learning method is Condition Random Field(CRF). Training is performed in 10-fold-cross-validation. In training, when using best features and feature combination, ellipsis detection classifier with a f-value of 86.73%, then recover by rule-based module with a f-value of 78.95%. Using information in diagnosis template to detect and recover ellipsis with a f-value of 82.58%. Total ellipsis system with a f-value of 85.54%. In testing, ellipsis system with a recall of 77.4%, a precision of 79.36% and a f-value of 78.35% Chuan-Jie Lin 林川傑 2012 學位論文 ; thesis 57 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣海洋大學 === 資訊工程學系 === 100 === Computerized virtual patient (CVP) is a domain specific dialog system. In this system, we handle ellipsis in medical diagnosis. Virtual patient is an important teaching method for medical college’s student. It can help student to learn how to judge patient’s condition from medical diagnosis. CVP need to resolve oral phenomenon something like our goal ellipsis. If we don’t handle ellipsis which is not easy to find corresponding problems and answers in the standard problem set of teaching text. Ellipsis handling includes ellipsis detection, type classification and recovery. There are many domain specific dialog system, but no one similar ours. Ellipses in our thesis are classified according to omitted element. Medical diagnosis template saves necessary information from dialogs. Our system is a hybrid system, rule-based module and machine learning. Rule-based module uses information from template to detect, classify and recover ellipsis. If some ellipsis can’t be detected by rule-based module, machine learning will implement. We learn a classifier for detecting ellipsis. Features include lexical surface, word information, POS, verb tense, punctuation and special terms from observation. These features also can be combined. After detection, rule-based module classifies and recovers ellipsis. The training and testing data are from virtual patient’s teaching record and medical diagnosis record in the hospital. Our machine learning method is Condition Random Field(CRF). Training is performed in 10-fold-cross-validation. In training, when using best features and feature combination, ellipsis detection classifier with a f-value of 86.73%, then recover by rule-based module with a f-value of 78.95%. Using information in diagnosis template to detect and recover ellipsis with a f-value of 82.58%. Total ellipsis system with a f-value of 85.54%. In testing, ellipsis system with a recall of 77.4%, a precision of 79.36% and a f-value of 78.35%
author2 Chuan-Jie Lin
author_facet Chuan-Jie Lin
鮑建威
author 鮑建威
spellingShingle 鮑建威
Ellipsis Handling in A Medical Diagnosis Dialog System
author_sort 鮑建威
title Ellipsis Handling in A Medical Diagnosis Dialog System
title_short Ellipsis Handling in A Medical Diagnosis Dialog System
title_full Ellipsis Handling in A Medical Diagnosis Dialog System
title_fullStr Ellipsis Handling in A Medical Diagnosis Dialog System
title_full_unstemmed Ellipsis Handling in A Medical Diagnosis Dialog System
title_sort ellipsis handling in a medical diagnosis dialog system
publishDate 2012
url http://ndltd.ncl.edu.tw/handle/50928352094819149040
work_keys_str_mv AT bàojiànwēi ellipsishandlinginamedicaldiagnosisdialogsystem
AT bàojiànwēi yīliáowènzhěnduìhuàxìtǒngzhōngshěnglüèxiànxiàngzhīchùlǐ
_version_ 1718071621129863168