Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model

碩士 === 國立臺灣大學 === 醫學工程學研究所 === 100 === Most information system had been developed for many years, which various kinds of data are stored in the database system of computer. As various kinds of predictive model are booming. The applications of predictive model in sociology, economics and clinical...

Full description

Bibliographic Details
Main Authors:	Min-Yen Lin, 林旻延
Other Authors:	翁昭旼教授
Format:	Others
Language:	zh-TW
Published:	2012
Online Access:	http://ndltd.ncl.edu.tw/handle/77194049536789773794

id	ndltd-TW-100NTU05530040
record_format	oai_dc
spelling	ndltd-TW-100NTU055300402015-10-13T21:50:18Z http://ndltd.ncl.edu.tw/handle/77194049536789773794 Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model 利用邏輯斯迴歸模型進行特徵選擇延伸諾模圖以獲得預測計分 Min-Yen Lin 林旻延碩士國立臺灣大學醫學工程學研究所 100 Most information system had been developed for many years, which various kinds of data are stored in the database system of computer. As various kinds of predictive model are booming. The applications of predictive model in sociology, economics and clinical medicine are commonly used. In clinical medicine, to estimate the patient''s risk of death or the probability of recurrence for particular disease is very important. The process of model construction may explore some unexpected factors which had been ignored. If the researchers want to retrieve some important information which had been hidden in mass data, particular methods for computing and analysis are need. For this purpose, several data mining algorithms are created. In the process of data mining, in order to enhancing generalization capability, speeding up learning process and improving model interpretability, feature selection becomes more and more in attention. For high dimensions database, feature selection is important way to reduce the redundant variables before processing the data mining. Taking above arguments, the main purpose of present research is to derive the predictive scoring function for patients’ outcome from nomogram which is a visualization tool extended from logistic regression model. We used the database of “CT finding and cranial surgery report” from NTUH as material, and apply several methods to obtain the feature subset : first, we use likelihood ratio to provide a summary of how many times more (or less) likely patients with the good outcome are to have that particular result than with poor outcome for each feature variables. After dropping the irrelevant features. Second, we apply the c-index selection to repeatedly calculate the c-index of the logistic regression model which drop a different feature variable a time, and select the subset with maximum c-index of the model. In this thesis, we also compare it with other common variable selection criterion of logistic regression model, and validate the predictive scoring by clinicians with expert knowledge. 翁昭旼教授 2012 學位論文 ; thesis 43 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立臺灣大學 === 醫學工程學研究所 === 100 === Most information system had been developed for many years, which various kinds of data are stored in the database system of computer. As various kinds of predictive model are booming. The applications of predictive model in sociology, economics and clinical medicine are commonly used. In clinical medicine, to estimate the patient''s risk of death or the probability of recurrence for particular disease is very important. The process of model construction may explore some unexpected factors which had been ignored. If the researchers want to retrieve some important information which had been hidden in mass data, particular methods for computing and analysis are need. For this purpose, several data mining algorithms are created. In the process of data mining, in order to enhancing generalization capability, speeding up learning process and improving model interpretability, feature selection becomes more and more in attention. For high dimensions database, feature selection is important way to reduce the redundant variables before processing the data mining. Taking above arguments, the main purpose of present research is to derive the predictive scoring function for patients’ outcome from nomogram which is a visualization tool extended from logistic regression model. We used the database of “CT finding and cranial surgery report” from NTUH as material, and apply several methods to obtain the feature subset : first, we use likelihood ratio to provide a summary of how many times more (or less) likely patients with the good outcome are to have that particular result than with poor outcome for each feature variables. After dropping the irrelevant features. Second, we apply the c-index selection to repeatedly calculate the c-index of the logistic regression model which drop a different feature variable a time, and select the subset with maximum c-index of the model. In this thesis, we also compare it with other common variable selection criterion of logistic regression model, and validate the predictive scoring by clinicians with expert knowledge.
author2	翁昭旼教授
author_facet	翁昭旼教授 Min-Yen Lin 林旻延
author	Min-Yen Lin 林旻延
spellingShingle	Min-Yen Lin 林旻延 Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model
author_sort	Min-Yen Lin
title	Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model
title_short	Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model
title_full	Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model
title_fullStr	Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model
title_full_unstemmed	Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model
title_sort	derive predictive scoring by nomogram of the feature selection from logistic regression model
publishDate	2012
url	http://ndltd.ncl.edu.tw/handle/77194049536789773794
work_keys_str_mv	AT minyenlin derivepredictivescoringbynomogramofthefeatureselectionfromlogisticregressionmodel AT línmínyán derivepredictivescoringbynomogramofthefeatureselectionfromlogisticregressionmodel AT minyenlin lìyòngluójísīhuíguīmóxíngjìnxíngtèzhēngxuǎnzéyánshēnnuòmótúyǐhuòdéyùcèjìfēn AT línmínyán lìyòngluójísīhuíguīmóxíngjìnxíngtèzhēngxuǎnzéyánshēnnuòmótúyǐhuòdéyùcèjìfēn
_version_	1718069089649295360

Derive Predictive Scoring by Nomogram of the Feature Selection from Logistic Regression Model

Similar Items