Missing treatment for Mass missing data-A case study on Propensity Score Matching Method

碩士 === 國立臺北大學 === 統計學系 === 102 === With more and more emphasis on handling missing data, missing data can not only be rounded up or replaced by mean, mode, instead, should be to select the appropriate imputation through the missing pattern. It will not only solve the affect by missing data, but al...

Full description

Bibliographic Details
Main Authors: CHEN,PO-CHANG, 陳柏彰
Other Authors: WANG, HUNG-LUNG
Format: Others
Language:zh-TW
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/dysenk
id ndltd-TW-102NTPU0337035
record_format oai_dc
spelling ndltd-TW-102NTPU03370352019-05-15T21:23:53Z http://ndltd.ncl.edu.tw/handle/dysenk Missing treatment for Mass missing data-A case study on Propensity Score Matching Method 大量資料遺漏下缺失處理方法之研究-以傾向分數配對法為例 CHEN,PO-CHANG 陳柏彰 碩士 國立臺北大學 統計學系 102 With more and more emphasis on handling missing data, missing data can not only be rounded up or replaced by mean, mode, instead, should be to select the appropriate imputation through the missing pattern. It will not only solve the affect by missing data, but also to enhance the credibility and efficiency of the analysis results.This study use the first wave to the third wave of survey data in "Taiwan Education Panel Survey" to investigate the effect of missing treatment.Since the third wave of student data in the "Core Panel", the proportion of missing data samples are relatively high.Using this type of the missing data to analyze, would better reveal the importance of missing data processing methods. In the imputation, we start to find significant variables that may affect the missingness, and then we use of these significant variables to construct 50 groups imitation missing data sets from the complete data set.Then we compare the four missing treatment methods (list-wise deletion, discriminant function imputation, logistic regression imputation , Monte Carlo - Markov chain single imputation) regarding the changes and influence in variables and coefficients of the binary logistic regression model.The results show that Monte Carlo - Markov chain single imputation method has relatively better performance and more stable imputation methods.Finally, we use Monte Carlo - Markov chain single imputation to imputate the original data and analysis, and discuss whether the length of the tutorial can enhance learning through multiple regression analysis and the average treatment effect on the treated (ATT) by completed data after imputated. WANG, HUNG-LUNG 王鴻龍 2014 學位論文 ; thesis 50 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北大學 === 統計學系 === 102 === With more and more emphasis on handling missing data, missing data can not only be rounded up or replaced by mean, mode, instead, should be to select the appropriate imputation through the missing pattern. It will not only solve the affect by missing data, but also to enhance the credibility and efficiency of the analysis results.This study use the first wave to the third wave of survey data in "Taiwan Education Panel Survey" to investigate the effect of missing treatment.Since the third wave of student data in the "Core Panel", the proportion of missing data samples are relatively high.Using this type of the missing data to analyze, would better reveal the importance of missing data processing methods. In the imputation, we start to find significant variables that may affect the missingness, and then we use of these significant variables to construct 50 groups imitation missing data sets from the complete data set.Then we compare the four missing treatment methods (list-wise deletion, discriminant function imputation, logistic regression imputation , Monte Carlo - Markov chain single imputation) regarding the changes and influence in variables and coefficients of the binary logistic regression model.The results show that Monte Carlo - Markov chain single imputation method has relatively better performance and more stable imputation methods.Finally, we use Monte Carlo - Markov chain single imputation to imputate the original data and analysis, and discuss whether the length of the tutorial can enhance learning through multiple regression analysis and the average treatment effect on the treated (ATT) by completed data after imputated.
author2 WANG, HUNG-LUNG
author_facet WANG, HUNG-LUNG
CHEN,PO-CHANG
陳柏彰
author CHEN,PO-CHANG
陳柏彰
spellingShingle CHEN,PO-CHANG
陳柏彰
Missing treatment for Mass missing data-A case study on Propensity Score Matching Method
author_sort CHEN,PO-CHANG
title Missing treatment for Mass missing data-A case study on Propensity Score Matching Method
title_short Missing treatment for Mass missing data-A case study on Propensity Score Matching Method
title_full Missing treatment for Mass missing data-A case study on Propensity Score Matching Method
title_fullStr Missing treatment for Mass missing data-A case study on Propensity Score Matching Method
title_full_unstemmed Missing treatment for Mass missing data-A case study on Propensity Score Matching Method
title_sort missing treatment for mass missing data-a case study on propensity score matching method
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/dysenk
work_keys_str_mv AT chenpochang missingtreatmentformassmissingdataacasestudyonpropensityscorematchingmethod
AT chénbǎizhāng missingtreatmentformassmissingdataacasestudyonpropensityscorematchingmethod
AT chenpochang dàliàngzīliàoyílòuxiàquēshīchùlǐfāngfǎzhīyánjiūyǐqīngxiàngfēnshùpèiduìfǎwèilì
AT chénbǎizhāng dàliàngzīliàoyílòuxiàquēshīchùlǐfāngfǎzhīyánjiūyǐqīngxiàngfēnshùpèiduìfǎwèilì
_version_ 1719114251427119104