Missing Data - A Gentle Introduction
This thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simul...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
Uppsala universitet, Statistiska institutionen
2020
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984 |
id |
ndltd-UPSALLA1-oai-DiVA.org-uu-413984 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-uu-4139842020-06-23T03:32:41ZMissing Data - A Gentle IntroductionengÖsterlund, VilgotUppsala universitet, Statistiska institutionen2020Missing dataSmall samplesMultiple imputationMaximum likelihoodListwise deletionMissing at randomMissing completely at randomLinear regressionLogistic regression.Probability Theory and StatisticsSannolikhetsteori och statistikThis thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simulation study is performed to see if there are circumstances in small samples when any of the two methods are to be preferred. To show the importance of handling missing data, multiple imputation and maximum likelihood are compared to listwise deletion. The results from the simulation study does not show any crucial differences between multiple imputation and maximum likelihood when it comes to point estimates. Some differences are seen in the estimation of the confidence intervals, talking in favour of multiple imputation. The difference is decreasing with an increasing sample size and more studies are needed to draw definite conclusions. Further, the results shows that listwise deletion lead to biased estimations under a missing at random mechanism. The methods are also applied to a real dataset, the Swedish enrollment registry, to show how the methods work in a practical application. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Missing data Small samples Multiple imputation Maximum likelihood Listwise deletion Missing at random Missing completely at random Linear regression Logistic regression. Probability Theory and Statistics Sannolikhetsteori och statistik |
spellingShingle |
Missing data Small samples Multiple imputation Maximum likelihood Listwise deletion Missing at random Missing completely at random Linear regression Logistic regression. Probability Theory and Statistics Sannolikhetsteori och statistik Österlund, Vilgot Missing Data - A Gentle Introduction |
description |
This thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simulation study is performed to see if there are circumstances in small samples when any of the two methods are to be preferred. To show the importance of handling missing data, multiple imputation and maximum likelihood are compared to listwise deletion. The results from the simulation study does not show any crucial differences between multiple imputation and maximum likelihood when it comes to point estimates. Some differences are seen in the estimation of the confidence intervals, talking in favour of multiple imputation. The difference is decreasing with an increasing sample size and more studies are needed to draw definite conclusions. Further, the results shows that listwise deletion lead to biased estimations under a missing at random mechanism. The methods are also applied to a real dataset, the Swedish enrollment registry, to show how the methods work in a practical application. |
author |
Österlund, Vilgot |
author_facet |
Österlund, Vilgot |
author_sort |
Österlund, Vilgot |
title |
Missing Data - A Gentle Introduction |
title_short |
Missing Data - A Gentle Introduction |
title_full |
Missing Data - A Gentle Introduction |
title_fullStr |
Missing Data - A Gentle Introduction |
title_full_unstemmed |
Missing Data - A Gentle Introduction |
title_sort |
missing data - a gentle introduction |
publisher |
Uppsala universitet, Statistiska institutionen |
publishDate |
2020 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984 |
work_keys_str_mv |
AT osterlundvilgot missingdataagentleintroduction |
_version_ |
1719323075662577664 |