Missing Data - A Gentle Introduction

This thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simul...

Full description

Bibliographic Details
Main Author: Österlund, Vilgot
Format: Others
Language:English
Published: Uppsala universitet, Statistiska institutionen 2020
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984
id ndltd-UPSALLA1-oai-DiVA.org-uu-413984
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-4139842020-06-23T03:32:41ZMissing Data - A Gentle IntroductionengÖsterlund, VilgotUppsala universitet, Statistiska institutionen2020Missing dataSmall samplesMultiple imputationMaximum likelihoodListwise deletionMissing at randomMissing completely at randomLinear regressionLogistic regression.Probability Theory and StatisticsSannolikhetsteori och statistikThis thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simulation study is performed to see if there are circumstances in small samples when any of the two methods are to be preferred. To show the importance of handling missing data, multiple imputation and maximum likelihood are compared to listwise deletion. The results from the simulation study does not show any crucial differences between multiple imputation and maximum likelihood when it comes to point estimates. Some differences are seen in the estimation of the confidence intervals, talking in favour of multiple imputation. The difference is decreasing with an increasing sample size and more studies are needed to draw definite conclusions. Further, the results shows that listwise deletion lead to biased estimations under a missing at random mechanism. The methods are also applied to a real dataset, the Swedish enrollment registry, to show how the methods work in a practical application. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Missing data
Small samples
Multiple imputation
Maximum likelihood
Listwise deletion
Missing at random
Missing completely at random
Linear regression
Logistic regression.
Probability Theory and Statistics
Sannolikhetsteori och statistik
spellingShingle Missing data
Small samples
Multiple imputation
Maximum likelihood
Listwise deletion
Missing at random
Missing completely at random
Linear regression
Logistic regression.
Probability Theory and Statistics
Sannolikhetsteori och statistik
Österlund, Vilgot
Missing Data - A Gentle Introduction
description This thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simulation study is performed to see if there are circumstances in small samples when any of the two methods are to be preferred. To show the importance of handling missing data, multiple imputation and maximum likelihood are compared to listwise deletion. The results from the simulation study does not show any crucial differences between multiple imputation and maximum likelihood when it comes to point estimates. Some differences are seen in the estimation of the confidence intervals, talking in favour of multiple imputation. The difference is decreasing with an increasing sample size and more studies are needed to draw definite conclusions. Further, the results shows that listwise deletion lead to biased estimations under a missing at random mechanism. The methods are also applied to a real dataset, the Swedish enrollment registry, to show how the methods work in a practical application.
author Österlund, Vilgot
author_facet Österlund, Vilgot
author_sort Österlund, Vilgot
title Missing Data - A Gentle Introduction
title_short Missing Data - A Gentle Introduction
title_full Missing Data - A Gentle Introduction
title_fullStr Missing Data - A Gentle Introduction
title_full_unstemmed Missing Data - A Gentle Introduction
title_sort missing data - a gentle introduction
publisher Uppsala universitet, Statistiska institutionen
publishDate 2020
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984
work_keys_str_mv AT osterlundvilgot missingdataagentleintroduction
_version_ 1719323075662577664