Missing Data - A Gentle Introduction

This thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simul...

Full description

Bibliographic Details
Main Author:	Österlund, Vilgot
Format:	Others
Language:	English
Published:	Uppsala universitet, Statistiska institutionen 2020
Subjects:	Missing data Small samples Multiple imputation Maximum likelihood Listwise deletion Missing at random Missing completely at random Linear regression Logistic regression. Probability Theory and Statistics Sannolikhetsteori och statistik
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984

id	ndltd-UPSALLA1-oai-DiVA.org-uu-413984
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-uu-4139842020-06-23T03:32:41ZMissing Data - A Gentle IntroductionengÖsterlund, VilgotUppsala universitet, Statistiska institutionen2020Missing dataSmall samplesMultiple imputationMaximum likelihoodListwise deletionMissing at randomMissing completely at randomLinear regressionLogistic regression.Probability Theory and StatisticsSannolikhetsteori och statistikThis thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simulation study is performed to see if there are circumstances in small samples when any of the two methods are to be preferred. To show the importance of handling missing data, multiple imputation and maximum likelihood are compared to listwise deletion. The results from the simulation study does not show any crucial differences between multiple imputation and maximum likelihood when it comes to point estimates. Some differences are seen in the estimation of the confidence intervals, talking in favour of multiple imputation. The difference is decreasing with an increasing sample size and more studies are needed to draw definite conclusions. Further, the results shows that listwise deletion lead to biased estimations under a missing at random mechanism. The methods are also applied to a real dataset, the Swedish enrollment registry, to show how the methods work in a practical application. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984application/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Missing data Small samples Multiple imputation Maximum likelihood Listwise deletion Missing at random Missing completely at random Linear regression Logistic regression. Probability Theory and Statistics Sannolikhetsteori och statistik
spellingShingle	Missing data Small samples Multiple imputation Maximum likelihood Listwise deletion Missing at random Missing completely at random Linear regression Logistic regression. Probability Theory and Statistics Sannolikhetsteori och statistik Österlund, Vilgot Missing Data - A Gentle Introduction
description	This thesis provides an introduction to methods for handling missing data. A thorough review of earlier methods and the development of the field of missing data is provided. The thesis present the methods suggested in today’s literature, multiple imputation and maximum likelihood estimation. A simulation study is performed to see if there are circumstances in small samples when any of the two methods are to be preferred. To show the importance of handling missing data, multiple imputation and maximum likelihood are compared to listwise deletion. The results from the simulation study does not show any crucial differences between multiple imputation and maximum likelihood when it comes to point estimates. Some differences are seen in the estimation of the confidence intervals, talking in favour of multiple imputation. The difference is decreasing with an increasing sample size and more studies are needed to draw definite conclusions. Further, the results shows that listwise deletion lead to biased estimations under a missing at random mechanism. The methods are also applied to a real dataset, the Swedish enrollment registry, to show how the methods work in a practical application.
author	Österlund, Vilgot
author_facet	Österlund, Vilgot
author_sort	Österlund, Vilgot
title	Missing Data - A Gentle Introduction
title_short	Missing Data - A Gentle Introduction
title_full	Missing Data - A Gentle Introduction
title_fullStr	Missing Data - A Gentle Introduction
title_full_unstemmed	Missing Data - A Gentle Introduction
title_sort	missing data - a gentle introduction
publisher	Uppsala universitet, Statistiska institutionen
publishDate	2020
url	http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413984
work_keys_str_mv	AT osterlundvilgot missingdataagentleintroduction
_version_	1719323075662577664

Missing Data - A Gentle Introduction

Similar Items