Working with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression

Collinearity plays an integral role in regression studies involving epidemiological data. Variables often form part of a common biological mechanism or measure the same element of a latent structure. It is a natural feature of most data and as such it is rarely possible to physically control for col...

Full description

Bibliographic Details
Main Author: Woolston, Andrew Stephen
Other Authors: Gilthorpe, M. ; Tu, Y. K. ; Baxter, P.
Published: University of Leeds 2012
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.559148
id ndltd-bl.uk-oai-ethos.bl.uk-559148
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-5591482017-10-04T03:35:46ZWorking with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regressionWoolston, Andrew StephenGilthorpe, M. ; Tu, Y. K. ; Baxter, P.2012Collinearity plays an integral role in regression studies involving epidemiological data. Variables often form part of a common biological mechanism or measure the same element of a latent structure. It is a natural feature of most data and as such it is rarely possible to physically control for collinearity in data collection. A focus is placed on the analytical assessment of the data. Departures from independence can severely distort the interpretation of a model and the role of each covariate. This leads to increased inaccuracy as expressed through the regression coefficients and increased uncertainty as expressed through coefficient standard errors. Such a feature has the potential to impact on the clinical conclusions formed from regression studies. The work in this thesis first considers an assessment of the impact of collinearity on model parameters and the conclusions formed. A new collinearity index is developed which incorporates the role of the response in moderating the impact of collinearity. The idea for the new index is developed using vector geometry and extended to a general measure. The work in collinearity is later extended to consider the formation of a dependency structure from a collection of collinear variables. A novel methodology, labelled the matroid approach, is coded and implemented on a metabolic syndrome dataset to extract a latent structure that could represent this clinical construct. Comparisons are subsequently made to existing exploratory factor analysis and clustering methods in the literature. Finally, the unique problem of perfect collinearity is considered in a lifecourse and age-period-cohort setting. The justification of constraint and non-constraint regression methods is considered in an attempt to provide ‘solutions’ to the identification problem generated by collinearity.614.4University of Leedshttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.559148http://etheses.whiterose.ac.uk/2951/Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 614.4
spellingShingle 614.4
Woolston, Andrew Stephen
Working with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression
description Collinearity plays an integral role in regression studies involving epidemiological data. Variables often form part of a common biological mechanism or measure the same element of a latent structure. It is a natural feature of most data and as such it is rarely possible to physically control for collinearity in data collection. A focus is placed on the analytical assessment of the data. Departures from independence can severely distort the interpretation of a model and the role of each covariate. This leads to increased inaccuracy as expressed through the regression coefficients and increased uncertainty as expressed through coefficient standard errors. Such a feature has the potential to impact on the clinical conclusions formed from regression studies. The work in this thesis first considers an assessment of the impact of collinearity on model parameters and the conclusions formed. A new collinearity index is developed which incorporates the role of the response in moderating the impact of collinearity. The idea for the new index is developed using vector geometry and extended to a general measure. The work in collinearity is later extended to consider the formation of a dependency structure from a collection of collinear variables. A novel methodology, labelled the matroid approach, is coded and implemented on a metabolic syndrome dataset to extract a latent structure that could represent this clinical construct. Comparisons are subsequently made to existing exploratory factor analysis and clustering methods in the literature. Finally, the unique problem of perfect collinearity is considered in a lifecourse and age-period-cohort setting. The justification of constraint and non-constraint regression methods is considered in an attempt to provide ‘solutions’ to the identification problem generated by collinearity.
author2 Gilthorpe, M. ; Tu, Y. K. ; Baxter, P.
author_facet Gilthorpe, M. ; Tu, Y. K. ; Baxter, P.
Woolston, Andrew Stephen
author Woolston, Andrew Stephen
author_sort Woolston, Andrew Stephen
title Working with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression
title_short Working with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression
title_full Working with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression
title_fullStr Working with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression
title_full_unstemmed Working with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression
title_sort working with collinearity in epidemiology : development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression
publisher University of Leeds
publishDate 2012
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.559148
work_keys_str_mv AT woolstonandrewstephen workingwithcollinearityinepidemiologydevelopmentofcollinearitydiagnosticsidentifyinglatentconstructsinexploratoryresearchanddealingwithperfectlycollinearvariablesinregression
_version_ 1718545166165344256