Information Matrices in Estimating Function Approach: Tests for Model Misspecification and Model Selection

Estimating functions have been widely used for parameter estimation in various statistical problems. Regular estimating functions produce parameter estimators which have desirable properties, such as consistency and asymptotic normality. In quasi-likelihood inference, an important example of estimat...

Full description

Bibliographic Details
Main Author: Zhou, Qian
Language:en
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10012/4614
id ndltd-LACETR-oai-collectionscanada.gc.ca-OWTU.10012-4614
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-OWTU.10012-46142013-10-04T04:09:08ZZhou, Qian2009-08-26T20:41:50Z2009-08-26T20:41:50Z2009-08-26T20:41:50Z2009http://hdl.handle.net/10012/4614Estimating functions have been widely used for parameter estimation in various statistical problems. Regular estimating functions produce parameter estimators which have desirable properties, such as consistency and asymptotic normality. In quasi-likelihood inference, an important example of estimating functions, correct specification of the first two moments of the underlying distribution leads to the information unbiasedness, which states that two forms of the information matrix: the negative sensitivity matrix (negative expectation of the first order derivative of an estimating function) and the variability matrix (variance of an estimating function) are equal, or in other words, the analogue of the Fisher information is equivalent to the Godambe information. Consequently, the information unbiasedness indicates that the model-based covariance matrix estimator and sandwich covariance matrix estimator are equivalent. By comparing the model-based and sandwich variance estimators, we propose information ratio (IR) statistics for testing model misspecification of variance/covariance structure under correctly specified mean structure, in the context of linear regression models, generalized linear regression models and generalized estimating equations. Asymptotic properties of the IR statistics are discussed. In addition, through intensive simulation studies, we show that the IR statistics are powerful in various applications: test for heteroscedasticity in linear regression models, test for overdispersion in count data, and test for misspecified variance function and/or misspecified working correlation structure. Moreover, the IR statistics appear more powerful than the classical information matrix test proposed by White (1982). In the literature, model selection criteria have been intensively discussed, but almost all of them target choosing the optimal mean structure. In this thesis, two model selection procedures are proposed for selecting the optimal variance/covariance structure among a collection of candidate structures. One is based on a sequence of the IR tests for all the competing variance/covariance structures. The other is based on an ``information discrepancy criterion" (IDC), which provides a measurement of discrepancy between the negative sensitivity matrix and the variability matrix. In fact, this IDC characterizes the relative efficiency loss when using a certain candidate variance/covariance structure, compared with the true but unknown structure. Through simulation studies and analyses of two data sets, it is shown that the two proposed model selection methods both have a high rate of detecting the true/optimal variance/covariance structure. In particular, since the IDC magnifies the difference among the competing structures, it is highly sensitive to detect the most appropriate variance/covariance structure.enEstimating functionsInformation unbiasednessQuasi-likelihood inferenceModel misspecficationModel selectionInformation Matrices in Estimating Function Approach: Tests for Model Misspecification and Model SelectionThesis or DissertationStatistics and Actuarial ScienceDoctor of PhilosophyStatistics (Biostatistics)
collection NDLTD
language en
sources NDLTD
topic Estimating functions
Information unbiasedness
Quasi-likelihood inference
Model misspecfication
Model selection
Statistics (Biostatistics)
spellingShingle Estimating functions
Information unbiasedness
Quasi-likelihood inference
Model misspecfication
Model selection
Statistics (Biostatistics)
Zhou, Qian
Information Matrices in Estimating Function Approach: Tests for Model Misspecification and Model Selection
description Estimating functions have been widely used for parameter estimation in various statistical problems. Regular estimating functions produce parameter estimators which have desirable properties, such as consistency and asymptotic normality. In quasi-likelihood inference, an important example of estimating functions, correct specification of the first two moments of the underlying distribution leads to the information unbiasedness, which states that two forms of the information matrix: the negative sensitivity matrix (negative expectation of the first order derivative of an estimating function) and the variability matrix (variance of an estimating function) are equal, or in other words, the analogue of the Fisher information is equivalent to the Godambe information. Consequently, the information unbiasedness indicates that the model-based covariance matrix estimator and sandwich covariance matrix estimator are equivalent. By comparing the model-based and sandwich variance estimators, we propose information ratio (IR) statistics for testing model misspecification of variance/covariance structure under correctly specified mean structure, in the context of linear regression models, generalized linear regression models and generalized estimating equations. Asymptotic properties of the IR statistics are discussed. In addition, through intensive simulation studies, we show that the IR statistics are powerful in various applications: test for heteroscedasticity in linear regression models, test for overdispersion in count data, and test for misspecified variance function and/or misspecified working correlation structure. Moreover, the IR statistics appear more powerful than the classical information matrix test proposed by White (1982). In the literature, model selection criteria have been intensively discussed, but almost all of them target choosing the optimal mean structure. In this thesis, two model selection procedures are proposed for selecting the optimal variance/covariance structure among a collection of candidate structures. One is based on a sequence of the IR tests for all the competing variance/covariance structures. The other is based on an ``information discrepancy criterion" (IDC), which provides a measurement of discrepancy between the negative sensitivity matrix and the variability matrix. In fact, this IDC characterizes the relative efficiency loss when using a certain candidate variance/covariance structure, compared with the true but unknown structure. Through simulation studies and analyses of two data sets, it is shown that the two proposed model selection methods both have a high rate of detecting the true/optimal variance/covariance structure. In particular, since the IDC magnifies the difference among the competing structures, it is highly sensitive to detect the most appropriate variance/covariance structure.
author Zhou, Qian
author_facet Zhou, Qian
author_sort Zhou, Qian
title Information Matrices in Estimating Function Approach: Tests for Model Misspecification and Model Selection
title_short Information Matrices in Estimating Function Approach: Tests for Model Misspecification and Model Selection
title_full Information Matrices in Estimating Function Approach: Tests for Model Misspecification and Model Selection
title_fullStr Information Matrices in Estimating Function Approach: Tests for Model Misspecification and Model Selection
title_full_unstemmed Information Matrices in Estimating Function Approach: Tests for Model Misspecification and Model Selection
title_sort information matrices in estimating function approach: tests for model misspecification and model selection
publishDate 2009
url http://hdl.handle.net/10012/4614
work_keys_str_mv AT zhouqian informationmatricesinestimatingfunctionapproachtestsformodelmisspecificationandmodelselection
_version_ 1716600246967992320