A case study in handling over-dispersion in nematode count data

Master of Science === Department of Statistics === Leigh W. Murray === Traditionally the Poisson process is used to model count response variables. However, a problem arises when the particular response variable contains an inordinate number of both zeros and large observations, relative to the mean...

Full description

Bibliographic Details
Main Author: Kreider, Scott Edwin Douglas
Language:en_US
Published: Kansas State University 2010
Subjects:
Online Access:http://hdl.handle.net/2097/4248
id ndltd-KSU-oai-krex.k-state.edu-2097-4248
record_format oai_dc
spelling ndltd-KSU-oai-krex.k-state.edu-2097-42482016-03-01T03:50:44Z A case study in handling over-dispersion in nematode count data Kreider, Scott Edwin Douglas Overdispersion Poisson Hurdle Zero Inflated Poisson Generalized Poisson Nematode Statistics (0463) Master of Science Department of Statistics Leigh W. Murray Traditionally the Poisson process is used to model count response variables. However, a problem arises when the particular response variable contains an inordinate number of both zeros and large observations, relative to the mean, for a typical Poisson process. In cases such as these, the variance of the data is greater than the mean and as such the data are over-dispersed with respect to the Poisson distribution due to the fact that the mean equals the variance for the Poisson distribution. This case study looks at several common and uncommon ways to attempt to properly account for this over-dispersion in a specific set of nematode count data using various procedures in SAS 9.2. These methods include but are not limited to a basic linear regression model, a generalized linear (log-linear) model, a zero-inflated Poisson model, a generalized Poisson model, and a Poisson hurdle model. Based on the AIC statistics the generalized log-linear models with the Pearson-scale and deviance-scale corrections perform the best. However, based on residual plots, none of the models appear to fit the data adequately. Further work with non-parametric methods or the negative binomial distribution may yield more ideal results. 2010-07-01T12:53:45Z 2010-07-01T12:53:45Z 2010-07-01T12:53:45Z 2010 August Report http://hdl.handle.net/2097/4248 en_US Kansas State University
collection NDLTD
language en_US
sources NDLTD
topic Overdispersion
Poisson Hurdle
Zero Inflated Poisson
Generalized Poisson
Nematode
Statistics (0463)
spellingShingle Overdispersion
Poisson Hurdle
Zero Inflated Poisson
Generalized Poisson
Nematode
Statistics (0463)
Kreider, Scott Edwin Douglas
A case study in handling over-dispersion in nematode count data
description Master of Science === Department of Statistics === Leigh W. Murray === Traditionally the Poisson process is used to model count response variables. However, a problem arises when the particular response variable contains an inordinate number of both zeros and large observations, relative to the mean, for a typical Poisson process. In cases such as these, the variance of the data is greater than the mean and as such the data are over-dispersed with respect to the Poisson distribution due to the fact that the mean equals the variance for the Poisson distribution. This case study looks at several common and uncommon ways to attempt to properly account for this over-dispersion in a specific set of nematode count data using various procedures in SAS 9.2. These methods include but are not limited to a basic linear regression model, a generalized linear (log-linear) model, a zero-inflated Poisson model, a generalized Poisson model, and a Poisson hurdle model. Based on the AIC statistics the generalized log-linear models with the Pearson-scale and deviance-scale corrections perform the best. However, based on residual plots, none of the models appear to fit the data adequately. Further work with non-parametric methods or the negative binomial distribution may yield more ideal results.
author Kreider, Scott Edwin Douglas
author_facet Kreider, Scott Edwin Douglas
author_sort Kreider, Scott Edwin Douglas
title A case study in handling over-dispersion in nematode count data
title_short A case study in handling over-dispersion in nematode count data
title_full A case study in handling over-dispersion in nematode count data
title_fullStr A case study in handling over-dispersion in nematode count data
title_full_unstemmed A case study in handling over-dispersion in nematode count data
title_sort case study in handling over-dispersion in nematode count data
publisher Kansas State University
publishDate 2010
url http://hdl.handle.net/2097/4248
work_keys_str_mv AT kreiderscottedwindouglas acasestudyinhandlingoverdispersioninnematodecountdata
AT kreiderscottedwindouglas casestudyinhandlingoverdispersioninnematodecountdata
_version_ 1718196496847863808