A Bayesian approach for analysis of ordered categorical responses subject to misclassification.

Ordinal categorical responses are frequently collected in survey studies, human medicine, and animal and plant improvement programs, just to mention a few. Errors in this type of data are neither rare nor easy to detect. These errors tend to bias the inference, reduce the statistical power and ultim...

Full description

Bibliographic Details
Main Authors:	Ashley Ling, El Hamidi Hay, Samuel E Aggrey, Romdhane Rekaya
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2018-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0208433

id	doaj-ecda0ea2ca254dd7ad0bd8042708e248
record_format	Article
spelling	doaj-ecda0ea2ca254dd7ad0bd8042708e2482021-03-03T21:02:48ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-011312e020843310.1371/journal.pone.0208433A Bayesian approach for analysis of ordered categorical responses subject to misclassification.Ashley LingEl Hamidi HaySamuel E AggreyRomdhane RekayaOrdinal categorical responses are frequently collected in survey studies, human medicine, and animal and plant improvement programs, just to mention a few. Errors in this type of data are neither rare nor easy to detect. These errors tend to bias the inference, reduce the statistical power and ultimately the efficiency of the decision-making process. Contrarily to the binary situation where misclassification occurs between two response classes, noise in ordinal categorical data is more complex due to the increased number of categories, diversity and asymmetry of errors. Although several approaches have been presented for dealing with misclassification in binary data, only limited practical methods have been proposed to analyze noisy categorical responses. A latent variable model implemented within a Bayesian framework was proposed to analyze ordinal categorical data subject to misclassification using simulated and real datasets. The simulated scenario consisted of a discrete response with three categories and a symmetric error rate of 5% between any two classes. The real data consisted of calving ease records of beef cows. Using real and simulated data, ignoring misclassification resulted in substantial bias in the estimation of genetic parameters and reduction of the accuracy of predicted breeding values. Using our proposed approach, a significant reduction in bias and increase in accuracy ranging from 11% to 17% was observed. Furthermore, most of the misclassified observations (in the simulated data) were identified with a substantially higher probability. Similar results were observed for a scenario with asymmetric misclassification. While the extension to traits with more categories between adjacent classes is straightforward, it could be computationally costly. For traits with high heritability, the performance of the methodology would be expected to improve.https://doi.org/10.1371/journal.pone.0208433
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Ashley Ling El Hamidi Hay Samuel E Aggrey Romdhane Rekaya
spellingShingle	Ashley Ling El Hamidi Hay Samuel E Aggrey Romdhane Rekaya A Bayesian approach for analysis of ordered categorical responses subject to misclassification. PLoS ONE
author_facet	Ashley Ling El Hamidi Hay Samuel E Aggrey Romdhane Rekaya
author_sort	Ashley Ling
title	A Bayesian approach for analysis of ordered categorical responses subject to misclassification.
title_short	A Bayesian approach for analysis of ordered categorical responses subject to misclassification.
title_full	A Bayesian approach for analysis of ordered categorical responses subject to misclassification.
title_fullStr	A Bayesian approach for analysis of ordered categorical responses subject to misclassification.
title_full_unstemmed	A Bayesian approach for analysis of ordered categorical responses subject to misclassification.
title_sort	bayesian approach for analysis of ordered categorical responses subject to misclassification.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2018-01-01
description	Ordinal categorical responses are frequently collected in survey studies, human medicine, and animal and plant improvement programs, just to mention a few. Errors in this type of data are neither rare nor easy to detect. These errors tend to bias the inference, reduce the statistical power and ultimately the efficiency of the decision-making process. Contrarily to the binary situation where misclassification occurs between two response classes, noise in ordinal categorical data is more complex due to the increased number of categories, diversity and asymmetry of errors. Although several approaches have been presented for dealing with misclassification in binary data, only limited practical methods have been proposed to analyze noisy categorical responses. A latent variable model implemented within a Bayesian framework was proposed to analyze ordinal categorical data subject to misclassification using simulated and real datasets. The simulated scenario consisted of a discrete response with three categories and a symmetric error rate of 5% between any two classes. The real data consisted of calving ease records of beef cows. Using real and simulated data, ignoring misclassification resulted in substantial bias in the estimation of genetic parameters and reduction of the accuracy of predicted breeding values. Using our proposed approach, a significant reduction in bias and increase in accuracy ranging from 11% to 17% was observed. Furthermore, most of the misclassified observations (in the simulated data) were identified with a substantially higher probability. Similar results were observed for a scenario with asymmetric misclassification. While the extension to traits with more categories between adjacent classes is straightforward, it could be computationally costly. For traits with high heritability, the performance of the methodology would be expected to improve.
url	https://doi.org/10.1371/journal.pone.0208433
work_keys_str_mv	AT ashleyling abayesianapproachforanalysisoforderedcategoricalresponsessubjecttomisclassification AT elhamidihay abayesianapproachforanalysisoforderedcategoricalresponsessubjecttomisclassification AT samueleaggrey abayesianapproachforanalysisoforderedcategoricalresponsessubjecttomisclassification AT romdhanerekaya abayesianapproachforanalysisoforderedcategoricalresponsessubjecttomisclassification AT ashleyling bayesianapproachforanalysisoforderedcategoricalresponsessubjecttomisclassification AT elhamidihay bayesianapproachforanalysisoforderedcategoricalresponsessubjecttomisclassification AT samueleaggrey bayesianapproachforanalysisoforderedcategoricalresponsessubjecttomisclassification AT romdhanerekaya bayesianapproachforanalysisoforderedcategoricalresponsessubjecttomisclassification
_version_	1714819098801602560

A Bayesian approach for analysis of ordered categorical responses subject to misclassification.

Similar Items