A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment.
We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the mo...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2018-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC5882162?pdf=render |
id |
doaj-63a3882e2dea4652aa6a11aeb5281760 |
---|---|
record_format |
Article |
spelling |
doaj-63a3882e2dea4652aa6a11aeb52817602020-11-25T02:47:06ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01134e019529710.1371/journal.pone.0195297A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment.Kaja ZupancErik ŠtrumbeljWe propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the model provides a good fit for both the total scores and when applied to individual rubrics. We estimate the median impact of rater effects on the final grade to be ± 2 points on a 50 point scale, while 10% of essays would receive a score at least ± 5 different from their actual quality. Most of the impact is due to rater unreliability, not rater bias.http://europepmc.org/articles/PMC5882162?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kaja Zupanc Erik Štrumbelj |
spellingShingle |
Kaja Zupanc Erik Štrumbelj A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment. PLoS ONE |
author_facet |
Kaja Zupanc Erik Štrumbelj |
author_sort |
Kaja Zupanc |
title |
A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment. |
title_short |
A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment. |
title_full |
A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment. |
title_fullStr |
A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment. |
title_full_unstemmed |
A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment. |
title_sort |
bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2018-01-01 |
description |
We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the model provides a good fit for both the total scores and when applied to individual rubrics. We estimate the median impact of rater effects on the final grade to be ± 2 points on a 50 point scale, while 10% of essays would receive a score at least ± 5 different from their actual quality. Most of the impact is due to rater unreliability, not rater bias. |
url |
http://europepmc.org/articles/PMC5882162?pdf=render |
work_keys_str_mv |
AT kajazupanc abayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment AT erikstrumbelj abayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment AT kajazupanc bayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment AT erikstrumbelj bayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment |
_version_ |
1724754499254353920 |