Measuring Individual Growth With Conventional and Adaptive Tests

Measuring individuals or groups longitudinally is frequently necessary in social science research and applications. Substantial research and discussion has focused on the statistical properties of measures of change and some of the psychometric problems involved This monte-carlo simulation study foc...

Full description

Bibliographic Details
Main Authors: David J. Weiss, Shannon Von Minden
Format: Article
Language:English
Published: University of Arizona Libraries 2011-03-01
Series:Journal of Methods and Measurement in the Social Sciences
Subjects:
Online Access:https://journals.uair.arizona.edu/index.php/jmmss/article/view/15990
id doaj-bf3e0629077042af96f2abdc1f93c46d
record_format Article
spelling doaj-bf3e0629077042af96f2abdc1f93c46d2020-11-25T03:11:27ZengUniversity of Arizona LibrariesJournal of Methods and Measurement in the Social Sciences2159-78552011-03-01228010110.2458/v2i2.1599015887Measuring Individual Growth With Conventional and Adaptive TestsDavid J. Weiss0Shannon Von Minden1University of MinnesotaUniversity of MinnesotaMeasuring individuals or groups longitudinally is frequently necessary in social science research and applications. Substantial research and discussion has focused on the statistical properties of measures of change and some of the psychometric problems involved This monte-carlo simulation study focused on properties of the measurement instruments used for obtaining scores that represent change or growth over five time points and examined how well scores from conventional tests and computerized adaptive tests used to measure individual growth curves reflect true change. Data representing four different patterns of individual change and a baseline no-change condition were generated from an item response theory (IRT) model. Different tests simulated were conventional peaked tests with narrow and wider difficulties and three levels of discrimination, and computerized adaptive tests (CATs) drawn from banks with the same levels of discrimination. Conventional tests were scored by number correct and IRT weighted maximum likelihood. Results showed that as the examinees’ scores moved from the difficulty levels at which the tests were concentrated, number-correct scores over-estimated true change and had increasing amounts of error. High discrimination conventional tests had the poorest recovery of change for both groups and individuals. IRT scoring of the conventional tests improved recovery of change somewhat. By contrast, CATs consistently estimated growth with minimum and consistent error and performed best with highly discriminating items.   DOI:10.2458/azu_jmmss_v2i2_weisshttps://journals.uair.arizona.edu/index.php/jmmss/article/view/15990adaptive testing, computerized adaptive tests, conventional tests, individual growth, item response theory, measuring change, measuring growth, off-target tests
collection DOAJ
language English
format Article
sources DOAJ
author David J. Weiss
Shannon Von Minden
spellingShingle David J. Weiss
Shannon Von Minden
Measuring Individual Growth With Conventional and Adaptive Tests
Journal of Methods and Measurement in the Social Sciences
adaptive testing, computerized adaptive tests, conventional tests, individual growth, item response theory, measuring change, measuring growth, off-target tests
author_facet David J. Weiss
Shannon Von Minden
author_sort David J. Weiss
title Measuring Individual Growth With Conventional and Adaptive Tests
title_short Measuring Individual Growth With Conventional and Adaptive Tests
title_full Measuring Individual Growth With Conventional and Adaptive Tests
title_fullStr Measuring Individual Growth With Conventional and Adaptive Tests
title_full_unstemmed Measuring Individual Growth With Conventional and Adaptive Tests
title_sort measuring individual growth with conventional and adaptive tests
publisher University of Arizona Libraries
series Journal of Methods and Measurement in the Social Sciences
issn 2159-7855
publishDate 2011-03-01
description Measuring individuals or groups longitudinally is frequently necessary in social science research and applications. Substantial research and discussion has focused on the statistical properties of measures of change and some of the psychometric problems involved This monte-carlo simulation study focused on properties of the measurement instruments used for obtaining scores that represent change or growth over five time points and examined how well scores from conventional tests and computerized adaptive tests used to measure individual growth curves reflect true change. Data representing four different patterns of individual change and a baseline no-change condition were generated from an item response theory (IRT) model. Different tests simulated were conventional peaked tests with narrow and wider difficulties and three levels of discrimination, and computerized adaptive tests (CATs) drawn from banks with the same levels of discrimination. Conventional tests were scored by number correct and IRT weighted maximum likelihood. Results showed that as the examinees’ scores moved from the difficulty levels at which the tests were concentrated, number-correct scores over-estimated true change and had increasing amounts of error. High discrimination conventional tests had the poorest recovery of change for both groups and individuals. IRT scoring of the conventional tests improved recovery of change somewhat. By contrast, CATs consistently estimated growth with minimum and consistent error and performed best with highly discriminating items.   DOI:10.2458/azu_jmmss_v2i2_weiss
topic adaptive testing, computerized adaptive tests, conventional tests, individual growth, item response theory, measuring change, measuring growth, off-target tests
url https://journals.uair.arizona.edu/index.php/jmmss/article/view/15990
work_keys_str_mv AT davidjweiss measuringindividualgrowthwithconventionalandadaptivetests
AT shannonvonminden measuringindividualgrowthwithconventionalandadaptivetests
_version_ 1724654126078361600