Implications of using Likert data in multiple regression analysis

Many of the measures obtained in educational research are Likert-type responses on questionnaires. These Likert-type variables are sometimes used in ordinary least-squares regression analysis. However, among the key implications of the assumptions of regression is that the criterion is continuous. L...

Full description

Bibliographic Details
Main Author: Owuor, Charles Ochieng
Format: Others
Language:English
Published: 2009
Online Access:http://hdl.handle.net/2429/13778
id ndltd-UBC-oai-circle.library.ubc.ca-2429-13778
record_format oai_dc
spelling ndltd-UBC-oai-circle.library.ubc.ca-2429-137782018-01-05T17:36:58Z Implications of using Likert data in multiple regression analysis Owuor, Charles Ochieng Many of the measures obtained in educational research are Likert-type responses on questionnaires. These Likert-type variables are sometimes used in ordinary least-squares regression analysis. However, among the key implications of the assumptions of regression is that the criterion is continuous. Little research has been done to examine how much information is lost and how inappropriate it is to use Likert variables in ordinary least-squares multiple regression. Therefore, this study examined the effect of Likert-type responses in the criterion variable and predictors for various scale points, on the accuracy of regression models using normal and skewed observed response patterns. This was done for the case of three predictors and one criterion. Similarly, eight levels of Likert-type categorization ranging from two to nine scale points were considered for both predictors and criterion variables. It was found that the largest bias in the estimation of the model R-squared, the relative Pratt Index, and Pearson correlation coefficient occurred for two or three-point Likert scales. The bias did not substantially reduce any further beyond the four-point Likert scale. Type of correlation matrix had no effect on the model fit. However, skewed response distribution resulted in large biases in both R² and Pearson correlation, but not in Relative Pratt index, which was not affected by the response distribution. Practical contribution and significance of the study is that it has provided information and insight on how much information is lost due to bias, and the extent to which accuracy is compromised in using Likert data in linear regression models in education and social science research. It is recommended that researchers and practitioners should recognize the extent of the bias in ordinary least-squares regression models with Likert data, resulting in substantial loss of information. For variable importance, the relative Pratt index should be used given that it is robust to Likert conditions and response distributions. Finally, when interpreting reported regression results in the research literature one should recognize that the reported R-squared values are underestimated and that the Pearson correlations are also typically underestimated and sometimes substantially underestimated. Education, Faculty of Educational and Counselling Psychology, and Special Education (ECPS), Department of Graduate 2009-10-08T21:23:38Z 2009-10-08T21:23:38Z 2001 2001-05 Text Thesis/Dissertation http://hdl.handle.net/2429/13778 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use. 6496528 bytes application/pdf
collection NDLTD
language English
format Others
sources NDLTD
description Many of the measures obtained in educational research are Likert-type responses on questionnaires. These Likert-type variables are sometimes used in ordinary least-squares regression analysis. However, among the key implications of the assumptions of regression is that the criterion is continuous. Little research has been done to examine how much information is lost and how inappropriate it is to use Likert variables in ordinary least-squares multiple regression. Therefore, this study examined the effect of Likert-type responses in the criterion variable and predictors for various scale points, on the accuracy of regression models using normal and skewed observed response patterns. This was done for the case of three predictors and one criterion. Similarly, eight levels of Likert-type categorization ranging from two to nine scale points were considered for both predictors and criterion variables. It was found that the largest bias in the estimation of the model R-squared, the relative Pratt Index, and Pearson correlation coefficient occurred for two or three-point Likert scales. The bias did not substantially reduce any further beyond the four-point Likert scale. Type of correlation matrix had no effect on the model fit. However, skewed response distribution resulted in large biases in both R² and Pearson correlation, but not in Relative Pratt index, which was not affected by the response distribution. Practical contribution and significance of the study is that it has provided information and insight on how much information is lost due to bias, and the extent to which accuracy is compromised in using Likert data in linear regression models in education and social science research. It is recommended that researchers and practitioners should recognize the extent of the bias in ordinary least-squares regression models with Likert data, resulting in substantial loss of information. For variable importance, the relative Pratt index should be used given that it is robust to Likert conditions and response distributions. Finally, when interpreting reported regression results in the research literature one should recognize that the reported R-squared values are underestimated and that the Pearson correlations are also typically underestimated and sometimes substantially underestimated. === Education, Faculty of === Educational and Counselling Psychology, and Special Education (ECPS), Department of === Graduate
author Owuor, Charles Ochieng
spellingShingle Owuor, Charles Ochieng
Implications of using Likert data in multiple regression analysis
author_facet Owuor, Charles Ochieng
author_sort Owuor, Charles Ochieng
title Implications of using Likert data in multiple regression analysis
title_short Implications of using Likert data in multiple regression analysis
title_full Implications of using Likert data in multiple regression analysis
title_fullStr Implications of using Likert data in multiple regression analysis
title_full_unstemmed Implications of using Likert data in multiple regression analysis
title_sort implications of using likert data in multiple regression analysis
publishDate 2009
url http://hdl.handle.net/2429/13778
work_keys_str_mv AT owuorcharlesochieng implicationsofusinglikertdatainmultipleregressionanalysis
_version_ 1718589451381243904