Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.]

Outliers with respect to the predictor variables are called high leverage points. The observations that are slightly different from all others can drive to a large difference in the results of regression analysis. In regression analysis, the detection of high leverage points is compulsory, as they w...

Full description

Bibliographic Details
Main Authors: Ismail, Nor Azima (Author), Midi, Prof Dr. Habshah (Author), Mohamad Sobri, Norafefah Mohamad Sobri (Author), Zulkifli, Siti Nurani Zulkifli (Author)
Format: Article
Language:English
Published: Unit Penerbitan UiTM Kelantan, 2016-06.
Subjects:
Online Access:Get fulltext
View Fulltext in UiTM IR
LEADER 02151 am a22002173u 4500
001 24068
042 |a dc 
100 1 0 |a Ismail, Nor Azima  |e author 
700 1 0 |a Midi, Prof Dr. Habshah  |e author 
700 1 0 |a Mohamad Sobri, Norafefah Mohamad Sobri  |e author 
700 1 0 |a Zulkifli, Siti Nurani Zulkifli  |e author 
245 0 0 |a Identification of High Leverage Points in Multiple Linear Regression / Noor Azima Ismail... [et al.] 
260 |b Unit Penerbitan UiTM Kelantan,   |c 2016-06. 
856 |z Get fulltext  |u https://ir.uitm.edu.my/id/eprint/24068/1/4 
856 |z View Fulltext in UiTM IR  |u https://ir.uitm.edu.my/id/eprint/24068/ 
520 |a Outliers with respect to the predictor variables are called high leverage points. The observations that are slightly different from all others can drive to a large difference in the results of regression analysis. In regression analysis, the detection of high leverage points is compulsory, as they will give large impact on the estimation values as well as lead to multicollinearity problems. In this situation, robust regression procedure can be very useful to deal with problems arise due to the existence of high leverage points. The aim of this study is to compare the performance of three methods in detecting high leverage points. At first stage, the two well-known data sets are considered. The first data used is artificial data set generated by Hawkins, Bradu and Kass in 1984 and the second data used is stack loss data by Brownlee in 1965. The second stage of this study is to conduct simulation study whereby the data were generated based on clean and contaminated data. The three sets of measures being considered in this study are Leverage methods Ttwice-the-mean-rule), Generalized Potentials and Diagnostic Robust Generalized Approach (DRGP). The result indicates that DRGP successfully proved its ability as a powerful method of detecting high leverage points as compared to the other two methods using both artificial data sets and simulated data. 
546 |a en 
650 0 4 |a Electronic digital computers 
650 0 4 |a Computer software 
650 0 4 |a Operating systems (Computers) 
655 7 |a Article