Robust Multicollinearity Diagnostic Measure For Fixed Effect Panel Data Model




High Leverage Points, Generalized-M estimator, High Leverage Collinearity Enhancing Observations, Multicollinearity, Within Group Least Squares Estimator


It is now evident that high leverage points (HLPs) can induce the multicollinearity pattern of a data in fixed effect panel data model. Those observations that are responsible for this phenomenon are called high leverage collinearity-enhancing observations (HLCEO). The commonly used within group ordinary least squares (WOLS) estimator for estimating the parameters of fixed effect panel data model is easily affected by HLCEOs. In their presence, the WOLS estimates may produce large variances and this would lead to erroneous interpretation. Therefore, it is imperative to detect the multicollinearity which is caused by HLCEOs. The classical Variance Inflation Factor (CVIF) is the commonly used diagnostic method for detecting multicollinearity in panel data. However, it is not correctly diagnosed multicollinearity in the presence of HLCEOs. Hence, in this paper three new robust diagnostic methods of diagnosing multicollinearity in panel data are proposed, namely the RVIF (WGM-FIMGT), RVIF (WGM-DRGP) and RVIF (WMM) and compared their performances with the CVIF. The numerical evidences show that the CVIF incorrectly diagnosed multicollinearity but our proposed methods correctly diagnosed no multicollinearity in the presence of HLCEOs where RVIF (WGM-FIMGT) being the best method as it has the least computational running time.


Kamruzzaman, M. and Imon, A.H.M.R.. High leverage point: another source of multicollinearity. Pakistan Journal of Statistics. 18(3): 435-448 , 2002.

Bagheri, A., and Midi, H. On the performance of robust variance inflation factors. International Journal of Agricultural and Statistical Sciences. 7(1): 31-45, 2011.

Bagheri, A., Habshah M. and Imon, A.H.M.R. A novel collinearity-influential observation diagnostic measure based on a group deletion approach. Communications in Statistics - Simulation and Computation. 41(8): 1379-1396, 2012.

Midi, H., Bagheri, A. and Imon, A.H.M.R. A Monte Carlo simulation study on high leverage collinearity-enhancing observation and its effect on multicollinearity pattern. Sains Malaysiana. 40(12), 1437-1447, 2011.

Midi, H., Ismaeel S. S., and Arasan J. On the performance of fast Robust Variance Inflation factor based on index set equality. Journal of Engineering and Applied Sciences. 13(16): 6634-6638 , 2018.

Habshah, M., Norazan, M.R., and Imon, A.H.M.R. The performance of Diagnostic-Robust Generalized Potentials for the identification of multiple high leverage points in linear regression. Journal of Applied Statistics. 36(5): 507-520, 2009.

Hadi, A.S. Diagnosing collinearity-influential observations. Computational Statistics and Data Analysis. 1988.7(2): 143-159 ,2011.

Sengupta, D. and Bhimasankaram P. On the roles of observations in collinearity in the linear model. Journal of the American Statistical Association. 92(439): 1024-1032 ,1997.

Bramati, M.C. and Croux C. Robust estimators for the fixed effects panel data model. The Econometrics Journal. 10(3): 521-540, 2007.

Baltagi, B. Econometric analysis of panel data. John Wiley & Sons, 2008.

Bakar, N. M. A. and H. Midi. Robust centering in the fixed effect panel data model. Pakistan Journal of Statistics. 31(1):33-48, 2015.

Bagheri A, and Midi H. Diagnostic plot for the identification of high leverage collinearity-influential observations. Statistics and Operations Research Transactions. 39(1): 51-70, 2015.

Ismaeel, S.S., and Midi, H. Robust within group estimator for fixed effect panel data. Pakistan Journal of Statistics. 34(4): 297-310, 2018.

Lim, H. A. and H. Midi. Diagnostic Robust Generalized Potential Based on Index Set Equality (DRGP (ISE)) for the identification of high leverage points in linear model. Computational Statistics. 3(31): 859-877, 2016.

Rousseeuw, P.J., and Van Zomeren, B.C. Unmasking multivariate outliers and leverage points. Journal of the American Statistical Association. 85(411): 633-639, 1990.

Maronna, R.A., Martin, R.D., and Yohai, V.J. Robust Statistics Theory and Methods. New York: Wiley and Sons, 2006.

Krasker, W. S. and Welsch R.E. Efficient bounded-influence regression estimation. Journal of the American Statistical Association. 77(379): 595-604, 1982

Rousseeuw, P.J., and Leroy, A.M. Robust Regression and Outlier Detection. New York: Wiley, 1987.

Coakley, C.W., and Hettmansperger, T.P. A bounded influence, high breakdown, efficient regression estimator. Journal of the American Statistical Association. 88(423): 872-880, 1993.

Midi, H., Sani, M., Ismaeel, S.S., and Arasan, J. Fast Improvised Influential Distance for the Identification of Influential Observations in Multiple Linear Regression. Sains Malaysiana. 50(7), 2085-2094, 2021.

Rousseeuw, P.J. Least median of squares regression. Journal of the American Statistical Association. 79(388): 871-880,1984.

Greene, W.H. Econometric Analysis. 6th Edition, Prentice Hall, 2007.