Application of Principal Component Analysis and Hierarchical Regression Model on Kenya Macroeconomic Indicators
##plugins.themes.bootstrap3.article.main##
The aim of this paper was to apply Principal Component Analysis (PCA) and hierarchical regression model on Kenyan Macroeconomic variables. The study adopted a mixed research design (descriptive and correlational research designs). The 18 macroeconomic variables data were extracted from Kenya National Bureau of Statistics and World Bank for the period 1970 to 2019. The R software was utilized to conduct all the data analysis. Principal Component Analysis was used to reduce the dimensionality of the data, where the original data set matrix was reduced to Eigenvectors and Eigenvalues. A hierarchical regression model was fitted on the extracted components, and R2 was used to determine whether the components were a good fit for predicting economic growth. The results from the study showed that the first component explained 73.605 % of the overall Variance and was highly correlated with 15 original variables. Additionally, the second principal component described approximately 10.03% of the total Variance, while the two variables had a higher positive loading into it. About 6.22% of the overall variance was explained by the third component, which was highly correlated with only one of the original variables. The first, second, and third models had F statistics of 2385.689, 1208.99, and 920.737, respectively, and each with a p-value of 0.0001<5% was hence implying that the models were significant. The third model had the lowest mean square error of 17.296 hence described as the best predictive model. Since component 1 had the highest Variance explained, and model 1 had a lower p-value than other models, Principal component 1 was more reliable in explaining economic growth. Therefore, it was concluded that the macroeconomic variables associated with the monetary economy, the trade and openness of the economy with government activities, the consumption factor of the economy, and the investment factor of the economy predict economic growth in Kenya. The study recommends that PCA should be utilized when dealing with more than 15 variables, and hierarchical regression model building technique be used to determine the partial variance change among the independent variables in regression modeling.
References
-
Pearson K. Principal components analysis. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science. 1901; 6(2): 559.
Google Scholar
1
-
Zou H, Hastie T, Tibshirani R. Sparse Principal Component Analysis. Journal of Computational and Graphical Statistics. 2006; 15(2): 265-286.
Google Scholar
2
-
Corner S. Choosing the right type of rotation in PCA and EFA. JALT Testing & Evaluation SIG newsletter. 2009;13(5): 38-45.
Google Scholar
3
-
Esmaeili A, Shokoohi Z. Assessing the effect of oil price on world food prices: Application of principal component analysis. Energy Policy. 2011; 39(2): 1022-1025.
Google Scholar
4
-
Njoroge E, Njoroge G, Muriithi D. Evaluating Secondary School Examination Results: Application of Principal Component Analysis. Journal of Statistical and Econometric Methods. 2014; 3(2): 31-46.
Google Scholar
5
-
Boligon A, Vicente I, Vaz R, Campos G, Souza F, Carvalheiro R et al. Principal component analysis of breeding values for growth and reproductive traits and genetic association with adult size in beef cattle1. Journal of Animal Science. 2016; 94(12): 5014-5022.
Google Scholar
6
-
Field, A. P. Discovering statistics using SPSS (2nd edition). London: SAGE Publication, 2005.
Google Scholar
7
-
Granato D, Santos J, Escher G, Ferreira B, Maggio R. Use of principal component analysis (PCA) and hierarchical cluster analysis (HCA) for the multivariate association between bioactive compounds and functional properties in foods: A critical perspective. Trends in Food Science & Technology. 2018; 72: 83-90.
Google Scholar
8
-
Richardson D, Hamra G, MacLehose R, Cole S, Chu H. Hierarchical Regression for Analyses of Multiple Outcomes. American Journal of Epidemiology. 2015; 182(5): 459-467.
Google Scholar
9
-
Hussain A, Sabir H, Kashif M. Impact of macroeconomic variables on GDP: Evidence from Pakistan. European Journal of Business and Innovation Research. 2016; 4(3): 38-52.
Google Scholar
10
-
De Roos A, Poole C, Teschke K, Olshan A. An application of hierarchical regression in the investigation of multiple paternal occupational exposures and neuroblastoma in offspring. American Journal of Industrial Medicine. 2001; 39(5): 477-486.
Google Scholar
11
-
Brown J. Choosing the right number of components or factors in PCA and EFA. JALT Testing & Evaluation SIG Newsletter. 2009; 13(3): 20-25.
Google Scholar
12
-
Lever J, Krzywinski M, Altman N. Principal component analysis. Nature Methods. 2017; 14(7): 641-642.
Google Scholar
13
-
Liu Q, Cook N, Bergström A, Hsieh C. A two-stage hierarchical regression model for meta-analysis of epidemiologic nonlinear dose-response data. Computational Statistics & Data Analysis. 2009; 53(12): 4157-4167.
Google Scholar
14
-
Field A. An Adventure in Statistics: The Reality Enigma Ed. 1. London: SAGE Publications, 2016.
Google Scholar
15
-
Marczyk G, DeMatteo D, Festinger D. Essentials of research design and methodology. John Wiley & Sons Inc., 2005.
Google Scholar
16
-
Fan J, Liao Y, Wang W. Projected principal component analysis in factor models. The Annals of Statistics. 2016; 44(1): 219.
Google Scholar
17
-
Trustrum K, Fox J. Regression Diagnostics: An Introduction. The Statistician. 1993; 42(2): 201.
Google Scholar
18
-
Gelman A, Goodrich B, Gabry J, Vehtari A. R-squared for Bayesian Regression Models. The American Statistician. 2019; 73(3): 307-309.
Google Scholar
19
Most read articles by the same author(s)
-
Teddy Mutugi Wanjuki,
Victor Wandera Lumumba,
Emmanuel Koech Kimtai,
Morris Kateeti Mbaluka,
Elizabeth Wambui Njoroge,
Comparative Analysis of GARCH-Based Volatility Models of Financial Market Volatility: A Case of Nairobi Security Market PLC, Kenya , European Journal of Mathematics and Statistics: Vol. 5 No. 4 (2024) -
Teddy Mutugi Wanjuki,
Adolphus Wagala,
Dennis K. Muriithi,
Forecasting Commodity Price Index of Food and Beverages in Kenya Using Seasonal Autoregressive Integrated Moving Average (SARIMA) Models , European Journal of Mathematics and Statistics: Vol. 2 No. 6 (2021) -
Teddy M. Wanjuki,
Adolphus Wagala,
Dennis K. Muriithi,
Evaluating the Predictive Ability of Seasonal Autoregressive Integrated Moving Average (SARIMA) Models using Food and Beverages Price Index in Kenya , European Journal of Mathematics and Statistics: Vol. 3 No. 2 (2022) -
Mburu S. Njoroge,
Gladys G. Njoroge,
Adolphus Wagala,
Vector Error Correction Model: Prediction of Bi-Directional Causality between Gross Domestic Product and Wage Growth in Kenya , European Journal of Mathematics and Statistics: Vol. 3 No. 4 (2022) -
Kimutai K. Emmanuel,
Adolphus Wagala,
Dennis K. Muriithi,
Singular Spectrum Analysis: An Application to Kenya’s Industrial Inputs Price Index , European Journal of Mathematics and Statistics: Vol. 3 No. 1 (2022) -
Eric M. Ndege,
Dennis K. Muriithi,
Adolphus Wagala,
Application of Asymmetric-GARCH Type Models to The Kenyan Exchange Rates , European Journal of Mathematics and Statistics: Vol. 4 No. 4 (2023)