University of Ghana  http://ugspace.ug.edu.gh
UNIVERSITY OF GHANA
A COMPARATIVE ANALYSIS OF FORECAST
PERFORMANCE BETWEEN SARIMA AND SETAR MODELS
USING MACROECONOMIC VARIABLES IN GHANA
BY
NAFISAH BADAWIISE AHMED
(10598809)
A THESIS SUBMITTED TO THE DEPARTMENT OF STATISTICS AND
ACTUARIAL SCIENCE, UNIVERSITY OF GHANA IN PARTIAL FUFILLMENT
OF THE REQUIREMENT FOR THE DEGREE OF MPHIL. STATISTICS
July, 2018
University of Ghana  http://ugspace.ug.edu.gh
DECLARATION
CANDIDATE’S DECLARATION
This is to certify that this thesis is the result of my own extensive research and reading,
under the guidance of my supervisors and that no part of it has been presented for
another degree in this university or elsewhere.
SIGNATURE:............................. DATE:..........................
NAFISAH BADAWIISE AHMED (10598809)
SUPERVISORS’ DECLARATION
We hereby certify that this thesis was prepared from the candidate’s own work and
supervised in accordance with guidelines on supervision of thesis laid down by the
University of Ghana.
SIGNATURE:.................................. DATE:..................................
DR. FELIX OKOE METTLE
(Principal Supervisor)
SIGNATURE:.................................. DATE:..................................
DR. RICHARD MINKAH
(Co-Supervisor)
i
University of Ghana  http://ugspace.ug.edu.gh
DEDICATION
This work is dedicated to my beloved parents Hajia and Mr. Issah Bawourun Ahmed
and also to my dear cousin Mr. Mohammed Nazif Senje.
ii
University of Ghana  http://ugspace.ug.edu.gh
ACKNOWLEDGMENT
This research has had a long development period, to which I am indebted to many
en route. My sincerest gratitude first and fore mostly goes to my supervisors, Dr.
Felix O. Mettle and Dr. Richard Minkah, thanks so much for the fatherly advices and
corrections given me throughout my research. This acknowledgment would not be
complete without the mention of my friends Mr. Osman Sherif, Dr. Nasiru Suleman,
Mr. Abdul Ghanniyu Abubakari, Miss Ama Dickson, Mr. John Essel, Mr. Badi Abdul
Sayibu and Miss Christophelia Adjei-Danso. I am forever grateful for all the help
rendered me in studies and words of encouragement given me each time I needed to
share my frustrations with someone. I pray God grant you all your heart desires. Also,
I am grateful to all the lecturers and staff of the university.
iii
University of Ghana  http://ugspace.ug.edu.gh
ABSTRACT
Most macroeconomic variables such as; inflation, GDP and others have been described
by most financial and economics time series analysts to exhibit nonlinear behaviour.
Therefore, to cater for this behaviour, the nonlinear class of models have been largely
adopted to model and forecast such time series. In this study, the Keenan and
Tsay tests for linearity showed inflation and CIC rates follow threshold nonlinear
processes. Hence, the two-regime SETAR model was adopted to accommodate these
nonlinearities in the datasets. Using the linear SARIMA model as a benchmark
for comparative analysis. Results from both in-sample and out- of- sample forecast
performance using MAE and RMSE measures revealed that, the nonlinear SETAR
model outperformed the linear SARIMA model for inflation. This was however
different for CIC rates, since the Linear SARIMA model turned to outperform the
nonlinear SETAR model. Further analysis of forecast accuracy using the Diebold-
Mariano test showed there was no significant difference between the two models for
inflation but, there was significant difference between both models for CIC rates.
Nevertheless, it is recommended that, continuous monitoring of these models, review
market conditions and necessary adjustments are vital to make realistic use of these
models.
iv
University of Ghana  http://ugspace.ug.edu.gh
CONTENTS
DECLARATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
DEDICATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
ACKNOWLEDGEMENT . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
ABBREVIATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
1 INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Background of study . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Objectives of the study . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3.1 General objective . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3.2 Specific Objectives . . . . . . . . . . . . . . . . . . . . . . . 5
1.4 Significance of study . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.5 Research Questions . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.6 Organisation of Study . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2 LITERATURE REVIEW . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2 Brief introduction to SARIMA model . . . . . . . . . . . . . . . . . 7
v
University of Ghana  http://ugspace.ug.edu.gh
2.3 Related studies on forecast performance of SARIMA model . . . . . . 8
2.4 Brief introduction to SETAR model . . . . . . . . . . . . . . . . . . 12
2.5 Related studies on forecast performance of SETAR models . . . . . . 13
2.6 Related studies on comparative studies of other linear and nonlinear
models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.7 Chapter Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3 METHODOLOGY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.2 Research Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.3 Source of Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.4 Trend Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.5 Unit Root Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.5.1 Phillips-Perron (PP) Test . . . . . . . . . . . . . . . . . . . . 24
3.5.2 Kwiatkowski-Philips-Schmidt-Shin (KPSS) Test . . . . . . . 25
3.6 Autoregressive AR(p)Model . . . . . . . . . . . . . . . . . . . . . . 27
3.7 Moving Averages (MA(q)) Model . . . . . . . . . . . . . . . . . . . 28
3.8 Autoregressive Moving Averages (ARMA (p,q)) . . . . . . . . . . . . 29
3.9 Autoregressive Integrated Moving Averages (ARIMA) . . . . . . . . 29
3.10 Seasonal Autoregressive Integrated Moving Averages (SARIMA) model 30
3.11 Self Exciting Threshold Autoregressive (SETAR) model . . . . . . . 32
3.12 Linearity Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
3.13 Ramsey Regression Equation Specification Error Test (RESET) . . . . 34
3.14 Keenan-test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.15 Tsay’s F -test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
3.16 Criterion for Model Selection . . . . . . . . . . . . . . . . . . . . . . 37
3.17 Model Diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
3.17.1 Ljung-Box Test . . . . . . . . . . . . . . . . . . . . . . . . . 38
3.17.2 ARCH-LM Test . . . . . . . . . . . . . . . . . . . . . . . . . 39
3.18 Diebold-Mariano Test . . . . . . . . . . . . . . . . . . . . . . . . . . 40
vi
University of Ghana  http://ugspace.ug.edu.gh
3.19 Markov Chain . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.20 Jarque-Bera Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.21 Chapter Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
4 DATA ANALYSIS AND DISCUSSIONS . . . . . . . . . . . . . . . . . . 43
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
4.2 Preliminary Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . 43
4.2.1 Preliminary Analysis of Inflation rate . . . . . . . . . . . . . 44
4.2.2 Preliminary Analysis for CIC rate . . . . . . . . . . . . . . . 46
4.3 Further Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
4.3.1 SARIMA modelling . . . . . . . . . . . . . . . . . . . . . . 49
4.3.2 SETAR Modelling . . . . . . . . . . . . . . . . . . . . . . . 57
4.3.3 Comparative Analysis of forecast between SARIMA and
SETAR models . . . . . . . . . . . . . . . . . . . . . . . . . 63
4.4 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . 67
4.5 Chapter Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
5 CONCLUSION AND RECOMMENDATIONS . . . . . . . . . . . . . . 70
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
5.2 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
5.3 Recommendations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
vii
University of Ghana  http://ugspace.ug.edu.gh
LIST OF ABBREVIATION
AIC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Akaike Information Criterion
AICc . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Akaike Information Criterion Corrected
ACF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Autocorrelation Function
ADF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Augmented Dickey-Fuller
ANN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Artificial Neural Network
AR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Autoregressive
ARCH . . . . . . . . . . . . . . . . . . . . . . . . . . . .Autoregressive Conditional Heteroscedasticity
ARCH-LM . . . . Autoregressive Conditional Heteroscedasticity Lagrange Multiplier
ARIMA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Autoregressive Integrated Moving Average
ARMA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Autoregressive Moving Average
ASEAN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Association of Southeast Asian Nations
BIC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bayesian Information Criterion
BIVARMA . . . . . . . . . . . . . . . . . . . . . . Bilinear Vector Autoregressive Moving Average
BoG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bank of Ghana
CIC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Currency in Circulation
CNB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Czech National Bank
DF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Dickey-Fuller
DGP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Data Generating Process
ECB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .European Central Bank
viii
University of Ghana  http://ugspace.ug.edu.gh
EGARCH . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Exponential Generalised ARCH
GARCH . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Generalised ARCH
GDP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Gross Domestic Product
GNP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Gross Net Product
KPSS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Kwiatkowski-Phillips-Schmidt-Shin
LCL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Lower Confidence Limit
LSTAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Linear Smooth Transition Autoregressive
MA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Moving Average
MAE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mean Absolute Error
MAPE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mean Absolute Percent Error
Max . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Maximum
Min . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Minimum
MPE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Mean Percent Error
MSE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mean Square Error
OLS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ordinary Least Square
PACF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Partial Autocorrelation Function
PP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Phillips -Perron
RESET . . . . . . . . . . . . . . . . . . . . Ramsey Regression Equation Specification Error Test
RMSE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Mean Square Error
RMSPE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Root Mean Square Percentage Error
SARIMA . . . . . . . . . . . . . . . . . . . Seasonal Autoregressive Integrated Moving Average
ix
University of Ghana  http://ugspace.ug.edu.gh
SETAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Self-Excited Threshold Autoregressive
SSR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Sum of Squared Error
STAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Smooth Transition Autoregressive
Std . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Standard Deviation
TAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Threshold Autoregressive
UCL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Upper Confidence Limit
UK . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . United Kingdom
VAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Vector Autoregressive
VARMA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Vector Autoregressive Moving Average
ZA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Zivot Andrews
x
University of Ghana  http://ugspace.ug.edu.gh
LIST OF TABLES
4.1 Descriptive Statistics of Inflation Rates . . . . . . . . . . . . . . . . . 44
4.2 Descriptive Statistics of CIC Rates . . . . . . . . . . . . . . . . . . . 47
4.3 Unit root test for inflation . . . . . . . . . . . . . . . . . . . . . . . . 49
4.4 AIC, RMSE, MAE and MAPE of candidate seasonal ARIMA Models
for Inflation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.5 Estimated Parameters of ARIMA(3,1,3)(1,0,1)12 for inflation . . . . 51
4.6 Estimate Parameters of ARIMA(1,1,2)(1,0,1)12 for inflation . . . . . 52
4.7 Estimate Parameters of ARIMA(3,1,3)(2,0,1)12 for inflation . . . . . 52
4.8 Residual diagnostic tests for Seasonal ARIMA models for inflation . . 53
4.9 Unit Root test for CIC rate . . . . . . . . . . . . . . . . . . . . . . . 54
4.10 AIC, RMSE, MAE and MAPE of candidate seasonal ARIMA Models
for CIC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
4.11 Estimated parameters of ARIMA (0,1,0) (3,0,3)12 for CIC . . . . . . . 55
4.12 Estimated parameters of ARIMA(0,1,0)(1,0,3)12 for CIC . . . . . . 56
4.13 Estimated parameters of ARIMA(0,1,0)(1,0,1)12 for CIC . . . . . . 56
4.14 Residual diagnostic tests for Seasonal ARIMA models for CIC . . . . 56
4.15 Linearity test for inflation rate . . . . . . . . . . . . . . . . . . . . . 58
4.16 AIC and MAPE values of tentative SETAR models . . . . . . . . . . 58
4.17 Estimate Parameters of SETAR (2;14,4) for inflation . . . . . . . . . . 59
4.18 Estimate Parameters of SETAR (2;14,5) for inflation . . . . . . . . . . 60
4.19 Estimate Parameters of SETAR (2;14,13) for inflation . . . . . . . . . 61
4.20 Residual diagnostics of tentative SETAR models for inflation . . . . . 61
4.21 Linearity test for CIC rate . . . . . . . . . . . . . . . . . . . . . . . . 62
4.22 AIC and MAPE candidate models for CIC . . . . . . . . . . . . . . . 62
xi
University of Ghana  http://ugspace.ug.edu.gh
4.23 Estimate Parameters of SETAR(2;13,9) for CIC . . . . . . . . . . . . 63
4.24 Residual Diagnostics of suggested SETAR models for CIC . . . . . . 63
4.25 Forecast results of competing SARIMA models for inflation . . . . . 64
4.26 Forecast results of competing SETAR models for inflation . . . . . . . 65
4.27 Forecast accuracy test for inflation . . . . . . . . . . . . . . . . . . . 65
4.28 Forecast results of competing SARIMA models for CIC . . . . . . . . 66
4.29 Forecast result of SETAR model for CIC . . . . . . . . . . . . . . . . 66
4.30 Forecast accuracy test for CIC . . . . . . . . . . . . . . . . . . . . . 67
xii
University of Ghana  http://ugspace.ug.edu.gh
LIST OF FIGURES
4.1 Time series plot of inflation . . . . . . . . . . . . . . . . . . . . . . . 45
4.2 ACF and PACF plots of inflation rates . . . . . . . . . . . . . . . . . 45
4.3 ACF and PACF plots of differenced inflation . . . . . . . . . . . . . . 46
4.4 Time Series plot of CIC . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.5 ACF and PACF plots of CIC . . . . . . . . . . . . . . . . . . . . . . 48
4.6 ACF and PACF plots of differenced CIC . . . . . . . . . . . . . . . . 48
4.7 ACF plots of residuals for inflation . . . . . . . . . . . . . . . . . . . 54
4.8 ACF plots of residuals for CIC . . . . . . . . . . . . . . . . . . . . . 57
xiii
University of Ghana  http://ugspace.ug.edu.gh
CHAPTER 1
INTRODUCTION
1.1 Background of study
The simple linear model gained popularity in modeling and forecasting most
macroeconomic variables until the 1980’s. Tong (1978) and Tong and Lim (1980)
first proposed the threshold concept of nonlinear class of models after a challenge was
thrown to a group of time series analysts in 1977 during an ordinary Royal Statistical
Society meeting.
For the purpose of this study, macroeconomics is defined as a branch of economics
which deals with the study of the whole economy characterized by variables. Inflation,
Gross Domestic Product (GDP) and unemployment rates are major examples of
macroeconomic variables that have received great attention in most economic and
statistical researches worldwide with Ghana not being an exception. This is so because
macroeconomic variables are the major indicators of an economy. For this reason,
governments, policy makers and investors have developed keen interests in researches
on macroeconomic variables. It is therefore not surprising to hear governments
boasting of attaining single digit inflation, creating jobs especially for the youth
and stabilizing an economy during its regime. Also, it is no lame reason that per
Ghanaian setting, governments appoint a statistician whose sole responsibility is to
constructively advise government on strategies to keep the economy in check and
balance.
According to Marcellino (2007), the World War II crisis socially, economically and
politically had an influence on modelling of macroeconomic variables worldwide. In
1
University of Ghana  http://ugspace.ug.edu.gh
this, many literatures have suggested time varying and non-linear class of models to
be better off in dealing with such macroeconomic variables as compared to linear
specifications in modelling and forecasting.
In a typical Ghanaian market, the dynamics in the business cycle associated with
events such as economic expansion and recession results in discontinuities in a time
series either as a result of fiscal or monetary policies by governments. Researchers
such as Williams and Miller (1990) and Carreno and Madinaveita (1990) proposed
some modifications to deal with these discontinuities. Other researchers such as
Rosas and Guerrero (1994) attempted to cater for discontinuities in a time series data
by examining an exponential smoothing forecast subject to a number of constraints
taking into consideration seasonal variations.
Tsay and Tiao (1984), Campbell and Shiller (2001), Clements and Jeremy (2001) and
Aidoo (2011) have also described most macroeconomic variables as nonlinear and
hence propose the nonlinear class of models as best for forecasting such variables.
However, other researchers such as Gooijer and Kumar (1992) are of divergent view
that there is no clear evidence either in favour of linear or nonlinear models in terms
of their forecast performance.
The fact that time series generated from macroeconomic variables appear irregular by
nature, it is imperative to study, model and detect this asymmetric behaviour. Again,
because of this irregular behaviour of these variables, the simple linear models cannot
be used in their modelling and forecasting to produce accurate results.
Recently, quite a number of nonlinear class of models such as the Artificial Neural
Networks (ANN) and the Threshold Autoregressive (TAR) models, have been
proposed and extensively used for modelling and forecasting time series that appear
not to be linear. These nonlinear models are classified as regime switching models
2
University of Ghana  http://ugspace.ug.edu.gh
but are further divided into “Threshold models” and “Markov Switching models”
depending on the state process evolution between the two models.
The nonlinear models, “threshold models” introduced by Tong (1978) and later
reviewed by Potter (1999) triggered by regime shifts in relation to an unknown
threshold are considered to capture irregularities in the data generation process of a
time series to be used for modelling and forecasting. The Self- Exciting Threshold
Autoregressive (SETAR) models, a special case of the TAR model is accommodating
in terms of structural changes in regimes or discontinuities in the data generation
process. For this reason, the SETAR model is considered in this study to model
some macroeconomic variables in Ghana. Again, according to Marcellino (2007) the
nonlinear ANN model is a very powerful tool for any kind of nonlinear behavior but
they are quite difficult to interpret from an economic perspective. Likewise, seasonal
ARIMA (SARIMA) model is proposed as a benchmark for the study since it is
comparatively easier to model and use for forecasting trended time series at regular
intervals. Researchers such as Swanson and White (1995) have described the in-
sample forecast of the SETAR models to outperform their linear counterpart but cannot
vouch for their out of sample forecast. It is therefore imperative to consider both the in
sample and the out of sample performance of the two models.
1.2 Problem Statement
The Linear class of time series models have received much attention over the years
in both theory and applied research. Although, the linear class of models has widely
been used in several areas of study, they usually do not capture vital features of most
macroeconomic and financial data. Since the business cycle for most economies is
bound to go through either an expansion or recession, data from these variables such
as inflation, unemployment rate and GDP go through a system of structural breaks and
behavioural changes. It is therefore reasonable to assume that each kind of data set
may require a different type of time series model to explain and model it at different
3
University of Ghana  http://ugspace.ug.edu.gh
times.
In detecting and modelling the nonlinear behaviour of a time series, it seems a natural
concept is to consider the nonlinear class of models as proposed by researchers
like Watier and Richardson (1995), Camphell and Shiller (2001), Hsu et al. (2010)
and Gharleghi et al. (2014) to effectively explain irregular behaviourial changes in
macroeconomic and financial time series.
Models that assume the dynamic changes or discontinuities in time series are usually
termed as Autoregressive models. Examples of such models include threshold
autoregressive model, self-exciting threshold autoregressive (SETAR) and smooth
transition autoregressive (STAR) models. The simple autoregressive models are
arguably most frequently used time series models that can be easily estimated and
interpreted. An extension of an AR process to cater for the nonlinear behaviour of
time series makes the entire process nonlinear thereby making the nonlinear models
easy to understand and explain.
Recently, Gharleghi et al. (2014) found that the nonlinear SETAR model outperformed
their linear counterpart, the ARIMA model. According to Aidoo (2011), little of
research has been done in comparing the forecast performance between SARIMA and
SETAR models in an application to Ghana’s inflation rate. It is in this similar spirit that
this research will consider a case study of macroeconomic variables in Ghana. This
research will produce multiple models which will be more accurate in choosing the best
performing model since macroeconomic variables operate under different conditions
though interrelated.
4
University of Ghana  http://ugspace.ug.edu.gh
1.3 Objectives of the study
1.3.1 General objective
To identify macroeconomic variables that satisfy the conditions of SARIMA and
SETAR models and compare their in-sample and out-of-sample forecast performance.
1.3.2 Specific Objectives
i. Identify appropriate linear and nonlinear models for forecasting macroeconomic
variables.
ii. To examine the in-sample forecast performance of both models.
iii. To examine the out- of- sample forecast performance of both models.
1.4 Significance of study
This study will contribute to existing literature for researchers who wish to work on
similar studies in the near future. The study will also identify appropriate models that
will be useful to policy makers at all relevant levels, such as investors, governmental
and non-governmental organisations, in the effective design of economic strategies to
combat expected and unexpected changes in business cycles and also influence their
financial planning.
1.5 Research Questions
i. What is the behavior of these macroeconomic variables?
ii. Which linear SARIMA model is appropriate for modelling each macroeconomic
variable?
ii. Which nonlinear SETAR model is appropriate for modelling each
macroeconomic variable?
5
University of Ghana  http://ugspace.ug.edu.gh
ii. Which of the two models performs better in forecasting?
1.6 Organisation of Study
The structure of the thesis is as follows: Chapter one comprises the introduction
and it caters for the following: background, problem statement, research questions,
objectives of study and organisation of study. Chapter two reviews literature, which
comprises: introduction, brief introductions of SARIMA and SETAR models and
examines existing literature on the performance of both models and other linear and
nonlinear models. It goes further to show what this study seeks to achieve. Chapter
three focuses on the methodology of the study, whereas chapter four deals with data
analysis and results and chapter five deals with conclusions and recommendations of
the study.
6
University of Ghana  http://ugspace.ug.edu.gh
CHAPTER 2
LITERATURE REVIEW
2.1 Introduction
This chapter deals with brief introduction to SARIMA and SETAR models, related
studies on forecast performance of SARIMA model, related studies on forecast
performance of SETAR model and lastly other related studies on comparative forecast
performance between linear and nonlinear models.
2.2 Brief introduction to SARIMA model
The simple linear model Autoregressive Integrated Moving Averages (ARIMA)
has received much attention in modelling and forecasting time series data ranging
from economics, health, agriculture and finance just to mention a few. The ARIMA
model is considered to be a special kind of a regression model which comprises an
Autoregressive (AR) and a Moving Average (MA) component that are dependent on
past values and past errors respectively. A mixture of the AR and the MA models
produces a mixed model called the ARMA model.
The ARIMA model was popularized by Box and Jenkins (1976), the “I” represents a
differencing or an integration of a time series to become stationary. ARIMA models
are comparatively good in modelling and forecasting as compared to other linear time
series models. Researchers such as Meese and Geweke (1984), Marcellino (2007),
Stock and Watson (2003) have confirmed this in their works.
However, these models are not capable of capturing both seasonal and non-seasonal
7
University of Ghana  http://ugspace.ug.edu.gh
patterns in a time series. An extension of these models to capture seasonality is a
Seasonal Autoregressive Integrated moving Average (SARIMA) model. A time series
is described as exhibiting seasonality if there exist a regular pattern in changes that
repeats over a number of time intervals until the patterns repeat again. For monthly
data, even changes usually occurs in specific months in the time series data. SARIMA
models are well known for their statistical modelling and forecasting but are however
unable to extract nonlinear relationships within the time series data.
2.3 Related studies on forecast performance of
SARIMA model
This section reviews existing literature from various fields where the SARIMA model
has been used.
An economics related thesis by Aidoo (2010), revealed that SARIMA(1,1,1)(0,0,1)12
best represent the behaviour of Ghana’s inflation rate considering monthly data
from July 1991 to December 2009 obtained from the Ghana statistical service. In
the study, the best model based on various selection criteria was used to forecast
Ghana’s inflation rate for the next seven months. The forecasted values demonstrated
a decrease in Ghana’s inflation rate for the forecast period with a turning point in the
month of July.
In a related study by Nasiru and Sarpong (2012), SARIMA(3,1,3)(2,1,1)12 was
found to be the most appropriate modelling approach for monthly inflation in Ghana
using Box-Jenkins methodology for the period between January 1990 to January
2012. Fannoh et al. (2014) further revealed SARIMA(0,1,0)(2,0,0)12 to best examine
short term data on inflation from January 2006 to December 2006 for Liberia while
considering seasonality.
8
University of Ghana  http://ugspace.ug.edu.gh
Moreover, Otu et al. (2014) developed a model for forecasting Nigeria’s inflation rate.
In their study, Seasonal ARIMA model was applied to short term monthly inflation
rate collected from November 2003 to October 2013. SARIMA(1,1,1)(0,0,1)12 was
adjudged the best model to forecast inflation rates for Nigeria after meeting all model
assumptions. Forecasted inflationary rates for November 2013 to November 2014
displayed a downwards trend for the first quarter of 2014 but made a U-turn at the
beginning of the second quarter in the same year. Additionally, there was an upwards
trend in inflation values till the month of September. They posited that volatility in
Nigeria’s inflation rates is attributed to many economic factors.
Again, Nasiru et al. (2013) compared the efficiency of ARIMAX and SARIMA
models in predicting monthly currency in circulation for Ghana. Their predicted
results indicated that, both models were good for prediction. Further diagnostic checks
revealed both models were free from conditional heteroscedasticity and high-level
autocorrelations. However, they recommended continuous evaluation of these models
to improve forecasting performance.
Luguterah et al. (2013) proposed SARIMA(0,1,1)(0,1,1)12 as the best model for
predicting currency in circulation for Ghana. They employed Seasonal ARIMA in
their model building and forecasting for Monthly data obtained from the Bank of
Ghana from January 2000 to December 2011. From their results, the model with
minimum values of AIC, BIC and AICc was chosen. Diagnostic checks of the model
residuals with Ljung-Box and ARCH-LM tests showed the model was free from
higher level autocorrelation and conditional heteroscedasticity.
In Agriculture, Akter and Rahman (2010) generated forecasts for milk supply of
dairy cooperative in UK applying seasonal Holt winter’s model and seasonal ARIMA
model. They revealed that the forecasts produced less than three-percent errors and
hence proposed both techniques to be effective in forecasting.
9
University of Ghana  http://ugspace.ug.edu.gh
A health-related study by Martinez and da Silva (2011), resulted in developing
the most appropriate prediction model for incidence of dengue cases in Campinas,
southeast Brazil using SARIMA model in application to reported monthly dengue
data. From their results, SARIMA(2,1,3)(1,1,1)12 was proposed to be the best
model fit in forecasting dengue cases in Brazil after satisfying all model developing
assumptions. By their study, researchers were advised to take into account climate
changes when dealing with data on dengue cases for Brazil in future.
Yet again, a health monitored study by Zhang (2013) predicted the incidence of
typhoid fever in China using SARIMA and three different inspired neural network
models. They applied the various models to data obtained from the Chinese center
for disease control and prevention from 2005 to 2010. Their results showed that the
SARIMA model outperformed the three other models.
An investigative research in the energy sector by Oduro-Gyimah et al. (2012) on
microwave transmission of the Yeji- Salaga link, collated receive signal level data
between January 2006 to December 2010. They found SARIMA(1,1,1)(0,2,2)12
as the best model to predict microwave transmission pattern for 24-months. They
also reported that the SARIMA model was the best model to use for time series with
seasonal complexity.
Another energy sector study by Abledu (2013), pointed SARIMA(1,1,1)(0,1,2)12 as
the best fit model to explain the pattern of short-term energy data collected within the
time frame 2010–2011. The developed model was then used to make predictions for
the year 2013.
In addition, Asamoah-Boaheng (2014) determined an efficient model using seasonal
ARIMA model to forecast average temperature for the Ashanti Region of Ghana with
10
University of Ghana  http://ugspace.ug.edu.gh
data between 1980 to 2013 from the meteorology and climate department. Out of
the three models built in their study, the SARIMA(2,0,2)(2,1,1)12 model had the
least values of AICs and BICs and so was chosen as most adequate for predicting
average temperatures. There was neither a decreasing nor increasing pattern in the
average temperatures but there existed some amount of seasonality in the data after
decomposition of the time series.
While examining the rainfall pattern by Abdul-Aziz et al. (2013) in the Ashanti
region of Ghana, the study revealed rainfall patterns changes over time using data
from 1974 to 2010. Their results further showed a slight decrease in rainfall figures
from August to December months and also that, there was likely to be an increase
in rainfall figures from February to march months. Afrifa-Yamoah et al. (2016)
too, examined the rainfall pattern for Brong Ahafo Region of Ghana and proposed
SARIMA(0,0,0)(1,1,1)12 as the best model to forecast rainfall patterns for the region.
Their results revealed increases in rainfall figures for the months of September and
October and minimum figures for January, February and December months for the
region using data from 1975-2009.
Recently, Dritsaki (2016) examined the precision of forecasting of Greece’s
unemployment rate following procedures of Box-Jenkins methodology and
demonstrated that SARIMA(0,2,1)(1,2,1)12 had the best predictive power and hence
was used to forecast unemployment rates for Greece. Correspondingly, Mahipan et al.
(2013) compared SARIMA and ANN models and found SARIMA(0,1,1)12 to be the
best predictive model for unemployment rates in Thailand after satisfying all model
assumptions. They suggested that the ANN model did not fit the unemployment data
adequately hence the choice of the SARIMA model for them.
11
University of Ghana  http://ugspace.ug.edu.gh
2.4 Brief introduction to SETAR model
Tong in an editorial report, recounts an ordinary royal statistical society meeting in
1977 presided over by the then David Cox. He explained the need to revisit all the
lynx data models widely researched by Mr. Campbell, Dr. Tong and Professor Walker
since they are time reversible. In addition, Dr. Granville Tunnicliffe asked, “would we
not prefer a model which would exhibit stable periodic deterministic behaviour a limit
cycle?” such limit cycles cannot arise from linear models. Long before this meeting,
there hardly existed the concept of nonlinearity in time series since time series at that
time operated on the assumption of linearity.
As events unfolded during the meeting, a challenge was thrown to all time series
analysts to go find a practical explanation to the chaotic nature of time series like data
for lynx which appeared not to be linear. This task was indeed a daunting one, since
the concept was new and therefore needed much experimentation.
Tong determined on raveling the mystery of this “non-linearity”, focused on data of
animal population and river flow. He observed two advantages for these data sets.
Firstly, a better insight to the dynamics underlying the data. Which indeed was a
revelation to Dr. Granville’s comments on “limit cycles”. Secondly, specific sets
of data were needed to constantly check the methodology if it headed in the right
direction. Keen on his quest to finding a better explanation to nonlinearity, between
the years 1977 -78, asked his research student, Lim to carry out a task using simulation
but she misunderstood and showed the results obtained by recursion of a SETAR
model. This actually was a blessing in disguise as it offered Tong the opportunity to
have a glimpse of the SETAR model for the first time. By the later part of 1980, after
reading widely and huge experimentation Tong wrote a paper entitled “Threshold auto
regression, limit cycles and cyclical data” to a UK journal but unfortunately for him it
was rejected. Though saddened, he tried the platform of the Royal Statistical Society
12
University of Ghana  http://ugspace.ug.edu.gh
and fortunately for him the paper was accepted. Although accepted, there was quite a
lot to be done to smoothen the rough edges like “how to choose the threshold variable”
in a time series. As a matter of fact, this threshold concept was a very broad one and
therefore needed further research to explain adequately the behaviour of a time series.
This therefore gave other researchers like Watier and Richardson (1995) and Clements
and Smith (1999) to research further on nonlinear models.
Narrowing the concept of this threshold Autoregression to internal changes within a
time series brought about “SETAR”. The SETAR model actually is a typical example
of the threshold Autoregressive (TAR) model. The threshold model deals with general
discontinuities in a time series unlike the SETAR model, internal discontinuities in
a time series derives the term “self- exciting”. A set of different linear AR process
characterized by regime switches according the value of their threshold makes the
entire process a nonlinear process.
2.5 Related studies on forecast performance of SETAR
models
Quiet a number of literature exists in various areas where SETAR models have been
applied. This section makes a casual review of such literature ranging from health,
agriculture to economics just to mention a few.
Investigating several approaches to obtain multi-step forecast ahead for SETAR
models, Clements and Smith (1997) based their study on existing literature of h-step
ahead forecast values. In their study, they compared the forecast performance of
SETAR models to AR models using various methods of simulation for US Gross
National Product (GNP). Their results expressed some doubt on the robustness of
SETAR models over two sample periods and on the size of the forecast accuracy
there was no doubt SETAR models could deliver. They therefore concluded that
13
University of Ghana  http://ugspace.ug.edu.gh
improvements for forecast performance of linear AR model depended largely on
historical events. They also concluded that, the type of nonlinearity in a time series
contributes to the out of sample forecast performance.
Further investigations by Feng and Liu (2002) revealed features of nonlinearity in
Canadian GDP. In their study, they compared the out of sample forecast performance
of non-linear SETAR model and a standard linear ARIMA model. Employing
techniques for forecasting like the1- step forecast ahead and multi-step forecast ahead,
they revealed that both models appears satisfactory in terms of their MAPEs and
RSMPEs. As a result, they proposed both models for predicting the Canadian GDP.
They however are of the view that, it is more practical to use the multi-step technique
in real life situations.
For more than forty years of existence and application of nonlinear models, Clements
et al. (2004) reviewed the current state -of- art of these models ranging from
estimation, evaluation and selection of best forecasting models among others for
some economic and financial time series. They argued that, evidence of non-linearity
in these time series are patchy and hence proposed a variety of future topics for
upcoming researchers.
In an economics related study, Christos (2005) evaluated the performance of
competing models for UK’s unemployment rate from January 1971 to December
2002. Firstly, it was found that, MA(4)− ARCH(1) provided better forecasts for
unemployment rate. On the other hand, two forecasting samples pointed that MA(4)
model does well, whereas both MA(1) and AR(4) proved to be the best forecasting
models for the other two forecasting periods. Hence, an empirical evidence suggests
that there exist a close relationship between forecasting theory and labour market
conditions. Further findings bring to the conclusion that, forecasting methods are
nearer to the realities of UK labour market.
14
University of Ghana  http://ugspace.ug.edu.gh
Another economics related study by Marcellino (2007), evaluated the relative
performance of a bootstrap algorithm to standard linear models in an application to
bootstrap series of GDP and inflation. Considering a variety of evaluation criteria, the
standard linear models was suggested to be hardly beaten in prediction if only they are
carefully specified.
Similar economics study by Sjoberg (2010), compared the forecast performance of
ARIMA, LSTAR and SETAR models using monthly Swedish Industrial production
from 28 branches over the period January1990 to December 2008. In terms of
their mean square error of prediction, the ARIMA model was adjudged the best
model for majority of the industrial branches. For some industrial branches and in
a few branches, the LSTAR and SETAR models also outperformed the ARIMA model.
Equally, Gharleghi et al. (2014) evaluated the prediction performance of SETAR-
EGARCH, ARIMA-EGARCH and AR-EGARCH models by employing data from
three ASEAN currencies, namely; Indonesian Rupiah, Malaysian Ringgit and Thai
Baht. They revealed that, the non-linear SETAR–EGARCH model fits better than the
other two models considering factors like the data generation process and therefore
concluded that, the data generation process of the variables is nonlinear.
van Ruth (2014), further applied several linear and non-linear models to monthly
inflation rates for Netherlands over the period 1970-2009. In an empirical approach,
the nonlinear STAR model was revealed to outperform all other linear and nonlinear
approaches to explain the dynamics of inflation in the Netherlands.
A comparative study by Gibson and Nur (2011) investigated the performance of a
two regime AR and STAR models. From their results they revealed that, the in-
sample forecast of the STAR remains problematic since it properly does not replicate
15
University of Ghana  http://ugspace.ug.edu.gh
the observed behaviour of the time series like the SETAR model does. They further
revealed that, the SETAR model outperforms the STAR model in terms of their out- of
-sample forecast.
Another comparative study by Akeyede et al. (2015), compared different forecasting
methods proposed in existing literature to obtain h-step ahead forecast. They further
compared the multi- step forecast performance of linear and nonlinear time series
via method of simulation. In conclusion, they proposed AR model to be best for
forecasting AR functions and that LSTAR model supersedes all other non-linear
forecasting methods except for polynomials. SETAR models beats all other models in
terms of their prediction ability.
Likewise, Badawi (2016) in an unpublished master’s thesis, compared the forecast
performance between SARIMA and SETAR models using data on pneumonia cases
in the northern region of Ghana. The data was obtained from the Tamale Teaching
Hospital over the period January, 2000 to October, 2015 based on the forecast values
from the two models, the SETAR model was adjudged the best in terms of their MSE,
RMSE and also other forms of criterion selection processes.
Again, Karlsson and Karlsson (2016) modelled Sweden’s unemployment rate by
exploiting model efficiency of SARIMA, SETAR and VAR models. As a measure of
goodness of fit test, they concluded that, the SARIMA and SETAR models performs
better than the VAR model in terms of their out of sample forecast using quarterly
unemployment data within the life span 1983 to 2010 from OECD. They further
revealed that, short term forecasts are better off than longer term forecasts.
Analysing structural changes in the Taiwan stock market, Hsu et al. (2010) found
the SETAR model to better examine the out of sample forecast of the non-linear
time series. Employing monthly data from January 2005 to December 2009, the best
16
University of Ghana  http://ugspace.ug.edu.gh
model was chosen by performing a unit root test and comparing the out of sample
forecast between the standard linear ARIMA model and the nonlinear SETAR model.
A graphical view of their results showed that the structural change in the time series
occurred in the month of June 2008. Thereby they built a 2–regime SETAR model. In
conclusion the nonlinear SETAR model was found to be superior in forecasting to the
linear ARIMA model in the Taiwan stock market.
Contrary to the norm of comparing the performance of models and prediction, Öznur
and GÜNERİ§ (2015) developed a hybrid models whose components are either
parametric or non- parametric models. They used the technique of back fitting
algorithm based on smoothing spline to producer price index in Turkey. Their results
showed that, the hybrid model AAR and SETAR performs best amongst all other
hybrid models.
Recently, Zak (2017) analysed the forecast performance of nonlinear models in
application to Czech Republic’s exchange rate against EUR. Secondary data from
three sources namely; Czech National Bank (CNB) Board decisions minutes,
Statistical Data Warehouse of European Central Bank (ECB) website and from the
press releases of Governing Council of ECB was obtained within the periods 1999 to
2016 for the study. Employing SETAR models and simple random walk they showed
that the simple random walk model out performs the complex SETAR model. This
further supports the assertion exchange rates are generally difficult to forecast.
Again, Fırat (2017) applied SETAR modelling process to explain the non-linear pattern
in modelling currencies in EUR/USD, EUR/ TRY and USD/TRY parities. The study
compared the forecast performance of the observed values for both linear and nonlinear
models. It was concluded that, the non-linear SETAR model, outperformed the other
forms of nonlinear and linear models.
17
University of Ghana  http://ugspace.ug.edu.gh
2.6 Related studies on comparative studies of other
linear and nonlinear models
In the field of engineering, Rodas et al. (2017) compared forecast performance
between linear and nonlinear models in application to a DC motor. In their study,
they adopted two criteria for measuring the performance of both models namely;
tracking error and the control effort. Their results indicated that, the linear controller
presents a significant advantage over their nonlinear counterpart when experimenting
at operation speeds very close to the operation argument.
A medical evaluation by Murphy and Dietrich (2006), assessed two classes of control
loop algorithms namely; the linear adaptive filter and the adaptive nonlinear neural
networks for highly irregular patient behaviours. From their discussion, they explained
the nonlinear neural networks demonstrated robust adaptability to all of the observed
breathing patterns of patients but their linear counterpart failed.
Motivated by the need to establish a Vector Autoregressive Moving averages
(VARMA) model, Iwok and Etuk (2009) developed a general bivariate Vector
Autoregressive Moving Averages (BIVARMA) model. In their model development,
they graphically compared the two competing models based on their estimates and
residual variances. Furtherly, they established the fact that, the BIVARMA model
performed better than the VARMA model.
Other aspects of human life are not left out in the application of linear and nonlinear
models as demonstrated by Isaac et al. (2016) in modelling road accidents in
Ghana. They used count regression models in their study as they compared several
models like Poisson, Negative Binomial and Conway- Maxwell-Poisson to fit count
data expected in the field of transportation. The best models were then chosen
using Akaike Information Criterion and Bayesian Information Criterion. From their
18
University of Ghana  http://ugspace.ug.edu.gh
results, the Negative Binomial model appeared to perform better than the other models.
A number of peer review researches showed the existence of rich literature on
macroeconomic variables. For instance, van Gysen et al. (2013) comprehensively
compared the predictive accuracy of linear and non-linear models in forecasting
financial returns. In their study, they considered varieties of macroeconomic variables
from Johannesburg’s stock exchange and their results showed that markov switching
models provided the best in- sample fit. Nevertheless, they are of the view that, the
out- of-sample forecast of the mixed models, EGARCH linear models and 2-state
dynamic regression models outperform their alternative models.
As well, Cuestas et al. (2011) considered inflation for a study by applying non-linear
and fractional integration as a hypothesis of empirical inflation for a pool of African
countries. Their discussions revealed that, most African countries do not support the
hypothesis. They therefore proposed that, the LSTAR models are more stable.
Furthermore, Hadrat et al. (2015) compared performance between Artificial Neural
Networks (ANN) and the traditional Autoregressive and Vector Autoregressive in
application to Ghana’s inflation rate. Their results indicated that, ANNs had less errors
than their traditional counter parts. In conclusion, the ANNs models were suggested
to accurately forecast Ghana’s inflation.
Several inflationary studies in Ghana has found nonlinear models to be the best model
but Antwi (2017) is of a divergent view. Firstly, the existence of nonlinear pattern
in monthly inflation for Ghana using nonlinear models was applied. Secondly, the
threshold models; SETAR and LSTAR were applied on the monthly inflation data
obtained from the Ghana statistical service within the period January 1981 to August
2016.
19
University of Ghana  http://ugspace.ug.edu.gh
Finally, the performance of the threshold models was compared to AR(1) and AR(2)
models using minimum values of AIC and BIC. In conclusion, both SETAR and
LSTAR models fitted the data best but in terms of their forecasting ability the simple
AR models outperformed their non-linear alternatives. Also, it was revealed that
Ghana’s inflation was likely to experience double digit in the year 2017.
Lastly, investigations by Nchor et al. (2016) established a linear and nonlinear
relationship between macroeconomic variables and oil prices in Ghana. They
considered six variables namely; real industry value added, real government
expenditure, the real effective exchange rate, real oil price, real imports and inflation.
Vector Auto regression (VAR) and Vector Error Correction Model (VECM) used in the
study pointed that, irrespective of a positive or a negative change in oil prices, there
definitely would be an impact on macroeconomic variables.
2.7 Chapter Conclusion
This chapter has comprehensively dealt with review of literature from other works
relevant to the study. Review of these literatures offered the opportunity to encounter
diverse methodology employed by researchers to forecast and predict various forms
of macroeconomic variables, health related cases, climate related cases and more.
However, in most of the studies, various forms of linear and nonlinear models such
as ARIMA and SETAR, SARIMA and ANN’s have extensively been used. In the
case of Ghana, little has been done in relation to applying macroeconomic variables to
compare their forecast performance. Based on this research gap identified, this study
employed the Seasonal Autoregressive Integrated Moving Average (SARIMA) models
and Self Excited Threshold Autoregressive (SETAR) models in predicting inflation and
Currency in circulation (CIC) of Ghana
20
University of Ghana  http://ugspace.ug.edu.gh
CHAPTER 3
METHODOLOGY
3.1 Introduction
This chapter presents the methods and techniques used to achieve the objectives of
the study. It contains the statistical techniques, analytical procedures and theories that
were employed to achieve the stated objectives of the study. Similarly, the chapter will
be sectioned into source of data, unit root test, trend analysis, AR(p) model, MA(q),
mixed ARMA(p,q) model, ARIMA(p,d,q) model, SARIMA(p,d,q)(P,D,Q)s model,
SETAR(2; p,d) model, linearity test, model estimation, model diagnosis and Diebold-
Mariano test.
3.2 Research Design
This study takes the form of an experimental approach since this concept is imperative
to forecasting phenomenon and research design to explain better the possible trends in
the datasets to be considered.
3.3 Source of Data
The study obtained secondary datasets of two macroeconomic variables; namely
inflation and currency in circulation rates spanning from January 1990 to August 2016,
from the website of the Bank of Ghana (BoG). An analytical method was employed
for the description of these macroeconomic variables and also for comparative
performance of the nonlinear SETAR and linear SARIMA models using R statistical
software.
21
University of Ghana  http://ugspace.ug.edu.gh
3.4 Trend Analysis
Generally, the long- term change in time series levels over time longer than a year
tends to reflect a trend in the time series. Linear, exponential, nonlinear and quadratic
trends are some types of trends exhibited by the time series. This study will evaluate
the performance of the aforementioned trend models in forecasting inflation and
currency in circulation. These models are however, presented either as additive or
multiplicative. The linear trend in the additive model does not depend on the trend
unlike the multiplicative model which intensifies in its variation as the trend increases.
If the trend in the time series is a linear function of time then,
Z(t) = β0 +β1t + εt , t = 1,2, . . . ,n. (3.1)
The logarithmic representation of the linear trend is
logZ(t) = β0 +β1t + εt . (3.2)
And also, for an exponential representation of the series
Z(t) = eβ0+β1t+εt . (3.3)
where Z(t) represents observations of the time series, time (t), (t = 1,2, · · · ,n− 1,n)
and εt is the error term, whereas β0 and β1 are constants.
If the series shows a parabola shape then it exhibits a quadratic trend represented as
Z(t) = β0 +β1t +β2t2 + εt . (3.4)
If β1 < 0, then a plot of the series eventually turns to negative. But if β2 > 0, then a
plot of the series sky-rockets up.
22
University of Ghana  http://ugspace.ug.edu.gh
For a polynomial of order k,
Z(t) = β +β t +β t2 + · · ·+β tk0 1 2 k εt . (3.5)
Equations (3.1) to (3.5) were obtained by the application of the concept of ordinary
least squares (OLS).
3.5 Unit Root Test
As described by Aidoo (2010) and Nasiru et al. (2013), most time series of
macroeconomic variables exhibit trending behaviour in their mean, autocorrelation
and variance through graphical plots of their series. By this, the time series turns
to be non-stationary if the coefficients of the estimated model are in absolute value
less than one. This means that, if the roots of the characteristic equation lie outside
the unit circle, then the series is considered stationary. Principal examples of such
macroeconomic variables include; inflation, exchange rates, stock prices and currency
in circulation.
An important task to dealing with such macroeconomic variables is to first check for
stationarity in the dataset, this idea is what is called a unit root. The unit root test is
usually performed to check for the existence of a stochastic or a deterministic trend in
the series. In practice the ARMA model is non-stationarity, but attains stationarity if
transformed and differenced. This process brings about the integration factor (I) in the
term ARIMA model, this in reality makes the model stationary and effective for any
forecast to produce accurate results.
A good number of time series related research have employed graphical and statistical
tests in checking for the presence of unit root in a series. Dickey-Fuller and Augmented
Dickey-Fuller (DF-ADF), Kwiatkowski, Philips, Schmidt and Shin (KPSS), Zivot
23
University of Ghana  http://ugspace.ug.edu.gh
Andrews (ZA), Phillips-Perron tests are typically used examples for statistical tests. In
the graphical approach an Autocorrelation Function (ACF) plot is observed and if there
exists a strong and slow decaying ACF, then it suggests a deviation from stationarity.
Hence, the series must be transformed and differenced to attain stationarity. However,
one needs to be cautious when differencing a non-stationary series since over
differencing may exclude relevant information to building an appropriate model for
any meaningful predictions. To determine seasonal and non-seasonal behaviours in a
series, various forms of tests must be conducted under both seasonal and non-seasonal
parts each using their ACF and PACF plots. Relevant tests to this study will be
discussed below.
3.5.1 Phillips-Perron (PP) Test
The Phillips- Perron test has received much attention in analysis of most financial time
series. This unit root test is quite different from the Augmented Dickey Fuller (ADF)
test in terms of their approach in dealing with errors arising from serial correlation and
heteroskedasticity. Also, this test unlike the ADF test, makes use of non-parametric
procedures in their test regression. By so doing, they correct any form of serial
correlation in the test regression by modifying the Dickey-Fuller test statistic. The
test regression for the PP tests can be written as follows:
∆yt = βDt +πyt−1 +µt (3.6)
where ∆ is a difference operator, β is a constant, D(t) is a time trend, π is the dickey
fuller test statistic and µt is an error term.
Under the null hypothesis π = 0, this means non-stationarity.
By modifying the Dickey-Fuller test statistic tπ=0 and Tπ , the statistics zt and zπ are
formed. Just like the ADF test statistic, these modified statistics are asymptotically
24
University of Ghana  http://ugspace.ug.edu.gh
distributed and normalized bias with a limiting distribution.
( ) 1 ( ) ( )
σ̂2 2 1 λ̂ 2− σ̂2 2
Zt = ·
T SE(λ̂ )
tπ=0− · 2 (3.7)λ̂ 2 2 λ̂ 2 σ̂
− 1 T
2SE(π̂)
Zπ = T π̂ (λ̂ 22 − σ̂
2) (3.8)
2 σ̂
with parameters λ̂ 2 and σ̂2 as consistent estimates of sample and long-run variances
respectively.
T
σ2 = lim T−1 E[µ2] (3.9)
T−→∞ ∑ tt=1
T
λ 2 = lim ∑ E[T−1S2T ] (3.10)T−→∞ t=1
(3.11)
where ST = ∑Tt=1 µt . The estimators σ̂2 and λ̂ 2, are consistent residual estimates (µt)
of the sample and the long-run variances respectively.
Although, the ADF has some advantages over the PP test, the PP tests are relatively
good in dealing with large samples because of their asymptotic assumption, their
robustness to any form of heteroskedasticity in error terms makes it more advantageous
than the ADF test. Furthermore, there are no mandatory specification for lagged values
as in the case of the ADF.
3.5.2 Kwiatkowski-Philips-Schmidt-Shin (KPSS) Test
The KPSS test is a complementary unit root test for testing a null hypothesis whose
observable time series is trend-stationary against an alternative hypothesis of a unit
root, this however is the opposite hypothesis for most unit root tests. In addition, the
existence of a unit root does not guarantee stationarity but, by design, trend stationary.
25
University of Ghana  http://ugspace.ug.edu.gh
Thus, it has become imperative to distinguish between some behaviour of time series
which may be non-stationary, non-unit root yet trend- stationary. If the data generating
process assumes no linear trend, then
Yt = zt + εt (3.12)
where zt is a random walk i.e. zt = zt−1 +µ 2t , and µt is a random error term ∼ (0,σv )
assumed to be i.i.d and ε is also a stationary error.
The hypothesis to tested is as follows;
H 20 : σv = 0
H1 : σ2v > 0.
If H0 is true, then it means that, Yt becomes a stationary process since it has components
of stationary εt and constant terms. The test statistics proposed by Kwiatkowski et al.
(1992) is given below:
1 T s2
KPSS = ∑ tT 2 σ̂2t=1 ∞
where T is the number of observations, st = ∑ti=1Ŵi with Ŵi = Yi− Ŷ and σ̂2i ∞ is an
estimator of σ2∞ = T−1Var(∑Tt=1 εt) is a long-run variance estimator of the εt process.
The observable time series yt attains stationary with an integration of order one (I(1))
and the denominator quantity in KPSS statistic is an estimator of its variance, which
has a deterministic limit. The term in the denominator ensures that overall; the limiting
distribution is free of unknown nuisance parameters. If, however, it is integrated of
order one (I(1)), the numerator will grow without bounds, causing the statistic to
become large for large sample sizes. The null hypothesis of stationarity is rejected
for large values of KPSS.
26
University of Ghana  http://ugspace.ug.edu.gh
3.6 Autoregressive AR(p)Model
A time series yt is said to be an autoregressive model if future values of the series
depend on past values. Where there exists some amount of correlation between
preceded and succeeded values of the time series, then the model becomes more
effective for prediction. The word “auto” from the term “autoregressive” meaning
“self” only considers past values of a time series to model its behaviour. The AR
model is measured by a simple linear regression model which usually comprises a
time series regression of past values plus an error term. The order of AR model is
paramount to model building since it measures the immediate preceding values in the
time series to be used to predict the succeeding value. In that case, if the preceding
model is of first-order or order one, then it is written as AR(1) and represented as:
yt = φ0 +φ1yt−1 + εt . (3.13)
For second order, AR(2)
yt = φ0 +φ1yt−1 +φ2yt−2 + εt . (3.14)
More generally, for order p, AR(p)
yt = φ0 +φ1yt−1 +φ2yt−2 + · · ·+φpyt−p + εt . (3.15)
where observations of the time series are denoted by the y’s, εt is the random error
at time period t, the terms t − j and φ j are the AR parameters to be estimated with
j = 1,2, · · · , p as AR orders. For simplicity, the backward shift operator B, such that
By 2t = yt−1, B yt = yt−2 and so on is used to simply the above general model of order
p into:
φ(B)yt = εt
27
University of Ghana  http://ugspace.ug.edu.gh
where φ(B) = 1−φ1B−φ B22 −·· ·−φpBp is a polynomial of order p.
For a stationary AR(p) model,
y = φ−1t (B)εt
converges to zero.
3.7 Moving Averages (MA(q)) Model
A time series yt is said to be a moving average process if present values of the series
depends on past errors or white noise values. Usually this model is classified as a
“naive” model due to its ability to stabilise datasets by creating another dataset with
less variations and also it does not show traces of seasonal trend in the dataset. For
this reason, they are designed to provide smoothing effect within a time series which
may be relevant or not to future estimations of a variable. But more importantly, it
provides a “tidier” dataset for estimation.
The general form of the MA model is in the form:
yt = θ1εt−1 +θ2εt−2 + · · ·+θqεt−q + εt . (3.16)
where θ j are the model parameters to be estimated, with j = (1,2, · · · ,q), q is the
order of the model and ε is the error term assumed to be i.i.d with mean zero and
constant variance σ2. No matter the independent average values of the error terms, the
MA process always assumes stationarity.
Equivalently, using the backward shift operator (B), this can be written as:
yt = θ(B)εt
where θ(B) = 1+ θ1B+ θ2B2 + · · ·+ θ Bqq is a polynomial in B of order q, and as
28
University of Ghana  http://ugspace.ug.edu.gh
usual εt is the normally distributed error term with mean zero and a constant variance
σ2. If the characteristic roots of the polynomial mq + θ mq−1 + θ q−21 2m + · · ·+ θq is
an invertible MA(q) process equals to zero, then the absolute terms are less than one.
3.8 Autoregressive Moving Averages (ARMA (p,q))
The mixed model ARMA(p,q) is a linear combination of the Autoregressive (AR) and
Moving Average (MA) model time series models, where p and q are orders of AR and
MA processes respectively. Mostly the ARMA(p,q) model is given as:
yt = φ1yt−1 +φ2yt−2 + · · ·+φpyt−p + εt +θ1εt−1 +θ2εt−2 + · · ·+θqεt−q
where φi and θ j are the respective AR and MA parameters to be estimated with
i = 1,2, · · · , p and j = 1,2, · · · ,q, εt’s are the i.i.d random errors with mean zero and a
constant variance σ2.
For simplicity, the backward shift operator (B) is used to express the ARMA(p,q)
model as:
φ(B)yt = θ(B)εt
with φ(B) and θ(B) as polynomials in B of orders p and q. The ARMA(p,q) process
is said to be stationary if the roots of the AR component of the characteristic function
converges to zero and also, if the polynomial of the MA component is less than one in
absolute terms, then the process is described to be invertible.
3.9 Autoregressive Integrated Moving Averages
(ARIMA)
The ARIMA model has been extensively used in theory and practice to study and
forecast the behaviour of most time series variables with macroeconomic variables not
29
University of Ghana  http://ugspace.ug.edu.gh
being an exception. In general, they are assumed to be the stationary form of ARMA
models by transforming and differencing the time series. However, when differencing
the model to become stationary, care should be taken so as not to over difference
the series otherwise the data might not fit well into the model and hence results in
inaccurate predictions. What the ARIMA model actually does is to remove any form
of “noise” from both the AR and MA components of the model to make it stationary
and then predicts future values with the transformed model. The non-seasonal ARIMA
model is basically denoted as ARIMA(p,d,q) model, where the parameters p,d,q are
the AR, differenced and MA terms greater than or equal to zero respectively.
The general form of the ARIMA model using the backward shift operator is:
φ(B)(1−B)dyt = θ(B)εt .
3.10 Seasonal Autoregressive Integrated Moving
Averages (SARIMA) model
A time series model is said to be a seasonal ARIMA model if it has the structure of an
ARIMA model with two components to capture seasonal and non-seasonal parts of the
model. When a time series is found to exhibit seasonal patterns, it is usually advisable
to carefully identify models that are relevant to capture this behaviour. The Seasonal
ARIMA model is usually denoted in the form ARIMA(p,d,q)∗ (P,D,Q)s where;
p is the number of non-seasonal AR component
q is the number of non-seasonal MA component
d is the order of non-seasonal differencing
P is the number of seasonal AR component
Q is the number of seasonal MA component
30
University of Ghana  http://ugspace.ug.edu.gh
D is order of seasonal differencing
s can either be represented monthly as, s = 12, or on quarterly basis, s = 4 and
so on.
Using the backward shift operator (B), the Seasonal ARIMA is given as:
φ(B)Φ(B)s.
When a time series data exhibit seasonal behaviour, the ARIMA model is usually not
able to capture the behaviour along the seasonal part of the series, hence, the tendency
for wrong order selection for the non-seasonal component. Identification of relevant
models and inclusion of suitable seasonal variables are therefore necessary when a time
series data exhibit periodic patterns. The SARIMA model therefore has the advantage
of capturing both seasonal and non seasonal components. The general SARIMA model
can be expressed using the backward shift operator (B) as:
φ(B)Φ(B)s(1−B)d(1−B)sy = θ(B)Θ(Bst )εt .
where
φ(B) = 1−φ B−φ B2−·· ·−φ Bp1 2 p ,
Φ(Bs) = 1−Φ s1B −Φ 2s2B −·· ·−φPBPs,
θ(B) = 1+θ B+θ B2 q1 2 + · · ·+θqB ,
θ(Bs) = 1+θ1Bs +θ2B2s + · · ·+θ QsQB ,
and
yt is the time series observation at time period (t),
B is the backward shift operator,
εt is a sequence of error term with mean zero and constant variance (σ2),
31
University of Ghana  http://ugspace.ug.edu.gh
φ(i) and Φ j are the non- seasonal and seasonal AR components respectively,
θi and Θ j are the non- seasonal and seasonal MA components respectively.
3.11 Self Exciting Threshold Autoregressive (SETAR)
model
An extension of an AR model results in a special case of the general Threshold
Autoregressive (TAR) model called the SETAR model. This model was first introduced
by Howell Tong in 1977 through a seminal paper and later by Tong and Lim (1980).
Also, this model explains better the data generation process of a time series especially
where changes in behaviour occur from one regime to another. Typically, these models
are employed to understand and predict regime switching behaviours for parameters
of higher orders with flexibility. The switch from one regime to another depends on
the past values of the time series (hence the Self-Exciting portion of the name). This
model is usually presented in the form (k, p) where k indicates the different regime
changes, p also indicates the order of AR. Furthermore, this model can be thought of
as a set of different AR models changing according to their lagged values or threshold
variables triggered by changes in a delay parameter d. The two regime SETAR model
of order p as given by Boero and Marrocu (2004) can be presented as follows:
 (1) p(1) (i) (1)
yt =
φ0 +∑i=1 φ1 yt−i + εt if yt−d ≤ τ
(3.17)
φ (2) ∑p(2) (i) (2)0 + i=1 φ1 yt−i + εt if yt−d > τ
where φ (1) and φ (2)i i are coefficients in lower and higher regimes respectively needed
to be estimated, τ is the threshold value, d is the delay parameter, yt−d is the threshold
variable which describes the switch from one regime to another regime, p(1) and p(2)
are orders of AR in lower and higher regimes respectively and εt’s are random error
terms which are i.i.d with mean zero and constant variance σ2. One major advantage
of the nonlinear SETAR model is its ability to capture the irregular behaviour of a time
32
University of Ghana  http://ugspace.ug.edu.gh
series which cannot be captured by the “naïve” linear models. In this study, the two-
regime SETAR model of the simplest form (2; p;d) will be considered and also the
procedure discussed by Franses and van Dijk (2000) will be used for effective model
selection, furthermore the basic approach and tests for modelling SETAR model will
be discussed below.
3.12 Linearity Test
For an observable time series yt , the SETAR model can only be applied if the series
under consideration is found to be nonlinear or irregular in nature under the hypothesis:
H0 : linearity exists
H1 : nonlinearity exists.
Sometimes, in testing for a certain nonlinear feature in the Data Generating Process
(DGP) of a series, the H1 is specifically pre-chosen but in some cases, the H1 is
relatively general.
In order to apply the SETAR model, one must first check for the existence of nonlinear
behaviour in the series, if there exists any, then an appropriate linear AR(p) model is
specified to test for nonlinearity in the series. A discussion paper by Franses and van
Djik revealed that, the maximum lag order is based on the choice of the minimum
AIC value in an AR(p) model. After careful determination of the linear AR(p), then
various forms of linearity tests such as; Tsay F-test and Keenan test, a special case of
the Ramsey Regression Equation Specification Error Test (RESET) test will be used to
test for linearity. Below are brief discussions of the tests to be used in this study.
33
University of Ghana  http://ugspace.ug.edu.gh
3.13 Ramsey Regression Equation Specification Error
Test (RESET)
Ramsey (1969) proposed linear least square regression analysis for specific tests based
on the view that, nonlinearity occurs in the diagnostics of a fitted model only if the
linear model residuals are correlated in some power terms. Thus, this test focuses on
specific errors in the linear regression including those from unmodeled nonlinearity
and its readily applicable to linear AR(p) models.
Now consider a linear AR(p) model:
yt = φ0 +φ1yt−1 +φ2yt−2 + · · ·+φpyt−p + εt .
In the first stage in a RESET test, the least square estimate φ̂ , the error term which
is denoted by εt = yt − ŷt and sum of square residual given by SSR0 = ∑n 2t=1+p ε̂t are
obtained.
Also, in the Second stage, we consider the linear regression
εt = X ′t−1b+Y
′
t−1d +Vt
where Xt−1 = (1,X 2 s+1t−1, · · · ,Xt−p) and Yt−1 = (Xt , · · · ,Xt ) for s≥ 1.
Then the least square residuals are computed as follows;
V̂t = ε̂t−X ′t−1b̂−Y ′t−1d̂.
Thirdly, SSR1 is computed as:
n
SSR1 = ∑ V̂ 2t
t=1+p
34
University of Ghana  http://ugspace.ug.edu.gh
For an adequate AR(p), b and d parameters must be zero. This can be tested in the
final stage by the usual F statistic given by:
(SSR0−SSR1)/gF = with g = s+ p+1
(SSR1)/(n− p−g)
which under normality and linearity has Fg,n−p−g.
3.14 Keenan-test
Considering an irregularly behaved observation, Keenan (1985) proposed nonlinearity
test for such time series. This test can be well- thought as a special case of
the Ramsey Regression Equation Specification Error Test (RESET) as discussed
above. Precisely, a modification of the second stage of the RESET test using X2t avoids
any form of multicollinearity between X2t and Xt−1. Particularly, Keenan assumes that
the series can be approximated by the Volterra expansion as follows:
∞ ∞ ∞ ∞
Xt = µ + ∑ ∑ θuεt−u + ∑ ∑ θuvεt−uεt−v
u=−∞ v=−∞ u=−∞ v=−∞
where µ is the mean level of the nonlinear observation Xt , with the error terms εt−u and
εt−v. The idea of the Keenan’s test is of a similar view as the F-test and so clearly from
linear approximation ∑∞ ∞u=−∞ ∑v=−∞ θuεt−u = 0. Similarly, the Keenan test follows the
same procedure used in the RESET test above to obtain fitted estimate (X̂t), error terms
(ε̂t) , the residual set (ξt)and SSR. And lastly, we calculate
∑n
η t=p+1
ε̂t ξ̂t
= .
∑n ξ̂ 2i=p+1 t
The test statistic of the Keenan test is:
2 (n−2p−2)F̂ = η
SSR−η2
where;
35
University of Ghana  http://ugspace.ug.edu.gh
P = lag order of the autoregressive process
n = Sample size considered
SSR = Sum of Square Residual of the AR(p) process
and the null hypothesis;
∞ ∞
H0 : ∑ ∑ θuεt−u = 0
u=−∞ v=−∞
with assumption that εt’s has zero mean and constant variance.
When the null hypothesis is satisfied, F̂ is approximately F-distributed with 1 and
n−2p−2 degrees of freedom. The H0 of linearity is rejected if the p – value associated
with F̂ is small (p-value <α) or when the value of F̂ is greater than the selected critical
value of the F- distribution with 1 and n−2p−2 degrees of freedom.
3.15 Tsay’s F -test
Tsay (1989) introduced the Tsay test for detecting nonlinearity in an observable time
series after improving on the power of the Keenan (1985) test in 1986. He generalized
the Keenan test by explicitly checking for quadratic serial dependence in the series,
by doing so, he catered for the disaggregated nonlinear variables. The first stage of
this test is the same as the first stage of the Keenan test. Unlike the second stage
in the Keenan test, the second stage of this test involves regression of the products
Xt−iXt− j, i = j = 1, . . . , p on (1,Xt−1,Xt−2, . . . ,Xt−p). Hence, its corresponding test
statistic F is asymptotically distributed as Fm,n−m−p−1, where m =
p(p−1)
2 , with the
hypothesis as;
H0 : linear AR(p) model
H1 : non-linear threshold model
36
University of Ghana  http://ugspace.ug.edu.gh
3.16 Criterion for Model Selection
In order to select tentative models that fits the datasets well, it is imperative to
consider various model selection criteria. The Akaike Information Criterion (AIC),
the Akaike Information Criterion corrected (AICc) and the Bayesian Information
Criterion (BIC) are examples of model selection criteria frequently used in most
time series research. In this study too, the AIC and other model selection criteria
are used. To select appropriate models from competing models, models with
minimum or least values of AIC are considered for further validation. This is because
these least values have been described as closest to truth. Again, this study considers
models with more parameters as tentative models since it improves goodness of fit test.
The AIC model can be given as:
AIC = 2k−2log(L)
where
K denotes the number of parameters in the model,
L is the maximized value of the likelihood function.
In the case of the SETAR model, the AIC model is given as follows:
AIC (p1, p2) = n1 ln σ̂2 21 +n2 ln σ̂2 +2(p1 +1)+2(p2 +1)
where
n1 and n2 are the observations in regimes1 and 2.
σ̂2 and σ̂21 2 represents the variance of residuals in regimes 1 and 2.
p1and p2 denotes the minimized information criterion selected lag orders in
regimes 1 and 2.
37
University of Ghana  http://ugspace.ug.edu.gh
3.17 Model Diagnostics
After carefully selecting tentative models to be used for forecasting, we check the
residuals of the models to ensure that, their time series satisfy all model assumptions.
This step is paramount to making any meaningful inferences with the models. So,
we employ the Ljung-Box and ARCH-LM tests on the model residuals as discussed
below:
3.17.1 Ljung-Box Test
Ljung and Box (1978), described this test as a diagnostic tool used to check for the
presence or absence of serial correlations in the residuals of a fitted model. Thus, a
time series with any specified lag orders, say order m examines autocorrelations in
the residuals. Instead of testing randomness at each distinct lag, usually it tests the
"overall" randomness based on a number of specified lags, and so for this reason it is
referred as a portmanteau test. The test procedure is given as follows;
The hypothesis to be tested is;
H0 : Residuals are uncorrelated up to order m
H1 : Residuals are uncorrelated up to order m
The test statistic is
m ρ̂2
Qm = n(n+2) ∑ k
k=1 n− k
where
ρ̂2k represents residual autocorrelations of the series at lag m
m is the number lags being tested,
n is the number of residuals.
We reject the null hypothesis if the value of Qm is greater than the chi-square table
value. Hence, residuals are correlated. However, if the value of Qm is less than the chi-
38
University of Ghana  http://ugspace.ug.edu.gh
square table value, we do not reject the null hypothesis. So, we conclude that the model
residuals are uncorrelated and therefore the model fits the dataset well. Otherwise the
whole estimation process has to be repeated again in order to get the most adequate
model.
3.17.2 ARCH-LM Test
For residuals of a fitted model to become adequate enough to produce meaningful
forecast values, the model must satisfy all model assumptions. Thus, the residuals
should have constant variance. Hence, the reason why Engle (1982) proposed
the ARCH-LM test to cater for issues of conditional heteroscedasticity in squared
residuals. The test procedure is as follows;
The hypothesis is;
H0 : There is no heteroscedasticity in the model residuals
H1 : There is heteroscedasticity in the model residuals.
and the test statistic is
LM = nR2 (3.18)
where n is the number of observations and R2 is the coefficient of determination of
auxiliary residual regression.
e2t = β
2 2
0 +β1et−1 +β2et−2 + · · ·+β e2q t−q + εt
where et is the residual and q is the length of ARCH lags.
We reject the null hypothesis if P-values are less than 5% level of significance, and
hence we conclude the existence of heteroscedasticity in the model residuals. Also,
p-values greater than 5% significance level indicates there is no heteroscedasticity.
Hence, we conclude the model has constant variance.
39
University of Ghana  http://ugspace.ug.edu.gh
3.18 Diebold-Mariano Test
Forecasting has been described as the last step to modelling procedures following the
approach of Box-Jenkins. To ensure competing models are statistically significant,
Diebold and Mariano (1995) proposed a test popularly known as the Diebold-Mariano
test. The purpose of this test is to verify forecast accuracy between competing models.
In this test, we assess forecasts between two competing models by considering their
mean squared errors for statistical significance. Nevertheless, lower values of mean
square errors of one forecast in comparison to the other do not necessarily translate
into superiority of that forecast.
The hypothesis is given as;
H0 : There is no significant difference between the two models
H1 : There is significant difference between the two models.
Following a standard normal distribution, the test statistic is given as;
[ ]− 1
DM = V̂ (d̂) 2 d̂ (3.19)
where;
d 2 2t = et1−et2 is the difference between the sets of squared error forecasts from the two
competing models.
And d̂= 1 ∑TT t=1 dt is the mean, T= no of residuals.
40
University of Ghana  http://ugspace.ug.edu.gh
3.19 Markov Chain
The Markov chain is the smallest form of a Markov model. This Chain models the state
of a system with changing random variable throughout time. This model operates on
the Markov property which suggests that, the distribution of a current variable depends
on previous distribution states. The Markov Chain Monte Carlo, an application of
the Markov Chain uses the Markov property to prove that a particular method for
performing a random walk will sample from its joint distribution. The model has the
ability to model the behaviour of a system by constructing consistent block diagrams,
fault trees and more. This model too can handle errors and recovery at detailed levels
in modelling. Despite its advantages, it is cumbersome to handle since it requires large
number of states for analysis. It is also quite difficult to construct and interpret though
it models types of complexity solution techniques that are feasible but for only small
models. In the study of a particular system, it does not offer the required speed and
accuracy for the study.
3.20 Jarque-Bera Test
The Jarque-Bera test, a type of Lagrange multiplier named after Carlos Jarque and Anil
K. Bera is used for checking normality in a data. Normality is one of the assumptions
used in most statistical tests, just like the t or F tests. In order to check normality in
a data, it is prudent to consider values of Skewness, (i.e. perfect symmetry around
the mean) and Kurtosis (i.e. how much tailed a data is and how peaked the data is).
However, it is needless to know the mean or standard deviation for the data before
performing the test.
The test statistic is defined as;
[ ]
b1 (b2−3)
2
JB = n +
6 24
41
University of Ghana  http://ugspace.ug.edu.gh
where;
n is the sample size
b1 is the sample skewness
b2 is the sample Kurtosis
with the hypothesis is given as;
H0 : The data is normally distributed
H1 : The data does not come from a normal distribution.
Generally, the null hypothesis is rejected for smaller p-values less than 5% level of
significance. Hence, the data under consideration is said not to be normally distributed.
3.21 Chapter Conclusion
The chapter has dealt with statistical techniques and methods which include; the unit
root tests, linearity tests and Box-Jenkins methodology for both linear and nonlinear
models. All these techniques are appropriate for the study due to their advantages over
other time series.
Due to the cumbersome nature of large data, the unit root test is preferred for checking
stationarity as compared to the graphical approach. The Box-Jenkins methodology is
also most appropriate in terms of forecasting for shorter periods of time as compared to
other time series like Markov chain which is difficult to interpret. Again, the normality
test is an appropriate approach to understand the distribution of most data rather than
assuming all datasets follow normal distribution.
42
University of Ghana  http://ugspace.ug.edu.gh
CHAPTER 4
DATA ANALYSIS AND DISCUSSIONS
4.1 Introduction
This chapter deals extensively with the results and discussions obtained from the study.
It includes preliminary and further analyses of datasets, the results and discussions.
In the preliminary analysis, the source from which data was obtained and general
characteristics of the time series datasets is explained. Also, in the further analysis,
the various models and techniques employed in derivation of the best model(s) is
explained. Furthermore, in the results and discussion, results obtained from analysis
using the R statistical software are discussed. The chapter ends with the conclusion,
which summarises the entire chapter.
4.2 Preliminary Analysis
Monthly data of Ghana’s inflation and CIC rates were obtained from the website of
the Bank of Ghana ranging from January 1990 to August 2016. The datasets each of
which consists of 320 observations, were divided into two parts namely; the train and
test datasets. A casual view of time series plots of these datasets appeared seasonally
adjusted and asymmetric in nature. Following the Box and Jenkins (1976) procedure
for forecasting, the first 296 points which starts from January 1990 to August 2015
represents the train dataset used to build the model whiles the remaining points of 12
months from September 2015 to August 2016 represented the test dataset retained for
out of sample performance assessment for each variable.
43
University of Ghana  http://ugspace.ug.edu.gh
4.2.1 Preliminary Analysis of Inflation rate
In this section, descriptive statistics of monthly inflation rates is discussed.
By inspection, Table 4.1 recorded a minimum value of 7.3 and a maximum value of
70.8 for inflation rate in March 1992 and December 1995 respectively. The sample
moments of inflation for the time period under consideration appeared positively
skewed and leptokurtic in nature about their mean values. The leptokurtic nature of
the data means that, the inflation values were more peaked about the mean value of the
inflation rate. The Jarque-Bera test for normality was used to confirm the asymmetric
nature of the inflation rate. At 5% significance level, the test confirmed that the test was
not normally distributed as its p-value, 2.2e−16, was less that the significant level.
Table 4.1: Descriptive Statistics of Inflation Rates
Mean Median Std Min Max Skewness Kurtosis Jarque-Bera
(P-value)
21.16 17.30 12.88 7.30 70.80 1.86 3.83 2.2e-16
Figure 4.1, shows both increasing and decreasing pattern in inflation rate overtime.
These fluctuations in the rate could be as a result of unstable market conditions.
Somewhere around 1991, the figure shows volatility in the time series plot. Volatility
in this plot could be as a result of some monetary factors such as poor agricultural
production and world economic crisis as described by Aidoo (2011) and Ocran (2007).
The years 2004 to 2008 saw consistent decline and pattern in inflation rate. This may
have resulted due to the implementation of policies like reduced taxes leading to the
reduction in price of goods and services. Yet again, inflation rates appeared stable
from somewhere around the year 2011 to 2013. A trendy nature in the plot shows
periodic pattern over time in regular intervals.
44
University of Ghana  http://ugspace.ug.edu.gh
Figure 4.1: Time series plot of inflation
Figure 4.2 confirms Ghana’s inflation is non-stationary, since there exists slow decay
in the series for all lag values in the ACF plot. Likewise, the PACF plot shows a
significant spike in lag 1 with marginal spikes in some few lags of the series.
Figure 4.2: ACF and PACF plots of inflation rates
After Figure 4.2 confirmed the series was nonstationary, the data was logarithmically
transformed and differenced to attain stationarity in the series as shown in Figure 4.3.
Significant spikes for both ACF and PACF plots appear to revolve about fixed points,
45
University of Ghana  http://ugspace.ug.edu.gh
this affirms stationarity. Moreover, these plots demonstrate a sine wave-like pattern.
This confirms Ghana’s inflation has both seasonal and non-seasonal components. In
this figure too, the PACF plot shows significant spikes in lags 12, 24 and 36 and at lag
12 for the ACF plot. This affirms the trendy pattern in the series at regular intervals.
For further analysis, the transformed and differenced series is used.
Figure 4.3: ACF and PACF plots of differenced inflation
4.2.2 Preliminary Analysis for CIC rate
An examination of descriptive statistics for the CIC data in Table 4.2, revealed the
value 53.1 as the minimum CIC rate recorded in March 1996 and the value 9536.5 as
the maximum CIC rate recorded in December 2015. The sample moments for the CIC
rate under study appeared positively skewed and platykurtic in nature about the mean
value. The platykurtic nature of the data indicates the CIC values are broadly spread
about the mean value of the CIC rate. Just like the inflation, the test for normality in
the CIC rates confirms the data is far from a normal distribution. So, we conclude the
CIC rate is asymmetric in nature.
46
University of Ghana  http://ugspace.ug.edu.gh
Table 4.2: Descriptive Statistics of CIC Rates
Mean Median Std Min Max Skewness Kurtosis Jarque-Bera
(P-value)
1729.6 730.5 2273.82 53.1 9536.5 1.68 1.75 2.2e-16
Although, the years 1996 and 1997 experienced steep decline in CIC rates, the years
1990 to 1993 recorded constant values for the CIC rate. In later years, CIC rates
picked up until somewhere around the year 2000. From the year 2000 onwards, a
gradual increase occurred in CIC rate at regular intervals. This indicates that as years
go by the total supply of money in Ghana increases. Volatility explains better the
consistent peaks in the time series plot of the CIC rates as shown in Figure 4.4. This
volatility, may be due to instability in prices and market interest rates.
Figure 4.4: Time Series plot of CIC
A graphical view of the CIC rate presented in Figure 4.5 revealed that, Ghana’s CIC
rate is non- stationary. Since, the graphs do not revolve around a zero mean and a
constant variance. In addition, the plot of ACF shows a slow decay in its series. But
then, the PACF plot displays a single significant spike at lag 1 with several marginal
spikes across all other lags.
47
University of Ghana  http://ugspace.ug.edu.gh
Figure 4.5: ACF and PACF plots of CIC
In Figure 4.6, the spikes in plots of ACF and PACF of CIC rate decay rapidly, this
indicates the series is stationary. Stationarity of the series is again confirmed as the
plots are perceived to revolve around fixed points.
Figure 4.6: ACF and PACF plots of differenced CIC
4.3 Further Analysis
To explain the procedures used for model building, evaluating and forecasting in this
study, we derived appropriate model(s) by employing SARIMA and SETAR modelling
48
University of Ghana  http://ugspace.ug.edu.gh
procedures for both inflation and CIC rates. This section too represents and explains
actual statements and observations including tables and figures. Sufficient details of
the study as well are discussed to serve as reference to other related studies.
4.3.1 SARIMA modelling
In this section, the seasonal ARIMA approach is followed to fit appropriate models to
forecast Ghana’s inflation and CIC rates. In this modelling cycle, stationarity is first
checked in the original datasets for both rates using Phillips-Perron and KPSS unit root
tests. Later, we follow the procedures described by Box and Jenkins in model building
and forecasting.
SARIMA Modelling for inflation
To model inflation, we first checked for stationarity in the data using the KPSS and PP
tests for unit root testing. For the PP-test, the null hypothesis of non-stationarity was
not rejected because the resulting P-value was greater than 5% level of significance.
So, we conclude the raw data is not stationary. Hence, the data was logarithmically
transformed and differenced resulting in a P-value smaller than the 5% level of
significance. This indicates the data is now stationary.
On the other hand, the alternative unit root test, the KPSS test was employed to cater
for shocks within the series under the null hypothesis of stationarity. The P-Value that
resulted from this test, suggests the raw data is not stationary. Just as in the PP-test, the
raw data is transformed to attain stationarity. This concept of stationarity is important
to most time series analysts. Since, it enables them to make meaningful predictions.
Table 4.3 summarises the results from the unit root tests performed for inflation.
Table 4.3: Unit root test for inflation
Unit root test Raw Data Transformed Data
Phillips-Perron (p-value) 0.29 0.01
KPSS (p-value) 0.01 0.1
49
University of Ghana  http://ugspace.ug.edu.gh
Now that our data is stationary, our next task is to build appropriate models that fit
the data well. We do so, by further examination of Figure 4.3. Suggested AR and
MA orders for the models under construction are chosen from Figure 4.3. While we
selected AR orders from significant spikes of lag orders in the differenced PACF plot,
MA orders were selected from the differenced ACF plot with significant spike of lag
orders to make up tentative models. In this approach of selecting model components,
we considered all spikes above zero as non-seasonal components and below zero as
the seasonal components. By inspection of the differenced PACF plot in Figure 4.3,
lag orders 1 and 3 were chosen as non-seasonal components and lag orders 1, 2 and 3
as seasonal components of the AR order to be added to the model. In the differenced
ACF plot too, lag orders 1,2 and 3 were chosen as non-seasonal components with lag
order 1 as the seasonal MA component to be added to the model.
The various model combinations are evaluated by considering models with least
values of AIC, RMSE, MAE and MAPE. These are shown in Table 4.4.
Table 4.4: AIC, RMSE, MAE and MAPE of candidate seasonal ARIMA Models for
Inflation
Model AIC RMSE MAE MAPE
ARIMA(1,1,1)(1,0,1)12 -696.55 0.07550 0.05060 1.7985
ARIMA(3,1,1)(1,0,1)12 -693.47 0.07540 0.05043 1.78758
ARIMA(1,1,1)(2,0,1)12 -695.00 0.07543 0.05032 1.78517
ARIMA(3,1,1)(2,0,1)12 -691.82 0.07534 0.05019 1.78018
ARIMA(1,1,2)(1,0,1)12 -694.68 0.07549 0.05050 1.78991
ARIMA(1,1,2)(2,0,1)12 -693.69 0.07542 0.05022 1.78136
ARIMA(3,1,2)(1,0,1)12 -692.64 0.07524 0.05046 1.78775
ARIMA(3,1,2)(2,0,1)12 -691.02 0.07518 0.05021 1.77997
ARIMA(1,1,3)(1,0,1)12 -693.75 0.07537 0.05038 1.78537
ARIMA(1,1,3)(2,0,1)12 -692.07 0.07535 0.05016 1.77862
ARIMA(3,1,3)(1,0,1)12 -698.18 0.07377 0.04940 1.74989***
ARIMA(3,1,3)(2,0,1)12 -694.36 0.07411 0.04974 1.76357
*** means best based on the selection criteria.
Carefully scrutinizing the twelve tentative models in Table 4.4 for inflation,
50
University of Ghana  http://ugspace.ug.edu.gh
ARIMA(3,1,3)(1,0,1)12 could be adjudged the best model in terms of its minimum
values of AIC, RMSE, MAE and MAPE for inflation rate.
In line with a said objective of the study, we sought to make an out of sample forecast
from the best model. Hence, tentative models with the most minimum values were
selected and assessed using the forecast accuracy test. In this test, the method of
maximum likelihood was used to evaluate the values of parameter estimates of the
models with their corresponding standard errors as shown in Table 4.5. Model
variables with either corresponding positive or negative values for parameter estimates
and standard errors are considered significant to the model. However, the greater
the number of variables resulting from either a positive standard error and a positive
estimate and vice-versa, the more accurate the model.
By the inspection of Table 4.5, seven parameter variables demonstrated to be more
significant to the model ARIMA(3,1,3)(1,0,1)12. This means that the variables
AR(1), AR(2), AR(3), MA(1), MA(3), SAR(1) and SMA(1) were added to the model,
since they proved to be significant.
Table 4.5: Estimated Parameters of ARIMA(3,1,3)(1,0,1)12 for inflation
Variable Estimate Standard Error 95% confidence Interval
Lower limit Upper Limit
∗AR(1) -0.8740 0.0651 -1.0016 -0.7464
∗AR (2) 0.5570 0.1067 0.3478 0.7662
∗AR(3) 0.8286 0.0599 0.7112 0.9459
∗MA(1) 1.1159 0.1238 0.8732 1.3585
MA(2) -0.2394 0.1679 -0.5684 0.0897
∗MA(3) -0.6872 0.0886 -0.8610 -0.5135
∗SAR(1) 0.1830 0.0821 0.0221 0.3439
∗SMA (1) -0.8027 0.0592 -0.9186 -0.6867
σ̂2 = 0.00546
∗ means significant at 5% significant level.
51
University of Ghana  http://ugspace.ug.edu.gh
Table 4.6 revealed just SAR (1) model variable as the significant parameter to the
model ARIMA(1,1,2)(1,0,1)12 and hence, this model was considered in the model.
Table 4.6: Estimate Parameters of ARIMA(1,1,2)(1,0,1)12 for inflation
Variable Estimate Standard Error 95% confidence Interval
Lower limit Upper Limit
AR(1) -0.8708 0.0620 0.7493 0.9923
∗MA(1) -0.6669 0.0844 -0.8323 -0.5016
MA(2) -0.0231 0.0642 -0.1490 0.1028
SAR(1) 0.1562 0.0872 -0.0147 0.3270
∗SMA (1) -0.7591 0.0650 -0.8864 -0.6317
σ̂2 = 005717
∗ means significant at 5% significant level.
In terms of the number of significant variables to the model, the model
ARIMA(3,1,3)(2,0,1)12 displayed six significant parameter variables as shown
in Table 4.7. This indicates the model is significant. Hence, we consider the model for
estimation.
Table 4.7: Estimate Parameters of ARIMA(3,1,3)(2,0,1)12 for inflation
Variable Estimate Standard Error 95% confidence Interval
Lower limit Upper Limit
AR(1) 0.0647 0.0644 -0.0616 0.1910
∗AR(2) -0.3129 0.0508 -0.4124 -0.2135
∗AR(3) 0.8572 0.0631 0.7335 0.9810
MA(1) 0.1283 0.0962 -0.0602 0.3169
∗MA(2) 0.4864 0.0784 0.3328 0.6401
∗MA(3) -0.6552 0.0916 -0.8347 -0.4756
SAR(1) 0.1664 0.0976 -0.0250 0.3577
SAR(2) 0.0795 0.0795 -0.0763 0.2353
∗SMA(1) -0.7934 0.0791 -0.9484 -0.6385
σ̂2 = 0.005511
∗ means significant at 5% significant level.
52
University of Ghana  http://ugspace.ug.edu.gh
Obtaining parameter estimates of tentative models is not enough to conclude that,
a particular model best fits a data. An important task therefore is to check if all
assumptions of the SARIMA model are satisfied. This was done by performing
diagnostic checks on model residuals as shown in Table 4.8. Thus, we checked
if residuals have zero mean, constant variance and are also uncorrelated, that is if
residuals follow a white noise process.
The Ljung- Box test, a portmanteau test, was performed on the residuals of the model
under the null hypothesis of residuals being random, P-values resulting from this test
revealed the residuals were random. This was so, because P-values obtained from
the test were greater than 5% level of significance. Equally, the ARCH-LM test was
performed on the model residuals under the null hypothesis of there is no ARCH
effect. This test was included in the study due to its ability to test for variations in
model variances. The resulting P-values greater than the 5% level of significance
indicated, there were no ARCH effect in the model residuals. This therefore, enables
us conclude the presence of constant variance with zero mean in all the tentative
models.
Table 4.8: Residual diagnostic tests for Seasonal ARIMA models for inflation
Model P-values
ARCH-LM test Ljung-test
ARIMA(3,1,3)(1,0,1)12 0.9111 0.8702
ARIMA(1,1,2)(1,0,1)12 0.7321 0.4823
ARIMA(3,1,3)(2,0,1)12 0.6512 0.7015
The ACF plots of the residual models was observed to be zero, depicting residuals are
uncorrelated. Also, it was observed that all spikes of the plots in Figure 4.7 are within
the confidence boundaries. This means the models fits best the data under study.
Clearly, all these tentative models appear to be satisfactory of all SARIMA model
assumptions, thereby we consider all these models for further valuation.
53
University of Ghana  http://ugspace.ug.edu.gh
Figure 4.7: ACF plots of residuals for inflation
SARIMA Modelling for CIC
As shown in Table 4.9, the resulting P-values from the raw data for CIC was
not stationary for both PP and KPSS tests under their respective null hypotheses.
Statistically, if a P-value is greater than 5% significance level, the null hypothesis is not
rejected and vice- versa. Contrarily to the PP test, the KPSS test produced a P-value
of 0.01, which is less than a 5% significance level. This affirms non-stationarity in
the raw data considering both unit root tests. As usual, the non-stationary data was
transformed to attain stationarity in the data.
Table 4.9: Unit Root test for CIC rate
Unit root test Raw Data Transformed Data
Phillips-Perron (p-value) 0.99 0.01
KPSS (p-value) 0.01 0.1
After attaining stationarity in the data, the next task was to select model constituents
to enable us derive appropriate model(s) for further analysis. Just as it was done
earlier for inflation, the same procedure was followed to select our seasonal and
54
University of Ghana  http://ugspace.ug.edu.gh
non-seasonal components of the model(s) to be derived by examining Figure 4.6. On
successfully deriving our models with their respective values of AIC, RMSE, MAE
and MAPE, models with minimum values were considered as the selection criterion
for the tentative models for further assessment. Table 4.10 below provides models
with their respective measure of values.
Table 4.10: AIC, RMSE, MAE and MAPE of candidate seasonal ARIMA Models for
CIC
Model AIC RMSE MAE MAPE
ARIMA(0,1,0)(1,0,1)12 -16.49 0.23235 0.07934 1.32993
ARIMA(0,1,0)(1,0,3)12 -27.72 0.22494 0.08272 1.36114
ARIMA(0,1,0)(3,0,1)12 -26.01 0.22675 0.08745 1.44632
ARIMA(0,1,0)(3,0,3)12 -52.20 0.20165 0.07955 1.30287***
∗∗∗Means minimum values of the selection criterion.
Later, the estimated parameters were of the tentative models were checked for their
significance with the least values of selection criterion. By this, the forecast accuracy
was employed as discussed earlier. Results from Tables 4.11, 4.12 and 4.13 indicates
all estimates of the model components are relevant. Here too, variable parameters
with resulting positive values for both estimates and standard errors or vice-versa were
considered as relevant to the model under construction.
Table 4.11: Estimated parameters of ARIMA (0,1,0) (3,0,3)12 for CIC
Variable Estimate Standard Error 95% confidence Interval
Lower limit Upper Limit
∗SAR(1) -1.0917 0.1203 -1.3274 -0.8559
∗SAR(2) -0.7941 0.1551 -1.0981 -0.4790
∗SAR(3) -0.2543 0.1076 -0.4652 -0.0435
∗SMA(1) 1.0499 0.1509 0.7542 1.3456
∗SMA(2) 0.8201 0.2140 0.4007 1.2394
SMA(3) -0.1499 0.1316 -0.4079 0.1081
σ̂2 = 0.04079
∗ means significant at 5% significant level.
55
University of Ghana  http://ugspace.ug.edu.gh
Table 4.12: Estimated parameters of ARIMA(0,1,0)(1,0,3)12 for CIC
Variable Estimate Standard Error 95% confidence Interval
Lower limit Upper Limit
∗SAR(1) -0.5825 0.0798 -0.7390 -0.4261
∗SMA(1) 0.5118 0.0781 0.3587 0.6650
SMA(2) 0.0455 0.0618 -0.0757 0.1667
∗SMA(3) -0.2883 0.0815 -0.4481 -0.1285
σ̂2 = 0.05076
∗ means significant at 5% significant level.
Table 4.13: Estimated parameters of ARIMA(0,1,0)(1,0,1)12 for CIC
Variable Estimate Standard Error 95% confidence Interval
Lower limit Upper Limit
SAR(1) -0.7978 0.0821 -0.9587 -0.6369
SMA(1) 0.6319 0.1059 0.4243 0.8394
σ̂2 = 0.05416
∗ means significant at 5% significant level.
To certify the models are appropriate SARIMA models, all the model assumptions of
constant variance, zero mean and autocorrelation were checked for residuals of the
models by employing the Ljung- Box and ARCH- LM tests under their respective null
hypothesis. Resulting P-values from Table 4.14 indicates, all models have zero means
and constant variances.
Table 4.14: Residual diagnostic tests for Seasonal ARIMA models for CIC
Model P-values
ARCH-LM test Ljung-test
ARIMA(0,1,0)(3,0,1)12 0.9997 0.9636
ARIMA(0,1,0)(1,0,3)12 0.9094 0.8556
ARIMA(0,1,0)(3,0,3)12 0.9996 0.9006
Lastly, the assumption of autocorrelation were checked. Figure 4.8 revealed the ACF
plots of residuals to be zero, describing residuals are uncorrelated. A supplementary
observation showed except for lag zero that all spikes in all other lags are within the
confidence boundaries. This means that, all the three models fit CIC data well. Hence,
56
University of Ghana  http://ugspace.ug.edu.gh
we consider all the models for further analysis.
Figure 4.8: ACF plots of residuals for CIC
4.3.2 SETAR Modelling
In this section, the SETAR modelling procedure proposed by Franses and Van Dijk
(2000) is followed to model and forecast Ghana’s inflation and CIC rates. The section
includes the two- regime SETAR model and forecast approach for the two rates under
study.
SETAR modelling for inflation rate
In order to model inflation rate with the SETAR model, its time series must first
satisfy the condition of nonlinearity. So, the nonlinearity was checked in this series
by specifying the order of the linear AR(p) model. The order of the linear AR(p)
is chosen as the AR(p) model based on the maximum lag order with the least value
of AIC. Using the Keenan and Tsay tests of linearity, AR(14) was found to be the
linear AR order with the least AIC value. In addition, P-values from these tests were
both less than the 5% significant level, suggesting that Ghana’s inflation rate follow
57
University of Ghana  http://ugspace.ug.edu.gh
threshold nonlinear. While considering the null hypothesis of linearity for Keenan-1-
degree test and the alternative hypothesis of threshold-type nonlinearity in the Tsay
F-test. Employing regime switching models can best account for these discontinuities
and switching of inflation as compared to the simple linear model. Table 4.15 displays
a summary of the linearity tests performed on inflation rate.
Table 4.15: Linearity test for inflation rate
Test Test statistic P- value Order Decision
Keenan- 4.2232 0.0041 14 Reject Linearity
1-degree
Tsay 3.034 1.936e-11 14 No threshold nonlinearity
rejected
Following that, the time series of inflation is confirmed to be nonlinear, the appropriate
SETAR model identified best fit the dataset. In doing so, the lag order p in each regime
and the threshold variable yt−d , here d represents the delay parameter were determined.
After performing grid search of tentative model combinations for SETAR models that
best fit the dataset, lag orders with minimum values of AIC and MAPE were chosen as
the tentative model(s). Tables 4.16 below provides a summary of competing models to
be used in forecasting the two rates. The table suggest that SETAR (2;14,4) with
threshold variable yt−4, SETAR (2;14,5) with threshold variable yt−5 and SETAR
(2;14,13) with threshold yt−13 was the appropriate models in explaining nonlinearity in
Ghana’s inflation rate. Hence, the three models were considered for further assessment.
Table 4.16: AIC and MAPE values of tentative SETAR models
Model AIC MAPE
SETAR (2;14,4) -1541.411 1.5680
SETAR (2;14,5) -1549.031 1.7785
SETAR (2;14,13) -1538.501 1.7000
In accordance with Franses and Dijk’s (2000) approach, the method of conditional
least squares is used to estimate parameters of the candidate models as performed in
Tables 4.17, 4.18 and 4.19 with their corresponding threshold values.
58
University of Ghana  http://ugspace.ug.edu.gh
Resulting proportions from Table 4.17 with delay parameter of four and threshold value
of 0.03944, can attribute structural breaks in inflation to data emanating from the low
regime rather than the high regime as displayed.
Table 4.17: Estimate Parameters of SETAR (2;14,4) for inflation
Low Regime High Regime
Coefficient
Estimate Std Error t- value Estimate Std Error t-value
Constant -0.0055 0.0063 -0.8821 0.0232 0.0171 1.3566
φ 1 0.2311 0.0629 3.6766 -0.1208 0.1617 -0.7467
φ 2 -0.0584 0.0648 -0.9018 0.7412 0.1738 4.2647
φ 3 0.0559 0.0709 0.7890 -0.0279 0.1055 -0.2640
φ 4 0.0544 0.0664 0.8197 0.0228 0.1055 0.2157
φ 5 0.1563 0.0908 1.7218 -0.0773 0.1097 -0.7050
φ 6 -0.0669 0.0789 -0.8479 0.0437 0.0880 0.4965
φ 7 0.0549 0.0688 0.7978 -0.0933 0.1081 -0.8625
φ 8 0.0568 0.0683 0.8306 0.0364 0.0963 0.3776
φ 9 -0.0840 0.0681 -1.2336 0.1066 0.0933 1.1422
φ 10 0.1396 0.0644 2.1678 -0.0387 0.1044 -0.3704
φ 11 -0.0563 0.0617 -0.9115 -0.0807 0.1222 -0.6606
φ 12 -0.3874 0.0628 -6.1712 -0.3996 0.1161 -3.441
φ 13 0.1143 0.0801 1.4268 0.0248 0.1104 0.2249
φ 14 -0.0341 0.0666 -0.5115 0.1700 0.1435 1.1851
Threshold value = 0.03944
Proportion 75.09% 24.91%
In Table 4.18 too, more than half the data recorded switches and discontinues in
the low regime. This indicates the earlier years of the inflation data demonstrates
discontinuities in the dataset given a delay parameter of five with threshold value of
0.01342.
59
University of Ghana  http://ugspace.ug.edu.gh
Table 4.18: Estimate Parameters of SETAR (2;14,5) for inflation
Low Regime High Regime
Coefficient
Estimate Std Error t- value Estimate Std Error t-value
Constant -0.0137 0.0078 -1.7433 0.0225 0.0111 2.0377
φ 1 0.3381 0.0789 4.2820 0.1130 0.0856 1.3205
φ 2 -0.0597 0.0686 -0.8706 0.3225 0.1202 2.6834
φ 3 0.0873 0.0737 1.1841 0.0315 0.0864 0.3648
φ 4 0.1221 0.0776 1.5737 -0.0060 0.0772 -0.0775
φ 5 -0.0176 0.0699 -0.2516 0.0979 0.0944 1.0371
φ 6 -0.0927 0.0989 -0.9375 -0.0280 0.0918 -0.3054
φ 7 0.0680 0.0808 0.8408 -0.0519 0.0769 -0.6746
φ 8 0.0155 0.0712 0.2182 0.1419 0.0912 1.5565
φ 9 -0.0293 0.0755 -0.3884 0.0038 0.0779 0.0490
φ 10 0.2872 0.0820 3.5007 -0.0726 0.0723 -1.0046
φ 11 -0.1464 0.0678 -2.1583 -0.0382 0.0892 -0.4283
φ 12 -0.3134 0.0739 -4.2402 -0.5414 0.0792 -6.8377
φ 13 0.0797 0.0780 1.0217 0.2289 0.0926 2.4729
φ 14 -0.0064 0.0792 -0.0808 0.0311 0.0914 0.3403
Threshold value = 0.01342
Proportion 61.77% 38.23%
Unlike Tables 4.17 and 4.18, Table 4.19 explains that, more than 80% of the inflation
data experienced switching regimes and discontinuities in the higher regimes as
compared to the low regime.
Similarly, in SETAR modelling we check models to confirm if they satisfy all
model assumptions as mentioned earlier. Thus, we check the model residuals
for no auto correlation, zero mean and constant variance of residuals. Also, we
perform the ARCH-LM and Ljung -Box tests on the residuals under their specific
hypotheses as discussed earlier. The results from Tables 4.20 confirms non-existence of
autocorrelation and random residuals in all tentative models under the null hypothesis
of residuals being random and residuals being non-correlated. A critical look at
the table again, reveals SETAR(2;14,4) failed its ARCH-test for lag order 12, this
indicates the possibility of autocorrelation, nevertheless we will still consider this
60
University of Ghana  http://ugspace.ug.edu.gh
model for further assessment.
Table 4.19: Estimate Parameters of SETAR (2;14,13) for inflation
Low Regime High Regime
Coefficient
Estimate Std Error t- value Estimate Std Error t-value
Constant -0.0412 0.0326 -1.2643 -0.0032 0.0053 -0.5957
φ 1 -0.0105 0.1322 -0.0792 0.3050 0.0673 4.5348
φ 2 0.1790 0.0937 1.9099 0.0405 0.0825 0.4916
φ 3 0.2177 0.0862 2.5269 0.0254 0.0740 0.3428
φ 4 -0.0780 0.1049 -0.7434 0.1051 0.0715 1.4697
φ 5 -0.1625 0.1468 -1.1070 0.0750 0.0635 1.1805
φ 6 0.2319 0.1053 2.2024 0.0294 0.0644 0.4567
φ 7 -0.1882 0.0957 -1.9670 0.0611 0.0687 0.8895
φ 8 -0.1880 0.1425 -1.3193 0.0794 0.0612 1.2972
φ 9 -0.0910 0.1105 -0.8232 0.0033 0.0617 0.0532
φ 10 0.0822 0.0967 0.8496 0.1159 0.0682 1.6990
φ 11 -0.1485 0.1287 -1.1543 -0.0950 0.0638 -1.4897
φ 12 -0.3674 0.1453 -2.5280 -0.3932 0.0632 -6.2224
φ 13 0.1225 0.1223 1.0013 0.0935 0.0761 1.2296
φ 14 -0.1817 0.1972 -0.9214 0.0223 0.0804 0.2779
Threshold value = -0.07091
Proportion 17.41% 82.59%
Table 4.20: Residual diagnostics of tentative SETAR models for inflation
Model Ljung-Box test ARCH-LM test
SETAR (2;14,4) 0.8371 0.0030
SETAR (2;14,5) 0.8667 0.4480
SETAR (2;14,13) 0.9776 0.5915
SETAR modelling for CIC rate
We follow the same procedure for testing linearity as we did earlier. In this section,
we found AR (13) to be our AR(p) model order with the least AIC while employing the
Keenan and Tsay tests for linearity. The P-values from Table 4.21 indicates Ghana’s
CIC rate follow Threshold nonlinear under the null hypotheses of Keenan and Tsay
61
University of Ghana  http://ugspace.ug.edu.gh
tests as mentioned earlier. Again, P-values from the Keenan test indicates nonlinearity
in the data as well as the presence of threshold from the Tsay test. Equally, regime
switching models best caters for these discontinuities and switching of inflation as
compared to the simple linear model.
Table 4.21: Linearity test for CIC rate
Test Test statistic P- value Order Decision
Keenan- 29.405 1.236e-07 13 Reject Linearity
1-degree
Tsay 3.905 5.066e-16 13 No threshold nonlinearity
rejected
After confirming the CIC rate is nonlinear, we identified an appropriate SETAR model
that best fits the data as shown in Table 4.22. Interestingly, a grid search of tentative
model combination for SETAR model revealed only one model as the tentative model
with close AIC and MAPE values for the CIC rate. Hence, we consider this model for
further assessment.
Table 4.22: AIC and MAPE candidate models for CIC
Model AIC MAPE
SETAR (2;13,9) 3148.395 0.1662
Looking at Table 4.23, it can be inferred early years of CIC values resulted in more
than 80% of structural breaks in the data. The standard errors from the table are
relatively low in the low regime as compared to the high standard error in the high
regime. The t- values in the table was obtained by dividing the estimate of each
constant by its corresponding standard error.
Again, we check the model if it satisfies the model assumptions of autocorrelation,
zero mean and constant variance. The ARCH -LM and Ljung-Box tests on the model
residual confirms the constructed model satisfied model assumptions after lag 6. Also,
62
University of Ghana  http://ugspace.ug.edu.gh
the ACF plot of the model residual is not correlated. Hence, we conclude the model fit
the data well.
Table 4.23: Estimate Parameters of SETAR(2;13,9) for CIC
Low Regime High Regime
Coefficient
Estimate Std Error t- value Estimate Std Error t-value
Constant 13.6062 15.8325 0.8594 318.4621 94.3598 3.3750
φ 1 0.6990 0.0640 10.9183 0.8922 0.1096 8.1432
φ 2 0.2298 0.0788 2.9180 -0.0661 0.1198 -0.5520
φ 3 -0.0254 0.0801 -0.3164 -0.0229 0.1190 -0.1924
φ 4 0.0106 0.0799 0.1327 -0.0808 0.1187 -0.6807
φ 5 0.0337 0.0800 0.4209 0.1599 0.1177 1.3590
φ 6 0.0317 0.0800 0.3956 -0.2129 0.1198 -1.7772
φ 7 0.0214 0.0804 0.2665 0.1279 0.1269 1.0074
φ 8 -0.0501 0.0832 -0.6020 0.0769 0.1198 0.6416
φ 9 0.0580 0.1015 0.5708 -0.2244 0.0971 -2.3109
φ 10 0.0297 0.1076 0.2761 0.0083 0.0939 0.0884
φ 11 0.0166 0.1072 0.1547 0.2542 0.0919 2.7654
φ 12 0.0367 0.1072 0.3422 0.5410 0.0915 5.9121
φ 13 -0.0922 0.0829 -1.1126 -0.4658 0.0967 -4.8164
Threshold value = 2159
Proportion 82.71% 17.29%
Table 4.24: Residual Diagnostics of suggested SETAR models for CIC
Model Ljung-Box test Arch-test
SETAR (2;13,9) 0.9163 0.4476
4.3.3 Comparative Analysis of forecast between SARIMA and
SETAR models
As described by most researchers, forecasting is an important phenomenon to effective
decision making and policy planning. It is therefore necessary to obtain good models
that provide best forecast values. The main task in this research is to compare the
forecasts performance between the linear SARIMA model and the non-linear SETAR
63
University of Ghana  http://ugspace.ug.edu.gh
Model for both inflation and CIC rates. This is core to the policy maker and the planner
for effective economic growth.
Inflation rate
The tentative models for inflation have shown to adequately fit the inflation data
well and satisfied all model assumptions. Confidently, we use these models to
forecast inflation rate over 12 months period from 2015(8) to 2016(9). Thus, we
compare forecast performance between the tentative models in terms of their RMSE
and MAE. As shown in Table 4.25, none of these models recorded consistent least
values from both in sample and out of sample forecasts. Inconsistently, the model
ARIMA(3,1,3)(1,0,1)12 recorded the most least values of RMSE and MAE for the
in-sample forecasts but failed to record same for the out of sample forecasts. The
ARIMA(1,1,2)(1,0,1)12 model also recorded the most least values for the out of
sample forecasts but recorded otherwise for the in-sample forecasts. This makes
it quite difficult to select any of the models to represent inflation in terms of the
most least error term measures. Hence, the maximum P-values resulting from the
Ljung-Box and ARCH tests in Table 4.20 were relied on to make a choice on the most
appropriate linear SARIMA model to consider for comparison with the nonlinear
SETAR model. Finally, we consider ARIMA(3,1,3)(1,0,1)12 as most appropriate for
comparison because it produced the most maximum P-values from table 4.8. This is
because the greater the P-value from 5%, the better the model. The chosen model as
well satisfies all SARIMA modelling assumptions.
Table 4.25: Forecast results of competing SARIMA models for inflation
Out of sample In-sample
Model
RMSE MAE RMSE MAE
ARIMA(3,1,3)(1,0,1)12 2.7711 2.7709 0.0738 0.0494
ARIMA(1,1,2)(1,0,1)12 2.7686 2.7683 0.0755 0.0506
ARIMA(1,1,1)(2,0,1)12 2.7834 2.7832 0.0754 0.0503
As explained earlier for Table 4.25, Table 4.26 demonstrates similar results. Yet again,
64
University of Ghana  http://ugspace.ug.edu.gh
no particular model shows consistent values for both in sample and out of sample
forecasts. Also, the P-values from Table 4.20 with the largest values from the Ljung-
Box and ARCH -LM tests were relied on to make a choice on the most appropriate
model to consider for valuation. The model SETAR (2;14,13) is chosen to be the
most appropriate model. This is because, it displayed to have the largest P- values.
Table 4.26: Forecast results of competing SETAR models for inflation
Out of sample In-sample
Model
RMSE MAE RMSE MAE
SETAR (2;14,4) 0.0015 0.0350 0.0058 0.0528
SETAR (2;14,5) 0.0021 0.0394 0.0055 0.0514
SETAR (2;14,13) 0.0017 0.0353 0.0057 0.0514
The selected models from the two modelling approaches have shown to fit the inflation
data well and satisfied all model assumptions. Although, the nonlinear SETAR model
outperformed the linear SARIMA model for inflation rates by their least error terms,
the two models were compared to see which of the models is more significant. This
comparison is done using the Diebold - Mariano test under the null hypothesis of
the existence of significant difference between the two models. Results from Table
4.27 indicates all the P-values from the five different forecast horizons was rejected.
Hence, it was concluded that the linear SARIMA and nonlinear SETAR models are
statistically significant to forecast inflation.
Table 4.27: Forecast accuracy test for inflation
Forecast Horizon DM statistic P-value
1 -0.2488 0.8037
2 -0.2397 0.8107
3 -0.2443 0.8071
4 -0.2767 0.7822
5 -0.2958 0.7676
65
University of Ghana  http://ugspace.ug.edu.gh
Currency in Circulation
To choose the most appropriate linear SARIMA model for comparison as we did for
the inflation rate, the three competing SARIMA models shown to satisfy all model
assumptions are examined. In Table 4.28 below, no model seems to have consistent
least values of RMSE and MAE for their out of sample and in sample forecasts over
our sample forecast period from 2015(8) to 2016(9). So, we inferred from the results
of Table 4.14 and adjudged ARIMA(0,1,0)(3,0,1)12 as our best model. We did this,
by choosing the model with the most maximum P-values resulting from the Ljung-
Box and ARCH -LM tests.
Table 4.28: Forecast results of competing SARIMA models for CIC
Out of sample In-sample
Model
RMSE MAE RMSE MAE
ARIMA(0,1,0)(3,0,1)12 8.8428 8.8428 0.2268 0.0874
ARIMA(0,1,0)(1,0,3)12 8.8700 8.8699 0.2249 0.0827
ARIMA(0,1,0)(3,0,3)12 8.9813 8.9812 0.2016 0.0795
Interestingly, the SETAR model for the CIC rate is just one and once it satisfies all
SETAR modelling assumptions, therefore the model was chosen from Table 4.29 for
further investigation.
Table 4.29: Forecast result of SETAR model for CIC
Out of sample In-sample
Model
RMSE MAE RMSE MAE
SETAR (2;13,9) 191086.54 280.35587 23697.61 74.85154
The Diebold - Mariano test was employed to test the forecast accuracy of the two
selected models for SARIMA and SETAR models. The test was performed under the
null hypothesis of the two models being significant. From Table 4.30, five forecasts
were considered to test the statistical significance of the models. The results from
the test indicates only one model is significant to forecast inflation, indicating that the
null hypothesis can be rejected. However, by comparing the error terms from both
66
University of Ghana  http://ugspace.ug.edu.gh
models, out rightly it can be concluded that the linear SARIMA model outperforms
the nonlinear SETAR model.
Table 4.30: Forecast accuracy test for CIC
Forecast Horizon DM statistic P-value
1 -3.0383 0.0026
2 -2.6836 0.0077
3 -2.6397 0.0087
4 -2.6492 0.0085
5 -2.6525 0.0084
4.4 Results and Discussion
An observation of datasets for inflation and CIC rates did not appear to follow a
normal distribution, this was confirmed using the Jarque -Bera test. While inflation
rate had a broader surface due to its positively skewed value greater than three,
CIC rate showed a positively skewed value less than three indicating CIC values
are peaked about the mean value as compared to a normal distribution. The series
for both rates were observed to be nonstationary through their time series plots.
These series were therefore logarithmically transformed and differenced to attain
stationarity. Additionally, the differenced series showed a sine- wave like structure in
the ACF’s of the datasets, indicating the presence of both seasonal and non-seasonal
behaviour in the series. However, after the first non-seasonal differencing the series
became stationary, hence there was no need seasonally differencing the series. These
transformed and differenced series was then used to build the appropriate models
following procedures of Box-Jenkins. All chosen tentative models for both inflation
and CIC performed well in terms of their ACF plots of residuals indicating that any
of these models can serve as a good model for forecasting. This was confirmed
through diagnostic checks on residuals of the tentative models. Further results from
the ARCH-LM and Ljung-Box tests depict all chosen models are good for forecasting
since they were above 5% level of significance and hence, satisfy all assumptions of
67
University of Ghana  http://ugspace.ug.edu.gh
the SARIMA model.
Before fitting the SETAR model, the datasets for both inflation and CIC rates must
be nonlinear. So, a linearity test was performed on both rates using the Keenan and
Tsay tests for linearity under their respective null hypothesis. The resulting P-values
were less than 5%, implying the datasets follow threshold nonlinear. In like manner
as in the SARIMA modelling, a diagnostic check was performed on the residuals of
the fitted model(s) after performing a grid search of tentative models. Again, results
from the ARCH and Ljung -Box tests showed P-values satisfying the assumptions of
SETAR model.
The fear of the “unknown” makes the policy maker and the planner disorganized, since
it becomes difficult to plan into the future. However, as discussed by Box and Jenkins
(1976), forecasting provides the basis for effective economic planning, optimization
of industrial activities and improved inventory and production control. Therefore,
obtaining good models to provide accurate forecasts is core to the researcher, policy
makers, investors and governments. Inflation and CIC rates are some examples of
the major drivers of an economy of which Ghana is not left out. Hence, forecasting
of these variables will serve as a guiding tool to financial and economic analysts
to effectively make decisions and advise key stakeholders to formulate important
policies to reduce inflation rates and regulate CIC rates. As a result, the study adjudges
ARIMA(3,1,3)(1,0,1)12 and SETAR (2;14,13) as best models in predicting inflation
rates. Concurrently, ARIMA(0,1,0)(3,0,1)12 and SETAR(2;13,9)are best models to
predict CIC rates in Ghana.
Further checks on the chosen models in terms of their forecast accuracy using the
Diebold-Mariano test revealed that both SARIMA and SETAR models can be used to
forecast inflation. Since, the test proved there was no significant difference between the
two models. Divergently, there was significant difference between the SARIMA and
68
University of Ghana  http://ugspace.ug.edu.gh
SETAR models used in predicting the CIC rate. This could have resulted due to vast
difference in the criterion used to measure forecast accuracy. Therefore, we predict
inflation using both models for a twelve months period. However, for CIC rates we
prefer to use the SARIMA model for predictions.
4.5 Chapter Conclusion
This chapter has comprehensively dealt with analyses and discussions from the study.
It also represented concise and precise major findings of the study. The chapter
also showed that the two rates considered increased in regular intervals, implying the
datasets are seasonal by nature. Further test using the Jarque-Bera test revealed the
datasets do not follow a normal distribution. In affirmation, the linearity test proposed
by Keenan and Tsay proved the datasets are threshold nonlinear. Additionally, forecast
performance indicates that the behaviour of a dataset does not necessarily inform the
best model fit. This was confirmed in the analysis for the CIC rate. Even though, the
dataset appeared to be nonlinear, the linear model fitted and forecasted the rate better.
Hence, it can be concluded that the SARIMA model is best for forecasting CIC rates.
69
University of Ghana  http://ugspace.ug.edu.gh
CHAPTER 5
CONCLUSION AND RECOMMENDATIONS
5.1 Introduction
This chapter represents conclusion and recommendations of the study.
5.2 Conclusion
In the study, SARIMA and SETAR models were fitted to time series datasets on
monthly inflation and currency in circulation (CIC) rates obtained from the Bank of
Ghana. Observation of the ACF and PACF of the datasets showed that the data were
not stationary. This was confirmed by the KPSS and PP unit root tests on the datasets.
The datasets were logarithmically transformed and differenced non-seasonally to
obtain a stationary time series. In the linear SARIMA modelling, the Box-Jenkins
approach was used and identified twelve and four tentative models for inflation and
CIC rates respectively. Amongst the tentative models, ARIMA(3,1,3)(1,0,1)12 and
ARIMA(0,1,0)(3,0,1)12 were respectively identified as best models for inflation and
CIC rates in the sense that, it gave least values of AIC, MAPE, RMSE and MAE.
Further diagnosis proved that all SARIMA model assumptions have been satisfied.
Thus, the models are adequate.
In the nonlinear SETAR modelling, Keenan and Tsay-F tests showed the datasets
were threshold nonlinear, so a two regime SETAR model was applied. A grid search
of tentative models revealed three and one SETAR models for inflation and CIC,
respectively. SETAR (2;14,13) and SETAR (2;13,9) were identified amongst the
tentative models as the best models. Again, further diagnosis showed the datasets
70
University of Ghana  http://ugspace.ug.edu.gh
satisfied all model assumptions. After modelling both rates, the performance between
the SARIMA and SETAR models were compared for inflation and CIC rates by
employing forecast measures, Root Mean Square Error (RMSE) and Mean Absolute
Error. Based on forecast results for the period of twelve months, the nonlinear SETAR
model outperformed the linear SARIMA model for inflation. However, the case
was different for CIC rates, the Linear SARIMA model outperformed the nonlinear
SETAR model. This confirms the stand of Gooijer and Kumar (1992) that there is no
clear evidence either in favour of linear or nonlinear models in terms of their forecast
performance.
Although, the results of forecast performance between the nonlinear SETAR model
and the linear SARIMA model showed the SETAR model has superiority over the
SARIMA model, the Diebold-Mariano test suggests both models are significant for
forecasting Inflation rates.
Finally, the results for forecast performance between models showed the linear
SARIMA model is superior to the nonlinear SETAR model. Hence, we conclude that
the SARIMA model is best to model and forecast inflation. This is confirmed from the
Diebold-Mariano test for different forecast horizons.
5.3 Recommendations
The research compared the linear SARIMA and nonlinear SETAR models. Since
models approximate reality, it is recommended that more research should be done
considering other linear and nonlinear models.
The results from the research revealed that the SARIMA model is better off in
predicting CIC rates as compared to the SETAR model; it is therefore recommended
close monitoring of financial markets should be done on regular basis in order to
regulate the total amount of money supply in the economy.
71
University of Ghana  http://ugspace.ug.edu.gh
REFERENCES
Abdul-Aziz, A., Anokye, M., Kwame, A., Munyakazi, L., and Nsowah-Nuamah, N.
(2013). Modeling and forecasting rainfall pattern in ghana as a seasonal arima
process: The case of ashanti region. International Journal of Humanities and Social
Science, 3(3):224–233.
Afrifa-Yamoah, E., Saeed, B. I. I., and Karim, A. (2016). Sarima modelling
and forecasting of monthly rainfall in the brong ahafo region of ghana. World
Environment, 6(1):1–9.
Aidoo, E. (2010). Modelling and forecasting inflation rates in ghana: An application
of sarima models. mathesis, School Of Technology And Business Studies.
Aidoo, E. (2011). Forecast performance between sarima and setar models: An
application to ghana inflation rate. mathesis, Uppsala University.
Akeyede, I., Adeleke, B. L., and Yahya, W. B. (2015). On forecast strength of some
linear and non linear time series models for stationary data structure. American
Journal of Mathematics and Statistics, 5(4):163–177.
Akter, S. and Rahman, S. (2010). Agribusiness forecasting with univariate time series
modelling techniques: The case of a dairy cooperative in the uk. Journal of Farm
Management, 13(11):747–764.
Antwi, E. (2017). Modelling and forecasting ghana’s inflation rate under threshold
models. mathesis, University of Venda.
Asamoah-Boaheng, M. (2014). Using sarima to forecast monthly mean surface air
temperature in the ashanti region of ghana. International Journal of Statistics and
Applications, 4(6):292–299.
72
University of Ghana  http://ugspace.ug.edu.gh
Badawi, M. (2016). Comparative analysis of sarima and setar models in predicting
pneumonia cases in the northern region of ghana. mathesis, University For
Development Studies.
Box, G. and Jenkins, G. (1976). Time series analys forecasting and control.
Clements, M. P., Franses, P. H., and Swanson, N. R. (2004). Forecasting economic and
financial time-series with non-linear models. International Journal of Forecasting,
20:169–183.
Cuestas, J. C., Gil-Alana, L. A., and Mourelle, E. (2011). Is there an asymmetric
behaviour in african inflation? a non-linear approach. In of Navarra, U., editor,
Navarra Center for International Development, number 3.
Diebold, F. X. and Mariano, R. S. (1995). Comparing predictive accuracy. Journal of
Business and Economic Statistics, 13(3):253–263.
Dritsaki, C. (2016). Forecast of sarima models: An application to unemployment rates
of greece. American Journal of Applied Mathematics and Statisitcs, 4(5):136–148.
Engle, R. (1982). Autoregressive conditional heteroscedasticity with estimates of the
variance of united kingdom inflation. Econometrica, 50:987–1007.
Fannoh, R., Orwa, G., and Mungatu, J. (2014). Modelling the inflation rates in liberia
sarima approach. International Journal of Science and Research, 3(6):1360–1367.
Feng, H. and Liu, J. (2002). A setar model for canadian gdp: Non-linearities and
forecast comparisons. Working Paper.
Fırat, E. H. (2017). Setar (self-exciting threshold autoregressive) non-linear currency
modelling in eur/usd, eur/try and usd/try parities. Mathematics and Statistics,
5(1):33–55.
Gharleghi, B., Nor, A. H. S. M., and Sarmidi, T. (2014). Application of the threshold
model for modelling and forecasting of exchange rate in selected asean countries.
Sains Malaysiana, 43(10):1609–1622.
73
University of Ghana  http://ugspace.ug.edu.gh
Gibson, D. and Nur, D. (2011). Threshold autoregressive models in finance: A
comparative approach. In University of Wollongong, A., editor, Applied Statistics
Education and Research Collaboration.
Hadrat, Y. M., Isaac, E. N., and Eric, E. S. (2015). Inflation forecasting in ghana -
artificial neural network model approach. International Journal of Economics and
Management science, 4(8):2162–6359.
Hsu, K.-H., Li, J.-F., Lin, Y.-B., Hong, C.-Y., and Huang, Y.-C. (2010). A setar model
for taiwan stock exchange capitalization weighted stock index: Non-linearities and
forecasting comparisons. Working Paper.
Isaac, A. M., Omari-Sasu, A. Y., and Boadi, R. K. (2016). Statistical models for count
data with applications to road accidents in ghana. International Journal of Statistics
and Applications, 6(3):123–137.
Iwok, I. A. and Etuk, E. H. (2009). On comparative performance of pure vector
autoregressive moving average and vector bilinear autoregressive moving average
time series models. Asian Journal of Mathematics and Statistics, 2(2):33–40.
Karlsson, N. and Karlsson, S. (2016). Forecasting of the inflation rates in uganda: A
comparison of arima, sarima and vecm models. mathesis, Örebro University School
of Business , Örebro University.
Keenan, D. M. (1985). A tukey non-additivity-type test for time series nonlinearity.
Biometrika, 72:39–44.
Kwiatkowski, D., Phillips, P. C., Schmidt, P., and Shin, Y. (1992). Testing the null
hypothesis of stationarity against the alternative of a unit-root. how sure are we that
economic time series have a unit-root? Journal of Econometrics, 54:159–178.
Ljung, G. M. and Box, G. E. P. (1978). On a measure of lack of fit in time series
models. Biometrika, 65(2):297–303.
74
University of Ghana  http://ugspace.ug.edu.gh
Luguterah, A., Nasiru, S., and Anzagra, L. (2013). A predictive model for monthly
currency in circulation in ghana. Mathematical Theory and Modeling, 3(4):2224–
5804.
Mahipan, K., Chutiman, N., and Kumphon, B. (2013). A forecasting model for
thailand’s unemployment rate. Modern Applied Science, 7(7):10–16.
Marcellino, M. (2007). A comparison of time series models for forecasting gdp growth
and inflation. Working Paper.
Martinez, E. Z. and da Silva, E. A. S. (2011). Predicting the number of cases of dengue
infection in ribeirão preto, são paulo state, brazil, using a sarima model. Revista da
Sociedade Brasileira de Medicina Tropical, 44(4):436–440.
Meese, R. and Geweke, J. (1984). A comparison of autoregressive univariate
forecasting procedures for macroeconomic time series. Journal of Business and
Economic Statistics, 2:191–200.
Murphy, M. J. and Dietrich, S. (2006). A comparative performance of linear and
non-linear neural networks to predict irregular breathing. Physics in medicine and
Biology, 51(22):5903.
Nasiru, S., Luguterah, A., and Anzagra, L. (2013). The efficacy of arimax and sarima
models in predicting monthly currency in circulation in ghana. Mathematical Theory
and Modeling, 3(5):73–81.
Nasiru, S. and Sarpong, S. (2012). Empirical approach to modelling and forecasting
inflation in ghana. Current Research Journal of Economic Theory, 4(3):83–87.
Nchor, D., Kplepac, V., and Adamec, V. (2016). Effects of oil price shocks on the
ghanaian economy. Acta Universitatis Agriculturae et Silviculturae Mendelinanae,
64(1):315–324.
Oduro-Gyimah, F. K., Harris, E., and Darkwah, K. F. (2012). Sarima time series
75
University of Ghana  http://ugspace.ug.edu.gh
model application to microwave transmission of yeji-salaga (ghana) line-of-sight
link. International Journal of Applied Science and Technology, 2(9):40–51.
Otu, O. A., A., O. G., Jude, O., Ifeyinwa, M. H., and Andrew, I. (2014). Application
of sarima models in modelling and forecasting nigeria’s inflation rates. American
Journal of Applied Mathematics and Statistics, 2(1):16–28.
Potter, S. M. (1999). Nonlinear time series modelling: An introduction. Journal of
Economic Survey, 13:505–528.
Rodas, C. F. R., Casas, N. C., and Montenegro, D. A. B. (2017). A performance
comparison of nonlinear and linear control for a dc series motor. Ciencia en
Desarrollo, 8(1):41–50.
Sjoberg, J. (2010). Prediction of industrial production based on nonlinear time series
models. Examensarbete.
Stock, J. H. and Watson, M. W. (2003). Forecasting output and inflation: The role of
asset prices. Journal of Economic Literature, 41(3):788–829.
Swanson, N. R. and White, H. (1995). A model selection approach to assessing the
information in the term structure using linear models and artificial neural networks.
Journal of Business and Economics Statistics, 13:265–275.
Tong, H. (1978). On a threshold model. Pattern Recognition and Signal Processin,
page 101–141.
Tong, H. and Lim, K. (1980). Threshold autoregression limit cycles and cyclical data.
Journal of the Royal Statistical Society Serie, 42:245–292.
Tsay, R. S. and Tiao, G. (1984). Consistent estimates of autoregressive parameters
and extended sample autocorellation function for stationary and nonstationary arma
models. JASA, 79(385).
76
University of Ghana  http://ugspace.ug.edu.gh
van Gysen, M., Huang, C.-S., and Kruger, R. (2013). The performance of linear
versus non-linear models in forecasting returns on the johannesburg stock exchange.
International Business & Economics Research Journal, 12(8):985 –994.
van Ruth, F. (2014). Inflation dynamics in the netherlands; a linear and non-linear
analysis and the influence of economic conditions. Discussion Paper.
Watier, L. and Richardson, S. (1995). Modelling of an epidemiological time series by
a threshold autoregressive model. The Statistician, 44(3):353–364.
Zak, B. P. (2017). Forecasting the exchange rate in the czech republic using non-linear
threshold models. mathesis, Institute of Economic Studies, Charles University in
Prague.
Zhang, H. (2013). Modeling and forecasting regional gdp in sweden using
autoregressive models. mathesis, School for Technology and Business Studies
,Dalarna University .
Öznur, D. A. and GÜNERİ§, . (2015). Time series prediction using hybridization of
ar, setar and arm models. International Journal of Applied Science and Technology,
5(6):87–96.
77