Table Of ContentHindawi Publishing Corporation
e Scientific World Journal
Volume 2014, Article ID 497941, 21 pages
http://dx.doi.org/10.1155/2014/497941
Research Article
Modeling Markov Switching ARMA-GARCH Neural Networks
Models and an Application to Forecasting Stock Returns
MelikeBildirici1andÖzgürErsin2
1YıldızTechnicalUniversity,DepartmentofEconomics,BarbarosBulvari,Besiktas,34349Istanbul,Turkey
2BeykentUniversity,DepartmentofEconomics,Ayazag˘a,S¸i¸sli,34396Istanbul,Turkey
CorrespondenceshouldbeaddressedtoMelikeBildirici;[email protected]
Received20August2013;Accepted4November2013;Published6April2014
AcademicEditors:T.Chen,Q.Cheng,andJ.Yang
Copyright©2014M.BildiriciandO¨.Ersin.ThisisanopenaccessarticledistributedundertheCreativeCommonsAttribution
License,whichpermitsunrestricteduse,distribution,andreproductioninanymedium,providedtheoriginalworkisproperly
cited.
Thestudyhastwoaims.ThefirstaimistoproposeafamilyofnonlinearGARCHmodelsthatincorporatefractionalintegration
andasymmetricpowerpropertiestoMS-GARCHprocesses.ThesecondpurposeofthestudyistoaugmenttheMS-GARCHtype
modelswithartificialneuralnetworkstobenefitfromtheuniversalapproximationpropertiestoachieveimprovedforecasting
accuracy.Therefore,theproposedMarkov-switchingMS-ARMA-FIGARCH,APGARCH,andFIAPGARCHprocessesarefurther
augmentedwithMLP,RecurrentNN,andHybridNNtypeneuralnetworks.TheMS-ARMA-GARCHfamilyandMS-ARMA-
GARCH-NNfamilyareutilizedformodelingthedailystockreturnsinanemergingmarket,theIstanbulStockIndex(ISE100).
ForecastaccuracyisevaluatedintermsofMAE,MSE,andRMSEerrorcriteriaandDiebold-Marianoequalforecastaccuracytests.
TheresultssuggestthatthefractionallyintegratedandasymmetricpowercounterpartsofGray’sMS-GARCHmodelprovided
promisingresults,whilethebestresultsareobtainedfortheirneuralnetworkbasedcounterparts.Further,amongthemodels
analyzed,themodelsbasedontheHybrid-MLPandRecurrent-NN,theMS-ARMA-FIAPGARCH-HybridMLP,andMS-ARMA-
FIAPGARCH-RNNprovidedthebestforecastperformancesoverthebaselinesingleregimeGARCHmodelsandfurther,overthe
Gray’sMS-GARCHmodel.Therefore,themodelsarepromisingforvariouseconomicapplications.
1.Introduction on volatility, the financial returns are under the influence
of sudden or abrupt changes in the economy. Hence, the
Inthelightofthesignificantimprovementsintheeconomet- volatilityofeconomicdatahasbeenexploredineconometric
ric techniques and in the computer technologies, modeling literature as a result of the need of modelling uncertainty
the financial time series have been subject to accelerated and risk in the financial returns. The relationship between
empiricalinvestigationintheliterature.Accordingly,follow- the financial returns and various important factors such as
ing the developments in the nonlinear techniques, analyses the trade volume, market price of financial assets, and the
focusingonthevolatilityinfinancialreturnsandeconomic relationship between volatility, trade volume, and financial
variables are observed to provide significant contributions. returnshavebeenvigorouslyinvestigated[1–4].
It could be stated that important steps have been taken TheARCHmodelintroducedbyEngle[5]andtheGener-
in terms of nonlinearmeasurement techniques focusing on alizedARCH(GARCH)modelintroducedbyBollerslev[6]
the instability or stability occurring vis-a-vis encountered are generally accepted for measuring volatility in financial
volatility. Further, the determination of stability or insta- models. GARCH models have been used intensively in
bility in terms of volatility in the financial markets gains academicstudies.AtremendousamountofGARCHmodels
importanceespeciallyforanalyzingtheriskencountered.In existandvariousstudiesprovideextendedevaluationofthe
addition to impact of the magnitude and the size of shocks development.
2 TheScientificWorldJournal
Among many, Engle and Bollerslev [7] developed the cited studies discussing high persistence in volatilitydue to
Integrated-GARCH(I-GARCH)processtoincorporateinte- structural changes. Lamoureux and Lastrapes [24] showed
gration properties, AGARCH model, introduced by Engle thattheencounteredhighpersistenceinvolatilityprocesses
[8], allows modeling asymmetric effects of negative and resultedfromthevolumeeffectsthathadnotbeentakeninto
positive innovations. In terms of modeling asymmetries, account.QiaoandWong[25]followedabivariateapproach
GARCH models have been further developed by including andconfirmedthattheLamoureuxandLastrapes[24]effect
asymmetric impacts of the positive and negative shocks
existsduetothevolumeandturnovereffectsonconditional
to capture the asymmetric effects of shocks on volatility
volatility and after the introduction of volume/turnover as
and return series which depend on the type of shocks, i.e.
exogenous variables, it is possible to obtain a significant
either negative or positive. Following the generalization of
decline in the the persistence. Mikosch and Staˇricaˇ [26]
EGARCH model of Nelson [9] that allows modelling the
showedthatstructuralchangeshadanimportantimpactthat
asymmetriesintherelationshipbetweenreturnandvolatility,
leads to accepting an integrated GARCH process. Bauwens
the Glosten et al. [10] noted the importance of asymmetry
etal.[27,28]discussedthatthepersistenceintheestimated
causedbygoodandbadnewsinvolatileseriesandproposeda
single regime GARCH processes could be considered as
modelthatincorporatesthepastnegativeandpositiveinno-
resulting from the misspecification which could be con-
vations with an identity function that leads the conditional
trolled by introducing an MS-GARCH specification where
variancetofollowdifferentprocessesduetoasymmetry.The
theregimeswitchesaregovernedbyahiddenMarkovchain.
finding is a result of the empirical analyses which pointed
Kra¨mer[29]evaluatedtheautocorrelationinthesquared
at the fact that the negative shocks had a larger impact on
errortermsandprovidedanimportantcontribution.Accord-
volatility. Consequently, the bad news have a larger impact ingly, the observed empirical autocorrelations of the 𝜀2 are
compared to the conditional volatility dynamics followed 𝑡
muchlargerthanthetheoreticalautocorrelationsimpliedby
afterthegoodnews.Duetothiseffect,asymmetricGARCH
theestimatedparametersthroughevaluatinganMS-GARCH
modelshaverapidlyexpanded.TheGJR-GARCHmodelwas
modelwheretheautocorrelationproblemcouldbeshownto
developedindependentlybyZakoian[11,12]andGlostenet
accelerate as the transition probabilities approached 1. (For
al.[10].Itshouldbenotedthat,intermsofasymmetry,the
a proof see e.g., Francq and Zako¨ıan [30]; Kra¨mer [29])
ThresholdGARCH(T-GARCH)ofZakoian[12],VGARCH,
(Inparticular,theempiricalautocorrelationsofthe𝜀2 often
and nonlinear asymmetric GARCH models (NAGARCH) 𝑡
seem to indicate longmemory,which is notpossible in the
of Engle and Ng [13] are closely related versions to model
GARCH-model; in fact, in all standard GARCH-models,
asymmetry in financial asset returns. The SQR-GARCH
theoretical autocorrelations must eventually decrease expo-
modelofHestonandNandi[14]andtheAug-GARCHmodel
nentially,solongmemoryisruledout).AlexanderandLazaar
developedbyDuan[15]nestseveralversionsofthemodels
[31]showedthatleverageeffectsareduetoasymmetryinthe
taking asymmetry discussed above. Further, models such
volatilityresponsestothepriceshocksandtheleverageeffect
as the Generalized Quadratic GARCH (GQARCH) model
acceleratesoncethemarketsareinthemorevolatileregime
of Sentana [16] utilize multiplicative error terms to capture
Kra¨merandTameze[32]showedthatasinglestateGARCH
volatilitymoreeffectively.TheFIGARCHmodelofBaillieet
modelhadonlyonemeanreversionwhilebyallowingregime
al.[17]benefitsfromanARFIMAtypefractionalintegration
switching in the GARCH processes, mean reverting effect
representationtobettercapturethelong-rundynamicsinthe
diminishes. In a perspective of volatility, if these shifts are
conditionalvariance(Seefordetailedinformation,Bollerslev
persistent,thentherearetwosourcesofvolatilitypersistence,
[18]).TheAPARCH/APGARCHmodelofDingetal.[19]is
duetoshocksandduetoregime-switchingintheparameters
an asymmetric model that incorporates asymmetric power
ofthevarianceprocess.ByutilizingaMarkovtransformation
terms which are allowed to be estimated directly from the
model, it could be shown that the relationships among the
data.TheAPGARCHmodelalsonestsseveralmodelssuch
regimesbetweentheperiodsof𝑡−1and𝑡couldbeexplained
as the TGARCH, TSGARCH, GJR, and logGARCH. The
andthemostimportantadvantageoftheMSGARCHmodel
FIAPGARCHmodelofTse[20]combinestheFIGARCHand
exposesitselfasthereisnoneedfortheresearcherstoobserve
the APGARCH. Hyperbolic GARCH (HYGARCH) model
the regime changes. The model allows different regimes to
of Davidson [21] nests the ARCH, GARCH, IGARCH, and
revealbyitself[33].
FIGARCHmodels(foranextendedreviewGARCHmodels,
seeBollerslev[18]). The regime switching in light of the Markov switching
EventhoughtheARCH/GARCHmodelscanbeapplied model has interesting properties to be examined such as
quickly for many time series, the shortcomings in these thestationaritybyallowingtheswitchingcourseofvolatility
models were discussed by certain studies. Perez-Quiros inherent in the asset prices. The hidden Markov model
and Timmermann [22] focused on the conditional distri- (HMM)developedbyTaylor[34]isaswitchingmodelthat
butions of financial returns and showed that recessionary benefits from including an unobserved variable to capture
and expansionary periods possess different characteristics, volatilitytobemodeledwithtransitionsbetweenthehidden
while the parameters of a GARCH model are assumed to statesthatpossessdifferentprobabilitydistributionsattached
bestableforthewholeperiod.Certainstudiesdiscussedthe to each state. Hidden Markov model has been applied
highvolatilitypersistenceinheritedinthebaselineGARCH successfully by Alexander and Dimitriu [35], Cheung and
and proposed early signs of regime switches. Diebold [23] Erlandsson [36], Francis and Owyang [37], and by Clarida
and Lamoureux and Lastrapes [24] are two of the highly et al. [38] to capture the switching type of predictions in
TheScientificWorldJournal 3
stock returns, interest rates, and exchange rates. Regime versions: the MS-ARMA-GARCH-HybridNN, MS-ARMA-
switching model has been used extensively for prediction APGARCH-HybridNN, MS-ARMA-FIGARCH-HybridNN,
of returns belonging to different stock market returns in andMS-ARMA-FIAPGARCH-HybridNN.
differenteconomiesandbyfollowingthefactthatthestock
market indices are very sensitive to stock volatility, which
acceleratesespeciallyduringperiodswithmarketturbulences 2.TheMS-GARCHModels
(seefordetailedinformation,AlexanderandKaeck,[39]).
Over long periods, there are many reasons why financial
The conventional statistical techniques for forecasting
seriesexhibitimportantbreaksinbehavior;examplesinclude
reached their limit in applications with nonlinearities, fur-
depression, recession, bankruptcies, natural disasters, and
thermore,recentresultssuggestthatnonlinearmodelstend
market panics, as well as changes in government policies,
to perform better in models for stock returns forecasting
investor expectations, or the political instability resulting
[40]. For this reason, many researchers have used artificial
fromregimechange.
neuralnetworksmethodologiesforfinancialanalysisonthe
Diebold[23]providedathroughoutanalysisonvolatility
stockmarket.LaiandWong[41]contributedtothenonlinear models.Oneoftheimportantfindingsisthefactthatvolatil-
timeseriesmodelingmethodologybymakinguseofsingle- itymodelsthatfailtoadequatelyincorporatenonlinearityare
layer neural network. Further, modeling of NN models for subjecttoanupwardbiasintheparameterestimateswhich
estimation and prediction for time series have important results in strong forms of persistence that occurs especially
contributions. Weigend et al. [42], Weigend and Gershen- inhighvolatilityperiodsinfinancialtimeseries.Asaresult
feld [43], White [44], Hutchinson et al. [45], and Refenes ofthebiasintheparameterestimates,oneimportantresult
et al. [46] contributed to financial analyses, stock market ofthisfactisontheout-of-sampleforecastsofsingleregime
returns estimation, pattern recognition, and optimization. typeGARCHmodels.Accordingly,Schwert[66]proposeda
NNmodelingmethodologyisappliedsuccessfullybyWang model that incorporates regime switching that is governed
et al. [47] and Wang [48] to forecast the value of stock by a two state Markov process, hence the model retains
indices. Similarly, Abhyankar et al. [49], Castiglione [50], differentcharacteristicsintheregimesthataredefinedashigh
Freisleben[51],KimandChun[52],LiuandYao[53],Phua volatilityandlowvolatilityregimes.
Hamilton [67] proposed the early applications of HMC
etal.[54],Refenesetal.[55],Resta[56],R.SitteandJ.Sitte
modelswithinaMarkovswitchingframework.Accordingly,
[57],Tinˇoetal.[58],YaoandPoh[59],andYaoandTan[60]
MS models were estimated by maximum likelihood (ML)
are important investigations focusing on the relationships
wheretheregimeprobabilitiesareobtainedbytheproposed
between stock prices and market volumes and volatility.
Hamilton-filter[68–71].MLestimationofthemodelisbased
For similar applications, see [1–4]. Bildirici and Ersin [61]
on a version of the Expectation Maximization (EM) algo-
modeledNN-GARCHfamilymodelstoforecastdailystock
rithmasdiscussedinHamilton[72],Krolzig[73–76].Inthe
returnsforshortandlongrunhorizonsandtheyshowedthat
MSmodels,regimechangesareunobservedandareadiscrete
GARCH models augmented with artificial neural networks
state of a Markov chain which governs the endogenous
(ANN) architectures and algorithms provided significant
switches between different AR processes throughout time.
forecasting performances. Ou and Wang [62] extended the
By inferring the probabilities of the unobserved regimes
NN-GARCH models to Support Vector Machines. Azadeh
whichareconditionalonaninformationset,itispossibleto
etal.[63]evaluatedNN-GARCHmodelsandproposedthe reconstructtheregimeswitches[77].
integrated ANN models. Bahrammirzaee [64] provided an Furthermore, certain studies aimed at the development
analysisbasedonfinancialmarketstoevaluatetheartificial of modeling techniques which incorporate both the proba-
neuralnetworks,expertsystems,andhybridintelligencesys- bilisticpropertiesandtheestimationofaMarkovswitching
tems.Further,KanasandYannopoulos[65]andKanas[40] ARCHandGARCHmodels.Aconditionforthestationarity
usedMarkovswitchingandNeuralNetworkstechniquesfor of a natural path-dependent Markov switching GARCH
forecasting stock returns; however, their approaches depart model as in Francq et al. [78] and a throughout analysis
fromtheapproachfollowedwithinthisstudy. oftheprobabilisticstructureofthatmodel,withconditions
Inthisstudy,theneuralnetworksandMarkovswitching for the existence of moments of any order, are developed
structuresareaimedtobeintegratedtoaugmenttheARMA- and investigated in Francq and Zako¨ıan [30]. Wong and
GARCH models by incorporating regime switching and Li [79], Alexander and Lazaar [80], and Haas et al. [81–
different neural networks structures. The approach aims at 83] derived stationarity analysis for some mixing models
formulations and estimations of MS-ARMA-GARCH-MLP, of conditional heteroskedasticity [27, 28]. For the Markov
MS-ARMA-APGARCH-MLP, MS-ARMA-FIGARCH-MLP, switchingGARCHmodelsthatavoidthedependencyofthe
MS-ARMA-FIAPGARCH-MLP, MS-ARMA-GARCH-RBF conditional variance on the chain’s history, the stationarity
MS-ARMA-APGARCH-RBF, MS-ARMA-FIGARCH-RBF, conditionsareknownforsomespecialcasesintheliterature
and MS-ARMA-FIAPGARCH-RBF; the recurrent neural [84].Klaassen[85]developedtheconditionsforstationarity
network augmentationsof the models are, namely, the MS- of the model as the special cases of the two regimes.
ARMA-GARCH-RNN MS-ARMA-APGARCH-RNN, MS- A necessary and sufficient stationarity condition has been
ARMA-FIGARCH-RNN, and MS-ARMA-FIAPGARCH- developedbyHaasetal.[81–83]fortheirMarkovswitching
RNN. And lastly, the paper aims at providing Hybrid NN GARCHmodel.Furthermore,Cai[86]showedtheproperties
4 TheScientificWorldJournal
ofBayesianestimationofaMarkovswitchingARCHmodel conditionalvariancesareinthesameregime(fordetails,the
where only the constant in the ARCH equation is allowed readersarereferredtoBauwensetal.[27,28],Klaassen[85],
tohaveregimeswitches.Theapproachhasbeeninvestigated Haas et al. [81–83], Francq and Zako¨ıan [30], Kra¨mer [29],
by Kaufman and Fru¨hwirth-Schnatter [87] and Kaufmann andAlexanderandKaeck[39]).
and Scheicher [88]. Das and Yoo [89] proposed an MCMC Another area of analysis pioneered by Haas [97] and
algorithmforthesamemodel(switchesbeingallowedinthe Chang et al. [98] allow different distributions in order to
constantterm)withasinglestateGARCHtermtoshowthat gainforecastaccuracy.Animportantfindingofthesestudies
gainscouldbeachievedtoovercomepath-dependence.MS- showedthatbyallowingtheregimedensitiestofollowskew-
GARCHmodelsarestudiedbyFrancqandZako¨ıan[30]to normaldistributionwithGaussiantailcharacteristics,several
achievetheirnon-Bayesianestimationpropertiesinlightof return series could be modeled more efficiently in terms
thegeneralizedmethodofmoments.Bauwensetal.[27,28] of forecast accuracy. Liu [99] developed and discussed the
proposed a Bayesian Markov chain Monte Carlo (MCMC) conditions for stationarity in Markov switching GARCH
algorithmthatisdifferentiatedbyincludingthestatevariables structure in Haas et al. [81–83] and proved the existence
in the parameter space to control the path-dependence by of the moments. In addition, Abramson and Cohen [100]
obtainingtheparameterspacewithGibbssampling[90]. discussed and further evaluated the stationarity conditions
ThehighandlowvolatilityprobabilitiesofMS-GARCH in a Markov switching GARCH process and extended the
modelsallowdifferentiatinghighandlowvolatilityperiods. analysis to a general case with m-state Markov chains and
By observing the periods in which volatility is high, it is GARCH(𝑝,𝑞) processes. An evaluation and extension of
possibletoinvestigatetheeconomicandpoliticalreasonsthat the stationarity conditions for a class of nonlinear GARCH
caused increased volatility. If a brief overview is to be pre- models are investigated in Abramson and Cohen [100].
sented,thereareseveralmodelsbasedontheideaofregime Francq and Zako¨ıan [30] derived the conditions for weak
changeswhichshouldbementioned.Schwert[66]explores stationarity and existence of moments of any order MS-
amodelinwhichswitchesbetweenthesestatesthatreturns GARCH model. Bauwens et al. [27, 28] showed that by
canhaveahighorlowvariancearedeterminedbyatwo-state enlarging the parameter space to include space variables,
Markovprocess.LamoureuxandLastrapes[24]suggestthe though maximum likelihood estimation is not feasible, the
useofMarkovswitchingmodelsforawayofidentifyingthe Bayesianestimationoftheextendedprocessisfeasiblefora
timingoftheshiftsintheunconditionalvariance.Hamilton modelwheretheregimechangesaregovernedwithahidden
and Susmel [91] and Cai [86] proposed Markov switching Markov chain. Further, Bauwens et al. [27, 28] accepted
ARCH model to capture the effects of sudden shifts in the mildregularityconditionsunderwhichtheMarkovchainis
conditional variance. Further, Hamilton and Susmel [91] geometricallyergodicandhasfinitemomentsandisstrictly
extended the analysis to a model that allows three regimes, stationary.
which were differentiated between low, moderate and high
volatilityregimes,wherethehigh-volatilityregimecaptured 2.1.MS-ARMA-GARCHModels. Toavoidpath-dependence
theeconomicrecessions.Itisacceptedthattheproposalsof problem,Gray[92]suggestsintegratingouttheunobserved
Cai[86]andHamiltonandSusmel[91]helpedtheresearchers regime path in the GARCH term by using the conditional
tocontrolfortheproblemofpathdependence,whichmakes expectationofthepastvariance.Gray’sMS-GARCHmodel
thecomputationofthelikelihoodfunctionimpossible(The isrepresentedasfollows:
conditionalvarianceattimetdependsontheentiresequence
of regimes up to time t due to the recursive nature of the 𝜎2 =𝑤 +∑𝑞 𝛼 𝜀2 +∑𝑝 𝛽 𝐸( 𝜀𝑡2−𝑗 )
GARCHprocess.InMarkovswitchingmodel,theregimesare 𝑡,(𝑠𝑡) (𝑠𝑡) 𝑖=1 𝑖,(𝑠𝑡) 𝑡−𝑖 𝑗=1 𝑗,(𝑠𝑡) 𝐼𝑡−𝑗−1
unobservable,oneneedstointegrateoverallpossibleregime
paths.Thenumberofpossiblepathsgrowsexponentiallywith 𝑞
t, which renders ML estimation intractable.) (see for detail, =𝑤 +∑𝛼 𝜀2 (1)
(𝑠) 𝑖,(𝑠) 𝑡−𝑖
𝑡 𝑡
Bauwens,etal.[27,28]). 𝑖=1
Gray[92]studyisoneoftheimportantstudieswherea
𝑝 𝑚 𝑠
MarkovswitchingGARCHmodelisproposedtoovercome +∑𝛽 ∑𝑃(𝑠 = 𝑡−𝑗 )𝜎2 ,𝑠 ,
the path dependence problem. According to Gray’s model, 𝑗,(𝑠𝑡) 𝑡−𝑗 𝐼 𝑡−𝑗 𝑡−𝑗
𝑗=1 𝑠𝑡−𝑗=1 𝑡−𝑗−1
once the conditional volatility processes are differentiated
betweenregimes,anaggregationoftheconditionalvariances where 𝑤𝑠𝑡 > 0, 𝛼𝑖,𝑠𝑡 ≥ 0, 𝛽𝑗,𝑠𝑡 ≥ 0, and 𝑖 = 1,...,𝑞,
fortheregimescouldbeusedtoconstructasinglevariance 𝑗 = 1,...,𝑝, 𝑠𝑡 = 1,...,𝑚. The probabilistic structure of
coefficienttoevaluatethepathdependence.Amodification the switching regime indicator 𝑠𝑡 is defined as a first-order
is also conducted by Klaassen [85]. Yang [93], Yao and Markovprocesswithconstanttransitionprobabilities𝜋1and
Attali [94], Yao [95], and Francq and Zako¨ıan [96] derived 𝜋2, respectively (Pr{𝑠𝑡 = 1 | 𝑠𝑡−1 = 1} = 𝜋1, Pr{𝑠𝑡 = 2 |
conditions for the asymptotic stationarity of some AR and 𝑠𝑡−1 = 1} = 1 − 𝜋1, Pr{𝑠𝑡 = 2 | 𝑠𝑡−1 = 2} = 𝜋2, and
ARMA models with Markov switching regimes. Haas et al. Pr{𝑠𝑡 =1|𝑠𝑡−1 =2}=1−𝜋2).
[81–83]investigatedaMS-GARCHmodelbywhichafinite AlthoughDueker[101]acceptsacollapsingprocedureof
state-space Markov chain is assumed to govern the ARCH Kim’s [102] algorithm to overcome path-dependence prob-
parameters, whereas the autoregressive process followed by lem,Dueker[101]adoptsthesameframeworkofGray[92].
theconditionalvarianceissubjecttotheassumptionthatpast Accordingly,themodifiedGARCHversionofDueker[101]is
TheScientificWorldJournal 5
𝜀 =𝐸[𝜀 |𝑠 ,𝑌 ],
acceptedwhichgovernsthedispersioninsteadoftraditional 𝑡−𝑖−1,(𝑠 ) 𝑡−𝑖−1,(𝑠 ) 𝑡−𝑖 𝑡−𝑖−1
𝑡−𝑖 𝑡−𝑖−1
GARCH(1,1)specification. (3)
Yang [103], Yao and Attali [94], Yao [95], and Francq 𝜎𝑡−𝑖−1,(𝑠 ) =𝐸[𝜀𝑡−𝑖−1,(𝑠 ) |𝑠𝑡−𝑖,𝑌𝑡−𝑖−1].
𝑡−𝑖 𝑡−𝑖−1
and Zako¨ıan [96] derived conditions for the asymptotic
stationarity of models with Markov switching regimes (see Thus, the parameters have nonnegativity constraints
for detailed information Bauwens and Rombouts [104]. 𝜙,𝜃,𝜑,𝑤,𝛼,𝛽>0andtheregimesaredeterminedby𝑠𝑡,
The major differences between Markov switching GARCH
𝑇
ims,otdheelscoarneditthioensaplevcairfiicaantcioen𝜎𝑡2of=theVavra(r𝜀i𝑡a/n𝑆c𝑡e).pTroocceossn;sitdheart 𝐿=∏𝑡=1𝑓(𝑦𝑡 |𝑠𝑡 =𝑖,𝑌𝑡−1)Pr[𝑠𝑡 =𝑖|𝑌𝑡−1], (4)
theconditionalvarianceasintheBollerslev’s[105]GARCH
modelandtoconsidertheregimedependentequationforthe and the probability Pr[𝑠𝑡 = 𝑖 | 𝑌𝑡−1] is calculated through
conditionalvarianceinFro¨mmel[106]areacceptedthatThe iteration:
coefficients𝑤𝑠𝑡,𝛼𝑠𝑡,𝛽𝑠𝑡 correspondtorespectivecoefficients 𝜋𝑗𝑡 =Pr[𝑠𝑡 =𝑗|𝑌𝑡−1]
intheone-regimeGARCHmodel,butmaydifferdepending
onthepresentstate. 1 1
Klaassen [85] (Klassen [85] model is defined as 𝜎𝑡2,(𝑠) = =∑Pr[𝑠𝑡 =𝑗|𝑠𝑡−1 =𝑖]Pr[𝑠𝑡 =𝑗|𝑌𝑡−1]∑𝜂𝑗𝑖𝜋𝑖∗𝑡−1.
𝑤 +∑𝑞 𝛼 𝜀2 +∑𝑝 𝛽 ∑𝑚 𝑃(𝑆 =𝑠 |𝐼 ,𝑆𝑡 = 𝑖=0 𝑖=0
(𝑠𝑡) 𝑖=1 𝑖,(𝑠𝑡) 𝑡−𝑖 𝑗=1 𝑗,(𝑠𝑡) ̃𝑠=1 𝑡−𝑗 𝑡−𝑗 𝑡−1 𝑡 (5)
𝑠𝑡)𝜎𝑡2−𝑗,𝑠𝑡−𝑗) suggested to use the conditional expectation of
the lagged conditionalvariance with a broader information Accordingly,thetwomodels,theHennekeetal.[108]andthe
set than the model derived in Gray [92]. Accordingly, Francq et al. [78] approaches, could be easily differentiated
Klaassen [85] suggested modifying Gray’s [92] model by throughthedefinitionsof𝜀𝑡2−1and𝜎𝑡−1.Further,asymmetric
replacing𝑝(𝑠𝑡−𝑗 =𝑠𝑡−𝑗 |𝐼𝑡−𝑗−1) by𝑝(𝑠𝑡−𝑗 =𝑠𝑡−𝑗 |𝐼𝑡−1,𝑆𝑡 =𝑠𝑡) powertermsandfractionalintegrationwillbeintroducedto
whileevaluating 𝜎𝑡2,𝑠𝑡. thederivedmodelinthefollowingsections.
Another version of MS-GARCH model is developed by
Haas et al. [81–83]. According to this model, Markov chain 2.2.MS-ARMA-APGARCHModel. Liu[99]providedagen-
controlstheARCHparametersateachregime(𝑤𝑠,𝛼𝑖,𝑠)and eralizationoftheMarkovswitchingGARCHmodelofHaas
the autoregressive behavior in each regime is subject to the et al. [81–83] and derived the conditions for stationarity
assumptionthatthepastconditionalvariancesareinthesame and for the existence of moments. Liu [99] proposes a
regimeasthatofthecurrentconditionalvariance[100]. model which allowed for a nonlinear relation between past
In this study, models will be derived following the MS- shocksandfuturevolatilityaswellasfortheleverageeffects.
ARMA-GARCH specification in the spirit of Blazsek and The leverage effect is an outcome of the observation that
Downarowicz [107] where the properties of MS-ARMA- the reaction of stock market volatility differed significantly
GARCH processes were derived following Gray [92] and to the positive and the negative innovations. Haas et al.
Klaassen [85] framework. Henneke et al. [108] developed [109,110]complementsLiu’s[99]workintwoways.Firstly,
an approach to investigate the model derived in Francq the representation of the model developed by Haas [109]
et al. [78] for which the Bayesian framework was derived. allows computational ease for obtaining the unconditional
The stationarity of the model was evaluated by Francq and moments.Secondly,thedynamicautocorrelationstructureof
Zako¨ıan [96] and an algorithm to compute the Bayesian thepower-transformedabsolutereturns(residuals)wastaken
estimator of the regimes and parameters was developed. It asameasureofvolatility.
shouldbenotedthattheMS-ARMA-GARCHmodelsinthis Haas [109] model assumes that time series {𝜀𝑡,𝑡 ∈ Z}
paper were developed by following the models developed followsakregimeMS-APGARCHprocess,
in the spirit of Gray [92] and Klaassen [85] similar to the
𝜀 =𝜂𝜎 𝑡∈Z,
frameworkofBlazsekandDownarowicz[107]. 𝑡 𝑡 Δ𝑡,𝑡 (6)
TheMS-ARMA-GARCHmodelwithregimeswitchingin with {𝜂𝑡,𝑡 ∈ Z} being i.i.d. sequence and {Δ𝑡,𝑡 ∈ Z} is a
the conditional mean and variance are defined as a regime Markovchainwithfinitestatespace𝑆 = {1,...,𝑘}and𝑃is
switchingmodelwheretheregimeswitchesaregovernedby
the irreducible and aperiodic transition matrix with typical
anunobservedMarkovchainintheconditionalmeanandin element𝑝𝑖𝑗 =𝑝(Δ𝑡 =𝑗|Δ𝑡−1 =𝑖)sothat
theconditionalvarianceprocessesas
𝑃=[𝑝𝑖𝑗]=[𝑝(Δ𝑡 =𝑗|Δ𝑡−1 =𝑖)], 𝑖,𝑗=1,...,𝑘. (7)
𝑟 𝑚
𝑦 =𝑐 +∑𝜃 𝑦 +𝜀 +∑𝜑 𝜀 ,
𝑡 (𝑠) 𝑖,(𝑠) 𝑡−𝑖 𝑡,(𝑠) 𝑗,(𝑠) 𝑡−𝑗,(𝑠) The stationary distribution of Markov-chain is shown as
𝑡 𝑡 𝑡 𝑡 𝑡
𝑖=1 𝑗=1 𝜋∞ =(𝜋1,∞,...,𝜋𝑘,∞).
(2)
𝑝 𝑞 According to the Liu [99] notation of MS-APGARCH
𝜎2 =𝑤 +∑𝛼 𝜀2 +∑𝛽 𝜎 , model, the conditional variance 𝜎2 of jth regime follows a
𝑡,(𝑠) (𝑠) 𝑖,(𝑠) 𝑡−𝑖,(𝑠) (𝑠) 𝑡−𝑗,(𝑠) 𝑗𝑡
𝑡 𝑡 𝑡 𝑡 𝑡 𝑡
𝑖=1 𝑗=1 univariateAPGARCHprocessasfollows:
where, 𝜎𝑗𝛿𝑡 =𝑤𝑗+𝛼1𝑗𝜀𝑡+−1𝛿+𝛼2𝑗𝜀𝑡−−1𝛿+𝛽𝑗𝜎𝑗𝛿,𝑡−1, 𝛿>0, (8)
6 TheScientificWorldJournal
where,𝑤𝑗 > 0,𝛼1𝑗, 𝛼2𝑗,𝛽𝑗 ≥ 0,𝑗 = 1,...,𝑘.Forthepower MS-ARMA-FIAPGARCHderivedisafractionalintegra-
term𝛿=2andfor𝛼1𝑗 =𝛼2𝑗,themodelin(8)reducestoMS- tionaugmentedmodelasfollows:
GARCHmodel.SimilartotheDingetal.[19],theasymmetry,
Iwfhthicehpiassctanlelegdat“ilveevsehraogcekseffheacvte,”diesecpaeprtuimrepdabcty,p𝛼a1𝑗ra=m̸𝛼e2t𝑗e[r1s0a9r]e. (1−𝛽(𝑠𝑡)𝐿)𝜎𝑡𝛿,((𝑠𝑠𝑡)𝑡)
expectedtobe𝛼1𝑗 < 𝛼2𝑗 sothattheleverageeffectbecomes =𝜔+((1−𝛽(𝑠)𝐿)−(1−𝛼(𝑠)𝐿)(1−𝐿)𝑑(𝑠𝑡)) (11)
stronger. 𝑡 𝑡
theAHnaoatsh[e1r09a]ppmrooadcehl,twhhateries tshimeialasyrmtomLetiury[t9e9r]mmsohdaevleias ×(𝜀𝑡−1−𝛾(𝑠𝑡)𝜀𝑡−1)𝛿(𝑠𝑡),
differentiatedformas
wherethelagoperatorisdenotedby𝐿,autoregressiveparam-
etersare𝛽(𝑠),and𝛼(𝑠)showsthemovingaverageparameters,
𝜎𝑗𝛿𝑡 =𝑤𝑗+𝛼𝑗(𝜀𝑡−1−𝛾𝑗𝜀𝑡−1)𝛿+𝛽𝑗𝜎𝑗𝛿,𝑡−1, 𝛿>0, (9) 𝛿fr(a1)ctio>na0l dd𝑡ieffneorteenstita𝑡htieonopptiamraaml petoewrevratrrieasnsbfoertmweaetinon0, th≤e
𝑑(𝑠) ≤ 1 and allows long memory to be integrated to the
with the restrictions 0 < 𝑤𝑗, 𝛼𝑗, 𝛽𝑗 ≥ 0, 𝛾𝑗 ∈ [−1,1] with mo𝑡del. Regime states (𝑠𝑡) are defined with 𝑚 regimes as
regimes 𝑗 = 1,...,𝑘. The MS-APGARCH model of Haas 𝑖 = 1,...,𝑚. The asymmetry term |𝛾(𝑠)| < 1 ensures that
𝑡
[109] reduces to Ding et al. [19] single regime APGARCH positiveandnegativeinnovationsofthesamesizemayhave
model if 𝑗 = 1. Equation (9) reduces to Liu [99] MS- asymmetric effects on the conditional variance in different
APGARCH specification if 𝛼1𝑗 = 𝛼𝑗(1−𝛾𝑗)𝛿 and 𝛽𝑗 = regimes.
𝛼𝑗𝑠(1+𝛾𝑗)𝛿. ARMSiAm-iFlaIArPtoGAthReCMHSm-AoRdMelAn-eAstPsGseAvRerCaHlmmodoedlesl.,BtyheapMplSy--
The MS-ARMA-GARCH type model specification in ing 𝛿(𝑠) = 2, the model reduces to Markov switching
this study assumes that the conditional mean follows MS- 𝑡
fractionally integrated asymmetric GARCH (MS-ARMA-
ARMA process, whereas the conditional variance follows FIAGARCH); if 𝛿(𝑠) = 2 restriction is applied with 𝛾(𝑠) =
regime switching in the GARCH architecture. Accordingly, 0, themodelreduce𝑡stoMarkovswitchingFIGARCH (M𝑡 S-
MS-ARMA-APGARCHarchitecturenestsseveralmodelsby ARMA-FIGARCH).For𝑑(𝑠) =0,modelreducestotheshort
applying certain restrictions. The MS-ARMA-APGARCH 𝑡
memory version, the MS-ARMA-APGARCH model, if the
mthoedceolnidsitdioenriavlemdebaynmanodviMngS-AfrPomGAMRCS-HA(R𝑙,M𝑚A)cpornodcietisosnianl additionalconstraint𝛿(𝑠𝑡) = 2isapplied,themodelreduces
to MS-Asymmetric GARCH (MS-AGARCH). Lastly, for all
varianceprocessasfollows: the models mentioned above if 𝑖 = 1, all models reduce
to single regime versions of the relevant models, namely,
𝑟 the FIAPGARCH, FIAGARCH, FIGARCH, and AGARCH
𝜎𝑡𝛿,((𝑠𝑠𝑡)𝑡) =𝑤(𝑠𝑡)+∑𝑙=1𝛼𝑙,(𝑠𝑡)(𝜀𝑡−𝑙−𝛾𝑙,(𝑠𝑡)𝜀𝑡−𝑙)𝛿(𝑠𝑡) mwiothdetlhse, tchoenirstrrealienvtsan𝑖t=sin1galnedre𝛿g(𝑠im) e=v2a,r𝛾ia(𝑠n)ts=. F0o,rthaetympoicdaell,
(10) reducestosingleregimeFIGARCH𝑡 modelo𝑡fBaillieetal.[17].
𝑞
+ ∑𝛽 𝜎𝛿(𝑠𝑡) , 𝛿 >0, TodifferentiatebetweentheGARCHspecifications,forecast
𝑚=1 𝑚,(𝑠𝑡) 𝑡−𝑚,(𝑠𝑡) (𝑠𝑡) performancecriteriacomparisonsareassumed.
3.NeuralNetworkand
where the regime switches are governed by (𝑠𝑡) and the
parameters are restricted as 𝑤(𝑠) > 0, 𝛼𝑙,(𝑠), 𝛽𝑚,(𝑠) ≥ MS-ARMA-GARCHModels
0 with 𝛾𝑙,(𝑠𝑡) ∈ (−1,1), 𝑙 =𝑡 1,...,𝑟. One𝑡 impo𝑡rtant Inthissectionofthestudy,theMultiLayerPerceptron,Rad-
differenceisthatMS-ARMA-APGARCHmodelin(10)allows
ical Basis Function, and Recurrent Neural Network models
thepowerparameterstovaryacrossregimes.Further,ifthe
thatbelongtotheANNfamilywillbecombinedwithMarkov
followingrestrictionsareapplied,𝑙 = 1,𝑗 = 1,𝛿(𝑠𝑡) = 𝛿,the switching and GARCH models. In this respect, Spezia and
modelreducestothemodelofHaas[109]givenin(9).
Paroli[113]isanotherstudythatmergedtheNeuralNetwork
In applied economics literature, it is shown that many
andMS-ARCHmodels.
financialtimeseriespossesslongmemory,whichcanbefrac-
tionallyintegrated.Fractionalintegrationwillbeintroduced
totheMS-ARMA-APGARCHmodelgivenabove. 3.1.MultilayerPerceptron(MLP)Models
3.1.1. MS-ARMA-GARCH-MLP Model. Artificial Neural
2.3.MS-ARMA-FIAPGARCHModel. AndersenandBoller- Network models have many applications in modeling of
slev [111], Baillie et al. [17], Tse [112], and Ding et al. [19] functional forms in various fields. In economics literature,
providedinterestingapplicationsinwhichtheattentionhad the early studies such as Dutta and Shektar [114], Tom and
been directed on long memory. Long memory could be Kiang [115], Do and Grudinsky [116], Freisleben [51], and
incorporated to the model above by introducing fractional Refenes et al. [55] utilize ANN models to option pricing,
integration in the conditional mean and the conditional realestates,bondratings,andpredictionofbankingfailures
varianceprocesses. amongmany,whereasstudiessuchasKanas[40],Kanasand
TheScientificWorldJournal 7
Yannopoulos[65],andShively[117]appliedANNmodelsto if𝑛𝑗,𝑖transitionprobability𝑃(𝑠𝑡 =𝑖|𝑠𝑡−1 =𝑗)isaccepted;
stock return forecasting, and Donaldson and Kamstra [118]
[𝜀 −𝐸(𝜀)]
proposed hybrid modeling to combine GARCH, GJR, and 𝑧 = 𝑡−𝑑
EGARCHmodelswithANNarchitecture. 𝑡−𝑑 √𝐸(𝜀2) (18)
TheMLP,animportantclassofneuralnetworks,consists
of a set of sensory units that constitute the input layer, one 𝑠 → max{𝑝,𝑞} recursive procedure is started by con-
or more hidden layers, and an output layer. The additional structing 𝑃(𝑧𝑠 = 𝑖 | 𝑧𝑠−1), where 𝜓(𝑧𝑡𝜆ℎ) is of the form
linearinputwhichisconnectedtotheMLPnetworkiscalled 1/(1+exp(−𝑥)),atwice-differentiable,continuousfunction
the Hybrid MLP. Hamilton model can also be considered bounded between [0,1]. The weight vector 𝜉 = 𝑤; 𝜓 = 𝑔
as a nonlinear mixture of autoregressive functions, such as
logisticactivationfunctionandinputvariablesaredefinedas
the multilayer perceptron and thus, the Hamilton model is 𝑧𝑡𝜆ℎ =𝑥𝑖,where𝜆ℎisdefinedasin(16).
called Hybrid MLP-HMC models [119]. Accordingly, in the If 𝑛𝑗,𝑖 transition probability 𝑃(𝑧𝑡 = 𝑖 | 𝑧𝑡−1 = 𝑗) is
HMCmodel,theregimechangesaredominatedbyaMarkov
accepted,
chain without making a priori assumptions in light of the
number of regimes [119]. In fact, Hybrid MLP accepts the 𝑓(𝑦 |𝑥,𝑧 =𝑖)
𝑡 𝑡 𝑡
network inputs to be connected to the output nodes with
weightedconnectionstoformalinearmodelthatisparallel 1 {−(𝑦𝑡−𝑥𝑡𝜑−∑𝐻𝑗=1𝛽𝑗𝑝(𝑥𝑡𝛾𝑗))2} (19)
withnonlinearMultilayerPerceptron. = exp{ },
2ℎ
In the study, the MS-ARMA-GARCH-MLP model to √2𝜋ℎ𝑡(𝑖) { 𝑡(𝑗) }
be proposed allows Markov switching type regime changes
both in the conditional mean and conditional variance 𝑠 → max{𝑝,𝑞},recursiveprocedureisstartedbyconstruct-
processes augmented with MLP type neural networks to ing𝑃(𝑧𝑠 =𝑖|𝑧𝑠−1).
achieve improvement in terms of in-sample and out-of-
sampleforecastaccuracy. 3.1.2. MS-ARMA-APGARCH-MLP Model. Asymmetric
The MS-ARMA-GARCH-MLP model is defined of the powerGARCH(APGARCH)modelhasinterestingfeatures.
form: In the construction of the model, the APGARCH structure
𝑟 𝑛 of Ding et al. [19] is followed. The model given in (13)
𝑦 =𝑐 +∑𝜃 𝑦 +𝜀 +∑𝜑 𝜀 , is modified to obtain the Markov switching APGARCH
𝑡 (𝑠) 𝑖,(𝑠) 𝑡−𝑖 𝑡,(𝑠) 𝑗,(𝑠) 𝑡−𝑗,(𝑠) (12)
𝑡 𝑡 𝑡 𝑡 𝑡
𝑖=1 𝑗=1 Multilayer Perceptron (MS-ARMA-APGARCH-MLP)
modeloftheform,
𝑝 𝑞
𝜎𝑡2,(𝑠𝑡) =𝑤(𝑠𝑡)+𝑝∑=1𝛼𝑝,(𝑠𝑡)𝜀𝑡2−𝑝,(𝑠𝑡)+𝑞∑=1𝛽𝑞,(𝑠𝑡)𝜎𝑡−𝑞,(𝑠𝑡) 𝜎𝑡𝛿,(,(𝑠𝑠𝑡𝑡))
(13)
+∑ℎ 𝜉 𝜓(𝜏 ,𝑍 𝜆 ,𝜃 ), =𝑤(𝑠)+ ∑𝑝 𝛼𝑝,(𝑠)(𝜀𝑡−𝑝−𝛾𝑝,(𝑠)𝜀𝑡−𝑝,(𝑠))𝛿,(𝑠𝑡)
ℎ,(𝑠) ℎ,(𝑠) 𝑡,(𝑠) ℎ,(𝑠) ℎ,(𝑠) 𝑡 𝑡 𝑡 𝑡
𝑡 𝑡 𝑡 𝑡 𝑡 𝑝=1
ℎ=1
𝑞 ℎ
where, the regimes are governed by unobservable Markov +∑𝛽 𝜎𝛿,(𝑠𝑡) +∑𝜉 𝜓(𝜏 ,𝑍 𝜆 ,𝜃 ),
process: 𝑞=1 𝑞,(𝑠𝑡) 𝑡−𝑞,(𝑠𝑡) ℎ=1 ℎ,(𝑠𝑡) ℎ,(𝑠𝑡) 𝑡,(𝑠𝑡) ℎ,(𝑠𝑡) ℎ,(𝑠𝑡)
𝑚 (20)
∑𝜎2 𝑃(𝑆 =𝑖|𝑧 ), 𝑖=1,...𝑚.
𝑡(𝑖) 𝑡 𝑡−1 (14)
where, regimes are governed by unobservable Markov
𝑖=1
process. The model is closed as defining the conditional
In the MLP type neural network, the logistic type sigmoid mean as in (12) and conditional variance of the form
functionisdefinedas equation’s (14)–(19) and (20) to augment the MS-ARMA-
GARCH-MLP model with asymmetric power terms to
𝜓(𝜏 ,𝑍 𝜆 ,𝜃 )
ℎ,(𝑠) 𝑡,(𝑠) ℎ,(𝑠) ℎ,(𝑠) obtain MS-ARMA-APGARCH-MLP. Note that, model
𝑡 𝑡 𝑡 𝑡
nest several specifications. Equation (20) reduces to the
−1
𝑙 ℎ
=[1+exp(−𝜏ℎ,(𝑠𝑡)(∑[∑𝜆ℎ,𝑙,(𝑠𝑡)𝑧𝑡ℎ−𝑙,(𝑠𝑡)+𝜃ℎ,(𝑠𝑡)]))] MterSm-A𝛿RM=A-G2AaRnCdH𝛾-𝑝M,(𝑠L)P=m0o.deSlimiinlarl(y1,3)theifmthodeelpnowesetrs
𝑙=1 ℎ=1 (15) MS-GJR-MLP if 𝛿 = 2𝑡 and 0 ≤ 𝛾𝑝,(𝑠𝑡) ≤ 1 are imposed.
The model may be shown as MSTGARCH-MLP model
(12)𝜆ℎ,𝑑 ∼ uniform [−1,+1] (16) isfin𝛿gle=reg1imanedre0str≤ictio𝛾𝑝n,,(𝑠𝑡𝑠)𝑡 ≤= 𝑠1.=Sim1,ilathrley,qbuyotaepdpmlyiondgelas
reduce to their respective single regime variants, namely,
and𝑃(𝑆𝑡 =𝑖|𝑧𝑡−1),thefilteredprobabilitywiththefollowing the ARMA-APGARCH-MLP, ARMA-GARCH-MLP,
representation, ARMA-NGARCH-MLP, ARMA-GJRGARCH-MLP, and
ARMA-GARCH-MLP models (for further discussion in
(𝑃(𝑆𝑡 =𝑖|𝑧𝑡−1)𝛼𝑓(𝑃(𝜎𝑡−1 |𝑧𝑡−1,𝑠𝑡−1 =1))) (17) NN-GARCHfamilymodels,seeBildiriciandErsin[61]).
8 TheScientificWorldJournal
3.1.3. MS-ARMA-FIAPGARCH-MLP Model. Following the For a typical example, consider a MS-ARMA-
methodologydiscussedintheprevioussection,MS-ARMA- FIAPGARCH-MLPmodelrepresentationwithtworegimes:
APGARCH-MLP model is augmented with neuralnetwork
modelingarchitectureandthataccountsforfractionalinte- (1−𝛽 𝐿)𝜎𝛿(1)
(1) 𝑡,(1)
gration to achieve long memory characteristics to obtain
MS-ARMA-FIAPGARCH-MLP. Following the MS-ARMA- =𝑤 +((1−𝛽 𝐿)−(1−𝜙 𝐿)(1−𝐿)𝑑(1))
(1) (1) (1)
FIAPGARCH represented in (11), the MLP type neural
nreeptwreoserkntaautigomneisnatecdhiMevSed-A:RMA-FIAPGARCH-MLP model ×(𝜀𝑡−1−𝛾(1)𝜀𝑡−1)𝛿(1) +∑ℎ 𝜉ℎ,(1)𝜓(𝜏ℎ,(1),𝑍𝑡,(1)𝜆ℎ,(1),𝜃ℎ,(1)),
ℎ=1
(1−𝛽(𝑠𝑡)𝐿)𝜎𝑡𝛿,((𝑠𝑠𝑡𝑡)) (1−𝛽(2)𝐿)𝜎𝑡𝛿,((22))
=𝑤(𝑠)+((1−𝛽(𝑠)𝐿)−(1−𝜙(𝑠)𝐿)(1−𝐿)𝑑(𝑠𝑡)) =𝑤(2)+((1−𝛽(2)𝐿)−(1−𝜙(2)𝐿)(1−𝐿)𝑑(2))
𝑡 𝑡 𝑡
×(𝜀𝑡−1,(𝑠𝑡)−𝛾(𝑠𝑡)𝜀𝑡−1,(𝑠𝑡))𝛿(𝑠𝑡) (21) ×(𝜀𝑡−1−𝛾(2)𝜀𝑡−1)𝛿(2) +∑ℎ 𝜉ℎ,(2)𝜓(𝜏ℎ,(2),𝑍𝑡,(2)𝜆ℎ,(2),𝜃ℎ,(2)).
ℎ=1
ℎ
(22)
+∑𝜉 𝜓(𝜏 ,𝑍 𝜆 ,𝜃 ),
ℎ,(𝑠) ℎ,(𝑠) 𝑡,(𝑠) ℎ,(𝑠) ℎ,(𝑠)
𝑡 𝑡 𝑡 𝑡 𝑡
ℎ=1
Followingthedivisionofregressionspaceintotwosub-
spaceswithMarkovswitching,themodelallowstwodifferent
where, ℎ are neurons defined with sigmoid type logistic asymmetric power terms, 𝛿(1) and 𝛿(2), and two different
functions, 𝑖 = 1,...,𝑚 regime states governed by fractionaldifferentiationparameters,𝑑(1)and𝑑(2);asaresult,
unobservable variable following Markov process. Equation differentlongmemoryandasymmetricpowerstructuresare
(21) defines the MS-ARMA-FIAPGARCH-MLP model, allowedintwodistinguishedregimes.
the fractionally integration variant of the MSAGARCH- ItispossibletoshowthemodelasasingleregimeNN-
MLP model modified with the ANN, and the logistic FIAPGARCHmodelif𝑖=1:
activation function, 𝜓(𝜏ℎ,(𝑠),𝑍𝑡,(𝑠)𝜆ℎ,(𝑠),𝜃ℎ,(𝑠)) defined as
𝑡 𝑡 𝑡 𝑡 (1−𝛽𝐿)𝜎𝛿
in (15). Bildirici and Ersin [61] proposes a class of NN- 𝑡
GARCH models including the NN-APGARCH. Similarly,
=𝜔+((1−𝛽𝐿)−(1−𝜙𝐿)(1−𝐿)𝑑)
the MS-ARMA-FIAPGARCH-MLP model reduces to
(23)
the MS-FIGARCH-MLP model for restrictions on the
rpeodwuecrestetromM𝛿S(-𝑠𝑡F)IN=GA2RaCnHd-𝛾M(𝑠𝑡L)P=mo0d.eFlufortrh𝛾e(r𝑠,)th=e m0oadnedl ×(𝜀𝑡−1−𝛾𝑗𝜀𝑡−1)𝛿+∑ℎ 𝜉ℎ𝜓(𝜏ℎ,𝑍𝑡𝜆ℎ,𝜃ℎ).
to the MS-FI-GJRGARCH-MLP model if 𝛿(𝑠)𝑡 = 2 and ℎ=1
𝛾(𝑠) is restricted to be in the range of 0 ≤ 𝛾(𝑡𝑠) ≤ 1. The Further, the model reduces to NN-FIGARCH if 𝑖 = 1 and
mino𝑡addedlirteiodnucteos ttoheM0S-T≤GA𝛾R(𝑠C)H-≤MLP1 mreostdreicltii𝑡ofn𝛿.(𝑠O𝑡)n=th1e 𝛿(𝑠1) =𝛿=2inthefashionofBildiriciandErsin[61]:
𝑡
contrary, if single regime restriction is imposed, models (1−𝛽𝐿)𝜎2
𝑡
discussed above, namely, MS-ARMA-FIAPGARCH-MLP,
MSFIGARCH-NN, MSFIGARCH-NN, MSFINGARCH- =𝜔+((1−𝛽𝐿)−(1−𝜙𝐿)(1−𝐿)𝑑)
MLP, MSFIGJRGARCH-MLP, and MSFITGARCH-MLP (24)
models reduce to NN-FIAPGARCH, NN-FIGARCH, NN- ℎ
FIGARCH, NN-FINGARCH, NN-FIGJRGARCH, and ×(𝜀𝑡−1−𝛾𝑗𝜀𝑡−1)2+∑𝜉ℎ𝜓(𝜏ℎ,𝑍𝑡𝜆ℎ,𝜃ℎ).
NN-FITGARCH models, which are single regime neural ℎ=1
network augmented GARCH family models of the form
BildiriciandErsin[61]thatdonotpossessMarkovswitching 3.2.RadialBasisFunctionModel. RadialBasisFunctionsare
type asymmetry (Bildirici and Ersin [61]). The model also one of the most commonly applied neural network models
nests model variants that do not possess long memory that aim at solving the interpolation problem encountered
characteristics. By imposing 𝑑(𝑠) = 0 to the fractional in nonlinear curve fitting.Liu and Zhang [120] utilized the
𝑡
integrationparameterwhichmaytakedifferentvaluesunder Radial Basis Function Neural Networks (RBF) and Markov
𝑖 = 1,2,...,𝑚 different regimes, the model in (21) reduces regime-switching regressionsto divide the regression space
to MS-ARMA-APGARCH-MLP model, the short memory intotwosub-spacestoovercomethedifficultyinestimating
model variant. In addition to the restrictions applied the conditionalvolatility inherent in stock returns. Further,
above, application of 𝑑(𝑠) = 0 results in models without Santosetal.[121]developedaRBF-NN-GARCHmodelthat
𝑡
long memory characteristics: MS-ARMA-FIAPGARCH- benefit from the RBF type neural networks. Liu and Zhang
MLP, MS-ARMA-GARCH-MLP, MS-ARMA-GARCH- [120]combinedRBFneuralnetworkmodelswiththeMarkov
MLP, MSNGARCH-MLP, MS-GJR-GARCH-MLP, and SwitchingmodeltomergeMarkovswitchingNeuralNetwork
MSTGARCH-MLP. modelbasedonRBFmodels.RBFneuralnetworksintheir
TheScientificWorldJournal 9
models are trained to generate both time series forecasts inlightofmodelingaradialfunctionofthedistancebetween
and certainty factors. Accordingly, RBF neural network is theinputsandcalculatedvalueinthehiddenunit.Theoutput
represented as a composition of three layers of nodes; first, unitproducesalinearcombinationofthebasisfunctionsto
theinputlayerthatfeedstheinputdatatoeachofthenodes provideamappingbetweentheinputandoutputvectors:
in the second or hidden layer; the second layer that differs
fdraotmacoluthsteerrnwehuircahlinsectewnoterkresdinattahaptaretaicchulanropdoeinretparnedsehnatssaa 𝜎𝑡𝛿,(,(𝑠𝑠𝑡𝑡)) =𝑤(𝑠𝑡)+𝑝∑𝑝=1𝛼𝑝,(𝑠𝑡)(𝜀𝑡−𝑗−𝛾𝑝,(𝑠𝑡)𝜀𝑡−𝑝,(𝑠𝑡))𝛿,(𝑠𝑡)
givenradiusandinthethirdlayer,consistingofonenode.
𝑞
3.2.1. MS-ARMA-GARCH-RBF Model. MS-GARCH-RBF +𝑞∑=1𝛽𝑞,(𝑠𝑡)𝜎𝑡𝛿−,(𝑞𝑠,𝑡()𝑠𝑡) (30)
modelisdefinedas
𝜎𝑡2,(𝑠𝑡) = 𝑤(𝑠𝑡)+𝑝∑𝑝=1𝛼𝑝,(𝑠𝑡)𝜀𝑡2−𝑝,(𝑠𝑡) +ℎ∑=ℎ1𝜉ℎ,(𝑠𝑡)𝜙ℎ,(𝑠𝑡)(𝑍𝑡,(𝑠𝑡)−𝜇ℎ,(𝑠𝑡)),
𝑞 where,𝑖 = 1,...,𝑚regimemodelandregimesaregoverned
+∑𝛽 𝜎
𝑞,(𝑠) 𝑡−𝑞,(𝑠) (25) by unobservable Markov process. Equations (26)–(29) with
𝑡 𝑡
𝑞=1 (30)definetheMS-ARMA-APGARCH-RBFmodel.Similar
totheMS-ARMA-APGARCH-MLPmodel,theMS-ARMA-
+∑ℎ 𝜉ℎ,(𝑠)𝜙ℎ,(𝑠)(𝑍𝑡,(𝑠)−𝜇ℎ,(𝑠)), APGARCH-RBFmodelnestsseveralmodels.Equation(30)
𝑡 𝑡 𝑡 𝑡 reducestotheMS-ARMA-GARCH-RBFmodelifthepower
ℎ=1
term 𝛿 = 2 and 𝛾𝑝,(𝑠) = 0, to the MSGARCH-RBF model
where 𝑖 = 1,...,𝑚 regimes are governed by unobservable for 𝛾𝑝,(𝑠) = 0, and𝑡to the MSGJRGARCH-RBF model if
Markovprocess: 𝛿 = 2𝑡and 0 ≤ 𝛾𝑝,(𝑠) ≤ 1 restrictions are allowed. The
model may be shown𝑡as MSTGARCH-RBF model if 𝛿 =
𝑚
∑𝜎𝑡2,(𝑠𝑡)𝑃(𝑆𝑡 =𝑖|𝑧𝑡−1). (26) n1aamnedly,0NN≤-A𝛾P𝑝G,(𝑠A𝑡)RC≤H,1N. NFu-rGthAeRr,CsHin,gNleNr-eGgAimReCHm,oNdeNls-,
𝑖=1
NGARCH, NN-GJRGARCH, and NN-TGARCH models,
AGaussianbasisfunctionforthehiddenunitsgivenas𝜙(𝑥) may be obtained if 𝑡 = 1 (for further discussion in NN-
for𝑥=1,2,...,𝑋wheretheactivationfunctionisdefinedas, GARCHfamilymodels,seeBildiriciandErsin[61]).
−𝑍𝑡,(𝑠)−𝜇ℎ,(𝑠)2 3.2.3. MS-ARMA-FIAPGARCH-RBF Model. MS-
𝜙(ℎ,(𝑠𝑡),𝑍𝑡)=exp( 𝑡2𝜌2 𝑡 ). (27) FIAPGARCH-RBFmodelisdefinedas
𝛿
With 𝑝 defining the width of each function. 𝑍𝑡 is a vector (1−𝛽(𝑠𝑡)𝐿)𝜎𝑡,((𝑠𝑠𝑡)𝑡)
of lagged explanatory variables, 𝛼 + 𝛽 < 1 is essential to
ensure stationarity. Networks of this type can generate any =𝑤(𝑠)+((1−𝛽(𝑠)𝐿)−(1−𝜙(𝑠)𝐿)(1−𝐿)𝑑(𝑠𝑡))
𝑡 𝑡 𝑡
real-valuedoutput,butintheirapplicationswheretheyhave
a priori knowledge of the range of the desired outputs, it ×(𝜀𝑡−1,(𝑠)−𝛾(𝑠)𝜀𝑡−1,(𝑠))𝛿(𝑠𝑡) (31)
𝑡 𝑡 𝑡
is computationally more efficient to apply some nonlinear
tran𝑃sf(e𝑆r𝑡fu=nct𝑖ion|to𝑧𝑡t−h1e)oiustpthuetsfitolterreefldecptrtohbaatbkinliotywlweditghe.the +∑ℎ 𝜉ℎ,(𝑠𝑡)𝜙ℎ,(𝑠𝑡)(𝑍𝑡,(𝑠𝑡)−𝜇ℎ,(𝑠𝑡)),
followingrepresentation: ℎ=1
(𝑃(𝑆𝑡 =𝑖|𝑧𝑡−1)𝛼𝑓(𝑃(𝜎𝑡−1 |𝑧𝑡−1,𝑠𝑡−1 =1))). (28) where, ℎ are neurons defined with Gaussian functions.
The MS-ARMA-FIAPGARCH-RBF model is a variant of
If𝑛𝑗,𝑖transitionprobability𝑃(𝑠𝑡 =𝑖|𝑠𝑡−1 =𝑗)isaccepted, the MSAGARCH-RBF model with fractional integration
augmented with ANN architecture. Similarly, the MS-
[𝜀 −𝐸(𝜀)] ARMA-FIAPGARCH-RBFmodelreducestotheMS-ARMA-
𝑧 = 𝑡−𝑑
𝑡−𝑑 (29) FIGARCH-RBF model with restrictions on the power term
√𝐸(𝜀2) 𝛿(𝑠) = 2 and 𝛾(𝑠) = 0. The model nests MSFINGARCH-
RB𝑡Fmodelfor𝛾(𝑡𝑠) = 0,andMSFIGJRGARCH-RBFmodel
𝑠 → max{𝑝,𝑞}recursiveprocedureisstartedbyconstruct- if𝛿(𝑠) = 2and𝛾(𝑠𝑡)variesbetween0 ≤ 𝛾(𝑠) ≤ 1.Further,the
ing𝑃(𝑧𝑠 =𝑖|𝑧𝑠−1). mod𝑡elmaybesho𝑡wnasMSTGARCH-RB𝑡Fmodelif𝛿(𝑠) =1
and0 ≤ 𝛾(𝑠) ≤ 1.Withsingleregimerestriction𝑖 = 1𝑡,dis-
𝑡
3.2.2.MS-ARMA-APGARCH-RBFModel. Radialbasisfunc- cussedmodelsreducetoNN-FIAPGARCH,NN-FIGARCH,
tions are three-layer neural network models with linear NN-FIGARCH, NN-FINGARCH, NN-FIGJRGARCH, and
outputfunctionsandnonlinearactivationfunctionsdefined NN-FITGARCH models, which do not possess Markov
as Gaussian functionsin hidden layer utilizedto the inputs switching type asymmetry. To obtain the model with short
10 TheScientificWorldJournal
imnteemgroartyiocnhapraarcatmereistetircss,sh𝑑o(⋅u)ld=be0irmesptroiscetdionanodnthfreacmtioondaell 𝜎𝑡𝛿,(,(𝑠𝑠𝑡𝑡)) = 𝑤(𝑠𝑡)+𝑝∑𝑝=1𝛼𝑝,(𝑠𝑡)(𝜀𝑡−𝑗−𝛾𝑝,(𝑠𝑡)𝜀𝑡−𝑝,(𝑠𝑡))𝛿,(𝑠𝑡)
reduces to MSAPGARCH-RBF model, the short memory
model variant. Additionally, by applying 𝑑(⋅) = 0 with 𝑞
tohreyrcehsatrraiccttieornisstidciss:cMusSsFedIAaPbGovAeR,mCHod-RelBsFw,iMthSoGutAlRonCgHm-ReBmF-, +𝑞∑=1𝛽𝑞,(𝑠𝑡)𝜎𝑡𝛿−,(𝑞𝑠,𝑡()𝑠𝑡) (33)
MSGARCH-RBF, MSNGARCH-RBF, MSGJRGARCH-RBF,
ℎ
andMSTGARCH-RBFmodelscouldbeobtained. +∑𝜉 Π(𝜃 𝜒 +𝜃 )
ℎ,(𝑠) 𝑘,ℎ,(𝑠) 𝑡−𝑘,ℎ,(𝑠) 𝑘,ℎ,(𝑠)
𝑡 𝑡 𝑡 𝑡
ℎ=1
3.3. Recurrent Neural Network MS-GARCH Models. The 𝑖 = 1,...,𝑚regimesaregovernedbyunobservableMarkov
RNN model includes the feed-forward system; however, process. 𝜃𝑘,ℎ,(𝑠) is the weights of connection from pre to
it distinguishes itself from standard feed-forward network postsynapticn𝑡odes,Π(𝑥)isalogisticsigmoidfunctionofthe
models in the activation characteristics within the layers. formgivenin(15),𝜒𝑡−𝑘,ℎ,(𝑠)isavariablevectorcorresponding
𝑡
The activations are allowed to provide a feedback to units totheactivationsofpostsynapticnodes,theoutputvectorof
withinthesameorprecedinglayer(s).Thisformsaninternal the hidden units, and 𝜃𝑘,ℎ,(𝑠) are the bias parameters of the
𝑡
memory system that enables a RNN to construct sensitive presynaptic nodes and 𝜉𝑖,(𝑠) are the weights of each hidden
internal representations in response to temporal features unitforℎhiddenneurons,𝑡𝑖 = 1,...,ℎ.Theparametersare
foundwithinadataset. estimatedbyminimizingthesumofthesquared-errorloss:
The Jordan [122] and Elman’s [123] networks are simple min𝜆=∑𝑇𝑡−1[𝜎𝑡 − 𝜎̂𝑡]2.Themodelisestimatedbyrecurrent
recurrent networks to obtain forecasts: Jordan and Elman
back-propagation algorithm and by the recurrent Newton
networks extend the multilayer perceptron with context
algorithm.ByimposingseveralrestrictionssimilartotheMS-
units, which are processing elements (PEs) that remember
ARMA-APGARCH-RBF model, several representations are
pastactivity.Contextunitsprovidethenetworkwiththeabil-
shown under certain restrictions. Equation (33) reduces to
itytoextracttemporalinformationfromthedata.TheRNN MS-ARMA-GARCH-RNN model with 𝛿 = 2 and 𝛾𝑝,(𝑠) =
modelemploysbackpropagation-through-time,anefficient 0, to the MSGARCH-RNN model for 𝛾𝑝,(𝑠) = 0, 𝑡and
gradient-descentlearningalgorithmforrecurrentnetworks. to the MSGJRGARCH-RNN model if 𝛿 =𝑡 2 and 0 ≤
Itwasusedasastandardvariantofcross-validationreferred 𝛾𝑝,(𝑠) ≤ 1 restrictions are imposed. MSTGARCH-RNN
to as the leave-one-out method and as a stopping criterion mod𝑡elisobtainedif𝛿 = 1and0 ≤ 𝛾𝑝,(𝑠) ≤ 1.Inaddition
suitableforestimationproblemswithsparsedataandsoitis 𝑡
to the restrictions above, if the single regime restriction
identifiedtheonsetofoverfittingduringtraining.TheRNN 𝑖 = 1 is implied, the model given in Equation (33) reduces
wasfunctionallyequivalenttoanonlinearregressionmodel
to their single regime variants; namely, the APGARCH-
used for time-series forecasting (Zhang et al. [124]; Binner
RNN, GARCH-RNN, GJRGARCH-RNN, and TGARCH-
etal.[125]).Tinˇoetal.[126]mergedtheRNNandGARCH
RNNmodels,respectively.
models.
3.3.3. MS-ARMA-FIAPGARCH-RNN. Markov Switching
Fractionally Integrated APGARCH Recurrent Neural
3.3.1. MS-ARMA-GARCH-RNN Models. The model is
NetworkModelisdefinedas
definedas
𝛿
(1−𝛽 𝐿)𝜎 (𝑠𝑡)
(𝑠𝑡) 𝑡,(𝑠𝑡)
𝜎2 = 𝑤 + ∑𝑝 𝛼 𝜀2 +∑𝑞 𝛽 𝜎 =𝑤(𝑠)+((1−𝛽(𝑠)𝐿)−(1−𝜙(𝑠)𝐿)(1−𝐿)𝑑(𝑠𝑡))
𝑡,(𝑠𝑡) (𝑠𝑡) 𝑝=1 𝑝,(𝑠𝑡) 𝑡−𝑝,(𝑠𝑡) 𝑞=1 𝑞,(𝑠𝑡) 𝑡−𝑞,(𝑠𝑡) (32) ×(𝑡𝜀𝑡−1,(𝑠)−𝛾(𝑠𝑡)𝜀𝑡−1,(𝑠))𝛿(𝑠𝑡) 𝑡 (34)
ℎ 𝑡 𝑡 𝑡
+∑𝜉 𝜋 (𝑤 𝜃 +𝜃 ).
ℎ,(𝑠) ℎ,(𝑠) 𝑘,ℎ,(𝑠) 𝑡−𝑘 𝑘,ℎ,(𝑠) ℎ
ℎ=1 𝑡 𝑡 𝑡 𝑡 +∑𝜉ℎ,(𝑠)Π(𝜃𝑘,ℎ,(𝑠)𝜒𝑡−𝑘,ℎ,(𝑠)+𝜃𝑘,ℎ,(𝑠)),
𝑡 𝑡 𝑡 𝑡
ℎ=1
where, ℎ are neurons defined as sigmoid type logistic
Similartothemodelsabove,(32)isshownfor𝑖 = 1,...,𝑚 functions and 𝑖 = 1,...,𝑚 regime states the following
regimes which are governed by unobservable Markov pro- Markovprocess.TheMS-ARMA-FIAPGARCH-RNNmodel
cess.Activationfunctionistakenasthelogisticfunction. is the fractionally integrated variant of the MS-ARMA-
APGARCH-RNN model. The MS-ARMA-FIAPGARCH-
RNN model reduces to the MS-ARMA-FIGARCH-RNN
3.3.2. MS-ARMA-APGARCH-RNN. Markov switching modelwithrestrictionsonthepowerterm𝛿(𝑠) =2and𝛾(𝑠) =
APGARCHRecurrentNeuralNetworkModelisrepresented 0.Further,themodelreducestoMSFINGARC𝑡H-RNNmo𝑡del
as for𝛾(𝑠) =0,totheMSFIGJRGARCH-RNNmodelif𝛿(𝑠) =2
𝑡 𝑡
Description:literature as a result of the need of modelling uncertainty and risk in the discussion in. NN-GARCH family models, see Bildirici and Ersin [61]).