闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

ECMT2130 - 2021 semester 2 final exam 1

Author: Geoﬀ Shuetrim

1. (10 points) Briony’s ARMA model identiﬁcation

Briony has a realisation of a stochastic process, xt , for t = 1, 2, 3, . . . , 1000. The estimated autocorrelation function and partial autocorrelation functions for that realisation are as shown below (the dotted blue lines represent the 95% conﬁdence interval):

(a) (3 points) What order of ARMA model will provide the best approximation to the stochastic process that produced the sample of data?

(b) (2 points) Specify a reasonable point estimate for each of the AR and MA coeﬃcients in the ARMA model you speciﬁed in part A. Base your answer on inspection of the ACF and PACF.

(d) (3 points) The sample standard deviation for the realisation of the stochastic process is 1 .394. If you were to estimate the model by maximum-likelihood, assuming the shocks have a normal distribution,

and you wanted to choose initial parameter values to use in the numerical optimisation of the log- likelihood function, what would you use as an initial estimate of σ, the standard deviation of the white-noise shocks in the ARMA model? Use the point estimates for the ARMA model from part B in your answer.

2. (10 points) GARCH model

(a) (3 points) Explain why ARCH and GARCH models are able to explain fat tails in the distributions of high-frequency ﬁnancial rate-of-return data, even if the shocks impacting the system have normal distributions.

(b) (2 points) Write out the variance equation for a GARCH(1,2) model.

(c) (3 points) What is formula for the conditional variance forecast, 3 periods ahead, using the variance equation in the GARCH(1,2) model?

(d) (2 points) In an MA(1) model, a shock only inﬂuences the stochastic process for 2 periods; the period in which the shock occurs and the period afterward. Thereafter, the shock has no inﬂuence on the value of the stochastic process. This is not the case for an ARCH(1) model where the conditional variance is a function of a constant and the square of the ﬁrst lag of the shock. Explain why an ARCH(1) will tend to exhibit multiple periods of higher volatility for shocks following a particularly large shock even though the variance equation only includes one lag of the squared shock.

3. (10 points) Unit roots and interest bond yields

Michael has monthly data from 1995 on the Australian 2 and 10 year government bond yields (both are interest rates), expressed in percentages. Before undertaking analysis of the yield curve in Australia, he wants to check whether the 2 government bond yield time series is I(0). Over the sample, both series trends down from around 10% to close to 0%.

He tests for at least one unit root in the natural log of the total return index using an Augmented Dickey Fuller test, allowing for a drift but not a deterministic trend. Her test results are reported below, based

on estimation of the regression:

∆yt = α + βyt _ 1 + V∆yt _ 1 + et

Coeﬃcient

Point estimate

Standard error

0.0235

-0.0102

0.3464

0.0267

0.0057

0.0519

(a) (5 points) Write up the formal Augmented Dickey-Fuller test. Perform the test at the 5% level of signiﬁcance, using the corresponding critical value, 一3.42.

The ACF for the residuals from this ADF regression is shown below:

(b) (2 points) Describe the information conveyed by this ACF and its implications for the ADF test.

(c) (2 points) He observes that government bond yields for diﬀerent maturity bonds move closely to- gether so he runs a simple linear regression (with intercept) of the 10-year bond yield on the 2-year bond yield. The slope coeﬃcient has a t-ratio of 82 and the R-squared for the regression is 0.96. Given that both yields exhibit a clear downward trend over the sample, and given the ﬁndings from his unit root testing, what should be his primary concern for this regression?

(d) (1 point) Doing an ADF test with the residuals from the regression described in part C, the null hypothesis is rejected the 1% level of signiﬁcance. Explain the implications of this ﬁnding for the concerns raised in part C?

4. (10 points) Anthony’s damped Holt-Winters model

Anthony has data on yt for periods t = 1, 2, 3, . . . , T and wants to forecast yt several steps ahead. The ﬁrst model he considers is a Holt-Winters model without a seasonal component or a trend component. It just uses a level component, with updating equation:

lt = αyt + (1 一 α)(lt _ 1 ) (1)

(a) (3 points) In terms of the estimates of the level in the last available time period T, what is the formula for the forecast of yT +3?

Anthony notes that the data exhibits periods of upward trending followed by periods of decline so he adds a trend to the model so there are two updating equations.

The level updating equation becomes:

lt = αyt + (1 一 α)(lt _ 1 + bt _ 1 ) (2)

The slope updating equation is:

bt = β(lt 一 lt _ 1 ) + (1 一 β)bt _ 1 (3)

(b) (3 points) In terms of the estimates of the slope in the last available time period T, bT , and the level, lT , in the same period, what is the formula for the forecast of yT +3?

Anthony is concerned that the periods of upward trending and downward trending behaviour mean that forecasts based on a constant trend into the future will quickly become unrealistic as he projects further ahead. This causes him to alter the updating equations yet again, this time including a

trend damping parameter φ where 0 < φ < 1.

The level equation becomes:

lt = αyt + (1 一 α)(lt _ 1 + φbt _ 1 ) (4) The slope equation becomes (in question variant 2, the symbol for the slope is s rather than b:

bt = β(lt 一 lt _ 1 ) + (1 一 β)φbt _ 1 (5)

(c) (1 point) Using the model with the damping parameter, φ, derive the formula for the forecast of yT +3 in terms of the estimates of the slope, bT , and the level, lT , in the last available time period T.

(d) (1 point) If φ = 0.5, ﬁnd the lowest value of h such that yˆT +h 一 yˆT +h _ 1 < 0.1bT where yˆT +h is the forecast for period T + h.

(e) (2 points) If the true stochastic process for yt is a random walk,

yt = yt _ 1 + et

where et is white noise, which of the three forecasting models would be most appropriate to use for forecasting?