G13BAF (PDF version)
G13 Chapter Contents
G13 Chapter Introduction
NAG Library Manual

NAG Library Routine Document

G13BAF

Note:  before using this routine, please read the Users' Note for your implementation to check the interpretation of bold italicised terms and other implementation-dependent details.

+ Contents

    1  Purpose
    7  Accuracy

1  Purpose

G13BAF filters a time series by an ARIMA model.

2  Specification

SUBROUTINE G13BAF ( Y, NY, MR, NMR, PAR, NPAR, CY, WA, NWA, B, NB, IFAIL)
INTEGER  NY, MR(NMR), NMR, NPAR, NWA, NB, IFAIL
REAL (KIND=nag_wp)  Y(NY), PAR(NPAR), CY, WA(NWA), B(NB)

3  Description

From a given series y1,y2,,yn, a new series b1,b2,,bn is calculated using a supplied (filtering) ARIMA model. This model will be one which has previously been fitted to a series xt with residuals at. The equations defining bt in terms of yt are very similar to those by which at is obtained from xt. The only dissimilarity is that no constant correction is applied after differencing. This is because the series yt is generally distinct from the series xt with which the model is associated, though yt may be related to xt. Whilst it is appropriate to apply the ARIMA model to yt so as to preserve the same relationship between bt and at as exists between yt and xt, the constant term in the ARIMA model is inappropriate for yt. The consequence is that bt will not necessarily have zero mean.
The equations are precisely:
wt=dsDyt, (1)
the appropriate differencing of yt; both the seasonal and non-seasonal inverted autoregressive operations are then applied,
ut=wt-Φ1wt-s--ΦPwt-s×P (2)
vt=ut-ϕ1ut-1--ϕput-p (3)
followed by the inverted moving average operations
zt=vt+Θ1zt-s++ΘQzt-s×Q (4)
bt=zt+θ1bt-1++θqbt-q. (5)
Because the filtered series value bt depends on present and past values yt,yt-1,, there is a problem arising from ignorance of y0,y-1, which particularly affects calculation of the early values b1,b2,, causing ‘transient errors’. The routine allows two possibilities.
(i) The equations (1), (2) and (3) are applied from successively later time points so that all terms on their right-hand sides are known, with vt being defined for t=1+d+s×D+s×P,,n. Equations (4) and (5) are then applied over the same range, taking any values on the right-hand side associated with previous time points to be zero.
This procedure may still however result in unacceptably large transient errors in early values of bt.
(ii) The unknown values y0,y-1, are estimated by backforecasting. This requires that an ARIMA model distinct from that which has been supplied for filtering, should have been previously fitted to yt.
For efficiency, you are asked to supply both this ARIMA model for yt and a limited number of backforecasts which are prefixed to the known values of yt. Within the routine further backforecasts of yt, and the series wt, ut, vt in (1), (2) and (3) are then easily calculated, and a set of linear equations solved for backforecasts of zt,bt for use in (4) and (5) in the case that q+Q>0.
Even if the best model for yt is not available, a very approximate guess such as
yt=c+et
or
yt=et
can help to reduce the transients substantially.
The backforecasts which need to be prefixed to yt are of length Qy=qy+sy×Qy, where qy and Qy are the non-seasonal and seasonal moving average orders and sy the seasonal period for the ARIMA model of yt. Thus you need not carry out the backforecasting exercise if Qy=0. Otherwise, the series y1,y2,,yn should be reversed to obtain yn,yn-1,,y1 and G13AJF should be used to forecast Qy values, y^0,,y^1-Qy. The ARIMA model used is that fitted to yt (as a forward series) except that, if dy+Dy is odd, the constant should be changed in sign (to allow, for example, for the fact that a forward upward trend is a reversed downward trend). The ARIMA model for yt supplied to the filtering routine must however have the appropriate constant for the forward series.
The series y^1-Qy,,y^0,y1,,yn is then supplied to the routine, and a corresponding set of values returned for bt.

4  References

Box G E P and Jenkins G M (1976) Time Series Analysis: Forecasting and Control (Revised Edition) Holden–Day

5  Parameters

1:     Y(NY) – REAL (KIND=nag_wp) arrayInput
On entry: the Qy backforecasts, starting with backforecast at time 1-Qy to backforecast at time 0, followed by the time series starting at time 1, where Qy=MR10+MR13×MR14. If there are no backforecasts, either because the ARIMA model for the time series is not known, or because it is known but has no moving average terms, then the time series starts at the beginning of Y.
2:     NY – INTEGERInput
On entry: the total number of backforecasts and time series data points in array Y.
Constraint: NYmax1+Qy,NPAR.
3:     MR(NMR) – INTEGER arrayInput
On entry: the orders vector for the filtering model, followed by the orders vector for the ARIMA model for the time series if the latter is known. The orders appear in the standard sequence p,d,q,P,D,Q,s as given in the G13 Chapter Introduction. If the ARIMA model for the time series is supplied, then the routine will assume that the first Qy values of the array Y are backforecasts.
Constraints:
the filtering model is restricted in the following ways:
  • MR1+MR3+MR4+MR6>0, i.e., filtering by a model which contains only differencing terms is not permitted;
  • MRk0, for k=1,2,,7;
  • if MR7=0, MR4+MR5+MR6=0;
  • if MR70, MR4+MR5+MR60;
  • MR71.
the ARIMA model for the time series is restricted in the following ways:
  • MRk0, for k=8,9,,14;
  • if MR14=0, MR11+MR12+MR13=0;
  • if MR140, MR11+MR12+MR130;
  • MR141.
4:     NMR – INTEGERInput
On entry: the number of values specified in the array MR. It takes the value 7 if no ARIMA model for the time series is supplied but otherwise it takes the value 14. Thus NMR acts as an indicator as to whether backforecasting can be carried out.
Constraint: NMR=7 or 14.
5:     PAR(NPAR) – REAL (KIND=nag_wp) arrayInput
On entry: the parameters of the filtering model, followed by the parameters of the ARIMA model for the time series, if supplied. Within each model the parameters are in the standard order of non-seasonal AR and MA followed by seasonal AR and MA.
6:     NPAR – INTEGERInput
On entry: the total number of parameters held in array PAR.
Constraints:
  • if NMR=7, NPAR=MR1+MR3+MR4+MR6;
  • if NMR=14, NPAR=MR1+MR3+MR4+MR6+ MR8+MR10+MR11+MR13.
Note: the first constraint (i.e., MR1+MR3+MR4+MR6>0) on the orders of the filtering model, in parameter MR, ensures that NPAR>0.
7:     CY – REAL (KIND=nag_wp)Input
On entry: if the ARIMA model is known (i.e., NMR=14), CY must specify the constant term of the ARIMA model for the time series. If this model is not known (i.e., NMR=7), then CY is not used.
8:     WA(NWA) – REAL (KIND=nag_wp) arrayWorkspace
9:     NWA – INTEGERInput
On entry: the dimension of the array WA as declared in the (sub)program from which G13BAF is called. Workspace is only required if the ARIMA model for the time series is known.
Constraints:
let K=MR3+MR6×MR7+MR8+MR9+MR11+MR12×MR14,
then
  • if NMR=14, NWAK×K+2;
  • if NMR=7, NWA1.
10:   B(NB) – REAL (KIND=nag_wp) arrayOutput
On exit: the filtered output series. If the ARIMA model for the time series was known, and hence Qy backforecasts were supplied in Y, then B contains Qy ‘filtered’ backforecasts followed by the filtered series. Otherwise, the filtered series begins at the start of B just as the original series began at the start of Y. In either case, if the value of the series at time t is held in Yt, then the filtered value at time t is held in Bt.
11:   NB – INTEGERInput
On entry: the dimension of the array B as declared in the (sub)program from which G13BAF is called. In addition to holding the returned filtered series, B is also used as an intermediate work array if the ARIMA model for the time series was known.
Constraints:
  • if NMR=14, NBNY+maxK3,K1+K2;
  • if NMR=7, NBNY.
Where
  • K1=MR1+MR4×MR7;
  • K2=MR2+MR5×MR7;
  • K3=MR3+MR6×MR7.
12:   IFAIL – INTEGERInput/Output
On entry: IFAIL must be set to 0, -1​ or ​1. If you are unfamiliar with this parameter you should refer to Section 3.3 in the Essential Introduction for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value -1​ or ​1 is recommended. If the output of error messages is undesirable, then the value 1 is recommended. Otherwise, if you are not familiar with this parameter, the recommended value is 0. When the value -1​ or ​1 is used it is essential to test the value of IFAIL on exit.
On exit: IFAIL=0 unless the routine detects an error or a warning has been flagged (see Section 6).

6  Error Indicators and Warnings

If on entry IFAIL=0 or -1, explanatory error messages are output on the current error message unit (as defined by X04AAF).
Errors or warnings detected by the routine:
IFAIL=1
On entry,NMR7 and NMR14.
IFAIL=2
On entry,the orders vector MR does not satisfy the constraints given in Section 5.
IFAIL=3
On entry,NPAR is inconsistent with the contents of MR (see Section 5).
IFAIL=4
On entry,NY is too small to successfully carry out the requested filtering, (see Section 5).
IFAIL=5
On entry,the work array WA is too small.
IFAIL=6
On entry,the array B is too small.
IFAIL=7
The orders vector for the filtering model is invalid.
IFAIL=8
The orders vector for the ARIMA model is invalid. (Only occurs if NMR=14.)
IFAIL=9
The initial values of the filtered series are indeterminate for the given models.
IFAIL=-999
Internal memory allocation failed.

7  Accuracy

Accuracy and stability are high except when the MA parameters are close to the invertibility boundary.

8  Further Comments

If an ARIMA model is supplied, a local workspace array of fixed length is allocated internally by G13BAF. The total size of this array amounts to K integer elements, where K is the expression defined in the description of the parameter WA.
The time taken by G13BAF is approximately proportional to
NY×MR1+MR3+MR4+MR6,
with an appreciable fixed increase if an ARIMA model is supplied for the time series.

9  Example

This example reads a time series of length 296. It reads the univariate ARIMA 4,0,2,0,0,0,0 model and the ARIMA filtering 3,0,0,0,0,0,0 model for the series. Two initial backforecasts are required and these are calculated by a call to G13AJF . The backforecasts are inserted at the start of the series and G13BAF is called to perform the calculations.

9.1  Program Text

Program Text (g13bafe.f90)

9.2  Program Data

Program Data (g13bafe.d)

9.3  Program Results

Program Results (g13bafe.r)


G13BAF (PDF version)
G13 Chapter Contents
G13 Chapter Introduction
NAG Library Manual

© The Numerical Algorithms Group Ltd, Oxford, UK. 2012