g02bz:: Correlation and Regression Analysis (NAG Toolbox)

Description

Let

X

and

Y

denote two sets of data, each with

m

variables and

n_{x}

and

n_{y}

observations respectively. Let

μ_{x}

denote the (optionally weighted) vector of

m

means for the first dataset and

C_{x}

denote either the sums of squares and cross-products of deviations from

μ_{x}

C_{x} = {(X - e μ_{x}^{T})}^{T} D_{x} (X - e μ_{x}^{T})

or the sums of squares and cross-products, in which case

C_{x} = X^{T} D_{x} X

where

e

is a vector of

n_{x}

ones and

D_{x}

is a diagonal matrix of (optional) weights and

W_{x}

is defined as the sum of the diagonal elements of

D

. Similarly, let

μ_{y}

C_{y}

and

W_{y}

denote the same quantities for the second dataset.

Given

μ_{x}, μ_{y}, C_{x}, C_{y}, W_{x}

and

W_{y}

nag_correg_ssqmat_combine (g02bz) calculates

μ_{z}

C_{z}

and

W_{z}

as if a dataset

Z

, with

m

variables and

n_{x} + n_{y}

observations were supplied to nag_correg_ssqmat (g02bu), with

Z

constructed as

Z = (\begin{matrix} X \\ Y \end{matrix}) .

nag_correg_ssqmat_combine (g02bz) has been designed to combine the results from two calls to nag_correg_ssqmat (g02bu) allowing large datasets, or cases where all the data is not available at the same time, to be summarised.

References

Parameters

Compulsory Input Parameters

Optional Input Parameters

Output Parameters

Error Indicators and Warnings

Accuracy

Further Comments

Example

This example illustrates the use of nag_correg_ssqmat_combine (g02bz) by dividing a dataset into three blocks of

4

5

and

3

observations respectively. Each block of data is summarised using nag_correg_ssqmat (g02bu) and then the three summaries combined using nag_correg_ssqmat_combine (g02bz).

function g02bz_example


fprintf('g02bz example results\n\n');

x1 = [-1.10  4.06  -0.95  8.53 10.41;
       1.63 -3.22  -1.15 -1.30  3.78;
      -2.23 -8.19  -3.50  4.31 -1.11;
       0.92  0.33  -1.60  5.80 -1.15];

x2 = [ 2.12  5.00 -11.69 -1.22  2.86;
       4.82 -7.23  -4.67  0.83  3.46;
      -0.51 -1.12  -1.76  1.45  0.26;
      -4.32  4.89   1.34 -1.12 -2.49;
       0.02 -0.74   0.94 -0.99 -2.61];

wt = [ 2;    0.89;  0.32; 4.19; 4.33];

x3 = [ 1.37  0.00  -0.53 -7.98  3.32;
       4.15 -2.81  -4.09 -7.96 -2.13;
      13.09 -1.43   5.16 -1.83  1.58];

for b=1:3

  switch b
    case 1
      % first data block: summarise the data into xmean and xc
      [xsw, xmean, xc, ifail] = g02bu( ...
                                       x1);
    case 2
      [ysw, ymean, yc, ifail] = g02bu( ...
                                       x2, 'wt', wt);
    case 3
      [ysw, ymean, yc, ifail] = g02bu( ...
                                       x3);
  end

  if b ~= 1
    % Update the running summaries
    [xsw, xmean, xc, ifail] = g02bz( ...
                                     xsw, xmean, xc, ysw, ymean, yc);
  end
end

% Display results
fprintf('\nMeans\n');
disp(xmean');
mtitle = 'Sums of squares and cross-products';
uplo = 'Upper';
diag = 'Non-unit';
m = int64(5);
[ifail] = x04cc( ...
                 uplo, diag, m, xc, mtitle);

if xsw > 1
  % convert to covariance matrix
  fprintf('\n');
  mtitle = 'Covariance Matrix';
  [ifail] = x04cc( ...
                   uplo, diag, m, xc/(xsw-1), mtitle);
end

g02bz example results


Means
    0.4369    0.4929   -1.3387   -0.5684    0.0987

 Sums of squares and cross-products
             1          2          3          4          5
 1    304.5052  -123.7700   -27.1830   -60.7092    83.4830
 2               298.9148   -17.3196    -2.1710     5.2072
 3                          332.1639    -3.9445   -96.9299
 4                                     264.7684    79.6211
 5                                                225.5948

 Covariance Matrix
             1          2          3          4          5
 1     17.1746    -6.9808    -1.5332    -3.4241     4.7086
 2                16.8593    -0.9769    -0.1224     0.2937
 3                           18.7346    -0.2225    -5.4670
 4                                      14.9334     4.4908
 5                                                 12.7239

NAG Toolbox: nag_correg_ssqmat_combine (g02bz)

▸▿ Contents

Purpose

Syntax