NAG Library Routine Document

f11xaf  (real_gen_matvec)

 Contents

    1  Purpose
    7  Accuracy

1
Purpose

f11xaf computes a matrix-vector or transposed matrix-vector product involving a real sparse nonsymmetric matrix stored in coordinate storage format.

2
Specification

Fortran Interface
Subroutine f11xaf ( trans, n, nnz, a, irow, icol, check, x, y, ifail)
Integer, Intent (In):: n, nnz, irow(nnz), icol(nnz)
Integer, Intent (Inout):: ifail
Real (Kind=nag_wp), Intent (In):: a(nnz), x(n)
Real (Kind=nag_wp), Intent (Out):: y(n)
Character (1), Intent (In):: trans, check
C Header Interface
#include nagmk26.h
void  f11xaf_ ( const char *trans, const Integer *n, const Integer *nnz, const double a[], const Integer irow[], const Integer icol[], const char *check, const double x[], double y[], Integer *ifail, const Charlen length_trans, const Charlen length_check)

3
Description

f11xaf computes either the matrix-vector product y=Ax, or the transposed matrix-vector product y=ATx, according to the value of the argument trans, where A is an n by n sparse nonsymmetric matrix, of arbitrary sparsity pattern. The matrix A is stored in coordinate storage (CS) format (see Section 2.1.1 in the F11 Chapter Introduction). The array a stores all nonzero elements of A, while arrays irow and icol store the corresponding row and column indices respectively.
It is envisaged that a common use of f11xaf will be to compute the matrix-vector product required in the application of f11bef to sparse linear systems. An illustration of this usage appears in Section 10 in f11ddf.

4
References

None.

5
Arguments

1:     trans – Character(1)Input
On entry: specifies whether or not the matrix A is transposed.
trans='N'
y=Ax is computed.
trans='T'
y=ATx is computed.
Constraint: trans='N' or 'T'.
2:     n – IntegerInput
On entry: n, the order of the matrix A.
Constraint: n1.
3:     nnz – IntegerInput
On entry: the number of nonzero elements in the matrix A.
Constraint: 1nnzn2.
4:     annz – Real (Kind=nag_wp) arrayInput
On entry: the nonzero elements in the matrix A, ordered by increasing row index, and by increasing column index within each row. Multiple entries for the same row and column indices are not permitted. The routine f11zaf may be used to order the elements in this way.
5:     irownnz – Integer arrayInput
6:     icolnnz – Integer arrayInput
On entry: the row and column indices of the nonzero elements supplied in array a.
Constraints:
irow and icol must satisfy the following constraints (which may be imposed by a call to f11zaf):
  • 1irowin and 1icolin, for i=1,2,,nnz;
  • irowi-1<irowi or irowi-1=irowi and icoli-1<icoli, for i=2,3,,nnz.
7:     check – Character(1)Input
On entry: specifies whether or not the CS representation of the matrix A, values of n, nnz, irow and icol should be checked.
check='C'
Checks are carried on the values of n, nnz, irow and icol.
check='N'
None of these checks are carried out.
See also Section 9.2.
Constraint: check='C' or 'N'.
8:     xn – Real (Kind=nag_wp) arrayInput
On entry: the vector x.
9:     yn – Real (Kind=nag_wp) arrayOutput
On exit: the vector y.
10:   ifail – IntegerInput/Output
On entry: ifail must be set to 0, -1​ or ​1. If you are unfamiliar with this argument you should refer to Section 3.4 in How to Use the NAG Library and its Documentation for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value -1​ or ​1 is recommended. If the output of error messages is undesirable, then the value 1 is recommended. Otherwise, if you are not familiar with this argument, the recommended value is 0. When the value -1​ or ​1 is used it is essential to test the value of ifail on exit.
On exit: ifail=0 unless the routine detects an error or a warning has been flagged (see Section 6).

6
Error Indicators and Warnings

If on entry ifail=0 or -1, explanatory error messages are output on the current error message unit (as defined by x04aaf).
Errors or warnings detected by the routine:
ifail=1
On entry,trans'N' or 'T',
orcheck'C' or 'N'.
ifail=2
On entry,n<1,
ornnz<1,
ornnz>n2.
ifail=3
On entry, the arrays irow and icol fail to satisfy the following constraints:
  • 1irowin and 1icolin, for i=1,2,,nnz;
  • irowi-1<irowi, or irowi-1=irowi and icoli-1<icoli, for i=2,3,,nnz.
Therefore a nonzero element has been supplied which does not lie within the matrix A, is out of order, or has duplicate row and column indices. Call f11zaf to reorder and sum or remove duplicates.
ifail=-99
An unexpected error has been triggered by this routine. Please contact NAG.
See Section 3.9 in How to Use the NAG Library and its Documentation for further information.
ifail=-399
Your licence key may have expired or may not have been installed correctly.
See Section 3.8 in How to Use the NAG Library and its Documentation for further information.
ifail=-999
Dynamic memory allocation failed.
See Section 3.7 in How to Use the NAG Library and its Documentation for further information.

7
Accuracy

The computed vector y satisfies the error bound: where cn is a modest linear function of n, and ε is the machine precision.

8
Parallelism and Performance

f11xaf is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
f11xaf makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9
Further Comments

9.1
Timing

The time taken for a call to f11xaf is proportional to nnz.

9.2
Use of check

It is expected that a common use of f11xaf will be to compute the matrix-vector product required in the application of f11bef to sparse linear systems. In this situation f11xaf is likely to be called many times with the same matrix A. In the interests of both reliability and efficiency you are recommended to set check='C' for the first of such calls, and to set check='N' for all subsequent calls.

10
Example

This example reads in a sparse matrix A and a vector x. It then calls f11xaf to compute the matrix-vector product y=Ax and the transposed matrix-vector product y=ATx.

10.1
Program Text

Program Text (f11xafe.f90)

10.2
Program Data

Program Data (f11xafe.d)

10.3
Program Results

Program Results (f11xafe.r)

© The Numerical Algorithms Group Ltd, Oxford, UK. 2017