//$$ newmat.txt Documentation file Documentation for newmat03, an experimental matrix package in C++. ================================================================== MATRIX PACKAGE 25 November, 1991 Copyright (C) 1991: R B Davies and DSIR Permission is granted to use but not to sell. Contents ======== General description Is this the package you need? Changes Where you can get a copy of this package Compiler performance Detailed documentation Customising Constructors Elements of matrices Matrix copy Unary operators Binary operators Combination of a matrix and scalar Scalar functions of matrices Submatrix operations Change dimensions Change type Multiple matrix solve Memory management Output Accessing matrices of unspecified type Cholesky decomposition Householder triangularisation Singular Value Decomposition Eigenvalues Sorting Fast Fourier Transform Interface to Numerical Recipes in C List of files Notes on the design of the package What this is package for What size of matrices? Allow matrix expressions? Which matrix types? What element types? Naming convention Row and Column index ranges Structure of matrix objects Data storage - one block or several Data storage - by row or by column or other Storage of symmetric matrices Element access - method and checking Use iterators? Memory management - reference counting or status variable? Evaluation of expressions - use two stage method? How to overcome an explosion in number of operations Using const A calculus of matrix types Error handling Band and sparse matrices Problem report form --------------------------------------------------------------------------- General description =================== The package is intented for scientists and engineers who need to manipulate a variety of types of matrices using standard matrix operations. Emphasis is on the kind of operations needed in statistical calculations such as least squares, linear equation solve and eigenvalues. It supports matrix types Matrix (rectangular matrix) nricMatrix (variant of rectangular matrix) UpperTriangularMatrix LowerTriangularMatrix DiagonalMatrix SymmetricMatrix RowVector (derived from Matrix) ColumnVector (derived from Matrix). Only one element type (float or double) is supported. The package includes the operations *, +, -, inverse, transpose, conversion between types, submatrix, determinant, Cholesky decomposition, Householder triangularisation, singular value decomposition, eigenvalues of a symmetric matrix, sorting, fast fourier transform, printing and an interface with "Numerical Recipes in C". It is intended for matrices in the range 4 x 4 to about 90 x 90 (125 x 125 for triangular matrices). The upper limit is imposed by the maximum number of elements that can be contained in a single array (8192 doubles in some machines). A two-stage approach to evaluating matrix expressions is used to improve efficiency and reduce use of temporary storage. The package is designed for version 2 of C++. It works with Turbo C++, Borland C++, Glockenspiel C++ (2.00a) on a PC and AT&T C++ (2.0) and Gnu C++ on a Sun. It works with some problems with Zortech C++ (version 2). --------------------------------------------------------------------------- Is this the package you need? ============================= Do you 1. need matrix operators such as * and + defined as operators so you can write things like X = A * (B + C); 2. need a variety of types of matrices 3. need only one element type (float or double) 4. work with matrices in the range 4x4 to 90x90 5. tolerate a large and complex package Then maybe this is the right package for you. If you don't need (1) then there may be better options. Likewise if you don't need (2) there may be better options. If you require "not (5)" then this is not the package for you. If you need (2) and "not (3)" and have some spare money, then maybe you should look at M++ from Dyad or the Rogue Wave matrix package. --------------------------------------------------------------------------- Changes ======= Newmat03 - November 1991: Col and Cols become Column and Columns. Added Sort, SVD, Jacobi, Eigenvalues, FFT, real conversion of 1x1 matrix, "Numerical Recipes in C" interface, output operations, various scalar functions. Improved return from functions. Reorganised setting options in "include.hxx". Newmat02 - July 1991: Version with matrix row/column operations and numerous additional functions. Matrix - October 1990: Early version of package. --------------------------------------------------------------------------- How to get a copy of this package ================================= I am putting copies on Compuserve (Borland library, zip format), SIMTEL20 (MsDos library, zip format), comp.sources.misc on Internet (shar format), and on the MsDos program library at Victoria University, Wellington. --------------------------------------------------------------------------- Compiler performance ==================== I have tested this package on a number of compilers. Here are the levels of success with this package. In most cases I have chosen code that works under all the compilers I have access to, but I have had to include some specific work-arounds for some compilers. For the MsDos versions, I am using a 386/387sx computer running MsDos 5, except that Turbo is on an old XT. The unix versions are on a Sun Sparc station. A series of #defines at the beginning of "include.hxx" customises the package for the compiler you are using. Turbo, Borland and Zortech are recognised automatically, otherwise you have to set the appropriate #define statement. The compilers are looking a bit old now. I do intend to test the package against newer versions as they become available. Borland C++ 2.0: Recently, this has been my main development platform, so naturally almost everything works with this compiler. The library manager "tlib" fails but you can use "zorlib" from Zortech instead. Sometimes Borland crashes during a compiler or mis-compiles. You just have to reboot and continue the compile. Turbo C++ (? version): Almost works OK. My rather elderly version does show a problem. Probably not worth tracking down - buy a newer version. Haven't tried the linker. Zortech C++ 2.12: "const" doesn't work correctly with this compiler, so the package skips all of the statements Zortech can't handle. If you are using a later version of Zortech you could probably re-activate these statements. Zortech leaves rubbish on the heap. I don't know whether this is my programming error or a Zortech error. It works better when one doesn't optimise but there still are problems. The nric routines don't work. Zortech does not support IO manipulators. Glockenspiel C++ (2.00a for MsDos loading into Microsoft C 5.1): I haven't tested the latest version of my package with Glockenspiel. I had to #define the matrix names to shorter names to avoid ambiguities and had quite a bit of difficulty stopping the compiles from running out of space and not exceeding Microsoft's block nesting limit. A couple of my test statements produced statements too complex for Microsoft, but basically the package worked. This was my original development platform and I still use .cxx and .hxx as my file name extensions. However, Glockenspiel is no longer competitive for MsDos and I am not updating my copy of the compiler. Sun AT&T C++ 2.00: This works fine. Except aggregates are not supported. Gnu G++ 1.37.1: This mostly works. You don't seem to be able to use expressions like Matrix(X*Y) in the middle of an expression and (Matrix)(X*Y) is unreliable. Gnu does not support IO manipulators. Gnu leaves rubbish on the heap. This is from output statements and not my package and may not be an error. The previous version of the package did not work under Gnu 1.37 or 1.39. --------------------------------------------------------------------------- Detailed Documentation ====================== Copyright (C) 1989,1990,1991: R B Davies and DSIR Permission is granted to use but not to sell. -------------------------------------------------------------- | Please understand that this is a test version; there may | | still be bugs and errors. Use at your own risk. Neither I | | nor DSIR take any responsibility for any errors or omissions | | in this package or for any misfortune that may befall you or | | others as a result of its use. | -------------------------------------------------------------- Please report bugs to me at robert@am.dsir.govt.nz or Compuserve 72777,656 When reporting a bug please tell me which C++ compiler you are using (if known), and what version. Also give me details of your computer (if known). Tell me where you downloaded your version of my package from and its version number (eg newmat03 or newmat04). (There may be very minor differences between versions at different sites). Note any changes you have made to my code. If at all possible give me a piece of code illustrating the bug. Please do report bugs to me. The matrix inverse routine and the sort routines are adapted from "Numerical Recipes in C" by Press, Flannery, Teukolsky, Vetterling, published by the Cambridge University Press. Other code is adapted from routines in "Handbook for Automatic Computation, Vol II, Linear Algebra" by Wilkinson and Reinsch, published by Springer Verlag. Customising ----------- I use .hxx as the suffix of definition files and .cxx as the suffix of C++ source files. This does not cause any problems with the compilers I use except that Borland and Turbo need to be told to accept any suffix as meaning a C++ file rather than a C file. Use the large model when you are using a PC. Do not "outline" inline functions. Each file accessing the matrix package needs to have file newmat.hxx #included at the beginning. Files using matrix applications (Cholesky decomposition, Householder triangularisation) need newmatap.hxx instead (or as well). If you need the output functions you will also need newmatio.hxx. Glockenspiel also needs to have include.hxx #included before newmat.hxx. The file include.hxx sets the options for the compiler. If you are using a compiler different from one I have worked with you may have to set up a new section in include.hxx appropriate for your compiler. Borland, Turbo and Zortech are recognised automatically. If you using Glockenspiel on a PC, AT&T, or Gnu C++ activate the appropriate statement at the beginning of include.hxx. Activate the appropriate statement to make the element type float or double. The file (newmat9.cxx) containing the output routines can be used only with libraries that support the AT&T input/output routines including manipulators. It cannot be used with Zortech or Gnu. Constructors ------------ To construct an m x n matrix, A, (m and n are integers) use Matrix A(m,n); The UpperTriangularMatrix, LowerTriangularMatrix, SymmetricMatrix and DiagonalMatrix types are symmetric. To construct an n x n matrix use, for example UpperTriangularMatrix U(n); Likewise the RowVector and ColumnVector types take just one argument in their constructors: RowVector RV(n); You can also construct vectors and matrices without specifying the dimension. For example Matrix A; In this case the dimension must be set by an assignment statement or a re-dimension statement. You can also use a constructor to set a matrix equal to another matrix or matrix expression. Matrix A = U; Matrix A = U * L; Only conversions that don't lose information are supported - eg you cannot convert an upper triangular matrix into a diagonal matrix using =. Elements of matrices -------------------- Elements are accessed by expressions of the form A(i,j) where i and j run from 1 to the appropriate dimension. Access elements of vectors with just one argument. Diagonal matrices can accept one or two subscripts. This is different from the earlier version of the package in which the subscripts ran from 0 to one less than the appropriate dimension. Use A.element(i,j) if you want this earlier convention. Matrix copy ----------- The operator = is used for copying matrices, converting matrices, or evaluating expressions. For example A = B; A = L; A = L * U; Only conversions that don't lose information are supported. The dimensions of the matrix on the left hand side are adjusted to those of the matrix or expression on the right hand side. Elements on the right hand side which are not present on the left hand side are set to zero. The operator << can be used in place of = where it is permissible for information to be lost. For example SymmetricMatrix S; Matrix A; ...... S << A.t() * A; is acceptable whereas S = A.t() * A; // error will cause a runtime error since the package does not (yet) recognise A.t()*A as symmetric. Note that you can not use << with constructors. For example SymmetricMatrix S << A.t() * A; // error does not work. A third copy routine is used in a similar role to =. Use A.Inject(D); to copy the elements of D to the corresponding elements of A but leave the elements of A unchanged if there is no corresponding element of D (the = operator would set them to 0). This is useful, for example, for setting the diagonal elements of a matrix without disturbing the rest of the matrix. Unlike = and <<, Inject does not reset the dimensions of A, which must match those of D. Both << and Inject can be used with submatrix expressions on the left hand side. See the section on submatrices. To set the elements of a matrix to a scalar use operator = real r; Matrix A(m,n); ...... Matrix A(m,n); A = r; You can load the elements of a matrix from an array: Matrix A(3,2); real a[] = { 11,12,21,22,31,33 }; A << a; This construction cannot check that the numbers of elements match correctly. This version of << can be used with submatrices on the left hand side. Unary operators --------------- The package supports unary operations change sign of elements -A transpose A.t() inverse (of square matrix A) A.i() Binary operations ----------------- The package supports binary operations matrix addition A+B matrix subtraction A-B matrix multiplication A*B equation solve (square matrix A) A.i()*B In the last case the inverse is not calculated. Notes: If you are doing repeated multiplication. For example A*B*C, use brackets to force the order to minimize the number of operations. If C is a column vector and A is not a vector, then it will usually reduce the number of operations to use A*(B*C) . The package does not recognise B*A.i() as an equation solve. It is probably better to use (A.t().i()*B.t()).t() . Combination of a matrix and scalar ---------------------------------- The following expression multiplies the elements of a matrix A by a scalar f: A * f; Likewise one can divide the elements of a matrix A by a scalar f: A / f; The expressions A + f and A - f add or subtract a rectangular matrix of the same dimension as A with elements equal to f to or from the matrix A. In each case the matrix must be the first term in the expression. Expressions such f + A or f * A are not recognised. Scalar functions of matrices ---------------------------- int m = A.Nrows(); // number of rows int n = A.Ncols(); // number of columns real ssq = A.SumSquare(); // sum of squares of elements real sav = A.SumAbsoluteValue(); // sum of absolute values real mav = A.MaximumAbsoluteValue(); // maximum of absolute values real norm = A.Norm1(); // maximum of sum of absolute values of elements of a column real norm = A.NormInfinity(); // maximum of sum of absolute values of elements of a row real t = A.Trace(); // trace LogandSign ld = A.LogDeterminant(); // log of determinant BOOL z = A.IsZero(); // test all elements zero MatrixType mt = A.Type(); // type of matrix real* s = Store(); // pointer to array of elements int l = Storage(); // length of array of elements A.LogDeterminant() returns a value of type LogandSign. If ld is of type LogAndSign use ld.Value() to get the value of the determinant ld.Sign() to get the sign of the determinant (values 1, 0, -1) ld.LogValue() to get the log of the absolute value. A.IsZero() returns BOOL value TRUE if the matrix A has all elements equal to 0.0. MatrixType mt = A.Type() returns the type of a matrix. Use (char*)mt to get a string (UT, LT, Rect, Sym, Diag, RowV, ColV, Crout) showing the type. SumSquare(A), SumAbsoluteValue(A), MaximumAbsoluteValue(A), Trace(A), LogDeterminant(A), Norm1(A), NormInfinity(A) can be used in place of A.SumSquare(), A.SumAbsoluteValue(), A.MaximumAbsoluteValue(), A.Trace(), A.LogDeterminant(), A.Norm1(), A,NormInfinity(). Submatrix operations -------------------- A.SubMatrix(fr,lr,fc,lc) This selects a submatrix from A. the arguments fr,lr,fc,lc are the first row, last row, first column, last column of the submatrix with the numbering beginning at 1. This may be used in any matrix expression or on the left hand side of << or Inject. Inject does not check no information loss in this case. You can also use the construction real c; .... A.SubMatrix(fr,lr,fc,lc) << c; to set a submatrix equal to a constant. The follwing are variants of SubMatrix: A.SymSubMatrix(f,l) // This assumes fr=fc and lr=lc. A.Rows(f,l) // select rows A.Row(f) // select single row A.Columns(f,l) // select columns A.Column(f) // select single column In each case f and l mean the first and last row or column to be selected (starting at 1). If SubMatrix or its variant occurs on the right hand side of an = or << or within an expression its type is as follows A.Submatrix(fr,lr,fc,lc): If A is RowVector or ColumnVector then same type otherwise type Matrix A.SymSubMatrix(f,l): Same type as A A.Rows(f,l): Type Matrix A.Row(f): Type RowVector A.Columns(f,l): Type Matrix A.Column(f): Type ColumnVector If SubMatrix or its variant appears on the left hand side of << , think of its type being Matrix. Thus L.Row(1) where L is LowerTriangularMatrix expects L.Ncols() elements even though it will use only one of them. Change dimensions ----------------- The following operations change the dimensions of a matrix. The values of the elements are lost. A.ReDimension(nrows,ncols); // for type Matrix or nricMatrix A.ReDimension(n); // for all other types Change type ----------- The following functions interpret the elements of a matrix (stored row by row) to be a vector or matrix of a different type. Actual copying is usually avoided where these occur as part of a more complicated expression. A.CopyToRow() A.CopyToColumn() A.CopyToDiagonal() A.CopyToMatrix(nrows,ncols) A.c() real(A) The form .c() is used in matrix expressions when A is of a const type. The expression real(A) is used to convert a 1 x 1 matrix to a scalar. Multiple matrix solve --------------------- If A is a square or symmetric matrix use CroutMatrix X = A; // carries out LU decomposition Matrix AP = X.i()*P; Matrix AQ = X.i()*Q; LogAndSign ld = X.LogDeterminant(); rather than Matrix AP = A.i()*P; Matrix AQ = A.i()*Q; LogAndSign ld = A.LogDeterminant(); since each operation will repeat the LU decompostion. Memory management ----------------- The package does not support delayed copy. Several strategies are required to prevent unnecessary matrix copies. Where a matrix is called as a function argument use a constant reference. For example YourFunction(const Matrix& A) rather than YourFunction(Matrix A) Constant matrices cannot be used in matrix expressions so if you wish to use A in an expression within this function use A.c() rather than A. Skip the rest of this section on your first reading. A second place where it is desirable to avoid unnecessary copies is when a function is returning a matrix. Matrices can be returned from a function with the return command as you would expect. However these may incur one and possibly two copyings of the matrix. To avoid this use the following instructions. Make your function of type ReturnMatrix . Then precede the return statement with a Release statement (or a ReleaseAndDelete statement if the matrix was created with new). For example ReturnMatrix MakeAMatrix() { Matrix A; ...... A.Release(); return A; } or ReturnMatrix MakeAMatrix() { Matrix* m = new Matrix; ...... m->ReleaseAndDelete(); return *m; } Note that .c() cannot be applied to a matrix following application of .Release() or ->ReleaseAndDelete() . --------------------------------------------------------------------- | Do not forget to make the function of type ReturnMatrix; otherwise | | incomprehensible run-time errors will occur with some compilers. | --------------------------------------------------------------------- You can also use .Release() or ->ReleaseAndDelete() to allow a matrix expression to recycle space. Suppose you call A.Release(); just before A is used just once in an expression. Then the memory used by A is either returned to the system or reused in the expression. In either case, A's memory is destroyed. This procedure can be used to imporve efficiency and reduce the use of memory. Use ->ReleaseAndDelete for matrices created by new if you want to completely delete the matrix after it is accessed. Output ------ To print a matrix use an expression like Matrix A; ...... cout << setw(10) << setprecision(5) << A; This will work only with systems that support the AT&T input/output routines including manipulators. Accessing matrices of unspecified type -------------------------------------- Skip this section on your first reading. Suppose you wish to write a function which accesses a matrix of unknown type including expressions (eg A*B). Then use a layout similar to the following: void YourFunction(BaseMatrix& X) { GeneralMatrix* gm = X.Evaluate(); // evaluate an expression // if necessary ........ // operations on *gm gm->tDelete(); // delete *gm if a temporary } See, as an example, the definitions of operator<< in newmat9.cxx. Cholesky decomposition ---------------------- Suppose S is symmetric and positive definite. Then there exists a unique lower triangular matrix L such that L * L.t() = S. To calculate this use SymmetricMatrix S; ...... LowerTriangularMatrix L = Cholesky(S); Householder triangularisation ----------------------------- Start with matrix / X 0 \ s \ Y 0 / t n s The Householder triangularisation post multiplies by an orthogonal matrix Q such that the matrix becomes / 0 L \ s \ Z M / t n s where L is lower triangular. Note that X is the transpose of the matrix sometimes considered in this context. This is good for solving least squares problems: choose b (matrix or row vector) to minimize the sum of the squares of the elements of Y - b*X Then choose b = M * L.i(); Two routines are provided: HHDecompose(X, L); replaces X by orthogonal columns and forms L. HHDecompose(X, Y, M); uses X from the first routine, replaces Y by Z and forms M. Singular Value Decomposition ---------------------------- The singular value decomposition of an m x n matrix A ( where m >= n) is a decomposition A = U * D * V.t() where U is m x n with U.t() * U equalling the identity, D is an n x n diagonal matrix and V is an n x n orthogonal matrix. Singular value decompositions are useful for understanding the structure of ill-conditioned matrices, solving least squares problems, and for finding the eigenvalues of A.t() * A. To calculate the singular value decomposition of A (with m >= n) use one of SVD(A, D, U, V); // U (= A is OK) SVD(A, D); SVD(A, D, U); // U (= A is OK) SVD(A, D, U, FALSE); // U (can = A) for workspace only SVD(A, D, U, V, FALSE); // U (can = A) for workspace only The values of A are not changed unless A is also inserted as the third argument. Eigenvalues ----------- An eigenvalue decomposition of a symmetric matrix A is a decomposition A = V * D * V.t() where V is an orthogonal matrix and D is a diagonal matrix. Eigenvalue analyses are used in a wide variety of engineering, statistical and other mathematical analyses. The package includes two algorithms: Jacobi and Householder. The first is extremely reliable but much slower than the second. The code is adapted from routines in "Handbook for Automatic Computation, Vol II, Linear Algebra" by Wilkinson and Reinsch, published by Springer Verlag. Jacobi(A,D,S,V); // A, S symmetric is for workspace, // S = A is OK Jacobi(A,D); // A symmetric Jacobi(A,D,S); // A, S symmetric is for workspace, // S = A is OK Jacobi(A,D,V); // A symmetric EigenValues(A,D); // A symmetric EigenValues(A,D,S); // A, S symmetric is for back // transforming, S = A is OK EigenValues(A,D,V); // A symmetric Sorting ------- To sort the values in a matrix or vector, A, (in general this operation makes sense only for vectors and diagonal matrices) use SortAscending(A); or SortDescending(A); Fast Fourier Transform ---------------------- FFT(CV1, CV2, CV3, CV4); // CV3=CV1 and CV4=CV2 is OK where CV1, CV2, CV3, CV4 are column vectors. CV1 and CV2 are the real and imaginary input vectors; CV3 and CV4 are the real and imaginary output vectors. The lengths of CV1 and CV2 must be equal and should be the product of numbers less than about 10 for fast execution. Interface to Numerical Recipes in C ----------------------------------- This package can be used with the vectors and matrices defined in "Numerical Recipes in C". You need to edit the routines in Numerical Recipes so that the elements are of the same type as used in this package. Eg replace float by double, vector by dvector and matrix by dmatrix, etc. You will also need to edit the function definitions to use the version acceptable to your compiler. Then enclose the code from Numerical Recipes in extern "C" { ... }. You will also need to include the matrix and vector utility routines. Then any vector in Numerical Recipes with subscripts starting from 1 in a function call can be replaced by a RowVector, ColumnVector or DiagonalMatrix in the present package. Similarly any matrix with subscripts starting from 1 can be replaced by an nricMatrix in the present package. The class nricMatrix is derived from Matrix and can be used in place of Matrix. Numerical Recipes cannot change the dimensions of a matrix or vector. So matrices or vectors must be correctly dimensioned before a Numerical Recipes routine is called. For example SymmetricMatrix B(44); ..... // load values into B nricMatrix BX = B; // copy values to an nricMatrix DiagonalMatrix D(44); // Matrices for output nricMatrix V(44,44); // correctly dimensioned int nrot; jacobi(BX,44,D,V,&nrot); // jacobi from NRIC cout << D; // print eigenvalues --------------------------------------------------------------------------- List of files ============= NEWMAT TXT documentation file NEWMAT LIS list of files BOOLEAN HXX boolean class definition CONTROLW HXX control word definition file INCLUDE HXX details of include files and options NEWMAT HXX main matrix clss definition file NEWMATAP HXX applications definition file NEWMATIO HXX input/output definition file NEWMATRC HXX row/column functions definition files NEWMATRM HXX rectangular matrix access definition files PRECISIO HXX numerical precision constants CHOLESKY CXX Cholesky decomposition EVALUE CXX eigenvalues and eigenvector calculation FFT CXX fast Fourier transform HHOLDER CXX Householder triangularisation JACOBI CXX eigenvalues by the Jacobi method NEWMAT1 CXX type manipulation routines NEWMAT2 CXX row and column manipulation functions NEWMAT3 CXX row and column access functions NEWMAT4 CXX constructors, redimension, utilities NEWMAT5 CXX transpose, evaluate, matrix functions NEWMAT6 CXX operators, element access NEWMAT7 CXX invert, solve, binary operations NEWMAT8 CXX LU decomposition, scalar functions NEWMAT9 CXX output routines NEWMATRM CXX rectangular matrix access functions SORT CXX sorting functions SUBMAT CXX submatrix functions SVD CXX singular value decomposition EXAMPLE CXX example of use of package EXAMPLE TXT output from example EXAMPLE DEP dependency file for example --------------------------------------------------------------------------- Notes on the design of the package ================================== Copyright (C) 1991: R B Davies and DSIR Please treat this as an academic publication. You can use the ideas but please acknowledge in any publication. In this section, I describe some of the ideas behind this package, some of the decisions that I needed to make and give some details about the way it works. You don't need to read this section in order to use the package. I don't think it is obvious what is the best way of going about structuring a matrix package. And I don't think you can figure this out with "thought" experiments. Different people have to try out different approaches. And someone else may have to figure out which is best. Or, more likely, the ultimate packages will lift some ideas from each of a variety of trial packages. So, I don't claim my package is an "ultimate" package, but simply a trial of a number of ideas. But I do hope it will meet the immediate requirements of some people who need to carry out matrix calculations. What this is package for ------------------------ The package is used for the manipulation of matrices, including the standard operations such as multiplication as understood by numerical analysts, engineers and mathematicians. A matrix is a two dimensional array of numbers. However, very special operations such as matrix multiplication are defined specifically for matrices. This means that a "matrix" package tends to be different from a general "array" package. I see a matrix package as providing the following 1. Evaluation of matrix expressions in a form familiar to scientists and engineers. For example A = B * (C + D.t()); 2. Access to the elements of a matrix; 3. Access to submatrices; 4. Common elementary matrix functions such as determinant and trace; 5. A platform for developing advanced matrix functions such as eigen- value analysis; 6. Good efficiency and storage management; 7. Graceful exit from errors (I don't provide this yet). It may also provide 8. A variety of types of elements (eg real and complex); 9. A variety of types of matrices (eg rectangular and symmetric). In contrast an array package should provide 1'. Arrays can be copied with a single instruction; may have some arithmetic operations, say + - and scalar + - * /, it may provide matrix multiplication as a function; 2'. High speed access to elements directly and with iterators; 3'. Access to subarrays and rows (and columns?); 6'. Good efficiency and storage management; 7'. Graceful exit from errors; 8'. A variety of types of elements (eg real and complex); 9'. One, two and three dimensional arrays, at least, with starting points of the indices set by user; may have arrays with ragged rows. It may be possible to amalgamate these two sets of requirements to some extent. However my package is definitely oriented towards the first set. Even within the bounds set by the first set of requirements there is a substantial opportunity for variation between what different matrix packages might provide. It is not possible to build a matrix package that will meet everyone's requirements. In many cases if you put in one facility, you impose overheads on everyone using the package. This both is storage required for the program and in efficiency. Likewise a package that is optimised towards handling large matrices is likely to become less efficient for very small matrices where the administration time for the matrix may become significant compared with the time to carry out the operations. It is better to provide a variety of packages (hopefully compatible) so that most users can find one that meets their requirements. This package is intended to be one of these packages; but not all of them. Since my background is in statistical methods, this package is oriented towards the kinds things you need for statistical analyses. What size of matrices? ---------------------- A matrix package may target small matrices (say 3 x 3), or medium sized matrices, or very large matrices. A package targeting very small matrices will seek to minimise administration. A package for medium sized or very large matrices can spend more time on administration in order to conserve space or optimise the evaluation of expressions. A package for very large matrices will need to pay special attention to storage and numerical properties. This package is designed for medium sized matrices. This means it is worth introducing some optimisations, but I don't have to worry about the 64K limit that some compilers impose. Allow matrix expressions? ------------------------- I want to be able to write matrix expressions the way I would on paper. So if I want to multiply two matrices and then add the transpose of a third one I can write something like X = A * B + C.t(); I want this expression to be evaluated with close to the same efficiency as a hand-coded version. This is not so much of a problem with expressions including a multiply since the multiply will dominate the time. However, it is not so easy to achieve with expressions with just + and - . A second requirement is that temporary matrices generated during the evaluation of an expression are destroyed as quickly as possible. A desirable feature is that a certain amount of "intelligence" be displayed in the evaluation of an expression. For example, in the expression X = A.i() * B; where i() denotes inverse, it would be desirable if the inverse wasn't explicitly calculated. Which matrix types? ------------------- As well as the usual rectangular matrices, matrices occuring repeatedly in numerical calculations are upper and lower triangular matrices, symmetric matrices and diagonal matrices. This is particularly the case in calculations involving least squares and eigenvalue calculations. So as a first stage these were the types I decided to include. It is also necessary to have types row vector and column vector. In a "matrix" package, in contrast to an "array" package, it is necessary to have both these types since they behave differently in matrix expressions. The vector types can be derived for the rectangular matrix type, so having them does not greatly increase the complexity of the package. The problem with having several matrix types is the number of versions of the binary operators one needs. If one has 5 distinct matrix types then a simple package will need 25 versions of each of the binary operators. In fact, we can evade this problem, but at the cost of some complexity. What element types? ------------------- Ideally we would allow element types double, float, complex and int, at least. It would be reasonably easy, using templates or equivalent, to provide a package which could handle a variety of element types. However, as soon as one starts implementing the binary operators between matrices with different element types, again one gets an explosion in the number of operations one needs to consider. Hence I decided to implement only one element type. But the user can decide whether this is float or double. The package assumes elements are of type real. The user typedefs to float or double. In retrospect, it would not be too hard to include matrices with complex matrix type. It might also be worth including symmetric and triangular matrices with extra precision elements (double or long double) to be used for storage only and with a minimum of operations defined. These would be used for accumulating the results of sums of squares and product matrices or multistage Householder triangularisations. Naming convention ----------------- How are classes and public member functions to be named? As a general rule I have spelt identifiers out in full with individual words being capitalised. For example "UpperTriangularMatrix". If you don't like this you can #define or typedef shorter names. This convention means you can select an abbreviation scheme that makes sense to you. The convention causes problems for Glockenspiel C++ on a PC feeding into Microsoft C. The names Glockenspiel generates exceed the the 32 characters recognised by Microsoft C and ambiguities result. So it is necessary to #define shorter names. Exceptions to the general rule are the functions for transpose and inverse. To make matrix expressions more like the corresponding mathematical formulae, I have used the single letter abbreviations, t() and i() . Row and Column index ranges --------------------------- In mathematical work matrix subscripts usually start at one. In C, array subscripts start at zero. In Fortran, they start at one. Possibilities for this package were to make them start at 0 or 1 or be arbitrary. Alternatively one could specify an "index set" for indexing the rows and columns of a matrix. One would be able to add or multiply matrices only if the appropriate row and column index sets were identical. In fact, I adopted the simpler convention of making the rows and columns of a matrix be indexed by an integer starting at one, following the traditional convention. In an earlier version of the package I had them starting at zero, but even I was getting mixed up when trying to use this earlier package. So I reverted to the more usual notation. Structure of matrix objects --------------------------- Each matrix object contains the basic information such as the number of rows and columns and a status variable plus a pointer to the data array which is on the heap. Data storage - one block or several ----------------------------------- In this package the elements of the matrix are stored as a single array. Alternatives would have been to store each row as a separate array or a set of adjacent rows as a separate array. The present solution simplifies the program but limits the size of matrices in systems that have a 64k byte (or other) limit on the size of arrays. The large arrays may also cause problems for memory management in smaller machines. Data storage - by row or by column or other ------------------------------------------- In Fortran two dimensional arrays are stored by column. In most other systems they are stored by row. I have followed this later convention. This makes it easier to interface with other packages written in C but harder to interface with those written in Fortran. An alternative would be to store the elements by mid-sized rectangular blocks. This might impose less strain on memory management when one needs to access both rows and columns. Storage of symmetric matrices ----------------------------- Symmetric matrices are stored as lower triangular matrices. The decision was pretty arbitrary, but it does slightly simplify the Cholesky decomposition program. Element access - method and checking ------------------------------------ We want to be able to use the notation A(i,j) to specify the (i,j)-th element of a matrix. This is the way mathematicians expect to address the elements of matrices. I didn't even consider using the totally alien notation A[i][j]. There are two ways of working out the address of A(i,j). One is using a "dope" vector which contains the first address of each row. This is how C works when you use A[i][j]. Alternatively you can calculate the address using the formula appropriate for the structure of A. I use this second approach. It is probably slower, but saves worrying about an extra bit of storage. The other question is whether to check for i and j being in range. I do carry out this check following years of experience with both systems that do and systems that don't do this check. I would hope that the routines I supply with this package will reduce your need to access elements of matrices so speed of access is not a high priority. Use iterators? -------------- Iterators are an alternative way of providing fast access to the elements of an array or matrix when they are to be accessed sequentially. They need to be customised for each type of matrix. I have not implemented iterators in this package, although some iterator like functions are used for some row and column functions. Memory management - reference counting or status variable? ---------------------------------------------------------- Consider the instruction X = A + B + C; To evaluate this a simple program will add A to B putting the total in a temporary T1. Then it will add T1 to C creating another temporary T2 which will be copied into X. T1 and T2 will sit around till the end of the block. It would be faster if the program recognised that T1 was temporary and stored the sum of T1 and C back into T1 instead of creating T2 and then avoided the final copy by just assigning the contents of T1 to X rather than copying. In this case there will be no temporaries requiring deletion. (More precisely there will be a header to be deleted but no contents). For an instruction like X = (A * B) + (C * D); we can't avoid one temporary being left over, so we would like this temporary deleted as quickly as possible. I provide the functionality for doing this by attaching a status variable to each matrix. This indicates if the matrix is temporary so that its memory is available for recycling or deleting. Any matrix operation checks the status variables of the matrices it is working with and recycles or deletes any temporary memory. An alternative or additional approach would be to use delayed copying. If a program requests a matrix to be copied, the copy is delayed until an instruction is executed which modifies the memory of either the original matrix or the copy. This saves the unnecessary copying in the previous examples. However, it does not provide the additional functionality of my approach. It would be possible to have both delayed copy and tagging temporaries, but this seemed an unnecessary complexity. In particular delayed copy mechanisms seem to require two calls to the heap manager, using extra time and making building a memory compacting mechanism more difficult. Evaluation of expressions - use two stage method? ------------------------------------------------- Consider the instruction X = B - X; The simple program will subtract X from B, store the result in a temporary T1 and copy T1 into X. It would be faster if the program recognised that the result could be stored directly into X. This would happen automatically if the program could look at the instruction first and mark X as temporary. C programmers would expect to avoid the same problem with X = X - B; by using an operator -= (which I haven't provided, yet) X -= B; However this is an unnatural notation for non C users and it is much nicer to write X = X - B; and know that the program will carry out the simplification. Another example where this intelligent analysis of an instruction is helpful is in X = A.i() * B; where i() denotes inverse. Numerical analysts know it is inefficient to evaluate this expression by carrying out the inverse operation and then the multiply. Yet it is a convenient way of writing the instruction. It would be helpful if the program recognised this expression and carried out the more appropriate approach. To carry out this "intelligent" analysis of an instruction matrix expressions are evaluated in two stages. In the the first stage a tree representation of the expression is formed. For example (A+B)*C is represented by a tree * / \ + C / \ A B Rather than adding A and B the + operator yields an object of a class "AddedMatrix" which is just a pair of pointers to A and B. Then the * operator yields a "MultipliedMatrix" which is a pair of pointers to the "AddedMatrix" and C. The tree is examined for any simplifications and then evaluated recursively. Further possibilities not yet included are to recognise A.t()*A and A.t()+A as symmetric or to improve the efficiency of evaluation of expressions like A+B+C, A*B*C, A*B.t() [t() denotes transpose]. One of the disadvantages of the two-stage approach is that the types of matrix expressions are determined at run-time. So the compiler will not detect errors of the type Matrix M; DiagonalMatrix D; ....; D = M; We don't allow conversions using = when information would be lost. Such errors will be detected when the statement is executed. How to overcome an explosion in number of operations ---------------------------------------------------- The package attempts to solve the problem of the large number of versions of the binary operations required when one has a variety of types. With n types of matrices the binary operations will each require n-squared separate algorithms. Some reduction in the number may be possible by carrying out conversions. However the situation rapidly becomes impossible with more than 4 or 5 types. Doug Lea told me that it was possible to avoid this problem. I don't know what his solution is. Here's mine. Each matrix type includes routines for extracting individual rows or columns. I assume a row or column consists of a sequence of zeros, a sequence of stored values and then another sequence of zeros. Only a single algorithm is then required for each binary operation. The rows can be located very quickly since most of the matrices are stored row by row. Columns must be copied and so the access is somewhat slower. As far as possible my algorithms access the matrices by row. An alternative approach of using iterators will be slower since the iterators will involve a virtual function access at each step. In fact, I provide several algorithms for operations like + . If one is adding two matrices of the same type then there is no need to access the individual rows or columns and a faster general algorithm is appropriate. Generally the method works well. However symmetric matrices are not always handled very efficiently (yet) since complete rows are not stored explicitly. The original version of the package did not use this access by row or column method and provided the multitude of algorithms for the combination of different matrix types. The code file length turned out to be just a little longer than the present one when providing the same facilities with 5 distinct types of matrices. It would have been very difficult to increase the number of matrix types in the original version. Apparently 4 to 5 types is about the break even point for switching to the approach adopted in the present package. Using const ----------- The memory management scheme introduces a problem when a matrix is declared const. Because an operator may want to recycle the memory of its operands these operands cannot be declared const. It isn't reasonable for a temporary matrix to be declared const. However, I don't know how to tell this to the C++ compiler. One possibility is to provide alternative versions of the operators for operands declared const. But then one gets the explosion in the number of operators. My solution is to include versions of the initialisers for matrices declared const. Otherwise, you need to use A.c() in place of A if A is declared const and you wish to use it in an expression. A calculus of matrix types -------------------------- The program needs to be able to work out the class of the result of a matrix expression. This is to check that a conversion is legal or to determine the class of a temporary. To assist with this, a class MatrixType is defined. Operators +, -, *, >= are defined to calculate the types of the results of expressions or to check that conversions are legal. Error handling -------------- The package does not have graceful exit from errors. All errors are treated as fatal. Originally I thought I would wait until exceptions became available in C++. This now seems to have been delayed. In any case I don't think exceptions will solve all the problems. Some clean up of objects on the heap will often be required before one can exit via an exception. There are four categories of errors: Programming error - eg illegal conversion of a matrix, subscript out of bounds, matrix dimensions don't match; Illegal data error - eg Cholesky of a non-positive definite matrix; Out of space error - "new" returns a null pointer; Convergence failure - an iterative operation fails to converge. For the first two of these, it might be sensible to terminate a program. For the second two, one does want to return control to the user's program in a convenient manner. I don't know a good way of doing this, especially before exceptions are implemented. Band and sparse matrices ------------------------ The package does not yet support band or sparse types. At present the package assumes that the structure of a matrix is determined by its class and dimensions. This is not sufficient for band and sparse matrices. For band matrices one also needs to know the upper and lower band widths. For sparse matrices there is going to be some kind of structure vector. These are going to have to be calculated for the results of expressions in much the same way that types are calculated. In addition, a whole new set of row and column operations would have to be written for sparse matrices. However the present ones will be fine for band matrices. Band and sparse matrices are important for people solving large sets of differential equations. Sparse matrices are also important for statistical and operational research applications. ------------------------------------------------------------------------------- Matrix package problem report form ---------------------------------- Version: ...............newmat03 Date of release: .......Nov 25th, 1991 Primary site: ..........Simtel20 Downloaded from: ....... Your email address: .... Today's date: .......... Your machine: .......... Compiler & version: .... Describe the problem - attach examples if possible: Email to robert@am.dsir.govt.nz or Compuserve 72777,656 -------------------------------------------------------------------------------