Usage Notes

CNLMath : Eigensystem Analysis : Usage Notes

Usage Notes

Reformulating Generalized Eigenvalue Problems

Eigenvalue Computation With ARPACK-Based Functions

An ordinary linear eigensystem problem is represented by the equation Ax = λx where A denotes an n × n matrix. The value λ is an eigenvalue and x ≠ 0 is the corresponding eigenvector. The eigenvector is determined up to a scalar factor. In all functions, we have chosen this factor so that x has Euclidean length one, and the component of x of largest magnitude is positive. The eigenvalues and corresponding eigenvectors are sorted then returned in the order of largest to smallest complex magnitude. If x is a complex vector, this component of largest magnitude is scaled to be real and positive. The entry where this component occurs can be arbitrary for eigenvectors having nonunique maximum magnitude values.

A generalized linear eigensystem problem is represented by Ax = λBx where A and B are n × n matrices. The value λ is a generalized eigenvalue, and x is the corresponding generalized eigenvector. The generalized eigenvectors are normalized in the same manner as the ordinary eigensystem problem.

Error Analysis and Accuracy

The remarks in this section are for ordinary eigenvalue problems. Except in special cases, functions will not return the exact eigenvalue-eigenvector pair for the ordinary eigenvalue problem Ax = λx. Typically, the computed pair

are an exact eigenvector-eigenvalue pair for a "nearby” matrix A + E. Information about E is known only in terms of bounds of the form ∥E∥2 ≤ f (n) ∥A∥2ɛ. The value of f(n) depends on the algorithm, but is typically a small fractional power of n. The parameter ɛ is the machine precision. By a theorem due to Bauer and Fike (see Golub and Van Loan 1989, p. 342),

where σ(A) is the set of all eigenvalues of A (called the spectrum of A), X is the matrix of eigenvectors, ∥⋅∥2 is Euclidean length, and κ(X) is the condition number of X defined as κ(X) = ∥X∥2∥X-1∥2. If A is a real symmetric or complex Hermitian matrix, then its eigenvector matrix X is respectively orthogonal or unitary. For these matrices, κ(X) = 1.

The accuracy of the computed eigenvalues

and eigenvectors

can be checked by computing their performance index τ. The performance index is defined to be

where ɛ is again the machine precision.

The performance index τ is related to the error analysis because

where E is the “nearby” matrix discussed above.

While the exact value of τ is precision and data dependent, the performance of an eigensystem analysis function is defined as excellent if τ < 1, good if 1 ≤ τ ≤ 100, and poor if τ > 100. This is an arbitrary definition, but large values of τ can serve as a warning that there is a significant error in the calculation.

If the condition number κ(X) of the eigenvector matrix X is large, there can be large errors in the eigenvalues even if τ is small. In particular, it is often difficult to recognize near multiple eigenvalues or unstable mathematical problems from numerical results. This facet of the eigenvalue problem is often difficult for users to understand. Suppose the accuracy of an individual eigenvalue is desired. This can be answered approximately by computing the condition number of an individual eigenvalue (see Golub and Van Loan 1989, pp. 344 - 345). For matrices A, such that the computed array of normalized eigenvectors X is invertible, the condition number of λj is

the Euclidean length of the j-th row of X-1. Users can choose to compute this matrix using function imsl_c_lin_sol_gen in Chapter 1, “Linear Systems.” An approximate bound for the accuracy of a computed eigenvalue is then given by κjɛ∥A∥. To compute an approximate bound for the relative accuracy of an eigenvalue, divide this bound by ∣λj∣.

Reformulating Generalized Eigenvalue Problems

The eigenvalue problem Ax = λBx is often difficult for users to analyze because it is frequently ill-conditioned. Occasionally, changes of variables can be performed on the given problem to ease this ill-conditioning. Suppose that B is singular, but A is nonsingular. Define the reciprocal μ = λ-1. Then assuming A is definite, the roles of A and B are interchanged so that the reformulated problem Bx = μAx is solved. Those generalized eigenvalues μj = 0 correspond to eigenvalues λj = ∞. The remaining λj = μj-1. The generalized eigenvectors for λj correspond to those for μj.

Now suppose that B is nonsingular. The user can solve the ordinary eigenvalue problem Cx = λx where C = B-1A. The matrix C is subject to perturbations due to ill-conditioning and rounding errors when computing B-1A. Computing the condition numbers of the eigenvalues for C may, however, be helpful for analyzing the accuracy of results for the generalized problem.

There is another method that users can consider to reduce the generalized problem to an alternate ordinary problem. This technique is based on first computing a matrix decomposition B = PQ where both P and Q are matrices that are “simple” to invert. Then, the given generalized problem is equivalent to the ordinary eigenvalue problem Fy = λy. The matrix F = P-1AQ-1 and the unnormalized eigenvectors of the generalized problem are given by x = Q-1y. An example of this reformulation is used in the case where A and B are real and symmetric, with B positive definite. The function imsl_f_eig_symgen uses P = RT and Q = R where R is an upper-triangular matrix obtained from a Cholesky decomposition, B = RTR. The matrix F = R-TAR-1 is symmetric and real. Computation of the eigenvalue-eigenvector expansion for F is based on function imsl_f_eig_sym.

Eigenvalue Computation With ARPACK-Based Functions

ARPACK consists of a set of Fortran 77 subroutines that use the implicitly restarted Arnoldi method, described in Sorensen (1992), to solve eigenvalue problems. ARPACK is well suited for large sparse or structured matrices A where structured means that a matrix-vector product ω ← Av requires O(n) rather than the usual O(n2) floating point operations.

A full description of the ARPACK features can be found in the ARPACK Users' Guide written by Lehoucq, Sorensen, and Yang (1998).

The original API for ARPACK uses a reverse communication interface. This interface can be used as illustrated in the ARPACK Users' Guide. In order to simplify the usage of the ARPACK algorithms, CNL instead applies a forward communication interface based on user-defined functions for matrix-vector products or linear solving steps required by the algorithms in ARPACK. It is not necessary that the linear operators be expressed as dense or sparse matrices. That is permitted, but for some problems the best approach is the ability to form a product of the operator with a vector.

Function imsl_d_arpack_symmetric, based on ARPACK routines DSAUPD and DSEUPD, computes some of the eigenvalues and corresponding eigenvectors of generalized real symmetric eigenproblems of the form Ax = λBx. For B = I, the problem reduces to the standard eigenproblem. In the symmetric case, the Arnoldi method specializes to a variant of the Lanczos method.

In below paragraph, add the link to the new arpack_general routine once it’s in the chapter.

Similarly, function imsl_d_arpack_general, which is based on ARPACK routines DNAUPD and DNEUPD, computes eigenvalues and eigenvectors of generalized real eigenproblems of the form Ax = λBx with a real general matrix A and a positive definite or positive semi-definite matrix B.

The Lanczos and Arnoldi methods usually work very well for the computation of non-clustered eigenvalues at the periphery of the spectrum. Therefore, the eigenvalues of largest or smallest magnitude can be determined by applying both methods directly to problem Cx ≡ B-1 Ax = λx. The user has to provide operator products ω = Ax, ω = Bx, and ω = B-1x. Here, x is an input vector and ω the result of applying the linear operators A, B or B-1 to x. Usually, matrix B is not directly inverted. Instead, a factorization of B is computed, and the linear system Bω = x is solved using the factored form of B. In the case of the standard problem, B = I, only the product ω = Ax has to be provided.

In the special case that B is positive definite and well-conditioned, one may compute the Cholesky decomposition B = RTR and then solve the standard eigenvalue problem Cy ≡ R-T AR-1y = λy. The product operation required by the Lanczos or Arnoldi algorithm, ω = Cx, is performed in steps: Solve Rz = x for z, compute y = Az, and solve RT ω = y for ω. The eigenvectors y of C are transformed to those of the generalized problem, x, by solving Rx = y for x.

For a generalized problem, if eigenvalues from a cluster or from the interior of the spectrum are sought, a shift and invert spectral transformation can often be applied efficiently. Here, for a given shift value σ, the problem is equivalently transformed into the standard eigenvalue problem Cx ≡ (A ‑ σ B)-1Bx = vx. The matrix pencil A ‑ σ B is assumed to be non-singular. The purpose of the user-defined function is to provide results for the operator products ω = Bx and ω = (A ‑ σ B)-1x. Usually, the inverse matrix product will be computed by solving linear systems, where the matrix pencil is the coefficient matrix. The back-transformed eigenvalues λ of the original problem satisfy λj = σ + vj-1. This relation shows that if eigenvalues of largest magnitude of matrix C are computed, then also eigenvalues for the original problem are found that are closest to the shift σ in absolute value.