com.imsl.math.NumericalDerivatives

All Implemented Interfaces:: Serializable, Cloneable

Direct Known Subclasses:: NumericalDerivativesEx4, NumericalDerivativesEx5

public class NumericalDerivatives extends Object implements Serializable, Cloneable

Compute the Jacobian matrix for a function $f(y)$ with m components in n independent variables.

NumericalDerivatives uses divided finite differences to compute the Jacobian. This class is designed for use in numerical methods for solving nonlinear problems where a Jacobian is evaluated repeatedly at neighboring arguments. For example this occurs in a Gauss-Newton method for solving non-linear least squares problems or a non-linear optimization method.

NumericalDerivatives is suited for applications where the Jacobian is a dense matrix. All cases $m \lt n$, $m = n$, or $m \gt n$ are allowed. Both one-sided and central divided differences can be used.

The design allows for computation of derivatives in a variety of contexts. Note that a gradient should be considered as the special case with $m = 1$, $n \ge 1$. A derivative of a single function of one variable is the case $m = 1$, $n = 1$. Any non-linear solving routine that optionally requests a Jacobian or gradient can use NumericalDerivatives. This should be considered if there are special properties or scaling issues associated with $f(y)$. Use the method setDifferencingMethods to specify different differencing options for numerical differentiation. These can be combined with some analytic subexpressions or other known relationships.

The divided differences are computed using values of the independent variables at the initial point $y_e = y$, and differenced points $y_e = y + del \times e_j$. Here the $e_j, j = 1, ..., n$, are the unit coordinate vectors. The value for each difference del depends on the variable j, the differencing method, and the scaling for that variable. This difference is computed internally. See setPercentageFactor for computational details. The evaluation of $f(y_e)$ is normally done by the user-provided method NumericalDerivatives.Function.f, using the values $y_e$. The index j and values $y_e$ are arguments to NumericalDerivatives.Function.f.

The computational kernel of evaluateJ performs the following steps:

evaluate the equations at the point y using NumericalDerivatives.Function.f.
compute the Jacobian.
compute the difference at $y_e$.

By default, evaluateJ uses NumericalDerivatives.Function.f in step 3. The user may choose to override the evaluateF method to extend the capability of the class beyond the default.

There are six examples provided which illustrate various ways to use NumericalDerivatives. A discussion of the expected errors for these difference methods is found in A First Course in Numerical Analysis, Anthony Ralston, McGraw-Hill, NY, (1965).

See Also:

Nested Class Summary

Nested Classes

Modifier and Type

Class

Description

static interface

NumericalDerivatives.Function

Public interface function.

static interface

NumericalDerivatives.Jacobian

Public interface for the user-supplied function to compute the Jacobian.
Field Summary

Fields

Modifier and Type

Field

Description

static final int

ACCUMULATE

Indicates the accumulation of the result from whatever type of differences have been specified previously into initial values of the Jacobian.

static final int

CENTRAL

Indicates central differences.

static final int

ONE_SIDED

Indicates one sided differences.

static final int

SKIP

Indicates a variable to be skipped.
Constructor Summary

Constructors

Constructor

Description

NumericalDerivatives(NumericalDerivatives.Function fcn)

Constructor for NumericalDerivatives.
Method Summary

Modifier and Type

Method

Description

protected double[]

evaluateF(int varIndex, double[] y)

This method is provided by the user to compute the function values at the current independent variable values y.

double[][]

evaluateJ(double[] y)

Evaluates the Jacobian for a system of (m) equations in (n) variables.

double[]

getPercentageFactor()

Returns the percentage factor for differencing.

double[]

getScalingFactors()

Returns the scaling factors for the y values.

int[]

getStatus()

Returns status information.

void

setDifferencingMethods(int[] options)

Sets the methods used to compute the derivatives

void

setInitialF(double[] valueF)

Set the initial function values.

void

setPercentageFactor(double[] factor)

Sets the percentage factor for differencing

void

setScalingFactors(double[] scale)

Sets the scaling factors for the y values.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Details
- ONE_SIDED
  
  public static final int ONE_SIDED
  
  Indicates one sided differences.
  See Also:
  
  Constant Field Values
- CENTRAL
  
  public static final int CENTRAL
  
  Indicates central differences.
  See Also:
  
  Constant Field Values
- ACCUMULATE
  
  public static final int ACCUMULATE
  
  Indicates the accumulation of the result from whatever type of differences have been specified previously into initial values of the Jacobian.
  See Also:
  
  Constant Field Values
- SKIP
  
  public static final int SKIP
  
  Indicates a variable to be skipped.
  See Also:
  
  Constant Field Values
Constructor Details
- NumericalDerivatives
  
  public NumericalDerivatives(NumericalDerivatives.Function fcn)
  
  Constructor for NumericalDerivatives.
  
  Parameters:
  
  fcn - a Function object which is a user-supplied function to evaluate the equations at the point y.

Method Details

evaluateJ

public double[][] evaluateJ(double[] y)

Evaluates the Jacobian for a system of (m) equations in (n) variables.

Parameters:

y - a double array of length n, the point at which the Jacobian is to be evaluated.

Returns:

a double matrix containing the Jacobian. Columns that are accumulated must have the additive term defined on entry or else be set to zero. Columns that are skipped can be defined either before or after the evaluateJ method is invoked.
evaluateF

protected double[] evaluateF(int varIndex, double[] y)

This method is provided by the user to compute the function values at the current independent variable values y. If the user does not override the evaluateF method, then NumericalDerivatives.Function.f is used to compute the function values.

Parameters:

varIndex - an int which indicates the index of the variable to perturb.

y - a double array of length n, the point at which the function is to be evaluated.

Returns:

a double array of length m. The equations evaluated at the point y.
setScalingFactors

public void setScalingFactors(double[] scale)

Sets the scaling factors for the y values. The user can also use scale to provide appropriate signs for the increments.

Parameters:

scale - a double array of length n containing the scaling factors. Default: all values are 1.0.
getScalingFactors

public double[] getScalingFactors()

Returns the scaling factors for the y values.

Returns:

a double array containing the scaling factors.

getStatus

public int[] getStatus()

Returns status information. This information might prove useful to the user wanting to gain better control over the differencing parameters. This information can often be ignored.

Returns:

an int array containing the ten diagnostic values described in the following table. These values can be used to monitor the progress or expense of the Jacobian computation.

*index*	Description
0	the number of times a function evaluation was computed.
1	the number of columns in which three attempts were made to increase a percentage factor for differencing (i.e. a component in the `factor` array) but the computed del remained unacceptably small relative to `y[j-1]` or `scale[j-1]`. In such cases the percentage factor is set to 1.4901161193847656e-8, which is the square root of machine precision
2	the number of columns in which the computed del was zero to machine precision because `y[j-1]` or `scale[j-1]` was zero. In such cases del is set to 1.4901161193847656e-8, which is the square root of machine precision
3	the number of Jacobian columns which had to be recomputed because the largest difference formed in the column was close to zero relative to scale, where $$scale = \max \left( {\left\| {f_i \left( y \right)} \right\|,\left\| {f_i (y + del \times e_{j} )} \right\|} \right)$$ and i denotes the row index of the largest difference in the column currently being processed. index = 9 gives the last column where this occurred.
4	the number of columns whose largest difference is close to zero relative to scale after the column has been recomputed.
5	the number of times scale information was not available for use in the roundoff and truncation error tests. This occurs when $$\min \left( {\left\| {f_i \left( y \right)} \right\|,\left\| {f_i (y + del \times e_{j} )} \right\|} \right) = 0$$ where i is the index of the largest difference for the column currently being processed.
6	the number of times the increment for differencing (del) was computed and had to be increased because (`scale[j-1]`+del) - `scale[j-1]`) was too small relative to `y[j-1]` or `scale[j-1]`.
7	the number of times a component of the `factor` array was reduced because changes in function values were large and excess truncation error was suspected. index = 8 gives the last column in which this occurred.
8	the index of the last column where the corresponding component of the `factor` array had to be reduced because excessive truncation error was suspected.
9	the index of the last column where the difference was small and the column had to be recomputed with an adjusted increment (see index = 3). The largest derivative in this column may be inaccurate due to excessive roundoff error.

setPercentageFactor

public void setPercentageFactor(double[] factor)

Sets the percentage factor for differencing
For each divided difference for variable j the increment used is del. The value of del is computed as follows: First define $\sigma = sign(\mbox{scale}[j - 1])$. If the user has set the elements of array scale to non-default values, then define $y_a = \left| {\mbox{scale}[j - 1]} \right|$. Otherwise $y_a = \left| {y[j - 1]} \right|$ and $\sigma = 1$. Finally compute $del=\sigma y{}_a\;\mbox{factor}[j-1]$. By changing the sign of scale[j-1], the difference del can have any desired orientation, such as staying within bounds on variable j. For central differences, a reduced factor is used for del that normally results in relative errors as small as machine precision to the 2/3 power.

Parameters:

factor - a double array of length n containing the percentage factor for differencing. Except for initialization, the factor array should not be altered in the evaluateF method. The elements of factor must be such that $$\mbox{1.8189894035458565e-12}\;\le\mbox{factor}[j-1]\le 0.1$$ where 1.8189894035458565e-12 is machine precision to the three-fourths power.
Default: all elements of factor are set to 1.4901161193847656e-8, which is the square root of machine precision.
getPercentageFactor

public double[] getPercentageFactor()

Returns the percentage factor for differencing.

Returns:

a double array containing the percentage factor for differencing. See setPercentageFactor for more detail.

setDifferencingMethods

public void setDifferencingMethods(int[] options)

Sets the methods used to compute the derivatives

Parameters:

options - an int array of length n, containing the methods used to compute the derivatives. options[i] is the method to be used for the i-th variable. options[i] can be one of the values in the table which follows. The default is to use ONE_SIDED differences for each variable.

Entry	Description
`ONE_SIDED`	Indicates one sided differences.
`CENTRAL`	Indicates central differences.
`ACCUMULATE`	Indicates the accumulation of the result from whatever type of differences have been specified previously into initial values of the Jacobian.
`SKIP`	Indicates a variable to be skipped.

setInitialF

public void setInitialF(double[] valueF)

Set the initial function values. Use the values $f(y_0)$, where $y_0$ is the initial value of the independent variables located in array y.

Parameters:

valueF - a double array of length m containing the initial function values, $y_0$. Default: all values are 0.0.

Class NumericalDerivatives

Nested Class Summary

Field Summary

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Details

ONE_SIDED

CENTRAL

ACCUMULATE

SKIP

Constructor Details

NumericalDerivatives

Method Details

evaluateJ

evaluateF

setScalingFactors

getScalingFactors

getStatus

setPercentageFactor

getPercentageFactor

setDifferencingMethods

setInitialF