CTWLS

FNLStat : Categorical and Discrete Data Analysis : CTWLS

CTWLS

Required Arguments

more...

Performs a generalized linear least‑squares analysis of transformed probabilities in a two‑dimensional contingency table.

Required Arguments

TABLE — NRESP by NPOP matrix containing the frequency count in each cell of each population. (Input)
The i‑th column of TABLE contains the counts for the i‑th population.

ITRAN — Vector of length NTRAN containing the transformation code for each of the NTRAN transformations to be applied. (Input)
ITRAN is not referenced and can be a vector of length 1 in the calling program if NTRAN = 0. Let a “response” denote a transformed cell probability. Then, ITRAN(1) contains the first transformation to be applied to the cell probabilities, ITRAN(2) contains the second transformation, which is to be applied to the responses obtained after ITRAN(1) is performed, etc. Note that the k‑th transformation takes the ISIZE(k ‑ 1) responses at step k into ISIZE(k) responses, where ISIZE(0) is taken to be NPOP * NRESP. Let y denote the vector result of a transformation, x denote the responses before the transformation is applied, A denote a matrix of constants, and v denote a vector of constants. Then, the possible transformations are

ITRAN	Transformation
1	Linear, defined over all populations (y = Ax)
2	Logarithmic (y(i, j) = ln(x(i, j))
3	Exponential (y(i, j) = exp(x(i, j)))
4	Additive (y(i, j) = y(i, j) + v(i, j))
5	Linear, defined for one population and, identically, applied over all populations (y(i) = Ax(i))

where y(i) and x(i) are the subvectors for the i‑th population, y(i, j) and x(i, j) denote the j‑th response in the i‑th population, and v(i, j) denotes the corresponding element of the vector “v”. Transformation type 5 is the same as transformation type 1 when the same linear transformation is applied in each population (i.e., the type 1 matrix is block diagonal with identical blocks). Because the size of the type 5 transformation matrix A is NPOP2 times smaller than the type 1 transformation matrix, the type 5 transformation is usually preferred where it can be used.

ISIZE — Vector of length NTRAN containing the number of response functions defined by the k‑th transformation. (Input)
Transformation types 2, 3, and 4 have the same number of output responses as are input, and elements of ISIZE corresponding to transformations of these types should reflect this fact. Transformation types 1 and 5 can either increase or, more commonly, decrease the number of responses. For transformation type 5, if m linear transformations are defined for each population, the corresponding element of ISIZE should be m * NPOP.

AMATS — Vector containing the transformation constants. (Input)
AMATS contains the transformation matrices and vectors needed in the type 1, 4 and 5 transformations. While AMATS is a vector, its elements may be treated as a number of matrices or vectors where the number of structures depends upon the transformation types as follows:

ITRAN	Type	Dimension	Length
1	Matrix	m by n	M * n
2, 3	Not referenced		0
4	Vector	M	M
5	Matrix	m/NPOP by n/NPOP	M * n/(NPOP * NPOP)

Here, m = ISIZE(i) and n = ISIZE(i ‑ 1), and ISIZE(0) is not input (in ISIZE) but is taken to be NPOP * NRESP. Matrices and vectors are stored consecutively in AMATS with column elements for matrices stored consecutively as is standard in FORTRAN. Thus, if ITRAN(1) = 5 and ITRAN(2) = 4, with NREP = 3, NPOP= 2, and ISIZE(1) = ISIZE(2) = 2, then the vector AMATS would contain in consecutive positions A(1, 1), A(2, 1), A(1, 2), A(2, 2), A(1, 3), A(2, 3), v(1), v(2), v(3), v(4) where A is the matrix for transformation type 5 and v is the vector for transformation type 4.

X — Design matrix of size ISIZE(NTRAN) by NCOEF. (Input, if NCOEF > 0)
X contains the design matrix for predicting the transformed cell probabilities F from the covariates stored in X. If NCOEF = 0, X is not referenced and can be a 1 by 1 matrix in the calling program.

NH — Vector of length NUMH. (Input, if NCOEF > 0)
NH(i) contains the number of consecutive rows in H used to specify hypothesis i. If NCOEF = 0, NH is not referenced and can be a vector of length 1 in the calling program.

H — Matrix of size m by NCOEF containing the constants to be used in the multivariate hypothesis tests. (Input, if NCOEF > 0)
Here, m is the sum of the elements in NH. Each hypothesis is of the form
H0 : C * COEF = 0, where C for the i‑th hypothesis is NH(i) rows of H, and COEF is estimated in the linear model. The first NH(1) rows of H make up the first hypothesis, the next NH(2) rows make up the second hypothesis, etc. If NCOEF = 0, H is not referenced and can be a 1 by 1 matrix in the calling program.

CHSQ — NUMH + 1 by 3 matrix containing the results of the hypothesis tests. (Output, if NCOEF > 0)
The first row of CHSQ contains the results for test 1, the next row contains the results for test 2, etc. The last row of CHSQ contains a test of the adequacy of the model. Within each row, the first column contains the chi‑squared statistic, the second column contains its degrees of freedom, and the last column contains the probability of a larger chi‑squared. If NCOEF = 0, CHSQ is not referenced and can be a 1 by 1 matrix in the calling program.

COEF — NCOEF by 4 matrix containing the coefficient estimates and related statistics. (Output, if NCOEF > 0)
The columns of coefficient are as follows:

Col.	Statistic
1	Coefficient estimate
2	Estimated standard error of the coefficient
3	z‑statistic for a test that the coefficient equals 0 versus the Two‑sided alternative
4	p‑value corresponding to the z‑statistic

If NCOEF = 0, COEF is not referenced and can be a 1 by 1 matrix in the calling program.

COVCF — NCOEF by NCOEF matrix containing the estimated variances and covariances of COEF. (Output, if NCOEF > 0)
If NCOEF = 0, COVCF is not referenced and can be a 1 by 1 matrix in the calling program.

F — Vector of length ISIZE(NTRAN) containing the transformed probabilities, the responses. (Output)

COVF — Matrix of size ISIZE(NTRAN) by ISIZE(NTRAN) containing the estimated variances and covariances of F. (Output)

RESID — ISIZE(NTRAN) by 4 matrix containing a case analysis for the transformed probabilities as estimated by the linear model. (Output, if NCOEF > 0)
The linear model gives F = X * BETA. The columns of RESID are as follows:

Col.	Description
1	Residual
2	Standard error
3	Leverage
4	Standardized residual

If NCOEF = 0, RESID is not referenced and can be a 1 by 1 matrix in the calling program.

Optional Arguments

NRESP — Number of cells in each population. (Input)
Default: NRESP = size (TABLE,1).

NPOP — Number of populations. (Input)
Default: NPOP = size (TABLE,2).

LDTABL — Leading dimension of TABLE exactly as specified in the dimension statement in the calling program. (Input)
Default: LDTABL = size (TABLE,1).

NTRAN — Number of transformations to be applied to the cell probabilities. (Input)
Cell probabilities are computed as the frequency count for the cell divided by the population sample size. Set NTRAN = 0 if a linear model predicting the cell probabilities is to be used.
Default: NTRAN = size (ITRAN,1).

NCOEF — Number of coefficients in the linear model relating the transformed probabilities F to the design matrix X. (Input)
Let F denote the vector result of applying the NTRAN transformations, and assume that the model gives F = X * COEF. Then, NCOEF is the length of COEF.
Default: NCOEF = size (X,2).

LDX — Leading dimension of X exactly as specified in the dimension statement in the calling program. (Input)
Default: LDX = size (X,1).

NUMH — Number of multivariate hypotheses to be tested on the coefficients in COEF. (Input, if NCOEF > 0)
If NCOEF = 0, NUMH is not referenced.
Default: NUMH = size (NH,1).

LDH — Leading dimension of H exactly as specified in the dimension statement in the calling program. (Input)
Default: LDH = size(H,1).

IPRINT — Printing option. (Input)
Default: IPRINT = 0.

IPRINT	Action
0	No printing is performed.
1	Print all output arrays and vectors.
2	Print all output arrays and vectors as well as the matrices and vectors in AMATS.

LDCHSQ — Leading dimension of CHSQ exactly as specified in the dimension statement in the calling program. (Input)
Default: LDCHSQ = size (CHSQ,1).

LDCOEF — Leading dimension of COEF exactly as specified in the dimension statement in the calling program. (Input)
Default: LDCOEF = size (COEF,1).

LDCOVC — Leading dimension of COVCF exactly as specified in the dimension statement in the calling program. (Input)
Default: LDCOVC = size (COVCF,1)

LDCOVF — Leading dimension of COVF exactly as specified in the dimension statement in the calling program. (Input)
Default: LDCOVF = size (COVF,1).

LDRESI — Leading dimension of RESID exactly as specified in the dimension statement in the calling program. (Input)
Default: LDRESI = size (RESID,1).

FORTRAN 90 Interface

Generic: CALL CTWLS (TABLE, ITRAN, ISIZE, AMATS, X, NH, H, CHSQ, COEF, COVCF, F, COVF, RESID [, …])

Specific: The specific interface names are S_CTWLS and D_CTWLS.

FORTRAN 77 Interface

Single: CALL CTWLS (NRESP, NPOP, TABLE, LDTABL, NTRAN, ITRAN, ISIZE, AMATS, NCOEF, X, LDX, NUMH, NH, H, LDH, IPRINT, CHSQ, LDCHSQ, COEF, LDCOEF, COVCF, LDCOVC, F, COVF, LDCOVF, RESID, LDRESI)

Double: The double precision name is DCTWLS.

Description

Routine CTWLS performs weighted least‑squares analysis of a general p = NPOP population by r = NRESP response categories per population contingency table. After division by the sample size, there are n = pr cell probabilities.

Define s = ISIZE(NTRAN) responses fi such that each response is obtained from the cell probabilities as fi = gi(p1, p2, …, pn), for i = 1, …, s. Call the functions gi the response functions. Then, if

is the asymptotic covariance matrix of the responses, and X is a design matrix for a linear model predicting f = X β with q = NCOEF coefficients β = COEF, then CTWLS performs a weighted least‑squares analysis of the model f = X β where the generalized weights are given by

Estimates obtained in this way are best asymptotic normal estimates of β.

Let

denote the estimated variance‑covariance matrix of the estimated cell probabilities, and let (∂gi/∂pj) denote the matrix of partial derivatives of gi with respect to pj. Then,

is given by

where the (i, j)-th element in

is computed as

pi(δij − pj)

Here, δij = 1 if i = j and is zero otherwise.

In CTWLS, the transformations gi are defined by successive application of one of five types of simpler transformations. Let pi = h0,j for j = 1, …, n denote the n cell probabilities, and let hi,j denote the ISIZE(i) responses obtained after i simple transformations have been performed with hi denoting the corresponding vector of estimates. Then, the simple transformations are defined by:

1. Linear: hi+1 = Aihi where Ai is a matrix of coefficients specified via the vector AMATS in CTWLS.

2. Logarithmic: hi+1,j = ln(hi,j) where j = 1, …, ISIZE(i). That is, take the logarithm of each of the responses.

3. Exponential: hi+1,j = exp(hi,j) where j = 1, …, ISIZE(i). That is, take the exponential of each of the responses.

4. Additive: hi+1,j = hi,j + vj , where j = 1, …, ISIZE(i), and vj is specified via the vector AMATS in CTWLS. Additive transformations are generally used to adjust for zero cells or to apply a continuity correction to the cell probabilities.

5. Linear (by population):

is the vector of responses at stage i in the j‑th population, and Ai is a matrix of coefficients specified via AMATS.

Given the responses fi and their covariances

estimates for β are computed via generalized least squares as

Let Σ β denote the asymptotic covariance matrix of β. Then, Σ β is estimated by

Hypothesis tests of the form Ho : Ci β = 0 are performed when requested. Here, Ci is a matrix of coefficients specified via a submatrix of the matrix H. Results are returned in the vector CHSQ. The asymptotic chi‑squared test for testing the null hypothesis is given by

This test has qi = rank(Ci) degrees of freedom. If zero degrees of freedom are returned, the hypothesis cannot be tested in the original parameterization.

A test of the model checks that the residuals obtained from the model f = X β are not too large. This test, which has s ‑ q degrees of freedom, is an asymptotic chi‑squared test and is computed as

Residuals from the generalized linear model are easily computed as

where xi is the row of the design matrix X corresponding to the i‑th observation. This residual has the asymptotic variance

where (A)ii denotes the i‑th diagonal element of matrix A. A standardized residual is then computed as

which has an asymptotic standard normal distribution if the model is correct.

The leverage of observation i, vi, is computed as

It is a measure of the importance of the observation in the predicted values. Values greater than 2q/s are large.

Because the tests performed by CTWLS are asymptotic ones, the user should treat the results with caution. The reported asymptotic p‑values are most likely to be exact when the number of counts in each cell is large (say 5 or more), and less exact for smaller cell counts. Care should also be taken to avoid illegal operations. For example, the routine returns an error message when the log of a negative or zero value is attempted. When this occurs, the user should either use a continuity correction (i.e. modify the transformations used by adding a constant to all cells or to the cell resulting in the illegal operation) or abandon the model.

Comments

1. Workspace may be explicitly provided, if desired, by use of C2WLS/DC2WLS. The reference is:

CALL C2WLS (NRESP, NPOP, TABLE, LDTABL, NTRAN, ITRAN, ISIZE, AMATS, NCOEF, X, LDX, NUMH, NH, H, LDH, IPRINT, CHSQ, LDCHSQ, COEF, LDCOEF, COVCF, LDCOVC, F, COVF, LDCOVF, RESID, LDRESI, PDER, FRQ, EST, XX, WK, IWK, WWK)

The additional arguments are as follows:

PDER — Work vector of length ISIZE(NTRAN) * max(NPOP * NRESP, ISIZE(i)) if NTRAN is greater than zero. PDER is not used and can be dimensioned of length 1 if NTRAN = 0.

FRQ — Work vector of length NPOP.

EST — Work vector of length NPOP * NRESP + ISIZE(1) + … + ISIZE(NTRAN).

XX — Work vector of length (NCOEF + 1) * ISIZE(NTRAN) if NCOEF is greater than zero. If NCOEF = 0, XX is not referenced and can be a vector of length 1 in the calling program.

WK — Work vector of length 3(max(NPOP * NRESP, ISIZE(i))) + NCOEF + 1.

IWK — Work vector of length max(NH(i)) if NUMH is greater than 0. If NCOEF = 0, IWK is not referenced and can be a vector of length 1 in the calling program.

WWK — Work vector of length max(NH(i)) * (4 + NCOEF + max(NCOEF, max(NH(i))) if NUMH is greater than 0. If NUMH = 0, WWK is not referenced and can be a vector of length 1 in the calling program.

2. Informational error

Type	Code	Description
4	1	A negative response occurred while performing a logarithmic transformation. The logarithm of a negative number is not allowed.

Examples

Example 1

This example is taken from Landis, Stanish, Freeman, and Koch (1976), pages 213‑217. Generalized kappa statistics are computed via vector functions of the form:

F(p) = exp(A4 ln(A3 exp(A2 ln(A1p))))

where p is the cell probabilities. The raw frequencies are given as two 4 × 4 contingency tables. These tables are reorganized as a single 16 × 2 table for input into CTWLS. The input tables are

Two generalized kappa statistics using two different sets of weights are computed for each population. Hypothesis tests are then performed on the four resulting generalized kappa statistics. In this example, the matrix of covariates is an identity matrix so that tests on the responses are performed.

USE CTWLS_INT

IMPLICIT NONE

INTEGER IPRINT, LDCHSQ, LDCOEF, LDCOVC, LDCOVF, LDH, LDRESI, &

LDTABL, LDX, NCOEF, NPOP, NTRAN

PARAMETER (IPRINT=2, LDCHSQ=10, LDCOEF=4, LDCOVC=4, LDCOVF=4, &

LDH=10, LDRESI=4, LDTABL=16, LDX=4, NCOEF=4, NPOP=2, &

NTRAN=8)

INTEGER ISIZE(NTRAN), ITRAN(NTRAN), NH(9)

REAL A1(10,16), A2(18,10), A3(4,18), A4(2,4), AMATS(420), &

CHSQ(LDCHSQ,3), COEF(LDCOEF,4), COVCF(LDCOVC,NCOEF), &

COVF(LDCOVF,LDCOVF), F(LDX), H(LDH,4), &

RESID(LDRESI,4), TABLE(LDTABL,NPOP), X(LDX,NCOEF)

EQUIVALENCE (A1, AMATS(1)), (A2, AMATS(161)), (A3, AMATS(341)), &

(A4, AMATS(413))

DATA TABLE/38, 5, 0, 1, 33, 11, 3, 0, 10, 14, 5, 6, 3, 7, 3, 10, &

5, 3, 0, 0, 3, 11, 4, 0, 2, 13, 3, 4, 1, 2, 4, 14/

DATA X/1, 0, 0, 0, 0, 1, 0, 0, 0, 0, 1, 0, 0, 0, 0, 1/

DATA NH/1, 1, 1, 1, 1, 1, 2, 1, 1/

DATA H/1, 0, 1, 0, 0, 0, 1, 0, 1, 0, 0, 1, -1, 0, 0, 0, 0, 1, 0, &

1, 0, 0, 0, 1, 0, 1, -1, 0, -1, 0, 0, 0, 0, 0, 1, -1, 0, &

-1, 0, -1/

DATA ITRAN/5, 2, 5, 3, 5, 2, 5, 3/

DATA ISIZE/20, 20, 36, 36, 8, 8, 4, 4/

DATA A1/1, 0, 0, 0, 1, 0, 0, 0, 1, 1, 1, 0, 0, 0, 0, 1, 0, 0, 0, &

.5, 1, 0, 0, 0, 0, 0, 1, 0, 0, .25, 1, 0, 0, 0, 0, 0, 0, 1,&

0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 0, .5, 0, 1, 0, 0, 0, 1, 0, &

0, 1, 1, 0, 1, 0, 0, 0, 0, 1, 0, 0, .5, 0, 1, 0, 0, 0, 0, &

0, 1, 0, .25, 0, 0, 1, 0, 1, 0, 0, 0, 0, .25, 0, 0, 1, 0, &

0, 1, 0, 0, 0, .5, 0, 0, 1, 0, 0, 0, 1, 0, 1, 1, 0, 0, 1, &

0, 0, 0, 0, 1, 0, .5, 0, 0, 0, 1, 1, 0, 0, 0, 0, 0, 0, 0, &

0, 1, 0, 1, 0, 0, 0, .25, 0, 0, 0, 1, 0, 0, 1, 0, 0, .5, 0, &

0, 0, 1, 0, 0, 0, 1, 1, 1/

DATA A2/1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,&

0, 0, 0, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, &

0, 0, 0, 0, 0, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, &

0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 0, 0, 1, 0, 0, 0, 1, 0, 0, &

0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 0, &

1, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 0, 1, &

0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 0, 1, 0, &

0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, &

0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, &

DATA A3/-1, -1, 0, 0, 0, -.5, 1, .5, 0, -.25, 1, .75, 0, 0, 1, &

1, 0, -.5, 1, .5, -1, -1, 0, 0, 0, -.5, 1, .5, 0, -.25, 1, &

.75, 0, -.25, 1, .75, 0, -.5, 1, .5, -1, -1, 0, 0, 0, -.5, &

1, .5, 0, 0, 1, 1, 0, -.25, 1, .75, 0, -.5, 1, .5, -1, -1, &

0, 0, 1, 0, 0, 0, 0, 1, 0, 0/

DATA A4/1, 0, 0, 1, -1, 0, 0, -1/

CALL CTWLS (TABLE, ITRAN, ISIZE, AMATS, X, NH, H, &

CHSQ, COEF, COVCF, F, COVF, RESID, IPRINT=IPRINT)

END

Output

Hypothesis Tests on Coefficients

H-1

1 0 0 0

H-2

0 1 0 0

H-3

1 -1 0 0

H-4

0 0 1 0

H-5

0 0 0 1

H-6

0 0 1 -1

H-7

1 0 -1 0

0 1 0 -1

H-8

1 0 -1 0

H-9

0 1 0 -1

Hypothesis Chi-Squared Statistics

Degrees of

Hypothesis Chi-Squared freedom p-value

1 16.99 1 0.0000

2 39.70 1 0.0000

3 39.54 1 0.0000

4 14.27 1 0.0002

5 30.07 1 0.0000

6 28.76 1 0.0000

7 1.07 2 0.5850

8 0.90 1 0.3425

9 1.06 1 0.3040

Degrees of

Chi-Squared freedom p-value

Model Test 0.00 0 NaN

Coefficient Statistics

Coefficient Standard Error Statistic p-value

1 0.2079 0.05 4.12 0.0000

2 0.3150 0.05 6.30 0.0000

3 0.2965 0.08 3.78 0.0002

4 0.4069 0.07 5.48 0.0000

Asymptotic Coefficient Covariance

1 2 3 4

1 2.5457E-03 2.3774E-03 0. 0.

2 2.4988E-03 0. 0.

3 6.1629E-03 5.6229E-03

4 5.5069E-03

Residual Analysis

Standard Standardized

Residual Error Leverage Residual

1 0.0000 0.0000 1.0000 NaN

2 0.0000 0.0000 1.0000 NaN

3 0.0000 0.0000 1.0000 NaN

4 0.0000 0.0000 1.0000 NaN

Transformed Probabilities

1 0.2079

2 0.3150

3 0.2965

4 0.4069

Asymptotic Covariance of the Transformed Probabilities

1 2 3 4

1 2.5457E-03 2.3774E-03 0. 0.

2 2.4988E-03 0. 0.

3 6.1629E-03 5.6229E-03

4 5.5069E-03

Linear transformation matrix, by population, for transformation 5

1 2 3 4 5 6 7 8 9

1 1.000 1.000 1.000 1.000 0.000 0.000 0.000 0.000 0.000

2 0.000 0.000 0.000 0.000 1.000 1.000 1.000 1.000 0.000

3 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 1.000

4 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000

5 1.000 0.000 0.000 0.000 1.000 0.000 0.000 0.000 1.000

6 0.000 1.000 0.000 0.000 0.000 1.000 0.000 0.000 0.000

7 0.000 0.000 1.000 0.000 0.000 0.000 1.000 0.000 0.000

8 0.000 0.000 0.000 1.000 0.000 0.000 0.000 1.000 0.000

9 1.000 0.000 0.000 0.000 0.000 1.000 0.000 0.000 0.000

10 1.000 0.500 0.250 0.000 0.500 1.000 0.500 0.250 0.250

10 11 12 13 14 15 16

1 0.000 0.000 0.000 0.000 0.000 0.000 0.000

2 0.000 0.000 0.000 0.000 0.000 0.000 0.000

3 1.000 1.000 1.000 0.000 0.000 0.000 0.000

4 0.000 0.000 0.000 1.000 1.000 1.000 1.000

5 0.000 0.000 0.000 1.000 0.000 0.000 0.000

6 1.000 0.000 0.000 0.000 1.000 0.000 0.000

7 0.000 1.000 0.000 0.000 0.000 1.000 0.000

8 0.000 0.000 1.000 0.000 0.000 0.000 1.000

9 0.000 1.000 0.000 0.000 0.000 0.000 1.000

10 0.500 1.000 0.500 0.000 0.250 0.500 1.000

Linear transformation matrix, by population, for transformation 5

1 2 3 4 5 6 7 8 9

1 1.000 0.000 0.000 0.000 1.000 0.000 0.000 0.000 0.000

2 1.000 0.000 0.000 0.000 0.000 1.000 0.000 0.000 0.000

3 1.000 0.000 0.000 0.000 0.000 0.000 1.000 0.000 0.000

4 1.000 0.000 0.000 0.000 0.000 0.000 0.000 1.000 0.000

5 0.000 1.000 0.000 0.000 1.000 0.000 0.000 0.000 0.000

6 0.000 1.000 0.000 0.000 0.000 1.000 0.000 0.000 0.000

7 0.000 1.000 0.000 0.000 0.000 0.000 1.000 0.000 0.000

8 0.000 1.000 0.000 0.000 0.000 0.000 0.000 1.000 0.000

9 0.000 0.000 1.000 0.000 1.000 0.000 0.000 0.000 0.000

10 0.000 0.000 1.000 0.000 0.000 1.000 0.000 0.000 0.000

11 0.000 0.000 1.000 0.000 0.000 0.000 1.000 0.000 0.000

12 0.000 0.000 1.000 0.000 0.000 0.000 0.000 1.000 0.000

13 0.000 0.000 0.000 1.000 1.000 0.000 0.000 0.000 0.000

14 0.000 0.000 0.000 1.000 0.000 1.000 0.000 0.000 0.000

15 0.000 0.000 0.000 1.000 0.000 0.000 1.000 0.000 0.000

16 0.000 0.000 0.000 1.000 0.000 0.000 0.000 1.000 0.000

17 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 1.000

18 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000

1 0.000

2 0.000

3 0.000

4 0.000

5 0.000

6 0.000

7 0.000

8 0.000

9 0.000

10 0.000

11 0.000

12 0.000

13 0.000

14 0.000

15 0.000

16 0.000

17 0.000

18 1.000

Linear transformation matrix, by population, for transformation 5

1 2 3 4 5 6 7 8 9

1 -1.000 0.000 0.000 0.000 0.000 -1.000 0.000 0.000 0.000

2 -1.000 -0.500 -0.250 0.000 -0.500 -1.000 -0.500 -0.250 -0.250

3 0.000 1.000 1.000 1.000 1.000 0.000 1.000 1.000 1.000

4 0.000 0.500 0.750 1.000 0.500 0.000 0.500 0.750 0.750

10 11 12 13 14 15 16 17 18

1 0.000 -1.000 0.000 0.000 0.000 0.000 -1.000 1.000 0.000

2 -0.500 -1.000 -0.500 0.000 -0.250 -0.500 -1.000 0.000 1.000

3 1.000 0.000 1.000 1.000 1.000 1.000 0.000 0.000 0.000

4 0.500 0.000 0.500 1.000 0.750 0.500 0.000 0.000 0.000

Linear transformation matrix, by population, for transformation 5

1 2 3 4

1 1.000 0.000 -1.000 0.000

2 0.000 1.000 0.000 -1.000

Example 2

The second example is taken from Prentice (1976) and involves a logistic fit to the mortality of beetles after exposure to various concentrations of carbon disulphide. Because one of the cells on input has a count of zero and it is not possible to take the logarithm of zero, a constant 0.5 is added to each cell prior to calling CTWLS. The model can be expressed as

where i indexes the 8 populations. The data is given as:

X	fi1	fi2
1.690	6	53
1.724	13	47
1.755	18	44
1.784	28	28
1.811	52	11
1.836	53	6
1.861	61	1
1.883	60	0

For comparison, a maximum fit yields

(see routine CTGLM).

USE CTWLS_INT

IMPLICIT NONE

INTEGER IPRINT, LDCHSQ, LDCOEF, LDCOVC, LDCOVF, LDH, LDRESI, &

LDTABL, LDX, NCOEF, NPOP, NRESP, NTRAN, NUMH

PARAMETER (IPRINT=2, LDCOVF=8, LDH=1, LDX=8, NCOEF=2, NPOP=8, &

NRESP=2, NTRAN=2, NUMH=0, LDCHSQ=NUMH+1, &

LDCOEF=NCOEF, LDCOVC=NCOEF, LDRESI=LDX, LDTABL=NRESP)

INTEGER ISIZE(NTRAN), ITRAN(NTRAN), NH(1)

REAL AMATS(2), CHSQ(LDCHSQ,3), COEF(LDCOEF,4), &

COVCF(LDCOVC,NCOEF), COVF(LDCOVF,LDCOVF), F(LDX), &

H(LDH,4), RESID(LDRESI,4), TABLE(LDTABL,NPOP), &

X(LDX,NCOEF)

DATA TABLE/6, 53, 13, 47, 18, 44, 28, 28, 52, 11, 53, 6, 61, 1, &

60, 0/, ITRAN/2, 5/, ISIZE/16, 8/, AMATS/1, -1/

DATA X/8*1, 1.690, 1.724, 1.755, 1.784, 1.811, 1.836, 1.861, &

1.883/

TABLE = TABLE + 0.5

CALL CTWLS (TABLE, ITRAN, ISIZE, AMATS, X, NH, H, &

CHSQ, COEF, COVCF, F, COVF, RESID, NUMH=NUMH, &

IPRINT=IPRINT)

END

Output

Test of the Model

Degrees of

Chi-Squared freedom p-value

8.43 6 0.2081

Coefficient Statistics

Coefficient Standard Error Statistic p-value

1 -55.6590 5.02 -11.10 0.0000

2 31.4177 2.83 11.09 0.0000

Asymptotic Coefficient Covariance

1 2

1 25.16 -14.20

2 8.024

Residual Analysis

Standard Standardized

Residual Error Leverage Residual

1 0.4552 0.3232 0.6052 1.4086

2 0.2368 0.2480 0.6468 0.9548

3 -0.3568 0.2413 0.7608 -1.4787

4 -0.3902 0.2285 0.7440 -1.7076

5 0.2800 0.2761 0.7192 1.0141

6 0.0840 0.3484 0.7036 0.2410

7 0.9042 0.7749 0.8791 1.1670

8 1.2953 1.3777 0.9413 0.9402

Transformed Probabilities

1 -2.108

2 -1.258

3 -0.878

4 0.000

5 1.518

6 2.108

7 3.714

8 4.796

Asymptotic Covariance of the Transformed Probabilities

1 2 3 4 5

1 0.1725 0. 0. 0. 0.

2 9.5127E-02 0. 0. 0.

3 7.6526E-02 0. 0.

4 7.0175E-02 0.

5 0.1060

6 7 8

1 0. 0. 0.

2 0. 0. 0.

3 0. 0. 0.

4 0. 0. 0.

5 0. 0. 0.

6 0.1725 0. 0.

7 0.6829 0.

8 2.017

Linear transformation matrix, by population, for transformation 5

1 2

1.000 -1.000