ANWAY
Analyzes a balanced n-way classification model with fixed effects.
Required Arguments
NF — Number of factors (number of subscripts) in the model including
error. (Input)
NL — Vector of length NF containing the number of levels for each of the factors. (Input)
Y — Vector of length NL(1) * NL(2) * … * NL(NF) containing the responses. (Input)
Y must not contain NaN (not a number) for any of its elements, i.e., missing values are not allowed.
INTERA — Interaction option. (Input)
The absolute value of INTERA is the number of factors to be included in the highest-way interaction in the model. The sign of INTERA indicates if factor NF is error.
INTERA |
Meaning |
< 0 |
Factor NF is not error. Only (‑INTERA + 1)-way and higher-way interactions are included in error. |
> 0 |
Factor NF is error. Its main effect and all its interaction effects are pooled into the error with the other (INTERA + 1)-way and higher-way |
AOV — Vector of length 15 containing statistics relating to the analysis of variance. (Output)
I |
AOV(I) |
1 |
Degrees of freedom for regression |
2 |
Degrees of freedom for error |
3 |
Total degrees of freedom |
4 |
Sum of squares for regression |
5 |
Sum of squares for error |
6 |
Total sum of squares |
7 |
Regression mean square |
8 |
Error mean square |
9 |
F-statistic |
10 |
p-value |
11 |
R2 (in percent) |
12 |
Adjusted R2 (in percent) |
13 |
Estimated standard deviation of the model error |
14 |
Mean of the response (dependent) variable |
15 |
Coefficient of variation (in percent) |
Optional Arguments
IPRINT — Printing option. (Input)
Default: IPRINT = 0.
IPRINT |
Action |
0 |
Printing is not performed. |
1 |
AOV and EFSS are printed. |
2, ‑2 |
Only marginal means are printed. If IPRINT = 2, then all of YMEANS is printed. If IPRINT = ‑2, then marginal means higher than (∣INTERA∣) -way are not printed. |
3, ‑3 |
AOV, EFSS, and all or some of YMEANS is printed. If IPRINT = 3, then all of YMEANS is printed. If IPRINT = ‑3, then marginal means higher than(∣INTERA∣) -way are not printed. |
EFSS — NEF by 4 matrix containing statistics relating to the sums of squares for the effects in the model. (Output)
Here, NEF= BINOM(n, 1) + BINOM(n, 2) + … + BINOM(n, ∣INTERA∣) where the IMSL subroutine BINOM (IMSL MATH/LIBRARY Special Functions) returns the binomial coefficient, and n is given by
Suppose the factors are A, B, C, and error. With INTERA = 3, rows 1 through NEF would correspond to A, B, C, AB, AC, BC, and ABC, respectively. The columns of EFSS are as follows:
Column |
Description |
1 |
Degrees of freedom |
2 |
Sum of squares |
3 |
F -statistic |
4 |
p-value |
LDEFSS — Leading dimension of EFSS exactly as specified in the dimension statement in the calling program. (Input)
Deafult: LDEFSS = size (EFSS , 1)
YMEANS — Vector of length (NL(1) + 1) * (NL(2) + 1) * … * (NL(n) + 1) containing subgroup means. (Output)
See argument EFSS for a definition of n. Suppose that the factors are A, B, C, and error. The ordering of the means is grand mean, A means, B means, C means, AB means, AC means, BC means, and ABC means.
FORTRAN 90 Interface
Generic: CALL ANWAY (NF, NL, Y, INTERA, AOV [, …])
Specific: The specific interface names are S_ANWAY and D_ANWAY.
FORTRAN 77 Interface
Single: CALL ANWAY (NF, NL, Y, INTERA, IPRINT, AOV, EFSS, LDEFSS, YMEANS)
Double: The double precision name is DANWAY.
Description
Routine ANWAY performs an analysis for an n-way classification design with balanced data. For balanced data, there must be an equal number of responses in each cell of the n-way layout. The effects are assumed to be fixed effects. The model is an extension of the twoway model to include n factors. The interactions (two-way, three-way, up to n-way) can be included in the model, or some of the higher-way interactions can be pooled into error. The argument INTERA specifies which interactions are to be pooled into error. For example, if three-way and higher-way interactions are to be pooled into error, set INTERA = ‑ 2 or INTERA = 2. A positive INTERA indicates there are repeated responses within the n-way cells, while a negative INTERA indicates otherwise.
Routine ANWAY requires the responses as input into a single vector Y in lexicographical order so that the response subscript associated with the first factor varies least rapidly, the subscript associated with the second factor varies next most rapidly, and so forth. Hemmerle (1967, Chapter 5) discusses the computational method.
Comments
Workspace may be explicitly provided, if desired, by use of A2WAY/DA2WAY. The reference is:
CALL A2WAY (NF, NL, Y, INTERA, IPRINT, AOV, EFSS, LDEFSS, YMEANS, WK, IWK)
The additional arguments are as follows:
WK — Work vector of length 5 * 2n + NMEANS + 4.
IWK — Work vector of length (NF + 2) * 2NF-1 + (n + 2) * 2n−1 + n ‑ 2.
Examples
Example 1
A two-way analysis of variance is performed with balanced data discussed by Snedecor and Cochran (1967, Table 12.5.1, page 347). The responses are the weight gains (in grams) of rats fed diets varying in two components—source of protein (A) and level of protein (B). Here, INTERA = 2 is used. The model is
yijk = μ + αi + βj + γij + ɛijk i = 1, 2; j = 1, 2, 3; k = 1, 2, …, 10
where
for j = 1, 2, 3; and
for i = 1, 2.
The first responses in each cell in the two-way layout are given in the following table:
Protein Source (A) |
|||
Protein Level (B) |
Beef |
Cereal |
Pork |
High |
73, 102, 118, 104, 81, 107, 100, 87, 117, 111 |
98, 74, 56, 111, 95, 88, 82, 77, 86, 92 |
94, 79, 96, 98, 102, 102, 108, 91, 120, 105 |
Low |
90, 76, 90, 64, 86, 51, 72, 90, 95, 78 |
107, 95, 97, 80, 98, 74, 74, 67, 89, 58 |
49, 82, 73, 86, 81, 97, 106, 70, 61, 82 |
USE ANWAY_INT
IMPLICIT NONE
INTEGER NF, NOBS
PARAMETER (NF=3, NOBS=60)
!
INTEGER INTERA, IPRINT, NL(NF)
REAL AOV(15), Y(NOBS)
!
DATA Y/73.0, 102.0, 118.0, 104.0, 81.0, 107.0, 100.0, 87.0, &
117.0, 111.0, 90.0, 76.0, 90.0, 64.0, 86.0, 51.0, 72.0, &
90.0, 95.0, 78.0, 98.0, 74.0, 56.0, 111.0, 95.0, 88.0, &
82.0, 77.0, 86.0, 92.0, 107.0, 95.0, 97.0, 80.0, 98.0, &
74.0, 74.0, 67.0, 89.0, 58.0, 94.0, 79.0, 96.0, 98.0, &
102.0, 102.0, 108.0, 91.0, 120.0, 105.0, 49.0, 82.0, 73.0, &
86.0, 81.0, 97.0, 106.0, 70.0, 61.0, 82.0/
DATA NL/3, 2, 10/
!
INTERA = 2
IPRINT = 3
CALL ANWAY (NF, NL, Y, INTERA, AOV, IPRINT=IPRINT)
END
Output
Dependent R-squared Adjusted Est. Std. Dev. Coefficient of
Variable (percent) R-squared of Model Error Mean Var. (percent)
Y 28.477 21.854 14.65 87.87 16.67
* * * Analysis of Variance * * *
Sum of Mean Prob. of
Source DF Squares Square Overall F Larger F
Model 5 4612.9 922.6 4.300 0.0023
Error 54 11586.0 214.6
Corrected Total 59 16198.9
* * * Variation Due to the Model * * *
Sum of Prob. of
Source DF Squares F Larger F
A 2 266.53 0.621 0.5411
B 1 3168.27 14.767 0.0003
A*B 2 1178.13 2.746 0.0732
* * * Subgroup Means * * *
A Means (N=20)
1 89.6000
2 84.9000
3 89.1000
B Means (N=30)
1 95.1333
2 80.6000
A*B Means (N=10)
1 1 100.0000
1 2 79.2000
2 1 85.9000
2 2 83.9000
3 1 99.5000
3 2 78.7000
Example 2
This example performs a three-way analysis of variance using data discussed by John (1971, pages 91‑92). The responses are weights (in grams) of roots of carrots grown with varying amounts of applied nitrogen (A), potassium (B), and phosphorus (C). There is one response within each cell of the three-way layout. INTERA is set to ‑2 in order to pool the ABC three-factor interaction into error. (Note that the ABC interaction sum of squares, which is 186, is given incorrectly by John [1971, Table 5.2].) IPRINT is set to ‑3 so that the ABC means will not be printed (since ∣INTERA∣ is equal to 2). The three-way layout is given in the following table:
|
A0 |
A1 |
A2 |
||||||
|
B0 |
B1 |
B2 |
B0 |
B1 |
B2 |
B0 |
B1 |
B2 |
C0 |
88.76 |
91.41 |
97.85 |
94.83 |
100.49 |
99.75 |
99.90 |
100.23 |
104.51 |
C1 |
87.45 |
98.27 |
95.85 |
84.57 |
97.20 |
112.30 |
92.98 |
107.77 |
110.94 |
C2 |
86.01 |
104.20 |
90.09 |
81.06 |
120.80 |
108.77 |
94.72 |
118.39 |
102.87 |
USE ANWAY_INT
IMPLICIT NONE
INTEGER NF, NOBS
PARAMETER (NF=3, NOBS=27)
!
INTEGER INTERA, IPRINT, NL(NF)
REAL AOV(15), Y(NOBS)
!
DATA Y/88.76, 87.45, 86.01, 91.41, 98.27, 104.20, 97.85, 95.85, &
90.09, 94.83, 84.57, 81.06, 100.49, 97.20, 120.8, 99.75, &
112.30, 108.77, 99.9, 92.98, 94.72, 100.23, 107.77, 118.39, &
104.51, 110.94, 102.87/
DATA NL/3, 3, 3/
!
INTERA = -2
IPRINT = -3
CALL ANWAY (NF, NL, Y, INTERA, AOV, IPRINT=IPRINT)
END
Output
Dependent R-squared Adjusted Est. Std. Dev. Coefficient of
Variable (percent) R-squared of Model Error Mean Var. (percent)
Y 92.804 76.612 4.819 98.96 4.869
* * * Analysis of Variance * * *
Sum of Mean Prob. of
Source DF Squares Square Overall F Larger F
Model 18 2395.7 133.1 5.731 0.0083
Error 8 185.8 23.2
Corrected Total 26 2581.5
* * * Variation Due to the Model * * *
Sum of Prob. of
Source DF Squares F Larger F
A 2 488.37 10.515 0.0058
B 2 1090.66 23.483 0.0004
C 2 49.15 1.058 0.3911
A*B 4 142.59 1.535 0.2804
A*C 4 32.35 0.348 0.8383
B*C 4 592.62 6.380 0.0131
* * * Subgroup Means * * *
A Means (N=9)
1 93.3211
2 99.9744
3 103.5900
B Means (N=9)
1 90.0311
2 104.3067
3 102.5478
C Means (N=9)
1 97.5256
2 98.5922
3 100.7678
A*B Means (N=3)
1 1 87.4067
1 2 97.9600
1 3 94.5967
2 1 86.8200
2 2 106.1633
2 3 106.9400
3 1 95.8667
3 2 108.7967
3 3 106.1067
A*C Means (N=3)
1 1 92.6733
1 2 93.8567
1 3 93.4333
2 1 98.3567
2 2 98.0233
2 3 103.5433
3 1 101.5467
3 2 103.8967
3 3 105.3267
B*C Means (N=3)
1 1 94.4967
1 2 88.3333
1 3 87.2633
2 1 97.3767
2 2 101.0800
2 3 114.4633
3 1 100.7033
3 2 106.3633
3 3 100.5767