GCSCP

Generates centered variables, squares, and crossproducts.

Required Arguments

X — NRX by NVAR matrix containing the data. (Input)

XMEAN — Vector of length NVAR containing the means of the variables. (Input)

CSCP — NRX by NVAR * (NVAR + 3)/2 matrix containing the centered variables, squares, and crossproducts. (Output)

Columns	Description
1 to NVAR	Centered variables
NVAR+ 1 to 2 * NVAR	Squared variables
2 * NVAR + 1 to NVAR * (NVAR + 3)/2	Crossproducts

If X is not needed, X and the first NVAR columns of CSCP may occupy the same storage locations.

Optional Arguments

IDO — Processing option. (Input)
Default: IDO = 0.

IDO	Action
0	This is the only invocation of GCSCP for this data set, and all the data are input at once.
1	This is the first invocation, and additional calls to GCSCP will be made. Initialization and updating for the data in X are performed.
2	This is an intermediate or final invocation of GCSCP and updating for the data in X is performed.

NRX — Number of rows of data in X. (Input)
Default: NRX = size (X,1).

NVAR — Number of variables. (Input)
Default: NVAR = size (X,2).

LDX — Leading dimension of X exactly as specified in the dimension statement in the calling program. (Input)
Default: LDX = size (X,1).

ICEN — Centering option. (Input)
If IDO = 1 or IDO = 2, ICEN must equal 0.
Default: ICEN = 0.

ISUB	Action
0	CSCP contains the centered variables in columns 1 through NVAR. Square and crossproduct variables are generated from these centered variables in the remaining columns of CSCP.
1	First, the action taken when ICEN = 0 is performed. Next, the means of the square and crossproduct variables are subtracted from the square and crossproduct variables.

SCPM — Vector of length NVAR * (NVAR + 1)/2 containing the means of the generated square and crossproduct variables. (Output, if IDO = 0 or 1; input/output, if IDO = 2)

Elements	Description
1 to NVAR	Squared variable means
NVAR+ 1 to NVAR * (NVAR + 1)/2	Crossproduct variable means

LDCSCP — Leading dimension of CSCP exactly as specified in the dimension statement in the calling program. (Input)
Default: LDCSCP = size (CSCP,1).

NRMISS — Number of rows of data encountered in calls to GCSCP that contain any missing values for the variables. (Output, if IDO = 0 or 1; Input/Output, if IDO = 2)
NaN (not a number) is used as the missing value code.
Default: NRMISS = 0.

NVOBS — Number of valid observations. (Output, if IDO = 0 or 1; Input/Output, if IDO = 2)
Number of rows of data encountered in calls to GCSCP that do not contain any missing values for the variables.

FORTRAN 90 Interface

Generic: CALL GCSCP (X, XMEAN, CSCP [, …])

Specific: The specific interface names are S_GCSCP and D_GCSCP.

FORTRAN 77 Interface

Single: CALL GCSCP (IDO, NRX, NVAR, X, LDX, ICEN, XMEAN, SCPM, CSCP, LDCSCP, NRMISS, NVOBS)

Double: The double precision name is DGCSCP.

Description

Routine GCSCP centers a data set consisting of independent variable settings and generates (using the centered variables) the settings for all possible squared and crossproduct variables in standard order. The routine GCSCP is designed so that you can partition a large data set into submatrices (requiring less space) and make multiple calls to GCSCP (with IDO = 1, 2, 2 …, 2). Alternatively, one invocation of GCSCP (with IDO = 0) can be made with the entire data set contained in X.

Let n be the number of rows in the entire data set, and let m (stored in NVAR) be the number of variables. Let xij be the i-th setting of the j-th variable (i = 1, 2, …, n; j = 1, 2, …, m). Denote the means (stored in XMEAN) by

The settings of the j-th centered variable (stored in the j-th column of CSCP) are given by

The settings of the j-th squared variable (stored in the (m + j)-th column of CSCP) are given by

where

(stored in the (m + j)-th column of SCPM) is the mean of the j-th squared variable. The settings of the jk crossproduct variable (stored in the

column of CSCP) are given by

where

(stored in the

location of SCPM) is the mean of the jk-th (j < k) crossproduct variable.

Comments

Crossproduct variables are ordered as follows: (1, 2), (1, 3), …, (1, NVAR), (2, 3), (2, 4), …, (2, NVAR), …, (NVAR ‑ 1, NVAR).

Examples

Example 1

With data containing 4 rows and 3 variables, GCSCP is used to center the variables and to generate (using the centered variables) the square and crossproduct variables. The data is input in one invocation (IDO = 0), and the generated squared and crossproduct variables are centered (ICEN = 1). On output, SCPM contains the means in standard order, i.e.,

Also, CSCP contains the variables in standard order, i.e.,

USE GCSCP_INT

USE UMACH_INT

USE WRRRN_INT

IMPLICIT NONE

INTEGER LDCSCP, LDX, NRX, NVAR, J, ICEN

PARAMETER (NRX=4, NVAR=3, LDCSCP=NRX, LDX=NRX)

INTEGER NOUT, NRMISS, NVOBS

REAL CSCP(LDCSCP,NVAR*(NVAR+3)/2), SCPM(NVAR*(NVAR+1)/2), &

X(LDX,NVAR), XMEAN(NVAR)

DATA (X(1,J),J=1,NVAR)/10.0, 8.0, 11.0/

DATA (X(2,J),J=1,NVAR)/ 5.0, 15.0, 1.0/

DATA (X(3,J),J=1,NVAR)/ 3.0, 2.0, 4.0/

DATA (X(4,J),J=1,NVAR)/ 6.0, 3.0, 4.0/

DATA XMEAN/6.0, 7.0, 5.0/

ICEN = 1

CALL GCSCP (X, XMEAN, CSCP, ICEN=ICEN, scpm=scpm, nrmiss=nrmiss, &

nvobs=nvobs)

CALL UMACH (2, NOUT)

WRITE (NOUT,*) ' NRMISS = ', NRMISS

CALL WRRRN ('SCPM', SCPM, 1, NVAR*(NVAR+1)/2, 1)

CALL WRRRN ('CSCP', CSCP)

END

Output

NRMISS = 0

SCPM

1 2 3 4 5

6.50 26.50 13.50 2.75 7.75 -4.25

CSCP

1 2 3 4 5 6 7 8 9

1 4.00 1.00 6.00 9.50 -25.50 22.50 1.25 16.25 10.25

2 -1.00 8.00 -4.00 -5.50 37.50 2.50 -10.75 -3.75 -27.75

3 -3.00 -5.00 -1.00 2.50 -1.50 -12.50 12.25 -4.75 9.25

4 0.00 -4.00 -1.00 -6.50 -10.50 -12.50 -2.75 -7.75 8.25

Example 2

With data containing 4 rows and 3 variables, GCSCP is used to center the variables and to generate (using the centered variables) the square and crossproduct variables. The data is input in multiple invocations (IDO = 1, 2, 2, 2). Here, the square and crossproduct variables, generated using the centered variables, cannot be centered (ICEN = 0).

USE GCSCP_INT

USE UMACH_INT

USE WRRRN_INT

IMPLICIT NONE

INTEGER LDCSCP, LDX, NRX, NVAR, J

PARAMETER (LDX=4, NRX=1, NVAR=3, LDCSCP=NRX)

INTEGER I, IDO, MISS, NOUT, NRMISS, NVOBS

REAL CSCP(LDCSCP,NVAR*(NVAR+3)/2), SCPM(NVAR*(NVAR+1)/2), &

X(LDX,NVAR), XMEAN(NVAR)

DATA (X(1,J),J=1,NVAR)/10.0, 8.0, 11.0/

DATA (X(2,J),J=1,NVAR)/ 5.0, 15.0, 1.0/

DATA (X(3,J),J=1,NVAR)/ 3.0, 2.0, 4.0/

DATA (X(4,J),J=1,NVAR)/ 6.0, 3.0, 4.0/

DATA XMEAN/6.0, 7.0, 5.0/

CALL UMACH (2, NOUT)

MISS = 0

DO 10 I=1, 4

IF (I .EQ. 1) THEN

IDO = 1

ELSE

IDO = 2

END IF

CALL GCSCP (X(I:,1:), XMEAN, CSCP, IDO=IDO, NRX=NRX, scpm=scpm, &

nrmiss=nrmiss, nvobs=nvobs)

MISS = MISS + NRMISS

CALL WRRRN ('CSCP', CSCP)

10 CONTINUE

CALL WRRRN ('SCPM', SCPM, 1, NVAR*(NVAR+1)/2, 1)

WRITE (NOUT,*) ' MISS = ', MISS

END

Output

CSCP

1 2 3 4 5 6 7 8 9

4.00 1.00 6.00 16.00 1.00 36.00 4.00 24.00 6.00

CSCP

1 2 3 4 5 6 7 8 9

-1.00 8.00 -4.00 1.00 64.00 16.00 -8.00 4.00 -32.00

CSCP

1 2 3 4 5 6 7 8 9

-3.00 -5.00 -1.00 9.00 25.00 1.00 15.00 3.00 5.00

CSCP

1 2 3 4 5 6 7 8 9

0.00 -4.00 -1.00 0.00 16.00 1.00 0.00 0.00 4.00

SCPM

1 2 3 4 5 6

6.50 26.50 13.50 2.75 7.75 -4.25

MISS = 0