dataSets

Retrieves a commonly analyzed data set.

Synopsis

dataSets (dataSetChoice)

Required Arguments

int dataSetChoice (Input)
Data set indicator. Set dataSetChoice = 0 to print a description of all fourteen data sets. In this case, any optional arguments are ignored.
dataSetChoice nObservations nVariables Description of Data Set
1 16 7 Longley
2 176 2 Wolfer sunspot
3 150 5 Fisher iris
4 144 1 Box and Jenkins Series G
5 13 5 Draper and Smith Appendix B
6 197 1 Box and Jenkins Series A
7 296 2 Box and Jenkins Series J
8 100 4 Robinson Multichannel Time Series
9 113 34 Afifi and Azen Data Set A
10 958 10 Tic-Tac-Toe Endgame
11 4601 58 Spambase Data Set
12 690 16 Credit Approval
13 20000 17 Letter Recognition Data
14 366 35 Dermatology Database

Return Value

If dataSetChoice ≠ 0, the requested data set is returned. If dataSetChoice = 0 or an error occurs, None is returned.

Optional Arguments

nObservations (Output)
Number of observations or rows in the output matrix.
nVariables (Output)
Number of variables or columns in the output matrix.
printNone
No printing is performed. This option is the default.
printBrief
Rows 1 through 10 of the data set are printed.
printAll
All rows of the data set are printed.

Description

Function dataSets retrieves a standard data set frequently cited in statistics text books or in this manual. The following table gives the references for each data set:

dataSetChoice Reference
1 Longley (1967)
2 Anderson (1971, p.660)
3 Fisher (1936); Mardia et al. (1979, Table 1.2.2)
4 Box and Jenkins (1976, p. 531)
5 Draper and Smith (1981, pp. 629-630)
6 Box and Jenkins (1976, p. 525)
7 Box and Jenkins (1976, pp. 532-533)
8 Robinson (1976, p. 204)
9 Afifi and Azen (1979, pp. 16-22)
10 Aha, D. W. (1991, pp. 117-121), and Asuncion, A. & Newman, D.J. (2007)
11 Asuncion, A. & Newman, D.J. (2007)
12 Quinlan (1987, pp. 221-234, 1997), and Asuncion, A. & Newman, D.J. (2007)
13 P. W. Frey and D. J. Slate, (Machine Learning Vol 6 #2 March 91), and Asuncion, A. & Newman, D.J. (2007)
14 G. Demiroz, H. A. Govenir, and N. Ilter, (Artificial Intelligence in Medicine ), and Asuncion, A. & Newman, D.J. (2007)

Example

In this example, dataSets is used to copy the Draper and Smith (1981, Appendix B) data set into x.

from pyimsl.stat.dataSets import dataSets
from pyimsl.stat.writeMatrix import writeMatrix

x = dataSets(5)
writeMatrix("Draper and Smith, Appendix B", x)

Output

 
                   Draper and Smith, Appendix B
              1            2            3            4            5
 1          7.0         26.0          6.0         60.0         78.5
 2          1.0         29.0         15.0         52.0         74.3
 3         11.0         56.0          8.0         20.0        104.3
 4         11.0         31.0          8.0         47.0         87.6
 5          7.0         52.0          6.0         33.0         95.9
 6         11.0         55.0          9.0         22.0        109.2
 7          3.0         71.0         17.0          6.0        102.7
 8          1.0         31.0         22.0         44.0         72.5
 9          2.0         54.0         18.0         22.0         93.1
10         21.0         47.0          4.0         26.0        115.9
11          1.0         40.0         23.0         34.0         83.8
12         11.0         66.0          9.0         12.0        113.3
13         10.0         68.0          8.0         12.0        109.4