Date of Award:

5-1975

Document Type:

Thesis

Degree Name:

Master of Science (MS)

Department:

Computer Science

Department name when degree awarded

Applied Statistics and Computer Science

Committee Chair(s)

David White

Committee

David White

Committee

Rex Hurst

Committee

James Shaver

Abstract

Four statistics used for the analysis of categorical data were observed in the presence of many zero cell frequencies in two way classification contingency tables. The purpose of this study was to determine the effect of many zero cell frequencies upon the distribution properties of each of the four statistics studied. It was found that Light and Margolin's C and Pearson's Chi-square statistic closely approximated the Chi-square distribution as long as less than one-third of the table cells were empty. It was found that the mean and variance of Kullbach's 2I were larger than the expected values in the presence of few empty cells. The mean for 2I was found to become small in the presence of large numbers of empty cells. Ku's corrected 2I statistic was found, in the presence of many zero cell frequencies, to have a much larger mean value than would be expected in a Chi-square distribution. Kullback's 2I demonstrated a peculiar distribution change in the presence of large numbers of zero cell frequencies. 2I first enlarged, then decreased in average value.

Checksum

951af932e9fcbed6f871aa6e5be8f53a

Share

COinS