Date of Award:
5-1975
Document Type:
Thesis
Degree Name:
Master of Science (MS)
Department:
Computer Science
Department name when degree awarded
Applied Statistics and Computer Science
Committee Chair(s)
David White
Committee
David White
Committee
Rex Hurst
Committee
James Shaver
Abstract
Four statistics used for the analysis of categorical data were observed in the presence of many zero cell frequencies in two way classification contingency tables. The purpose of this study was to determine the effect of many zero cell frequencies upon the distribution properties of each of the four statistics studied. It was found that Light and Margolin's C and Pearson's Chi-square statistic closely approximated the Chi-square distribution as long as less than one-third of the table cells were empty. It was found that the mean and variance of Kullbach's 2I were larger than the expected values in the presence of few empty cells. The mean for 2I was found to become small in the presence of large numbers of empty cells. Ku's corrected 2I statistic was found, in the presence of many zero cell frequencies, to have a much larger mean value than would be expected in a Chi-square distribution. Kullback's 2I demonstrated a peculiar distribution change in the presence of large numbers of zero cell frequencies. 2I first enlarged, then decreased in average value.
Checksum
951af932e9fcbed6f871aa6e5be8f53a
Recommended Citation
Post, Jane R., "A Study of Four Statistics, Used in Analysis of Contingency Tables, in the Presence of Low Expected Frequencies" (1975). All Graduate Theses and Dissertations, Spring 1920 to Summer 2023. 6955.
https://digitalcommons.usu.edu/etd/6955
Included in
Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .