CLUSTAR and CLUSTID: Computer Programs for Hierarchical Cluster Analysis
The American Statistician
American Statistical Association
The computer program CLUSTAR performs hierarchical cluster analysis following the sequential paradigm described by Sneath and Sokal (1973): (a) A data matrix of measurements for each of t objects (items, individuals, etc.) over n attributes is read; (b) the data matrix is optionally standardized; (c) a (dis)similarity coefficient is used to compute a pairwise measure of (dis)similarity among the t(t + 1)/2 object pairs; (d) a clustering method is used to produce a dendrogram on a line printer showing the similarity/dissimilarity relationships among the t objects. Clusters, that is, groups of objects that are sufficiently similar to be treated as homogeneous units, are defined on the dendrogram by the user. The computer program CLUSTID can then be used to (a) identify which clusters new objects (i.e., those objects not used in the CLUSTAR run) belong to and (b) provide summary statistics for each cluster, that is, attribute means and standard deviations.
H. C. Romesburg and K. Marshall. 1980. CLUSTER and CLUSTID-Programs for Hierarchical Cluster Analysis. American Statistician, 34(3):186.