All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Clustering Educational Digital Library Usage Data: Comparisons of Latent Class Analysis and K-Means Algorithms

Beijie Xu, Utah State University

Date of Award:

5-2011

Document Type:

Thesis

Degree Name:

Master of Science (MS)

Department:

Computer Science

Committee Chair(s)

Xiaojun Qi

Committee

Xiaojun Qi

Committee

Nicholas Flann

Committee

Mimi Recker

Abstract

There are common pitfalls and neglected areas when using clustering approaches to solve educational problems. A clustering algorithm is often used without the choice being justified. Few comparisons between a selected algorithm and a competing algorithm are presented, and results are presented without validation. Lastly, few studies fully utilize data provided in an educational environment to evaluate their findings. In response to these problems, this thesis describes a rigorous study comparing two clustering algorithms in the context of an educational digital library service, called the Instructional Architect.

First, a detailed description of the chosen clustering algorithm, namely, latent class analysis (LCA), is presented. Second, three kinds of preprocessed data are separately applied to both the selected algorithm and a competing algorithm, namely, K-means algorithm. Third, a series of comprehensive evaluations on four aspects of each clustering result, i.e., intra-cluster and inter-cluster distances, Davies-Bouldin index, users' demographic profile, and cluster evolution, are conducted to compare the clustering results of LCA and K-means algorithms. Evaluation results show that LCA outperforms K-means in producing consistent clustering results at different settings, finding compact clusters, and finding connections between users' teaching experience and their effectiveness in using the IA. The implication, contributions, and limitation of this research are discussed.

Checksum

96fcc1a97c0b986c7d32522047201113

Comments

This work made publicly available electronically on June 10, 2011.

Recommended Citation

Xu, Beijie, "Clustering Educational Digital Library Usage Data: Comparisons of Latent Class Analysis and K-Means Algorithms" (2011). All Graduate Theses and Dissertations, Spring 1920 to Summer 2023. 954.
https://digitalcommons.usu.edu/etd/954

Download

Included in

Computer Engineering Commons

COinS

Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .

DOI

https://doi.org/10.26076/d2f7-4641

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Clustering Educational Digital Library Usage Data: Comparisons of Latent Class Analysis and K-Means Algorithms

Date of Award:

Document Type:

Degree Name:

Department:

Committee Chair(s)

Committee

Committee

Committee

Abstract

Checksum

Comments

Recommended Citation

Included in

DOI

Browse

For Authors

Scholarly Communication

Research Data

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Clustering Educational Digital Library Usage Data: Comparisons of Latent Class Analysis and K-Means Algorithms

Author

Date of Award:

Document Type:

Degree Name:

Department:

Committee Chair(s)

Committee

Committee

Committee

Abstract

Checksum

Comments

Recommended Citation

Included in

Share

DOI

Browse

For Authors

Scholarly Communication

Research Data