Date of Award:

5-2023

Document Type:

Thesis

Degree Name:

Master of Science (MS)

Department:

Mathematics and Statistics

Committee Chair(s)

Alan Wisler

Committee

Alan Wisler

Committee

Yan Sun

Committee

Kevin Moon

Abstract

Many real world problems require the prediction of ordinal variables where the values are a set of categories with an ordering to them. However, in many of these cases the categorical nature of the ordinal data is not a desirable outcome. As such, regression models treat ordinal variables as continuous and do not bind their predictions to discrete categories. Prior research has found that these models are capable of learning useful information between the discrete levels of the ordinal labels they are trained on, but complex models may learn ordinal labels too closely, missing the information between levels. In this study, we use several datasets where the outcome is continuous and generate ordinal labels from this variable. The performance of two types of models, namely ordinal classification and continuous regression, is examined to determine the effect of model complexity. The experiment confirms previous findings that regression models trained on the synthetic ordinal labels reach optimal performance on the continuous outcome with less complexity than compared with performance on the ordinal labels. Additionally, the former overfit more quickly as complexity increases. This suggests that for machine learning settings where we would like to form a continuous analog for ordinal training labels, models should be trained with less complexity than what appears optimal given the observed performance on the ordinal labels reaches optimization after the underlying ground truth continuous measure’s optimization occurs.

Checksum

ad2f9ec6498cd173330399d1c27da83b

Recommended Citation

Thomas, McKade S., "Examining Model Complexity's Effects When Predicting Continuous Measures From Ordinal Labels" (2023). All Graduate Theses and Dissertations, Spring 1920 to Summer 2023. 8784.
https://digitalcommons.usu.edu/etd/8784

Download

Included in

Mathematics Commons, Statistics and Probability Commons

COinS

Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .

DOI

https://doi.org/10.26076/9503-31c0

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Examining Model Complexity's Effects When Predicting Continuous Measures From Ordinal Labels

Date of Award:

Document Type:

Degree Name:

Department:

Committee Chair(s)

Committee

Committee

Committee

Abstract

Checksum

Recommended Citation

Included in

DOI

Browse

For Authors

Scholarly Communication

Research Data

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Examining Model Complexity's Effects When Predicting Continuous Measures From Ordinal Labels

Author

Date of Award:

Document Type:

Degree Name:

Department:

Committee Chair(s)

Committee

Committee

Committee

Abstract

Checksum

Recommended Citation

Included in

Share

DOI

Browse

For Authors

Scholarly Communication

Research Data