Date of Award:
8-2023
Document Type:
Thesis
Degree Name:
Master of Science (MS)
Department:
Mathematics and Statistics
Committee Chair(s)
Brennan Bean
Committee
Brennan Bean
Committee
Stephen Walsh
Committee
Brent Thomas
Abstract
Many discipline specific researchers need a way to quickly compare the accuracy of their predictive models to other alternatives. However, many of these researchers are not experienced with multiple programming languages. Python has recently been the leader in machine learning functionality, which includes the PyCaret library that allows users to develop high-performing machine learning models with only a few lines of code. The goal of the stressor package is to help users of the R programming language access the advantages of PyCaret without having to learn Python. This allows the user to leverage R’s powerful data analysis workflows, while simultaneously leveraging Python’s powerful machine learning functionality. stressor also implements a series of synthetic data set generation functions that create data sets where users can test ideas with models they create and/or use. These data sets can be paired with various forms of accuracy comparison to stress-test the models predictive capacity. This thesis illustrates this stress-test workflow on both real and synthetic data, illustrating stressor’s utility and ease of use.
Checksum
cd78a8943a3f7575da4ffda21ae42c20
Recommended Citation
Haycock, Samuel A., "Stressor: An R Package for Benchmarking Machine Learning Models" (2023). All Graduate Theses and Dissertations, Spring 1920 to Summer 2023. 8819.
https://digitalcommons.usu.edu/etd/8819
Included in
Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .