Date of Award
12-2018
Degree Type
Creative Project
Degree Name
Master of Science (MS)
Department
Mathematics and Statistics
Committee Chair(s)
Adele Cutler
Committee
Adele Cutler
Committee
John R. Stevens
Committee
D. Richard Cutler
Abstract
Random forests are very popular tools for predictive analysis and data science. They work for both classification (where there is a categorical response variable) and regression (where the response is continuous). Random forests provide proximities, and both local and global measures of variable importance. However, these quantities require special tools to be effectively used to interpret the forest. Rfviz is a sophisticated interactive visualization package and toolkit in R, specially designed for interpreting the results of a random forest in a user-friendly way. Rfviz uses a recently developed R package (loon) from the Comprehensive R Archive Network (CRAN) to create parallel coordinate plots of the predictor variables, the local importance values, and the MDS plot of the proximities. The visualizations allow users to highlight or brush observations in one plot and have the same observations show up as highlighted in other plots. This allows users to explore unusual subsets of their data and to potentially discover previously-unknown relationships between the predictor variables and the response.
Recommended Citation
Beckett, Christopher, "Rfviz: An Interactive Visualization Package for Random Forests in R" (2018). All Graduate Plan B and other Reports, Spring 1920 to Spring 2023. 1335.
https://digitalcommons.usu.edu/gradreports/1335
Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .