Date of Award
Master of Science (MS)
Mathematics and Statistics
Dr. Adele Cutler
Random Forests is a useful data mining tool that is quite popular in finding variable importance. However, many people don’t make use of the Random Forests results in interactive graphs. Partly, this is because software packages that can do interactive graphs can’t handle large data sets and those that use Random Forests have large data sets or many variables. A new software package in R, known as iPlots eXtreme, that is still in development makes it simple to explore large data sets interactively. I have created a function, called irfplot (interactive random forests plot) that specifically uses Random Forests to produce interactive graphs that are more informative than using raw values. I will use the interactive Random Forests plot that I’ve created to explore the nutrition data set from the Cache County Memory Study.
Quach, Anna T., "Interactive Random Forests Plots" (2012). All Graduate Plan B and other Reports. Paper 134.
Copyright for this work is retained by the student.