Date of Award

5-2012

Degree Type

Report

Degree Name

Master of Science (MS)

Department

Mathematics and Statistics

Committee Chair(s)

Adele Cutler

Committee

Adele Cutler

Committee

Chris Corcoran

Committee

Heidi Wengreen

Abstract

Random Forests is a useful data mining tool that is quite popular in finding variable importance. However, many people don’t make use of the Random Forests results in interactive graphs. Partly, this is because software packages that can do interactive graphs can’t handle large data sets and those that use Random Forests have large data sets or many variables. A new software package in R, known as iPlots eXtreme, that is still in development makes it simple to explore large data sets interactively. I have created a function, called irfplot (interactive random forests plot) that specifically uses Random Forests to produce interactive graphs that are more informative than using raw values. I will use the interactive Random Forests plot that I’ve created to explore the nutrition data set from the Cache County Memory Study.

Comments

This work made publicly available electronically on June 4, 2012.

Share

COinS