Date of Award

2009

Degree Type

Report

Degree Name

Master of Science (MS)

Department

Mathematics and Statistics

Committee Chair(s)

Adele Cutler

Committee

Adele Cutler

Abstract

Random forests are ensembles of trees that give accurate predictions for regression, classification and clustering problems. The CART tree, the base learn er employed by random forests, has been criticized because of bias in the selection of splitting variables. The performance of random forests is suspect due to this criticism. A new implementation of random forests, Cforest, which is claimed to outperform random forests in both predictive power and variable importance measures , was developed based on Ctree, an implementation of conditional inference trees.

We address the underlying mechanism of random forests and Cforest in this report. Comparison of random forests and Cforest is presented based on simulated data. Our study shows that except for some extreme situations, with proper choice of tuning parameter values, random forests provides higher prediction accuracies and more reliable variable importance measures than Cforest.

Recommended Citation

Xia, Rong, "Comparison of Random Forests and Cforest: Variable Importance Measures and Prediction Accuracies" (2009). All Graduate Plan B and other Reports, Spring 1920 to Spring 2023. 1255.
https://digitalcommons.usu.edu/gradreports/1255

Download

Included in

Applied Statistics Commons

COinS

Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .

DOI

https://doi.org/10.26076/74bd-59e1

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Comparison of Random Forests and Cforest: Variable Importance Measures and Prediction Accuracies

Date of Award

Degree Type

Degree Name

Department

Committee Chair(s)

Committee

Abstract

Recommended Citation

Included in

DOI

Browse

For Authors

Scholarly Communication

Research Data

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Comparison of Random Forests and Cforest: Variable Importance Measures and Prediction Accuracies

Author

Date of Award

Degree Type

Degree Name

Department

Committee Chair(s)

Committee

Abstract

Recommended Citation

Included in

Share

DOI

Browse

For Authors

Scholarly Communication

Research Data