How many trees in a random forest?
Journal of the Korean Data & Information Science Society 2022;33:325-35
Published online March 31, 2022;
Cheolyong Park1 · Fred W. Huffer2

1Major in Statistics, Keimyung University
2Department of Statistics, Florida State Univrsity
We propose diagnostic statistics which might assist in choosing the size of a random forest for classification. We use these statistics sequentially as we construct the forest. The statistics are computed from out-of-bag or test set votes and give an estimate of expected disagreement between the current and infinite forests. Simulation studies are provided to illustrate the performance of these statistics and to compare them with other methods for choosing the size of a random forest.
Keywords : Binary classi cation, diagnostic statistics, measure of disagreement, number of trees, random forest.