Skip to Main Content
Mastering Machine Learning with R, Second Edition - Second Edition
book

Mastering Machine Learning with R, Second Edition - Second Edition

by Cory Lesmeister, Doug Ortiz, Vikram Dhillon, Miroslav Kopecky
April 2017
Beginner to intermediate content levelBeginner to intermediate
420 pages
9h 58m
English
Content preview from Mastering Machine Learning with R, Second Edition - Second Edition

Random forest

To greatly improve our model's predictive ability, we can produce numerous trees and combine the results. The random forest technique does this by applying two different tricks in model development. The first is the use of bootstrap aggregation or bagging, as it is called.

In bagging, an individual tree is built on a random sample of the dataset, roughly two-thirds of the total observations (note that the remaining one-third is referred to as out-of-bag (oob)). This is repeated dozens or hundreds of times and the results are averaged. Each of these trees is grown and not pruned based on any error measure, and this means that the variance of each of these individual trees is high. However, by averaging the results, you can reduce ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

Publisher Resources

ISBN: 9781787287471Supplemental Content