Learn more about this course.

Random Forest performance

Ian Witten discusses how the performance of the Random Forest algorithm changes with the number of trees.

The performance of Random Forest does tend to improve with more trees, but only up to a point.

We found in the preceding Quiz that performance increases from 100 to 200 trees, but stays the same for 300 and deteriorates for 400 and 500 trees. (However, the difference is probably not statistically significant in this small example.)

The amount of data in each partition tends to be the limiting factor, and, as we have seen, this can be improved by reducing the number of partitions. However, in a practical “big data” problem this is unlikely to be an issue.

For big data, set the number of partitions to match the available hardware – the number of nodes/cores in the cluster, along with the amount of memory available to each. Configure Weka so that each partition contains as much data as possible, consistent with it fitting into the available memory.

Want to keep
learning?

This content is taken from
The University of Waikato online course,

Advanced Data Mining with Weka

View Course

Want to keep learning?

This content is taken from The University of Waikato online course

Advanced Data Mining with Weka

View Course

See other articles from this course

This article is from the free online

Advanced Data Mining with Weka

Created by

Join Now

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now

Learn more about this course.

Random Forest performance

Want to keep
learning?

Advanced Data Mining with Weka

Want to keep learning?

Advanced Data Mining with Weka

Advanced Data Mining with Weka

Advanced Data Mining with Weka

Reach your personal and professional goals

Register to receive updates

Learn more about this course.

Learn more about this course.

See all FutureLearn courses.

Learn more about this course.

Random Forest performance

Want to keep learning?

Advanced Data Mining with Weka

Want to keep learning?

Advanced Data Mining with Weka

Share this

Advanced Data Mining with Weka

Advanced Data Mining with Weka

Reach your personal and professional goals

Register to receive updates

Learn more about this course.

Learn more about this course.

See all FutureLearn courses.

Want to keep
learning?