Prepare for the quiz

Before starting the quiz we suggest you reproduce what Ian did in the video, using the LED24 data in the Command Line interface.

Make a test file with 100,000 instances (you should probably precede all filenames below with an appropriate directory specification):

java weka.datagenerators.classifiers.classification.LED24 -n 100000 
        -o test.arff

Make a training file with 10,000,000 instances:

 java weka.datagenerators.classifiers.classification.LED24 -n 10000000 
        -o train.arff

Apply NaiveBayesUpdateable:

 java weka.classifiers.bayes.NaiveBayesUpdateable -t train.arff -T test.arff 

(this will take a few minutes)

Verify that Weka runs out of memory if cross-validation is attempted:

 java weka.classifiers.bayes.NaiveBayesUpdateable -t train.arff

Weka will become unresponsive, although you will probably not get an error message.

If you feel brave, repeat the exercise with a 100,000,000-instance training file.

Share this article:

This article is from the free online course:

More Data Mining with Weka

The University of Waikato