CSCI 374: Homework Assignment #4
Decision Trees
Due: 11:59 PM on Friday, November 11
You can download the assignment instructions by clicking on this link
Instructions for using GitHub for our assignments can be found on the Resources page of the class website, as well as using this link.
Example Accuracies
Here are the test set accuracies (and runtimes) that my implementation achieved on the four data sets with a training set size of 75% and the random seed 12345. Your answers might be slightly different because of different ways of implementing the solution and different random seeds.
Data Set | Accuracy (Numeric = False) | Learning Time (Numeric = False) | Accuracy (Numeric = True) | Learning Time (Numeric = True) |
---|---|---|---|---|
monks1.csv | 0.9444 | 0.002 sec | N/A | N/A |
penguins.csv | 0.6744 | 0.005 sec | 0.9651 | 0.03 sec |
occupancy.csv | 0.9531 | 0.40 sec | 0.9911 | 39.05 sec |
opticalDigit.csv | 0.5680 | 0.44 sec | 0.9132 | 8.97 sec |