Improving the Reliability of Decision Trees and Naïve Bayes Learners

ICDM Nov 2004 Improving the Reliability of Decision Trees and Naïve Bayes Learners David Lindsay and Siân Cox Computer Learning Research Centre, Royal Holloway University of London, Egham, Surrey, UK PhD Supervisors: Alex Gammerman and Volodya Vovk

Motivation and Outline Aim: To assess the improvement of the quality of probability forecasts by the VPM meta-learner applied to Naïve Bayes and C4.5 base-learners. • Define problem of probability forecasting. • Introduce resolution and reliability. • Detail methods we used for assessing reliability – focus on ERC plots. • Summarise results. • Discuss conclusions and future work.

Reliable Probability Forecasts • Pattern recognition  Probability Forecasting • Good quality forecasts should be: • Resolute: forecasts useful for ranking labels in order of likelihood of occurring • Reliable: do not lie, labels assigned with forecast ≈ p should occur with frequency ≈ p (a.k.a. ) calibrated Predicted Probability We focus on Reliability! We focus on Reliability! Empirical frequency

Methods for Assessing Quality of Probability Forecasts

Empirical Reliability Curves (ERC) ERC Plots Generated from Naïve Bayes learners’ forecasts on Abdominal Pain Medical Dataset Naïve Bayes VPM Naïve Bayes ERC Dev. Area = 0.153 ERC Dev. Area = 0.006

Learners Tested • Naïve Bayes and C4.5 Decision Tree • Meta-learners applied: • Binning • Venn Probability Machine (VPM) • Laplace (just to C4.5) • Tested on multi-class and binary data from UCI repository

Results • Notice that VPM C4.5 error rate slightly more than Binning C4.5 0.1% • But… VPM performs best in improving reliability! • And… VPM improves resolution  overall loss is improved

Conclusions • VPM is slow, but gives good improvement in reliability and resolution. • ERC nice visualisation and measure of reliability individually. • Reliability should not be overlooked, classification accuracy is not always useful to look at!

Current and Future Work • Have tested much larger set of meta-learners: • ERC re-calibration • Bagging • Boosting • Find Best Weights (FBW) • Tested other base learners: • SVM • Neural Networks • K-Nearest Neighbours • Bayesian Belief Networks • Developed extension to WEKA

Bibliography • A. P. Dawid. Calibration-based empirical probability (with discussion). Annals of Statistics, 13:1251-1285, 1985 • V. Vovk, G. Shafer, and I. Nouretdinov. Self-calibrating probability forecasting. In Advances in Neural Information Processing Systems 16, 2003 • D. Lindsay. Visualising and improving reliability – a machine learning perspective. CLRC-TR-04-01, Royal Holloway University, England, 2004 • A. H. Murphy. A new vector parturition of the probability score. Journal of Applied Meteorology, 12:595-600 1973

VPM Compared With Underlying Naïve Bayes Learner Key: Predcited = underlined, Actual class =

Improving the Reliability of Decision Trees and Naïve Bayes Learners