70 likes | 405 Views
A comparison of ANN, Naïve Bayes, and Decision Tree for the purpose of spam filtering. Kaashyapee jha ECE/CS 539. Naïve Bayes Classifier. Bayes Theorem:. Method : P(S|D )= ( 2 ) P(S|D) = P( S|D) = (3) . PreProcessing.
E N D
A comparison of ANN, Naïve Bayes, and Decision Tree for the purpose of spam filtering Kaashyapee jha ECE/CS 539 1
Naïve Bayes Classifier Bayes Theorem: Method: P(S|D)= (2) P(S|D) = P(S|D) = (3)
PreProcessing • Stop list: do not take into account trivial words like {or, and, but, a, an, the, is, in, for} • Do not take into account words that are very uncommon
Weakness of Naïve Bayes Classifier • Example: hey man are you interested in sports? then email me at imcool@gmail.com • Spammers can avoid using words that are more prone to being in a spam email
Work Ahead • Finishimplementing andtesting Decision Tree • More preprocessing of the data • Perform more trials with different ratios of training set and testing set