The e-mail's header session usually contains important attributes such as e-mail title, sender's name, sender's e-mail address, sending date, which are helpful to classication of e-mails. In this paper, we apply decision tree data mining technique to header's basic attributes to analyze the association rules of spam e-mails and propose an efficient spam ¯ltering method to accurately identify spam and legitimate e-mails. According to the experiment of applying numerous Chinese e-mails to our spam ¯ltering method, we obtain the following excellent datums: the Accuracy is 96.5%, the Precision is 96.67%, and the Re-call is 96.3%. Thus, the method proposed in this paper can e±ciently identify the spam e-mails by checking only the header sessions, which can reduce the cost for calculation.
International Journal of Network Security, 8(3), 334-343