Email classification often uses the Vector Space Model (VSM) as a tool to represent emails. This model is only based on frequencies of the words that disappear in the email. It ignores the structure of the email, therefore VSM can not express the email exactly. In order to overcome the shortcomings of the VSM, the idea that uses glue measure to extract n-grams is applied in this paper, which is then used to weight the words, and an email classification system is designed. Because the structure of email is considered in glue measure, the experiment shows that the new method can improve the precision of classification.
Computer Simulation