Skip to Main Content
Unwanted and malicious messages dominate email traffic and pose a great threat to the utility of email communications. Reputation systems have been getting momentum as the solution. Such systems extract email senders behavior data based on global sending distribution, analyze them and assign a value of trust to each IP address sending email messages. We build two models for the classification purpose. One is based on support vector machines (SVM) and the other is random forests(RF). Experimental results show that either classifier is effective. RF is slightly more accurate, but more expensive in terms of both time and space. SVM produces similar accuracy in a much faster manner if given modeling parameters. These classifiers can contribute to a reputation system as one source of analysis and increase its accuracy.
Date of Conference: Nov. 30 2008-Dec. 4 2008