Towards Eradication of SPAM: A Study on Intelligent Adaptive SPAM Filters



Similar documents
Towards Eradication of SPAM: A Study on Intelligent Adaptive SPAM Filters

Knowledge Management Strategic Alignment in the Banking Sector at the Gulf Cooperation Council (GCC) Countries

International Journal of Research in Advent Technology Available Online at:

About this documentation

Top 25 Marketing Terms You Should Know. Marketing from Constant Contact

Gordon State College. Spam Firewall. User Guide

Adjust Webmail Spam Settings

How To Extract Content From Thai Websites

OIS. Update on the anti spam system at CERN. Pawel Grzywaczewski, CERN IT/OIS HEPIX fall 2010

MINIMIZING THE TIME OF SPAM MAIL DETECTION BY RELOCATING FILTERING SYSTEM TO THE SENDER MAIL SERVER

Lan, Mingjun and Zhou, Wanlei 2005, Spam filtering based on preference ranking, in Fifth International Conference on Computer and Information

Detecting spam using social networking concepts Honours Project COMP4905 Carleton University Terrence Chiu

Savita Teli 1, Santoshkumar Biradar 2

Webmail Friends & Exceptions Guide

BARRACUDA. N e t w o r k s SPAM FIREWALL 600

escan Anti-Spam White Paper

Antispam Security Best Practices

The Guardian Digital Control and Policy Enforcement Center

Solutions IT Ltd Virus and Antispam filtering solutions

Adaptive Filtering of SPAM

Junk Filtering System. User Manual. Copyright Corvigo, Inc All Rights Reserved Rev. C

Top 40 Marketing Terms You Should Know

eprism Security Appliance 6.0 Intercept Anti-Spam Quick Start Guide

BrightVisions Spam Filter User Guide

Barracuda Spam Firewall Users Guide. How to Download, Review and Manage Spam

How to Use Red Condor Spam Filtering

AN EFFECTIVE SPAM FILTERING FOR DYNAMIC MAIL MANAGEMENT SYSTEM

Spam Detection using Adaptive Neural Networks: Adaptive Resonance Theory

UNIVERSITY OF CALIFORNIA RIVERSIDE. Fighting Spam, Phishing and Fraud


Western University Spam Firewall User s Guide

Groundbreaking Technology Redefines Spam Prevention. Analysis of a New High-Accuracy Method for Catching Spam

Marketing Glossary of Terms

EnterGroup offers multiple spam fighting technologies so that you can pick and choose one or more that are right for you.

HOW TO: Use the UWITC Barracuda Spam Filter System

The Network Box Anti-Spam Solution

HOSTED EXCHANGE ADVANCED SECURITY. Hosted Exchange Advanced Security Feb 2013

How To Stop Spam From Being A Problem

Groundbreaking Technology Redefines Spam Prevention. Analysis of a New High-Accuracy Method for Catching Spam

How to Use the Greymail Spam Filter

ModusMail Software Instructions.

Software Engineering 4C03 SPAM

WEB QUARANTINE USER GUIDE VERSION 4.3

LOGGING IN. Please take a moment to add this page to the favorites in your preferred Web Browser. (

Evaluation of Anti-spam Method Combining Bayesian Filtering and Strong Challenge and Response

Personal Spam Solution Overview

An Overview of Spam Blocking Techniques

Barracuda Spam Firewall User s Guide

Delivery Simplified White Paper

Journal of Information Technology Impact

Content Filters A WORD TO THE WISE WHITE PAPER BY LAURA ATKINS, CO- FOUNDER

Who will win the battle - Spammers or Service Providers?

Security and Spam Prevention. March 25, 2004 Tim Faltemier Saurabh Jain

Spam Filtering Methods for Filtering

Government of Canada Managed Security Service (GCMSS) Annex A-5: Statement of Work - Antispam

Spam Filter Message Center. User Guide

Smart E-Marketer s Guide

REVIEW AND ANALYSIS OF SPAM BLOCKING APPLICATIONS

EXPLANATION OF COMMON SPAM FILTERING TECHNIQUES WHITEPAPER

FILTERING FAQ

Opus One PAGE 1 1 COMPARING INDUSTRY-LEADING ANTI-SPAM SERVICES RESULTS FROM TWELVE MONTHS OF TESTING INTRODUCTION TEST METHODOLOGY

Anti Spamming Techniques

How To Report Spam On An Ipo For A County

Using the Barracuda Spam Blocker

Barracuda Spam Firewall User s Guide

Dealing with spam mail

SURVEY PAPER ON INTELLIGENT SYSTEM FOR TEXT AND IMAGE SPAM FILTERING Amol H. Malge 1, Dr. S. M. Chaware 2

- Spam Spam Firewall How Does the Spam Firewall Work? Getting Started username Create New Password

Migration Manual (For Outlook Express 6)

WildBlue Spam Filtering

DEFENDER SERVICES

Directions on How to Set up Your Spam Settings in your *for those individuals who use the district's web-based (webmail)

ESPC Best Practices Guide

Bayesian Learning Cleansing. In its original meaning, spam was associated with a canned meat from

Advice on Using Dada Mail

Managing Spam With Outlook Express

Bayesian Spam Filtering

CONFIGURING FUS ANTI-SPAM

Using the Barracuda Spam Firewall to Filter Your s

How To Filter Spam Image From A Picture By Color Or Color

FastNetSecurity SpamGuard Spam Filter How-To

Navigating Through SpamTitan

Understanding & Improving. Deliverability

Personalised package Details

Knowledge Base. Google Apps User Manual. 24 Pages. Zeumic Pty Ltd. PO Box 44 Kew, VIC Australia 3101

was made, the message will sit in the BoxTrapper queue for 15 days (a selectable number, by you) for your potential review in case a mistake was

Achieve more with less

Barracuda Spam Firewall User s Guide

HPL Barracuda Spam/Virus Firewall

Spam How Malaysian Users Deal with It?

Comparing Industry-Leading Anti-Spam Services

How To Manage Your Quarantine On A Blackberry.Com

Intelligent Word-Based Spam Filter Detection Using Multi-Neural Networks

KUMC Spam Firewall: Barracuda Instructions

Spam Filtering and Removing Spam Content from Massage by Using Naive Bayesian

Messaging Firewall. W h i t e p a p e r. w w w. c m s c o n n e c t. c o m

Protecting your business from spam

Evios. A Managed, Enterprise Appliance for Identifying and Eliminating Spam

Adaption of Statistical Filtering Techniques

China s Anti-Spam Works

Transcription:

Towards Eradication of SPAM: A Study on Intelligent Adaptive SPAM Filters Tarek Hassan (B.Sc. Computer Science, Egypt) This thesis is presented for the degree of Master of Computer Science Murdoch University, Western Australia 2006

I declare that this thesis is my own account of my research and contains as its main content work, which has not previously been submitted for a degree at any tertiary educational institution. Tarek Hassan

Abstract ABSTRACT As the massive increase of electronic mail (email) usage continues, SPAM (unsolicited bulk email), has continued to grow because it is a very inexpensive method of advertising. These unwanted emails can cause a serious problem by filling up the email inbox and thereby leaving no space for legitimate emails to pass through. Currently the only defense against SPAM is the use of SPAM filters. A novel SPAM filter GetEmail5 along with the design rationale, is described in this thesis. To test the efficacy of GetEmail5 SPAM filter, an experimental setup was created and a commercial bulk email program was used to send SPAM and non-spam emails to test the new SPAM filter. GetEmail5 s efficiency and ability to detect SPAM was compared against two highly ranked commercial SPAM filters on different sets of emails, these included all SPAM, non-spam, and mixed emails, also text and HTML emails. The results showed the superiority of GetEmail5 compared to the two commercial SPAM filters in detecting SPAM emails and reducing the user s involvement in categorizing the incoming emails. This thesis demonstrates the design rationale for GetEmail5 and also its greater effectiveness in comparison with the commercial SPAM filters tested. i

Table of Contents TABLE OF CONTENTS Page No. ABSTRACT TABLE OF CONTENTS ACKNOWLEDGEMENTS PUBLICATIONS i ii v vi CHAPTER 1. INTRODUCTION 1 1.1 Introduction 1 1.2 Definition of SPAM 2 1.3 Sources of SPAM 4 1.4 Growth of SPAM in Australia 5 1.5 Nuisance and costs of SPAM 6 1.6 Content of SPAM 6 1.7 Transmission of SPAM emails 7 1.8 Why is SPAM so common? 9 1.9 How to stop SPAM 11 1.10 Amis 12 1.11 Thesis Overview 12 CHAPTER 2. LITERATURE REVIEW 14 2.1 Introduction 14 2.2 Dealing with SPAM 15 2.3 SPAM filters 18 2.3.1 Black List Filter 20 2.3.2 White List Filter 21 2.3.3 Bayesian Filtering (Content Focus) 21 2.3.4 Fingerprints Filter 22 2.3.5 Password Filter 23 2.3.6 Challenge/Response Filter 24 2.3.7 Community-base Filter 24 ii

Table of Contents 2.3.8 Mobile Agent 25 2.3.9 Encryption and Trust 26 2.3.10 Copyright Tokens 26 2.4 Legal Action against SPAM 30 2.4.1 Complaining to Spammers ISPs 31 2.4.2 The use of an Opt-in Mechanism 31 2.4.3 SPAM Litigation 32 CHAPTER 3. DESIGN OF GETEMAIL5 SPAM FILTER 36 3.1 Introduction 36 3.2 History of Bayesian Probability 37 3.3 How a Bayesian Filter Works? 39 3.4 Advantages of the Bayesian Filtering 41 3.5 Design of the Proposed Filter 42 3.5.1 Whitelist Filter 46 3.5.2 Blacklist Filter 46 3.5.3 Bayesian Filter Subject to HAM or SPAM 47 3.6 Implementation of the GetEmail5 SPAM Filter 48 3.7 Operation of the GetEmail5 Filter 52 CHAPTER 4. RESEARCH METHODOLOGY 56 4.1 Introduction 56 4.2 Preparing the Filter Requirements 56 4.2.1 Creation of a new Email Account 56 4.2.2 Obtaining a Bulk Email Sender 58 4.2.3 Collecting SPAM Emails (Data set) 58 4.3 Sending SPAM Emails 59 4.4 Receive and Analyse the Emails 63 4.5 Evaluation 63 4.5.1 First Experiment (Mixed Emails) 67 4.5.2 Second Experiment (80% SPAM and 20% non-spam Emails) 67 iii

Table of Contents 4.5.3 Third Experiment (100% Plain Text SPAM Emails) 68 4.5.4 Fourth Experiment (100% HTML SPAM Emails) 68 CHAPTER 5. RESULTS AND DISCUSSION 69 5.1 Introduction 69 5.2 Initial Evaluation of the GetEmail5 SPAM Filter 69 5.3 First Experiment 71 5.4 Second Experiment (80% SPAM and 20% non-spam Emails) 75 5.5 Third Experiment (100% Plain Text SPAM Emails) 79 5.6 Fourth Experiment (100% HTML SPAM Emails) 82 5.7 Analysis of the five Key Performance Indicators (KPI) 85 5.7.1 SPAM Detection 85 5.7.2 Filtering Time 86 5.7.3 False Positive 87 5.7.4 False Negative 88 5.7.5 User Involvement 89 5.8 Limitations of this Research 91 CHAPTER 6. CONCLUSIONS AND FURTHER WORK 93 6.1 Conclusions 93 6.2 Recommendations for Further Work 94 REFERENCES 95 APPENDIX GETEMAIL5 DOCUMENTATION iv

Acknowledgements ACKNOWLEDGEMENTS The work presented in this thesis was carried out under the supervision of Mr. Peter Cole and Associate Professor Lance C.C. Fung, to whom I extend my gratitude for their valuable and friendly guidance, encouragement and helpful suggestions throughout the progress of the research and in the preparation of this thesis. I would like to extend my special thanks and indebtedness to Kashif Saleem for his help and support. My gratitude and heartfelt thanks to Dr. Karin Strehlow and Anthony Horton, for their support, friendship, help and encouragement, God Bless. Finally, my thanks and gratitude go to my wife, Eman and daughter, Amira who have been understanding, patient, supportive and making duaa(supplication) for the success of my study. Thank you also to my parents, sisters and brother for their encouragement and support during my stay in Australia. v

Publications PUBLICATIONS Hassan, T., Cole, P. and Fung C.C. (2006), An Intelligent SPAM filter GetEmail5, Proceedings of the 2 nd IEEE International Conference on Cybernetics and Intelligent Systems (CIS 2006), Bangkok Thailand, 7-9 June, 2006, (ISBN 1-4244-0023-6) pp. 86-90. Hassan T., Cole P. and Fung C.C. (2005), Development and evaluation of an Intelligent SPAM filter, Proceedings of the 6 th Postgraduate Electrical and Computing Symposium (PEECS 2005), (ISBN 0-7298-06090-X), Edith Cowan University, Perth, WA, pp. 142-145 Hassan T., Cole P., and Fung C.C., (2004) Towards Eradication of Spam: A Study on Intelligent Adaptive Spam Filter, Proceedings of the Fifth Postgraduate Electrical Engineering and Computing Symposium (PEECS 2004), 28 th September 2004, Perth, Western Australia, pp 203-206. vi