Spam filtering. Peter Likarish Based on slides by EJ Jung 11/03/10
|
|
|
- Marjory Hamilton
- 10 years ago
- Views:
Transcription
1 Spam filtering Peter Likarish Based on slides by EJ Jung 11/03/10
2 What is spam? An unsolicited equivalent to Direct Mail in postal service UCE (unsolicited commercial ) UBE (unsolicited bulk ) 82% of US in 2004 [MessageLabs 2004] Etymology Monty Python s spam episode ham = not spam
3 Historical events May 3 rd, 1978 Sent to Arpanet west coast users Invitation to see new models of DEC-20 Violated ARPA s user policy Vendors were chastised & ceased (for then) March 31 st, 1993 first UUCP spam Unix-to-Unix Copy April, 1994 Green Card Lottery spam Message posted to EVERY UUCP group
4 Who s the victim? Internet Service Provider (ISP) much higher bandwidth usage service providers need to support larger disk space need to deploy anti-spam measures Businesses time spent on spam damage by spams and viruses and worms Us?
5 Does spam bother you? Potentially malicious viruses, worms phishing s Fill up the disk space Does it bother you more than DMs? if so, why?
6 How much is spam? >97% filtered! Source: Microsoft Security Intelligence Report 09 ( 5b085dc0a4cd&displaylang=en)
7 Other numbers Spamhaus >90% Postini (Google owned) >90%-95% Consensus: >90% of all is spam
8 How many messages? Is (was) exponentially increasing
9 Cost of spam Very back of the envelope 100,000,000,000 messages/month Recipient $0.025 $2.5 billion/month for recipients Sender $ for sender to generate $1 million/month for sender Profit? 1:200,000 = 500,000 sales. $2 is break-even point
10 Legal actions CAN-SPAM Act of 2003 UBE must be labeled Must have opt-out option Some people went to jail Some ISPs have been shut down
11 Some ways to blocking spam Network-level DNS blackholes/blacklisting Edge filtering Content-level Rule-based checksum Machine learning Probabilistic SVM Cost-based approach
12 DNS filtering Blacklist = list of known spammers IP addresses/domains/senders Top five spamming ISPs account for > 80% of spam traffic! Whitelist = list of known good senders IP addresses/domains/senders Further reading: Zhang et al. Highly Predictive Blacklisting (
13 How much is spam? >97% filtered! Source: Microsoft Security Intelligence Report 09 ( 5b085dc0a4cd&displaylang=en)
14 Edge filtering Filtering at edges of network. Ingress: filtering traffic entering your network Egress: filtering traffic exiting your network IP Sender Attachments Mailstream Providers Content
15 Content filtering Rule-based regular expressions on headers, contents, blacklist Hash/checksum database Bayesian probability/networks probabilistic approach Support Vector Machine High dimension hyper-plane!
16 Goal Given new message do I deliver or trash? Spam I ve seen New? S?? S? S
17 Rule-based filtering Rules are expressed as regular expressions (or SNORT traces) If contains refinancing & mortgage, trash If contains non-english alphabets, trash If attachment type = executable, trash Limitations Difficult to write rules general enough to catch spam but not legit often miss obvious tricks misspelling on purpose
18 Checksums Fuzzy hashing Goal: resilient to spelling changes Does it look like spam I ve already seen?? S?? S? S
19 Fuzzy vs. non-fuzzy hashing Traditional hashing Same: Fixed length output Different: Unique input = (nearly) unique output 1 bit change in input = COMPLETELY different hash Fuzzy hashing Same: Fixed length output Different: mostly the same = SAME value
20 Fuzzy hashing example Input Viagra Vi4gr4 Fuzzy hash Output DEF8F32 DE28F3C V!aGra DGF8E32 V i a g r a DAF8W3B Output = fixed length. Look at distance between outputs. Shorter distance = more similar. Input = message
21 Bayesian Theorem Conditional probability P( A B) = P( A! B) P( B) Bayes Theorem P ( A B) = P( B A) P( A) P( B)
22 Bayesian in Spam [Graham 02, Sahami 98] Given a word stock in an , what is the probability of this being spam? we can compute these from the sample set of s P ( spam stock) = P( stock spam) P( spam) P( stock)
23 More words Does more words mean higher probability? related words such as stock, jackpot, blue chip, not-related words such as stock, diet, n words require O(2 n ) probabilities to compute Naïve Bayesian filter assumes the independence between words O(n) probabilities to compute P ( spam / stock, jackpot) = P( stock, jackpot P( stock, spam) P( spam) jackpot)
24 Support Vector Machine Given records, plot them in n-dimension spaces and find a plane that divides the plots the best.
25 Feature selection How do we choose which tokens to examine? Words alone are not sufficient phrases, punctuation, sender s address, IP address, domain name, URLs, images, tags, More sophisticated conditions attachment? contains images?
26 Hybrid approaches SpamAssassin open-source combination of rule-based and Bayesian commonly run at SMTP server can be applied for an individual user Vipul s Razor Collaborative, distributed, checksum (statistical & randomized signature-based) anti-spam solution Commercial version: Cloudmark, Inc
27 Charging for If you can t win the game, change the rules Would you pay $ to send an ? If you sent 50 s/day: ~$46/year Spammers: 100,000,000,000 * $.0025 = $250,000,000 dollars Assuming adoption rate of 1:200,000 Break even = $1,250
Spam Filtering Methods for Email Filtering
Spam Filtering Methods for Email Filtering Akshay P. Gulhane Final year B.E. (CSE) E-mail: [email protected] Sakshi Gudadhe Third year B.E. (CSE) E-mail: [email protected] Shraddha A.
How To Stop Spam From Being A Problem
Solutions to Spam simple analysis of solutions to spam Thesis Submitted to Prof. Dr. Eduard Heindl on E-business technology in partial fulfilment for the degree of Master of Science in Business Consulting
An Overview of Spam Blocking Techniques
An Overview of Spam Blocking Techniques Recent analyst estimates indicate that over 60 percent of the world s email is unsolicited email, or spam. Spam is no longer just a simple annoyance. Spam has now
eprism Email Security Appliance 6.0 Intercept Anti-Spam Quick Start Guide
eprism Email Security Appliance 6.0 Intercept Anti-Spam Quick Start Guide This guide is designed to help the administrator configure the eprism Intercept Anti-Spam engine to provide a strong spam protection
Groundbreaking Technology Redefines Spam Prevention. Analysis of a New High-Accuracy Method for Catching Spam
Groundbreaking Technology Redefines Spam Prevention Analysis of a New High-Accuracy Method for Catching Spam October 2007 Introduction Today, numerous companies offer anti-spam solutions. Most techniques
COMBATING SPAM. Best Practices OVERVIEW. White Paper. March 2007
COMBATING SPAM Best Practices March 2007 OVERVIEW Spam, Spam, More Spam and Now Spyware, Fraud and Forgery Spam used to be just annoying, but today its impact on an organization can be costly in many different
ETH Zürich - Mail Filtering Service
Eidgenössische Technische Hochschule Zürich Swiss Federal Institute of Technology Zurich Informatikdienste / IT-Services ETH Zürich - Mail Filtering Service (TERENA 2009) 09 Dec 2009 - D. McLaughlin ([email protected])
Adaptive Filtering of SPAM
Adaptive Filtering of SPAM L. Pelletier, J. Almhana, V. Choulakian GRETI, University of Moncton Moncton, N.B.,Canada E1A 3E9 {elp6880, almhanaj, choulav}@umoncton.ca Abstract In this paper, we present
Spam, Spam and More Spam. Spammers: Cost to send
Spam, Spam and More Spam cs5480/cs6480 Matthew J. Probst *with some slides/graphics adapted from J.F Kurose and K.W. Ross Spammers: Cost to send Assuming a $10/mo dialup account: 13.4 million messages
Evios. A Managed, Enterprise Appliance for Identifying and Eliminating Spam
Evios A Managed, Enterprise Appliance for Identifying and Eliminating Spam Introduction Electronic messaging (e-mail) is a mission-critical business tool that has been compromised by the proliferation
Do you need to... Do you need to...
TM Guards your Email. Kills Spam and Viruses. Do you need to... Do you need to... Scan your e-mail traffic for Viruses? Scan your e-mail traffic for Viruses? Reduce time wasted dealing with Spam? Reduce
Commtouch RPD Technology. Network Based Protection Against Email-Borne Threats
Network Based Protection Against Email-Borne Threats Fighting Spam, Phishing and Malware Spam, phishing and email-borne malware such as viruses and worms are most often released in large quantities in
A Monitor Tool for Anti-spam Mechanisms and Spammers Behavior
A Monitor Tool for Anti-spam Mechanisms and Spammers Behavior Danilo Michalczuk Taveira and Otto Carlos Muniz Bandeira Duarte UFRJ - PEE/COPPE/GTA - DEL/POLI P.O. Box 6854-2945-97, Rio de Janeiro, RJ,
Intercept Anti-Spam Quick Start Guide
Intercept Anti-Spam Quick Start Guide Software Version: 6.5.2 Date: 5/24/07 PREFACE...3 PRODUCT DOCUMENTATION...3 CONVENTIONS...3 CONTACTING TECHNICAL SUPPORT...4 COPYRIGHT INFORMATION...4 OVERVIEW...5
The spam economy: the convergent spam and virus threats
The spam economy: the convergent spam and virus threats A Sophos whitepaper May 2005 SUMMARY Spammers, virus writers and hackers were once distinct communities with distinct motivations. However, the success
Being labeled as a spammer will drive your customers way, ruin your business, and can even get you a big fine or a jail sentence!
Getting Your Email Delivered Beware: If you send out spam mail (unsolicited emails), whether it be knowingly or not, you will put your business and your reputation at risk. Consequences for spamming have
Government of Canada Managed Security Service (GCMSS) Annex A-5: Statement of Work - Antispam
Government of Canada Managed Security Service (GCMSS) Date: June 8, 2012 TABLE OF CONTENTS 1 ANTISPAM... 1 1.1 QUALITY OF SERVICE...1 1.2 DETECTION AND RESPONSE...1 1.3 MESSAGE HANDLING...2 1.4 CONFIGURATION...2
How To Prevent Spam From Being Filtered Out Of Your Email Program
Active Carrot - Avoiding Spam Filters Table of Contents What is Spam?... 3 How Spam Filters Work... 3 Avoid these common mistakes... 3 Preventing False Abuse Reports... 4 How Abuse Reports Work... 4 Reasons
Who will win the battle - Spammers or Service Providers?
Who will win the battle - Spammers or Service Providers? Pranaya Krishna. E* Spam Analyst and Digital Evidence Analyst, TATA Consultancy Services Ltd. ([email protected]) Abstract Spam is abuse
Bayesian Spam Filtering
Bayesian Spam Filtering Ahmed Obied Department of Computer Science University of Calgary [email protected] http://www.cpsc.ucalgary.ca/~amaobied Abstract. With the enormous amount of spam messages propagating
Savita Teli 1, Santoshkumar Biradar 2
Effective Spam Detection Method for Email Savita Teli 1, Santoshkumar Biradar 2 1 (Student, Dept of Computer Engg, Dr. D. Y. Patil College of Engg, Ambi, University of Pune, M.S, India) 2 (Asst. Proff,
How to keep spam off your network
What features to look for in anti-spam technology A buyers guide to anti-spam software, this white paper highlights the key features to look for in anti-spam software and why. GFI Software www.gfi.com
ModusMail Software Instructions.
ModusMail Software Instructions. Table of Contents Basic Quarantine Report Information. 2 Starting A WebMail Session. 3 WebMail Interface. 4 WebMail Setting overview (See Settings Interface).. 5 Account
Spamfilter Relay Mailserver
Spamfilter Relay Mailserver Mark McSweeney CentraLUG, February 1, 2010 Overview Scope Little bit about me Why I built the spamfilter Deployment environment Spamfilter details Tuning and maintainance Other
Anti Spam Best Practices
39 Anti Spam Best Practices Anti Spam Engine: Time-Tested Scanning An IceWarp White Paper October 2008 www.icewarp.com 40 Background The proliferation of spam will increase. That is a fact. Secure Computing
How to keep spam off your network
GFI White Paper How to keep spam off your network What features to look for in anti-spam technology A buyer s guide to anti-spam software, this white paper highlights the key features to look for in anti-spam
Ipswitch IMail Server with Integrated Technology
Ipswitch IMail Server with Integrated Technology As spammers grow in their cleverness, their means of inundating your life with spam continues to grow very ingeniously. The majority of spam messages these
Recurrent Patterns Detection Technology. White Paper
SeCure your Network Recurrent Patterns Detection Technology White Paper January, 2007 Powered by RPD Technology Network Based Protection against Email-Borne Threats Spam, Phishing and email-borne Malware
Setting up Microsoft Outlook to reject unsolicited email (UCE or Spam )
Reference : USER 191 Issue date : January 2004 Updated : January 2008 Classification : Staff Authors : Matt Vernon, Richard Rogers Setting up Microsoft Outlook to reject unsolicited email (UCE or Spam
Email Migration Project Plan for Cisco Cloud Email Security
Sales Tool Email Migration Project Plan for Cisco Cloud Email Security 2014 Cisco and/or its affiliates. All rights reserv ed. This document is Cisco Conf idential. For Channel Partner use only. Not f
Handling Unsolicited Commercial Email (UCE) or spam using Microsoft Outlook at Staffordshire University
Reference : USER 190 Issue date : January 2004 Revised : November 2007 Classification : Staff Originator : Richard Rogers Handling Unsolicited Commercial Email (UCE) or spam using Microsoft Outlook at
IMPROVING SPAM EMAIL FILTERING EFFICIENCY USING BAYESIAN BACKWARD APPROACH PROJECT
IMPROVING SPAM EMAIL FILTERING EFFICIENCY USING BAYESIAN BACKWARD APPROACH PROJECT M.SHESHIKALA Assistant Professor, SREC Engineering College,Warangal Email: [email protected], Abstract- Unethical
AntiSpam QuickStart Guide
IceWarp Server AntiSpam QuickStart Guide Version 10 Printed on 28 September, 2009 i Contents IceWarp Server AntiSpam Quick Start 3 Introduction... 3 How it works... 3 AntiSpam Templates... 4 General...
Email Marketing 201. How a SPAM Filter Works. Craig Stouffer Pinpointe On-Demand [email protected] (408) 834-7577 x125
Email Marketing 201 How a SPAM Filter Works Craig Stouffer Pinpointe On-Demand [email protected] (408) 834-7577 x125 Mark Feldman NetProspexVP Marketing [email protected] (781) 290-5714 www.twitter.com/pinpointe
Emails and anti-spam Page 1
Emails and anti-spam Page 1 As the spammers become increasing aggressive more and more legit emails get banned as spam. When you send emails from your webcrm system, we use the webcrm servers to send emails
Antispam Security Best Practices
Antispam Security Best Practices First, the bad news. In the war between spammers and legitimate mail users, spammers are winning, and will continue to do so for the foreseeable future. The cost for spammers
Email. Daniel Zappala. CS 460 Computer Networking Brigham Young University
Email Daniel Zappala CS 460 Computer Networking Brigham Young University How Email Works 3/25 Major Components user agents POP, IMAP, or HTTP to exchange mail mail transfer agents (MTAs) mailbox to hold
SCORECARD EMAIL MARKETING. Find Out How Much You Are Really Getting Out of Your Email Marketing
EMAIL MARKETING SCORECARD Find Out How Much You Are Really Getting Out of Your Email Marketing This guide is designed to help you self-assess your email sending activities. There are two ways to render
Achieve more with less
Energy reduction Bayesian Filtering: the essentials - A Must-take approach in any organization s Anti-Spam Strategy - Whitepaper Achieve more with less What is Bayesian Filtering How Bayesian Filtering
Spam Testing Methodology Opus One, Inc. March, 2007
Spam Testing Methodology Opus One, Inc. March, 2007 This document describes Opus One s testing methodology for anti-spam products. This methodology has been used, largely unchanged, for four tests published
The Network Box Anti-Spam Solution
NETWORK BOX TECHNICAL WHITE PAPER The Network Box Anti-Spam Solution Background More than 2,000 years ago, Sun Tzu wrote if you know yourself but not the enemy, for every victory gained you will also suffer
Spam Classification Using Machine Learning Techniques - Sinespam
Master in Artificial Intelligence (UPC-URV-UB) Master of Science Thesis Spam Classification Using Machine Learning Techniques - Sinespam José Norte Sosa Advisor: PhD. René Alquezar Mancho August 2010 Content
Panda Cloud Email Protection
Panda Cloud Email Protection 1. Introduction a) What is spam? Spam is the term used to describe unsolicited messages or messages sent from unknown senders. They are usually sent in large (even massive)
Fighting Spam in an ISP Environment:
White Paper Fighting Spam in an ISP Environment: Challenges, Solutions and Best Practices April, 2007 Summary An ISP presents one of the most complex environments for managing spam because of the high
Emerging Trends in Fighting Spam
An Osterman Research White Paper sponsored by Published June 2007 SPONSORED BY sponsored by Osterman Research, Inc. P.O. Box 1058 Black Diamond, Washington 98010-1058 Phone: +1 253 630 5839 Fax: +1 866
Spam DNA Filtering System
The Excedent Spam DNA Filtering System provides webmail.us customers with premium and effective junk email protection. Threats to email services are rising rapidly. A Growing Problem As of November 2002,
Email Marketing Glossary of Terms
Email Marketing Glossary of Terms A/B Testing: A method of testing in which a small, random sample of an email list is split in two. One email is sent to the list A and another modified email is sent to
T-61.3050 : Email Classification as Spam or Ham using Naive Bayes Classifier. Santosh Tirunagari : 245577
T-61.3050 : Email Classification as Spam or Ham using Naive Bayes Classifier Santosh Tirunagari : 245577 January 20, 2011 Abstract This term project gives a solution how to classify an email as spam or
Comprehensive Email Filtering. Whitepaper
Comprehensive Email Filtering Whitepaper Email has undoubtedly become a valued communications tool among organizations worldwide. With frequent virus attacks and the alarming influx of spam, email loses
Fighting Spam: Tools, Tips, and Techniques
Fighting Spam: Tools, Tips, and Techniques Brian Sebby Argonne National Laboratory NetSecure 08 IIT Center for Professional Development Part I: Introduction 2 Argonne National Laboratory IT Environment
Enhanced Spam Defence
Enhanced Spam Defence An approach to making SMTP connect time blocking a reliable method for e-mail filtering By John Jensen, Topsec Technology Ltd. As the spam problem keeps growing and the associated
Stop Spam Now! By John Buckman. John Buckman is President of Lyris Technologies, Inc. and programming architect behind Lyris list server.
Stop Spam Now! By John Buckman John Buckman is President of Lyris Technologies, Inc. and programming architect behind Lyris list server. Copyright 1999 Lyris Technologies, Inc. Stop Spam Now! 1 Introduction
How to Use Red Condor Spam Filtering
What is Spam Filtering? Spam filtering is used to detect unsolicited and unwanted email. By looking for certain words in an email's subject line, or through suspicious word patterns or word frequencies
A Case-Based Approach to Spam Filtering that Can Track Concept Drift
A Case-Based Approach to Spam Filtering that Can Track Concept Drift Pádraig Cunningham 1, Niamh Nowlan 1, Sarah Jane Delany 2, Mads Haahr 1 1 Department of Computer Science, Trinity College Dublin 2 School
Global Reputation Monitoring The FortiGuard Security Intelligence Database WHITE PAPER
Global Reputation Monitoring The FortiGuard Security Intelligence Database WHITE PAPER FORTINET Global Reputation Monitoring PAGE 2 Overview Fortinet s FortiGuard Security Services delivers two essential
Get Started Guide - PC Tools Internet Security
Get Started Guide - PC Tools Internet Security Table of Contents PC Tools Internet Security... 1 Getting Started with PC Tools Internet Security... 1 Installing... 1 Getting Started... 2 iii PC Tools
Why Content Filters Can t Eradicate spam
WHITEPAPER Why Content Filters Can t Eradicate spam About Mimecast Mimecast () delivers cloud-based email management for Microsoft Exchange, including archiving, continuity and security. By unifying disparate
How To Filter Email From A Spam Filter
Spam Filtering A WORD TO THE WISE WHITE PAPER BY LAURA ATKINS, CO- FOUNDER 2 Introduction Spam filtering is a catch- all term that describes the steps that happen to an email between a sender and a receiver
Anti-SPAM Solutions as a Component of Digital Communications Management
Anti-SPAM Solutions as a Component of Digital Communications Management Ron Shuck CISSP, GCIA, CCSE Agenda What is Spam & what can you do? What is the cost of Spam E-mail E to organizations? How do we
K7 Mail Security FOR MICROSOFT EXCHANGE SERVERS. v.109
K7 Mail Security FOR MICROSOFT EXCHANGE SERVERS v.109 1 The Exchange environment is an important entry point by which a threat or security risk can enter into a network. K7 Mail Security is a complete
How To Protect Your Email From Spam On A Barracuda Spam And Virus Firewall
Comprehensive Email Filtering: Barracuda Spam & Virus Firewall Safeguards Legitimate Email Email has undoubtedly become a valued communications tool among organizations worldwide. With frequent virus attacks
Combining Global and Personal Anti-Spam Filtering
Combining Global and Personal Anti-Spam Filtering Richard Segal IBM Research Hawthorne, NY 10532 Abstract Many of the first successful applications of statistical learning to anti-spam filtering were personalized
Intelligent Word-Based Spam Filter Detection Using Multi-Neural Networks
www.ijcsi.org 17 Intelligent Word-Based Spam Filter Detection Using Multi-Neural Networks Ann Nosseir 1, Khaled Nagati 1 and Islam Taj-Eddin 1 1 Faculty of Informatics and Computer Sciences British University
How To Configure Forefront Threat Management Gateway (Forefront) For An Email Server
Owner of the content within this article is www.isaserver.org Written by Marc Grote www.it-training-grote.de Configuring and using the E-Mail protection feature in Microsoft Forefront Threat Management
Improving the Performance of Heuristic Spam Detection using a Multi-Objective Genetic Algorithm. James Dudley
Improving the Performance of Heuristic Spam Detection using a Multi-Objective Genetic Algorithm James Dudley This report is submitted as partial fulfilment of the requirements for the Honours Programme
A White Paper. VerticalResponse, Email Delivery and You A Handy Guide. VerticalResponse,Inc. 501 2nd Street, Suite 700 San Francisco, CA 94107
A White Paper VerticalResponse, Email Delivery and You Delivering email seems pretty straightforward, right? You upload a mailing list, create an email, hit send, and then mighty wizards transport that
Tightening the Net: A Review of Current and Next Generation Spam Filtering Tools
Tightening the Net: A Review of Current and Next Generation Spam Filtering Tools Spam Track Wednesday 1 March, 2006 APRICOT Perth, Australia James Carpinter & Ray Hunt Dept. of Computer Science and Software
