On Entropy in Network Traffic Anomaly Detection
|
|
|
- Brent Fox
- 10 years ago
- Views:
Transcription
1 On Entropy in Network Traffic Anomaly Detection Jayro Santiago-Paz, Deni Torres-Roman. Cinvestav, Campus Guadalajara, Mexico November 2015 Jayro Santiago-Paz, Deni Torres-Roman. 1/19 On Entropy in Network Traffic Anomaly Detection
2 Outline Introduction 1 Introduction Databases 2 Windowing in Network Traffic 3 Kullback-Leibler divergence Mutual information Entropy calculation 4 5 The Classifier Metrics 6 Jayro Santiago-Paz, Deni Torres-Roman. 2/19 On Entropy in Network Traffic Anomaly Detection
3 Outline Introduction 1 Introduction Databases 2 Windowing in Network Traffic 3 Kullback-Leibler divergence Mutual information Entropy calculation 4 5 The Classifier Metrics 6 Jayro Santiago-Paz, Deni Torres-Roman. 2/19 On Entropy in Network Traffic Anomaly Detection
4 Outline Introduction 1 Introduction Databases 2 Windowing in Network Traffic 3 Kullback-Leibler divergence Mutual information Entropy calculation 4 5 The Classifier Metrics 6 Jayro Santiago-Paz, Deni Torres-Roman. 2/19 On Entropy in Network Traffic Anomaly Detection
5 Outline Introduction 1 Introduction Databases 2 Windowing in Network Traffic 3 Kullback-Leibler divergence Mutual information Entropy calculation 4 5 The Classifier Metrics 6 Jayro Santiago-Paz, Deni Torres-Roman. 2/19 On Entropy in Network Traffic Anomaly Detection
6 Outline Introduction 1 Introduction Databases 2 Windowing in Network Traffic 3 Kullback-Leibler divergence Mutual information Entropy calculation 4 5 The Classifier Metrics 6 Jayro Santiago-Paz, Deni Torres-Roman. 2/19 On Entropy in Network Traffic Anomaly Detection
7 Outline Introduction 1 Introduction Databases 2 Windowing in Network Traffic 3 Kullback-Leibler divergence Mutual information Entropy calculation 4 5 The Classifier Metrics 6 Jayro Santiago-Paz, Deni Torres-Roman. 2/19 On Entropy in Network Traffic Anomaly Detection
8 Databases Chandola et al. (2009) states that the term anomaly-based intrusion detection in networks refers to the problem of finding exceptional patterns in network traffic that do not conform to the expected normal behavior. Given a traffic network and its set of the selected traffic features X = {X 1, X 2,..., X p }, and N time instances of X, the normal and abnormal behaviors of the instances can be studied. The space of all instances of X builds the feature space which can be mapped to another space by employing a function such as entropy. In the literature, Shannon and generalized Rényi and Tsallis entropy estimators, as well as probability estimators (Balanced, Balanced II), are used. Jayro Santiago-Paz, Deni Torres-Roman. 3/19 On Entropy in Network Traffic Anomaly Detection
9 Databases The A-NIDS usually consists of two stages: training and testing stage. In the training stage using a database of normal or free-anomaly network traffic, the feature extraction, windowing and entropy calculation modules, a normal profile is found. In the testing stage, using the feature extraction, windowing and entropy calculation modules, anomalies in the current network traffic are detected and classified. Figure 1: General architecture of entropy-based A-NIDS. Jayro Santiago-Paz, Deni Torres-Roman. 4/19 On Entropy in Network Traffic Anomaly Detection
10 Databases Synthetic The synthetic databases are generated artificially, e.g., the MIT-DARPA 1998, 1999, 2000 databases a, which include five major categories: Denial of Service Attacks (DoS), User to Root Attacks (U2R), Remote to User Attacks (R2U) and probes. a Real Some real public databases are: CAIDA a, which contains anonymized passive traffic traces from high-speed Internet backbone links, and the traffic data repository, maintained by the MAWI b Working Group of the WIDE Project. Other researchers have created their own databases in different universities, e.g., Carnegie Mellon University, Xi an Jiaotong University, and Clemson University (GENI), or traffic collected from backbone in SWITCH, Abilene, and Géant. a b Jayro Santiago-Paz, Deni Torres-Roman. 5/19 On Entropy in Network Traffic Anomaly Detection
11 Motoda H. and Liu H. (2002) Windowing in Network Traffic Feature selection is a process that chooses a subset of M features from the original set of N features M N so that the feature space is optimally reduced according to a certain criterion. Feature extraction is a process that extracts a set of new features from the original features through some functional mapping. Assuming that there are N features Z 1, Z 2,..., Z N after feature extraction, another set of new features X 1, X 2,..., X M (M < N) is obtained via the mapping functions F i, i.e. X i = F i (Z 1, Z 2,..., Z N ). Jayro Santiago-Paz, Deni Torres-Roman. 6/19 On Entropy in Network Traffic Anomaly Detection
12 Windowing in Network Traffic Among the algorithms used to reduce the number of features in network traffic anomaly detection are: PCA, Mutual Information and linear correlation, decision tree, and maximum entropy. In network traffic, the most commonly employed features are: source and destination IP addresses and source and destination port numbers. Other features extracted from headers are: protocol field, number of bytes, service, flag, and country code. Zhang et al. (2009) divided the size of packets into seven types and Gu et al. (2005) defined 587 packet classes based on the port number. At flow a level the features selected were: flow duration, flow size distribution (FSD), and average packet size per flow. For KDD Cup 99, 41 features or a subset were employed. On the other hand, Tellenbach et al. (2011) used source port, country code and others, constructing the TES as input data. time T. a An IP flow corresponds to an IP port-to-port traffic exchanged between two IP addresses during a period of Jayro Santiago-Paz, Deni Torres-Roman. 7/19 On Entropy in Network Traffic Anomaly Detection
13 Windowing in Network Traffic Window-based methods group consecutive packets or flows based on a sliding window. The ith window of size L packets is represented as W i (L, τ) = {pack k, pack k+1,..., pack k+l }, with k = il iτ, where τ is the overlapping and τ {0, 1,..., L 1}. When the window size is given by time, L can be different in each window. Windowing is performed in two ways: overlapping (τ 0) and non overlapping (τ = 0) windows. The window sizes most commonly used are: 5 min, 30 min, 1 min, 100 sec, 5 sec and 0.5 sec. Some researchers use windows with a fixed length L = 4096, 1000, and 32 packets. Jayro Santiago-Paz, Deni Torres-Roman. 8/19 On Entropy in Network Traffic Anomaly Detection
14 Kullback-Leibler divergence Mutual information Entropy calculation Let X be a random variable which takes values of the set {x 1, x 2,..., x M }, p i := P (X = x i ) the probability of occurrence of x i, and M the cardinality of the finite set; hence, the Shannon entropy is: M H S (X) = p i log (p i ). (1) i=1 The Rényi entropy is defined as: ( H R (X, q) = 1 M ) 1 q log p q i i=1 and the Tsallis entropy is ( H T (X, q) = 1 1 q 1 (2) ) M p q i, (3) i=1 when q 1 the generalized entropies are reduced to Shannon entropy. In order to compare the changes of entropy at different times, the entropy is normalized, i.e., H(X) = H(X) H max(x). (4) Jayro Santiago-Paz, Deni Torres-Roman. 9/19 On Entropy in Network Traffic Anomaly Detection
15 Kullback-Leibler divergence Mutual information Entropy calculation Consider two complete discrete probability distributions P = (p 1, p 2,..., p n) and Q = (q 1, q 2,..., q n), with n i=1 p i = n i=1 q i = 1, 1 p i 0, 1 q i 0, i = 1, 2,..., n. The information divergence is a measure of the divergence between P and Q and is defined by Rényi (1961): ( D ρ(p Q) = 1 n ) ρ 1 log p ρ i q1 ρ i, ρ 0, (5) i=1 where ρ is the order of the information divergence. Consequently, the smaller D ρ(p Q) is, the closer the distributions P and Q are. D ρ(p Q) = 0 iff P = Q. When ρ 1 the Kullback-Leibler (KL) divergence is obtained D 1 (P Q) = n i=1 ( p i log ( pi q i )), ρ 1. (6) Jayro Santiago-Paz, Deni Torres-Roman. 10/19 On Entropy in Network Traffic Anomaly Detection
16 Kullback-Leibler divergence Mutual information Entropy calculation Conditional Entropy The conditional entropy of a variable Y given X, with alphabet X and Y, respectively, is defined as: H(Y X) = p(x) p(y x) log (p(y x)) (7) x X y Y = p(x, y) log (p(y x)). (8) x X y Y Joint Entropy The joint entropy of X and Y, defined as H(X; Y ) = p(x, y) log (p(x, y)) (9) x X y Y where p(x, y) is the joint probability mass function. Jayro Santiago-Paz, Deni Torres-Roman. 11/19 On Entropy in Network Traffic Anomaly Detection
17 Kullback-Leibler divergence Mutual information Entropy calculation The mutual information (MI) between two random variables X and Y is a measure of the amount of knowledge of Y supplied by X or vice versa. If X and Y are independent, then their mutual information is zero. The MI of two random variables X and Y is defined as: I(X; Y ) = H(X) H(X Y ) = H(Y ) H(Y X) = H(X) + H(Y ) H(X; Y ) (10) where H( ) is entropy, H(X Y ) and H(Y X) are conditional entropies, H(X; Y ) is the joint entropy. The MI equation can be written as: I(X; Y ) = x X y Y ( ) p(x, y) p(x, y) log p(x)p(y) where p(x) and p(y) are marginal probability mass functions of X and Y, respectively. In order to estimate the MI between X, Y, it is necessary to estimate p(x, y). (11) Jayro Santiago-Paz, Deni Torres-Roman. 12/19 On Entropy in Network Traffic Anomaly Detection
18 Kullback-Leibler divergence Mutual information Entropy calculation Different probability estimators are used, e.g., relative frequency, Balanced, and Balanced II, and consequently, a true probability distribution is built. The entropy is calculated using these estimators; the more accurate the estimators, the better the entropy estimates. Rahmani et al. (2009) noted that time series of IP-flow number and aggregate traffic size are strongly statistically dependent, and when an attack occurs, it causes a rupture in the time series of joint entropy values. In order to calculate the joint entropy H(X; Y ) they employed p(x, y) of the time series X and Y using either the Gamma density probability function (when the number of connections was small) or the central limit theorem (when the number of connections was large enough). Liu et al. (2010) calculated the conditional entropy H(Y X) where Y and X are two of the most widely used traffic variables: source and destination Ip addresses. Amiri et al. (2011) used an estimator of MI developed by Kraskov et al. (2004), which employs entropy estimates from k-nearest neighbors distances. Velarde-Alvarado et al. (2009) estimated entropy values using the balanced estimator II as a probability estimator. Jayro Santiago-Paz, Deni Torres-Roman. 13/19 On Entropy in Network Traffic Anomaly Detection
19 An anomaly in network traffic is a data pattern that does not conform to those representing a normal traffic behavior. Assuming that 1) X R p is a p-dimensional real-valued random variable with a domain S X R p representing traffic features, 2) x i are instances of X, i.e. x i S X, and 3) data patterns of normal behavior are represented by the subspace S N S X, anomaly detection determines whether an instance x i belongs to S N or not. The space S X can be partitioned or divided into classes with the help of decision functions, allowing further classification. Jayro Santiago-Paz, Deni Torres-Roman. 14/19 On Entropy in Network Traffic Anomaly Detection
20 Specific Decision Functions Zhang et al. (2009), Gu et al. (2005) used the KL divergence D 1 (P Q), in addition, Zhang et al. (2009) classified the abnormal situations into different classes. Coluccia et al. (2013) employed both KL divergence and Maximum entropy. Yan et al. (2008) used D 0.5 (P Q). In Santiago-Paz et al. (2015), a decision function is based on the Mahalanobis distance d 2 M (x i), and a second decision function is given by f(x i ) = N i α i k(x i, x) b for One Class-Support Vector Machine (OC-SVM), where k(x i, x) is a kernel. Huang et al. (2006) computed the Rényi entropy (q = 3) of the Coiflet and Daubechies wavelets. In Velarde-Alvarado et al. (2009), used the proportional uncertainty (PU) and the method of remaining elements (MRE) to detect anomalies. Tellenbach et al. (2011) used Kalman filter, PCA, and KLE as anomaly detection methods. Ma et al. (2014) established a function decision based on the entropy of the source IP address Ĥs and the entropy of the destination IP address Ĥd. In Berezínski et al. (2015), Özçelik and Brooks (2015) a function decision based on entropy and a range of values was used to detect anomalies. Jayro Santiago-Paz, Deni Torres-Roman. 15/19 On Entropy in Network Traffic Anomaly Detection
21 Table 1: Results of network traffic anomaly detection using entropy. Author Information metric Database Anomaly TNR [%] Gu et al. (2005) KL divergence Portscan 91.0 DoS Zhang et al. Probe KL divergence MIT-DARPA (2009) R2L U2R Liu et al.(2010) Conditional entropy CAIDA DDoS 93.0 Ferreira et al. (2011) Amiri et al. (2011) Santiago-Paz et al.(2015) Shannon, Rényi, Tsallis Mutual Information Shannon, Rényi, Tsallis KDD Cup 99 KDD Cup 99 LAN, MIT-DARPA subset DoS Probe R2L U2R DoS Probe R2L U2R 95.0 Worms, DoS, Portscan Jayro Santiago-Paz, Deni Torres-Roman. 16/19 On Entropy in Network Traffic Anomaly Detection
22 The Classifier Metrics Gupta et al. (2014) state that given: 1) a training data set of the form {(x i, y i )}, where x i S X is a feature vector or data pattern and y i {1,..., G} is the subset of the G class labels that are known to be correct labels for x i, 2) a discriminant function f(x; β g) with class-specific parameters β g for each class with g = 1,..., G; then class discriminant functions are used to classify an instance x as the class label that solves arg max gf(x; β g). Lakhina et al. (2005) apply two clustering algorithms: k-means and hierarchical agglomeration, using a vector h = [ H(srcIP), H(dstIP), H(srcPort), H(dstPort)]. Xu et al., (2005) define three free feature dimensions and introduce an Entropy-based Significant Cluster Extraction Algorithm for clustering. Lima et al. (2011)use the WEKA 1 Simple K-Means algorithm. SVM is applied by Tellenbach et al. (2011) to classify the anomalies. Yao et al. (2012) use the Random Forests Test. Santiago-Paz et al. (2014) present the Entropy and Mahalanobis Distance (EMD) based Algorithm to define elliptical regions in the feature space. In Santiago-Paz et al. (2015), OC-SVM and k-temporal nearest neighbors are used to improve accuracy in classification. 1 Jayro Santiago-Paz, Deni Torres-Roman. 17/19 On Entropy in Network Traffic Anomaly Detection
23 The Classifier Metrics Given a classifier and an instance, there are four possible outcomes: 2 T N, F P, F N, and T P. With these entries, the following statistics are computed: Accuracy (AC) is the T N+T P proportion of the total number of predictions that were correct: AC = T N+F P +F N+T P ; True Positive Rate (TPR) is the proportion of positive cases that were correctly identified: T P R = T P ; True Negative Rate (TNR) is the proportion of negative cases F N+T P that were classified correctly: T NR = T N ; False Negative Rate (FNR) is the proportion of positive cases that were incorrectly classified as negative: F NR = F N F N+T P ; 2 T P R AC T N+F P and F -measure is a measure of a test s accuracy: F -measure =. In addition, Receiver Operating Characteristic 3 (ROC) graphs illustrate the performance of a T P R+AC classifier. 2 T N is the number of correct predictions that an instance is negative, F P is the number of incorrect predictions that an instance is positive, FN is the number of incorrect predictions that an instance is negative, and TP is the number of correct predictions that an instance is positive. 3 ROC graphs are two-dimensional graphs in which an (F P rate, T P rate) pair corresponding to a single point in Receiver Operating Characteristic space. Jayro Santiago-Paz, Deni Torres-Roman. 18/19 On Entropy in Network Traffic Anomaly Detection
24 Nowadays, there is no public database large enough to exhaustively test and compare different algorithms in order to extract significant conclusions about their performances and their capabilities of classification. Therefore, the construction of a common database with real normal and anomalous traffic for the evaluation of A-NIDS is needed. The value of the q parameter for generalized entropies is found experimentally; its correct choice for the best anomaly detection is an open research problem. For different networks, the larger the slot size, the more different the entropy behaviors. In the near future, this behavior including more and recent traces in order to determine whether the learned model from a certain network can be used in a different network should be addressed. Another open issue is related to the adequate window size for reducing the data volume, ensuring good entropy estimates and early detection of anomalies. The set of decision functions and classifiers with new closeness and farness entropy-based measures should be enhanced. Jayro Santiago-Paz, Deni Torres-Roman. 19/19 On Entropy in Network Traffic Anomaly Detection
Detecting Anomalies in Network Traffic Using Maximum Entropy Estimation
Detecting Anomalies in Network Traffic Using Maximum Entropy Estimation Yu Gu, Andrew McCallum, Don Towsley Department of Computer Science, University of Massachusetts, Amherst, MA 01003 Abstract We develop
An analysis of suitable parameters for efficiently applying K-means clustering to large TCPdump data set using Hadoop framework
An analysis of suitable parameters for efficiently applying K-means clustering to large TCPdump data set using Hadoop framework Jakrarin Therdphapiyanak Dept. of Computer Engineering Chulalongkorn University
Joint Entropy Analysis Model for DDoS Attack Detection
2009 Fifth International Conference on Information Assurance and Security Joint Entropy Analysis Model for DDoS Attack Detection Hamza Rahmani, Nabil Sahli, Farouk Kammoun CRISTAL Lab., National School
Conclusions and Future Directions
Chapter 9 This chapter summarizes the thesis with discussion of (a) the findings and the contributions to the state-of-the-art in the disciplines covered by this work, and (b) future work, those directions
A new Approach for Intrusion Detection in Computer Networks Using Data Mining Technique
A new Approach for Intrusion Detection in Computer Networks Using Data Mining Technique Aida Parbaleh 1, Dr. Heirsh Soltanpanah 2* 1 Department of Computer Engineering, Islamic Azad University, Sanandaj
Detecting Network Anomalies. Anant Shah
Detecting Network Anomalies using Traffic Modeling Anant Shah Anomaly Detection Anomalies are deviations from established behavior In most cases anomalies are indications of problems The science of extracting
DISCLOSING MALICIOUS TRAFFIC FOR NETWORK SECURITY
DISCLOSING MALICIOUS TRAFFIC FOR NETWORK SECURITY Kamal Shah and Tanvi Kapdi Thakur College of Engineering and Technology, Mumbai University, Mumbai-400101, India ABSTRACT Network anomaly detection is
Detecting Flooding Attacks Using Power Divergence
Detecting Flooding Attacks Using Power Divergence Jean Tajer IT Security for the Next Generation European Cup, Prague 17-19 February, 2012 PAGE 1 Agenda 1- Introduction 2- K-ary Sktech 3- Detection Threshold
Measuring Intrusion Detection Capability: An Information-Theoretic Approach
Measuring Intrusion Detection Capability: An Information-Theoretic Approach Guofei Gu, Prahlad Fogla, David Dagon, Boris Škorić Wenke Lee Philips Research Laboratories, Netherlands Georgia Institute of
Classifying DNS Heavy User Traffic by using Hierarchical Aggregate Entropy. 2012/3/5 Keisuke Ishibashi, Kazumichi Sato NTT Service Integration Labs
Classifying DNS Heavy User Traffic by using Hierarchical Aggregate Entropy 2012/3/5 Keisuke Ishibashi, Kazumichi Sato NTT Service Integration Labs Motivation Network resources are consumed by a small number
Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup
Network Anomaly Detection A Machine Learning Perspective Dhruba Kumar Bhattacharyya Jugal Kumar KaKta»C) CRC Press J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor
STUDY OF IMPLEMENTATION OF INTRUSION DETECTION SYSTEM (IDS) VIA DIFFERENT APPROACHS
STUDY OF IMPLEMENTATION OF INTRUSION DETECTION SYSTEM (IDS) VIA DIFFERENT APPROACHS SACHIN MALVIYA Student, Department of Information Technology, Medicaps Institute of Science & Technology, INDORE (M.P.)
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
An Efficient Way of Denial of Service Attack Detection Based on Triangle Map Generation
An Efficient Way of Denial of Service Attack Detection Based on Triangle Map Generation Shanofer. S Master of Engineering, Department of Computer Science and Engineering, Veerammal Engineering College,
Denial of Service attacks: analysis and countermeasures. Marek Ostaszewski
Denial of Service attacks: analysis and countermeasures Marek Ostaszewski DoS - Introduction Denial-of-service attack (DoS attack) is an attempt to make a computer resource unavailable to its intended
An Anomaly-Based Method for DDoS Attacks Detection using RBF Neural Networks
2011 International Conference on Network and Electronics Engineering IPCSIT vol.11 (2011) (2011) IACSIT Press, Singapore An Anomaly-Based Method for DDoS Attacks Detection using RBF Neural Networks Reyhaneh
KEITH LEHNERT AND ERIC FRIEDRICH
MACHINE LEARNING CLASSIFICATION OF MALICIOUS NETWORK TRAFFIC KEITH LEHNERT AND ERIC FRIEDRICH 1. Introduction 1.1. Intrusion Detection Systems. In our society, information systems are everywhere. They
Contribution of the French MetroSec
Contribution of the French MetroSec project to traffic anomalies detection ti Philippe OWEZARSKI LAAS-CNRS Toulouse, France [email protected] With the contribution of Patrice Abry, Pierre Borgnat, Nicolas Larrieu,
System for Denial-of-Service Attack Detection Based On Triangle Area Generation
System for Denial-of-Service Attack Detection Based On Triangle Area Generation 1, Heena Salim Shaikh, 2 N Pratik Pramod Shinde, 3 Prathamesh Ravindra Patil, 4 Parag Ramesh Kadam 1, 2, 3, 4 Student 1,
How To Detect Denial Of Service Attack On A Network With A Network Traffic Characterization Scheme
Efficient Detection for DOS Attacks by Multivariate Correlation Analysis and Trace Back Method for Prevention Thivya. T 1, Karthika.M 2 Student, Department of computer science and engineering, Dhanalakshmi
Medical Information Management & Mining. You Chen Jan,15, 2013 [email protected]
Medical Information Management & Mining You Chen Jan,15, 2013 [email protected] 1 Trees Building Materials Trees cannot be used to build a house directly. How can we transform trees to building materials?
AUTONOMOUS NETWORK SECURITY FOR DETECTION OF NETWORK ATTACKS
AUTONOMOUS NETWORK SECURITY FOR DETECTION OF NETWORK ATTACKS Nita V. Jaiswal* Prof. D. M. Dakhne** Abstract: Current network monitoring systems rely strongly on signature-based and supervised-learning-based
International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering
An Efficient and Reliable DDoS Attack Detection Using a Fast Entropy Computation Method
An Efficient and Reliable DDoS Attack Detection Using a Fast Entropy Computation Method Giseop No and Ilkyeun Ra * Department of Computer Science and Engineering University of Colorado Denver, Campus Box
CHAPTER 1 INTRODUCTION
21 CHAPTER 1 INTRODUCTION 1.1 PREAMBLE Wireless ad-hoc network is an autonomous system of wireless nodes connected by wireless links. Wireless ad-hoc network provides a communication over the shared wireless
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Volume 1 Issue 11 (November 2014)
Denial-of-Service Attack Detection Mangesh D. Salunke * Prof. Ruhi Kabra G.H.Raisoni CEM, SPPU, Ahmednagar HOD, G.H.Raisoni CEM, SPPU,Ahmednagar Abstract: A DoS (Denial of Service) attack as name indicates
How To Prevent Network Attacks
Ali A. Ghorbani Wei Lu Mahbod Tavallaee Network Intrusion Detection and Prevention Concepts and Techniques )Spri inger Contents 1 Network Attacks 1 1.1 Attack Taxonomies 2 1.2 Probes 4 1.2.1 IPSweep and
Statistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
Monitoring sítí pomocí NetFlow dat od paketů ke strategiím
Monitoring sítí pomocí NetFlow dat od paketů ke strategiím Martin Rehák, Karel Bartoš, Martin Grill, Jan Stiborek a Michal Svoboda ATG, České vysoké učení technické v Praze Jiří Novotný, Pavel Čeleda a
A Review of Anomaly Detection Techniques in Network Intrusion Detection System
A Review of Anomaly Detection Techniques in Network Intrusion Detection System Dr.D.V.S.S.Subrahmanyam Professor, Dept. of CSE, Sreyas Institute of Engineering & Technology, Hyderabad, India ABSTRACT:In
Analysis of kiva.com Microlending Service! Hoda Eydgahi Julia Ma Andy Bardagjy December 9, 2010 MAS.622j
Analysis of kiva.com Microlending Service! Hoda Eydgahi Julia Ma Andy Bardagjy December 9, 2010 MAS.622j What is Kiva? An organization that allows people to lend small amounts of money via the Internet
Botnet Detection Based on Degree Distributions of Node Using Data Mining Scheme
Botnet Detection Based on Degree Distributions of Node Using Data Mining Scheme Chunyong Yin 1,2, Yang Lei 1, Jin Wang 1 1 School of Computer & Software, Nanjing University of Information Science &Technology,
DDoS Confirmation & Attack Packet Dropping Algorithm in On- Demand Grid Computing Platform
DDoS Confirmation & Attack Packet Dropping Algorithm in On- Demand Grid Computing Platform Muhammad Zakarya, Zahoor Jan, Imtiaz Ullah, Nadia Dilawar and Uzm Abstract- Distributed denial of service (DDoS)
Efficient Security Alert Management System
Efficient Security Alert Management System Minoo Deljavan Anvary IT Department School of e-learning Shiraz University Shiraz, Fars, Iran Majid Ghonji Feshki Department of Computer Science Qzvin Branch,
An Approach for Detecting and Preventing DoS Attacks in LAN
An Approach for Detecting and Preventing DoS Attacks in LAN Majed Tabash 1, Tawfiq Barhoom 2. 1 Faculty of Information Technology, Islamic University Gazs, Palestine. 2 Faculty of Information Technology,
Anomaly Detection with Virtual Service Migration in Cloud Infrastructures
Institut für Technische Informatik und Kommunikationsnetze Kirila Adamova Anomaly Detection with Virtual Service Migration in Cloud Infrastructures Master Thesis 263-8-L October 22 to March 23 Tutor: Dr.
Bandwidth based Distributed Denial of Service Attack Detection using Artificial Immune System
Bandwidth based Distributed Denial of Service Attack Detection using Artificial Immune System 1 M.Yasodha, 2 S. Umarani 1 PG Scholar, Department of Information Technology, Maharaja Engineering College,
IDS IN TELECOMMUNICATION NETWORK USING PCA
IDS IN TELECOMMUNICATION NETWORK USING PCA Mohamed Faisal Elrawy 1, T. K. Abdelhamid 2 and A. M. Mohamed 3 1 Faculty of engineering, MUST University, 6th Of October, Egypt [email protected] 2,3
Network Intrusion Detection using Data Mining Technique
Network Intrusion Detection using Data Mining Technique Abstract - In recent years, most of the research has been done in the field of Intrusion Detection System (IDS) to detect attacks in network traffic
Class #6: Non-linear classification. ML4Bio 2012 February 17 th, 2012 Quaid Morris
Class #6: Non-linear classification ML4Bio 2012 February 17 th, 2012 Quaid Morris 1 Module #: Title of Module 2 Review Overview Linear separability Non-linear classification Linear Support Vector Machines
Detecting Constant Low-Frequency Appilication Layer Ddos Attacks Using Collaborative Algorithms B. Aravind, (M.Tech) CSE Dept, CMRTC, Hyderabad
Detecting Constant Low-Frequency Appilication Layer Ddos Attacks Using Collaborative Algorithms B. Aravind, (M.Tech) CSE Dept, CMRTC, Hyderabad M. Lakshmi Narayana, M.Tech CSE Dept, CMRTC, Hyderabad Abstract:
A Preliminary Performance Comparison of Two Feature Sets for Encrypted Traffic Classification
A Preliminary Performance Comparison of Two Feature Sets for Encrypted Traffic Classification Riyad Alshammari and A. Nur Zincir-Heywood Dalhousie University, Faculty of Computer Science {riyad, zincir}@cs.dal.ca
How To Cluster
Data Clustering Dec 2nd, 2013 Kyrylo Bessonov Talk outline Introduction to clustering Types of clustering Supervised Unsupervised Similarity measures Main clustering algorithms k-means Hierarchical Main
Combining Statistical and Spectral Analysis Techniques in Network Traffic Anomaly Detection
Combining Statistical and Spectral Analysis Techniques in Network Traffic Anomaly Detection by Stevan Novakov, B. Eng. (CSE) A thesis submitted to the Faculty of Graduate and Postdoctoral Affairs in partial
Impact of Feature Selection on the Performance of Wireless Intrusion Detection Systems
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Impact of Feature Selection on the Performance of ireless Intrusion Detection Systems
MACHINE LEARNING IN HIGH ENERGY PHYSICS
MACHINE LEARNING IN HIGH ENERGY PHYSICS LECTURE #1 Alex Rogozhnikov, 2015 INTRO NOTES 4 days two lectures, two practice seminars every day this is introductory track to machine learning kaggle competition!
E-commerce Transaction Anomaly Classification
E-commerce Transaction Anomaly Classification Minyong Lee [email protected] Seunghee Ham [email protected] Qiyi Jiang [email protected] I. INTRODUCTION Due to the increasing popularity of e-commerce
Network Intrusion Detection using Semi Supervised Support Vector Machine
Network Intrusion Detection using Semi Supervised Support Vector Machine Jyoti Haweliya Department of Computer Engineering Institute of Engineering & Technology, Devi Ahilya University Indore, India ABSTRACT
DATA VERIFICATION IN ETL PROCESSES
KNOWLEDGE ENGINEERING: PRINCIPLES AND TECHNIQUES Proceedings of the International Conference on Knowledge Engineering, Principles and Techniques, KEPT2007 Cluj-Napoca (Romania), June 6 8, 2007, pp. 282
Internet Worm Classification and Detection using Data Mining Techniques
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 3, Ver. 1 (May Jun. 2015), PP 76-81 www.iosrjournals.org Internet Worm Classification and Detection
A Frequency-Based Approach to Intrusion Detection
A Frequency-Based Approach to Intrusion Detection Mian Zhou and Sheau-Dong Lang School of Electrical Engineering & Computer Science and National Center for Forensic Science, University of Central Florida,
Using Data Mining for Mobile Communication Clustering and Characterization
Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer
NETWORK INTRUSION DETECTION SYSTEM USING HYBRID CLASSIFICATION MODEL
NETWORK INTRUSION DETECTION SYSTEM USING HYBRID CLASSIFICATION MODEL Prof. Santosh T. Waghmode 1, Prof. Vinod S. Wadne 2 Department of Computer Engineering, 1, 2 JSPM s Imperial College of Engineering
An apparatus for P2P classification in Netflow traces
An apparatus for P2P classification in Netflow traces Andrew M Gossett, Ioannis Papapanagiotou and Michael Devetsikiotis Electrical and Computer Engineering, North Carolina State University, Raleigh, USA
A VIVACIOUS APPROACH TO DETECT AND PREVENT DDoS ATTACK
A VIVACIOUS APPROACH TO DETECT AND PREVENT DDoS ATTACK Kharat J.S. 1, Radhakrishna Naik 2 1 Student ME (CSE), 2 Head, Computer Science and Engineering, MIT (E) Aurangabad, Maharashtra, India, [email protected],
Classification algorithm in Data mining: An Overview
Classification algorithm in Data mining: An Overview S.Neelamegam #1, Dr.E.Ramaraj *2 #1 M.phil Scholar, Department of Computer Science and Engineering, Alagappa University, Karaikudi. *2 Professor, Department
Semi-Supervised Support Vector Machines and Application to Spam Filtering
Semi-Supervised Support Vector Machines and Application to Spam Filtering Alexander Zien Empirical Inference Department, Bernhard Schölkopf Max Planck Institute for Biological Cybernetics ECML 2006 Discovery
Data Preprocessing. Week 2
Data Preprocessing Week 2 Topics Data Types Data Repositories Data Preprocessing Present homework assignment #1 Team Homework Assignment #2 Read pp. 227 240, pp. 250 250, and pp. 259 263 the text book.
A SYSTEM FOR DENIAL OF SERVICE ATTACK DETECTION BASED ON MULTIVARIATE CORRELATION ANALYSIS
Journal homepage: www.mjret.in ISSN:2348-6953 A SYSTEM FOR DENIAL OF SERVICE ATTACK DETECTION BASED ON MULTIVARIATE CORRELATION ANALYSIS P.V.Sawant 1, M.P.Sable 2, P.V.Kore 3, S.R.Bhosale 4 Department
RID-DoS: Real-time Inter-network Defense Against Denial of Service Attacks. Kathleen M. Moriarty. MIT Lincoln Laboratory.
: Real-time Inter-network Defense Against Denial of Service Attacks Kathleen M. Moriarty 22 October 2002 This work was sponsored by the Air Force Contract number F19628-00-C-002. Opinions, interpretations,
An Anomaly-based Botnet Detection Approach for Identifying Stealthy Botnets
An Anomaly-based Botnet Detection Approach for Identifying Stealthy Botnets Sajjad Arshad 1, Maghsoud Abbaspour 1, Mehdi Kharrazi 2, Hooman Sanatkar 1 1 Electrical and Computer Engineering Department,
Network Intrusion Detection Using a HNB Binary Classifier
2015 17th UKSIM-AMSS International Conference on Modelling and Simulation Network Intrusion Detection Using a HNB Binary Classifier Levent Koc and Alan D. Carswell Center for Security Studies, University
A survey on Data Mining based Intrusion Detection Systems
International Journal of Computer Networks and Communications Security VOL. 2, NO. 12, DECEMBER 2014, 485 490 Available online at: www.ijcncs.org ISSN 2308-9830 A survey on Data Mining based Intrusion
Monitoring of Internet traffic and applications
Monitoring of Internet traffic and applications Chadi BARAKAT INRIA Sophia Antipolis, France Planète research group ETH Zurich October 2009 Email: [email protected] WEB: http://www.inria.fr/planete/chadi
Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1
Data Mining 1 Introduction 2 Data Mining methods Alfred Holl Data Mining 1 1 Introduction 1.1 Motivation 1.2 Goals and problems 1.3 Definitions 1.4 Roots 1.5 Data Mining process 1.6 Epistemological constraints
PhD Proposal: Functional monitoring problem for distributed large-scale data streams
PhD Proposal: Functional monitoring problem for distributed large-scale data streams Emmanuelle Anceaume, Yann Busnel, Bruno Sericola IRISA / CNRS Rennes LINA / Université de Nantes INRIA Rennes Bretagne
A Survey on Intrusion Detection System with Data Mining Techniques
A Survey on Intrusion Detection System with Data Mining Techniques Ms. Ruth D 1, Mrs. Lovelin Ponn Felciah M 2 1 M.Phil Scholar, Department of Computer Science, Bishop Heber College (Autonomous), Trichirappalli,
Adaptive Flow Aggregation - A New Solution for Robust Flow Monitoring under Security Attacks
Adaptive Flow Aggregation - A New Solution for Robust Flow Monitoring under Security Attacks Yan Hu Dept. of Information Engineering Chinese University of Hong Kong Email: [email protected] D. M. Chiu
A Web-based Interactive Data Visualization System for Outlier Subspace Analysis
A Web-based Interactive Data Visualization System for Outlier Subspace Analysis Dong Liu, Qigang Gao Computer Science Dalhousie University Halifax, NS, B3H 1W5 Canada [email protected] [email protected] Hai
AS THE Internet continues to grow in size and complexity,
IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 16, NO. 6, DECEMBER 2008 1241 Internet Traffic Behavior Profiling for Network Security Monitoring Kuai Xu, Zhi-Li Zhang, Member, IEEE, and Supratik Bhattacharyya
Echidna: Efficient Clustering of Hierarchical Data for Network Traffic Analysis
Echidna: Efficient Clustering of Hierarchical Data for Network Traffic Analysis Abdun Mahmood, Christopher Leckie, Parampalli Udaya Department of Computer Science and Software Engineering University of
Intrusion Forecasting Framework for Early Warning System against Cyber Attack
Intrusion Forecasting Framework for Early Warning System against Cyber Attack Sehun Kim KAIST, Korea Honorary President of KIISC Contents 1 Recent Cyber Attacks 2 Early Warning System 3 Intrusion Forecasting
Supervised Feature Selection & Unsupervised Dimensionality Reduction
Supervised Feature Selection & Unsupervised Dimensionality Reduction Feature Subset Selection Supervised: class labels are given Select a subset of the problem features Why? Redundant features much or
A Dynamic Flooding Attack Detection System Based on Different Classification Techniques and Using SNMP MIB Data
International Journal of Computer Networks and Communications Security VOL. 2, NO. 9, SEPTEMBER 2014, 279 284 Available online at: www.ijcncs.org ISSN 2308-9830 C N C S A Dynamic Flooding Attack Detection
The Artificial Prediction Market
The Artificial Prediction Market Adrian Barbu Department of Statistics Florida State University Joint work with Nathan Lay, Siemens Corporate Research 1 Overview Main Contributions A mathematical theory
Categorical Data Visualization and Clustering Using Subjective Factors
Categorical Data Visualization and Clustering Using Subjective Factors Chia-Hui Chang and Zhi-Kai Ding Department of Computer Science and Information Engineering, National Central University, Chung-Li,
Chapter ML:XI (continued)
Chapter ML:XI (continued) XI. Cluster Analysis Data Mining Overview Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained
ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)
ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING) Gabriela Ochoa http://www.cs.stir.ac.uk/~goc/ OUTLINE Preliminaries Classification and Clustering Applications
Network Management & Monitoring
Network Management & Monitoring NetFlow Overview These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (http://creativecommons.org/licenses/by-nc/3.0/)
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Big Data Analytics CSCI 4030
High dim. data Graph data Infinite data Machine learning Apps Locality sensitive hashing PageRank, SimRank Filtering data streams SVM Recommen der systems Clustering Community Detection Web advertising
Hybrid Intrusion Detection System Model using Clustering, Classification and Decision Table
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 9, Issue 4 (Mar. - Apr. 2013), PP 103-107 Hybrid Intrusion Detection System Model using Clustering, Classification
Decompose Error Rate into components, some of which can be measured on unlabeled data
Bias-Variance Theory Decompose Error Rate into components, some of which can be measured on unlabeled data Bias-Variance Decomposition for Regression Bias-Variance Decomposition for Classification Bias-Variance
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
CYBER SCIENCE 2015 AN ANALYSIS OF NETWORK TRAFFIC CLASSIFICATION FOR BOTNET DETECTION
CYBER SCIENCE 2015 AN ANALYSIS OF NETWORK TRAFFIC CLASSIFICATION FOR BOTNET DETECTION MATIJA STEVANOVIC PhD Student JENS MYRUP PEDERSEN Associate Professor Department of Electronic Systems Aalborg University,
An Intelligent DDoS Attack Detection System Using Packet Analysis and Support Vector Machine
An Intelligent DDoS Attack Detection System Using Packet Analysis and Support Vector Machine Keisuke Kato, Vitaly Klyuev Department of Computer Science and Engineering The University of Aizu, Japan Abstract
A Novel Distributed Denial of Service (DDoS) Attacks Discriminating Detection in Flash Crowds
International Journal of Research Studies in Science, Engineering and Technology Volume 1, Issue 9, December 2014, PP 139-143 ISSN 2349-4751 (Print) & ISSN 2349-476X (Online) A Novel Distributed Denial
Linear Threshold Units
Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear
D-optimal plans in observational studies
D-optimal plans in observational studies Constanze Pumplün Stefan Rüping Katharina Morik Claus Weihs October 11, 2005 Abstract This paper investigates the use of Design of Experiments in observational
