Detecting Credit Card Fraud by Decision Trees and Support Vector Machines
|
|
|
- Jessie Brooks
- 10 years ago
- Views:
Transcription
1 Detecting Credit Card Fraud by Decision Trees and Support Vector Machines Y. Sahin and E. Duman Abstract With the developments in the Information Technology and improvements in the communication channels, fraud is spreading all over the world, resulting in huge financial losses. Though fraud prevention mechanisms such as CHIP&PIN are developed, these mechanisms do not prevent the most common fraud types such as fraudulent credit card usages over virtual POS terminals or mail orders. As a result, fraud detection is the essential tool and probably the best way to stop such fraud types. In this study, classification models based on decision trees and support vector machines (SVM) are developed and applied on credit card fraud detection problem. This study is one of the firsts to compare the performance of SVM and decision tree methods in credit card fraud detection with a real data set. Index Terms Credit card fraud detection, decision trees, Support Vector Machines, classification F I. INTRODUCTION raud can be defined as wrongful or criminal deception intended to result in financial or personal gain [1], or to damage another individual without necessarily leading to direct legal consequences. The two main mechanisms to avoid frauds and losses due to fraudulent activities are fraud prevention and fraud detection systems. Fraud prevention is the proactive mechanism with the goal of disabling the occurrence of fraud. Fraud detection systems come into play when the fraudsters surpass the fraud prevention systems and start a fraudulent transaction. Nobody can understand whether a fraudulent transaction has passed the prevention mechanisms. Accordingly, the goal of the fraud detection systems is to check every transaction for the possibility of being fraudulent regardless of the prevention mechanisms, and to identify fraudulent ones as quickly as possible after the fraudster has begun to perpetrate a fraudulent transaction. A review of the fraud detection can be found in [2-5]. The most well-known fraud types are fraudulent transactions in credit card systems and e-commerce systems, money laundering in financial systems, intrusions to computer systems, fraudulent calls or service usages in Manuscript received December 08, 2010; revised January 15, This work was supported in part by The Scientific and Research Council of Turkey (TUBITAK) under Grant (TUBITAK TEYDEB 1501 Program). Y. Sahin is with the Department of Electrical & Electronics Engineering, Marmara University, Kadikoy, 34722, Istanbul, Turkey (phone: ; fax: ; [email protected]). E. Duman is with the Department of Industrial Engineering, Dogus University, Acibadem, Istanbul Turkey (phone: ; fax: ; [email protected]). telecommunication systems, and fraudulent claims in health and auto insurance systems. With the developments in the Information Technology and improvements in the communication channels, fraud is spreading all over the world with results of huge financial losses. Though fraud can be perpetrated through many types of media, including mail, wire, phone and the Internet, online media such as Internet are the most popular ones. Because of the international availability of the web and ease with which users can hide their location and identity over Internet transactions, there is a rapid growth of committing fraudulent actions over this medium. Furthermore, with the improvements in the bandwidth of internetworking channels, fraudsters have the chance to form fraud networks among themselves through information change and collaboration all over the world. As a result, frauds committed over Internet such as online credit card frauds become the most popular ones because of their nature. Credit card frauds can be made in many ways such as simple theft, application fraud, counterfeit cards, never received issue (NRI) and online fraud (where the card holder is not present). In online fraud, the transaction is made remotely and only the card s details are needed. A manual signature, a PIN or a card imprint are not required at the time of purchase. Though prevention mechanisms like CHIP&PIN decrease the fraudulent activities through simple theft, counterfeit cards and NRI; online frauds (Internet and mail order frauds) are still increasing in both amount and number of transactions. There has been a growing amount of financial losses due to credit card frauds as the usage of the credit cards become more and more common. Many papers reported huge amounts of losses in different countries [2, 6-7]. According to Visa reports about European countries, about 50% of the whole credit card fraud losses in 2008 are due to online frauds. Credit card fraud detection is an extremely difficult, but also popular problem to solve. Firstly, there comes only a limited amount of data with the transaction being committed, such as transaction amount, date and time; address, Merchant Category Code (MCC) and acquirer number of the merchant. There are millions of possible places and e-commerce sites to use a credit card which makes it extremely difficult to match a pattern. Also, there can be past transactions made by fraudsters which also fit a pattern of normal (legitimate) behavior [8]. Also the problem has many constraints. First of all, the profile of normal and fraudulent behavior changes constantly. Secondly, the development of new fraud detection methods is made more difficult by the fact that the exchange of ideas
2 in fraud detection, especially in credit card fraud detection is severely limited due to security and privacy concerns. Thirdly, data sets are not made available and the results are often censored, making them difficult to assess. Because of this problem, there is no chance of benchmarking for the models built. Even, some of the studies are done using synthetically generated data [9-10]. None of the previous studies with real data in the literature give details about the data and the variables used in classifier models. Fourthly, credit card fraud data sets are highly skewed sets with a ratio of about legitimate transactions to a fraudulent one. Lastly, the data sets are also constantly evolving making the profiles of normal and fraudulent behaviors always changing [2-4]. Techniques used in fraud detection can be divided into two: Supervised techniques where past known legitimate/fraud cases are used to build a model which will produce a suspicion score for the new transactions; and unsupervised ones where there are no prior sets in which the state of the transactions are known to be fraud or legitimate. A brief discussion of supervised and unsupervised techniques can be found in [2]. Many of these techniques like artificial neural networks can be used in both manners, supervised or unsupervised. The most commonly used fraud detection methods are rule-induction techniques, decision trees, neural networks, Support Vector Machines (SVM), logistic regression, and meta-heuristics such as genetic algorithm, k-means clustering and nearest neighbor algorithms. These techniques can be used alone or in collaboration using ensemble or meta-learning techniques to build classifiers. From the well-known decision tree techniques, ID3, C4.5 and C&RT are used for credit card fraud detection in [11-14]. Also, SVM is used in [10, 15] for detecting credit card frauds. Fraud detection systems evaluate the transactions and produce a suspicion score (generally a probability between 0 and 1) which shows the possibility of that transaction to be fraudulent. Computational procedures of these scores are relevant to the techniques used to build the model/models in the fraud detection systems. These scores are used with a predefined threshold value to differentiate the fraudulent transactions from the legitimate ones. However, most of the time, these scores are not directly used; but help the observer staff with domain expertise who examine and try to identify the frauds. Because the organizations have limited staff for this process, the ability of the detection systems to produce accurate suspicion scores helps these staff in many ways. Nevertheless, the success of the detection systems lies in distinguishing the fraudulent transactions from legitimate ones through producing suspicion scores with high precisions. In this study, a credit card fraud detection system based on a number of decision tree and SVM methods is developed. In this system, each account is monitored separately using suitable descriptors, and the transactions are attempt to be identified and flagged as legitimate or normal. The identification will be based on the suspicion score produced by the classifier models developed. When a new transaction is going, the classifier can predict whether the transaction is normal or fraud. To the best of our knowledge, there is no prior work on the comparison of performance of decision tree and SVM based classifiers in the subject of credit card fraud detection with a real data set. The rest of this paper is organized as follows: Section 2 gives some insights to the structure of credit card data. Section 3 is a summary of the classification methods used to develop the classifier models of the credit card fraud detection system given in this paper. Section 4 is about the details of our approach. Section 5 gives the results and a short discussion about the results. Section 6 concludes the study and shows directions for future work. II. STRUCTURE OF THE CREDIT CARD DATA The credit card data used in this study are taken from a national bank s credit card data warehouses with the required permissions. The past data in the credit card data warehouses are used to form a data mart representing the card usage profiles of the customers. Though some of the customers may have more than one credit card, each card is taken as a unique profile because customers with more than one card generally use each card for a different purpose. Every card profile consists of variables each of which discloses a behavioral characteristic of the card usage. These variables may show the spending habits of the customers with respect to geographical locations, days of the month, hours of the day or Merchant Category Codes (MCC) which show the type of the merchant where the transaction takes place. Later on, these variables are used to build a model to be used in the fraud detection systems to distinguish fraudulent activities which show significant deviations from the card usage profile stored in the datamart. The number of transactions for each card differs from one to other; however, each transaction record is of the same fixed length and includes the same fields. Hand and Blunt gave a detailed description of the characteristics of credit card data [16]. These fields range from the date and hour of the transaction to the amount, transaction type, MCC code, address of the merchant where the transaction is done and etc. The date and hour of the transaction record shows when the transaction is made. Transaction type shows whether this transaction is a purchase or a cash-advance transaction. MCC code shows the type of the merchant store where the transaction takes place. These are fixed codes given by the members of the VISA International Service Association. However; however, many of these codes form natural groups. So, instead of working with hundreds of codes, we grouped them into 25 groups according to their nature and the risk of availability to commit a fraud. The goods or services bought from merchant stores in some MCC codes can be easily converted to cash. As a result, transactions belonging to these MCC codes are more open to fraud and more risky from the transactions belonging to others. The grouping of the MCC codes are done according to both the number of the fraudulent transactions made belonging to each MCC code and the interviews done with the personnel of the data supplier bank with domain expertise about the subject. The distribution of the data with respect to being normal
3 or fraudulent is highly imbalanced with a ratio of about normal transaction records to one fraudulent transaction record. So, to enable the models to learn both types of profiles, some under sampling or oversampling techniques should be used. Instead of oversampling the fraudulent records by making multiple copies or etc., we use stratified sampling to under sample the legitimate records to a meaningful number. III. CLASSIFICATION METHODS There are a lot of studies done on credit card fraud detection. The general background of the credit card systems and non-technical knowledge about this type of fraud can be learned from [17] and [18]. Most of the credit card fraud detection systems are using supervised algorithms such as neural networks and SVMs [7, 9, 11, 15, 19-24]. In this study, the performance of classifier models built by using the well-known decision tree methods C5.0, C&RT and CHAID and a number of different SVM methods (SVM with polynomial, sigmoid, linear and RBF kernel functions) are compared. A decision tree is a tree structure which attempts to separate the given records into mutually exclusive subgroups. To do this, starting from the root node, each node is split into child nodes in a binary or a multi split fashion related to the method used based on the value of the attribute (input variable) which separates the given records best. Records in a node is recursively separated into child nodes until there is no split that makes statistical difference on the distribution of the records in the node or the number of records in a node is too small. Each decision tree method uses its own splitting algorithms and splitting metrics. From the well-known decision tree algorithms, ID3, C5.0 and C&RT methods use impurity measures to choose the splitting attribute and the split value/s. ID3 [25] uses information gain while the successor, C5.0 uses gain ratio, and C&RT [26] uses Gini coefficient for impurity measurements. Unlikely, CHAID uses chi-square or F statistic to choose the splitting variable [26]. As the tree is grown, the resultant tree may overfit the training data containing possible errors or noise or some of the branches of the resultant tree may contain anomalies. So, the resultant tree should be checked whether removal of some nodes, starting from the leaf ones, make a significant effect on the tree s classification performance. This operation is called as pruning. After the tree is grown, a new observation or record is classified by tracing the route on the tree up to a leaf node according to the values of the attributes of the record. This is done by recursively checking the values of the splitting attribute of the record at each node and following the required branch of the tree until a leaf node is reached. The label of the leaf node reached gives the class which the new observation or record is classified in. Unlike the decision tree methods, SVM tries to find a hyperplane to separate the two classes while minimizing the classification error. SVM is a new and promising classification and regression technique proposed by Vapnik and his group at AT&T Bell Laboratories [27]. SVM learns a separating hyperplane which maximizes the margin and produces good generalization ability [28]. In prior literature, SVM has been successfully applied to many areas such as telecommunication fraud detection [29], pattern recognition [28], system intrusion detection [30] SVM s basic idea is to transform the attributes to a higher dimensional feature space and find the optimal hyperplane in that space that maximizes the margin between the classes. Briefly, SVM does this by using a polynomial, sigmoid, radial basis or a linear kernel function which satisfies the Mercer condition [31]. The kernel function reflects the geometric relationship between the input record and the support vector, or the similarities of the features of the classes. The details of the theory have been given in [32]. IV. APPROACH First of all, the collected data is pre-processed before starting the modeling phase. As mentioned earlier, the distribution of data with respect to the classes is highly imbalanced. The time period that is used to build our sample included 978 fraudulent records and 22 million normal ones with a ratio of about 1: So, stratified sampling is used to under sample the normal records so that the models have chance to learn the characteristics of both the normal and the fraudulent records profile. To do this, first of all, the variables that are most successful in discriminating the fraudulent and legitimate transactions are founded. Then, these variables are used to form stratified samples of the legitimate records. Later on, these stratified samples of the legitimate records are combined with the fraudulent ones to form three samples with different fraudulent to normal record ratios. The first sample set has a ratio of one fraudulent record to one normal record; the second one has a ratio of one fraudulent record to four normal ones; and the last one has the ratio of one fraudulent to nine normal ones. The variables which form the card usage profile and the methods used to build the model make the difference in the fraud detection systems. Our aim in defining the variables used to form the data-mart is to discriminate the profile of the fraudulent card usage by the fraudsters from the profile of legitimate (normal) card usage by the card holders. We will be content with mentioning about the type of variables used but regarding the privacy, confidentiality and security concerns, we are not allowed to talk on the full list of variables (a variable is a certain level of deviation from a personal average statistics). The variables are one of the five main types: all transactions statistics, regional statistics, sectorial statistics, daily amount statistics and daily number of transactions statistics. The chosen methods to build classifier models are C5.0, C&RT and CHAID from decision tree methods and SVMs with kernels of polynomial, sigmoid, linear and radial basis functions. All these methods are used to develop models using the three data sets. These methods and the parameters used with these methods are given in Table 1.
4 TABLE I INPUT PARAMETERS FOR CLASSIFIER MODELS data, but this can also lead to overfitting. Bias is enabled only if the kernel type is also set to Polynomial or Sigmoid. It sets the constant coefficient value in the kernel function. The default value 0 is suitable in most cases. Degree is enabled only if Kernel type is set to Polynomial. It controls the complexity (dimension) of the mapping space. TABLE II TRAINING AND TEST SET SIZES FOR THE SAMPLES C&RT method takes two parameters. The first one is the impurity measure used for deciding which split to use which is chosen as Gini coefficient, and the second one is the maximum number of surrogates. Surrogates are a method for dealing with missing values. For each split in the tree, decision tree method identifies the input fields that are most similar to the selected split field. Those fields are the surrogates for that split. When a record must be classified but has a missing value for a split field, its value on a surrogate field can be used to make the split. The maximum number of surrogates is chosen as 10. Unlike C&RT method, C5.0 uses gain ratio in the entropy as the impurity measure. CHAID uses Chi-Square distribution for splitting and merging operations. Accordingly, it takes two probabilities, the first one showing the significance level for splitting the node, and the second one showing the significance level for merging the nodes. SVM models built in this study use four different kernel functions: polynomial, sigmoid, radial basis and linear kernel functions. There are two parameters associated with RBF kernel: C and Ɣ. Regularization parameter (C) controls the trade-off between maximizing the margin and minimizing the training error term. Increasing the value improves the classification accuracy (or reduces the regression error) for the training data, but this can also lead to overfitting. RBF gamma should normally take a value between 3/k and 6/k, where k is the number of input fields. For example, if there are 12 input fields, values between 0.25 and 0.5 would be worth trying. Increasing the value improves the classification accuracy (or reduces the regression error) for the training data, but this can also lead to overfitting. Gamma is used in Polynomial or Sigmoid kernels. Increasing the value improves the classification accuracy (or reduces the regression error) for the training As mentioned above, three different sets are formed to compare the performances of the methods under these different conditions. In the first set, there is one normal transaction for each fraudulent one. In the second set, there are four normal transactions for each fraudulent one and there are nine normal ones for each fraudulent one in the third set. For each sample, 70% of the data, both 70% of the normal transactions and 70% of the fraudulent transactions, are taken as the training set for the models; and 30% of the data are taken as the test set to evaluate the performance of the models developed. The training set and test set of each sample is designed to have the same fraudulent records as the sets of other samples. The training and test set sizes are given in Table 2. In each sample, the number of fraudulent transactions is fixed to 973. The number of normal transactions changes according to the given ratios. For the training sets, 70% of the normal and 70% of the fraudulent transactions are chosen. The rest 30% of the normal and fraudulent transactions are left as test data set. Every model for a sample uses the same training and test data set so that a comparison of the performance of the models for each sample becomes possible. V. RESULTS AND DISCUSSIONS In this study, seven alternative models based on decision tree algorithms and SVM were built using the relevant training data set for each sample. To evaluate these models, we used the remaining transactions in the relevant test sets for each model. The fraudulent transactions in the test sets of the samples are identical. Accuracy rates were used to describe the usefulness of the models. Accuracy is probably the most commonly used metric to measure the performance of targeting models in classification applications. The performance among the predictive models is presented in Table 3. The left column shows the methods used to build the models, the columns named as Train show the prediction accuracy of the models on the training data set of the given samples, the columns named as Test show the prediction accuracy of the models on the testing
5 TABLE III PERFORMANCE OF CLASSIFIERS W.R.T. ACCURACY OVER TRAINING & TEST SETS data set of the given samples, the columns named as Build Time show the time elapsed for building the given model over the given sample, and columns Frauds show the number of fraudulent transactions in the test data sets assigned as fraud (True Positive) by the models over the given samples. From Table 3, it is clear that decision tree based models outperform the SVM based models when their performances are compared over the test data sets. Though the result is reverse when the performances are compared over the training data sets, it just shows that SVM based models overfit the training data. In this problem, assigning as many fraudulent transactions as fraudulent is the most important success factor. However, accuracy shows the rate of true assignments regardless of whether it a true fraud assignment or a true normal assignment. When the performances of the models are compared w.r.t. accuracy, it is seen that as the number of the training data increases, this overfitting behavior becomes less remarkable and the performances of the SVM based models become comparable to decision tree based models. But the number of frauds caught by SVM models are still far less than the decision tree models, especially C&RT model. Nevertheless, the number of actual fraudulent transactions assigned as fraudulent by the models are not strongly related with the accuracy of the assignments done by the models. Accordingly, accuracy is not a well matched performance metric for this problem domain. Though C5.0 model is the champion over the other models w.r.t. accuracy for each sample among the seven models, C&RT model catches the largest number of frauds for each sample except the first one. This provides a key factor in choosing the models and we choose the C&RT and C5.0 models as the final methods to build the prediction model. VI. CONCLUSIONS As usage of credit cards become more and more common in every field of the daily life, credit card fraud has become much more rampant. To improve security of the financial transaction systems in an automatic and effective way, building an accurate and efficient credit card fraud detection system is one of the key tasks for the financial institutions. In this study, seven classification methods were used to build fraud detecting models. The work demonstrates the advantages of applying the data mining techniques including decision trees and SVMs to the credit card fraud detection problem for the purpose of reducing the bank s risk. The results show that the proposed classifiers of C&RT and other decision tree approaches outperform SVM approaches in solving the problem under investigation. However, as the size of the training data sets become larger, the accuracy performance of SVM based models reach the performance of the decision tree based models, but the number of frauds caught by SVM models are still far less than the number of frauds caught by decision tree methods, especially C&RT model. Under this framework, financial institutions can utilize credit card fraud detection models to compare the transaction information with the historical profile patterns to predict the probability of being fraudulent for a new transaction, and provide a scientific basis for the authorization mechanisms. Furthermore, resources of the institutions can be focused on more suspicious transactions to decrease the fraud levels. As a future work, other data mining algorithms such as different versions of Artificial Neural Networks (ANN) and logistic regression will be used to build new classification models on the same real world dataset and the performance of the new models will be compared with the performance of the models given in this paper. Also, instead of making performance comparisons just over the prediction accuracy, these comparisons will be extended to include the comparisons over other performance metrics. REFERENCES [1] Hobson, A The Oxford Dictionary of Difficult Words. The Oxford University Press. New York. [2] Bolton, R. J. and Hand, D. J Statistical fraud detection: A review. Statistical Science 28(3): [3] Kou, Y. et. al Survey of fraud detection techniques. In Proceedings of the 2004 IEEE International Conference on Networking, Sensing and Control, Taipei, Taiwan, March
6 [4] Phua, C. et al A comprehensive survey of data mining-based fraud detection research. Artificial Intelligence Review. [5] Sahin, Y., Duman E. An overview of business domains where fraud can take place, and a survey of various fraud detection techniques. In Proceedings of the 1st International Symposium on Computing in Science and Engineering, Aydin, Turkey, June [6] Leonard, K. J Detecting credit card fraud using expert systems. Computers and Industrial Engineering, 25 [7] Ghosh, S. and Reilly D. L Credit card fraud detection with a neural network. In Proceedings of the 27th Hawaii International Conference on system Sciences, 3. [8] Mena, J Investigate Data Mining for Security and Criminal Detection. Amsterdam, Butterworth-Heinemann. [9] Aleskerov, E., Freisleben, B. and Rao, B CARDWATCH: A neural network based data mining system for credit card fraud detection. In Computational Intelligence for Financial Engineering. [10] Chen, R., Chiu, M., Huang, Y. and Chen, L Detecting credit card fraud by using questionnaire-responded transaction model based on SVMs. In Proceedings of IDEAL2004. [11] Brause, R., Langsdorf, T. and Hepp, M Neural data mining for credit card fraud detection. In Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence. [12] Stolfo, S. J., Fan, D. W., Lee, W., Prodromidis, A. L. and Chan, P. K Credit card fraud detection using meta-learning: Issues and initial results. In AAAI Workshop on AI Approaches to Fraud Detection and Risk Management. AAAI Press, Menlo Park, CA. [13] Stolfo, S., Fan, W., Lee, W., Prodromidis, A. L. and Chan, P Cost-based modeling for fraud and intrusion detection: Results from the JAM Project. In Proceedings of the DARPA Information Survivability Conference and Exposition. IEEE Computer Press, New York. [14] Prodromidis, A. L., Chan P. and Stolfo S. J Meta-learning in distributed data mining systems: issues and approaches. Advances of Distributed Data Mining, Editors Kargupta H. and Chan, P. AAAI Press. [15] Chen, R.-C., Luo, S.-T., Liang, X. and Lee, V. C. S Personalized approach based on SVM and ANN for detecting credit card fraud. In Proceedings of the IEEE International Conference on Neural Networks and Brain, Beijing, China [16] Hand, D. J. and Blunt G Prospecting gems in credit card data. IMA Journal of Management Mathematics, 12. [17] Dahl, J Card Fraud. In Credit Union Magazine. [18] Schindeler, S Fighting Card Fraud in the USA. In Credit Control, House of Words Ltd. [19] Dorronsoro, J. R., Ginel, F., Sanchez, C. and Cruz, C. S Neural fraud detection in credit card operations. IEEE Transactions on Neural Networks, 8 [20] Hanagandi, V., Dhar, A. and Buescher, K Density-Based Clustering and Radial Basis Function Modeling to Generate Credit Card Fraud Scores. In Proceedings of the IEEE/IAFE 1996 Conference. [21] Juszczak, P., Adams, N. M., Hand, D. J., Whitrow, C., and Weston, D. J Off-the-peg and bespoke classifiers for fraud detection. Computational Statistics & Data Analysis. 52(9). [22] Quah, J. T. and Sriganesh, M Real-time credit card fraud detection using computational intelligence. Expert Systems with Applications. 35(4). [23] Shen, A., Tong, R., Deng, Y Application of classification models on credit card fraud detection. In International Conference on Service Systems and Service Management, Chengdu, China, June [24] Syeda, M., Zhang, Y. and Pan, Y Parallel granular neural networks for fast credit card fraud detection. In Proceedings of the 2002 IEEE International Conference on Fuzzy Systems. [25] Han, J., & Camber, M. (2000). Data mining concepts and techniques. San Diego, USA: Morgan Kaufman [26] Koh, H. C., & Low, C. K. (2004). Going concern prediction using data mining techniques. Managerial Auditing Journal, 19(3), [27] Cortes, C., Vapnik, V Support vector network. Machine Learning, vol:20 pg [28] Burges, C.J.C A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, Vol. 2, No.2, pp [29] Kim, H.-C., Pang, S., Je, H.-M., Kim, D., Bang, S. Y Constructing support vector machine ensemble. Pattern Recognition, Vol. 36, pg [30] Chen, R.C., Chen, J., Chen, T.S., Hsieh, C.H., Chen, T.Y. and K.Y. Wu Building an Intrusion Detection System Based on Support Vector Machine and Genetic Algorithm. Lecture Notes in Computer Science (LNCS), Vol. 3498, pp [31] Mercer, J. Function of positive and negative type and their connection with theory of integral equations, Philosophical Transactions of the Royal Society, London A 209 (1909) [32] Cristianini, N., Shawe-Taylor, J. An Introduction to Support Vector Machines and other Kernel-based Learning Methods, Cambridge University Press, Cambridge, UK, 2000.
Electronic Payment Fraud Detection Techniques
World of Computer Science and Information Technology Journal (WCSIT) ISSN: 2221-0741 Vol. 2, No. 4, 137-141, 2012 Electronic Payment Fraud Detection Techniques Adnan M. Al-Khatib CIS Dept. Faculty of Information
Meta Learning Algorithms for Credit Card Fraud Detection
International Journal of Engineering Research and Development e-issn: 2278-67X, p-issn: 2278-8X, www.ijerd.com Volume 6, Issue 6 (March 213), PP. 16-2 Meta Learning Algorithms for Credit Card Fraud Detection
Credit Card Fraud Detection Using Hidden Markov Model
International Journal of Soft Computing and Engineering (IJSCE) Credit Card Fraud Detection Using Hidden Markov Model SHAILESH S. DHOK Abstract The most accepted payment mode is credit card for both online
To improve the problems mentioned above, Chen et al. [2-5] proposed and employed a novel type of approach, i.e., PA, to prevent fraud.
Proceedings of the 5th WSEAS Int. Conference on Information Security and Privacy, Venice, Italy, November 20-22, 2006 46 Back Propagation Networks for Credit Card Fraud Prediction Using Stratified Personalized
Credit Card Fraud Detection Using Self Organised Map
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1343-1348 International Research Publications House http://www. irphouse.com Credit Card Fraud
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM ABSTRACT Luis Alexandre Rodrigues and Nizam Omar Department of Electrical Engineering, Mackenzie Presbiterian University, Brazil, São Paulo [email protected],[email protected]
A Study of Detecting Credit Card Delinquencies with Data Mining using Decision Tree Model
A Study of Detecting Credit Card Delinquencies with Data Mining using Decision Tree Model ABSTRACT Mrs. Arpana Bharani* Mrs. Mohini Rao** Consumer credit is one of the necessary processes but lending bears
DATA MINING APPLICATION IN CREDIT CARD FRAUD DETECTION SYSTEM
Journal of Engineering Science and Technology Vol. 6, No. 3 (2011) 311-322 School of Engineering, Taylor s University DATA MINING APPLICATION IN CREDIT CARD FRAUD DETECTION SYSTEM FRANCISCA NONYELUM OGWUELEKA
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
A COMPARATIVE ASSESSMENT OF SUPERVISED DATA MINING TECHNIQUES FOR FRAUD PREVENTION
A COMPARATIVE ASSESSMENT OF SUPERVISED DATA MINING TECHNIQUES FOR FRAUD PREVENTION Sherly K.K Department of Information Technology, Toc H Institute of Science & Technology, Ernakulam, Kerala, India. [email protected]
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Unsupervised Profiling Methods for Fraud Detection
Unsupervised Profiling Methods for Fraud Detection Richard J. Bolton and David J. Hand Department of Mathematics Imperial College London {r.bolton, d.j.hand}@ic.ac.uk Abstract Credit card fraud falls broadly
Decision Support Systems
Decision Support Systems 50 (2011) 602 613 Contents lists available at ScienceDirect Decision Support Systems journal homepage: www.elsevier.com/locate/dss Data mining for credit card fraud: A comparative
Support Vector Machines with Clustering for Training with Very Large Datasets
Support Vector Machines with Clustering for Training with Very Large Datasets Theodoros Evgeniou Technology Management INSEAD Bd de Constance, Fontainebleau 77300, France [email protected] Massimiliano
COMP3420: Advanced Databases and Data Mining. Classification and prediction: Introduction and Decision Tree Induction
COMP3420: Advanced Databases and Data Mining Classification and prediction: Introduction and Decision Tree Induction Lecture outline Classification versus prediction Classification A two step process Supervised
The Credit Card Fraud Detection Analysis With Neural Network Methods
The Credit Card Fraud Detection Analysis With Neural Network Methods 1 M.Jeevana Sujitha, 2 K. Rajini Kumari, 3 N.Anuragamayi 1,2,3 Dept. of CSE, A.S.R College of Engineering & Tech., Tetali, Tanuku, AP,
Statistics in Retail Finance. Chapter 7: Fraud Detection in Retail Credit
Statistics in Retail Finance Chapter 7: Fraud Detection in Retail Credit 1 Overview > Detection of fraud remains an important issue in retail credit. Methods similar to scorecard development may be employed,
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail
Research Article FraudMiner: A Novel Credit Card Fraud Detection Model Based on Frequent Itemset Mining
e Scientific World Journal, Article ID 252797, 10 pages http://dx.doi.org/10.1155/2014/252797 Research Article FraudMiner: A Novel Credit Card Fraud Detection Model Based on Frequent Itemset Mining K.
Intrusion Detection via Machine Learning for SCADA System Protection
Intrusion Detection via Machine Learning for SCADA System Protection S.L.P. Yasakethu Department of Computing, University of Surrey, Guildford, GU2 7XH, UK. [email protected] J. Jiang Department
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier D.Nithya a, *, V.Suganya b,1, R.Saranya Irudaya Mary c,1 Abstract - This paper presents,
E-commerce Transaction Anomaly Classification
E-commerce Transaction Anomaly Classification Minyong Lee [email protected] Seunghee Ham [email protected] Qiyi Jiang [email protected] I. INTRODUCTION Due to the increasing popularity of e-commerce
ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION
ISSN 9 X INFORMATION TECHNOLOGY AND CONTROL, 00, Vol., No.A ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION Danuta Zakrzewska Institute of Computer Science, Technical
Index Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
Survey on Credit Card Fraud Detection Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN: 2319-7242 Volume 4 Issue 11 Nov 2015, Page No. 15010-15015 Survey on Credit Card Fraud Detection Techniques Priya Ravindra Shimpi,
AN UPDATE RESEARCH ON CREDIT CARD ON-LINE TRANSACTIONS
AN UPDATE RESEARCH ON CREDIT CARD ON-LINE TRANSACTIONS Falaki S. O. Alese B. K. Department of Computer Science, Federal University of Technology, Akure, Ondo State, Nigeria. Ismaila W. O. Department of
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Data Mining for Knowledge Management. Classification
1 Data Mining for Knowledge Management Classification Themis Palpanas University of Trento http://disi.unitn.eu/~themis Data Mining for Knowledge Management 1 Thanks for slides to: Jiawei Han Eamonn Keogh
Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal
Learning Example Chapter 18: Learning from Examples 22c:145 An emergency room in a hospital measures 17 variables (e.g., blood pressure, age, etc) of newly admitted patients. A decision is needed: whether
Classification of Bad Accounts in Credit Card Industry
Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition
How To Detect Credit Card Fraud
Card Fraud Howard Mizes December 3, 2013 2013 Xerox Corporation. All rights reserved. Xerox and Xerox Design are trademarks of Xerox Corporation in the United States and/or other countries. Outline of
HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION
HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION Chihli Hung 1, Jing Hong Chen 2, Stefan Wermter 3, 1,2 Department of Management Information Systems, Chung Yuan Christian University, Taiwan
Data Mining: Overview. What is Data Mining?
Data Mining: Overview What is Data Mining? Recently * coined term for confluence of ideas from statistics and computer science (machine learning and database methods) applied to large databases in science,
Fraud Detection in Credit Card Using DataMining Techniques Mr.P.Matheswaran 1,Mrs.E.Siva Sankari ME 2,Mr.R.Rajesh 3
Fraud Detection in Credit Card Using DataMining Techniques Mr.P.Matheswaran 1,Mrs.E.Siva Sankari ME 2,Mr.R.Rajesh 3 1 P.G. Student, Department of CSE, Govt.College of Engineering, Thirunelveli, India.
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Classification algorithm in Data mining: An Overview
Classification algorithm in Data mining: An Overview S.Neelamegam #1, Dr.E.Ramaraj *2 #1 M.phil Scholar, Department of Computer Science and Engineering, Alagappa University, Karaikudi. *2 Professor, Department
SURVIVABILITY OF COMPLEX SYSTEM SUPPORT VECTOR MACHINE BASED APPROACH
1 SURVIVABILITY OF COMPLEX SYSTEM SUPPORT VECTOR MACHINE BASED APPROACH Y, HONG, N. GAUTAM, S. R. T. KUMARA, A. SURANA, H. GUPTA, S. LEE, V. NARAYANAN, H. THADAKAMALLA The Dept. of Industrial Engineering,
A COMPARATIVE STUDY OF DECISION TREE ALGORITHMS FOR CLASS IMBALANCED LEARNING IN CREDIT CARD FRAUD DETECTION
International Journal of Economics, Commerce and Management United Kingdom Vol. III, Issue 12, December 2015 http://ijecm.co.uk/ ISSN 2348 0386 A COMPARATIVE STUDY OF DECISION TREE ALGORITHMS FOR CLASS
Chapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
A Novel Approach for Credit Card Fraud Detection Targeting the Indian Market
www.ijcsi.org 172 A Novel Approach for Credit Card Fraud Detection Targeting the Indian Market Jaba Suman Mishra 1, Soumyashree Panda 2, Ashis Kumar Mishra 3 1 Department Of Computer Science & Engineering,
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
A fast multi-class SVM learning method for huge databases
www.ijcsi.org 544 A fast multi-class SVM learning method for huge databases Djeffal Abdelhamid 1, Babahenini Mohamed Chaouki 2 and Taleb-Ahmed Abdelmalik 3 1,2 Computer science department, LESIA Laboratory,
International Journal of Computer Science and Network (IJCSN) Volume 1, Issue 4, August 2012 www.ijcsn.org ISSN 2277-5420.
Credit Card Fraud: The study of its impact and detection techniques 1 Khyati Chaudhary, 2 Bhawna Mallick 1 Dept. of Computer Science, GCET, Page 31 2 Dept. of Computer Science, GCET, Greater Noida Abstract
Techniques for Fraud Detection
Analysis on Credit Card Fraud Detection Methods 1 Renu HCE Sonepat 2 Suman HCE Sonepat Abstract Due to the theatrical increase of fraud which results in loss of dollars worldwide each year, several modern
IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES
IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES Bruno Carneiro da Rocha 1,2 and Rafael Timóteo de Sousa Júnior 2 1 Bank of Brazil, Brasília-DF, Brazil [email protected] 2 Network Engineering
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Application of Hidden Markov Model in Credit Card Fraud Detection
Application of Hidden Markov Model in Credit Card Fraud Detection V. Bhusari 1, S. Patil 1 1 Department of Computer Technology, College of Engineering, Bharati Vidyapeeth, Pune, India, 400011 Email: [email protected]
The Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
Dan French Founder & CEO, Consider Solutions
Dan French Founder & CEO, Consider Solutions CONSIDER SOLUTIONS Mission Solutions for World Class Finance Footprint Financial Control & Compliance Risk Assurance Process Optimization CLIENTS CONTEXT The
Exploration of Data mining techniques in Fraud Detection: Credit Card Khyati Chaudhary
International Journal of Electronics and Computer Science Engineering 1765 Available Online at www.ijecse.org ISSN- 2277-1956 Exploration of Data mining techniques in Fraud Detection: Credit Card Khyati
A Survey on Outlier Detection Techniques for Credit Card Fraud Detection
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 16, Issue 2, Ver. VI (Mar-Apr. 2014), PP 44-48 A Survey on Outlier Detection Techniques for Credit Card Fraud
Azure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
Classification and Prediction
Classification and Prediction Slides for Data Mining: Concepts and Techniques Chapter 7 Jiawei Han and Micheline Kamber Intelligent Database Systems Research Lab School of Computing Science Simon Fraser
Predicting required bandwidth for educational institutes using prediction techniques in data mining (Case Study: Qom Payame Noor University)
260 IJCSNS International Journal of Computer Science and Network Security, VOL.11 No.6, June 2011 Predicting required bandwidth for educational institutes using prediction techniques in data mining (Case
Data quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
Data Mining Classification: Decision Trees
Data Mining Classification: Decision Trees Classification Decision Trees: what they are and how they work Hunt s (TDIDT) algorithm How to select the best split How to handle Inconsistent data Continuous
Scalable Developments for Big Data Analytics in Remote Sensing
Scalable Developments for Big Data Analytics in Remote Sensing Federated Systems and Data Division Research Group High Productivity Data Processing Dr.-Ing. Morris Riedel et al. Research Group Leader,
ClusterOSS: a new undersampling method for imbalanced learning
1 ClusterOSS: a new undersampling method for imbalanced learning Victor H Barella, Eduardo P Costa, and André C P L F Carvalho, Abstract A dataset is said to be imbalanced when its classes are disproportionately
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES
ISSN: 2229-6956(ONLINE) ICTACT JOURNAL ON SOFT COMPUTING, JULY 2012, VOLUME: 02, ISSUE: 04 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES V. Dheepa 1 and R. Dhanapal 2 1 Research
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Support Vector Machines Explained
March 1, 2009 Support Vector Machines Explained Tristan Fletcher www.cs.ucl.ac.uk/staff/t.fletcher/ Introduction This document has been written in an attempt to make the Support Vector Machines (SVM),
Immune Support Vector Machine Approach for Credit Card Fraud Detection System. Isha Rajak 1, Dr. K. James Mathai 2
Immune Support Vector Machine Approach for Credit Card Fraud Detection System. Isha Rajak 1, Dr. K. James Mathai 2 1Department of Computer Engineering & Application, NITTTR, Shyamla Hills, Bhopal M.P.,
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE Kasra Madadipouya 1 1 Department of Computing and Science, Asia Pacific University of Technology & Innovation ABSTRACT Today, enormous amount of data
Gerry Hobbs, Department of Statistics, West Virginia University
Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit
KEITH LEHNERT AND ERIC FRIEDRICH
MACHINE LEARNING CLASSIFICATION OF MALICIOUS NETWORK TRAFFIC KEITH LEHNERT AND ERIC FRIEDRICH 1. Introduction 1.1. Intrusion Detection Systems. In our society, information systems are everywhere. They
International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering
Towards applying Data Mining Techniques for Talent Mangement
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Towards applying Data Mining Techniques for Talent Mangement Hamidah Jantan 1,
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES Vijayalakshmi Mahanra Rao 1, Yashwant Prasad Singh 2 Multimedia University, Cyberjaya, MALAYSIA 1 [email protected]
Credit Card Fraud Detection and Concept-Drift Adaptation with Delayed Supervised Information
Credit Card Fraud Detection and Concept-Drift Adaptation with Delayed Supervised Information Andrea Dal Pozzolo, Giacomo Boracchi, Olivier Caelen, Cesare Alippi, and Gianluca Bontempi 15/07/2015 IEEE IJCNN
SVM Ensemble Model for Investment Prediction
19 SVM Ensemble Model for Investment Prediction Chandra J, Assistant Professor, Department of Computer Science, Christ University, Bangalore Siji T. Mathew, Research Scholar, Christ University, Dept of
Automatic Bank Fraud Detection Using Support Vector Machines
Automatic Bank Fraud Detection Using Support Vector Machines Djeffal Abdelhamid 1, Soltani Khaoula 1, Ouassaf Atika 2 1 Computer science department, LESIA Laboratory, Biskra University, Algeria 2 Economic
Spam detection with data mining method:
Spam detection with data mining method: Ensemble learning with multiple SVM based classifiers to optimize generalization ability of email spam classification Keywords: ensemble learning, SVM classifier,
Data Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification
COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments
Contents List of Figures Foreword Preface xxv xxiii xv Acknowledgments xxix Chapter 1 Fraud: Detection, Prevention, and Analytics! 1 Introduction 2 Fraud! 2 Fraud Detection and Prevention 10 Big Data for
Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence
Artificial Neural Networks and Support Vector Machines CS 486/686: Introduction to Artificial Intelligence 1 Outline What is a Neural Network? - Perceptron learners - Multi-layer networks What is a Support
ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA
ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA D.Lavanya 1 and Dr.K.Usha Rani 2 1 Research Scholar, Department of Computer Science, Sree Padmavathi Mahila Visvavidyalayam, Tirupati, Andhra Pradesh,
Network Intrusion Detection using Semi Supervised Support Vector Machine
Network Intrusion Detection using Semi Supervised Support Vector Machine Jyoti Haweliya Department of Computer Engineering Institute of Engineering & Technology, Devi Ahilya University Indore, India ABSTRACT
A new Approach for Intrusion Detection in Computer Networks Using Data Mining Technique
A new Approach for Intrusion Detection in Computer Networks Using Data Mining Technique Aida Parbaleh 1, Dr. Heirsh Soltanpanah 2* 1 Department of Computer Engineering, Islamic Azad University, Sanandaj
Meta-learning. Synonyms. Definition. Characteristics
Meta-learning Włodzisław Duch, Department of Informatics, Nicolaus Copernicus University, Poland, School of Computer Engineering, Nanyang Technological University, Singapore [email protected] (or search
How To Make A Credit Risk Model For A Bank Account
TRANSACTIONAL DATA MINING AT LLOYDS BANKING GROUP Csaba Főző [email protected] 15 October 2015 CONTENTS Introduction 04 Random Forest Methodology 06 Transactional Data Mining Project 17 Conclusions
SURVEY OF TEXT CLASSIFICATION ALGORITHMS FOR SPAM FILTERING
I J I T E ISSN: 2229-7367 3(1-2), 2012, pp. 233-237 SURVEY OF TEXT CLASSIFICATION ALGORITHMS FOR SPAM FILTERING K. SARULADHA 1 AND L. SASIREKA 2 1 Assistant Professor, Department of Computer Science and
Data Mining Practical Machine Learning Tools and Techniques
Ensemble learning Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 8 of Data Mining by I. H. Witten, E. Frank and M. A. Hall Combining multiple models Bagging The basic idea
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be
A SYSTEM FOR DENIAL OF SERVICE ATTACK DETECTION BASED ON MULTIVARIATE CORRELATION ANALYSIS
Journal homepage: www.mjret.in ISSN:2348-6953 A SYSTEM FOR DENIAL OF SERVICE ATTACK DETECTION BASED ON MULTIVARIATE CORRELATION ANALYSIS P.V.Sawant 1, M.P.Sable 2, P.V.Kore 3, S.R.Bhosale 4 Department
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents
Data Mining Techniques for Prognosis in Pancreatic Cancer
Data Mining Techniques for Prognosis in Pancreatic Cancer by Stuart Floyd A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUE In partial fulfillment of the requirements for the Degree
Detecting Corporate Fraud: An Application of Machine Learning
Detecting Corporate Fraud: An Application of Machine Learning Ophir Gottlieb, Curt Salisbury, Howard Shek, Vishal Vaidyanathan December 15, 2006 ABSTRACT This paper explores the application of several
MS1b Statistical Data Mining
MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
How To Solve The Kd Cup 2010 Challenge
A Lightweight Solution to the Educational Data Mining Challenge Kun Liu Yan Xing Faculty of Automation Guangdong University of Technology Guangzhou, 510090, China [email protected] [email protected]
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
Data Mining Applications in Higher Education
Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2
A Secured Approach to Credit Card Fraud Detection Using Hidden Markov Model
A Secured Approach to Credit Card Fraud Detection Using Hidden Markov Model Twinkle Patel, Ms. Ompriya Kale Abstract: - As the usage of credit card has increased the credit card fraud has also increased
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
