Analyzing Call Center Performance: A Data Mining Approach
|
|
|
- Joella Palmer
- 10 years ago
- Views:
Transcription
1 Analyzing Call Center Performance: A Data Mining Approach Ruiyuan Guo, Ajith Abraham* and Marcin Paprzycki** Computer Science Department, Oklahoma State University, USA [email protected] * IITA Professorship Program, School of Computer Science and Engineering, Chung-Ang University, Korea, [email protected] ** Warsaw School of Social Psychology, ul. Chodakowska 19/31, Warsaw, Poland [email protected]. Abstract. Society is becoming more accustomed to toll-free numbers as an efficient way to request and receive services in all aspects of their lives. While a move can be observed to eliminate humans as handlers of most rudimentary customer requests, responding to telephone calls remains a top priority in customer service. Call centers are either managed in-house or contracted out and provide a variety of services. The performance of the call center depends on the performance of its customer service representatives and the call handling regulations. The research aims to apply some well-known data mining techniques such as neural networks, classification and regression trees, support vector machines and a hybrid decision tree neural network approach to the problem of predicting the quality of service in call centers; based on the performance data actually collected in call centers of a large insurance company. We also applied the apriori association rule mining algorithm to find interesting features among the variables. We first compared the performance of models built using the above-mentioned techniques and then we analyzed the characteristics of the input sensitivity in order to better understand the relationship between the performance evaluation process and the actual performance to help improve management and performance of call centers. 1 Introduction Recent research studies have linked information management as the most powerful tool to optimize the performance of call centers. Efficient call center management directly contributes to the success of the whole organization. The performance of call centers depends on the performance of its customer service representatives (CSRs) and the call handling regulations. Most existing large call centers collect data which is then used to assess and improve the performance of its representatives [1][2][5][13][15]. Typically, such data includes some form of quality assessment, time management representation, and business processing aspects [3][7]. While data mining has been applied to analyze the customer behavior with its main aim to improve the customer satisfaction, there is not much research on mining the data of performance of call center representatives. Therefore, the aim of our research is to fill this gap by applying data mining techniques to the combined performance evaluation results collected from four call cen-
2 ters of a large US-based insurance company [14]. The remaining parts of this paper are organized as follows. In section 2, we summarize the related research that we were able to locate, and follow it with a short description of different data mining techniques used in our research (Section 3). Section 4 introduces data used in our study and presents results of our experiments, including sensitivity analysis. We briefly summarize our findings in Section 5. 2 Summary of related research As indicated above, we were able to find only results related to mining customer-related data. Some vendors of (call center) monitoring systems such as etalk and Gartner Group have built data mining tools directly into their monitoring systems. These tools are intended primarily for non-experts, such as supervisors and managers, who can mine the available data by asking what if type questions [8]. In this way it was found, for instance, that transfers of calls between representatives tend to frustrate customers. Predictive modeling such as decision-tree or neural network based techniques can be used to predict customer behavior. Quaero LLC used such techniques to cluster customers according to their current and their potential value [10]. Text data mining has also been applied in the context of call centers. Busemann et al. classified request from customers based on shallow text processing and machine learning techniques. Their system was able to correctly respond to s with an accuracy of 73 [11]. Next, audio data mining has been experimented with. ScanSoft used context-free-grammar to parse the speech and followed it by Sequence Package Analysis to caption the text to which data mining is applied. This approach allowed capturing early warning signs of caller frustration [4]. Finally, web usage mining has been applied to web-based activities of call centers. Techniques utilized here were similar to these used in other cases of web mining [9]. 3 Data mining techniques used in our research Data mining is often defined as information extraction activity with a goal of discovering facts hidden in (large) datasets. Using a combination of machine learning, statistical analysis, modeling techniques and database technology, data mining finds patterns and/or relationships in data and infers rules that allow the prediction of future results. There exist a number of popular data mining techniques and in what follows we summarize these that were used in our work Multi-layer perceptron Multi-layer Perceptron (MLP) is the most popular neural network architecture. It consists of at least three layers, an input layer of source neurons, at least one hidden layer of computational neurons, and an output layer of computational neuron(s). The input layer accepts inputs and redistributes to all the neurons of the middle layer. The neurons in the middle layer detect features exiting in input data and pass the features to the output layer. The output layer uses the features to determine the output patterns.
3 3.2. Linear neural networks Linear neural networks (LNN) have just two layers: an input layer and an output layer. Linear models have good performance on linear problems. However, they cannot solve more complex problems. Linear networks can be trained to serve as a base comparison for non-linear problems. Linear model building is relatively simple and not many parameters need to be selected in the process. We used the standard pseudo-inverse (SVD) linear optimization algorithm Probabilistic neural networks Probabilistic neural networks (PNN) have been developed for classification problems and utilize kernelbased estimation. They usually have three layers: one input layer, one hidden layer and one output layer. The network embeds the training cases into the hidden layer, which has as many neurons as there are training cases. The output layer combines the estimates and produces the output Classification and regression trees Classification and regression trees (CART) are techniques based on the tree structured binary decisions. Each decision tree has internal and leaf nodes. Leaf nodes represent the final decision or prediction. CART labels each leaf node a unique increasing integer number from left to right starting from 1. All the records in the dataset are assigned an integer. CART creates decision trees to predict categorical dependencies by using both categorical and continuous predictors Support vector machine Support vector machine (SVM) is a binary learning method [12]. It conducts computational learning based on structural risk minimization that finds a hypothesis h for which the lowest true error is guaranteed. The true error of h is the probability that h will make an error for an unseen and randomly selected case. An upper bound of the true error can be used for h. Support vector machine finds the hypothesis h and minimizes the bound of the true error Hybrid Decision Tree Neural Network Finally, the above-described techniques can be combined using a hybrid decision tree neural network technique [17]as depicted in Figure 1. In this case, data is fed into the decision tree first and then the leave node information is obtained and added into the dataset used by the neural network as an additional variable (new attribute). The training data together with the node information were supplied for training ANN. Figure 2 illustrates a decision tree structure. For the neural network we have used the multi-layer perceptron with three layers and the backpropagation learning for training. Here, the same training parameters were used as for the CART and the MLP.
4 Fig. 1. Decision Tree-ANN Hybrid Model Fig. 2. Decision tree structure 4 Call center performance data The data used in this study is one year worth of actual data from the performance evaluation database of four call centers of a large nationwide (US-based) insurance company. In this company, each customer service representative (CSR) is being evaluated monthly. To this effect randomly selected calls are recorded (out of ten to sixty calls answered daily by each representative) and the monitoring system constantly keeps up to ten calls for each CSR available. Of these, six randomly selected calls are used by a group of (human) evaluators to assess the CSR s performance. In the insurance company from which the data was obtained, there are two main attributes against which the performance of its representatives is evaluated: (a) customer service satisfaction and (b) business need satisfaction. The customer service satisfaction score assigned to the CSR is an aggregate result of evaluation based on eleven features. Exactly the same features are used for all products serviced by the CSRs (e.g. life insurance, homeowners insurance, etc.) and in all call centers. Typical way of evaluating performance with regard to these features is by asking questions like: did the CSR thank the customer for calling the company? or did a CSR ask what else (s)he can help customer with? The result of the evaluation is an integer between 0 and 5. Here, 0 means that a given feature was not applicable to the call. A score of 1 indicates
5 that the CSR did not meet the expectation; a 2 signifies that the expectation was met to some degree (denoted met some ); a 3 indicates meeting the expectation; a 4 specifies exceeding the expectation, and a 5 represents the case when the CSR far exceeded the expectation. These results are then aggregated to a value representing the total level of meeting the customer service satisfaction. For example, let us assume that an evaluator reviewed the call and found that only three questions out of eleven were applicable to that particular call and graded the performance of the CSR as 3, 4 and 1 (according to how the CSR performed when she/he answered the call). The evaluator also marked the remaining eight questions as 0 (not applicable). The final score of customer service satisfaction was then calculated as the sum (8) divided by the number of applicable questions (3), resulting in the score equal to The monthly score is the total score of all applicable questions of all six evaluated calls divided by the total number of applicable questions. Business need satisfaction is scored exactly the same way as the customer service satisfaction. However, the number of evaluated features/questions varies between products. Typical questions are did a CSR provide correct information to customer or did a CSR access proper systems or documents. Depending on the product, the minimal number of questions is eight and the maximal number is sixteen. Although the final scores of customer service satisfaction and business need satisfaction are continuous numbers ranging from 1 to 5, in the call centers, which were the source of the data used in the research, these results were then converted to monthly evaluations according to the following rules: Quality Table 1. Rules for converting scores into final evaluation Not met score < 2 met some score >= 2 and score < 3 Met score >= 3 and score < 4 Exceeded score >= 4 and score < 4.75 far exceeded score >= 4.75 Table 2. Dataset Description Category Attribute Name Data Type Format Example Time management Agent ID Integer - 1, 201, etc Date of Data Collection Date mm/01/yyyy 09/01/2001 Training Boolean 0, 1 0 Product ID Integer - 226, 3927 Customer Service Satisfaction Category 1, 2, 3, 4 3 Business Needs Satisfaction Category 1, 2, 3, 4, 5 4 After Call Work Time Integer 1, 2, 3, 180 Adherence Float Percentage 96 Attendance Integer 1, 2, 3, 2 Auxiliary Float Percentage 4
6 In addition to the above described customer service satisfaction and business need satisfaction attributes, four additional attributes of time management are utilized to evaluate the performance of the CSR and they are: adherence, after call work time, auxiliary and attendance. The time management data is collected from phone switches on monthly basis. Adherence is the percentage of the length of time a CSR is logged into the phone switch to the length of time he/she is supposed to be logged in. After call work time is the average number of seconds that a CSR spends on post-processing data after calls (during a given month). Auxiliary is the percentage of the length of time a CSR is spending on personal activity to the length of time that a CSR is logged into the phone switch. Attendance is a CSR s monthly absence. Finally, in the available data, there is a Boolean attribute representing the fact that the CSR is / is not in a training period; each record has a time stamp; and there is an attribute representing which product a CSR is servicing. In summary, there are total ten attributes in the dataset utilized in our project and they are summarized in Table Data Cleaning and Preparation As follows from the above, values of customer service and business need satisfaction should fall between one and five. We have therefore removed from the available dataset all records with data outside of these bounds. The value of time management categories should all be equal to or above zero. The values below zero are not valid and were deleted. The records that had other missing values were also deleted from the dataset. Finally, when preparing the data, we have found that the distribution of the scores of the customer service satisfaction attribute was bad. Only six records fell into the not met and thirteen into the far exceeded categories. These records were therefore deleted since they were too few to meaningfully participate in training and testing. Furthermore, the majority of the records fell into the met class. This class was thus separated at 3.5 into two sub-classes. We have then utilized both the dataset with the big met class and the dataset with the sub-class division and compared the performance of models build in both cases. After the cleaning, a total of records were left in the customer service dataset (1469, 5965, 5841, 1396 in four subcategories, when the met class was separated) and records in the business need dataset (63, 3533, 5974, 3610, 1510 in each category). Different products have different expected values of after call work, adherence and auxiliary categories. For example, 150 seconds may be a short after call work time for one of the products but a long time for another. Thus the after call work time, adherence and auxiliary were normalized to real numbers from the interval (0, 1). Finally, all of the remaining attributes, except date, were scaled similarly. There are eight input attributes in the final dataset, which are agent ID, date, product ID, training, ACW, aux, adherence and attendance (see Table 2). There are two output attributes: customer service satisfaction and business needs satisfaction. To achieve the best performance, a separate model was built for each of the output attributes. There are four (or three depending if the met class was separated or not) possible
7 output values for the customer service satisfaction and five possible outputs values for the business needs satisfaction. All of the data mining algorithms used in our work utilize random sampling. Each experiment was repeated several times. In all cases the results from the same algorithm were very close so we could make the assumption that the results are representative. 4.2 Experiment setup For the MLP we used one hidden layer. After a trial and error approach by varying the number of neurons from fifty to a hundred-twenty, we finalized the architecture with 113 neurons. There are eight neurons in the input layer since there are eight input attributes. There is one neuron in each model for one output. We used both a single-phase backpropagation based training and a two-phase backpropagation (BP) combined with conjugate gradient (CG) training. Furthermore, we have applied a typical split of available data into 50 for training, 25 for testing, and the remaining 25 for cross validation. Same datasets were used for the different machine learning algorithms. We used 100 epochs for both the backpropagation and the conjugate gradient. In the PNN, in the hidden layer we used 7337 neurons for training the customer service need satisfaction attribute and 7346 for the business needs satisfaction attribute. In the CART algorithm, to achieve the best performance Gini was selected for goodness of fit measurement. We used a maximum tree height of 32 (that resulted in the best performance). A hybrid decision tree-neural network was constructed as described in Section 3. For SVM s we used several kernels and after a trial and error approach, we report data obtained with the third degree polynomial kernel, which resulted in its best performance. 4.3 Analysis of predictive performance We start reporting obtained results from the application of machine learning techniques as tools that are to allow prediction of CSR performance. The effectiveness of the technique is calculated on the basis of the classification accuracy of testing results. The final result is the sum of total the number correct prediction of the correct category and the correct prediction of the incorrect category divided by the total number of testing cases. The performance of a perfect model is 100 for both the correct category and the incorrect category. The models that have accuracy near 100 are good. A random classifier should exhibit a 50 accuracy. Table 3 shows the performance of each model for predicting the customer service satisfaction attribute. The results of the met class are shown both for the case of one large class and two subclasses. According to the overall results from the confusion matrix, the ranking of the performance of the trained models is CART, PNN, SVM, BP/CG, BP, Hybrid and the LNN. There are no apparent difference among the BP/CG, the BP and the hybrid. For example for the Met 1 class, there were 5969 records out of falling into correct category in the dataset and the remaining 8702 records fell into wrong category. CART predicted 4443 out of 5969 correctly, which was 74.43
8 shown as correct prediction of the correct class. CART predicted 6124 out of 8702 correctly, which was Since 25 of the records in the dataset were used for cross validation for the LNN, MLP, PNN, and SVM, which is different from CART (10 fold cross-validation), the base to calculate the accuracy was different from CART, which was For example for the met 1 class again, 1448 records out of 3668 fell into the correct category and the remaining 2220 records fell into the wrong category. 873 records out of 1448 were predicted as correct correctly, which is out of 2220 were predicted as wrong correctly, which is Table 3 also shows the accuracy details for customer service satisfaction. As far as the met class is considered, usually the prediction of one large class has higher accuracy. However, it is not true for the customer service satisfaction. The performance for one large class is very close to the performance of predicting sub-classes indicating that the big class has more noise. Our research reveals that the scale used for the customer service satisfaction attribute evaluation is incorrect and mixes data without good differentiation. The CSRs in sub-class 1 are more likely to be met-some performers. The CSRs in sub-class two are more likely to be exceeded performers. Table 3. Classification Accuracy of Customer Service Prediction Customer Service Skills Cross Validation Class Case # Linear BP CG BP/CG PNN CART Hybrid SVM Correct Met Some Wrong Overall Correct Met 1 Wrong Overall Correct Met 2 Wrong Overall Correct Met (1 and 2) Wrong Overall Correct Exceeded Wrong Overall Table 4 illustrates the performance of each model for the predicting business need satisfaction attribute. The way to calculate the performance of business need prediction is exactly the same as the way for customer service. The ranking of the performance is the same as the models for the customer service satisfaction. After looking into the performance accuracy of each correct/wrong class, the research found that the PNN based models are not valid for the dataset used. The performance of the BP/CG is slightly better than that of the BP. However, the results are very close and it would be unwarranted to make the
9 conclusion that the models trained by the BP/CG exhibit better performance than the ones trained by the BP alone. The performance of the hybrid model was at least the same as that of the CART. However, the overall accuracy of the hybrid and the CART is somewhat better than that of the BP and the BP/CG models. The LNN model serves as a comparison for other models. Models trained by other algorithms are supposed to have at least the performance that linear models can reach. CART models have the best performance in the research. They not only have the best overall performance, but also they have highest accuracy to predict correct (C1) and incorrect (C0) for all each class. Table 4. Classification Accuracy of Business Need Prediction Business Need Satisfaction - Cross Validation Class Case # Linear BP CG BP/CG PNN CART Hybrid SVM Not met Met some Met Exceeded Far exceeded Correct Wrong Overall Correct Wrong Overall Correct Wrong Overall Correct Wrong Overall Correct Wrong Overall Inputs sensitivity analysis We now proceed with the input sensitivity analysis. The sensitivity is calculated by the accumulated errors when a particular attribute is removed from the training. In this case, the higher the error is, the more important the attribute is and the smaller the error, the less the importance of a given attribute. The importance of individual inputs is ranked by the accumulated error. Tables 5 and 6 illustrate the ranking of the various attributes for customer service satisfaction and business needs satisfaction prediction. First, product is very important to predicting customer service satisfaction, which indicates that CSRs who are servicing some products have more opportunity to exceed and far exceed than the CSRs in other products. Adherence (how much time of the required time a CSR spends logged into the switch) reveals the commitment toward work and is found to be also important. Here, a good attitude or commitment toward work in general, may also lead to a good customer service performance. Another interesting
10 characteristic is that date is important when predicting customer service satisfaction. The reason why date is important may be that dates are interrelated with call types. One type of calls may be dominant of all types of calls during a certain period. After that period, calls of another type become the majority in the call volume in the next period. Since we cannot account for call types (unavailability of further data) we can only speculate that the affect of call types may materialize as the date parameter. Another way to explain the importance of the date may be the training or coaching delivery date. The customer service satisfaction may be improved right after the coaching or training session and may drop after a certain time afterwards. The ranking analysis from the LNN, BP, BP/CG and Hybrid model are pretty consistent in predicting business need satisfaction. The product becomes more important in predicting business needs satisfaction from not met class to the far-exceeded class. This can be interpreted that a CSR has more opportunity to be far exceeding if a CSR services a particular product and less opportunity if he/she services some other product. Agent is more important when predicting exceeded and far-exceeded classes. It means that the top performers are likely staying on the top most of the time. The performance of the CSRs whose performance falls into met or below met is not stable. However, they are more likely staying in met class or below. Table 5. Ranking of the Inputs (importance) for Predicting Customer Service Customer Service Satisfaction - Sensitivity Analysis Class Algorithm Agent Date Training Product ACW Adherence Aux Attendance Note Linear Met Some BP BP/CG Hybrid Linear Met 1 BP BP/CG Hybrid Linear Met 2 BP BP/CG Hybrid Linear Met (1 & 2) BP BP/CG Hybrid Linear Exceeded BP BP/CG Hybrid
11 4.5 Association rule mining Finally we have experimented with the association rule mining. Here, we have used the Apriori algorithm which uses the minimum support threshold to find frequent itemsets. The algorithm finds all frequent itemsets and generates strong association rules [16]. For the Apriori algorithm we used 0.9 as the minimum confidence level and the associations depicted in Table 7 were developed. Table 6. Ranking of the Inputs (importance) for Predicting Business Needs Business Need Requirements Sensitivity Analysis Class Not Met Met Some Met Exceeded Far exceeded Algorithms Agent Date Training Product ACW Adherence Aux Attendance Note Linear BP BP/CG Hybrid Linear BP BP/CG Hybrid Linear BP BP/CG Hybrid Linear BP BP/CG Hybrid Linear BP BP/CG Hybrid Table 7. Association rule mining/analysis using Apriori algorithm Input Output Confidence Product 3834 ==> Adherence Agent 8154 ==> Attendance Business 9653 ==> Attendance Service ==> Attendance Auxiliary 4438 ==> Attendance ACW 4201 ==> Attendance Adherence Auxiliary 2420 ==> Training Service Adherence 2593 ==> Training Business 4068 ==> Training Auxiliary 7117 ==> Training Adherence 7613 ==> Training Service 4698 ==> Training Table 7 reveals a very interesting association between product and adherence with a high confidence (0.92). The adherence should be the same across all the products since it indicates how much time the
12 CSR s commit to the phone. However, the results indicate that adherence can be significantly different across products. It is found that agents can predict attendance with a confidence level of Certain agents can constantly achieve good performance on attendance while some others constantly fail. The results also showed business requirement, customer service, auxiliary and ACW can also predict attendance accurately. We found that almost all the performance attributes except ACW can accurately predict if a CSR is in training or not. The results indicate that the performance of experienced CSR s and new CSR s who are still in training are significantly different. 5 Concluding remarks In our research, we have applied six AI-based models (LNN, MLP, PNN, CART, Decision tree-ann Hybrid model and the SVM) to predict the quality score of customer service satisfaction and business need satisfaction. The research compared the performance of the six models based on the confusion matrix results of cross validation. The performance was also analyzed by using the accuracy of the correct category prediction and the accuracy of the wrong category prediction. The overall accuracy from CART is on predicting customer service satisfaction and on predicting business need satisfaction. The accuracy of the correct category and the accuracy of the wrong category are very close. The trained models based on CART can be used for future prediction. MLP training using BP and CG did not have significant better performance than BP alone. The research also analyzed the sensitivity of inputs. The research found that products, agents and dates could affect the quality of performance more than time management. The CSRs serving in some products have more opportunity to exceed the expectation than the ones in some other products. The top performers constantly exceed or farexceed the expectation. The performance of CSRs whose evaluation results fall into met or below is not stable. The research suggest that call center management team should focus training and coaching the individuals and products that constantly have low quality instead of emphasizing balancing the length of times spent on calls (which happens to be currently the case). The CSR s working in different products may have different performance on adherence. This could indicate the leadership issue within products. The model we built in this research can be used to predict attendance when attendance related data is not available. This study also revealed that training and experience might be the key factors for the performance of call centers. Addition of telephone handling experience to the training process is also very important. Acknowledgements The second author was supported by the International Joint Research Grant of the IITA (Institute of Information Technology Assessment) foreign professor invitation program of the MIC (Ministry of Information and Communication), Korea.
13 References [1] A. Gilmore, Call Center Management: Is Service Quality a Priority, Managing Service Quality, vol. 11, no. 3 pp [2] A. Parasuraman, Service Quality and Productivity: a Synergistic Perspective, Managing Service Quality, vol. 12, no. 1, pp [3] Altitude Software, Improving Agent Performance While Maintaining High Level of Motivation, September 2002, access date: April 2003 [4] Amy Neustein, Building Natural Language Intelligence into Voice-Based Applications, July 2002, access date: April [5] David Sims, What Is CRM, , access date: April 2003 [6] HigherGroud, Discovering the Business Intelligence Hidden in Your Call Center, HigherGroud White Paper Document, June 2002, access date: April [7] James Brewton, Customer Value-Driven Competency Models: Powerful Tools for Maximizing Call Center Performance and Customer Loyalty, , access date: April [8] Linda Dilauro, What s Next in Monitoring Technology? Data Mining Finds a Calling in Call Centers, Dictaphone Corporation. [9] Pan-Ning Tan, Vipin Kumar, Mining Indirect Associations in Web Data, WebKDD 2001: Mining Log Data Across All Customer Touch Points. [10] Ro King, Data Mining and CRM, , access date: April 2002 [11] Stephan Busemann, Sven Schmeier, Roman G.Arens, Message Classification in the Call Center, Proceedings of the 6th Conference on Applied Natural Language Processing, Seattle, WA, 2000 [12] Vladimir Vapnik, The Nature of Statistical Learning Theory, Springer, New York, [13] Warren Staples, John Dalrymple, Rhonda Bryar, Accessing Call Center Quality using the SERVQUAL Model, , (access date: April 2003). [14] Marcin Paprzycki, Ajith Abraham and Ruiyuan Guo, Data Mining Approach for Analyzing Call Center Performance, The 17th International Conference on Industrial & Engineering Applications of Artificial Intelligence and Expert Systems, Innovations in Applied Artificial Intelligence, Robert Orchard, Chunsheng Yang, Moonis Ali (Eds.), Lecture Notes in Computer Science 3029, Springer Verlag, Germany, pp , [15] Call Centers: Learn The "Ins" and "Outs" of Your Call Center Needs. (accessed on December 31, 2004). [16] Srikant, R. and Agrawal, R., Mining Generalized Association Rules, Future Generation Computer Systems, 13(2-3), [17] Ajith Abraham, Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems, IEEE International Conference on Fuzzy Systems (IEEE FUZZ'02), 2002 IEEE World Congress on Computational Intelligence, Hawaii, ISBN , IEEE Press pp , 2002.
Data quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
Real Stock Trading Using Soft Computing Models
Real Stock Trading Using Soft Computing Models Brent Doeksen 1, Ajith Abraham 2, Johnson Thomas 1 and Marcin Paprzycki 1 1 Computer Science Department, Oklahoma State University, OK 74106, USA, 2 School
Traffic Accident Data Mining Using Machine Learning Paradigms
Traffic Accident Data Mining Using Machine Learning Paradigms Miao Chong, Ajith Abraham and Marcin Paprzycki Computer Science Department, Oklahoma State University, USA [email protected], [email protected]
Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing E-mail Classifier
International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume-1, Issue-6, January 2013 Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski [email protected]
Introduction to Machine Learning and Data Mining Prof. Dr. Igor Trakovski [email protected] Neural Networks 2 Neural Networks Analogy to biological neural systems, the most robust learning systems
Chapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT S ACADEMIC PERFORMANCE
EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT S ACADEMIC PERFORMANCE S. Anupama Kumar 1 and Dr. Vijayalakshmi M.N 2 1 Research Scholar, PRIST University, 1 Assistant Professor, Dept of M.C.A. 2 Associate
Towards better accuracy for Spam predictions
Towards better accuracy for Spam predictions Chengyan Zhao Department of Computer Science University of Toronto Toronto, Ontario, Canada M5S 2E4 [email protected] Abstract Spam identification is crucial
Impact of Feature Selection on the Performance of Wireless Intrusion Detection Systems
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Impact of Feature Selection on the Performance of ireless Intrusion Detection Systems
Event driven trading new studies on innovative way. of trading in Forex market. Michał Osmoła INIME live 23 February 2016
Event driven trading new studies on innovative way of trading in Forex market Michał Osmoła INIME live 23 February 2016 Forex market From Wikipedia: The foreign exchange market (Forex, FX, or currency
Data Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
6.2.8 Neural networks for data mining
6.2.8 Neural networks for data mining Walter Kosters 1 In many application areas neural networks are known to be valuable tools. This also holds for data mining. In this chapter we discuss the use of neural
Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence
Artificial Neural Networks and Support Vector Machines CS 486/686: Introduction to Artificial Intelligence 1 Outline What is a Neural Network? - Perceptron learners - Multi-layer networks What is a Support
Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network
General Article International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347-5161 2014 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Impelling
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Evaluation of Feature Selection Methods for Predictive Modeling Using Neural Networks in Credits Scoring
714 Evaluation of Feature election Methods for Predictive Modeling Using Neural Networks in Credits coring Raghavendra B. K. Dr. M.G.R. Educational and Research Institute, Chennai-95 Email: [email protected]
How can we discover stocks that will
Algorithmic Trading Strategy Based On Massive Data Mining Haoming Li, Zhijun Yang and Tianlun Li Stanford University Abstract We believe that there is useful information hiding behind the noisy and massive
A Property & Casualty Insurance Predictive Modeling Process in SAS
Paper AA-02-2015 A Property & Casualty Insurance Predictive Modeling Process in SAS 1.0 ABSTRACT Mei Najim, Sedgwick Claim Management Services, Chicago, Illinois Predictive analytics has been developing
A Multi-level Artificial Neural Network for Residential and Commercial Energy Demand Forecast: Iran Case Study
211 3rd International Conference on Information and Financial Engineering IPEDR vol.12 (211) (211) IACSIT Press, Singapore A Multi-level Artificial Neural Network for Residential and Commercial Energy
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS Breno C. Costa, Bruno. L. A. Alberto, André M. Portela, W. Maduro, Esdras O. Eler PDITec, Belo Horizonte,
Beating the MLB Moneyline
Beating the MLB Moneyline Leland Chen [email protected] Andrew He [email protected] 1 Abstract Sports forecasting is a challenging task that has similarities to stock market prediction, requiring time-series
Learning is a very general term denoting the way in which agents:
What is learning? Learning is a very general term denoting the way in which agents: Acquire and organize knowledge (by building, modifying and organizing internal representations of some external reality);
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Erkan Er Abstract In this paper, a model for predicting students performance levels is proposed which employs three
Supply Chain Forecasting Model Using Computational Intelligence Techniques
CMU.J.Nat.Sci Special Issue on Manufacturing Technology (2011) Vol.10(1) 19 Supply Chain Forecasting Model Using Computational Intelligence Techniques Wimalin S. Laosiritaworn Department of Industrial
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Employer Health Insurance Premium Prediction Elliott Lui
Employer Health Insurance Premium Prediction Elliott Lui 1 Introduction The US spends 15.2% of its GDP on health care, more than any other country, and the cost of health insurance is rising faster than
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
Comparison of Data Mining Techniques used for Financial Data Analysis
Comparison of Data Mining Techniques used for Financial Data Analysis Abhijit A. Sawant 1, P. M. Chawan 2 1 Student, 2 Associate Professor, Department of Computer Technology, VJTI, Mumbai, INDIA Abstract
CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht [email protected] 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht [email protected] 539 Sennott
A New Approach for Evaluation of Data Mining Techniques
181 A New Approach for Evaluation of Data Mining s Moawia Elfaki Yahia 1, Murtada El-mukashfi El-taher 2 1 College of Computer Science and IT King Faisal University Saudi Arabia, Alhasa 31982 2 Faculty
Neural Networks and Support Vector Machines
INF5390 - Kunstig intelligens Neural Networks and Support Vector Machines Roar Fjellheim INF5390-13 Neural Networks and SVM 1 Outline Neural networks Perceptrons Neural networks Support vector machines
Electroencephalography Analysis Using Neural Network and Support Vector Machine during Sleep
Engineering, 23, 5, 88-92 doi:.4236/eng.23.55b8 Published Online May 23 (http://www.scirp.org/journal/eng) Electroencephalography Analysis Using Neural Network and Support Vector Machine during Sleep JeeEun
Neural Network Add-in
Neural Network Add-in Version 1.5 Software User s Guide Contents Overview... 2 Getting Started... 2 Working with Datasets... 2 Open a Dataset... 3 Save a Dataset... 3 Data Pre-processing... 3 Lagging...
Numerical Algorithms Group
Title: Summary: Using the Component Approach to Craft Customized Data Mining Solutions One definition of data mining is the non-trivial extraction of implicit, previously unknown and potentially useful
Predictive Dynamix Inc
Predictive Modeling Technology Predictive modeling is concerned with analyzing patterns and trends in historical and operational data in order to transform data into actionable decisions. This is accomplished
Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05
Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 2015-03-05 Roman Kern (KTI, TU Graz) Ensemble Methods 2015-03-05 1 / 38 Outline 1 Introduction 2 Classification
How To Solve The Kd Cup 2010 Challenge
A Lightweight Solution to the Educational Data Mining Challenge Kun Liu Yan Xing Faculty of Automation Guangdong University of Technology Guangzhou, 510090, China [email protected] [email protected]
Data Mining Techniques Chapter 6: Decision Trees
Data Mining Techniques Chapter 6: Decision Trees What is a classification decision tree?.......................................... 2 Visualizing decision trees...................................................
A Secured Approach to Credit Card Fraud Detection Using Hidden Markov Model
A Secured Approach to Credit Card Fraud Detection Using Hidden Markov Model Twinkle Patel, Ms. Ompriya Kale Abstract: - As the usage of credit card has increased the credit card fraud has also increased
In this presentation, you will be introduced to data mining and the relationship with meaningful use.
In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Towards applying Data Mining Techniques for Talent Mangement
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Towards applying Data Mining Techniques for Talent Mangement Hamidah Jantan 1,
Predict Influencers in the Social Network
Predict Influencers in the Social Network Ruishan Liu, Yang Zhao and Liuyu Zhou Email: rliu2, yzhao2, [email protected] Department of Electrical Engineering, Stanford University Abstract Given two persons
IBM SPSS Neural Networks 22
IBM SPSS Neural Networks 22 Note Before using this information and the product it supports, read the information in Notices on page 21. Product Information This edition applies to version 22, release 0,
Chapter 20: Data Analysis
Chapter 20: Data Analysis Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 20: Data Analysis Decision Support Systems Data Warehousing Data Mining Classification
Prediction of Stock Performance Using Analytical Techniques
136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University
REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES
REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES R. Chitra 1 and V. Seenivasagam 2 1 Department of Computer Science and Engineering, Noorul Islam Centre for
Programming Exercise 3: Multi-class Classification and Neural Networks
Programming Exercise 3: Multi-class Classification and Neural Networks Machine Learning November 4, 2011 Introduction In this exercise, you will implement one-vs-all logistic regression and neural networks
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
Customer Relationship Management using Adaptive Resonance Theory
Customer Relationship Management using Adaptive Resonance Theory Manjari Anand M.Tech.Scholar Zubair Khan Associate Professor Ravi S. Shukla Associate Professor ABSTRACT CRM is a kind of implemented model
Making Sense of the Mayhem: Machine Learning and March Madness
Making Sense of the Mayhem: Machine Learning and March Madness Alex Tran and Adam Ginzberg Stanford University [email protected] [email protected] I. Introduction III. Model The goal of our research
Predicting required bandwidth for educational institutes using prediction techniques in data mining (Case Study: Qom Payame Noor University)
260 IJCSNS International Journal of Computer Science and Network Security, VOL.11 No.6, June 2011 Predicting required bandwidth for educational institutes using prediction techniques in data mining (Case
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
Back Propagation Neural Networks User Manual
Back Propagation Neural Networks User Manual Author: Lukáš Civín Library: BP_network.dll Runnable class: NeuralNetStart Document: Back Propagation Neural Networks Page 1/28 Content: 1 INTRODUCTION TO BACK-PROPAGATION
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Lecture 10: Regression Trees
Lecture 10: Regression Trees 36-350: Data Mining October 11, 2006 Reading: Textbook, sections 5.2 and 10.5. The next three lectures are going to be about a particular kind of nonlinear predictive model,
ENHANCED CONFIDENCE INTERPRETATIONS OF GP BASED ENSEMBLE MODELING RESULTS
ENHANCED CONFIDENCE INTERPRETATIONS OF GP BASED ENSEMBLE MODELING RESULTS Michael Affenzeller (a), Stephan M. Winkler (b), Stefan Forstenlechner (c), Gabriel Kronberger (d), Michael Kommenda (e), Stefan
The Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier D.Nithya a, *, V.Suganya b,1, R.Saranya Irudaya Mary c,1 Abstract - This paper presents,
Lecture 6. Artificial Neural Networks
Lecture 6 Artificial Neural Networks 1 1 Artificial Neural Networks In this note we provide an overview of the key concepts that have led to the emergence of Artificial Neural Networks as a major paradigm
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
Keywords data mining, prediction techniques, decision making.
Volume 5, Issue 4, April 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analysis of Datamining
Chapter 4: Artificial Neural Networks
Chapter 4: Artificial Neural Networks CS 536: Machine Learning Littman (Wu, TA) Administration icml-03: instructional Conference on Machine Learning http://www.cs.rutgers.edu/~mlittman/courses/ml03/icml03/
Flexible Neural Trees Ensemble for Stock Index Modeling
Flexible Neural Trees Ensemble for Stock Index Modeling Yuehui Chen 1, Ju Yang 1, Bo Yang 1 and Ajith Abraham 2 1 School of Information Science and Engineering Jinan University, Jinan 250022, P.R.China
Impact of Boolean factorization as preprocessing methods for classification of Boolean data
Impact of Boolean factorization as preprocessing methods for classification of Boolean data Radim Belohlavek, Jan Outrata, Martin Trnecka Data Analysis and Modeling Lab (DAMOL) Dept. Computer Science,
Neural Networks for Machine Learning. Lecture 13a The ups and downs of backpropagation
Neural Networks for Machine Learning Lecture 13a The ups and downs of backpropagation Geoffrey Hinton Nitish Srivastava, Kevin Swersky Tijmen Tieleman Abdel-rahman Mohamed A brief history of backpropagation
COMP3420: Advanced Databases and Data Mining. Classification and prediction: Introduction and Decision Tree Induction
COMP3420: Advanced Databases and Data Mining Classification and prediction: Introduction and Decision Tree Induction Lecture outline Classification versus prediction Classification A two step process Supervised
A Hybrid Artificial Intelligence System for Assistance in Remote Monitoring of Heart Patients
A Hybrid Artificial Intelligence System for Assistance in Remote Monitoring of Heart Patients Theodor Heinze, Robert Wierschke, Alexander Schacht and Martin v. Löwis Hasso-Plattner-Institut, Prof.-Dr.-Helmert-Str.
Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence
Government of Russian Federation Federal State Autonomous Educational Institution of High Professional Education National Research University «Higher School of Economics» Faculty of Computer Science School
Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network
Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network Dušan Marček 1 Abstract Most models for the time series of stock prices have centered on autoregressive (AR)
1. Classification problems
Neural and Evolutionary Computing. Lab 1: Classification problems Machine Learning test data repository Weka data mining platform Introduction Scilab 1. Classification problems The main aim of a classification
Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research [email protected]
Introduction to Machine Learning Lecture 1 Mehryar Mohri Courant Institute and Google Research [email protected] Introduction Logistics Prerequisites: basics concepts needed in probability and statistics
Application of Event Based Decision Tree and Ensemble of Data Driven Methods for Maintenance Action Recommendation
Application of Event Based Decision Tree and Ensemble of Data Driven Methods for Maintenance Action Recommendation James K. Kimotho, Christoph Sondermann-Woelke, Tobias Meyer, and Walter Sextro Department
Identification of User Patterns in Social Networks by Data Mining Techniques: Facebook Case
Identification of User Patterns in Social Networks by Data Mining Techniques: Facebook Case A. Selman Bozkır 1, S. Güzin Mazman 2, and Ebru Akçapınar Sezer 1 1 Hacettepe University, Department of Computer
Application of Neural Network in User Authentication for Smart Home System
Application of Neural Network in User Authentication for Smart Home System A. Joseph, D.B.L. Bong, D.A.A. Mat Abstract Security has been an important issue and concern in the smart home systems. Smart
CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES
CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES Claus Gwiggner, Ecole Polytechnique, LIX, Palaiseau, France Gert Lanckriet, University of Berkeley, EECS,
Scalable Developments for Big Data Analytics in Remote Sensing
Scalable Developments for Big Data Analytics in Remote Sensing Federated Systems and Data Division Research Group High Productivity Data Processing Dr.-Ing. Morris Riedel et al. Research Group Leader,
Decision Trees from large Databases: SLIQ
Decision Trees from large Databases: SLIQ C4.5 often iterates over the training set How often? If the training set does not fit into main memory, swapping makes C4.5 unpractical! SLIQ: Sort the values
Data Mining for Knowledge Management. Classification
1 Data Mining for Knowledge Management Classification Themis Palpanas University of Trento http://disi.unitn.eu/~themis Data Mining for Knowledge Management 1 Thanks for slides to: Jiawei Han Eamonn Keogh
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
Course Syllabus. Purposes of Course:
Course Syllabus Eco 5385.701 Predictive Analytics for Economists Summer 2014 TTh 6:00 8:50 pm and Sat. 12:00 2:50 pm First Day of Class: Tuesday, June 3 Last Day of Class: Tuesday, July 1 251 Maguire Building
An Introduction to Data Mining
An Introduction to Intel Beijing [email protected] January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
Face Recognition For Remote Database Backup System
Face Recognition For Remote Database Backup System Aniza Mohamed Din, Faudziah Ahmad, Mohamad Farhan Mohamad Mohsin, Ku Ruhana Ku-Mahamud, Mustafa Mufawak Theab 2 Graduate Department of Computer Science,UUM
Blog Post Extraction Using Title Finding
Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School
Question 2 Naïve Bayes (16 points)
Question 2 Naïve Bayes (16 points) About 2/3 of your email is spam so you downloaded an open source spam filter based on word occurrences that uses the Naive Bayes classifier. Assume you collected the
NEURAL NETWORKS IN DATA MINING
NEURAL NETWORKS IN DATA MINING 1 DR. YASHPAL SINGH, 2 ALOK SINGH CHAUHAN 1 Reader, Bundelkhand Institute of Engineering & Technology, Jhansi, India 2 Lecturer, United Institute of Management, Allahabad,
Enhancing Quality of Data using Data Mining Method
JOURNAL OF COMPUTING, VOLUME 2, ISSUE 9, SEPTEMBER 2, ISSN 25-967 WWW.JOURNALOFCOMPUTING.ORG 9 Enhancing Quality of Data using Data Mining Method Fatemeh Ghorbanpour A., Mir M. Pedram, Kambiz Badie, Mohammad
Server Load Prediction
Server Load Prediction Suthee Chaidaroon ([email protected]) Joon Yeong Kim ([email protected]) Jonghan Seo ([email protected]) Abstract Estimating server load average is one of the methods that
HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION
HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION Chihli Hung 1, Jing Hong Chen 2, Stefan Wermter 3, 1,2 Department of Management Information Systems, Chung Yuan Christian University, Taiwan
A Health Degree Evaluation Algorithm for Equipment Based on Fuzzy Sets and the Improved SVM
Journal of Computational Information Systems 10: 17 (2014) 7629 7635 Available at http://www.jofcis.com A Health Degree Evaluation Algorithm for Equipment Based on Fuzzy Sets and the Improved SVM Tian
Ensembles and PMML in KNIME
Ensembles and PMML in KNIME Alexander Fillbrunn 1, Iris Adä 1, Thomas R. Gabriel 2 and Michael R. Berthold 1,2 1 Department of Computer and Information Science Universität Konstanz Konstanz, Germany [email protected]
Learning to Process Natural Language in Big Data Environment
CCF ADL 2015 Nanchang Oct 11, 2015 Learning to Process Natural Language in Big Data Environment Hang Li Noah s Ark Lab Huawei Technologies Part 1: Deep Learning - Present and Future Talk Outline Overview
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
