A Decision Tree- Rough Set Hybrid System for Stock Market Trend Prediction
|
|
|
- Horatio Brooks
- 10 years ago
- Views:
Transcription
1 A Decision Tree- Rough Set Hybrid System for Stock Market Trend Prediction Binoy.B.Nair Department Electronics and Communication Engineering, Amrita Vishwa Vidaypeetham, Ettimadai, Coimbatore, , India. V.P Mohandas Department Electronics and Communication Engineering, Amrita Vishwa Vidaypeetham, Ettimadai, Coimbatore, , India. N. R. Sakthivel Department of Mechanical Engineering, Amrita Vishwa Vidaypeetham, Ettimadai, Coimbatore, , India. ABSTRACT Prediction of stock market trends has been an area of great interest both to those who wish to profit by trading stocks in the stock market and for researchers attempting to uncover the information hidden in the stock market data. Applications of data mining techniques for stock market prediction, is an area of research which has been receiving a lot of attention recently. This work presents the design and performance evaluation of a hybrid decision tree- rough set based system for predicting the next days trend in the Bombay Stock Exchange (BSE- SENSEX). Technical indicators are used in the present study to extract features from the historical SENSEX data. C4.5 decision tree is then used to select the relevant features and a rough set based system is then used to induce rules from the extracted features. Performance of the hybrid rough set based system is compared to that of an artificial neural network based trend prediction system and a naive bayes based trend predictor. It is observed from the results that the proposed system outperforms both the neural network based system and the naive bayes based trend prediction system. General Terms Data mining, stock market, trend prediction Keywords Stock market, Rough set, decision tree, artificial neural networks, Technical indicators, Rules. 1. INTRODUCTION Stock market prediction has been an area of intense interest due to the potential of obtaining a very high return on the invested money in a very short time. However, according to the efficient market hypothesis[1], all such attempts at prediction are futile as all the information that could affect the behaviour of stock price or the market index must have been already incorporated into the current market quotation. There have been several studies, for example, in [2], which question the efficient market hypothesis by showing that it is, in fact, possible to predict, with some degree of accuracy, the future behaviour of the stock markets. Technical analysis has been used since a very long time for predicting the future behaviour of the stock price. A total of 21 different technical indicators are used in the present study to extract information from the financial time series and hence, they act as the features that are used for stock market trend prediction. A C 4.5 decision tree is used to select the relevant features (technical indicators) from the extracted feature set. The selected features are then applied to a rough set based system for predicting one-day-ahead trends in the stock market. An artificial neural network based trend predictor and a naive bayes based trend prediction system are also implemented. The performance of the rough set based system is compared to that of the artificial neural network based and the naϊve bayes based predictors. The financial time series considered for the purpose is the Bombay Stock Exchange sensitive index (BSE-SENSEX). The period under consideration is from September 3, 2003 to March 7,2010. The rest of the paper is organised as follows: Section 2 presents the literature survey, section 3 presents the design of the proposed hybrid system, the comparison of the proposed system with an artificial neural network and a naive bayes based system is presented in section 4 and conclusions are presented in section LITERATURE SURVEY Artificial neural networks have been widely used prediction of financial time series. In [3], the effectiveness of time delay, recurrent and probabilistic neural networks for prediction of stock trends based on historical data of the daily closing price was attempted. In [4] technical indicators were used as inputs to a feedforward neural network for prediction of Taiwan Stock Exchange and NASDAQ. In [5], technical indicators and a backpropagation neural network was used to create a decision support system for exchange traded funds trading. Technical indicators and neural networks were used in [6] to predict the US Dollar Vs British Pound exchange rates. In [7] a framework for intelligent interaction of automatic trading algorithms with the user was presented. In [8] a back propagation neural network was employed to predict the buy/sell points for a stock and then applied a case based dynamic window to further improve the forecast accuracy. In [2] a survey of more than hundred articles which used neural networks and neuro-fuzzy models for predicting stock markets was presented. It was observed that soft computing techniques outperform conventional models in most cases. Defining the structure of the model is however, a major issue and is a matter of trial and error. In [9], review of data mining applications in stock markets was presented. [10], used a two-layer bias decision tree with technical indicators feature to create a decision rule that can make recommendations when to buy a stock and when not to buy it. [11] combined the filter rule and the decision tree technique for stock trading. [12] presented a hybrid model for stock market forecasting and portfolio selection. In[13] a hybrid fuzzy time series model with cumulative probability distribution approach and rough set rule induction was proposed, to forecast stock markets. Cumulative probability distribution approach was used to discretize the observations in training datasets based on the characteristics of data distribution and rules were generated using rough set 1
2 algorithm. Forecasting was then done based on rule support values from rough set algorithm. 3. DESIGN OF THE HYBRID SYSTEM The hybrid trend prediction system proposed in this paper works in the following way: First the features are extracted from the daily stock market data. Then the relevant features are selected using decision tree. A rough set based classifier is then used to predict the next day s trend using the selected features. In the present study, the prediction accuracy of the proposed hybrid system is validated using the Bombay Stock Exchange Sensitive Index (BSE-SENSEX or commonly, SENSEX) data. The daily SENSEX data (open, high, low, close and volume) from September 3, 2003 to March 7, 2010 (a total of 1625 BSE working days) is considered for the purpose. The performance of trend prediction systems are evaluated using the 10-folds cross validation method. 3.1 Bombay Stock Exchange Sensitive Index (BSE-SENSEX) The BSE-SENSEX was selected for empirical validation of the proposed trend prediction system. BSE is the world's largest exchange in terms of the number of listed companies (over 4900). It is in the top ten of global exchanges in terms of the market capitalization of its listed companies (as of December 31, 2009). The companies listed on BSE command a total market capitalization of 1.36 trillion US Dollars as of 31st March, The BSE Index, SENSEX, is India's first and most popular Stock Market benchmark index. The stocks that constitute the BSE-SENSEX are drawn from almost all the market sectors and hence the BSE-SENSEX is capable of accurately tracking the movements of the market. Hence, BSE- SENSEX is taken as the representative of the Indian stock markets. Being such a large and complex system, prediction of trends in the BSE-SENSEX is especially challenging. 3.2 Feature Extraction The inputs of the proposed system are the daily open, high, low, close, trading volumes of the SENSEX and technical indexes. Twenty one commonly used technical indicators (comprising of twenty five indexes) are considered initially. The Technical indexes considered, are presented in Table 1. Three types of indicators, namely, volume based, price based and overlay indicators were chosen. Since the proposed system is capable of choosing the relevant features automatically, the exact number of indicators is not of much importance, only those indicators which contain any relevant information will be selected by the system.literature on the technical indices considered above, is widely available, for example in [14]. Classification of the trend into up, down and no trend was done according to the definition in [15] in the following way: The market is formally classified as being in an uptrend (downtrend) when all the following conditions are satisfied: 1. The closing value must lead (lag) its 25 day moving average 2. The 25 day moving average must lead (lag) 65 day moving average. 3. The 25 day moving average must have been rising (falling) for at least 5 days. 4. The 65 day moving average must have been rising (falling) for at least 1 day. Table 1. Technical Indicators for feature extraction Indicator Type Positive volume index Volumebased Negative volume index On balance volume indicators Price volume trend Relative strength Indicator (RSI) Highest high Lowest low Moving Average Nine period Convergence Divergence consisting of two indexes moving average MACD line Momentum Acceleration stochastic oscillator %K consisting of two %D indexes Chaikin s volatility William s %R Typical price price rate of change Median price Weighted close William s accumulation/ distribution Bollinger bands Bollinger Upper consisting of three Bollinger Mid indexes Bollinger Lower Pricebased indicators Overlays 25-day moving average 65-day moving average If the movement of the market cannot be classified as either an uptrend or a downtrend, it is assumed that there is no trend in the market movement. The above information gives the present day s trend and is also used to predict the next day s trend. The aim of all the trend prediction techniques considered in the present study, is to identify the relationship between the information variables and the decision variable. All the technical indexes considered, may or may not be necessary to arrive at the decision. It is also possible that some features may have a negative impact on the accuracy of the system. Hence, feature selection is used to identify those features (technical indexes) which are necessary for trend prediction and help in maintaining or improving the prediction accuracy. Features which are not selected are deemed unnecessary and are discarded. 3.3 Decision Tree based Feature Selection The Decision tree C4.5 algorithm[15] is widely used to construct decision trees, and is used in the present study to reduce the number of features required for rule generation. A decision tree is a flowchart-like tree structure, where each internal node (non-leaf node) denotes a test on an attribute, each branch represents an outcome of the test, and each leaf node (or terminal node) holds a class label. The topmost node in a tree is the root node. Given a tuple, for which the associated class label is unknown, the attribute values of the tuple are tested against the decision tree. A path is traced from the root to a leaf node, which holds the class prediction for that tuple. The construction of decision tree classifiers does not require any domain knowledge or parameter setting, and 2
3 therefore is appropriate for exploratory knowledge discovery. Decision trees can handle high dimensional data. During tree construction, attribute selection measures are used to select the attribute that best partitions the tuples into distinct classes. When decision trees are built, many of the branches may reflect noise or outliers in the training data. Tree pruning attempts to identify and remove such branches, with the goal of improving classification accuracy on unseen data Attribute selection measure for decision treethe Gain Ratio An attribute selection measure (or a splitting rule) is a heuristic for selecting the splitting criterion that best separates a given data partition, D, of class-labeled training tuples into individual classes. It provides a ranking for each attribute describing the given training tuples. The attribute having the best score for the measure is chosen as the splitting attribute for the given tuples. The tree node created for partition D is labelled with the splitting criterion, branches are grown for each outcome of the criterion, and the tuples are partitioned accordingly. C4.5,uses gain ratio as its attribute selection measure. It is defined as: Where: A is the attribute under consideration. (1) - (2) (3) (4) and (5) here p i is the probability that an arbitrary tuple in D belongs to class C i and is estimated by C i,d / D. The SplitInfo A (D) represents the potential information generated by splitting the training data set, D, into v partitions, corresponding to the v outcomes of a test on attribute A. The attribute with the maximum gain ratio is selected as the splitting attribute. The Info (D), also known as the entropy of D, is the average amount of information needed to identify the class label of a tuple in D. Info A (D) is the expected information required to classify a tuple from D based on the partitioning by A. The smaller the expected information still required, the greater the purity of the partitions. Gain (A), ie. the information gain, is defined as the difference between the original information requirement (i.e., based on just the proportion of classes) and the new requirement (i.e., obtained after partitioning on A). The attribute A with the highest information gain, Gain(A), is chosen as the splitting attribute at node N Decision tree pruning When a decision tree is built, some of the branches will reflect anomalies in the training data due to noise or outliers. Pruning addresses this problem of overfitting the data. Pruned trees tend to be smaller and less complex. C4.5 uses pruning method in which subtrees are removed from a fully grown tree. A subtree at a given node is pruned by removing its branches and replacing it with a leaf. The leaf is labelled with the most frequent class among the subtree being replaced. C4.5 uses pessimistic pruning, which makes use of error rate (percentage of tuples misclassified by the tree) estimates to make decisions regarding subtree pruning. The training set is used to estimate error rates. The pruning starts from the bottom of the tree. For each internal node, N, it computes the cost complexity of the subtree at N, and the cost complexity of the subtree at N if it were to be pruned (i.e., replaced by a leaf node). The two values are compared. If pruning the subtree at node N would result in a smaller cost complexity, then the subtree is pruned. Otherwise, it is kept. In the present study, to find the optimum parameters, an exhaustive search is carried out. Two parameters of the decision tree that need to be optimized for best accuracy are: the confidence level and minimum number of examples in the leaves. Table 2 presents the results of decision tree pruning for different confidence levels and minimal number of examples (for dataset with market data upto March 7, 2010). For other confidence levels, not displayed in the table, the results were similar, with the best accuracy is achieved when the minimum number of examples in leaves is 2 at confidence level of 25%, (from table 2). The features that form the most accurate decision tree are the selected features and are used for predicting the future trend. Other features which are not selected are discarded and not considered further. Table 2. Decision tree pruning results Min. Confidence Level (%) Ex Rough Set based Trend Prediction Rough sets are extremely useful in dealing with incomplete or imperfect knowledge. This section presents a brief overview of the rough set concepts [17] used in the present study and also presents their application for stock market trend prediction. A rough set uses the concepts of lower and upper approximations to classify the domain of interest into disjoint categories. The lower approximation is a description of the domain objects that are known with certainty to belong to the subset of interest, whereas the upper approximation is a description of the objects that possibly belong to the subset. 3
4 3.4.1 Information and decision system In rough set theory knowledge is represented in information tables. Each row in the table represents an object. Each column in the table represents an attribute (also referred to as feature, in this study), in the present case, each attribute is a technical index obtained from the historical SENSEX data. An information system, is defined as: = (U, A) (6) U= Non empty finite set of objects called the universe A= Non empty finite set of features such that : U V x A (7) And V x is the value set for Inclusion of decision features (attributes) along with the features (attributes) of the information system results in a decision system. If Q is the set of decision attributes, a decision system S can be represented as S= ( U, A Q ) (8) The elements of A are called conditional attributes or conditions and U is the universal set as defined in equation (6).The decision attribute for the present study is the trend of SENSEX (up trend, down trend and no trend) Indiscernibility The concept of indiscernibility is used to define equivalence classes for the objects. Given a subset of attributes P A, each such subset defines an equivalence relation IND(P) called an indiscernibility relation.this indiscernibility relation is defined as IND(P) = { (x,y) U 2 P, (x) = (y) } (9) Equation (9) means that the subset of features, P, will define the partition of the universe into sets such that each object in a set cannot be distinguished from other objects in the set using only the features in P. The sets into which the objects are divided, are called the equivalence classes. The equivalence classes of the indiscernibility relation with respect to P are denoted [x] P,where x U.The partition of U, determined by IND(P), is denoted U/IND(P) or U/P, which is simply the set of equivalence classes generated by IND(P) Set approximation Given an information system, = (U, A), the set X U, can be approximated using only the information in the set P using the P-upper and P-lower approximations of X. P-Lower approximation of X: PX x x P X (10) The lower approximation is the set containing all objects for which the equivalence class corresponding to the object is a subset of the set. This set contains all objects which certainly belong to the set X. P-Upper approximation of X: PX x x P X 0 (11) This set contains all objects which possibly belong to the set X. Table 3 presents the number of objects in the upper and lower approximations of each class. Table 3. Approximation results Class No. of Lower Upper objects Approx. Approx. Up Down Notrend Reducts A reduct is a minimal representation of the original dataset. The concepts of positive region, feature dependency and significance are used for computation of reducts. If P and Q are sets of features inducing equivalence relations over U, then the positive region comprises all objects of U that can be classified to classes of U/Q using the information contained within features P. The positive region is denoted by POS P (Q). For P,Q A, the dependence of Q on P is given by k= P (Q) = POS P (Q) / U (12) The dependency k can take values from 0 (meaning Q is independent of P ) to 1 (meaning that Q is completely dependent on P.The significance of feature can be determined by calculating the change in dependency when a feature is removed from the set of considered features. The higher the change in dependency, the more significant is the attribute. Considering the sets P and Q and an attribute x P, the significance of feature x upon Q is defined by P (Q, x ) = P (Q) - P-{x} (Q) (13) A reduct is a minimal subset of the initial feature set such that no features can be removed from the subset without affecting the dependency degree. A given dataset may have many reduct sets, and the collection of all reducts is denoted by R=X X P, X(D)= P (D); X-{a} (D) X (D), a X (14) The core is a subset of the reduct set R. The elements of core are those features that cannot be eliminated without introducing more contradictions to the dataset. The following five reducts were obtained: 1.Volume,MACD Line, Nine-period moving average, Fast stochastic %K, Fast stochastic %D, Acceleration,Momentum,William s %R, Negative volume index, RSI, Bollinger-mid, Highest High, On balance volume, Price rate of change, Pricevolume trend, Typical Price, William s accumulation/ 2.Volume, MACD Line, Nine-period moving average, Fast stochastic %K, Fast stochastic %D, Acceleration,Momentum, William s %R, Negative volume index, RSI, Bollinger-mid, Highest High, Lowest Low, On balance volume, Price rate of change, Price-volume trend, William s accumulation/ 3.Volume, MACD Line, Nine-period moving average, Fast stochastic %K, Fast stochastic %D, Acceleration, Momentum, William s %R, Negative volume index, Positive volume index, RSI, Bollinger-mid, Highest high, On balance volume, Price 4
5 rate of change, Price-volume trend, William s accumulation/ 4.Open,Volume, MACD Line, Nine-period moving average, Fast stochastic %K, Fast stochastic %D, Acceleration, Momentum, William's %R, Negative volume index, RSI, Bollinger mid, Highest high, on balance volume, Price rate of change, Price-volume trend, William s accumulation/ 5.Volume, MACD Line, Nine-period moving average, Fast stochastic %K, Fast stochastic %D, Acceleration, Momentum, William s %R, Negative volume index, Positive volume index, RSI, Bollinger-mid, Highest high, Lowest Low, Price rate of change, Price-volume trend, William s accumulation/ There are fifteen attributes in the core, namely, Volume, MACD Line, nine period moving average,fast stochastic %K, fast stochastic %D, acceleration, momentum, william s %R, negative volume index, RSI, bollinger-mid, highest high, price rate of change, price-volume trend, William s A/D Discernibility matrices A discernibility matrix of is a symmetric n x n matrix with entries C ij = A (x i ) (x j ) for i, j = 1,2,, n. (15) The entries for each object are thus the attributes that are needed in order to discern object i from object j Decision rules An approximation of decision Q, for decision system S= (U, A Q) (16) can be done by constructing the decision relative discernibility matrix of S. The decision relative discernibility matrix of S gives information about how to discern an object from objects belonging to another decision class. If G(A)= c ij is the discernibility matrix of a decision system S, the decision relative discernibility matrix of S is defined as G (A)= ( c ij ) (17) Assuming c ij = 0 if q(x i ) = q (x j ) and c ij = c ij - q otherwise. From the reducts computed from the discernibility matrix, the decision rules are generated for classification of the objects. In the present study, a total of 199 rules were generated. The classification accuracy was evaluated using a 10-folds crossvalidation method and was found to be 90.22%. The confusion matrix is given in table 4. Table 4. Hybrid decision tree-roughset confusion matrix No trend Up trend Down trend No trend Uptrend Down trend A stand-alone rough set based trend prediction system without feature selection was also designed and evaluated. A total of 202 decision rules were obtained and the classification accuracy obtained was %. 4. ARTIFICIAL NEURAL NETWORK AND NAIVE BAYES BASED PREDICTORS 4.1Artificial Neural Network based Trend Predictor Artificial neural networks (ANNs) have been used widely for predicting stock markets. In the present study, a simple multilayer feedforward neural network trained using backpropagation learning is used for predicting the next days trend of the SENSEX. All the features considered for the study initially, are used as inputs with each feature forming one input neuron. The inputs are normalized to -1 to +1 range for better performance. Training parameters which gave the best performance are : Learning rate = 0.3 Momentum = 0.2 Number of Hidden Layers = 1 Number of neurons in the input layer = 30 Number of neurons in the hidden layer = 30 Number of neurons in the output layer = 3 (one each for Up, Down and No trend) A classification accuracy of 77.66% was obtained. The prediction results are presented in the form of a confusion matrix in table 5. Table 5: ANN trend predictor confusion matrix No trend Uptrend Downtrend No trend Uptrend Downtrend Naive Bayes based Trend Predictor A naive Bayes based trend prediction system is also designed and evaluated. A naive Bayes based classifier/prediction system is much easier to design when compared to neural network based or rough set based classifier. Hence, the naive Bayes based prediction system is designed and evaluated to showcase the improvement in accuracy that is brought about by the use of neural network based and the rough set based prediction systems. Bayesian classifiers are statistical classifiers. They can predict class membership probabilities, such as the probability that a given tuple belongs to a particular class. Naïve Bayesian classifiers assume class conditional independence, that is, the effect of an attribute value on a given class is considered independent of the values of the other attributes. It is made to simplify the computations involved and, in this sense, is considered naïve. Let X be a data tuple (referred to as evidence ), described by a set of n attributes such that X = (x 1, x 2, x 3, x 4..., x n ). Assuming that there are m classes, C 1, C 2,..., C m, the classifier will predict that X belongs to the class having the highest posterior probability, conditioned on X. By Bayes theorem (18) As P(X) is constant for all classes, only P(X C i )P(C i ) need be maximized. The naive assumption of class conditional independence implies (19) In the present study, the attributes are continuous and hence, are assumed to have a Gaussian distribution with a mean μ and standard deviation σ, 5
6 P (x k C i ) = g(x k, μ Ci, σ Ci ) (20) Where In order to predict the class label of X, P(X C i )P(C i ) is evaluated for each class C i. The naïve Bayesian classifier predicts that tuple X belongs to the class C i if and only if P(C i X) > P(C j X) for 1 j m, j i. In the present study, three classes are considered: C 1 = Up, C 2 = down and C 3 = no trend. Each X is the set of all the attributes for one day. On application of the Naive Bayes classification technique, the accuracy obtained was 72.36% and the confusion matrix is given in table 6. Table 6: Naive Bayes trend predictor confusion matrix No trend Up trend Down trend No trend Uptrend Down trend CONCLUSIONS The design of rough set based stock market trend prediction system for predicting the one-day-ahead trend of the SENSEX is presented in this paper. Features are extracted from the historical SENSEX data. Extracted features are used to generate the prediction rules using rough set theory. It is observed that the proposed hybrid decision tree-rough set based trend prediction system produces an accuracy of 90.22%. The stand-alone rough set based trend prediction system, without any feature selection, produced an accuracy of %. The artificial neural network based prediction system, without feature selection, yielded an accuracy of 77.66%. Both these systems are found to offer better performance when compared to a simple Naïve bayes based trend prediction system which provided a trend prediction accuracy of 72.36%. It is seen that automated feature selection employed in the present study significantly improves the trend prediction accuracy. It can be concluded from the present study that soft computing based techniques, particularly the hybrid decision tree-rough set based trend prediction system is well suited for stock market trend prediction. 6. REFERENCES [1] Fama, E. F.,(1970), Efficient capital markets: A review of theory and empirical work, Journal of Finance, 25, pp [2] Atsalakis G. S., and Valavanis K. P., (2009), Surveying stock market forecasting techniques part II: soft computing methods, Expert Systems with Applications, vol.36, pp [3] Saad E. W., Prokhorov D.V., and Wunsch, D.C., (1998), Comparative study of stock trend prediction using time delay, recurrent and probabilistic neural networks, IEEE Transactions on Neural Networks, Vol. 9, No. 6, pp [4] Lee, C-T., and Chen,Y-P The efficacy of neural networks and simple technical indicators in predicting stock markets. In Proceedings of the International Conference on Convergence Information Technology, pp [5] Kuo, M-H., and Chen, C-L An ETF trading decision support system by using neural network and technical indicators. In Proceedings of the International Joint Conference on Neural Networks, pp [6] Nagarajan,V., Wu,Y., Liu,M. and Wang Q-G Forecast Studies for Financial Markets using Technical Analysis. In Proceedings of the International Conference on Control and Automation (ICCA2005), pp [7] Bansal, Archit, Mishra,Kaushik, Pachouri, Anshul, (2010), Algorithmic Trading (AT) - Framework for Futuristic Intelligent Human Interaction with Small Investors,, International Journal of Computer Applications, vol. 1, no. 21, pp [8] Chang, P.-C., Liu, C.-H., Lin, J.-L., Fan, C.-Y., & Celeste, S.P. Ng., (2009), A neural network with a case based dynamic window for stock trading prediction, Expert Systems with Applications, vol.36, pp [9] Setty, Venugopal D., Rangaswamy, T.M. and Subramanya, K.N.,(2010), A Review on Data Mining Applications to the Performance of Stock Marketing, International Journal of Computer Applications, vol. 1, no. 3,pp [10] Wang, J-L., and Chan, S-H.,(2006), Stock market trading rule discovery using two-layer bias decision tree, Expert Systems with Applications, 30, pp [11] Wu, M-C., Lin, S-Y., & Lin, C-H.,(2006), An effective application of decision tree to stock trading, Expert Systems with Applications, 31, pp [12] Huang, K.Y, and Jane, C.-J., (2009), A hybrid model for stock market forecasting and portfolio selection based on ARX, grey system and RS theories, Expert Systems with Applications, 36, pp [13] Teoh, H. J., Cheng,C-H., Chu, H-H., and Chen, J-S., (2008), Fuzzy time series model based on probabilistic approach and rough set rule induction for empirical research in stock markets, Data & Knowledge Engineering, 67, pp [14] Eng,W.F The Technical Analysis of Stocks, Options & Futures- Advanced Trading Systems and Techniques. Vision Books,India. [15] [16] Han, Jiawei and Kamber, Micheline 2006 Data Mining:Concepts and Techniques-Second Edition, Morgan Kaufmann,USA, pp [17] Jensen,R., & Shen, Q Computational Intelligence And Feature Selection-Rough and Fuzzy Approaches, John Wiley & Sons. 6
PREDICTING STOCK PRICES USING DATA MINING TECHNIQUES
The International Arab Conference on Information Technology (ACIT 2013) PREDICTING STOCK PRICES USING DATA MINING TECHNIQUES 1 QASEM A. AL-RADAIDEH, 2 ADEL ABU ASSAF 3 EMAN ALNAGI 1 Department of Computer
COMP3420: Advanced Databases and Data Mining. Classification and prediction: Introduction and Decision Tree Induction
COMP3420: Advanced Databases and Data Mining Classification and prediction: Introduction and Decision Tree Induction Lecture outline Classification versus prediction Classification A two step process Supervised
Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing E-mail Classifier
International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume-1, Issue-6, January 2013 Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
Classification and Prediction
Classification and Prediction Slides for Data Mining: Concepts and Techniques Chapter 7 Jiawei Han and Micheline Kamber Intelligent Database Systems Research Lab School of Computing Science Simon Fraser
Data quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
Data Mining for Knowledge Management. Classification
1 Data Mining for Knowledge Management Classification Themis Palpanas University of Trento http://disi.unitn.eu/~themis Data Mining for Knowledge Management 1 Thanks for slides to: Jiawei Han Eamonn Keogh
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
Implementation of Data Mining Techniques to Perform Market Analysis
Implementation of Data Mining Techniques to Perform Market Analysis B.Sabitha 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, P.Balasubramanian 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
Prediction of Stock Performance Using Analytical Techniques
136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Review on Financial Forecasting using Neural Network and Data Mining Technique
ORIENTAL JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY An International Open Free Access, Peer Reviewed Research Journal Published By: Oriental Scientific Publishing Co., India. www.computerscijournal.org ISSN:
A Prediction Model for Taiwan Tourism Industry Stock Index
A Prediction Model for Taiwan Tourism Industry Stock Index ABSTRACT Han-Chen Huang and Fang-Wei Chang Yu Da University of Science and Technology, Taiwan Investors and scholars pay continuous attention
DATA MINING USING INTEGRATION OF CLUSTERING AND DECISION TREE
DATA MINING USING INTEGRATION OF CLUSTERING AND DECISION TREE 1 K.Murugan, 2 P.Varalakshmi, 3 R.Nandha Kumar, 4 S.Boobalan 1 Teaching Fellow, Department of Computer Technology, Anna University 2 Assistant
Analyzing and Predicting Stock Market Using Data Mining Techniques A Review
Analyzing and Predicting Stock Market Using Data Mining Techniques A Review Suraj Baviskar 1, Nitin Namdev 2 1 ME Scholar, Department of CSE, JIT Borwan, Khargoan, (M.P), India. 2 Assit.Prof. Department
Data Mining Classification: Decision Trees
Data Mining Classification: Decision Trees Classification Decision Trees: what they are and how they work Hunt s (TDIDT) algorithm How to select the best split How to handle Inconsistent data Continuous
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE Kasra Madadipouya 1 1 Department of Computing and Science, Asia Pacific University of Technology & Innovation ABSTRACT Today, enormous amount of data
Decision Tree Learning on Very Large Data Sets
Decision Tree Learning on Very Large Data Sets Lawrence O. Hall Nitesh Chawla and Kevin W. Bowyer Department of Computer Science and Engineering ENB 8 University of South Florida 4202 E. Fowler Ave. Tampa
EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH
EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH SANGITA GUPTA 1, SUMA. V. 2 1 Jain University, Bangalore 2 Dayanada Sagar Institute, Bangalore, India Abstract- One
Neural Network and Genetic Algorithm Based Trading Systems. Donn S. Fishbein, MD, PhD Neuroquant.com
Neural Network and Genetic Algorithm Based Trading Systems Donn S. Fishbein, MD, PhD Neuroquant.com Consider the challenge of constructing a financial market trading system using commonly available technical
NEURAL networks [5] are universal approximators [6]. It
Proceedings of the 2013 Federated Conference on Computer Science and Information Systems pp. 183 190 An Investment Strategy for the Stock Exchange Using Neural Networks Antoni Wysocki and Maciej Ławryńczuk
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES Vijayalakshmi Mahanra Rao 1, Yashwant Prasad Singh 2 Multimedia University, Cyberjaya, MALAYSIA 1 [email protected]
Stock Portfolio Selection using Data Mining Approach
IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 11 (November. 2013), V1 PP 42-48 Stock Portfolio Selection using Data Mining Approach Carol Anne Hargreaves, Prateek
Professor Anita Wasilewska. Classification Lecture Notes
Professor Anita Wasilewska Classification Lecture Notes Classification (Data Mining Book Chapters 5 and 7) PART ONE: Supervised learning and Classification Data format: training and test data Concept,
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
Data Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
Classification algorithm in Data mining: An Overview
Classification algorithm in Data mining: An Overview S.Neelamegam #1, Dr.E.Ramaraj *2 #1 M.phil Scholar, Department of Computer Science and Engineering, Alagappa University, Karaikudi. *2 Professor, Department
HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION
HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION Chihli Hung 1, Jing Hong Chen 2, Stefan Wermter 3, 1,2 Department of Management Information Systems, Chung Yuan Christian University, Taiwan
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA
ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA D.Lavanya 1 and Dr.K.Usha Rani 2 1 Research Scholar, Department of Computer Science, Sree Padmavathi Mahila Visvavidyalayam, Tirupati, Andhra Pradesh,
Evaluation of Feature Selection Methods for Predictive Modeling Using Neural Networks in Credits Scoring
714 Evaluation of Feature election Methods for Predictive Modeling Using Neural Networks in Credits coring Raghavendra B. K. Dr. M.G.R. Educational and Research Institute, Chennai-95 Email: [email protected]
Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal
Learning Example Chapter 18: Learning from Examples 22c:145 An emergency room in a hospital measures 17 variables (e.g., blood pressure, age, etc) of newly admitted patients. A decision is needed: whether
Optimization of C4.5 Decision Tree Algorithm for Data Mining Application
Optimization of C4.5 Decision Tree Algorithm for Data Mining Application Gaurav L. Agrawal 1, Prof. Hitesh Gupta 2 1 PG Student, Department of CSE, PCST, Bhopal, India 2 Head of Department CSE, PCST, Bhopal,
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
Stock Market Prediction Model by Combining Numeric and News Textual Mining
Stock Market Prediction Model by Combining Numeric and News Textual Mining Kranti M. Jaybhay ME (CSE) - II Computer Science Department Walchand Institute of Technology, Solapur University, India Rajesh
Machine Learning in FX Carry Basket Prediction
Machine Learning in FX Carry Basket Prediction Tristan Fletcher, Fabian Redpath and Joe D Alessandro Abstract Artificial Neural Networks ANN), Support Vector Machines SVM) and Relevance Vector Machines
Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees
Statistical Data Mining Practical Assignment 3 Discriminant Analysis and Decision Trees In this practical we discuss linear and quadratic discriminant analysis and tree-based classification techniques.
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 11 Sajjad Haider Fall 2013 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
A Study of Detecting Credit Card Delinquencies with Data Mining using Decision Tree Model
A Study of Detecting Credit Card Delinquencies with Data Mining using Decision Tree Model ABSTRACT Mrs. Arpana Bharani* Mrs. Mohini Rao** Consumer credit is one of the necessary processes but lending bears
SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING
AAS 07-228 SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING INTRODUCTION James G. Miller * Two historical uncorrelated track (UCT) processing approaches have been employed using general perturbations
FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM
International Journal of Innovative Computing, Information and Control ICIC International c 0 ISSN 34-48 Volume 8, Number 8, August 0 pp. 4 FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT
Data Mining Applications in Fund Raising
Data Mining Applications in Fund Raising Nafisseh Heiat Data mining tools make it possible to apply mathematical models to the historical data to manipulate and discover new information. In this study,
Review on Financial Forecasting using Neural Network and Data Mining Technique
Global Journal of Computer Science and Technology Neural & Artificial Intelligence Volume 12 Issue 11 Version 1.0 Year 2012 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global
Prediction of Heart Disease Using Naïve Bayes Algorithm
Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,
Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network
Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network Dušan Marček 1 Abstract Most models for the time series of stock prices have centered on autoregressive (AR)
Feature Selection for Stock Market Analysis
Feature Selection for Stock Market Analysis Yuqinq He, Kamaladdin Fataliyev, and Lipo Wang School of Electrical and Electronic Engineering Nanyang Technological University Singapore Abstract. The analysis
Weather forecast prediction: a Data Mining application
Weather forecast prediction: a Data Mining application Ms. Ashwini Mandale, Mrs. Jadhawar B.A. Assistant professor, Dr.Daulatrao Aher College of engg,karad,[email protected],8407974457 Abstract
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Equity forecast: Predicting long term stock price movement using machine learning
Equity forecast: Predicting long term stock price movement using machine learning Nikola Milosevic School of Computer Science, University of Manchester, UK [email protected] Abstract Long
A COMPARATIVE ASSESSMENT OF SUPERVISED DATA MINING TECHNIQUES FOR FRAUD PREVENTION
A COMPARATIVE ASSESSMENT OF SUPERVISED DATA MINING TECHNIQUES FOR FRAUD PREVENTION Sherly K.K Department of Information Technology, Toc H Institute of Science & Technology, Ernakulam, Kerala, India. [email protected]
Credit Card Fraud Detection Using Self Organised Map
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1343-1348 International Research Publications House http://www. irphouse.com Credit Card Fraud
Experiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
Comparison of Data Mining Techniques used for Financial Data Analysis
Comparison of Data Mining Techniques used for Financial Data Analysis Abhijit A. Sawant 1, P. M. Chawan 2 1 Student, 2 Associate Professor, Department of Computer Technology, VJTI, Mumbai, INDIA Abstract
Rule based Classification of BSE Stock Data with Data Mining
International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 4, Number 1 (2012), pp. 1-9 International Research Publication House http://www.irphouse.com Rule based Classification
Performance Analysis of Decision Trees
Performance Analysis of Decision Trees Manpreet Singh Department of Information Technology, Guru Nanak Dev Engineering College, Ludhiana, Punjab, India Sonam Sharma CBS Group of Institutions, New Delhi,India
!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"
!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"!"#"$%&#'()*+',$$-.&#',/"-0%.12'32./4'5,5'6/%&)$).2&'7./&)8'5,5'9/2%.%3%&8':")08';:
Artificial Neural Network Approach for Classification of Heart Disease Dataset
Artificial Neural Network Approach for Classification of Heart Disease Dataset Manjusha B. Wadhonkar 1, Prof. P.A. Tijare 2 and Prof. S.N.Sawalkar 3 1 M.E Computer Engineering (Second Year)., Computer
REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES
REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES R. Chitra 1 and V. Seenivasagam 2 1 Department of Computer Science and Engineering, Noorul Islam Centre for
Effective Analysis and Predictive Model of Stroke Disease using Classification Methods
Effective Analysis and Predictive Model of Stroke Disease using Classification Methods A.Sudha Student, M.Tech (CSE) VIT University Vellore, India P.Gayathri Assistant Professor VIT University Vellore,
Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing
www.ijcsi.org 198 Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing Lilian Sing oei 1 and Jiayang Wang 2 1 School of Information Science and Engineering, Central South University
Healthcare Data Mining: Prediction Inpatient Length of Stay
3rd International IEEE Conference Intelligent Systems, September 2006 Healthcare Data Mining: Prediction Inpatient Length of Peng Liu, Lei Lei, Junjie Yin, Wei Zhang, Wu Naijun, Elia El-Darzi 1 Abstract
AnalysisofData MiningClassificationwithDecisiontreeTechnique
Global Journal of omputer Science and Technology Software & Data Engineering Volume 13 Issue 13 Version 1.0 Year 2013 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals
A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier
A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier G.T. Prasanna Kumari Associate Professor, Dept of Computer Science and Engineering, Gokula Krishna College of Engg, Sullurpet-524121,
FOREX TRADING PREDICTION USING LINEAR REGRESSION LINE, ARTIFICIAL NEURAL NETWORK AND DYNAMIC TIME WARPING ALGORITHMS
FOREX TRADING PREDICTION USING LINEAR REGRESSION LINE, ARTIFICIAL NEURAL NETWORK AND DYNAMIC TIME WARPING ALGORITHMS Leslie C.O. Tiong 1, David C.L. Ngo 2, and Yunli Lee 3 1 Sunway University, Malaysia,
The Optimality of Naive Bayes
The Optimality of Naive Bayes Harry Zhang Faculty of Computer Science University of New Brunswick Fredericton, New Brunswick, Canada email: hzhang@unbca E3B 5A3 Abstract Naive Bayes is one of the most
Neural Networks for Sentiment Detection in Financial Text
Neural Networks for Sentiment Detection in Financial Text Caslav Bozic* and Detlef Seese* With a rise of algorithmic trading volume in recent years, the need for automatic analysis of financial news emerged.
Data Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
American International Journal of Research in Science, Technology, Engineering & Mathematics
American International Journal of Research in Science, Technology, Engineering & Mathematics Available online at http://www.iasir.net ISSN (Print): 2328-349, ISSN (Online): 2328-3580, ISSN (CD-ROM): 2328-3629
JetBlue Airways Stock Price Analysis and Prediction
JetBlue Airways Stock Price Analysis and Prediction Team Member: Lulu Liu, Jiaojiao Liu DSO530 Final Project JETBLUE AIRWAYS STOCK PRICE ANALYSIS AND PREDICTION 1 Motivation Started in February 2000, JetBlue
How To Solve The Kd Cup 2010 Challenge
A Lightweight Solution to the Educational Data Mining Challenge Kun Liu Yan Xing Faculty of Automation Guangdong University of Technology Guangzhou, 510090, China [email protected] [email protected]
Data Mining based on Rough Set and Decision Tree Optimization
Data Mining based on Rough Set and Decision Tree Optimization College of Information Engineering, North China University of Water Resources and Electric Power, China, [email protected] Abstract This paper
COMBINED METHODOLOGY of the CLASSIFICATION RULES for MEDICAL DATA-SETS
COMBINED METHODOLOGY of the CLASSIFICATION RULES for MEDICAL DATA-SETS V.Sneha Latha#, P.Y.L.Swetha#, M.Bhavya#, G. Geetha#, D. K.Suhasini# # Dept. of Computer Science& Engineering K.L.C.E, GreenFields-522502,
Neural Network Add-in
Neural Network Add-in Version 1.5 Software User s Guide Contents Overview... 2 Getting Started... 2 Working with Datasets... 2 Open a Dataset... 3 Save a Dataset... 3 Data Pre-processing... 3 Lagging...
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
ACCURACY DRIVEN ARTIFICIAL NEURAL NETWORKS IN STOCK MARKET PREDICTION
ACCURACY DRIVEN ARTIFICIAL NEURAL NETWORKS IN STOCK MARKET PREDICTION Selvan Simon 1 and Arun Raoot 2 1 Asst. Professor, P. G. Dept. of Computer Sci., SNDT Women s University, Mumbai [email protected]
A Data Mining Study of Weld Quality Models Constructed with MLP Neural Networks from Stratified Sampled Data
A Data Mining Study of Weld Quality Models Constructed with MLP Neural Networks from Stratified Sampled Data T. W. Liao, G. Wang, and E. Triantaphyllou Department of Industrial and Manufacturing Systems
Question 2 Naïve Bayes (16 points)
Question 2 Naïve Bayes (16 points) About 2/3 of your email is spam so you downloaded an open source spam filter based on word occurrences that uses the Naive Bayes classifier. Assume you collected the
Chapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
Real Stock Trading Using Soft Computing Models
Real Stock Trading Using Soft Computing Models Brent Doeksen 1, Ajith Abraham 2, Johnson Thomas 1 and Marcin Paprzycki 1 1 Computer Science Department, Oklahoma State University, OK 74106, USA, 2 School
Data Mining using Artificial Neural Network Rules
Data Mining using Artificial Neural Network Rules Pushkar Shinde MCOERC, Nasik Abstract - Diabetes patients are increasing in number so it is necessary to predict, treat and diagnose the disease. Data
Data Mining with R. Decision Trees and Random Forests. Hugh Murrell
Data Mining with R Decision Trees and Random Forests Hugh Murrell reference books These slides are based on a book by Graham Williams: Data Mining with Rattle and R, The Art of Excavating Data for Knowledge
Email Spam Detection A Machine Learning Approach
Email Spam Detection A Machine Learning Approach Ge Song, Lauren Steimle ABSTRACT Machine learning is a branch of artificial intelligence concerned with the creation and study of systems that can learn
Summary Data Mining & Process Mining (1BM46) Content. Made by S.P.T. Ariesen
Summary Data Mining & Process Mining (1BM46) Made by S.P.T. Ariesen Content Data Mining part... 2 Lecture 1... 2 Lecture 2:... 4 Lecture 3... 7 Lecture 4... 9 Process mining part... 13 Lecture 5... 13
Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski [email protected]
Introduction to Machine Learning and Data Mining Prof. Dr. Igor Trakovski [email protected] Neural Networks 2 Neural Networks Analogy to biological neural systems, the most robust learning systems
First Semester Computer Science Students Academic Performances Analysis by Using Data Mining Classification Algorithms
First Semester Computer Science Students Academic Performances Analysis by Using Data Mining Classification Algorithms Azwa Abdul Aziz, Nor Hafieza IsmailandFadhilah Ahmad Faculty Informatics & Computing
Comparing Artificial Intelligence Systems for Stock Portfolio Selection
Abstract Comparing Artificial Intelligence Systems for Stock Portfolio Selection Extended Abstract Chiu-Che Tseng Texas A&M University Commerce P.O. BOX 3011 Commerce, Texas 75429 Tel: (903) 886-5497 Email:
ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION
ISSN 9 X INFORMATION TECHNOLOGY AND CONTROL, 00, Vol., No.A ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION Danuta Zakrzewska Institute of Computer Science, Technical
International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
Predict Influencers in the Social Network
Predict Influencers in the Social Network Ruishan Liu, Yang Zhao and Liuyu Zhou Email: rliu2, yzhao2, [email protected] Department of Electrical Engineering, Stanford University Abstract Given two persons
Dr. U. Devi Prasad Associate Professor Hyderabad Business School GITAM University, Hyderabad Email: [email protected]
96 Business Intelligence Journal January PREDICTION OF CHURN BEHAVIOR OF BANK CUSTOMERS USING DATA MINING TOOLS Dr. U. Devi Prasad Associate Professor Hyderabad Business School GITAM University, Hyderabad
Financial Time Series Forecasting with Machine Learning Techniques: A Survey
Financial Time Series Forecasting with Machine Learning Techniques: A Survey Bjoern Krollner, Bruce Vanstone, Gavin Finnie School of Information Technology, Bond University Gold Coast, Queensland, Australia
Random forest algorithm in big data environment
Random forest algorithm in big data environment Yingchun Liu * School of Economics and Management, Beihang University, Beijing 100191, China Received 1 September 2014, www.cmnt.lv Abstract Random forest
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Evaluating the Accuracy of a Classifier Holdout, random subsampling, crossvalidation, and the bootstrap are common techniques for
arxiv:1301.4944v1 [stat.ml] 21 Jan 2013
Evaluation of a Supervised Learning Approach for Stock Market Operations Marcelo S. Lauretto 1, Bárbara B. C. Silva 1 and Pablo M. Andrade 2 1 EACH USP, 2 IME USP. 1 Introduction arxiv:1301.4944v1 [stat.ml]
A New Approach for Evaluation of Data Mining Techniques
181 A New Approach for Evaluation of Data Mining s Moawia Elfaki Yahia 1, Murtada El-mukashfi El-taher 2 1 College of Computer Science and IT King Faisal University Saudi Arabia, Alhasa 31982 2 Faculty
