Decision Support System for predicting Football Game result
|
|
|
- Victoria Nichols
- 10 years ago
- Views:
Transcription
1 Decision Support System for predicting Football Game result João Gomes, Filipe Portela, Manuel Filipe Santos Abstract there is an increase of bookmaker s number over the last decade, leading to the conclusion that the bet houses have obtained profitability in the detriment of its users. Based in this principle arises an opportunity to explore a set of artificial intelligence techniques in order to support the user betting decision. The development of this project aims to support bookmaker's users to increase their profits on bets related to football matches, suggesting to them which bet that they should carry out (home win, draw or away win). To this, it was collected several statistical information related to football games from the Premier League. It was developed a dataset and applied data mining techniques to create a model with good predictive capability. This model was then integrated in a decision support system which allows complement the machine intelligence with human perception. The model developed allowed to have profits of 20% in relation to an initial bankroll. Keywords Decision Support Systems, Data Mining, Football Games Prediction, Decision Support Systems for Football Betting, Knowledge Discovery in Database, Football Bets T I. INTRODUCTION HIS paper presents the first step of a project that is being developed in the area of Decision Support Systems using Data Mining applied to football betting. In the last few years a growing number of bookmakers is been observed in this industry, leading to the conclusion that this area presents itself as a profitable activity for the bookmakers themselves and consequently unfavorable for its users. The project consists in analyzing football games statistics, trying to identify patterns used in these data and then leading to a suggestion for which bet should be held in a given game. The goals of this project are divided in two groups: Get models with good predictive capabilities, aimed to support the users decision in a way that they can perform the best bet in a particular football game; Develop a prototype of an Decision Support System that incorporates the developed models. This work has been supported by FCT - Fundação para a Ciência e Tecnologia within the Project Scope UID/CEC/00319/2013. João Gomes is with Information System Department, University of Minho, Portugal. Filipe Portela is with Algoritmi Research Centre, University of Minho, Guimarães, Portugal. (Corresponding author to provide phone: ; fax: ; [email protected]). Manuel Filipe Santos, is with Algoritmi Research Centre, University of Minho, Guimarães, Portugal. ( [email protected]). The first one can be divided into the following steps: Collect statistical data of football games (number of goals, number of shots, etc...); Make a treatment of data collected; Create predicting data mining models; Make an assessment of the metrics models; Choose the model that satisfies the work requirements. The second objective can be divided into: Create a prototype of a Decision Support System; Integrating the forecasting models previously created in the prototype. After this work it was possible to find a path in order to achieve the defined goals. In this first step logical blocks were developed combined with data mining models in order to predict the better bet. In terms of model assessment, the results were not totally satisfactory being notorious the need of adding other variables and continuing the work. Although the obtained results were not the expected, it was possible to obtain models that are able to achieve good profits. By making a model evaluation, it was also possible to obtain upper than 50% correct results (more than 1/3 of the possibilities (home win, draw or away win) and profits of 20%. The achieved results gave enormous confidence to proceed with this type of research. This paper is divided into seven topics. The first topic presents itself as an introduction to the project made. The second is called "Background" and focuses on the structuration of the theoretical context, in such way that we can ease the understanding of the project. The third topic is called "Methodologies" and in this chapter is made a description of the methodological environment concerning the development of the carried out project. The following topic is "Intelligent Decision Support System" and describes the practical work developed. The fifth topic is titled "Discussion" and it shows the results obtained through tests made evaluating the performance of the system. In the sixth topic is stated a conclusion of the theme. Finally, the last topic discusses the work that is needed to carry out in the future. II. BACKGROUND A. Knowledge Discovery in Data Knowledge Discovery in Database (KDD) is described as the process of identifying and understanding incomprehensible patterns in datasets, being these patterns valuable novelties, and potentially useful [1]. This is an organized process. It is ISBN:
2 performed automatically with an easy exploratory analysis and modeling, occurring in a given data repository [2]. KDD is an iterative process, due the fact that it may be necessary to go back one or more steps to get the results that best suit project requirements [1]. The process is composed by nine steps that can be compressed in the next five [3]: Selection: In this first phase a dataset selection is made to be used and sometimes it is still necessary to create a new dataset. In this phase the objectives to be achieved are also defined and it is chosen what data will be used throughout the process; Pre-processing: this is the phase in which is performed a cleaning and pre-processing of data; Transformation: At this stage the data transformation is performed in order to improve the dataset quality in order to facilitate finding data dependencies; Data Mining: This topic will be addressed in the topic II.B; Evaluation: this is the phase in which is performed an interpretation of the found patterns at the previous phase, the models are evaluated and then it is found a model that meets the objectives defined in the first phase [2]. B. Data Mining Turban et. al [4] stated that the use of Data Mining (DM) from databases would become an activity that would be fundamental for organizations in the near future. This will be an important technique, so the organizations cannot waste any information regarding to their business and customers, having the risk of being overcome by its competitors. DM is a process aiming to find patterns and interesting information in a given dataset [5]. This process was initially defined by Turban et. al [4] as a pattern discovery process, later they improved the concept and define DM as a process that can also be used as a way to analyze the data with the objective of increasing the efficiency and effectiveness of organizations. The DM contains activities that can be divided into two dimensions, forecasting activities and interpretation activities [6]. In the case of this project the goal is to predict the outcome in a football game, so it will be using classification techniques. The objective of this task was to predict the value that any random variable will assume or else estimate the probability of a future event occurs. Estimating therefore the value of a variable called 'dependent', 'target', 'response' [7]. Depending on the target class it is intended to provide the prediction of an activity. It can divided them into two specific groups: classification - if what you want is to predict the label of a class, for example "Victory", "Draw" or "Defeat", or regression if you what to want is to predict a real number "0", "1" or "2" [8]. C. Decision Support Systems Decision Support Systems (DSS) are like an interactive computer system able to supports managers in the decision process by connecting attributes, goals and objectives, in order to solve the semi-structured and unstructured problems [9]. These are, therefore, systems aiming to support decision makers by providing alternative analysis, studying previously made decisions and what influences these decisions had in an organizational context in order to better support the decision [10]. DSS based their development in the phases of the decisionmaking process. Herbert Simon [11] is the author of the methodology that gathers a greater consensus among the community. Initially he defines the decision-making process as having only three phases: Intelligence, Design and Choice. Later, together with other authors and gathering consensus they defined a fourth phase Implementation, therefore, argued that the implementation of what had been previously decided was important enough to create an individual stage for the several authors [12]. D. Similar Systems After making a research on systems that have the objective to predict football matches results, it was possible to verify that there are platforms aiming to support the gambler decision in what will be the best bet to make in a football matches. The DSS in development by this work has a particular feature not found in any of the existing platforms. The operation of these platforms is based on some mathematical calculations. From the study made, the following web platforms were found: There is also applications for smartphones available in the AppStore or PlayStore which have the same purpose of previously described web platforms. KickOff Smart Betting Made Simple Smart BET Prediction FootWin Sports Prediction The system that is being developed will be distinguished from these by the use of DM techniques. III. METHODOLOGIES, MATERIAL AND METHODS The main project follows the Design Science Research (DSR) Methodology. DSR should lead to the production of a viable artefact in the form of a building, a model, a method, or an instantiation. The main goal is to develop a technologybased solutions able to solve important and relevant business problems [13]. To complement this methodology it was used a combination of two other methodologies, the CRISP-DM and the Simon phases of decision making. For example, the first phase is composed by a combination of the Intelligence phase of decision-making and Business Understanding phase of CRISP-DM methodology. The complete methodology was presented in Table 1. ISBN:
3 Table 1 - Combined Methodology CRISP-DM Decisionmaking Business Understanding Data Understanding Data Transformation Modelling Evaluation Deployment Intelligence Design Choice Implementation Combined Methodology - DSR Phase 1 Phase 2 Phase 3 Phase 4 IV. DECISION SUPPORT SYSTEM As previously mentioned this project was developed using a combination of three methodologies, CRISP-DM, the phases of decision-making and the DSR. At this topic is present the work developed in each phase which has been generated by the combination of the three methodologies. A. Phase 1 In the first step was performed the identification of the problem. After making an analysis of the bookmakers market it was verified an increase of them in recent years. This situation leads to the conclusion that the profit made by them is lucrative, on the opposite only a small percentage of their users get in long-term considerable profit. Then came the idea of creating a Decision Support System (DSS) that has the ability of help its user to make the best decision at the time it will make a bet on a certain football game, suggesting which bet must be made. This problem corresponds to a semi-structured decision because it is necessary to complement the existing data (already collected) in the dataset with information possessed by users. B. Phase 2 In this second phase, the project goals were defined. An analysis was made and the risks and possible restrictions that could exist for its development were determined. The development of a DSS was made through the Exsys Corvid tool that would integrate data mining models developed from the WEKA tool, creating a system that combines intelligence machine with human perception. Also in this phase it was carried out a dataset considered interesting and valuable for the development of the project. After making the dataset collection a detailed analysis (statistics) of it was performed in order to better understand the data that it contains and their distribution. The data collected are detailed in Table 2. The data are from 13 seasons of the English Premier League until the season 2012/2013, which corresponds to statistical information of 4940 games. Table 2 - Original Variables Original Variables Date HomeTeam AwayTeam FTHG FTAG FTR HTHG HTAG HTR Attendance Referee HS AS HST AST HHW AHW HC AC HF AF HO AO HY AY HR AR ODDS Description Match Date (dd/mm/yy) Home Team Away Team Full Time Home Team Goals Full Time Away Team Goals Full Time Result (H=Home Win, D=Draw, A=Away Win) Half Time Home Team Goals Half Time Away Team Goals Half Time Result (H=Home Win, D=Draw, A=Away Win) Crowd Attendance Name of Match Referee Home Team Shots Away Team Shots Home Team Shots on Target Away Team Shots on Target Home Team Hit Woodwork Away Team Hit Woodwork Home Team Corners Away Team Corners Home Team Fouls Committed Away Team Fouls Committed Home Team Offsides Away Team Offsides Home Team Yellow Cards Away Team Yellow Cards Home Team Red Cards Away Team Red Cards Betting odds data from several bookmakers After collecting the data, it was necessary to make the respective analysis. For it, an extract, transform and loading (ETL) process was designed. This process is presented in the Figure 1. In this figure is possible observe all the process since database creation, passing by the prediction phase and concluding with the dataset preparation which will allow its use in Exsys Corvid tool. Figure 1 - ETL Process This ETL is a simple process that begins with the creation of two databases, one temporary database and another one that will be later filled with all the data processed. The original dataset was uploaded to the temporary database. Then the field Date" was updated. This field was separated into three other fields, Day, Month and Year in order to understand for example the weekday game. The data presented in the dataset are mostly information that can only be obtained at the end of each game. Therefore it was necessary to identify other variables that could be known before the game starts. In this step several indicators for the home team ISBN:
4 and for the away team were created. These indicators (table 3) are averages of the variables presented in the original dataset. Finally and before loading the final database the existing null fields were removed. Table 3 - New Variables New Variables Description goal average of the home AverageGoalsHomeTeam disputed at home. goal average of the away AverageGoalsAwayTeam disputed out. average shots of the home AverageShotsHomeTeam disputed at home. average shots of away AverageShotsAwayTeam disputed out. average shots on goal the AverageShotsTargetHomeTeam home disputed at home. average shots on goal of AverageShotsTargetAwayTeam away disputed out. HomeWinLastFive last five games of the home team in home games. AwayWinLastFive last 5 games of away team in matches disputed outside. HomeWinLastFiveConfrontation last 5 home team's games in confrontation with the away team in home games. last 5 games of away team AwayWinLastFiveConfrontation matched up against the home team in matches disputed outside. Predict result predicted by the data mining model. The next step, after loading the final database, was focused in the induction of classification models with the data obtained after them been processed and transformed. The DM models were induced using three different DM techniques: Naive Bayes (NB), Decision Trees (J48) and Support Vector Machine (LibSVM) and two sampling methods, 10-Folds Cross Validation (CV) and Percentage Split where 66% of data was used to carry out training and 33 for testing the models The data mining models was created and tested using a scenario. The variables used in this scenario were Home Team; Away Team; AverageGoalsHomeTeam; AverageGoalsAwayTeam; AverageShotsHomeTeam; AverageShotsAwayTeam; AverageShotsTargetHomeTeam; AverageShotsTargetAwayTeam; HomeWinLastFive; AwayWinLastFive; HomeWinLastFiveConfrontation; AwayWinLastFiveConfrontation. The target variable was: FTR (Final Time Result). Then a transformation was made to the field predicted by the models, making it a field containing only numeric data. This transformation was important because only in this way the field was recognized by Exsys Corvid tool. Finally the database was extracted to a text file, in order to be imported in Exsys Corvid. C. Phase 3 As result is possible to get three distinct predictions (Home Win- 1, Draw- 2 and AwayWin- 3 ). To analyze the models, the accuracy metric provided by Confusion Matrix was used. The six models (1 scenario x 3 techniques x 2 sampling methods) were created using all the data available. In this first phase of the project (exploration phase) it was not defined any extra scenario (combination of different variables). The obtained confusion matrix through these models has a size of 3x3 as can be seen in the following tables (table 4 to 9). Each table is corresponding to each created model. Table 4 - Model 1 Table 5 - Model 2 NB Accuracy , , , Accuracy Average 0, J Accuracy , , , Accuracy Average 0, ISBN:
5 Table 6 - Model 3 LibSVM Accuracy , , , Accuracy Average 0, Table 7 - Model 4 NB Accuracy , , , Accuracy Average 0,49221 Table 8 - Model 5 J Accuracy , , , Accuracy Average 0, Table 9 - Model 6 LibSVM Accuracy , , , Accuracy Average 0, D. Phase 4 The goal of this phase was to ensure that what it was defined in the previous phases it is applied. In this phase was performed all the development process from the data collection, data treatment and data transformation. Then the decision support system was developed. In this task the first prototype of the project turned up. The tool Exsys Corvid was used to develop the prototype. In this phase several system development designs were tested. Below it is presented the "models" which allowed to achieve best results (user profits). The first step was used to import the data obtained in the ETL process. The variables that the system would use were also defined. Besides the already existing variables, four new variables were defined. Two for each team, because in the moment of decision-making, there are not game data. These data are normally only known a few moments before the games start. For that the system did the following questions to the user: Which is the present classification of the home team? Which is the present classification of the away team? How many holders, from the user's opinion, are unavailable of home team? How many holders, from the user's opinion, are unavailable of away team? Moreover, four logic blocks were created, one was responsible for setting the value of the variables associated with the home team (Figure 2), and the other was responsible for defining the variables associated with away team. Another block was designed to do the calculation of each team score. Finally the last block makes a relationship between the scores of each team in order to suggest the best bet for the user, in this phase the data mining models are induced. It was also created one command block to the system knowing the sequence that must take (Figure 3). Table 10 presents an overview of the achieved results for each model, the sampling method and algorithm used. Table 10 - Models Evaluation Model Sampling Method Algorithm Accuracy Model 1 Percentage Split NaiveBayes Model 2 Percentage Split J Model 3 Percentage Split LibSVM Model 4 10-Folds CV NaiveBayes Model 5 10-Folds CV J Model 6 10-Folds CV LibSVM The accuracy obtained was identical in all models, being model 3 the best. So model 3 was chosen with an accuracy of 50.8%. Even though this value be not high, it is better than the 33% (random probability). This work represents an early stage of the project which leads to believe that it is a great starting point. Figure 2 - Logic Block Home Figure 3 - Command Block ISBN:
6 V. DISCUSSION After the system be created, it was necessary to make an assessment of it. Regarding this fact, tests were performed at seven rounds of the English Premier League matches. They were simulated betting of 100 in each of the 10 games by each round of matches. The obtained results are shown in Table 11. Table 11 - System Performed Tests Round Percentage of Return Correct Bets (Bets of 100 by game) Round 5 80 % 689 Round % -418 Round % 11 Round % 713 Round % 480 Round % -245 Round % 179 Total Average = 54,29% 1409 These results shown that even though the model accuracy results are around 50% it was possible to have good profits. In total in this simulation was bet 7000 and obtained a return of 1409, about 20%, which is certainly a value to be taken into account. In the Table 12 is presented in more detail the results obtained in the fifth round of the English Premier League in season 2013/2014. Table 12 - Return in fifth round Round 5 Game Result (1,2,3) Corvid Output (1,2,3) Return (Bets of 100 ) Norwich x A. Villa Liverpool x Southampton Newcastle x Hull City West Brom x Sunderland West Ham x Everton Chelsea x Fulham Arsenal x Stoke C. Palace x Swansea Cardiff x Tottenham Man City x Man Utd Total 689 VI. CONCLUSION This work is a first step in order to develop an Intelligent Decision Support System to predict football game results. Although the achieved results are not totally satisfactory it was possible to prove the possibility of increasing the profits on football games betting through the use of data mining classification models. The system used data of fourteen seasons of English Premier League. Although the accuracy obtained was not very good, it was upper to 33% (probability if it was a matter of luck). The obtained profit proves that the system has enough value to continue their research and development. Scientifically this type of models has a high level of evolution and shown to be a good option to the researchers which wants to explore this area. In conclusion this is a project that has the potential to make the test of creating different data mining models with different types of target class to be able to make the decision support with the highest accuracy possible. VII. FUTURE WORK In the future the objective is to improve data mining models accuracy by increasing system reliability and consequently obtaining a higher profit. For that it will be considered the following aspects: To explore different types of data mining techniques; To create a new result variable, instead of there being three possibilities (home team win, draw or away team win), only have a chance to bet in favor of a team or against it, for example; To create new variables, as the weather, the players rest time; To use other variables which are in original dataset and which have not been used; Develop an Intelligent Decision Support System combining rules with the data mining engine. REFERENCES [1] U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, From data mining to knowledge discovery in databases, AI Mag., vol. 17, no. 3, pp , [2] L. Maimon, Oded; Rokach, Data Mining and Knowledge Discovery Handbook, 2nd ed [3] U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, Knowledge Discovery and Data Mining : Towards a Unifying Framework, Kdd, [4] E. Turban, R. Sharda, and J. Aronson, Business intelligence: a managerial approach, Tamu-Commerce.Edu [5] J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques [6] C. Vercellis, Business Intelligence: Data Mining and Optimization for Decision Making [7] S. Tuffery, Data Mining and Statistics for Decision-Making [8] E. Turban, Decision Support and Business Intelligence, vol [9] H. R. Nemati, D. M. Steiger, L. S. Iyer, and R. T. Herschel, Knowledge warehouse: An architectural integration of knowledge management, decision support, artificial intelligence and data warehousing, Decis. Support Syst., vol. 33, pp , [10] J. P. Shim, M. Warkentin, J. F. Courtney, D. J. Power, R. Sharda, and C. Carlsson, Past, present, and future of decision support technology, Decis. Support Syst., vol. 33, pp , [11] H. A. Simon, The New Science of Management Decision [12] H. a. Simon, The new science of management [13] A. R. Hevner, S. T. March, J. Park, and S. Ram, Design Science in Information Systems Research, MIS Q., vol. 28, no. 1, pp , ISBN:
How to cash-in on ALL football matches without ever picking a winning team
Football Bets Explained: How to cash-in on ALL football matches without ever picking a winning team Football betting has been one of the growth areas of the last five years. A staggering 20 billion will
The most reliable cards bets in the Premier League. bets that paid out in 84%, 89% and 95% of games last season
The most reliable cards bets in the Premier League bets that paid out in 84%, 89% and 95% of games last season www.footballbettingdata.com Inside this special report from Matt Nesbitt at Football Betting
The 7 Premier League trends the bookies don t want YOU to know...
If you knew something happened 7, 8 or even 9 times out of 10 you d bet on it happening again wouldn t you? The 7 Premier League trends the bookies don t want YOU to know... Inside this special report
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
Prediction on Soccer Matches using Multi- Layer Perceptron
Prediction on Soccer Matches using Multi- Layer Perceptron ECE 539 Project Report Author: Yue Weng Mak ID: 903-051-7735 Contents Introduction... 3 Motivation... 4 Experiment... 4 Results... 5 Improvements...
Football Match Winner Prediction
Football Match Winner Prediction Kushal Gevaria 1, Harshal Sanghavi 2, Saurabh Vaidya 3, Prof. Khushali Deulkar 4 Department of Computer Engineering, Dwarkadas J. Sanghvi College of Engineering, Mumbai,
Modelling the Scores of Premier League Football Matches
Modelling the Scores of Premier League Football Matches by: Daan van Gemert The aim of this thesis is to develop a model for estimating the probabilities of premier league football outcomes, with the potential
Numerical Algorithms for Predicting Sports Results
Numerical Algorithms for Predicting Sports Results by Jack David Blundell, 1 School of Computing, Faculty of Engineering ABSTRACT Numerical models can help predict the outcome of sporting events. The features
How to Make s Using my Football Betting system. Chris Clark
How to Make s Using my Football Betting system By Chris Clark Copyright 2004 C.S. Clark All rights reserved. No part of this book may be duplicated, stored in a retrieval system, resold or used as part
Predicting outcome of soccer matches using machine learning
Saint-Petersburg State University Mathematics and Mechanics Faculty Albina Yezus Predicting outcome of soccer matches using machine learning Term paper Scientific adviser: Alexander Igoshkin, Yandex Mobile
Guide to Spread Betting
www.footballbettingdata.co.uk Guide to Spread Betting Part One: How to get a 9/2 payout from a 5/6 shot Are you interested in boosting your profits? Of course you are. Who isn t? There is more to making
Relational Learning for Football-Related Predictions
Relational Learning for Football-Related Predictions Jan Van Haaren and Guy Van den Broeck [email protected], [email protected] Department of Computer Science Katholieke Universiteit
澳 門 彩 票 有 限 公 司 SLOT Sociedade de Lotarias e Apostas Mútuas de Macau, Lda. Soccer Bet Types
Soccer Bet Types 1. Asian Handicap Bet on a team to win in a designated match. Bets will be fully refunded in the case of a draw result after calculating handicap-goal*. *Handicap-goal Handicap-goal applies
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
Soccer Bet Types Content
Soccer Bet Types Content Content 1 1. Asian Handicap 2 Examples on Asian Handicap All-Up Win bets 3 2. Win/Draw/Win 5 3. All-Up Win 6 Banker Combo 9 Multiple Combo 12 4. Over/Under 14 5. Correct Scores
You re keen to start winning, and you re keen to start showing everyone that you re the great undiscovered tactical mastermind.
Bet4theBest Rulebook You re keen to start winning, and you re keen to start showing everyone that you re the great undiscovered tactical mastermind. But first, you want to know how it all works; the terminology
Dynamic Data in terms of Data Mining Streams
International Journal of Computer Science and Software Engineering Volume 2, Number 1 (2015), pp. 1-6 International Research Publication House http://www.irphouse.com Dynamic Data in terms of Data Mining
Predicting sports events from past results
Predicting sports events from past results Towards effective betting on football Douwe Buursma University of Twente P.O. Box 217, 7500AE Enschede The Netherlands [email protected] ABSTRACT
Evidence from the English Premier League and the German Bundesliga. September 21, 2013. NESSIS 2013, Harvard University
Evidence from the English Premier League and the German Bundesliga NESSIS 2013, Harvard University September 21, 2013 Abstract soccer plays a significant role in scoring, about 15% of all goals scored
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control Andre BERGMANN Salzgitter Mannesmann Forschung GmbH; Duisburg, Germany Phone: +49 203 9993154, Fax: +49 203 9993234;
Glossary of bet types
Glossary of bet types The description is meant as a guide the online rules over-ride any possible differences given here. All bets are 90mins only no extra time or penalties included Kokusbets Simple Guide
Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction
Data Mining and Exploration Data Mining and Exploration: Introduction Amos Storkey, School of Informatics January 10, 2006 http://www.inf.ed.ac.uk/teaching/courses/dme/ Course Introduction Welcome Administration
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Sports betting: A source for empirical Bayes
Research practice 1: Progress report Mateo Graciano-Londoño Mathematical Engineering Student Andrés Ramírez-Hassan Department of Economics Tutor EAFIT University, Medelĺın Colombia October 2 nd, 2015 Previous
THE INTELLIGENT BUSINESS INTELLIGENCE SOLUTIONS
THE INTELLIGENT BUSINESS INTELLIGENCE SOLUTIONS ADRIAN COJOCARIU, CRISTINA OFELIA STANCIU TIBISCUS UNIVERSITY OF TIMIŞOARA, FACULTY OF ECONOMIC SCIENCE, DALIEI STR, 1/A, TIMIŞOARA, 300558, ROMANIA [email protected],
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Football Match Result Predictor Website
Electronics and Computer Science Faculty of Physical and Applied Sciences University of Southampton Matthew Tucker Tuesday 13th December 2011 Football Match Result Predictor Website Project supervisor:
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
SECRET BETTING CLUB FINK TANK FREE SYSTEM GUIDE
SECRET BETTING CLUB FINK TANK FREE SYSTEM GUIDE WELCOME TO THE FINK TANK FOOTBALL VALUE SYSTEM Welcome to this free download of the Fink Tank Football Value System - as devised by the Secret Betting Club
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support Rok Rupnik, Matjaž Kukar, Marko Bajec, Marjan Krisper University of Ljubljana, Faculty of Computer and Information
Data quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
Information Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli ([email protected])
In this presentation, you will be introduced to data mining and the relationship with meaningful use.
In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine
Big Data solutions to support Intelligent Systems and Applications
Big solutions to support Intelligent Systems and Applications Luciana Lima, Filipe Portela, Manuel Filipe Santos, António Abelha and José Machado. Abstract in the last years the number of data available
Sports betting odds: A source for empirical Bayes
Sports betting odds: A source for empirical Bayes Mateo Graciano Londoño Andrés Ramírez Hassan, EAFIT University Medellín, Colombia November 25, 2015 Abstract This paper proposes a new betting strategy
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Betting Terms Explained www.sportsbettingxtra.com
Betting Terms Explained www.sportsbettingxtra.com To most people betting has a language of its own, so to help, we have explained the main terms you will come across when betting. STAKE The stake is the
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Neural Networks in Data Mining
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V6 PP 01-06 www.iosrjen.org Neural Networks in Data Mining Ripundeep Singh Gill, Ashima Department
Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing
www.ijcsi.org 198 Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing Lilian Sing oei 1 and Jiayang Wang 2 1 School of Information Science and Engineering, Central South University
Beating the NCAA Football Point Spread
Beating the NCAA Football Point Spread Brian Liu Mathematical & Computational Sciences Stanford University Patrick Lai Computer Science Department Stanford University December 10, 2010 1 Introduction Over
Data Mining Applications in Manufacturing
Data Mining Applications in Manufacturing Dr Jenny Harding Senior Lecturer Wolfson School of Mechanical & Manufacturing Engineering, Loughborough University Identification of Knowledge - Context Intelligent
Enhanced Boosted Trees Technique for Customer Churn Prediction Model
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V5 PP 41-45 www.iosrjen.org Enhanced Boosted Trees Technique for Customer Churn Prediction
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS Breno C. Costa, Bruno. L. A. Alberto, André M. Portela, W. Maduro, Esdras O. Eler PDITec, Belo Horizonte,
Guide to Spread Betting
www.footballbettingdata.co.uk Guide to Spread Betting Part Two: The more right you are, the more you win In Part One of your exclusive guide we looked at how Spread Betting works. All football markets
A Knowledge Management Framework Using Business Intelligence Solutions
www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For
Data Mining Application in Direct Marketing: Identifying Hot Prospects for Banking Product
Data Mining Application in Direct Marketing: Identifying Hot Prospects for Banking Product Sagarika Prusty Web Data Mining (ECT 584),Spring 2013 DePaul University,Chicago [email protected] Keywords:
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Towards applying Data Mining Techniques for Talent Mangement
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Towards applying Data Mining Techniques for Talent Mangement Hamidah Jantan 1,
Cost Drivers of a Parametric Cost Estimation Model for Data Mining Projects (DMCOMO)
Cost Drivers of a Parametric Cost Estimation Model for Mining Projects (DMCOMO) Oscar Marbán, Antonio de Amescua, Juan J. Cuadrado, Luis García Universidad Carlos III de Madrid (UC3M) Abstract Mining is
An opinion mining Partial Least Square Path Modeling for football betting
An opinion mining Partial Least Square Path Modeling for football betting Mohamed El Hamdaoui, Jean-Valère Cossu LIA/Université d Avignon et des Pays de Vaucluse 39 chemin des Meinajaries, Agroparc BP
Financial Trading System using Combination of Textual and Numerical Data
Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,
Data Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
Premier League Guide 08/09 EX CITEMENT. add even more. this season
add even more EX CITEMENT this season www.spreadex.com Fixtures August: 9 Start of Football League season 16 Start of Premier League season 20 England v Czech Republic, friendly 29 Man Utd v Zenit St Petersburg,
Chapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
Comparison of Data Mining Techniques used for Financial Data Analysis
Comparison of Data Mining Techniques used for Financial Data Analysis Abhijit A. Sawant 1, P. M. Chawan 2 1 Student, 2 Associate Professor, Department of Computer Technology, VJTI, Mumbai, INDIA Abstract
Rating Systems for Fixed Odds Football Match Prediction
Football-Data 2003 1 Rating Systems for Fixed Odds Football Match Prediction What is a Rating System? A rating system provides a quantitative measure of the superiority of one football team over their
ONLINE SPORTS GAMBLING: A LOOK INTO THE EFFICIENCY OF BOOKMAKERS ODDS AS FORECASTS IN THE CASE OF ENGLISH PREMIER LEAGUE
ONLINE SPORTS GAMBLING: A LOOK INTO THE EFFICIENCY OF BOOKMAKERS ODDS AS FORECASTS IN THE CASE OF ENGLISH PREMIER LEAGUE By: Jasmine Siwei Xu Thesis Advisor: Professor Roger Craine Undergraduate Economics
Grid Density Clustering Algorithm
Grid Density Clustering Algorithm Amandeep Kaur Mann 1, Navneet Kaur 2, Scholar, M.Tech (CSE), RIMT, Mandi Gobindgarh, Punjab, India 1 Assistant Professor (CSE), RIMT, Mandi Gobindgarh, Punjab, India 2
Chapter Managing Knowledge in the Digital Firm
Chapter Managing Knowledge in the Digital Firm Essay Questions: 1. What is knowledge management? Briefly outline the knowledge management chain. 2. Identify the three major types of knowledge management
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam
ECLT 5810 E-Commerce Data Mining Techniques - Introduction Prof. Wai Lam Data Opportunities Business infrastructure have improved the ability to collect data Virtually every aspect of business is now open
Business Intelligence. Data Mining and Optimization for Decision Making
Brochure More information from http://www.researchandmarkets.com/reports/2325743/ Business Intelligence. Data Mining and Optimization for Decision Making Description: Business intelligence is a broad category
Data Mining Applications in Fund Raising
Data Mining Applications in Fund Raising Nafisseh Heiat Data mining tools make it possible to apply mathematical models to the historical data to manipulate and discover new information. In this study,
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
How To Use Data Mining For Knowledge Management In Technology Enhanced Learning
Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanbul, Turkey, May 27-29, 2007 115 Data Mining for Knowledge Management in Technology Enhanced Learning
DATA ANALYSIS USING BUSINESS INTELLIGENCE TOOL. A Thesis. Presented to the. Faculty of. San Diego State University. In Partial Fulfillment
DATA ANALYSIS USING BUSINESS INTELLIGENCE TOOL A Thesis Presented to the Faculty of San Diego State University In Partial Fulfillment of the Requirements for the Degree Master of Science in Computer Science
STATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
Data mining and football results prediction
Saint Petersburg State University Mathematics and Mechanics Faculty Department of Analytical Information Systems Malysh Konstantin Data mining and football results prediction Term paper Scientific Adviser:
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
Does NFL Spread Betting Obey the E cient Market Hypothesis?
Does NFL Spread Betting Obey the E cient Market Hypothesis? Johannes Harkins May 2013 Abstract In this paper I examine the possibility that NFL spread betting does not obey the strong form of the E cient
Making Sense of the Mayhem: Machine Learning and March Madness
Making Sense of the Mayhem: Machine Learning and March Madness Alex Tran and Adam Ginzberg Stanford University [email protected] [email protected] I. Introduction III. Model The goal of our research
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
Adaptive Business Intelligence (ABI): Presentation of the Unit
Adaptive Business Intelligence (ABI): Presentation of the Unit MAP-i PhD (Edition 2015/16) Lecture Team: Manuel Filipe Santos (University of Minho); Paulo Cortez (University of Minho); Rui Camacho (University
Predicting Soccer Match Results in the English Premier League
Predicting Soccer Match Results in the English Premier League Ben Ulmer School of Computer Science Stanford University Email: [email protected] Matthew Fernandez School of Computer Science Stanford University
Index Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
Man Vs Bookie. The 3 ways to make profit betting on Football. Man Vs Bookie Sport Betting
Man Vs Bookie The 3 ways to make profit betting on Football Sports Betting can be one of the most exciting and rewarding forms of entertainment and is enjoyed by millions of people around the world. In
EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH
EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH SANGITA GUPTA 1, SUMA. V. 2 1 Jain University, Bangalore 2 Dayanada Sagar Institute, Bangalore, India Abstract- One
COURSE SYLLABUS. Enterprise Information Systems and Business Intelligence
MASTER PROGRAMS Autumn Semester 2008/2009 COURSE SYLLABUS Enterprise Information Systems and Business Intelligence Instructor: Malov Andrew, Master of Computer Sciences, Assistant,[email protected] Organization
pi-football: A Bayesian network model for forecasting Association Football match outcomes
Working Paper, draft 5, February 2012 pi-football: A Bayesian network model for forecasting Association Football match outcomes ANTHONY C. CONSTANTINOU *, NORMAN E. FENTON AND MARTIN NEIL Risk Assessment
Information Visualization WS 2013/14 11 Visual Analytics
1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
How to Earn Extra N 50,000 Weekly From NAIRABET Rebranded
How to Earn Extra N 50,000 Weekly From NAIRABET Rebranded Copiled by: Omani Chukwunalu www.9jahub.com More Updates At www.9jahub.com Now Page 1 Disclaimer and Legal Notices. While every effort has been
Analytics on Big Data
Analytics on Big Data Riccardo Torlone Università Roma Tre Credits: Mohamed Eltabakh (WPI) Analytics The discovery and communication of meaningful patterns in data (Wikipedia) It relies on data analysis
SUNDERLAND PROGRAMMES. 1946-47 Aways 30.00 unless states Aston Villa, Brentford, Charlton, Chelsea, Derby, Portsmouth, Sheffield United
1946-47 Homes 30.00 unless stated SUNDERLAND PROGRAMMES 1946-47 Aways 30.00 unless states Aston Villa, Brentford, Charlton, Chelsea, Derby, Portsmouth, Sheffield United 1947-48 Homes 30.00 unless stated
Beating the MLB Moneyline
Beating the MLB Moneyline Leland Chen [email protected] Andrew He [email protected] 1 Abstract Sports forecasting is a challenging task that has similarities to stock market prediction, requiring time-series
Lotto Master Formula (v1.3) The Formula Used By Lottery Winners
Lotto Master Formula (v.) The Formula Used By Lottery Winners I. Introduction This book is designed to provide you with all of the knowledge that you will need to be a consistent winner in your local lottery
Decision Support and Business Intelligence Systems. Chapter 1: Decision Support Systems and Business Intelligence
Decision Support and Business Intelligence Systems Chapter 1: Decision Support Systems and Business Intelligence Types of DSS Two major types: Model-oriented DSS Data-oriented DSS Evolution of DSS into
Evaluating an Integrated Time-Series Data Mining Environment - A Case Study on a Chronic Hepatitis Data Mining -
Evaluating an Integrated Time-Series Data Mining Environment - A Case Study on a Chronic Hepatitis Data Mining - Hidenao Abe, Miho Ohsaki, Hideto Yokoi, and Takahira Yamaguchi Department of Medical Informatics,
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
Software project cost estimation using AI techniques
Software project cost estimation using AI techniques Rodríguez Montequín, V.; Villanueva Balsera, J.; Alba González, C.; Martínez Huerta, G. Project Management Area University of Oviedo C/Independencia
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Business Intelligence and Decision Support Systems
Chapter 12 Business Intelligence and Decision Support Systems Information Technology For Management 7 th Edition Turban & Volonino Based on lecture slides by L. Beaubien, Providence College John Wiley
