Discovery of students academic patterns using data mining techniques

Size: px
Start display at page:

Download "Discovery of students academic patterns using data mining techniques"

Transcription

1 Discovery of students academic patterns using data mining techniques Mr. Shreenath Acharya Information Science & Engg department St. Joseph Engineering College Mangalore, India Ms. Madhu N Information Science & Engg department St. Joseph Engineering College Mangalore, India nmadhu.nayak@gmail.com Abstract - Knowledge discovery is an emerging field which combines the techniques from mathematics, statistics, algorithms and Artificial Intelligence to extract the knowledge. Data mining is a main phase of Knowledge Discovery in Databases (KDD) for extracting the knowledge based on the patterns and their correlation by the application of appropriate association rules to the informations available from the data set. The outcome of the KDD is used to analyse or predict on the future aspects in any area of considerations. In this paper we propose an analysis and prediction of students placements based on the historical informations from the database by considering the students information at different confident levels and support counts to generate the association rules. The widely used algorithm in data mining ie, apriori algorithm is specifically considered for the extraction of the knowledge. Keywords: KDD, Data mining, analysis, rules 1. Introduction The explosive growth in stored or transient data has generated an urgent need for new techniques and automated tools that can intelligently assist us in transforming the vast amounts of data into useful information and knowledge [6]. Today, techniques for the discovery of patterns hidden in large data sets, focusing on issues relating to their feasibility, usefulness, effectiveness and scalability are present. Hence, the proposed system aims at discovery of knowledge from a huge database. Patterns of interest will be mined which will prove useful in near future using available techniques. The proposed model is intended to mine the interesting patterns from the student academic records and placement database. Nowadays, organizations as well as students need to take decisions appropriately to satisfy their academic requirements. An educational organization should be aware of the trends of placements or recruitments in their institution. Even the students, who would like to pursue any course in particular institution, would want to know the placement trends of that institution. Therefore, the useful patterns that are generated would guide the academicians and the students to make decisions more confidently. Moreover, the patterns generated from student academic records will help the institution to analyse the academic performance of the students. Hence, this work would behave as a decision support tool for the management system to improve their policy-making, setting new strategies and having more advanced decision making procedures. The remaining section of the paper is organized as follows. Section II explores the various development and their reviews, section III describes the model architecture of the proposed system, section IV describes the methodology & implementation, section V provides an analysis of the obtained results and section VI draws the conclusion and the future scope. 2. Literature Review The developments in the computing along with their specific requirements has led to increased requirements of large volumes of the complex data. It will be cumbersome to analyse those datas manually as per their need in ISSN : Vol. 4 No. 06 June

2 an application. Thus data mining plays a vital role in extracting the hidden information from the large data sets (databases). Faouzi Mhamdi et al [1] described the Knowledge discovery saying that it provides new concepts or concept relationships hidden in large volumes of raw data which has the capacity to automate complex search and data analysis tasks. They also specified that data mining extracts nuggets of knowledge to be used in verification of hypothesis or the prediction and knowledge explanation and its different phases are data preprocessing, data processing or data mining and data post processing. Usama Fayyad [2] presents an overview of KDD and Data mining in the applications depicting that they are at the intersection of several disciplines including statistics, databases, pattern recognition/ai, visualization, high performance and parallel computing. He also describes the significance of Data Mining which provides techniques that allow managers (higher officials) to identify the valid, useful, understandable correlation and patterns of data. Mykhaylo Lobur et al [3] provided an insight into the different repositories of information like data warehouses, transactional databases; relational databases etc. leading to varied approaches for the data mining. They also discussed about the four basic mining approaches supported by different mining technologies: predictive model creation supported by supervised induction techniques, link analysis by association & sequence discovery techniques, DB segmentation supported by clustering techniques, and deviation detection supported by statistical techniques. They described Knowledge Discovery as a process to extract the specific patterns of interest from the data using varied data mining algorithms. Timo Horeis et al [4] described the capability of the conventional systems for knowledge discovery and data mining revealing their ability to extract valid rules from huge data sets. These extracted rules describe the dependencies between attributes and classes in a quantitative way. They also discussed the effect of fusion of this knowledge combined with the qualitative knowledge from several experts resulting in more comprehensive knowledge about an application area. M. Lobur et al [5] defined KDD as a process of discovering useful knowledge from the data and data mining as a step in it. They gave an insight into the main goal of KDD as extracting high-level knowledge from low-level data in the large data set context. They also revealed the structure of KDD software systems often embedding statistical procedures for sampling and modeling data, evaluating hypothesis and handling noise within an overall knowledge discovery framework. Qian Wan et al [7] discussed about the practical usefulness of data mining techniques necessitating an approach which would hope to bring a promising avenue to look at the data from a new angle in order to allow us find new, useful and actionable patterns. Li zhu et al [8] described the association rules in the context of data mining concentrating only on improving the efficiency of the algorithm neglecting the users understanding and participation. They revealed the fact that students historical records stored in the university databases could be a data source for mining the students subjective interest and interest degree so that their performance may be improved through teaching and personal trainings. The proposed data mining model would eventually help educational institutions to improve their decisionmaking approach. 3. System Architecture Institutions do not rely on any system to predict the performance of their students placement. This may result in students becoming unproductive professionals as they are unable to identify their field of interest. So it would be better if we have a system to help educational institutions to produce qualitative students by providing them appropriate guidance. The discovery of the student behaviour patterns assists not only the institution but also the students to help them take right decisions. To justify, if we consider the present educational institutions, we do not find any system which finds patterns of various kinds on the performance of the previous students. Manually doing such a task of finding patterns would be erroneous as large amount of data has to be surfed in order to come to a conclusion. Making the system think would help to take decisions at a faster pace. Thus, the knowledge discovery process with its different phases will prove to be an effective and efficient mechanism to extract the patterns of interest from a database which could lead to right decisions by the concerned persons on any area of their considerations. ISSN : Vol. 4 No. 06 June

3 Figure 1 describes the process of knowledge discovery as an iterative sequence of the following steps: 5 Practical application of the Results A step to produce useful patterns/models from data, under some acceptable computational efficiency limitations 3 4 Data Mining: Extract Patterns/Models Interpret & Evaluate discovered knowledge 2 Collect & Preprocess data 1 Understand domain Define problems Figure 1. The Knowledge discovery process 1. Understanding the domain and problem definition: This is the phase where the problems will be identified and defined to be applied to the KDD process. 2. Collection and preprocessing of the data: This phase collects the informations from the database or dataset and converts it into a standard format to be able to perform the processing. 3. Data Mining: Extraction of the patterns/models: This phase does the main function of KDD ie, it extracts the specific patterns of interest by the application of suitable algorithms (like apriori here). 4. Interpretation & evaluation of the discovered knowledge: This phase uses a domain expert to analyse and interpret the generated patterns. 5. Putting the results in Practical applications: This phase puts forth the knowledge from the previous phase into practical use in applications. Our system aims at mining attractive, unnoticed, and useful patterns from a database which has been gathered from the placement section of a particular institution. Once done with the identification of patterns, it can be applied to future set of data in order to provide appropriate guidance and counselling to the students. Similar idea can be applied to any other databases to arrive at conclusions that would be beneficial to the institution or organisation. The main intention is to make the system think. The users of the system are the executives who are having the authority to take decisions on behalf of the students. Even the students can take necessary guidance from the system with the help of their in-charge. For e.g.: The HOD of the departments can make use of the mined patterns to send students to companies suiting their status. Among the different phases considered for the Knowledge discovery in databases, data mining is the main phase which considers the preprocessed data as the data source. The data source is mined using the apriori algorithm to extract the specific pattern of interest so that it would be analysed by a domain expert to provide suggestions towards the decision making. A typical data mining system contains the following components [9]. ISSN : Vol. 4 No. 06 June

4 1. Database, Data warehouse: There can be one or a set of databases, data warehouses. Data cleaning, data integration and filtration techniques may be performed on the data. 2. Database or data warehouse server: It is responsible for fetching relevant data based on the user s data mining request. 3. Knowledge base: This is the domain knowledge that is used to guide the search or evaluate the interestingness of resulting patterns. Knowledge can include concept hierarchies, used to organize attributes or attribute values into different levels of abstraction. Knowledge such as user beliefs, which can be used to assess a pattern s interestingness based on its unexpectedness, may also be included. 4. Data mining engine: This is an essential part of the data mining system which consists of functional modules for tasks. Here, it is association analysis. 5. Pattern evaluation module: This component typically employs interestingness measures and interacts with the data mining modules so as to focus the search toward interesting patterns. It may use interestingness thresholds to filter out discovered patterns. 6. User interface: This module communicates between users and the data mining system, allowing the user to interact with the system by specifying a data mining query or task. In addition, this component allows the user to browse database and also query it. 4. Methodology The different tasks of the data mining has been divided into pre-processing, processing and post-processing. 4.1 Pre-processing The data will be pre-processed before applying the apriori algorithm to mine patterns. The pre-processing stage includes: Filtration/Cleaning, Homogenization, Generalization and Integration Filteration The database collected has 21 tables. The drop table query has been applied to filter the database contents so that it only contains the tables 5 tables named comp_info, course, enrolled_in, is_placed, stu_info. Each table is analysed individually to remove unwanted attributes by the application of specific queries to the contents of the table so that it suits our requirement in the proposed application Homogenization The data in the table has to be brought to a standard form to make the mining process easy. This process is called as homogenization or the modularization of data. The 5 tables taken into consideration have been brought into a standard form by eliminating the spelling mistakes, different notations for the same place etc. by applying specific queries so that the mining process becomes easier Generalization The queries have been executed appropriately at both UG and PG level to generalize it for the specific range of salaries Integration All the tables that have been filtered, standardized and generalized are merged by the application of specific queries so that it results in a single table to be fed as the data source for the algorithm. This merged table is converted into a market basket type of data. ISSN : Vol. 4 No. 06 June

5 4.2 Processing The processing stage involves the application of the apriori algorithm for the given support and confidence in order to extract the frequent item set thereby generating the rules of our interest to be analysed for future applications. Support determines how often a rule is applicable to a given data set. A low support rule is likely to be uninteresting from a business perspective. So, support is often used to eliminate uninteresting rules. Support(x) = Number of transactions containing x (1) Total number of transactions Confidence determines how frequently items in Y appear in transactions that contain X, when the association is of the form X Y. It measures the reliability of the inference made by a rule. For a given rule X Y, the higher the confidence, the more likely it is for Y to be present in transactions that contain X. Confidence also provides an estimate of the conditional probability of Y, given X. Confidence(x y) = Number of transactions containing x and y (2) Number of transactions containing x Algorithm to calculate FrequentItemsets: set count = candidates size read transaction file to a FileInputStream and put it in a BufferedReader for(i=0 to number of transactions) read i th item from transaction file, tokenize it with respect to itemseparator for(j=0 to number of items) set Boolean array = true after comparison with oneval[j] where transaction is present and false otherwise for(c=0 to Candidates size) set match = false tokenize candidates (c), store in tokenizer st while(st has more tokens) store the value in trans[next token of st -1] if(match) increment count[c] for(i=0 to Candidate size) if((count[i]/number of transactions ) >= minimum support) add candidates at i to frequentcandidates reinitialize Candidates with frequentcandidates clear frequentcandidates Algorithm to generate rules for all the frequentitemsets formed if( frequentitemset!=1) find the combinations formed from that frequent itemsets for each combination generate antecedent, consequent of the rule confidence = support(antecedent and consequent)/support(confidence) display rules with (confidence > minimum confidence) ISSN : Vol. 4 No. 06 June

6 4.3 Post-Processing In this stage of post processing, the domain expert analyzes the rules according to the interestingness of the rule. The knowledge base of the user is applied here to filter out the rules which are not interesting. Association analysis algorithms have the potential to generate a large number of patterns. As the size and dimensionality of real, commercial databases can be very large, we could easily end up with thousands or even millions of patterns, many of which might not be interesting. Sifting through the patterns to identify the most interesting ones is not a trivial task because One person s trash might be another person s treasure. It is therefore important to establish a set of well-accepted criteria for evaluating the quality of association patterns. The first set of criteria can be established through statistical arguments. Patterns that involve a set of mutually independent items or cover very few transactions are considered uninteresting because they may capture spurious relationships in the data. Such patterns can be eliminated by applying an objective interestingness measure that uses statistics derived from data to determine whether a pattern is interesting. The second set of criteria can be established through subjective arguments. A pattern is considered subjectively uninteresting unless it reveals unexpected information about the data or provides useful knowledge that can lead to profitable actions. For example, the rule {BTECH} {Male} may not be interesting despite having high support and confidence values, because the relationship represented by the rule may seem rather obvious. On the other hand, the rule {cgpa_6} {comp_add_up} is interesting because the relationship is quite unexpected and may suggest a new trend in placement. Incorporating subjective knowledge into pattern evaluation is a difficult task because it requires a considerable amount of prior information from the domain experts. Pattern evaluation is done by incorporating subjective knowledge. The approach used is visualization where it requires a user-friendly environment to keep the human user in the loop. It also allows the domain expert to interact with the data mining system by interpreting and verifying the discovered patterns. 5. Implementation The implementation of the algorithm and the user interface for the human interaction has been carried out using Java programming language and Netbeans IDE 6.5 is used as an Integrated Development Kit. The NetBeans facilitates the execution of the application in any platform like Windows, Mac OS, Linux and Solaris. We have considered Windows platform and Microsoft SQL Server 2005 as the back end database. The XAMPP which is an open source cross-platform web server is also considered to support the capability of serving dynamic web pages. 6. Results and Analysis The pre-processing of the database is done manually by applying our domain knowledge with the help of SQL queries. The results obtained by the queries were tested during the process itself by referring back to the original database. So, separate test cases are not used to test the task of pre-processing. The Apriori Algorithm used in the system to discover student academic behaviour is a well tested algorithm in itself. But, testing of individual phases of the algorithm during the frequent item set generation and rule generation module have been successfully conducted so that it satisfies all the requirements for the analysis in pattern evaluation to be presented to the user. 6.1 Analysis of the rules Discovered Student Behaviour from the Placement Database Figure 2 and Figure 3 describe the generation of frequent item set and rule for the sample input shown with support count =15 and the confidence = 8. ISSN : Vol. 4 No. 06 June

7 Figure 2. Frequent Itemset Generation Figure 3. Rule Generation It is observed that the rules generated when support is low and confidence is high are more interesting. The support values below 10 and above 40 are giving spurious rules. The confidence values below 30 and above 90 are giving rules which are not very interesting. So the favourable range for support is between 10 and 20 and that of confidence is from 70 to 90. Some of the generalizations considered before analysing the results are: quota_1 : The students coming from general merit quota_2 : The students belonging to certain minorities quota_3 : SC/ST students quota_4: Physically challenged students sal_one: The salary range between 0 to 3 lakhs ISSN : Vol. 4 No. 06 June

8 sal_two: The salary range between 3 to 6 lakhs sal_three: The salary range between 6 to 11 lakhs cgpa_6: The cumulative grade point average is greater than or equal to 6 and less than 7 cgpa_7: The cumulative grade point average is greater than or equal to 7 and less than 8 comp_address_others: Faridabad, Goa, Gujarat, Jamshedpur, Kolkata, Rajasthan, Jaipur The rules that are obtained from the apriori algorithm are analysed as follows for various support and confidence values. The rules having single element in the antecedent and consequent are pruned. The test results obtained for some values of support and confidence are as in the table 1 below. Table 1. Sample test results for different support and confidence values Sl. No Support Confidence Number of rules generated Similarly the test results are obtained for different support and confidence values and the rules that are unrealistic and uninteresting are eliminated. A sample of the suggestions to the institution from the obtained patterns: 1. Send male students doing BTECH, and who want to get placed in places like Faridabad, Goa, Gujarat, Jamshedpur, Kolkata, Rajasthan and Jaipur to core companies for placement. 2. Send students doing BTECH, from general merit quota, staying in the hostel and want to get placed in Karnataka to core companies for placement. 3. The male students from Karnataka doing BTECH get placed in a company in Karnataka. 4. Send male students doing BTECH who are from the general merit quota, staying in the hostel and want to get placed in a company in Delhi, to core companies for placement. 5. Male Civil students staying in hostel should be sent to core companies. 6. Students expecting a job in UP with a salary in the range 0 to 3 lakhs, should try for normal companies. 7. Male students doing BTECH, from the general merit quota and staying in the hostel, from the CS branch and like to get in a company in Karnataka, should try for normal companies. 8. MTECH students who are male and hostelites or from quota_1 should be sent to companies paying within 3 lakhs. Similarly, many more conclusions can be drawn and actions taken for different combinations of support and confidence. 7. Conclusions and Future Scope Successful Knowledge discovery and Data mining applications play a vital role in extracting unknown knowledge from vast data sets. In this paper we have developed a model for the placement database so that institutions can use it to discover some interesting patterns that could be analyzed to plan their future activities. It has been found to be very useful to the higher authorities like principal, head of the department or the ISSN : Vol. 4 No. 06 June

9 placement officer for taking decisions to sort out the students based on their educational stream, location, interest for proper management and training leading to their successful career. The issues regarding the outcome of the research is that, it depends on the completeness and the accuracy of the data being analyzed and it requires a domain expert to evaluate the generated rules. The future scope could be dynamic updation of the database by the user before applying the algorithm to generate the patterns of interest rather than performing it on statistical historical data. Moreover the data mining can be made as constraint-based mining wherein the rules can be generated based on the constraints provided by the user. References [1] Faouzi Mhamdi, Mourad Elloumi, A New Survey On knowledge Discovery And Data Mining December [2] Usama Fayyad, Data Mining and Knowledge Discovery in Databases: Implications for Scientific Databases Proceedings of the Ninth International Conference on Scientific and Statistical Database Management IEEE Computer Society Washington, DC, USA, [3] Mykhaylo Lobur, Yuri Stekh, Vitalij Artsibasov, Challenges in knowledge discovery and data mining in data MEMSTECH 2011, May 2011, Polyana-Svalyava (Zakarpattya), UKRAINE. [4] Timo Horeis, Bernhard Sick, Collaborative Knowledge Discovery & Data Mining: From Knowledge to Experience Proceedings of the 2007 IEEE Symposium on Computational Intelligence and Data Mining (CIDM 2007). [5] M. Lobur, Yu. Stekh, A. Kernytskyy, Faisal M.E. Sardieh, Some Trends in Knowledge Discovery and Data Mining MEMSTECH 2008, May 21-24, 2008, Polyana, UKRAINE. [6] Han J, Kamber M, Data Mining: Concepts and Techniques, Simon Fraser University, Morgan Koufmann Publishers, Second Edition, [7] Qian Wan, Aijun An, Transitional Patterns and their significant milestones 7 th IEEE International Conference on Data Mining, pp , [8] Li zhu, Yanli Li, Xiang Li, Research on Early Warning Model of students academic records based on association rules, Proceedings of World Congress on Computer Science and Information Engineering, pp , [9] Pang-Ning Tan, Michael Steinbach, Vipin Kumar, Introduction to Data Mining, Pearson Education Inc., Fourth Edition, ISSN : Vol. 4 No. 06 June

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 Over viewing issues of data mining with highlights of data warehousing Rushabh H. Baldaniya, Prof H.J.Baldaniya,

More information

SPATIAL DATA CLASSIFICATION AND DATA MINING

SPATIAL DATA CLASSIFICATION AND DATA MINING , pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR

More information

Introduction. A. Bellaachia Page: 1

Introduction. A. Bellaachia Page: 1 Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.

More information

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10 1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom

More information

Healthcare Measurement Analysis Using Data mining Techniques

Healthcare Measurement Analysis Using Data mining Techniques www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik

More information

Selection of Optimal Discount of Retail Assortments with Data Mining Approach

Selection of Optimal Discount of Retail Assortments with Data Mining Approach Available online at www.interscience.in Selection of Optimal Discount of Retail Assortments with Data Mining Approach Padmalatha Eddla, Ravinder Reddy, Mamatha Computer Science Department,CBIT, Gandipet,Hyderabad,A.P,India.

More information

Static Data Mining Algorithm with Progressive Approach for Mining Knowledge

Static Data Mining Algorithm with Progressive Approach for Mining Knowledge Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 85-93 Research India Publications http://www.ripublication.com Static Data Mining Algorithm with Progressive

More information

Prediction of Heart Disease Using Naïve Bayes Algorithm

Prediction of Heart Disease Using Naïve Bayes Algorithm Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

Application of Data Mining Techniques in Intrusion Detection

Application of Data Mining Techniques in Intrusion Detection Application of Data Mining Techniques in Intrusion Detection LI Min An Yang Institute of Technology leiminxuan@sohu.com Abstract: The article introduced the importance of intrusion detection, as well as

More information

DATA MINING AND WAREHOUSING CONCEPTS

DATA MINING AND WAREHOUSING CONCEPTS CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation

More information

A Data Warehouse Design for A Typical University Information System

A Data Warehouse Design for A Typical University Information System (JCSCR) - ISSN 2227-328X A Data Warehouse Design for A Typical University Information System Youssef Bassil LACSC Lebanese Association for Computational Sciences Registered under No. 957, 2011, Beirut,

More information

Database Marketing, Business Intelligence and Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

Dynamic Data in terms of Data Mining Streams

Dynamic Data in terms of Data Mining Streams International Journal of Computer Science and Software Engineering Volume 2, Number 1 (2015), pp. 1-6 International Research Publication House http://www.irphouse.com Dynamic Data in terms of Data Mining

More information

New Matrix Approach to Improve Apriori Algorithm

New Matrix Approach to Improve Apriori Algorithm New Matrix Approach to Improve Apriori Algorithm A. Rehab H. Alwa, B. Anasuya V Patil Associate Prof., IT Faculty, Majan College-University College Muscat, Oman, rehab.alwan@majancolleg.edu.om Associate

More information

DEVELOPMENT OF HASH TABLE BASED WEB-READY DATA MINING ENGINE

DEVELOPMENT OF HASH TABLE BASED WEB-READY DATA MINING ENGINE DEVELOPMENT OF HASH TABLE BASED WEB-READY DATA MINING ENGINE SK MD OBAIDULLAH Department of Computer Science & Engineering, Aliah University, Saltlake, Sector-V, Kol-900091, West Bengal, India sk.obaidullah@gmail.com

More information

Overview Applications of Data Mining In Health Care: The Case Study of Arusha Region

Overview Applications of Data Mining In Health Care: The Case Study of Arusha Region International Journal of Computational Engineering Research Vol, 03 Issue, 8 Overview Applications of Data Mining In Health Care: The Case Study of Arusha Region 1, Salim Diwani, 2, Suzan Mishol, 3, Daniel

More information

Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier

Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier Data Mining: Concepts and Techniques Jiawei Han Micheline Kamber Simon Fräser University К MORGAN KAUFMANN PUBLISHERS AN IMPRINT OF Elsevier Contents Foreword Preface xix vii Chapter I Introduction I I.

More information

How To Solve The Kd Cup 2010 Challenge

How To Solve The Kd Cup 2010 Challenge A Lightweight Solution to the Educational Data Mining Challenge Kun Liu Yan Xing Faculty of Automation Guangdong University of Technology Guangzhou, 510090, China catch0327@yahoo.com yanxing@gdut.edu.cn

More information

Data Mining Analytics for Business Intelligence and Decision Support

Data Mining Analytics for Business Intelligence and Decision Support Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing

More information

Finding Frequent Patterns Based On Quantitative Binary Attributes Using FP-Growth Algorithm

Finding Frequent Patterns Based On Quantitative Binary Attributes Using FP-Growth Algorithm R. Sridevi et al Int. Journal of Engineering Research and Applications RESEARCH ARTICLE OPEN ACCESS Finding Frequent Patterns Based On Quantitative Binary Attributes Using FP-Growth Algorithm R. Sridevi,*

More information

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.

More information

Understanding Web personalization with Web Usage Mining and its Application: Recommender System

Understanding Web personalization with Web Usage Mining and its Application: Recommender System Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,

More information

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH 205 A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH ABSTRACT MR. HEMANT KUMAR*; DR. SARMISTHA SARMA** *Assistant Professor, Department of Information Technology (IT), Institute of Innovation in Technology

More information

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,

More information

Web Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm

Web Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm International Journal of Engineering Inventions e-issn: 2278-7461, p-issn: 2319-6491 Volume 2, Issue 5 (March 2013) PP: 16-21 Web Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm

More information

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control Andre BERGMANN Salzgitter Mannesmann Forschung GmbH; Duisburg, Germany Phone: +49 203 9993154, Fax: +49 203 9993234;

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

Data Mining Applications in Higher Education

Data Mining Applications in Higher Education Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2

More information

Visual Data Mining in Indian Election System

Visual Data Mining in Indian Election System Visual Data Mining in Indian Election System Prof. T. M. Kodinariya Asst. Professor, Department of Computer Engineering, Atmiya Institute of Technology & Science, Rajkot Gujarat, India trupti.kodinariya@gmail.com

More information

Neural Networks in Data Mining

Neural Networks in Data Mining IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V6 PP 01-06 www.iosrjen.org Neural Networks in Data Mining Ripundeep Singh Gill, Ashima Department

More information

RULE-BASE DATA MINING SYSTEMS FOR

RULE-BASE DATA MINING SYSTEMS FOR RULE-BASE DATA MINING SYSTEMS FOR CUSTOMER QUERIES A.Kaleeswaran 1, V.Ramasamy 2 Assistant Professor 1&2 Park College of Engineering and Technology, Coimbatore, Tamil Nadu, India. 1&2 Abstract: The main

More information

A Framework for Data Warehouse Using Data Mining and Knowledge Discovery for a Network of Hospitals in Pakistan

A Framework for Data Warehouse Using Data Mining and Knowledge Discovery for a Network of Hospitals in Pakistan , pp.217-222 http://dx.doi.org/10.14257/ijbsbt.2015.7.3.23 A Framework for Data Warehouse Using Data Mining and Knowledge Discovery for a Network of Hospitals in Pakistan Muhammad Arif 1,2, Asad Khatak

More information

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland Data Mining and Knowledge Discovery in Databases (KDD) State of the Art Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland 1 Conference overview 1. Overview of KDD and data mining 2. Data

More information

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining Lluis Belanche + Alfredo Vellido Intelligent Data Analysis and Data Mining a.k.a. Data Mining II Office 319, Omega, BCN EET, office 107, TR 2, Terrassa avellido@lsi.upc.edu skype, gtalk: avellido Tels.:

More information

A Survey on Web Mining From Web Server Log

A Survey on Web Mining From Web Server Log A Survey on Web Mining From Web Server Log Ripal Patel 1, Mr. Krunal Panchal 2, Mr. Dushyantsinh Rathod 3 1 M.E., 2,3 Assistant Professor, 1,2,3 computer Engineering Department, 1,2 L J Institute of Engineering

More information

Introduction to Data Mining

Introduction to Data Mining Bioinformatics Ying Liu, Ph.D. Laboratory for Bioinformatics University of Texas at Dallas Spring 2008 Introduction to Data Mining 1 Motivation: Why data mining? What is data mining? Data Mining: On what

More information

COURSE RECOMMENDER SYSTEM IN E-LEARNING

COURSE RECOMMENDER SYSTEM IN E-LEARNING International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand

More information

A Survey on Intrusion Detection System with Data Mining Techniques

A Survey on Intrusion Detection System with Data Mining Techniques A Survey on Intrusion Detection System with Data Mining Techniques Ms. Ruth D 1, Mrs. Lovelin Ponn Felciah M 2 1 M.Phil Scholar, Department of Computer Science, Bishop Heber College (Autonomous), Trichirappalli,

More information

FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT

FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT ANURADHA.T Assoc.prof, atadiparty@yahoo.co.in SRI SAI KRISHNA.A saikrishna.gjc@gmail.com SATYATEJ.K satyatej.koganti@gmail.com NAGA ANIL KUMAR.G

More information

INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR. ankitanandurkar2394@gmail.com

INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR. ankitanandurkar2394@gmail.com IJFEAT INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR Bharti S. Takey 1, Ankita N. Nandurkar 2,Ashwini A. Khobragade 3,Pooja G. Jaiswal 4,Swapnil R.

More information

Chapter 3: Data Mining Driven Learning Apprentice System for Medical Billing Compliance

Chapter 3: Data Mining Driven Learning Apprentice System for Medical Billing Compliance Chapter 3: Data Mining Driven Learning Apprentice System for Medical Billing Compliance 3.1 Introduction This research has been conducted at back office of a medical billing company situated in a custom

More information

Data Mining Introduction

Data Mining Introduction Data Mining Introduction Organization Lectures Mondays and Thursdays from 10:30 to 12:30 Lecturer: Mouna Kacimi Office hours: appointment by email Labs Thursdays from 14:00 to 16:00 Teaching Assistant:

More information

An Effective Analysis of Weblog Files to improve Website Performance

An Effective Analysis of Weblog Files to improve Website Performance An Effective Analysis of Weblog Files to improve Website Performance 1 T.Revathi, 2 M.Praveen Kumar, 3 R.Ravindra Babu, 4 Md.Khaleelur Rahaman, 5 B.Aditya Reddy Department of Information Technology, KL

More information

II. OLAP(ONLINE ANALYTICAL PROCESSING)

II. OLAP(ONLINE ANALYTICAL PROCESSING) Association Rule Mining Method On OLAP Cube Jigna J. Jadav*, Mahesh Panchal** *( PG-CSE Student, Department of Computer Engineering, Kalol Institute of Technology & Research Centre, Gujarat, India) **

More information

A Time Efficient Algorithm for Web Log Analysis

A Time Efficient Algorithm for Web Log Analysis A Time Efficient Algorithm for Web Log Analysis Santosh Shakya Anju Singh Divakar Singh Student [M.Tech.6 th sem (CSE)] Asst.Proff, Dept. of CSE BU HOD (CSE), BUIT, BUIT,BU Bhopal Barkatullah University,

More information

Data Warehousing and Data Mining

Data Warehousing and Data Mining Data Warehousing and Data Mining Winter Semester 2010/2011 Free University of Bozen, Bolzano DW Lecturer: Johann Gamper gamper@inf.unibz.it DM Lecturer: Mouna Kacimi mouna.kacimi@unibz.it http://www.inf.unibz.it/dis/teaching/dwdm/index.html

More information

Use of Data Mining in the field of Library and Information Science : An Overview

Use of Data Mining in the field of Library and Information Science : An Overview 512 Use of Data Mining in the field of Library and Information Science : An Overview Roopesh K Dwivedi R P Bajpai Abstract Data Mining refers to the extraction or Mining knowledge from large amount of

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

Edifice an Educational Framework using Educational Data Mining and Visual Analytics

Edifice an Educational Framework using Educational Data Mining and Visual Analytics I.J. Education and Management Engineering, 2016, 2, 24-30 Published Online March 2016 in MECS (http://www.mecs-press.net) DOI: 10.5815/ijeme.2016.02.03 Available online at http://www.mecs-press.net/ijeme

More information

Data Mining System, Functionalities and Applications: A Radical Review

Data Mining System, Functionalities and Applications: A Radical Review Data Mining System, Functionalities and Applications: A Radical Review Dr. Poonam Chaudhary System Programmer, Kurukshetra University, Kurukshetra Abstract: Data Mining is the process of locating potentially

More information

Mining an Online Auctions Data Warehouse

Mining an Online Auctions Data Warehouse Proceedings of MASPLAS'02 The Mid-Atlantic Student Workshop on Programming Languages and Systems Pace University, April 19, 2002 Mining an Online Auctions Data Warehouse David Ulmer Under the guidance

More information

Association Technique on Prediction of Chronic Diseases Using Apriori Algorithm

Association Technique on Prediction of Chronic Diseases Using Apriori Algorithm Association Technique on Prediction of Chronic Diseases Using Apriori Algorithm R.Karthiyayini 1, J.Jayaprakash 2 Assistant Professor, Department of Computer Applications, Anna University (BIT Campus),

More information

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanbul, Turkey, May 27-29, 2007 115 Data Mining for Knowledge Management in Technology Enhanced Learning

More information

Grid Density Clustering Algorithm

Grid Density Clustering Algorithm Grid Density Clustering Algorithm Amandeep Kaur Mann 1, Navneet Kaur 2, Scholar, M.Tech (CSE), RIMT, Mandi Gobindgarh, Punjab, India 1 Assistant Professor (CSE), RIMT, Mandi Gobindgarh, Punjab, India 2

More information

International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: 2347-937X DATA MINING TECHNIQUES AND STOCK MARKET

International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: 2347-937X DATA MINING TECHNIQUES AND STOCK MARKET DATA MINING TECHNIQUES AND STOCK MARKET Mr. Rahul Thakkar, Lecturer and HOD, Naran Lala College of Professional & Applied Sciences, Navsari ABSTRACT Without trading in a stock market we can t understand

More information

Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report

Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report G. Banos 1, P.A. Mitkas 2, Z. Abas 3, A.L. Symeonidis 2, G. Milis 2 and U. Emanuelson 4 1 Faculty

More information

A Review of Data Mining Techniques

A Review of Data Mining Techniques Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

Business Lead Generation for Online Real Estate Services: A Case Study

Business Lead Generation for Online Real Estate Services: A Case Study Business Lead Generation for Online Real Estate Services: A Case Study Md. Abdur Rahman, Xinghui Zhao, Maria Gabriella Mosquera, Qigang Gao and Vlado Keselj Faculty Of Computer Science Dalhousie University

More information

Data Warehousing and OLAP Technology for Knowledge Discovery

Data Warehousing and OLAP Technology for Knowledge Discovery 542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories

More information

ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION

ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION K.Vinodkumar 1, Kathiresan.V 2, Divya.K 3 1 MPhil scholar, RVS College of Arts and Science, Coimbatore, India. 2 HOD, Dr.SNS

More information

An Empirical Study of Application of Data Mining Techniques in Library System

An Empirical Study of Application of Data Mining Techniques in Library System An Empirical Study of Application of Data Mining Techniques in Library System Veepu Uppal Department of Computer Science and Engineering, Manav Rachna College of Engineering, Faridabad, India Gunjan Chindwani

More information

Customer Classification And Prediction Based On Data Mining Technique

Customer Classification And Prediction Based On Data Mining Technique Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor

More information

Binary Coded Web Access Pattern Tree in Education Domain

Binary Coded Web Access Pattern Tree in Education Domain Binary Coded Web Access Pattern Tree in Education Domain C. Gomathi P.G. Department of Computer Science Kongu Arts and Science College Erode-638-107, Tamil Nadu, India E-mail: kc.gomathi@gmail.com M. Moorthi

More information

A Survey on Association Rule Mining in Market Basket Analysis

A Survey on Association Rule Mining in Market Basket Analysis International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 4 (2014), pp. 409-414 International Research Publications House http://www. irphouse.com /ijict.htm A Survey

More information

A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING

A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING Ahmet Selman BOZKIR Hacettepe University Computer Engineering Department, Ankara, Turkey selman@cs.hacettepe.edu.tr Ebru Akcapinar

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:

More information

Improving Apriori Algorithm to get better performance with Cloud Computing

Improving Apriori Algorithm to get better performance with Cloud Computing Improving Apriori Algorithm to get better performance with Cloud Computing Zeba Qureshi 1 ; Sanjay Bansal 2 Affiliation: A.I.T.R, RGPV, India 1, A.I.T.R, RGPV, India 2 ABSTRACT Cloud computing has become

More information

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College

More information

How To Use Neural Networks In Data Mining

How To Use Neural Networks In Data Mining International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and

More information

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Enhanced Boosted Trees Technique for Customer Churn Prediction Model IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V5 PP 41-45 www.iosrjen.org Enhanced Boosted Trees Technique for Customer Churn Prediction

More information

Assessing Learners Behavior by Monitoring Online Tests through Data Visualization

Assessing Learners Behavior by Monitoring Online Tests through Data Visualization International Journal of Electronics and Computer Science Engineering 2068 Available Online at www.ijecse.org ISSN : 2277-1956 Assessing Learners Behavior by Monitoring Online Tests through Data Visualization

More information

Indirect Positive and Negative Association Rules in Web Usage Mining

Indirect Positive and Negative Association Rules in Web Usage Mining Indirect Positive and Negative Association Rules in Web Usage Mining Dhaval Patel Department of Computer Engineering, Dharamsinh Desai University Nadiad, Gujarat, India Malay Bhatt Department of Computer

More information

Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2

Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2 Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2 Department of Computer Engineering, YMCA University of Science & Technology, Faridabad,

More information

SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL

SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL Krishna Kiran Kattamuri 1 and Rupa Chiramdasu 2 Department of Computer Science Engineering, VVIT, Guntur, India

More information

Association rules for improving website effectiveness: case analysis

Association rules for improving website effectiveness: case analysis Association rules for improving website effectiveness: case analysis Maja Dimitrijević, The Higher Technical School of Professional Studies, Novi Sad, Serbia, dimitrijevic@vtsns.edu.rs Tanja Krunić, The

More information

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

More information

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics

More information

Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1

Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1 Data Mining 1 Introduction 2 Data Mining methods Alfred Holl Data Mining 1 1 Introduction 1.1 Motivation 1.2 Goals and problems 1.3 Definitions 1.4 Roots 1.5 Data Mining process 1.6 Epistemological constraints

More information

CONCEPTUAL FRAMEWORK OF BUSINESS INTELLIGENCE ANALYSIS IN ACADEMIC ENVIRONMENT USING BIRT

CONCEPTUAL FRAMEWORK OF BUSINESS INTELLIGENCE ANALYSIS IN ACADEMIC ENVIRONMENT USING BIRT CONCEPTUAL FRAMEWORK OF BUSINESS INTELLIGENCE ANALYSIS IN ACADEMIC ENVIRONMENT USING BIRT Julaily Aida Jusoh, Norhakimah Endot, Nazirah Abd. Hamid, Raja Hasyifah Raja Bongsu, Roslinda Muda Faculty of Informatics,

More information

DATA MINING CONCEPTS AND TECHNIQUES. Marek Maurizio E-commerce, winter 2011

DATA MINING CONCEPTS AND TECHNIQUES. Marek Maurizio E-commerce, winter 2011 DATA MINING CONCEPTS AND TECHNIQUES Marek Maurizio E-commerce, winter 2011 INTRODUCTION Overview of data mining Emphasis is placed on basic data mining concepts Techniques for uncovering interesting data

More information

PREDICTING STUDENTS PERFORMANCE USING ID3 AND C4.5 CLASSIFICATION ALGORITHMS

PREDICTING STUDENTS PERFORMANCE USING ID3 AND C4.5 CLASSIFICATION ALGORITHMS PREDICTING STUDENTS PERFORMANCE USING ID3 AND C4.5 CLASSIFICATION ALGORITHMS Kalpesh Adhatrao, Aditya Gaykar, Amiraj Dhawan, Rohit Jha and Vipul Honrao ABSTRACT Department of Computer Engineering, Fr.

More information

Data Warehousing and Data Mining

Data Warehousing and Data Mining Data Warehousing and Data Mining Winter Semester 2012/2013 Free University of Bozen, Bolzano DM Lecturer: Mouna Kacimi mouna.kacimi@unibz.it http://www.inf.unibz.it/dis/teaching/dwdm/index.html Organization

More information

Mining changes in customer behavior in retail marketing

Mining changes in customer behavior in retail marketing Expert Systems with Applications 28 (2005) 773 781 www.elsevier.com/locate/eswa Mining changes in customer behavior in retail marketing Mu-Chen Chen a, *, Ai-Lun Chiu b, Hsu-Hwa Chang c a Department of

More information

Evolutionary Hot Spots Data Mining. An Architecture for Exploring for Interesting Discoveries

Evolutionary Hot Spots Data Mining. An Architecture for Exploring for Interesting Discoveries Evolutionary Hot Spots Data Mining An Architecture for Exploring for Interesting Discoveries Graham J Williams CRC for Advanced Computational Systems, CSIRO Mathematical and Information Sciences, GPO Box

More information

Big Data with Rough Set Using Map- Reduce

Big Data with Rough Set Using Map- Reduce Big Data with Rough Set Using Map- Reduce Mr.G.Lenin 1, Mr. A. Raj Ganesh 2, Mr. S. Vanarasan 3 Assistant Professor, Department of CSE, Podhigai College of Engineering & Technology, Tirupattur, Tamilnadu,

More information

Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE

Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE YOUR SUCCESS IS OUR FOCUS Whitepaper Published on: January 2009 Author: BIBA PRACTICE 2009 Hexaware Technologies. All rights reserved. Table of Contents 1. 2. Data Warehouse - Typical pain points 3. Hexaware

More information

Data Mining Approach in Security Information and Event Management

Data Mining Approach in Security Information and Event Management Data Mining Approach in Security Information and Event Management Anita Rajendra Zope, Amarsinh Vidhate, and Naresh Harale Abstract This paper gives an overview of data mining field & security information

More information

A New Approach for Evaluation of Data Mining Techniques

A New Approach for Evaluation of Data Mining Techniques 181 A New Approach for Evaluation of Data Mining s Moawia Elfaki Yahia 1, Murtada El-mukashfi El-taher 2 1 College of Computer Science and IT King Faisal University Saudi Arabia, Alhasa 31982 2 Faculty

More information

not possible or was possible at a high cost for collecting the data.

not possible or was possible at a high cost for collecting the data. Data Mining and Knowledge Discovery Generating knowledge from data Knowledge Discovery Data Mining White Paper Organizations collect a vast amount of data in the process of carrying out their day-to-day

More information

Knowledge-Driven Decision Support System Based on Knowledge Warehouse and Data Mining for Market Management

Knowledge-Driven Decision Support System Based on Knowledge Warehouse and Data Mining for Market Management Knowledge-Driven Decision Support System Based on Knowledge Warehouse and Data Mining for Market Management Dr. Murtadha M. Hamad 1 and Banaz Anwer Qader 2 1,2 College of Computer - Anbar University Anbar

More information

A Framework for Dynamic Faculty Support System to Analyze Student Course Data

A Framework for Dynamic Faculty Support System to Analyze Student Course Data A Framework for Dynamic Faculty Support System to Analyze Student Course Data J. Shana 1, T. Venkatachalam 2 1 Department of MCA, Coimbatore Institute of Technology, Affiliated to Anna University of Chennai,

More information

A COGNITIVE APPROACH IN PATTERN ANALYSIS TOOLS AND TECHNIQUES USING WEB USAGE MINING

A COGNITIVE APPROACH IN PATTERN ANALYSIS TOOLS AND TECHNIQUES USING WEB USAGE MINING A COGNITIVE APPROACH IN PATTERN ANALYSIS TOOLS AND TECHNIQUES USING WEB USAGE MINING M.Gnanavel 1 & Dr.E.R.Naganathan 2 1. Research Scholar, SCSVMV University, Kanchipuram,Tamil Nadu,India. 2. Professor

More information

A Business Intelligence Training Document Using the Walton College Enterprise Systems Platform and Teradata University Network Tools Abstract

A Business Intelligence Training Document Using the Walton College Enterprise Systems Platform and Teradata University Network Tools Abstract A Business Intelligence Training Document Using the Walton College Enterprise Systems Platform and Teradata University Network Tools Jeffrey M. Stewart College of Business University of Cincinnati stewajw@mail.uc.edu

More information

Research of Smart Space based on Business Intelligence

Research of Smart Space based on Business Intelligence Research of Smart Space based on Business Intelligence 1 Jia-yi YAO, 2 Tian-tian MA 1 School of Economics and Management, Beijing Jiaotong University, jyyao@bjtu.edu.cn 2 School of Economics and Management,

More information

ALIAS: A Tool for Disambiguating Authors in Microsoft Academic Search

ALIAS: A Tool for Disambiguating Authors in Microsoft Academic Search Project for Michael Pitts Course TCSS 702A University of Washington Tacoma Institute of Technology ALIAS: A Tool for Disambiguating Authors in Microsoft Academic Search Under supervision of : Dr. Senjuti

More information

AN EFFICIENT APPROACH TO PERFORM PRE-PROCESSING

AN EFFICIENT APPROACH TO PERFORM PRE-PROCESSING AN EFFIIENT APPROAH TO PERFORM PRE-PROESSING S. Prince Mary Research Scholar, Sathyabama University, hennai- 119 princemary26@gmail.com E. Baburaj Department of omputer Science & Engineering, Sun Engineering

More information