Decision Support System for Inventory Management using Data Mining Techniques
|
|
|
- Evelyn Hudson
- 10 years ago
- Views:
Transcription
1 International Journal of Engineering and Advanced Technology (IJEAT) Decision Support System for Inventory Management using Data Mining Techniques Vivek Ware, Bharathi H. N Abstract Timely identification of newly emerging trends is needed in business process. Data mining techniques are best suited for the classification, useful patterns extraction and predications which are very important for business support and decision making. Patterns from inventory data indicate market trends and can be used in forecasting which has great potential for decision making, strategic planning. Our objectives is to get better decision making for improving sale, services and quality, which is useful mechanism for business support, investment and surveillance. An approach is implemented for mining patterns of huge stock data to predict factors affecting the sale of products. For this divide the stock data in three different clusters on the basis of sold quantities i.e. Dead-Stock (DS), Slow-Moving (SM) and Fast- Moving (FM) using K-means algorithm or Hierarchical agglomerative algorithm. After that Most Frequent Pattern (MFP) algorithm is implemented to find frequencies of property values of the corresponding items. MFP provides frequent patterns of item attributes and also gives sales trend in a compact form. Clustering and MFP algorithm can generate more useful pattern from large stock data which is helpful to get item information for inventory. Keywords: Most Frequent Patterns, Clustering, Decision Making. I. INTRODUCTION Sale data classification has different market trends. Some clusters or segments of sale may be growing, while others are declining. The information produced is very useful for business decision making. [1][2]It is easy to turn cash into inventory, but the challenge is to turn inventory into cash. Effective inventory management enables an organization to meet or exceed customer s expectations of product availability while maximizing net profits and minimizing costs [6]. Only through data mining techniques, it is possible to extract useful pattern and association from the stock data [7]. Data mining techniques like clustering and associations can be used to find meaningful patterns for future predictions. Clustering is used to generate groups of related patterns, while association provides a way to get generalized rules of dependent variables. Patterns from a huge stock data on the basis of these rules can be obtained. The behavior in terms of sales transaction is significant. The general term used for such type of analysis is called Market Basket Analysis [8]. Typically there is lot of different items, placed in a market for selling, in which some of the product will be fast selling items, some will be slow selling items and some will be dead stock. Decision making in business sector is considered as one of the critical tasks. There is study for data mining for inventory item selection with cross selling considerations which is used for maximal-profit selling items [1]. Manuscript Received on August Vivek Ware, Department of Computer Engineering, K J Somaiya College of Engineering Vidhyavihar, Mumbai, India. Bharathi H. N, Department of Computer Engineering, K J Somaiya College of Engineering Vidhyavihar, Mumbai, India. But our problem is finding out the selling power of the products in the market. This is a useful approach to distinguish the selling frequency of items on the basis of the known attributes, e.g. we can examine that a Sinthetic Surat sadi of red color of type nylon in marriage season has high ratio of sale, here we have basic property related to this example, i.e. color, type, season, and Design. So it can be predict that what products of certain properties have what type of sale trends in different locations. Thus on the basis of this scenario it can predict the reason of dead-stock, slow moving and fast moving items. Data mining techniques are best suited for the analysis of such type of classification, useful patterns extraction and predictions. [1] II. BACKGROUND Researchers in the field of data mining always try to find innovative techniques so as to improve the performance of the extraction methods used in data mining as they usually use history of the different transactions done in finding the data as it will be useful for future use. This data collection can be used by them to predict the customer behavior and their interests L.K.Soon et al [27], compared the execution performance of numerical and symbolic representation of using data in term of similar search. M. C. Lo [8] considered a model for inventory decision support system [IDSS] in which ordering quantity, ordering cost, safety factor, lead time and backorder discounts are decision variables, the algorithm is applied to fine the optimal solution for the case where the lead time demands to follow a general distribution. J. ting et al [2] he proposed a technique based trading data mining approach for intra-stock mining which usually perform concentrates on finding most appearing items for the stock time series data and inter trading mining which used to discover the different strong relationship among the several stocks. L. K. Soon et al [2] generated a list of stocks which are influential to Kuala Lumpur Composite Index (KLCI), and then produce classification rules, which he denotes the inter-relationships among the stocks in terms of their trading performance with respect to KLCI. [2][1]. In the current years of development in the field of data mining, it is considered that the partitioned clustering technique is well suited for clustering a large document dataset due to their relatively low computational requirements and increase in the gradual performance of the system. The time factor complexity of the partitioning technique is almost linear, because of which it is widely used. The best known partitioning clustering algorithm is the K-means algorithm and its variants [25]. As this algorithm is simple, straightforward and is based on the firm foundation of analysis of variances. In addition to the K-means algorithm, several algorithms, other algorithms such as Particle Swarm Optimization (PSO) are another computational intelligence method that has already been applied to image clustering and 164
2 Decision Support System for Inventory Management using Data Mining Techniques other low dimensional datasets. [1]. III. ARCHITECTURE In this work an algorithm used for mining patterns of huge stock data to predict factors affecting the sale of products. In the first phase, it divides the stock data in three different clusters on the basis of sold quantities i.e. Dead Stock (DS), Small Growth (SG) and Fast-Growth (FG) using K-means algorithm or Hierarchical Agglomerative. In the second phase Most Frequent Pattern (MFP) algorithm is used to find frequencies of property values of the corresponding items. [1] Figure 1 Block Diagram of Architecture Step-1: to collect database (cash memo) form storekeeper and put it into proper format (excel sheet). Also collect inventory and put in separate excel sheet. Step-2: In this data preprocessing is done by filling missing value either by global constant or by average. Step-3: After that aggregation has to perform. Season-wise aggregation (collection) is performed. Step-4: clustering algorithms has to be performed. 1. K-mean Algorithm or 2.Hierachical Agglomerative. Any one of them has to be performed. Step-5: after any one clustering algorithm performed. 3 clusters are generated as small growth cluster, fast growth clusters and dead stock clusters. Step-6: perform Most Frequent Pattern algorithm on 1. Fast growth cluster and 2. Slow growth clusters. Show the result in matrix format. A. K-MEANS K-means [11] is a typical clustering algorithm and has used for classification of data for decades. Proximity is usually measured by some sort of distance; the most commonly being used is the Euclidean distance [1] l Dist (i, j) = sqrt ( (x it - x jk ) ^2)... (1) k = 1 The main idea is to define k centroids, one for each cluster. These centroids should be placed in a cunning way because of different location causes different result. This algorithm aims at minimizing an objective function, in this case a squared error function. The objective function is k k j J = x j - c j 2... (2) j = 1 i = 1 Where x j j - c j 2 is a chosen distance measure between a data point (j) i x and the cluster centre j. c, is an indicator of the distance of the n data points from their respective cluster centers. The steps of the K-mean algorithm [8] are as described below: Step 1: Place K points into the space represented by the objects that are being clustered; these points represent initial group centroids. Step 2: Assign each object to the group that has the closest centroid. Step 3: When all objects have been assigned, recalculate the positions of the K centroids. Step 4: Repeat Steps 2 and 3 until the centroids no longer move. This produces a separation of the objects into groups from which the metric to be minimized can be calculated. B. Hierarchical Agglomerative Hierarchical clustering [8] is an agglomerative (top down) clustering method. As its name suggests, the idea of this method is to build a hierarchy of clusters, showing relations between the individual members and merging clusters of data based on similarity. In the first step of clustering, the algorithm will look for the two most similar data points and merge them to create a new "pseudo-datapoint", which represents the average of the two merged datapoints. Each iterative step takes the next two closest datapoints (or pseduo-datapoints) and merges them. This process is generally continued until there is one large cluster containing all the original datapoints. Hierarchical clustering results in a "tree", showing the relationship of all of the original points. C. Most Frequent Pattern (MFP) Association rule mining is one of the most important and well defines technique for extract correlations, frequent patterns, associations or causal structures among sets of items in the transaction databases or other repositories. Association rules are widely used in various areas such as risk management, telecomm, market analysis, inventory control, and stock data. [1] Apriori algorithm [8] for strong association among the patterns is highly recommended. A new algorithm MFP that is more efficiently generates frequent patterns and strong association between them. For this purpose a property matrix containing counted values of corresponding properties of each product has been used. MFP Algorithm: Let we have set X of N items in a Dataset having set Y of 165
3 International Journal of Engineering and Advanced Technology (IJEAT) attributes. This algorithm counts maximum of each attribute values yij for each item in the dataset. Input: Datasets (DS) Output: Matrix Frequent Property Pattern (FPP): FPP (DS) Begin For each item Xi in DS A. for each attribute i. count occurrences for Xi C=Count (Xi) ii. Find attribute name of C Mi=Attribute (Ci) Next [End of inner loop] b. Find Most Frequent Pattern i. MFP=Combine (Mi) Next [End of outer loop] End IV. IMPLEMENTATION Data in its original format never confirm to the required shape for data mining. It needs to be transformed, integrated, and aggregated so that the mining process can effectively perform on it. There is a need to process the data before it used in the knowledge discovery (KDD) process. Being data quality a key issue with data mining as 50% to 80% of mining experts often spend their time on data quality, the pre-processing in data mining have a key importance.[1] Customer buying details (Cash Memo) are stored in the excel sheet as shown in the figure. Here filed taken as Product Id (PID), Product Name (PNAME), Product Color (Color), Type of design product has (Design), Product Prize Range (Prize Range), How many products are sold on that day (Volume), total bill number of a day (Bill numbers), Total cash of day (Cash Total), date, Month and season. Where 6 Season are considered as summer, winter, Mansoon, Marriage, Cell, and Festival. An Inventory is also recorded when stock is came to shop for future use. A. Data Cleaning and Aggregation Missing values can be replaced by either average value or global constant as per preprocessing techniques. Select appropriate attribute on which preprocessing going has to done. Here volume attribute is selected (figure 4) and missing values are replaced by global constant zero. Aggregation will here collect data by season wise attribute. Select the attribute on which aggregation has to do. Figure 3 Snapshot of Items in Fast Growing Cluster Figure 4 Snapshot of Items in Slow Growing Cluster Figure 5 Snapshot of Items in Dead Stock Cluster Figure 2 Snapshot of Proper Data in Excel Sheet Figure 6 Snapshot of MFP on Fast Growing Cluster 166
4 Decision Support System for Inventory Management using Data Mining Techniques Figure 7 Snapshot of MFP on Small Growing Cluster Now form this we can tell for example consider the summer season in that Sinthetic Surat sadi of type Naylon with color red and having cross design is frequently taken by customer so u can purchase this type in summer so that storekeeper can get maximum profit. V.EXPERIMENTAL RESULT Table 1 Number of Items Present in the Cluster Algorithm Total Fast Slow Dead Records Moving Moving Stock K-Mean Hierarchical Agglomerative Table 1 will show records in the clusters. Reason behind this is K-mean initially itself choose 3 central points to create cluster. Hierarchical Agglomerative choose cluster on basis of similarity. So we can t predict number of records in the any cluster in case of Hierarchical Agglomerative. Meaning of dead stock clusters is that not sold items, Fast moving clusters frequently large number going items and slow moving clusters are item which are sold more but not large in number. Now these clusters will help to shop owner get knowledge about items. Now storekeeper can make idea like using cell to sold dead stock or slow moving items. On cluster generation it gets item selling status. Now it will find pattern. Most frequent pattern will help now. It will tell which attribute of item has more frequency in selling. This gives more information about that item. Using present inventory in shop, owner can decide about inventory. Table 2 is showing pattern of fast moving items using K-mean algorithm. Now form this we can tell for example consider the summer season in that Sinthetic Surat sadi of type Naylon with color red and having cross design and prize range is frequently taken by customer so he can purchase this type in summer so that storekeeper can get maximum profit. Table 2 MFP on Fast Growing Item using K-mean Season Product Name Color Type Design Prize Range Winter Summer SintheticSurat Red Naylon Cross Mansoon SintheticSurat Red Naylon Cross VI.CONCLUSION The problem of pattern discovery from stock data mining is addressed in this project. Hybrid clustering association mining approach is implemented to classify stock data and find compact form of associated patterns of sale. After implementation on current database it is shown that clustering and most frequent Pattern algorithm is very efficient for mining patterns of huge stock data and predicting the factors affecting the sale of products. It formulate most frequent pattern of products using their known properties in inventory system. It identified the trends of selling products through their known attributes. This technique is simple by using matrix and counting of attribute values. Hierarchical (Hierarchical Agglomerative) and Partitional (K-Mean) Clustering have key differences in running time, assumptions, input parameters and resultant clusters. Typically, partitional clustering is faster than hierarchical clustering. Hierarchical clustering requires only a similarity measure, while partitional clustering requires stronger assumptions such as number of clusters and the initial centers. Hierarchical clustering does not require any input parameters, while partitional clustering algorithms require the number of clusters to start running. Hierarchical clustering returns a much more meaningful and subjective division of clusters but partitioned clustering results in exactly k clusters. As hierarchical didn t required number of cluster but we can control its limitations of generation of clusters. The hierarchical clustering method, though simple, often encounters difficulties regarding the selection of merge or split points. Such a decision is critical because once a group of objects is merged or split, the process at the next step will operate on the newly generated clusters. It will neither undo what was done previously nor perform object swapping between clusters. Thus merge or split decisions, if not well chosen at some step, may lead to low-quality clusters. Moreover, the method does not scale well, because each decision to merge or split requires the examination and evaluation of a good number of objects or clusters. Both of these choose their initial points randomly but in case of hierarchical it is seen that he decide the points itself because splitting and merging is dynamic at that points. So while implementing various result got when hierarchical agglomerative algorithms is used. Sometimes it gives 46 items in fast growth, sometimes 78 items in fast growth. Sometimes it gives same result as such given by K-mean. So according to items present in the cluster pattern are also changed. Here result shown in the table is taken which occur maximum time. A new algorithm MFP that is more efficiently generates frequent patterns and strong association between them. It does just calculate the frequency count which is easy to understand than apriori algorithm. The limitation of the project is that it works only on numerical data in later it can be implemented using Image data or 2-dimentional data. Most frequent pattern didn t define specific attributes so in future we can specify exact attributes. In future it will extend implement in sentiment analysis process and decision making from online customer reviews and blogs data. Artificial Intelligence can be added to that to get knowledge.effectiveness of the system can be further improved if trained on larger database. Festival SintheticSurat Red Naylon Handwork Cell SintheticSurat Red Naylon Neckless Marriage SintheticSurat Red Naylon Batta
5 REFERENCE [1] A Khan, B. Baharudin, K. A. Khan, Mining Customer Data for Decision Making using new Hybrid Classification algorithm in journal of theoretical and applied Information Technology Vol 27 no.1,15th May 2011 [2] Dattatray Gandhmal, Ranjeetsingh Parihar and Rajesh Argiddi, An Optimized Approach to Analyze Stock market using Data Mining Technique in International Conference on Emerging Technology Trends (ICETT) 2011 [3] Mrs. Tejaswini Hilage and R. V. Kulkarni, Review of Literature on Data Mining IJRRAS 10 (1), January [4] Chidanad Apte, Bing Liu,Edwin P.D, Pednault and Padhraic Smyth Business Application of Data Mining,Communication of the ACM August 2002/Vol. 45, No. 8 [5] [6] Abubakar, Felix, Customer satisfaction with supermarket retail shopping, [7] Sung-Ju Kim, Dong-Sik Yun and Byung-Soo chang, Association Analysis of Customer Services from the Enterprise Customer Management System,ICDM [8] Jiawan Han, Micheline Kamber Data Mining Concepts and Techniques 2nd edition 2004 [9] Neelamadhab Padhy, Dr. Pragnyaban Mishra, and Rasmita Panigrahi, The Survey of Data Mining Applications And Feature Scope, International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), Vol.2, No.3, June 2012 [10] L.K. Soon and Sang Ho Lee, Explorative Data Mining on Stock Data Experimental Results and Findings, pringer- ADMA 2007, LNAI 4632, pp , [11] Darken, C. Moody, J. Yale Comput. Sci., New Haven, Fast adaptive k-means clustering IEEE [12] Berry and Linoff, data mining techniques: for marketing, sales and customer support, John Eilry #Sons, inc, 1997 [13] Usama Fayyad, Gregory Piatetsky-Shapiro, and Padhraic Smyth, From Data Mining to Knowledge Discovery in Databases, AI Magazine Volume 17, Number 3, 1996 [14] Shelly Gupta, Dharminder Kumar and Anand Sharma, Performance Analysis of Various Data Mining Classification Techniques on Healthcare Data, International Journal of Computer Science & Information Technology (IJCSIT) Vol 3, No 4, August 2011 [15] Er. Mamta Juneja and Er.Nikita Phulll, Data Mining and its Scope [16] [17] E Balagurusamy, Programming in C#, Second Edition, Tata Mcgraw Hill [18] Kumar Sanjeev and Shibi Panikkar, Magic of ASP.Net with C#, Firewall Media [19] Emin Aleskerov, Bernd fieisleben and Bharat Rao, Neural network based database mining system for credit card fraud detection, Department of Electrical Engineering and Computer Science, University of Siegen [20] [21] Matt Hartely Using data mining to predict inventory levels, IEEE,2005 [22] [23] Shu-Hsien Liao, Hsu-hui Ho, Hui-wen Lin, Mining stock category, association and cluster on Taiwan stock market, Expert Systems with Applications Volume 35, Issue 1-2 July [24] P.Thomas, Macredie Knowledge Discovery and Data Mining [25] Artigan, J. A. Clustering Algorithms. Ohn Wiley and Sons, Inc., New York, NY [26] visited [27] M. Al-Noukari, and W. Al-Hussan, Using Data Mining Techniques for Predicting Future Car market Demand IEEE, 2008 International Journal of Engineering and Advanced Technology (IJEAT) 168
Use of Data Mining Techniques to Improve the Effectiveness of Sales and Marketing
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 4, April 2015,
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
Data Mining Project Report. Document Clustering. Meryem Uzun-Per
Data Mining Project Report Document Clustering Meryem Uzun-Per 504112506 Table of Content Table of Content... 2 1. Project Definition... 3 2. Literature Survey... 3 3. Methods... 4 3.1. K-means algorithm...
Data Mining Analytics for Business Intelligence and Decision Support
Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Enhanced Boosted Trees Technique for Customer Churn Prediction Model
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V5 PP 41-45 www.iosrjen.org Enhanced Boosted Trees Technique for Customer Churn Prediction
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Comparison and Analysis of Various Clustering Methods in Data mining On Education data set Using the weak tool
Comparison and Analysis of Various Clustering Metho in Data mining On Education data set Using the weak tool Abstract:- Data mining is used to find the hidden information pattern and relationship between
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
Data Mining System, Functionalities and Applications: A Radical Review
Data Mining System, Functionalities and Applications: A Radical Review Dr. Poonam Chaudhary System Programmer, Kurukshetra University, Kurukshetra Abstract: Data Mining is the process of locating potentially
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Analysis of Stock Market Trend using Integrated Clustering and Weighted Rule Mining Technique
Analysis of Stock Market Trend using Integrated Clustering and Weighted Rule Mining Technique S.Karthik #, K.K.Sureshkumar * # M.Phil - Computer Science (Part Time), Research Scholar, Kongu Arts and Science
ISSN: 2321-7782 (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies
ISSN: 2321-7782 (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS 1 AND ALGORITHMS Chiara Renso KDD-LAB ISTI- CNR, Pisa, Italy WHAT IS CLUSTER ANALYSIS? Finding groups of objects such that the objects in a group will be similar
Static Data Mining Algorithm with Progressive Approach for Mining Knowledge
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 85-93 Research India Publications http://www.ripublication.com Static Data Mining Algorithm with Progressive
International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: 2347-937X DATA MINING TECHNIQUES AND STOCK MARKET
DATA MINING TECHNIQUES AND STOCK MARKET Mr. Rahul Thakkar, Lecturer and HOD, Naran Lala College of Professional & Applied Sciences, Navsari ABSTRACT Without trading in a stock market we can t understand
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Grid Density Clustering Algorithm
Grid Density Clustering Algorithm Amandeep Kaur Mann 1, Navneet Kaur 2, Scholar, M.Tech (CSE), RIMT, Mandi Gobindgarh, Punjab, India 1 Assistant Professor (CSE), RIMT, Mandi Gobindgarh, Punjab, India 2
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,
Clustering. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca
Clustering Adrian Groza Department of Computer Science Technical University of Cluj-Napoca Outline 1 Cluster Analysis What is Datamining? Cluster Analysis 2 K-means 3 Hierarchical Clustering What is Datamining?
Clustering UE 141 Spring 2013
Clustering UE 141 Spring 013 Jing Gao SUNY Buffalo 1 Definition of Clustering Finding groups of obects such that the obects in a group will be similar (or related) to one another and different from (or
- Clustering Taiwan s Real Estate Data for Market Structure Analysis
Unlock the Value of Open Data - Clustering Taiwan s Real Estate Data for Market Structure Analysis 1 Sheng-Chi Chen, 2 Chien-hung Liu 1,2 Department of Management Information Systems, National Chengchi
A Comparative Study of clustering algorithms Using weka tools
A Comparative Study of clustering algorithms Using weka tools Bharat Chaudhari 1, Manan Parikh 2 1,2 MECSE, KITRC KALOL ABSTRACT Data clustering is a process of putting similar data into groups. A clustering
Machine Learning using MapReduce
Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous
Using Data Mining for Mobile Communication Clustering and Characterization
Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer
Comparison of Data Mining Techniques used for Financial Data Analysis
Comparison of Data Mining Techniques used for Financial Data Analysis Abhijit A. Sawant 1, P. M. Chawan 2 1 Student, 2 Associate Professor, Department of Computer Technology, VJTI, Mumbai, INDIA Abstract
Market Basket Analysis for a Supermarket based on Frequent Itemset Mining
www.ijcsi.org 257 Market Basket Analysis for a Supermarket based on Frequent Itemset Mining Loraine Charlet Annie M.C. 1 and Ashok Kumar D 2 1 Department of Computer Science, Government Arts College Tchy,
Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing
www.ijcsi.org 198 Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing Lilian Sing oei 1 and Jiayang Wang 2 1 School of Information Science and Engineering, Central South University
Data Mining Clustering (2) Sheets are based on the those provided by Tan, Steinbach, and Kumar. Introduction to Data Mining
Data Mining Clustering (2) Toon Calders Sheets are based on the those provided by Tan, Steinbach, and Kumar. Introduction to Data Mining Outline Partitional Clustering Distance-based K-means, K-medoids,
Predict Influencers in the Social Network
Predict Influencers in the Social Network Ruishan Liu, Yang Zhao and Liuyu Zhou Email: rliu2, yzhao2, [email protected] Department of Electrical Engineering, Stanford University Abstract Given two persons
Rule based Classification of BSE Stock Data with Data Mining
International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 4, Number 1 (2012), pp. 1-9 International Research Publication House http://www.irphouse.com Rule based Classification
Mining an Online Auctions Data Warehouse
Proceedings of MASPLAS'02 The Mid-Atlantic Student Workshop on Programming Languages and Systems Pace University, April 19, 2002 Mining an Online Auctions Data Warehouse David Ulmer Under the guidance
Cluster Analysis. Alison Merikangas Data Analysis Seminar 18 November 2009
Cluster Analysis Alison Merikangas Data Analysis Seminar 18 November 2009 Overview What is cluster analysis? Types of cluster Distance functions Clustering methods Agglomerative K-means Density-based Interpretation
Adaptive Framework for Network Traffic Classification using Dimensionality Reduction and Clustering
IV International Congress on Ultra Modern Telecommunications and Control Systems 22 Adaptive Framework for Network Traffic Classification using Dimensionality Reduction and Clustering Antti Juvonen, Tuomo
Chapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH
EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH SANGITA GUPTA 1, SUMA. V. 2 1 Jain University, Bangalore 2 Dayanada Sagar Institute, Bangalore, India Abstract- One
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
Quality Assessment in Spatial Clustering of Data Mining
Quality Assessment in Spatial Clustering of Data Mining Azimi, A. and M.R. Delavar Centre of Excellence in Geomatics Engineering and Disaster Management, Dept. of Surveying and Geomatics Engineering, Engineering
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control Andre BERGMANN Salzgitter Mannesmann Forschung GmbH; Duisburg, Germany Phone: +49 203 9993154, Fax: +49 203 9993234;
Robust Outlier Detection Technique in Data Mining: A Univariate Approach
Robust Outlier Detection Technique in Data Mining: A Univariate Approach Singh Vijendra and Pathak Shivani Faculty of Engineering and Technology Mody Institute of Technology and Science Lakshmangarh, Sikar,
2.1. Data Mining for Biomedical and DNA data analysis
Applications of Data Mining Simmi Bagga Assistant Professor Sant Hira Dass Kanya Maha Vidyalaya, Kala Sanghian, Distt Kpt, India (Email: [email protected]) Dr. G.N. Singh Department of Physics and
Analytics on Big Data
Analytics on Big Data Riccardo Torlone Università Roma Tre Credits: Mohamed Eltabakh (WPI) Analytics The discovery and communication of meaningful patterns in data (Wikipedia) It relies on data analysis
Role of Social Networking in Marketing using Data Mining
Role of Social Networking in Marketing using Data Mining Mrs. Saroj Junghare Astt. Professor, Department of Computer Science and Application St. Aloysius College, Jabalpur, Madhya Pradesh, India Abstract:
Clustering. Danilo Croce Web Mining & Retrieval a.a. 2015/201 16/03/2016
Clustering Danilo Croce Web Mining & Retrieval a.a. 2015/201 16/03/2016 1 Supervised learning vs. unsupervised learning Supervised learning: discover patterns in the data that relate data attributes with
ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS
ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS Abstract D.Lavanya * Department of Computer Science, Sri Padmavathi Mahila University Tirupati, Andhra Pradesh, 517501, India [email protected]
Keywords: Mobility Prediction, Location Prediction, Data Mining etc
Volume 4, Issue 4, April 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Data Mining Approach
Web Usage Mining: Identification of Trends Followed by the user through Neural Network
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 617-624 International Research Publications House http://www. irphouse.com /ijict.htm Web
Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 8 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 4/8/2004 Hierarchical
Future Trend Prediction of Indian IT Stock Market using Association Rule Mining of Transaction data
Volume 39 No10, February 2012 Future Trend Prediction of Indian IT Stock Market using Association Rule Mining of Transaction data Rajesh V Argiddi Assit Prof Department Of Computer Science and Engineering,
IT services for analyses of various data samples
IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical
Clustering. 15-381 Artificial Intelligence Henry Lin. Organizing data into clusters such that there is
Clustering 15-381 Artificial Intelligence Henry Lin Modified from excellent slides of Eamonn Keogh, Ziv Bar-Joseph, and Andrew Moore What is Clustering? Organizing data into clusters such that there is
Standardization and Its Effects on K-Means Clustering Algorithm
Research Journal of Applied Sciences, Engineering and Technology 6(7): 399-3303, 03 ISSN: 040-7459; e-issn: 040-7467 Maxwell Scientific Organization, 03 Submitted: January 3, 03 Accepted: February 5, 03
Chapter 20: Data Analysis
Chapter 20: Data Analysis Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 20: Data Analysis Decision Support Systems Data Warehousing Data Mining Classification
Bisecting K-Means for Clustering Web Log data
Bisecting K-Means for Clustering Web Log data Ruchika R. Patil Department of Computer Technology YCCE Nagpur, India Amreen Khan Department of Computer Technology YCCE Nagpur, India ABSTRACT Web usage mining
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
Implementation of Data Mining Techniques to Perform Market Analysis
Implementation of Data Mining Techniques to Perform Market Analysis B.Sabitha 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, P.Balasubramanian 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin
Chapter ML:XI (continued)
Chapter ML:XI (continued) XI. Cluster Analysis Data Mining Overview Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained
Credit Card Fraud Detection Using Self Organised Map
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1343-1348 International Research Publications House http://www. irphouse.com Credit Card Fraud
Neural Networks in Data Mining
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V6 PP 01-06 www.iosrjen.org Neural Networks in Data Mining Ripundeep Singh Gill, Ashima Department
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
Data Warehousing and Data Mining for improvement of Customs Administration in India. Lessons learnt overseas for implementation in India
Data Warehousing and Data Mining for improvement of Customs Administration in India Lessons learnt overseas for implementation in India Participants Shailesh Kumar (Group Leader) Sameer Chitkara (Asst.
Selection of Optimal Discount of Retail Assortments with Data Mining Approach
Available online at www.interscience.in Selection of Optimal Discount of Retail Assortments with Data Mining Approach Padmalatha Eddla, Ravinder Reddy, Mamatha Computer Science Department,CBIT, Gandipet,Hyderabad,A.P,India.
SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING
AAS 07-228 SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING INTRODUCTION James G. Miller * Two historical uncorrelated track (UCT) processing approaches have been employed using general perturbations
The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon
The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon ABSTRACT Effective business development strategies often begin with market segmentation,
A Survey on Intrusion Detection System with Data Mining Techniques
A Survey on Intrusion Detection System with Data Mining Techniques Ms. Ruth D 1, Mrs. Lovelin Ponn Felciah M 2 1 M.Phil Scholar, Department of Computer Science, Bishop Heber College (Autonomous), Trichirappalli,
Strategic Online Advertising: Modeling Internet User Behavior with
2 Strategic Online Advertising: Modeling Internet User Behavior with Patrick Johnston, Nicholas Kristoff, Heather McGinness, Phuong Vu, Nathaniel Wong, Jason Wright with William T. Scherer and Matthew
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING Sumit Goswami 1 and Mayank Singh Shishodia 2 1 Indian Institute of Technology-Kharagpur, Kharagpur, India [email protected] 2 School of Computer
Research on Clustering Analysis of Big Data Yuan Yuanming 1, 2, a, Wu Chanle 1, 2
Advanced Engineering Forum Vols. 6-7 (2012) pp 82-87 Online: 2012-09-26 (2012) Trans Tech Publications, Switzerland doi:10.4028/www.scientific.net/aef.6-7.82 Research on Clustering Analysis of Big Data
Unsupervised learning: Clustering
Unsupervised learning: Clustering Salissou Moutari Centre for Statistical Science and Operational Research CenSSOR 17 th September 2013 Unsupervised learning: Clustering 1/52 Outline 1 Introduction What
Dr. U. Devi Prasad Associate Professor Hyderabad Business School GITAM University, Hyderabad Email: [email protected]
96 Business Intelligence Journal January PREDICTION OF CHURN BEHAVIOR OF BANK CUSTOMERS USING DATA MINING TOOLS Dr. U. Devi Prasad Associate Professor Hyderabad Business School GITAM University, Hyderabad
Tutorial Segmentation and Classification
MARKETING ENGINEERING FOR EXCEL TUTORIAL VERSION 1.0.8 Tutorial Segmentation and Classification Marketing Engineering for Excel is a Microsoft Excel add-in. The software runs from within Microsoft Excel
Advanced Ensemble Strategies for Polynomial Models
Advanced Ensemble Strategies for Polynomial Models Pavel Kordík 1, Jan Černý 2 1 Dept. of Computer Science, Faculty of Information Technology, Czech Technical University in Prague, 2 Dept. of Computer
Clustering on Large Numeric Data Sets Using Hierarchical Approach Birch
Global Journal of Computer Science and Technology Software & Data Engineering Volume 12 Issue 12 Version 1.0 Year 2012 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global
Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health
Lecture 1: Data Mining Overview and Process What is data mining? Example applications Definitions Multi disciplinary Techniques Major challenges The data mining process History of data mining Data mining
Data Mining + Business Intelligence. Integration, Design and Implementation
Data Mining + Business Intelligence Integration, Design and Implementation ABOUT ME Vijay Kotu Data, Business, Technology, Statistics BUSINESS INTELLIGENCE - Result Making data accessible Wider distribution
Application of Data Mining Techniques in Intrusion Detection
Application of Data Mining Techniques in Intrusion Detection LI Min An Yang Institute of Technology [email protected] Abstract: The article introduced the importance of intrusion detection, as well as
What is Customer Relationship Management? Customer Relationship Management Analytics. Customer Life Cycle. Objectives of CRM. Three Types of CRM
Relationship Management Analytics What is Relationship Management? CRM is a strategy which utilises a combination of Week 13: Summary information technology policies processes, employees to develop profitable
DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM
INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM M. Mayilvaganan 1, S. Aparna 2 1 Associate
Practical Applications of DATA MINING. Sang C Suh Texas A&M University Commerce JONES & BARTLETT LEARNING
Practical Applications of DATA MINING Sang C Suh Texas A&M University Commerce r 3 JONES & BARTLETT LEARNING Contents Preface xi Foreword by Murat M.Tanik xvii Foreword by John Kocur xix Chapter 1 Introduction
New Matrix Approach to Improve Apriori Algorithm
New Matrix Approach to Improve Apriori Algorithm A. Rehab H. Alwa, B. Anasuya V Patil Associate Prof., IT Faculty, Majan College-University College Muscat, Oman, [email protected] Associate
A Big Data Analytical Framework For Portfolio Optimization Abstract. Keywords. 1. Introduction
A Big Data Analytical Framework For Portfolio Optimization Dhanya Jothimani, Ravi Shankar and Surendra S. Yadav Department of Management Studies, Indian Institute of Technology Delhi {dhanya.jothimani,
Medical Information Management & Mining. You Chen Jan,15, 2013 [email protected]
Medical Information Management & Mining You Chen Jan,15, 2013 [email protected] 1 Trees Building Materials Trees cannot be used to build a house directly. How can we transform trees to building materials?
An Empirical Study of Application of Data Mining Techniques in Library System
An Empirical Study of Application of Data Mining Techniques in Library System Veepu Uppal Department of Computer Science and Engineering, Manav Rachna College of Engineering, Faridabad, India Gunjan Chindwani
Index Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
A Novel Fuzzy Clustering Method for Outlier Detection in Data Mining
A Novel Fuzzy Clustering Method for Outlier Detection in Data Mining Binu Thomas and Rau G 2, Research Scholar, Mahatma Gandhi University,Kerala, India. [email protected] 2 SCMS School of Technology
Mining Association Rules: A Database Perspective
IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.12, December 2008 69 Mining Association Rules: A Database Perspective Dr. Abdallah Alashqur Faculty of Information Technology
SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL
SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL Krishna Kiran Kattamuri 1 and Rupa Chiramdasu 2 Department of Computer Science Engineering, VVIT, Guntur, India
. Learn the number of classes and the structure of each class using similarity between unlabeled training patterns
Outline Part 1: of data clustering Non-Supervised Learning and Clustering : Problem formulation cluster analysis : Taxonomies of Clustering Techniques : Data types and Proximity Measures : Difficulties
Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms
Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms Y.Y. Yao, Y. Zhao, R.B. Maguire Department of Computer Science, University of Regina Regina,
Data Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
