Standardization of Components, Products and Processes with Data Mining
|
|
|
- Andrea Shepherd
- 10 years ago
- Views:
Transcription
1 B. Agard and A. Kusiak, Standardization of Components, Products and Processes with Data Mining, International Conference on Production Research Americas 2004, Santiago, Chile, August 1-4, Standardization of Components, Products and Processes with Data Mining Bruno AGARD Département de Mathématiques et de Génie Industriel, École Polytechnique de Montréal, C.P. 6079, succ. Centre-ville, Montréal (Québec), H3C3A7, Canada and Andrew KUSIAK Intelligent Systems Laboratory, 2139 Seamans Center, Department of Mechanical and Industrial Engineering, The University of Iowa, IA , USA ABSTRACT Data mining offers tools for extracting knowledge from databases. This paper discusses applications of data mining in standardization of components, products, and processes. Standardization of components is accomplished using association rules derived from customers requirements. A design process is proposed for the standardization of products. The design of a unique standardized product, different standardized products, and a standardized product for a selected group of customers are considered. A methodology for process standardization is proposed. Each approach discussed in the paper is illustrated with an industrial scenario. Keywords: Data mining, standardization, component, product, process. 1. INTRODUCTION Engineering design, manufacturing, and marketing activities generate large volumes of data. The data collection efforts are often driven by productivity improvement efforts. This large quantity of data makes it almost impossible for a person to develop a complete understanding of it without using any tools. Examples of data sources growing in size include (see Figure 1): Data from marketing studies describing customers expectations. Data describing actual designs. Data characterizing manufacturing processes used in a company. Manufacturing Figure 1. Typical databases Researches have demonstrated that standardization may result in improved efficiency by taking advantage of the economy of scale and simplifying manufacturing processes. Different levels of standardization are considered (see Figure 2): Use of common components in different products. Design of standardized products for comparable requirements. Standardization of a manufacturing process for similar products. Processes Products Marketing Design Components Figure 2. Different levels of standardization 1
2 The focus of this paper is on extracting knowledge from industrial databases in support of standardisation. Section 2 discusses advantages of standardization of components, products, and processes. Some of the datamining applications of interest to the research presented in this paper are reviewed in Section 3. Section 4 presents opportunities for data mining in standardization. Section 5 concludes the paper. 2. STANDARDIZATION Standardization is concerned with the use of common components, products, or processes to satisfy heterogeneous needs. It necessitates designing an overly robust product or the use of a robust process (often a more flexible process). Different aspects of standardization have been discussed in the literature. Tarondeau [18] argued that standardization results in higher productivity, larger lot sizes, decrease in the number of reference points to be managed, decrease in the stock level, and the reduction of complexity of a manufacturing system. The unnecessary cost of robustness may be balanced by increased productivity and decrease in product and process control. Lee and Tang [15] developed a mathematical model to determine the best compromise between the investment necessary for the standardization and the profit resulting from the economy of scale. They considered the standardization of products and processes. Erol [6] proposed a mathematical model for standardization of low value components that was solved by Dupont et al. [5]. Fouque [8] identified favourable scenarios for the standardization of two components into one. The uncertainty of the components demand increases, the service level of the components increases, the components have similar costs, and the demand for the two components is small. Standardization aggregates the risk and the uncertainty of the standardized component is smaller than the uncertainty of each individual component. Then the buffer sizes may be reduced and the productivity and service level may increase [4]. Thoteman and Brandeau [19] determined the optimal level of internal standardization of products with characteristics that do not differentiate models from the customer s point of view. They concluded that an optimal design (for cost) may be also obtained. The group technology [11] approach involves grouping parts, products, and processes that are similar, and therefore it can be applied to standardization. Numerous algorithms are available to define these groups according to various criteria, e.g., production cost, setup time, and in-process inventory. 3. DATA MINING IN ENGINEERING Anand and Büchner [1] defined data mining as the discovery of non-trivial, implicit, previously unknown, and potentially useful and understandable patterns from large data sets. Available volume of data in engineering provides many opportunities for knowledge extraction with data mining. Fayyad et al. described the iterative process of extracting knowledge from data (data mining). The first step deals with understanding of the application domain and of the goal of the end user. The selected data is preprocessed, reduced, and presented in a suitable format for the mining. According to the goal of the analysis, an appropriate data-mining algorithm is selected. Finally, knowledge is extracted. The end user has to interpret and validate the knowledge for using it or for better specifying the problem for iteration. Westphal and Blaxton [20] categorized data mining functions as classification, segmentation, and description. Classification involves assigning labels to new data based on the knowledge extracted from historical data. Segmentation (called also clustering) divides a population into smaller sub-populations with similar behaviour according to a predefined metric. It maximizes homogeneity within a group and maximizes heterogeneity between the groups. Description and visualization are used to explain the relationships among the data. Frequent patterns may be extracted in the form of A => B rules with two measures of quality: the support which represents the number of times A occurs as a fraction of the total number of examples and confidence which expresses the number of times B exists in the data when A is present. Berry and Linoff [2] presented numerous examples and applications of data mining in marketing, sales, and customer support. Büchner et al. [3] focused on the applications of data mining in manufacturing. Leu et al. [16] applied data mining in support of the tunnel design. Prediction of the quality products in the semiconductor industry was discussed in Kusiak [13]. Gertosio and Dussauchoy [9] applied data mining to reduce the processing time for checking and adjusting electronically controlled truck diesel engines. Huang et al. [10] acquired knowledge from data automatically to generate IF-THEN rules for a knowledge-based expert system in the design and manufacturing of micromachined atomizer. Many algorithms are available for extracting knowledge from databases [7]. The efficiency of these algorithms may be improved with decomposition methods [12] and feature transformation methods [14]. 2
3 4. DATA MINING AND STANDARDIZATION Data mining techniques are useful in providing answers to the following questions: Which components could be standardized? Which products could be standardized? Which processes could be standardized? Different application scenarios for standardization of components, products, and processes are presented in Section 4.1, Section 4.2, and Section 4.3, respectively. 4.1 Data mining in standardization of components Consider a company manufacturing personalized products with numerous options and alternatives. Each personalized product meets the requirements of a specific customer. To simplify manufacturing of personalized products and meeting an acceptable lead time, standardization of the components is considered. Assume the availably of historical data about the features selected by the customers. Table 1 represents the data relating six options and alternatives selected by eight customers. Table 1. The relationship between customers, options, and alternatives F1 F2 F3 F4 F5 F6 Customer Customer Customer Customer Customer Customer 6 1 Customer Customer The data in Table 1 show that Customer 1 has selected features F1, F2, F3, F4 and F6. Customer 2 has selected features F2, F4 and F6, and so on. Information contained in Table 1 may lead to useful patterns for the company. The similarities in customers requirements may imply relationship between the corresponding features. The data in Table 1 could be mined to derive associations between customers requirements (see Figure 3). Customer 1 Customer 2 Customer 3 Customer 4 F1 Figure 3. Associations between customers requirements Example rules extracted from the data in Table 1 are presented next: Rule 1. F1 => F3, with support = 62.5% (= 5/8) and confidence = 80 % (= 4/5) Rule 2. (F2 & F4) => F6, with support = 50 % (= 4/8) and confidence = 100% (= 4/4) Rule 3. F5 => NOT F2, with support = 37.5 % (= 3/8) and confidence = 100% (= 4/4) Associations among requirements with high support and confidence suggest standardization, for example: Rule 1. Suggests a standardized component supporting features F1 and F3. The standardized component {F1, F3} would meet the requirements {F1}, {F3} and {F1, F3}. Rule 2. This rule suggests the design of a Rule 3. F2 F3 standardized component {F2, F4, F6}. This rule implies that whenever F5 is present, F2 is not included, i.e., F2 and F5 should not be realized as one component. The associations among customers requirements provide indications for component standardization. The design team needs to evaluate the value of these indications. 4.2 Data mining in standardization of products Consider a company performing a marketing study preceding the design of a new product. The goal of the study is to obtain information about customers expectations. Sample data from a marketing study is shown in Table 2. F4 F5 F6 3
4 Table 2. Description of customers and their requirements Customer Description Product Requirement Age Gender Income Insurance F1 F2 F3 F4 F5 F6 Customer 1 35 F L Yes Customer 2 55 M H Yes Customer 3 40 M M Yes Customer 4 28 F H No Customer 5 29 F L Yes Customer 6 50 M H Yes Customer 7 32 F M No Customer 8 37 M L No Customer 9 48 M M No Customer M H Yes Table 2 contains data about customers, e.g., Customer 1 is 35 years old, a female with low income and insurance. Moreover, Customer 1 expects the features F1, F3 and F4 to be present in the product. The similarity among customers requirements may be analyzed. Any cluster identified in Table 1 may indicate opportunities for product standardization. Thus analyzing the customers data may emphasize customers of special interest. The following alternatives are considered in this section: The design of a standardized product for a large population of customers. The design of different standardized products (How many? Which ones?). The design of a standardized product for selected customers Neural network The three alternatives are discussed next. Design of a standardized product for a large population of customers To design a standardized product, the functions to be integrated into the product need to be specified. A standardized product does not need to support all the functions. Some functions may be integrated within the design while others can be excluded. A representative (target) customer model can be constructed using an averaging technique. However, such a customer could not represent any actual customers. A neural network is able to learn customers preferences to construct a target customer that could be used in the design of the standardized product (Figure 4). List of requirements Target customer Design Standardized product based on the target customer Figure 4. Design process for a standardized product 4
5 Once the requirements for the target customer are identified, economic evaluation of each feature may be performed. To determine its inclusion or exclusion in the standardized product two questions have to be answered: How much does it cost to design such an option/alternative? What is the loss of NOT designing it? For more details on the design of standardized products see [17]. Design of different standardized products To better match the customers requirements, it may be judicious to design different standardized products rather than a unique one. Each standardized product could be designed for a specific subset of customers. The latter calls for determining the number of products to be designed. Clustering is useful here as it defines groups of data objects that are similar according to the predefined metrics. In this paper, clustering is used to identify similar customers that share the same or highly similar behaviours (see Figure 5). The following process is suggested: 1. Define the expected number of clusters. The initial value could be the number of expected standardized products. 2. Form clusters of customers. 3. Analyze each cluster for suitability for standardization. Some clusters may not be relevant (not enough customers to ensure benefits). 4. If some clusters are disqualified, customers from these clusters could be considered or not in the next iteration. Elements from the same cluster have close requirements; it implies the design of a dedicated product for each cluster (Figure 5). Clustering Requirements Cluster 1 Cluster 2 Cluster 3 Neural network Neural network Not relevant for design Target customer 1 Design 1 Target customer 2 Design 2 Standardized product 1 based on Standardized product 2 based on cluster 1 cluster 2 Figure 5. Design process for different standardized products 5
6 Applying clustering to Table 2 based on features F1,, F6 for extracting three clusters; the following result is obtained (see Table 3): Table 3. Three clusters of customers Customer Requirement Cluster 1 (50%) Cluster 2 (30%) F1 F2 F3 F4 F5 F6 Customer Customer Customer Customer Customer Customer Customer Customer Cluster 3 Customer (20%) Customer The data in Table 3 shows that customers 1, 3, 4, 7 and 8 share similar behaviours expressed with features F1,..., F6. In the same way customers 2, 6 and 10 are similar. The opportunities for standardization to be considered here are: Cluster 1: The design of a product with features F1, F3 and F List of requirements Classification Cluster 2: The design of a product with features F2 and F5. Cluster 3: The design of a product with features F1, F2, F3, F4, F5 and F6. Table 3 also provides information about the size of each cluster, which may be useful in a cost-based evaluation. If Cluster 3 was too small to provide enough benefits, it would not be considered for the design of a product. The company may concentrate on the two other clusters or alternatively another iteration of the clustering algorithm could be considered. Some customers could be deleted from the clustering process. Different clusters could be tested and the resulting designs evaluated to select the best alternative. Design a unique standardized product for selected customers For a company that focuses on specific categories of customers for strategic reasons (fashion, reputation, test of new markets), it may be helpful to identify relevant sets of customers. Using the customers data in Table 2, classification separates the customers in different categories based on the relevant characteristics identified by the company. From the selected set of customers the company may learn their preferences and design a dedicated product. For example, it is possible to differentiate the preferences of different categories of customers (young people, female, old men) and design specific products or adapt specific services (Figure 6) Set of selected customers Neural network Design Standardized product based on Target customer the target customer Figure 6. Design process of a standardized product for selected customers 6
7 4.3 Data mining in process standardization A company manufacturing diverse parts could establish a database describing all previously manufactured parts, their characteristics, and the processes used for their manufacturing (see Table 4). Table 4. Description of parts and the corresponding manufacturing process Part Description Manufacturing Process C1 C2 C3 C4 C5 C6 P1 P2 P3 P4 P5 P6 Part M1 M4 M7 M2 Part M5 M2 M1 M4 Part M4 M3 M8 Part M2 M7 Part M5 M2 M1 M4 Part M4 M7 M2 Part M1 M4 M7 M5 M2 The data in Table 4 shows that Part 1 has characteristics C1, C2, C4 and C6. The manufacturing process of this part is Machine 1, Machine 4, Machine 7 and Machine 2. Part 2 shares characteristics C2, C3, C5 and C6 and involves Machine 5, Machine 2, Machine 1 and Machine 4. A data mining algorithm may link the description of parts to the manufacturing processes. The goal is to generate standardized processes for manufacturing new parts. The following methodology is proposed (see Figure 7): 1. Similar parts and processes are grouped into clusters. 2. Standardized processes are affected to each cluster. 3. When a new part is to be manufactured according to the identified cluster it impacts the standardized process. 4. The standardized process is adapted to the new part and the new data is incorporated into the database to enrich it. Step 1 and 2 represent the learning stage. Step 3 exploits the knowledge extracted. Step 4 adjusts the knowledge provided for a new item and provides one more element in the database (Figure 7). 7
8 Data similar to that of in Table 4 1 Clustering Learning stage 4 Cluster 1: Similar parts and processes Cluster 2 Cluster 2 Neural network Standardized process for cluster 1 New part to be manufactured Classification Part included in a cluster 3 Manufacturing process for the new part Standardized process for the cluster Figure 7. Process standardization with a data mining approach The more complete the database is, the more exhaustive is the standardized process suggested in Step 3. For example, the database may contain the sequence of machines as well as manufacturing parameters (speed, temperatures, times, settings). 5. CONCLUSION This paper discussed the application of data mining in standardization of components, products and processes. The standardization of components is realized through the extraction of association rules from the historical data representing customers requirements. Various options for the application of data mining in standardization of products were discussed. The design of a standardized product for a large population of customers was presented as well as the design of different standardized products. A data mining approach was used to select and prioritize a set of customers, e.g., a top-of-the-range product for high income customers. Finally, the standardization of processes was considered for improving the efficiency of the manufacturing process to produce new parts. This paper has illustrated that data mining is a useful tool for standardization. The existing industrial 8
9 databases should be explored more systematically to extract new knowledge. The latter is key to the improvement of products, processes, and systems. 6. REFERENCES 1. Anand, S.S. and Büchner, A.G., Decision Support Using Data Mining, Financial Times Pitman Publishers, London, UK, Berry, M.J.A. and Linoff, G., Data Mining Techniques: For Marketing, Sales, and Customer Support, New York: Wiley, Büchner, A.G., Anand, S.S. and Hugues, J.G., Data Mining in Manufacturing Environments: Goals, Techniques and Applications, in Studies in Informatics and Control, Vol. 6, No 4, 1997, pp Dupont, L., La Gestion Industrielle : Concepts et Outils, Hermès, Paris, Dupont, L., Erol, M., Cormier, G. and Turkkan, N., La standardisation des composants: modèles et algorithmes, 3ème Congrès International de Génie Industriel, Montréal, Canada, May, 1999, pp Erol, M., Prise en compte de la flexibilité dans la planification dynamique, PhD Thesis, Institut National Polytechnique de Grenoble, France, Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P. and Uthurusamy, R., Advances in Knowledge Discovery and Data Mining, AAAI Press/The MIT Press, Cambridge, MA, Fouque, T., A la recherche des produits flexibles, Revue Française de Gestion, mars-avril-mai, 1999, pp Gertosio, C. and Dussauchoy, A., Knowledge discovery from industrial databases, Journal of Intelligent Manufacturing, Vol. 15, 2004, pp Huang, S.H., Hao, X. and Benjamin, M., Automated knowledge acquisition for design and manufacturing: The case of micromachined atomizer, Journal of Intelligent Manufacturing, Vol. 12, 2001, pp Hyer, N. and Wemmerlöv, U., Group technology and productivity, Harvard Business Review, July- August, 1984, pp Kusiak, A., Decomposition in data mining: An industrial case study, IEEE Transactions on Electronics Packaging Manufacturing, Vol. 23, No. 4, 2000, pp Kusiak, A., Rough set theory: A data mining tool for semiconductor manufacturing, IEEE Transactions on Electronics Packaging Manufacturing, Vol. 24, No. 1, 2001, pp Kusiak, A., Feature Transformation Methods in Data Mining, IEEE Transactions on Electronics Packaging Manufacturing, Vol. 24, No. 3, 2001, pp Lee, H.L. and Tang, C.S., Modeling the costs and benefits of delayed product differentiation, Management Science, Vol. 43, No. 1, 1997, pp Leu, S.S., Chen, C.N. and Chang, S.L., Data mining for tunnel support stability: neural network approach, Automation in Construction, Vol. 10, No. 4, 2001, pp Pahl, G. and Beitz, W., Engineering Design, Berlin, Springer-Verlag, Tarondeau, J.C., Stratégie Industrielle: Second Edition, Collection Gestion, Vuibert, Thonemann, U.W. and Brandeau, M., Optimal commonality in component design, Operations Research, Vol. 48, No. 1, 2000, pp Westphal, C. and Blaxton, T., Data Mining Solutions, John Wiley, New York,
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control Andre BERGMANN Salzgitter Mannesmann Forschung GmbH; Duisburg, Germany Phone: +49 203 9993154, Fax: +49 203 9993234;
How To Use Data Mining For Knowledge Management In Technology Enhanced Learning
Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanbul, Turkey, May 27-29, 2007 115 Data Mining for Knowledge Management in Technology Enhanced Learning
Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms
Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms Y.Y. Yao, Y. Zhao, R.B. Maguire Department of Computer Science, University of Regina Regina,
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
Use of Data Mining in the field of Library and Information Science : An Overview
512 Use of Data Mining in the field of Library and Information Science : An Overview Roopesh K Dwivedi R P Bajpai Abstract Data Mining refers to the extraction or Mining knowledge from large amount of
ISSN: 2321-7782 (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies
ISSN: 2321-7782 (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: 2347-937X DATA MINING TECHNIQUES AND STOCK MARKET
DATA MINING TECHNIQUES AND STOCK MARKET Mr. Rahul Thakkar, Lecturer and HOD, Naran Lala College of Professional & Applied Sciences, Navsari ABSTRACT Without trading in a stock market we can t understand
Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing
www.ijcsi.org 198 Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing Lilian Sing oei 1 and Jiayang Wang 2 1 School of Information Science and Engineering, Central South University
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM
INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM M. Mayilvaganan 1, S. Aparna 2 1 Associate
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Mining Public Transport User Behaviour from Smart Card Data
Mining Public Transport User Behaviour from Smart Card Data Bruno Agard Catherine Morency Martin Trépanier October 2007 CIRRELT-2007-42 Bruno Agard 1,2, Catherine Morency 1,3, Martin Trépanier 1,2,3,*
Data Mining and KDD: A Shifting Mosaic. Joseph M. Firestone, Ph.D. White Paper No. Two. March 12, 1997
1 of 11 5/24/02 3:50 PM Data Mining and KDD: A Shifting Mosaic By Joseph M. Firestone, Ph.D. White Paper No. Two March 12, 1997 The Idea of Data Mining Data Mining is an idea based on a simple analogy.
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
Rule based Classification of BSE Stock Data with Data Mining
International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 4, Number 1 (2012), pp. 1-9 International Research Publication House http://www.irphouse.com Rule based Classification
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
Data Mining Analytics for Business Intelligence and Decision Support
Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing
Introduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction
Data Mining and Exploration Data Mining and Exploration: Introduction Amos Storkey, School of Informatics January 10, 2006 http://www.inf.ed.ac.uk/teaching/courses/dme/ Course Introduction Welcome Administration
FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT
FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT ANURADHA.T Assoc.prof, [email protected] SRI SAI KRISHNA.A [email protected] SATYATEJ.K [email protected] NAGA ANIL KUMAR.G
A Spatial Decision Support System for Property Valuation
A Spatial Decision Support System for Property Valuation Katerina Christopoulou, Muki Haklay Department of Geomatic Engineering, University College London, Gower Street, London WC1E 6BT Tel. +44 (0)20
Data Mining Applications in Fund Raising
Data Mining Applications in Fund Raising Nafisseh Heiat Data mining tools make it possible to apply mathematical models to the historical data to manipulate and discover new information. In this study,
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Keywords Data Mining, Knowledge Discovery, Direct Marketing, Classification Techniques, Customer Relationship Management
Volume 4, Issue 6, June 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Simplified Data
BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL
The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University
Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement
Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement Toshio Sugihara Abstract In this study, an adaptive
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
Data Mining Algorithms and Techniques Research in CRM Systems
Data Mining Algorithms and Techniques Research in CRM Systems ADELA TUDOR, ADELA BARA, IULIANA BOTHA The Bucharest Academy of Economic Studies Bucharest ROMANIA {Adela_Lungu}@yahoo.com {Bara.Adela, Iuliana.Botha}@ie.ase.ro
Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina
Graduate Co-op Students Information Manual Department of Computer Science Faculty of Science University of Regina 2014 1 Table of Contents 1. Department Description..3 2. Program Requirements and Procedures
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
Introduction to Data Mining
Introduction to Data Mining José Hernández ndez-orallo Dpto.. de Systems Informáticos y Computación Universidad Politécnica de Valencia, Spain [email protected] Horsens, Denmark, 26th September 2005
IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES
IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES Bruno Carneiro da Rocha 1,2 and Rafael Timóteo de Sousa Júnior 2 1 Bank of Brazil, Brasília-DF, Brazil [email protected] 2 Network Engineering
Static Data Mining Algorithm with Progressive Approach for Mining Knowledge
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 85-93 Research India Publications http://www.ripublication.com Static Data Mining Algorithm with Progressive
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
Software project cost estimation using AI techniques
Software project cost estimation using AI techniques Rodríguez Montequín, V.; Villanueva Balsera, J.; Alba González, C.; Martínez Huerta, G. Project Management Area University of Oviedo C/Independencia
Using Semantic Data Mining for Classification Improvement and Knowledge Extraction
Using Semantic Data Mining for Classification Improvement and Knowledge Extraction Fernando Benites and Elena Sapozhnikova University of Konstanz, 78464 Konstanz, Germany. Abstract. The objective of this
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Equational Reasoning as a Tool for Data Analysis
AUSTRIAN JOURNAL OF STATISTICS Volume 31 (2002), Number 2&3, 231-239 Equational Reasoning as a Tool for Data Analysis Michael Bulmer University of Queensland, Brisbane, Australia Abstract: A combination
Data Mining System, Functionalities and Applications: A Radical Review
Data Mining System, Functionalities and Applications: A Radical Review Dr. Poonam Chaudhary System Programmer, Kurukshetra University, Kurukshetra Abstract: Data Mining is the process of locating potentially
Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland
Data Mining and Knowledge Discovery in Databases (KDD) State of the Art Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland 1 Conference overview 1. Overview of KDD and data mining 2. Data
What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO
What is Data Mining? Data Mining (Knowledge discovery in database) Data Mining: "The non trivial extraction of implicit, previously unknown, and potentially useful information from data" William J Frawley,
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Sensitivity Analysis for Data Mining
Sensitivity Analysis for Data Mining J. T. Yao Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S 0A2 E-mail: [email protected] Abstract An important issue of data mining
72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD
72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD Paulo Gottgtroy Auckland University of Technology [email protected] Abstract This paper is
Prediction of Heart Disease Using Naïve Bayes Algorithm
Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM ABSTRACT Luis Alexandre Rodrigues and Nizam Omar Department of Electrical Engineering, Mackenzie Presbiterian University, Brazil, São Paulo [email protected],[email protected]
A Performance Study of Load Balancing Strategies for Approximate String Matching on an MPI Heterogeneous System Environment
A Performance Study of Load Balancing Strategies for Approximate String Matching on an MPI Heterogeneous System Environment Panagiotis D. Michailidis and Konstantinos G. Margaritis Parallel and Distributed
Evaluating an Integrated Time-Series Data Mining Environment - A Case Study on a Chronic Hepatitis Data Mining -
Evaluating an Integrated Time-Series Data Mining Environment - A Case Study on a Chronic Hepatitis Data Mining - Hidenao Abe, Miho Ohsaki, Hideto Yokoi, and Takahira Yamaguchi Department of Medical Informatics,
Data Mining. Knowledge Discovery, Data Warehousing and Machine Learning Final remarks. Lecturer: JERZY STEFANOWSKI
Data Mining Knowledge Discovery, Data Warehousing and Machine Learning Final remarks Lecturer: JERZY STEFANOWSKI Email: [email protected] Data Mining a step in A KDD Process Data mining:
TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY
TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY Danubianu Mirela Stefan cel Mare University of Suceava Faculty of Electrical Engineering andcomputer Science 13 Universitatii Street, Suceava
Distributed forests for MapReduce-based machine learning
Distributed forests for MapReduce-based machine learning Ryoji Wakayama, Ryuei Murata, Akisato Kimura, Takayoshi Yamashita, Yuji Yamauchi, Hironobu Fujiyoshi Chubu University, Japan. NTT Communication
Support Vector Machines with Clustering for Training with Very Large Datasets
Support Vector Machines with Clustering for Training with Very Large Datasets Theodoros Evgeniou Technology Management INSEAD Bd de Constance, Fontainebleau 77300, France [email protected] Massimiliano
Chapter 2 Literature Review
Chapter 2 Literature Review 2.1 Data Mining The amount of data continues to grow at an enormous rate even though the data stores are already vast. The primary challenge is how to make the database a competitive
CHALLENGES AND APPROACHES FOR KNOWLEDGE MANAGEMENT. Jean-Louis ERMINE CEA/UTT
CHALLENGES AND APPROACHES FOR KNOWLEDGE MANAGEMENT Jean-Louis ERMINE CEA/UTT Abstract: Knowledge Management is now a crucial issue in companies: Knowledge is a major economic challenge for the future.
Process Mining in Big Data Scenario
Process Mining in Big Data Scenario Antonia Azzini, Ernesto Damiani SESAR Lab - Dipartimento di Informatica Università degli Studi di Milano, Italy antonia.azzini,[email protected] Abstract. In
ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION
ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION K.Vinodkumar 1, Kathiresan.V 2, Divya.K 3 1 MPhil scholar, RVS College of Arts and Science, Coimbatore, India. 2 HOD, Dr.SNS
IMPLEMENTING A SPC INTEGRATED SYSTEM TO IMPROVE MANUFACTURING PROCESSES IN AUTOMOTIVE INDUSTRY
IMPLEMENTING A SPC INTEGRATED SYSTEM TO IMPROVE MANUFACTURING PROCESSES IN AUTOMOTIVE INDUSTRY L., Lobon 1, C. V., Kifor 2, C., Oprean 3 and O., Suciu 4 1 Lucian Blaga University of Sibiu, [email protected]
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support Rok Rupnik, Matjaž Kukar, Marko Bajec, Marjan Krisper University of Ljubljana, Faculty of Computer and Information
Quality Assessment in Spatial Clustering of Data Mining
Quality Assessment in Spatial Clustering of Data Mining Azimi, A. and M.R. Delavar Centre of Excellence in Geomatics Engineering and Disaster Management, Dept. of Surveying and Geomatics Engineering, Engineering
Text Mining: The state of the art and the challenges
Text Mining: The state of the art and the challenges Ah-Hwee Tan Kent Ridge Digital Labs 21 Heng Mui Keng Terrace Singapore 119613 Email: [email protected] Abstract Text mining, also known as text data
Visualization of Breast Cancer Data by SOM Component Planes
International Journal of Science and Technology Volume 3 No. 2, February, 2014 Visualization of Breast Cancer Data by SOM Component Planes P.Venkatesan. 1, M.Mullai 2 1 Department of Statistics,NIRT(Indian
An Empirical Study of Application of Data Mining Techniques in Library System
An Empirical Study of Application of Data Mining Techniques in Library System Veepu Uppal Department of Computer Science and Engineering, Manav Rachna College of Engineering, Faridabad, India Gunjan Chindwani
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.
DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,
College information system research based on data mining
2009 International Conference on Machine Learning and Computing IPCSIT vol.3 (2011) (2011) IACSIT Press, Singapore College information system research based on data mining An-yi Lan 1, Jie Li 2 1 Hebei
DATA MINING FOR THE MANAGEMENT OF SOFTWARE DEVELOPMENT PROCESS
International Journal of Software Engineering and Knowledge Engineering Vol. 0, No. 0 (1994) 000 000 c World Scientific Publishing Company DATA MINING FOR THE MANAGEMENT OF SOFTWARE DEVELOPMENT PROCESS
Praseeda Manoj Department of Computer Science Muscat College, Sultanate of Oman
International Journal of Electronics and Computer Science Engineering 290 Available Online at www.ijecse.org ISSN- 2277-1956 Analysis of Grid Based Distributed Data Mining System for Service Oriented Frameworks
Dynamic Data in terms of Data Mining Streams
International Journal of Computer Science and Software Engineering Volume 2, Number 1 (2015), pp. 1-6 International Research Publication House http://www.irphouse.com Dynamic Data in terms of Data Mining
What is Customer Relationship Management? Customer Relationship Management Analytics. Customer Life Cycle. Objectives of CRM. Three Types of CRM
Relationship Management Analytics What is Relationship Management? CRM is a strategy which utilises a combination of Week 13: Summary information technology policies processes, employees to develop profitable
Data Mining and Machine Learning in Bioinformatics
Data Mining and Machine Learning in Bioinformatics PRINCIPAL METHODS AND SUCCESSFUL APPLICATIONS Ruben Armañanzas http://mason.gmu.edu/~rarmanan Adapted from Iñaki Inza slides http://www.sc.ehu.es/isg
Mercy Health System. St. Louis, MO. Process Mining of Clinical Workflows for Quality and Process Improvement
Mercy Health System St. Louis, MO Process Mining of Clinical Workflows for Quality and Process Improvement Paul Helmering, Executive Director, Enterprise Architecture Pete Harrison, Data Analyst, Mercy
Introduction to Data Mining
Introduction to Data Mining a.j.m.m. (ton) weijters (slides are partially based on an introduction of Gregory Piatetsky-Shapiro) Overview Why data mining (data cascade) Application examples Data Mining
ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam
ECLT 5810 E-Commerce Data Mining Techniques - Introduction Prof. Wai Lam Data Opportunities Business infrastructure have improved the ability to collect data Virtually every aspect of business is now open
DATA MINING FOR BUSINESS INTELLIGENCE. Data Mining For Business Intelligence: MIS 382N.9/MKT 382 Professor Maytal Saar-Tsechansky
DATA MINING FOR BUSINESS INTELLIGENCE PROFESSOR MAYTAL SAAR-TSECHANSKY Data Mining For Business Intelligence: MIS 382N.9/MKT 382 Professor Maytal Saar-Tsechansky This course provides a comprehensive introduction
Role of Social Networking in Marketing using Data Mining
Role of Social Networking in Marketing using Data Mining Mrs. Saroj Junghare Astt. Professor, Department of Computer Science and Application St. Aloysius College, Jabalpur, Madhya Pradesh, India Abstract:
APPLICATION OF INTELLIGENT METHODS IN COMMERCIAL WEBSITE MARKETING STRATEGIES DEVELOPMENT
ISSN 1392 124X INFORMATION TECHNOLOGY AND CONTROL, 2005, Vol.34, No.2 APPLICATION OF INTELLIGENT METHODS IN COMMERCIAL WEBSITE MARKETING STRATEGIES DEVELOPMENT Algirdas Noreika Department of Practical
CHAPTER 1 INTRODUCTION
1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful
Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept
Statistics 215b 11/20/03 D.R. Brillinger Data mining A field in search of a definition a vague concept D. Hand, H. Mannila and P. Smyth (2001). Principles of Data Mining. MIT Press, Cambridge. Some definitions/descriptions
Chapter ML:XI. XI. Cluster Analysis
Chapter ML:XI XI. Cluster Analysis Data Mining Overview Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained Cluster
Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com
Data Mining: Introduction. Lecture Notes for Chapter 1. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler
Data Mining: Introduction Lecture Notes for Chapter 1 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Why Mine Data? Commercial Viewpoint Lots of data is being collected and warehoused - Web
Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results
, pp.33-40 http://dx.doi.org/10.14257/ijgdc.2014.7.4.04 Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results Muzammil Khan, Fida Hussain and Imran Khan Department
