Combining SOM and GA-CBR for Flow Time Prediction in Semiconductor Manufacturing Factory



Similar documents
The development of a weighted evolving fuzzy neural network for PCB sales forecasting

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Applied Mathematical Sciences, Vol. 7, 2013, no. 112, HIKARI Ltd,

Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

A Data Mining Study of Weld Quality Models Constructed with MLP Neural Networks from Stratified Sampled Data

The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network

Pattern Recognition Using Feature Based Die-Map Clusteringin the Semiconductor Manufacturing Process

A Case Retrieval Method for Knowledge-Based Software Process Tailoring Using Structural Similarity

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION

Data Mining Part 5. Prediction

A Review of Data Mining Techniques

Data Mining: A Preprocessing Engine

Statistical Learning for Short-Term Photovoltaic Power Predictions

Visualization of large data sets using MDS combined with LVQ.

To improve the problems mentioned above, Chen et al. [2-5] proposed and employed a novel type of approach, i.e., PA, to prevent fraud.

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Flexible Neural Trees Ensemble for Stock Index Modeling

Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier

Numerical Research on Distributed Genetic Algorithm with Redundant

Towards applying Data Mining Techniques for Talent Mangement

DATA MINING TECHNIQUES AND APPLICATIONS

THE INTEGRATION OF SUPPLY CHAIN MANAGEMENT AND SIMULATION SYSTEM WITH APPLICATION TO RETAILING MODEL. Pei-Chann Chang, Chen-Hao Liu and Chih-Yuan Wang

Advanced analytics at your hands

Stock price prediction using genetic algorithms and evolution strategies

Sales Forecast for Pickup Truck Parts:

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

DATA MINING-BASED PREDICTIVE MODEL TO DETERMINE PROJECT FINANCIAL SUCCESS USING PROJECT DEFINITION PARAMETERS

Comparison of K-means and Backpropagation Data Mining Algorithms

Big Data with Rough Set Using Map- Reduce

Neural Network Applications in Stock Market Predictions - A Methodology Analysis

Chapter 2 The Research on Fault Diagnosis of Building Electrical System Based on RBF Neural Network

An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset

Genetic Algorithm Evolution of Cellular Automata Rules for Complex Binary Sequence Prediction

Soft-Computing Models for Building Applications - A Feasibility Study (EPSRC Ref: GR/L84513)

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network

A survey on Data Mining based Intrusion Detection Systems

Efficient Security Alert Management System

Computing the Carbon Footprint Supply Chain for the Semiconductor Industry: A Learning Tool.

A Proposed Scheme for Software Project Scheduling and Allocation with Event Based Scheduler using Ant Colony Optimization

An Analysis on Density Based Clustering of Multi Dimensional Spatial Data

DATA MINING IN FINANCE AND ACCOUNTING: A REVIEW OF CURRENT RESEARCH TRENDS

OPTIMIZED SENSOR NODES BY FAULT NODE RECOVERY ALGORITHM

Design call center management system of e-commerce based on BP neural network and multifractal

Towards Cloud Factory Simulation. Abstract

Automatic Detection of PCB Defects

Planning Workforce Management for Bank Operation Centers with Neural Networks

A Hybrid Tabu Search Method for Assembly Line Balancing

Prediction Model for Crude Oil Price Using Artificial Neural Networks

Application of Genetic Algorithm to Scheduling of Tour Guides for Tourism and Leisure Industry

ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL

Programming Risk Assessment Models for Online Security Evaluation Systems

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network

An Overview of Knowledge Discovery Database and Data mining Techniques

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control

A Comparative Study of the Pickup Method and its Variations Using a Simulated Hotel Reservation Data

Segmentation of stock trading customers according to potential value

A New Approach For Estimating Software Effort Using RBFN Network

A New Method for Traffic Forecasting Based on the Data Mining Technology with Artificial Intelligent Algorithms

Applications of improved grey prediction model for power demand forecasting

GENETIC ALGORITHM FORECASTING FOR TELECOMMUNICATIONS PRODUCTS

Comparison of Request Admission Based Performance Isolation Approaches in Multi-tenant SaaS Applications

An Improved Ant Colony Optimization Algorithm for Software Project Planning and Scheduling

Predictive Dynamix Inc

Occupation Cluster: Manufacturing Technician Function or Job Duty: A Establish Internal/External Customer Needs

DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES

Use of Artificial Neural Network in Data Mining For Weather Forecasting

Analecta Vol. 8, No. 2 ISSN

A Method of Cloud Resource Load Balancing Scheduling Based on Improved Adaptive Genetic Algorithm

A Parallel Processor for Distributed Genetic Algorithm with Redundant Binary Number

Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms

Price Prediction of Share Market using Artificial Neural Network (ANN)

Early defect identification of semiconductor processes using machine learning

Prediction of Stock Performance Using Analytical Techniques

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

Evaluation of Feature Selection Methods for Predictive Modeling Using Neural Networks in Credits Scoring

How To Use Neural Networks In Data Mining

A Study of Crossover Operators for Genetic Algorithm and Proposal of a New Crossover Operator to Solve Open Shop Scheduling Problem

Up/Down Analysis of Stock Index by Using Bayesian Network

A Review And Evaluations Of Shortest Path Algorithms

Data Mining Part 5. Prediction

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition

Performance Based Evaluation of New Software Testing Using Artificial Neural Network

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

DATA PREPARATION FOR DATA MINING

A Binary Model on the Basis of Imperialist Competitive Algorithm in Order to Solve the Problem of Knapsack 1-0

How to Learn Good Cue Orders: When Social Learning Benefits Simple Heuristics

International Journal of Software and Web Sciences (IJSWS)

Transcription:

Combining SOM and GA-CBR for Flow Time Prediction in Semiconductor Manufacturing Factory Pei-Chann Chang 12, Yen-Wen Wang 3, Chen-Hao Liu 2 1 Department of Information Management, Yuan-Ze University, 2 Department of Industrial Engineering and Management, Yuan-Ze University, 135 Yuan-Dong Rd., Taoyuan 32026, Taiwan, R.O.C. 3 Department of Industrial Engineering and Management, Ching-Yun University, 229 Chien-Hsin Rd., Taoyuan 320, Taiwan, R.O.C. iepchang@saturn.yzu.edu.tw Abstract. of semiconductor manufacturing factory is highly related to the shop floor status; however, the processes are highly complicated and involve more than hundred of production steps. Therefore, a simulation model with the production process of a real wafer fab located in Hsin-Chu Sciencebased Park of Taiwan is built. In this research, a hybrid approach by combining Self-Organizing Map (SOM) and Case-Based Reasoning (CBR) for flow time prediction in semiconductor manufacturing factory is proposed. And Genetic Algorithm (GA) is applied to fine-tune the weights of features in the CBR model. The flow time and related shop floor status are collected and fed into the SOM for classification. Then, corresponding GA-CBR is selected and applied for flow time prediction. Finally, using the simulated data, the effectiveness of the proposed method (SGA-CBR) is shown by comparing with other approaches. Keywords: prediction, Case-Based Reasoning, Genetic Algorithms, Self-Organizing Map 1 Introduction prediction is an important feature of a semiconductor manufacturing problem, which is the basis used to estimate the due date of a new order under current shop floor status. Traditionally, assigning due date for each order is accomplished by the production planning and control staffs based on their knowledge of the manufacturing processes and shop floor status. The production planning and scheduling staffs usually estimate the flow time of each order based on products manufactured before and schedule its release to the shop floor for production. Even if the product specification is exactly the same, the status of the shop floor such as jobs in the system, shop loading and jobs in the bottleneck machine may not be identical to the previous production. As a result, due date estimated by the production planning and scheduling staffs might subject to errors.

As the advance in artificial intelligence (AI), tools in soft computing have been widely applied in manufacturing planning and scheduling problems. Ref. [2] reported that back-propagation neural networks (BPN) could be more effective than some traditional direct procedures for due date assignment since neural network can obtain a probable result even if the input data are incomplete or noisy. Using a k-nearestneighbors (KNN) based case-based reasoning (CBR) approach with dynamic feature weights and non-linear similarity functions; ref. [6] found that further performance improvement could be made. This paper constructs a case-based prediction system with the aid of a Self-Organizing Map (SOM), Genetic Algorithm (GA) and CBR, and we call it SGA-CBR in the rest of the article. The SOM is first used to classify the data, and after the classification GA is used to construct the CBR prediction method by searching the best weights combination. The rest of the paper is organized as follows: Section 2 reviews some related literatures. Section 3 briefly describes the case that will be discussed in this research. Section 4 presents the framework of the methodology applied in the flow time prediction method. Section 5 presents some experimental results of various models including other compared methods. Section 6 discusses the simulated results from these different models and then the conclusion is made. 2 Literature Review CBR is a general problem solving method with a simple and appealing definition [10] that emphasizes finding appropriate past experience to the solution of new problems. It solves problems using or adapting solutions from previous experiences. CBR is a problem-solving approach that takes advantage of the knowledge gained from previous attempts to solve a particular problem. Ref. [7] applied the CBR technique to the software estimation problem and found that CBR performs somewhat superior to regression modeling based on the same data. The successful applications of the CBR system in the prediction problem can refer to ref. [8], [12], [17], and [18]. For a CBR system, the retrieval of appropriate cases relies on a similarity metric which takes into account the distance between pairs of cases in their state space of variables, also commonly called features. Similarity measurements between pairs of features play a central role in CBR [11]. Many CBR systems represent cases using features and employ a similarity function to measure the similarities between new and prior cases [15]. A CBR system may perform ineffectively in retrieving cases when the features are irrelevant for cases matching. Therefore, to minimize the bias associated with the features, it is crucial to identify the most salient features leading to effective case retrieval. Generally, the performance of the similarity metric and the weighting of features are keys to this reasoning process [10]. In general, feature weights are used to denote the relevance of features. They allow similarity functions to emphasize features according to their relevance. Several research works attempted to determine feature weight settings with the aid of GA. Ref. [16] proposed methods for feature subset selection using genetic algorithms. Ref. [1] developed a GA-based, weighted K-NN approach to construct CBR. They suggested that the types of similarity functions, feature weights, and the indexing method could

affect the retrieval performance of CBR. To the best of our knowledge, none of the above studies considered the non-linear feature value distance between an old case and a new case. Therefore, this paper aims to investigate the effect of GA-based feature weighting together with a number of non-linear similarity functions. 3 Problem Description The basic configuration of the wafer fabrication factory is same as a real-world one located in the Science-Based Park in Hsin-Chu, Taiwan, R.O.C. There are 66 singleserver or multiple-server workstations in the shop floor. The major wafer manufacturing processes are divided into two sections, i.e., the front-end process and the back-end process. A flowchart of the basic front-end processes is described in Figure 1. The production steps are just a step-by-step process. Real floor shop manufacturing processes are more complicated with many detailed processing procedures. Photolithography Thermal Process Implantation Chemical Vapor Deposition Etching Cleaning Metrology Chemical Mechanical Polishing Physical Vapor Deposition Fig. 1. Basic Front-End Processes After the front-end processes, wafers are fed into the back-end processes. A simple flowchart of the back-end processes is also shown in Figure 2. Test Wafer Dicing Die Attach Wire Bondong Encapsulation Fig. 2. Basic Back-End Processes The time series plot of 300 flow time data is depicted in Figure 3. The pattern of the flow time is not stable in this plot. The traditional approach by human decision is very inaccurate and very prone to fail when the shop status is totally different even for the same product. This is the motive for this research to develop an approach to cut down the forecasting error based on such non-stationary situation.

5500 4500 3500 2500 1500 1 31 61 91 121 151 181 211 241 271 Fig. 3. Time Series Plot of Flow Time 4 A Hybrid System Combining SOM and GA-CBR This research first uses SOM to cluster past cases to the different groups, and the training cases in each sub-group are used to train the best weights between features by GA. In the testing process, the most similar sub-group to the new case then could be retrieved by CBR from past case. New case is compared to each case within the selected group in order to find the most similar case to get the forecasting flow time of the new case. Hopefully, the hybrid model could improve the effect of flow time forecasting. The framework of SGA-CBR can be described as figure 4. Totally 300 records of data are randomly divided into 240 records of training data and 60 records of testing data. Following briefly describes the operation process for the SGA-CBR: SOM CBR Cycle Retrieved NewCase Retrieve Case Reuse GA Reproduction / Selection Crossover Learned Case Clustered Case Base Adapted Case Mutation Replacement Stop Retain Weights Adjusting Evaluated Case Revise Fig. 4. The Framework of the research

Step 1. Classify the training data by SOM From the data collected, each new case is composed of six features: order quantities ( X 1 ), existing order qualities when the order arrived ( X 2 ), average shop workload when the order arrived ( X 3 ), average queuing length when the order arrived ( X 4 ), workstation queue when the order arrived ( X 5 ), and utilization rate of work station when the order arrived ( X 6 ). Uses these six features to be the input variables of SOM, and SOM will produce output-processing elements similar to neighboring elements, which means that the cases in the same group would have similar connection weight. Training Process Step 2. Initial weights generation Randomly generate the initial weights Step 3. Case retrieving i W j of the j th feature in sub-group i. This step would find out the most matching case from case base using similarity rule in order to predict the flow time for the new case. The similarity rule as follows: mn i i ( C C ) S = Dis,, n m (1) m n S mn is the similarity degree between case m ( C ) and n ( C ) in group i. And Dis () is the distance between two cases, Dis ( ) is compute as: i m i n Dis i i i m n ( C, C ) = W ( F F ) 2 m n f j f f (2) where i m F f means the value of the f th feature of case m. Thus, i n Dis ( C m, C ) computes the summarized weighted distance between case m and n. Step 4. Case reusing After the steps above, KNN is added to gain more matching cases to forecast the flow time of case. For example, when k = 5 in the sub-group, the forecast flow time of new case is determined by the 5 best matching cases. And the parameter k of each sub-group is generated by trail-and-error separately. Step 5. Error computing

Root of mean square error (RMSE) is adopted to be the performance measure in this research. RMSE = N ( forecasted value real value) l= 1 N 2 (3) where, N is the total number of case in the sub-group. Step 6. Weights revising by GA Uses GA approach to find the optimal weight for each feature in the sub-group. Some parameters setting of GA are list in following: Table 1. Parameters settings in GA Parameters Setting Selection Binary tournament Crossover Single point crossover Crossover rate 0.85 Mutation Swap mutation Mutation rate 0.1 Reproduction Elitism strategy Population size 30 Stopping criteria 1000 Step 7. Cases and weights retaining The best weight combination of each sub-group is retained for the further testing process. Testing Process Step 8. New testing case retrieving The same as process above, similarity rule is used to compute the similarity of cases. Step 9. New testing case reusing Find the most k similar cases of new case. Step 10. Forecasted flow time generating Forecast the flow time of new case from k similar cases.

5. Experimental Results 5.1. Data clustered by SOM The main purpose of data clustering is to reduce the effect of data noise. As mentioned previously, SOM is applied to cluster the data in this study. The cluster results diagram can be found in figure 5, which shows the results of two and three clusters for 240 data. The number of clusters might influence the forecasting result; therefore, the number of clusters will be discussed in the next sub-section. 6500 Data Classification (2 Clusters) Cluster 1 Cluster 2 Data Classification (3 Clusters) Cluster 1 Cluster 2 Cluster 3 5500 4500 3500 2500 1500 0 50 100 150 200 250 300 Index 6500 5500 4500 3500 2500 1500 0 50 100 150 200 250 300 Index Fig. 5. The two and three clustered results by using SOM 5.2. SGA-CBR with Different Clusters Forecast results under different number of cluster are shown in figure 6. By observing the figure 6, when the cluster number is increasing, the forecasting and real data will be more matched. Furthermore, according to the Mean Absolute Percentage Error (MAPE) and RMSE, figure 7 shows the performance of different number of clusters. SGACBR (2 clusters) SGACBR (3 clusters) Real data Real data 1000 Forecasting data 1000 Forecasting data 0 1 6 11 16 21 26 31 36 41 46 51 56 Testing data 0 1 6 11 16 21 26 31 36 41 46 51 56 Testing data SGACBR (4 clusters) SGACBR (5 clusters) Real data Real data 1000 Forecasting data 1000 Forecasting data 0 1 6 11 16 21 26 31 36 41 46 51 56 Testing data 0 1 6 11 16 21 26 31 36 41 46 51 56 Testing data Fig. 6. SGACBR with different number of cluster

400 0.08 300 0.06 RMSE 200 100 MAPE 0.04 0.02 0 2 3 4 5 Number of Cluster 0.00 2 3 4 5 Number of Cluster Fig. 7. The convergence chart of MAPE and RMSE from different number of cluster Table 2. Parameters settings in GA Number of Cluster 2 3 4 5 MAPE 7.44 % 5.18 % 5.01 % 4.73 % RMSE 312.5231 218.3532 218.6860 208.2776 We chose 5 clusters as the number of sub-groups in the research. As shown in Table 2, we can find that when the number of clusters is large than 3, the accuracy of forecasting will converge, and it has no obviously improvement when using large number of clusters. Therefore, further cluster number will stop to test. 5.3. Comparison with other Methodology Other forecasting methodologies are compared with SGA-CBR in this research, such as general CBR, Back-propagation neural network (BPN), GA and fuzzy rule based method (GA&WM), GA and CBR hybrid method (GA-CBR), and Fuzzy rule based SOM method (SOM&WM). The detail of these methods can refer to the previous research [3], [4], and [6]. By observing table 3, SGA-CBR proposed in this research performs superior to other methods that performed well in the previous research. The reason why SGA- CBR of this research outperforms others is because GA can fine-tune the weights. CBR is one of the famous forecasting methods while resolving this kind of forecasting problem with multiple features considering. By adopting the Euclidean distance to retrieve the similar cases, CBR is an effective and efficient method. Otherwise, in the real world, each feature may play a different important role. It means we should take different importance of each feature into consideration; thus, we use GA to search the best weights combination of features in our CBR process.

Table 3. Parameters settings in GA Methodology RMSE Improving rate CBR 538 - BPN 480 10.78% GA&WM 479 10.97% GA-CBR 391 27.32% SOM&WM 320 40.52% SGA-CBR 208 61.34% In the comparative study, the overall average RMSE of SGA-CBR is 208, the overall average RMSE of other methods can be found in table 3. Hence the results of our limited comparative studies show that the proposed SGA-CBR method produces the lowest RMSE value. 6 Conclusion The experimental results in section 5 demonstrate the effectiveness of the SGA-CBR that is superior to other effective approaches. In summary, this research has the following important contribution in the flow time prediction area and these contributions might be interested to other academic researchers and industrial engineers and managers: No matter what kind of data, some noise may influence the forecasting result a lot. In the recent research, data preprocessing seems to be more and more important. After the numerical testing of this study, data pre-clustering is a better way to increase the forecasting accuracy. As shown in table 3, the methods with SOM clustering (SOM&WM, and SGA-CBR) perform better than other method without data classifier. This research compared some well forecasting methods; RMSE was the performance measure index. SGA-CBR proposed in this research was the best one with the minimum RMSE. This research discussed how to integrate the SOM and GA-CBR approaches to construct a hybrid system of flow time prediction. It can help industrial managers to make a better project scheduling or some other forecasting matters. References 1. Brill, F.Z., Brown, D.E., Martin, W.N.: Fast Genetic Selection of Features for Neural Network Classifiers. IEEE Transactions on Neural Networks, Vol. 32. (1992) 324-328 2. Chang, P.C., Hsieh, J.C.: A Neural Networks Approach for Due Date Assignment in A Wafer Fabrication Factory. Int. J. Ind. Eng., Vol. 10. (2003) 55-61 3. Chang, P.C., Hsieh, J.C., Liao T.W.: Evolving Fuzzy Rules for Due-Date Assignment Problem in Semiconductor Manufacturing Factory. Journal of Intelligent Manufacturing, Vol. 16. 5 (2005) 549-557

4. Chang, P.C., Liao, T.W.: Combing SOM and Fuzzy Rule Base for Flow Time Prediction in Semiconductor Manufacturing Factory. Applied Soft Computing, Vol. 6. (2006) 198-206 5. Chiu C.C.: A Case-based Customer Classification Approach for Direct Marketing. Expert Systems with Applications, Vol. 22. (2002) 163-168 6. Chiu, C.C., Chang, P.C., Chiu, N.H.: A Case-based Expert Support System for Due Date Assignment in A Wafer Fabrication Factory. J. Intell. Manuf., Vol. 14. (2003) 287-296 7. Finnie, G.R., Witting, G.E.: Estimating Software Development Effort with Case-Based Reasoning. Lecture Notes in Computer Science, Vol. 1266. Springer-Verlag, Berlin Heidelberg New York (1995) 13-22 8. Jo, H., Han, I., Lee, H.: Bankruptcy Prediction Using Case-Based Reasoning, Neural Networks and Discriminant Analysis. Expert Systems and Applications, Vol. 13. (1997) 97-108. 9. Kim, K., Han, I.: Maintaining Case-based Reasoning Systems Using A Genetic Algorithms Approach. Expert Systems with Applications, Vol. 21. (2001) 139-145 10. Kim, S.H., Shin, S.W.: Identifying the Impact of Decision Variables for Nonlinear Classification Tasks. Expert Systems with Applications, Vol. 18. () 201-214 11. Kolodner, J.L.: An Introduction to Case-Based Reasoning. Artificial Intelligence Review, Vol. 6. (1992) 3-34 12. Liao, T.W., Zhang, Z.M., Mount, C.R.: A Case-Based Reasoning System for Identifying Failure Mechanisms. Engineering Applications of Artificial Intelligence, Vol. 13. () 199-213 13. Louis, S.J., Xu, Z.: Genetic Algorithms for Open Shop Scheduling and Re-Scheduling. In Proceedings of the ISCA 11th International Conference on Computers and Their Applications. (1996) 99-102 14. Ramsey, C., Grefensttete, J.: Case-based Initialization of Genetic Algorithms. Proceeding of the Fifth International Conference on Genetic Algorithms, San Mateo, California (1993) 15. Shin, K., Han, I.: Case-Based Reasoning Supported by Genetic Algorithms for Corporate Bond Rating. Expert Systems with Applications, Vol. 16. (1999) 85-95 16. Siedlecki, W., Sklansky, J.: A Note on Genetic Algorithms for Large-Scale Feature Selection. Pattern Recognition Letters, Vol. 10. (1989) 335-347 17. Watson, I., Gardingen, D.: A Distributed Cased-Based Reasoning Application for Engineering Sales Support. In Proc. 16th Int. Joint Conf. on Artificial Intelligence, Vol. 1. (1999) 600-605 18. Watson, I., Watson, H.: CAIRN: A Case-Based Document Retrieval System. In Proc. of the 3rd United Kingdom Case-Based Reasoning Wrokshop, University of Manchester, Filer, N & Watson, I (Eds). (1997)