APPLYING DATA MINING METHODS TO PREDICT DEFECTS ON STEEL SURFACE



Similar documents
Material Expansion Coefficients

MATERIAL SAFETY DATA SHEET SECTION 1 CHEMICAL PRODUCT AND COMPANY INFORMATION SECTION 2 COMPOSITION/INFORMATION ON INGREDIENTS

2015 Investor Day Summary Investor Presentation June 25, 2015

High-Strength Low-Alloy Steels

Multicrystalline solar silicon production for development of photovoltaic industry

Appendice Caratteristiche Dettagliate dei Materiali Utilizzati

EXTRACTION OF METALS

Influence of Steel Scrap on Microstructure and Mechanical Properties of Spheroidal Graphite Cast Iron

Chapter 8 Atomic Electronic Configurations and Periodicity

All answers must use the correct number of significant figures, and must show units!

Name Date Class CHAPTER 1 REVIEW. Answer the following questions in the space provided.

ME 612 Metal Forming and Theory of Plasticity. 1. Introduction

Determining Average Atomic Mass

Application of Web-Based Bar Fatigue Database. Database Development

Jefferies 2014 Global Industrials Conference Dr. Heinz Schimmelbusch CEO August 12, 2014

Standard Specification for Stainless Steel Bars and Shapes 1

S355 European Standard Steel

B) atomic number C) both the solid and the liquid phase D) Au C) Sn, Si, C A) metal C) O, S, Se C) In D) tin D) methane D) bismuth B) Group 2 metal

X15TN TM. A high hardness, corrosion and fatigue resistance martensitic grade CONTINUOUS INNOVATION RESEARCH SERVICE.

Chemistry CP Unit 2 Atomic Structure and Electron Configuration. Learning Targets (Your exam at the end of Unit 2 will assess the following:)

ASTM A860/A860M-09 Standard Specification for Wrought High Strength. Ferritic Steel Butt Welding Fittings. 1. Scope :- 2. Reference Documents :-

Standard Specification for Chromium and Chromium-Nickel Stainless Steel Plate, Sheet, and Strip for Pressure Vessels and for General Applications 1

TEST REPORT Job No./Report No TR Date:12 June 2014 Page 1 of 5

APPENDIX B: EXERCISES

Data Mining Algorithms Part 1. Dejan Sarka

PART 6 SPECIAL CONTRACT RULES FOR METALS CONTENTS

Texas Commission on Environmental Quality

Standard Specification for Tool Steel High Speed 1

Standard Specification for Seamless Ferritic and Austenitic Alloy-Steel Boiler, Superheater, and Heat-Exchanger Tubes 1

MATERIAL SAFETY DATA SHEET SECTION 1 CHEMICAL PRODUCT AND COMPANY INFORMATION SECTION 2 COMPOSITION/INFORMATION ON INGREDIENTS

How to read your Oil Analysis Report

3. What would you predict for the intensity and binding energy for the 3p orbital for that of sulfur?

Martin RICHEZ. Franck ZANONCELLI

MOLES AND MOLE CALCULATIONS

Rely on Our Material Competence for Your Demanding Industrial Processes

Moles, Molecules, and Grams Worksheet Answer Key

University of Arizona Bulletin

Naming Compounds Handout Key

47374_04_p25-32.qxd 2/9/07 7:50 AM Page Atoms and Elements

UNIT (2) ATOMS AND ELEMENTS

A Comparison Between Data Mining Prediction Algorithms for Fault Detection (Case study: Ahanpishegan co.)

Chapter Outline. 3 Elements and Compounds. Elements and Atoms. Elements. Elements. Elements 9/4/2013

Welding and Corrosion Performance of INCO-WELD 686CPT Filler Metal In Waste-To-Energy Power Plants

Theory of Heating by Induction

SUS. Company Profile. Ulrich Nell, Feldstr.23, D Oberhausen, Tel. 0049(0)208/ Fax 0049(0)208/658536

North American Stainless

How To Write A Recipe Card

The ground water region in the area of the proposed Project site is made-up of sedimentary rocks (Figure 5.6-1)

Periodic Table Bingo

Chapter 2 Lecture Notes: Atoms

ICP - Mono Element ppm Standard Solutions (Plasma HIQU)

VALIDATION REPORT. Investigation of the major interferences for proposed Horizontal ICP AES method. ECN protocol

MATERIAL SPECIFICATIONS CATALOGUE

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Wipe Analysis to Determine Metal Contamination on Critical Surfaces

THE ROLE OF METALLURGY IN ENHANCING BENEFICIATION IN THE SOUTH AFRICAN MINING INDUSTRY

Additional Terms And Conditions Taiwan Air Force (TAF) E-2T Aircraft Retrofit of TW 1-4 (Prime Contract No. N C-0040)

AUSTENITIC STAINLESS DAMASCENE STEEL

Steel Reinforcing Bars for the Reinforcement of Concrete

DIN STEEL PIPES FOR PIPE LINES FOR THE TRANSPORT OF COMBUSTIBLE FLUIDS AND GASES

XIX. Chemistry, High School

MODERN ATOMIC THEORY AND THE PERIODIC TABLE

Sustainable energy products Simulation based design for recycling

Durcomet 100 CD4MCuN. Bulletin A/7l

Determination of Metals in a 3% Sodium Chloride (NaCl) Matrix by Axially-Viewed ICP-OES

USED LUBE OIL ANALYSIS & ANALYTICAL FERROGRAPHY

Chemical Proportions in Compounds

Urea and Nitric Acid Plants: Improvement of Shut-Down, Revamping, Debottlenecking and Refurbishment

Sputtering Targets for Microelectronics. Sputtering Targets for Semiconductor Applications

Alloys & Their Phase Diagrams

Module 3 Machinability. Version 2 ME, IIT Kharagpur

Forward-Looking Statements

CLASS TEST GRADE 11. PHYSICAL SCIENCES: CHEMISTRY Test 6: Chemical change

Investor Presentation May 2014

CATALOGUE REFERENCE MATERIALS

Social Media Mining. Data Mining Essentials

Chapter 3 Mass Relationships in Chemical Reactions

M/s. D.S. Alloyd Pvt. Ltd.

CHAPTER Naming Ions. Chemical Names and Formulas. Naming Transition Metals. Ions of Transition Metals. Ions of Transition Metals

WEIGHTS AND MASS STANDARDS

TECHICAL SPECIFICATION Conductive Elastomer Extruded Gaskets

SOFIA UNIVERSITY ST. KLIMENT OHRIDSKI Faculty:...Chemistry and Pharmacy... Subject area: (code and name) C H L

The mechanical properties of metal affected by heat treatment are:

Grade 8 FCAT 2.0 Science Sample Questions

Impact of utilised bottom ash

Chalcophile and Key Element Distribution in the Eastern Goldfields: seismic traverse EGF01. Aleks Kalinowski Geoscience Australia, pmdcrc Y2 project

Customer Classification And Prediction Based On Data Mining Technique

THREE MAIN SOLIDIFICATION REACTIONS OF VANADIUM MODIFIED T1 TUNGSTEN HIGH SPEED TOOL STEEL. Hossam Halfa

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

Stainless steel grade chart

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

University of Pittsburgh Safety Manual Subject: COMBUSTIBLE METALS. EH&S Guideline Number: Effective Date 09/10/13.

Contents Nickel Alloys... 2 Copper Alloys Titanium Alloys Stainless Steel Miscellaneous Welding Material Contact...

SAFETY DATA SHEET BW 001 Solid Carbon steel wire electrodes & rods

Austenitic Stainless Steels

MINING CHAINS IN HIGH EFFICIENCY LONGWALL SYSTEMS. 1. Introduction. Maksymilian Klank* AGH Journal of Mining and Geoengineering vol. 36 No.

Evaluation of the Susceptibility of Simulated Welds In HSLA-100 and HY-100 Steels to Hydrogen Induced Cracking

Web Mining as a Tool for Understanding Online Learning

Transcription:

APPLYING DATA MINING METHODS TO PREDICT DEFECTS ON STEEL SURFACE 1 SAYED MEHRAN SHARAFI, 2 HAMID REZA ESMAEILY 1 Computer Engineering Department, Islamic Azad University, Najaf Abad Branch, Esfahan, Iran 2 Computer Engineering Department, Islamic Azad University, Najaf Abad Branch, Esfahan, Iran ABSTRACT In the steel industry, especially alloy steel, creating different defected product can impose a high cost for steel producers. One common defect in producing low carbon steel grades is Pits & Blister defect. To eliminate this drawback, we need to grind the surface of the product. In some cases, the severity of defects may lead to scrap part of the product. Grinding cause waste of time and cost of production will be increased. Incidence of defects is related to several factors including material analysis and production processes. In this study we want to create a model to predict this fault with data mining methods including decision tree, neural network and association rules. We will compare the efficiency and accuracy of these models and select the appropriate model. In this study, methodology used to perform data mining is CRISP (Cross Industry Standard Process for Data Mining). To create a decision tree and neural network respectively, entropy method and 24 hidden nodes are used. And to discover association rules, four members itemset is used. And applying data mining on data received from Iran Alloy Steel Company, the model created using the decision tree, has higher accuracy. Keywords: Data mining, neural network, Decision tree, Association rules, Alloy steel, Pits & Blister defect. 1. INTRODUCTION Data mining is an interdisciplinary science. It uses different methods including statistics, pattern recognition and machine learning to extract knowledge from mass of data [1]-[2]. In this study, predict Pits & Blister defect has been selected as a target. This type of surface defect is observed after rolling and it should be removed from the product with milling, or if not, it becomes scrap. This description is determined to reduce the incidence of this defect. It will cause reducing cost and time. The origin of defects has been a topic of interest for many research projects, but it is still hard to find literature on the modeling of this defect. No physical model for this defect formation has been formulated so far. The problem for modeling is due to high dimensional variable group with their interactions. In this study data mining techniques has been used to find the causes of this defect. Since data mining methods is based on data, selecting the correct data increase accuracy of models created by these methods [3]. Hence, we studied the production process and defected products with Pits & Blister in Iran Alloy Steel Company. Chemical analysis and casting parameters were identified as factors in the rise of this defect. And these data were selected as input to data mining algorithms. Since we had a higher number of chemical analysis instances rather than casting parameters instances, first we applied only chemical analysis data to the algorithms and after that the composition of chemical analysis and casting parameters were applied to the algorithms. For predicting this defect we created models using decision tree, neural network and discovery association rules. Several reasons for this defect are mentioned in the literature, including the effects of aluminum and nitrogen elements. 87

The result of our research not only confirms the effects of aluminum in the defect but also identifies other elements as effective elements. 2. DATA GATHERING At the beginning of the research, we held several meetings with experts to review and guess factors that cause this defect. The effective parameters in continuous casting and melting analysis known as factors in this defect. And since most pit & blister defects occur in low carbon steels, occurrence of this defect was evaluated in these steel grades. Data used in this research belong to Iran Alloy Steel Company. This data was available in Progress database on the VAX machine. We had two different datasets for our research. First dataset includes only chemical analysis. It contains 2300 instances, and is related to the products from 2003 to 2009. Second dataset includes chemical analysis and effective parameters of casting. It contains 350 instances, and is related to the products from 2009. For data preparation, we either repaired instances with lost attributes or just deleted them. For repairing the instances we used most frequently values to fill the lost attribute. For example, to replace the amounts of silicon attribute, the most frequently value had replaced. Class attribute in this collection has two values. 523 indicates Pits & Blister defect and 0 indicates lack of defects. Data mining operation was conducted in two stages. First phase was conducted only on melting analysis (dataset1) and second phase was done on the analysis of melting and effective parameters of casting (dataset2). The set of melting analysis attributes is shown in Tables (1) and effective parameters in continuous casting attribute is shown in Tables (2). Table 1.Attribiutes of melting analysis Attribute Ni Cr S P Mn Si C Description Nickel Chromium Sulfur Phosphorus Manganese Silicon Carbon Continue Table 1. Attribute Sn V W Cu Co Ti Al Description Tin Vanadium Tungsten Copper Cobalt Titanium Aluminum Continue Table 1. Attribute As B Ca Ce Nb Sb Mo Description Arsenic Blond Calcium Cerium Niobium Antimony Molybdenum Table 2.Attribiutes of casting parameters Attribute Description Param_Date Sampling time Cast_Speed Casting speed Cool_Flow1 Water flow rate(zone 1) Cool_Flow2 Water flow rate(zone 2) Cool_Flow3 Water flow rate(zone 3) Bloom_Temp Bloom temperature under straightening and withdrawal units Air_press1 Air pressure(zone 1) Air_press2 Air pressure(zone 2) Air_press3 Air pressure(zone 3) Water_press1 water pressure(zone 1) Water_press2 water pressure(zone 2) Water_press3 water pressure(zone 3) Mould_Water_Temp Mould water temperature Secondary_Water_Temp Secondary water temperature Mould_Oscil_Freq Mould frequency 3. APPLYING METHODS 88

For applying decision tree, neural network and association rules, Microsoft Sql Server 2008 is used for modeling [4]. To perform data mining, data set was divided to three categories with different levels of carbon. This division is shown in Table (3). Table 3.Records categories Description Name Carbon<0.15% product with very low carbon 0.15%<carbon<0.25% product with low carbon Consist Molybdenum product with Molybdenum To create a model, data randomly divided into two parts, namely training and testing datasets. Training dataset includes 70 percent of our data and testing dataset includes 30 percent of our data. In training stage, we used decision tree with entropy settings to compute the weight of nodes for break points [5]. And we used 24 hidden nodes for neural network [6]. Also we used quaternary itemset for discovery association rules. According to Figure (1), model of association rules for low carbon products (0.15% to 0.25% carbon) has 64 percent accuracy and is more accurate than the models created with decision tree and neural network. Figure 1. A comparison between algorithms for low carbon products (0.15% to 0.25% carbon) According to Figure (2), the model created with decision tree for very low carbon products (0.0% to 0.15% carbon) has 83 percent accuracy and is more accurate than the models created with association rules and neural network. 89

Figure 2. A comparison between algorithms for very low carbon products (0% to 0.15% carbon) According to Figure (3), model of neural models created with association rules and decision network for product with Molybdenum element has tree. 76 percent accuracy and is more accurate than the Figure 3. A comparison between algorithms for products with Molybdenum Comparing figures 1, 2 and 3 shows that the decision tree model, resulting from data mining on very low carbon has the highest accuracy and it is appropriate. Figure (4) shows the resulted tree after applying decision tree method on very low carbon data. 90

Figure 4. Tree resulting from the decision tree method for very low carbon products No defect product with gray color and defect product with black color were shown. And as it is clear, the aluminum element in the creation of this defect is very impressive and products with aluminum less than 0.004 do not have defect. Also vanadium, tin and titanium elements are effective in creation or prevention of this defect. In the next step data mining algorithms applied on data sets including analysis of melting and casting parameters. Accuracy of created models in this step was about 50 percent, so unacceptable. 4. CONCLUSIONS An effort to produce high quality steels in the steel industry continues. One way to achieve quality products is using the knowledge and experience from the past. And since usually high level of automation are used in steel production, after some time mass data is collected. Data mining algorithms can discover very useful knowledge and good patterns from the data. The non-uniform data and the complexity of the underlying process made the data synchronization a challenging task, which was successfully accomplished before the data analysis. Preprocessing continued with the selection of the most meaningful variables, and the number of variables was reduced from 186 to 36 at the modeling phase. The study also tried using these algorithms to reduce product defects with Pits & Blister defect. Applying different inputs on data mining algorithms can realize the fact that only chemical analysis has better results and especially very low carbon steel model gave us a better answer. And also confirmed the influence of Aluminum element as that is mentioned in various articles [7]. In addition, the effect of titanium in the prevention of this defect was revealed. Using this model, the expected benefits are the following: Reduce the process time due to omitting grinding step. Reduce energy costs. There is an optimal tool for predicting defect and also considering that for producing products with no defect. 91

Acknowledgements The authors would like to express their gratitude to Mr Jafarzadeh, Mr Safarzadeh and Mrs Jafari at Iran Alloy Steel Company for their expert knowledge support. REFRENCES: [1] D.Olson;D.Delen; Advanced Data Mining Techniques,Springer,2008. [2] J.Han;M.Kamber; Data Mining: concept and Techniques, Second Edition, Elsevier Inc, 2006. [3] D.Hand;H.Mannila;P.Smyth; Principles of data mining, The MIT Press. 2001. [4] J.MacLennan; B.Crivat; Data Mining With Microsoft SqlServer 2008, Wiley Publishing, 2009. [5] P.Tan;M.Steinbach;V.Kumar; Introduction to Data Mining, Addison Wesley,2005 [6] M.Bramer; Principle of data mining, Springer, 2007. [7]. Z.Ma,P.Valentine,A.Heinen, Improvement of castability and quality of continuously cast steel, ISIJ International,vol.40,No.1,pp.31-39,2000. 92