Cluster Analysis Using Data Mining Approach to Develop CRM Methodology



Similar documents
Management Science Letters

Combining Data Mining and Group Decision Making in Retailer Segmentation Based on LRFMP Variables

Developing a model for measuring customer s loyalty and value with RFM technique and clustering algorithms

Product Recommendation Based on Customer Lifetime Value

A Hybrid Model of Data Mining and MCDM Methods for Estimating Customer Lifetime Value. Malaysia

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Implementing A Data Mining Solution To Customer Segmentation For Decayable Products A Case Study For A Textile Firm

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Customer Classification And Prediction Based On Data Mining Technique

Data Mining Solutions for the Business Environment

Mining changes in customer behavior in retail marketing

Examining the role of Electronic Customer Relationship Management (E-CRM) on Attracting Customers in Internet Marketing

ISSN: (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies

DATA MINING AND CUSTOMER RELATIONSHIP MANAGEMENT FOR CLIENTS SEGMENTATION

Random forest algorithm in big data environment

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

The impact of relationship marketing on customer loyalty enhancement (Case study: Kerman Iran insurance company)

ISSN: (Online) Volume 3, Issue 7, July 2015 International Journal of Advance Research in Computer Science and Management Studies

CUSTOMER RELATIONSHIP MANAGEMENT IN B2B MARKET

Data Mining Project Report. Document Clustering. Meryem Uzun-Per

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

Uncertain Supply Chain Management

Using Data Mining Techniques to Increase Efficiency of Customer Relationship Management Process

PREDICTIVE DATA MINING ON WEB-BASED E-COMMERCE STORE

Dr. U. Devi Prasad Associate Professor Hyderabad Business School GITAM University, Hyderabad

EFFECTIVENESS OF CUSTOMER RELATIONSHIP MANAGEMENT PRACTICES IN STAR HOTELS IN KARNATAKA

A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

S.Thiripura Sundari*, Dr.A.Padmapriya**

Investigating the effective factors on Customer Relationship Management capability in central department of Refah Chain Stores

Clustering Marketing Datasets with Data Mining Techniques

THE IMPLEMENTATION OF CUSTOMER RELATIONSHIP MANAGEMENT WITHIN TAIWANESE BANKS

SEM Analysis of the Impact of Knowledge Management, Total Quality Management and Innovation on Organizational Performance

Knowledge Discovery from patents using KMX Text Analytics

Research on Clustering Analysis of Big Data Yuan Yuanming 1, 2, a, Wu Chanle 1, 2

Prediction of Stock Performance Using Analytical Techniques

K-Mean Clustering Method For Analysis Customer Lifetime Value With LRFM Relationship Model In Banking Services

How to Get More Value from Your Survey Data

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Comparison of K-means and Backpropagation Data Mining Algorithms

Factorial Analysis of Customer Relationship Management In Bangladesh *

Standardization and Its Effects on K-Means Clustering Algorithm

International Journal of Advance Research in Computer Science and Management Studies

USE OF DATA MINING TO DERIVE CRM STRATEGIES OF AN AUTOMOBILE REPAIR SERVICE CENTER IN KOREA

ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION

IBM SPSS Direct Marketing 23

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

Enhancing Education Quality Assurance Using Data Mining. Case Study: Arab International University Systems.

OPTIMAL DESIGN OF A MULTITIER REWARD SCHEME. Amir Gandomi *, Saeed Zolfaghari **

JOURNAL OF INTERNATIONAL ACADEMIC RESEARCH FOR MULTIDISCIPLINARY Impact Factor 1.393, ISSN: , Volume 2, Issue 6, July 2014

IBM SPSS Direct Marketing 22

An Analytical Study of CRM Practices in Public and Private Sector Banks in the State of Uttar Pradesh

An Empirical Study of Application of Data Mining Techniques in Library System

Hybrid approaches to product recommendation based on customer lifetime value and purchase preferences

Prediction of Heart Disease Using Naïve Bayes Algorithm

Indian Institute of Management Bangalore. Customer Relationship Management

Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH

Data Mining Applications in Higher Education

An Overview of Knowledge Discovery Database and Data mining Techniques

Towards applying Data Mining Techniques for Talent Mangement

A New Approach for Evaluation of Data Mining Techniques

Managing Customer Retention

Use of Data Mining in Banking

Data quality in Accounting Information Systems

IMPROVING THE CRM SYSTEM IN HEALTHCARE ORGANIZATION

CUSTOMER RELATIONSHIP MANAGEMENT (CRM) AND ITS IMPACT ON PERFORMANCE IMPROVEMENT OF COMPANIES IN IRAN TELECOMMUNICATIONS

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

2.1. The Notion of Customer Relationship Management (CRM)

The Relationship of E-CRM, Customer Satisfaction and Customer Loyalty. The Moderating Role of Anxiety

Product recommendations and promotions (couponing and discounts) Cross-sell and Upsell strategies

Enhancing Quality of Data using Data Mining Method

Effective Analysis and Predictive Model of Stroke Disease using Classification Methods

Hello, Goodbye. The New Spin on Customer Loyalty. From Customer Acquisition to Customer Loyalty. Definition of CRM.

The Importance of Customer Satisfaction in Relation to Customer Loyalty and Retention. Harkiranpal Singh. UCTI Working Paper WP-06-06

Social Media Mining. Data Mining Essentials

Easily Identify Your Best Customers

Chapter 2: Relationship Marketing and the Concept of Customer Value

Customer Relationship Management based on Increasing Customer Satisfaction

Customer relationship management MB-104. By Mayank Kumar Pandey Assistant Professor at Noida Institute of Engineering and Technology

RELATIONSHIP BETWEEN BUSINESS INTELLIGENCE AND ORGANIZATIONAL PERFORMANCE (CASE STUDY: FOOD INDUSTRY COMPANIES IN RASHT INDUSTRIAL CITY)

How To Solve The Kd Cup 2010 Challenge

Customer Relationship Management using Adaptive Resonance Theory

Sales and Invoice Management System with Analysis of Customer Behaviour

2.1. Data Mining for Biomedical and DNA data analysis

Ira J. Haimowitz Henry Schwarz

SCIENCE ROAD JOURNAL

Measuring Service Supply Chain Management Processes: The Application of the Q-Sort Technique

Applying CRM in Information Product Pricing

Effect of some important factors on management of customer relationship with an emphasis on comprehensive banking

Data Mining Algorithms and Techniques Research in CRM Systems

Investigating Customer Satisfaction Factors in a Customized Software Development Company

Healthcare Data Mining: Prediction Inpatient Length of Stay

Critical Success Factors for Implementing CRM Using Data Mining*

CONCEPTUAL FRAMEWORK OF CRM PROCESS IN BANKING SYSTEM

e-crm: Latest Paradigm in the world of CRM

Transcription:

Cluster Analysis Using Data Mining Approach to Develop CRM Methodology Seyed Mohammad Seyed Hosseini, Anahita Maleki, Mohammad Reza Gholamian Industrial Engineering Department, Iran University of Science and Technology PhD, PE Professor, seyedhoseini@iustacir, MSc Student, anahitamaleki@yahoocom, Associate Professor, gholamian@iustacir Abstract Data Mining (DM) methodology has a tremendous contribution for researchers to extract the hidden knowledge and information which have been inherited in the data used by researchers This study has proposed a new procedure, based on expanded RM model by including one additional parameter, joining WRM-based method to K-means algorithm applied in DM with K-optimum according to Davies-Bouldin Index, and then classifying customer product loyalty in under BB concept The developed methodology has been implemented for SAPCO Co in Iran The result shows a tremendous capability to the firm to assess his customer loyalty in marketing strategy designed by this company in comparing with random selection commonly used by most companies in Iran Keywords: CRM, BB, K-means algorithm, Customer Loyalty, RM model

Introduction In a BB environment, suppliers and/or service providers usually need to understand the nature and characteristics of their Customers As customers attraction and satisfaction are the main objective of any leading company, so the main objective of this article is to provide an effective and efficient methodology to be used for implementing the firm s objective to the best of possible This part mainly reviews the studies related to customer relationship management, customer loyalty, K-means algorithm, RM model Customer Relationship Management ١ Since the early 0s, the concept of customer relationship management in marketing, and consists of four dimensions: Customer Identification, Customer Attraction, Customer Retention and Customer Development has gained its important It is difficult to find out a totally approved definition of CRM We can describe it as a comprehensive strategy and process of acquiring, retaining and partnering with selective customers to create superior value for the company and the customer [] CRM is a comprehensive business and marketing strategy that integrates technology, process, and all business activities around the customer [, ] Brown points out that CRM as the key competitive strategy you need to stay focused on the needs of your customers and to integrate a customer-facing approach throughout your organization [4] Chatterjee also defines CRM as a discipline which focuses on automating and improving the business processes associated with managing customer relationships in the area of sales, management, customer service, and support [] According to einberg and Kadam, profits increase by -0 percent when customer retention rates increase by five points [] CRM projects often fails and about 40 percent of CRM implementations are successful [7] Customer Loyalty Creating a loyal BB customer base is not only about maintaining numbers of customer overtime, but it is creating the relationship with business customers to encourage their future purchase and level of advocacy Equipped with the knowledge of their business customers' loyalty levels, a supplier will be able to figure how their endeavors to maintain good relationships can contribute to its profit levels Some authors believe that loyal customers offer a steady stream of revenue for a company by remaining with the brand/supplier and rejecting the overtures of competitors [, ] Considering this with the nature of large purchase and transactions in a BB setting; there are gigantic rewards for those suppliers who succeed in creating and maintaining loyal customers Some Authors have proposed a number of theories to link variables that one usually finds in relationship marketing and business marketing to the loyalty construct In the BB context, evidence shows that relationship elements affect customer loyalty or example, Ricard and Perrien found that relationship practices have a direct impact on customer loyalty [0] Other Authors provide empirical evidence linking several constructs such as relationship quality, trust, involvement, satisfaction, purchase development, organizational change, and switching costs to influence BB customer loyalty and retention [, ] CRM

In some researches, customer lifetime value (CLV) or loyalty is evaluated in terms of recency, frequency, monetary variables namely the integrating rate of each cluster, that is C j =w R C R j + w C j + w M C M j, where w R, w, w M are the relative importance of the RM variables [] Other researches for classifying customer value have proposed a model based on computing the distance between the center of cluster and zero point as high value refers to most customer loyalty [4] RM Analysis RM analysis has been used in direct marketing for several decades [] This technique identifies customer behavior and represents customer behavior characteristics by three variables as follows: () Recency of the last purchase which refers to the interval between the time that the latest consuming behavior happens and present () requency of the purchases which refers to the number of transactions in a particular period () Monetary value of the purchase which refers to consumption money amount in a particular period RM analysis is utilized in many ways by practitioners; therefore, RM analysis can mean different things to different people Classic RM implementation ranks each customer on valuable parameters against all the other customers, and creates an RM score for each customer/product The first step is to sort the customer file according to how recently customers have purchased from the firm Then database divided into equal quintiles and these quintiles are assigned the numbers to Therefore, the 0% of the customers who most recently purchased from the company are assigned the number ; the next 0% are assigned the number 4, and so on The next step involves sorting the frequently, monetary Therefore, the database is divided into roughly equal groups (cells) according to recency, frequency, and monetary value Customers/products with high scores are usually the most profitable [, 7] K-means Algorithm Clustering is the process of grouping a set of physical objects into similar groups A cluster is a collection of data objects that are similar to one another within the same cluster and are dissimilar to the objects in other clusters [] K-means is one of the well-known algorithms for clustering which is very sensitive to the choice of a starting point for partitioning the items into K initial clusters We can compare the performance of different clustering methods using intraclass method when the number of fixed cluster of K value is defined as k n ( K) = / K Dist( c, c ) [, 0] n= ci Customer Loyalty Assessment Model i We briefly introduce the proposed procedure for classifying customer value based on products Recency, requency, Monetary

The proposed model is based on the use of some data mining techniques and customer lifetime value analysis to improve CRM for enterprises, based on RM attributes and K-means method for clustering product value General purpose of this study is to determine loyalty degree of product to achieve an excellent CRM which are believed to maximize profits with win-to-win situation under BB concept ig illustrates the proposed method in this study Preparing and preprocessing data Expand RM model and determining the relative weights Determining optimum K by Davies-Bouldin Index Clustering by K-means base on weighted parameter Clustering by K-means base on non weighted parameter Compare the quality of Clusters and choose the best one Compute the integrating rate of each cluster Compute Distance between centers of each cluster with zero Determine the degree Product loyalty Evaluate the result ig Proposed method The proposed procedure is divided into six steps: () selecting the dataset and preparing data; () defining recency, frequency and monetary and period of product activity as the parameters of model then determining weight of each parameter by eigenvector technique and yielding the scale of RML attributes for the purpose of clustering, then partitioning all parts of firm into 4 classes while we evaluate the optimum K based on Davies-Bouldin Index in order to use in K- means algorithm; next the parts have been clustered in two ways which are based on weightedparameters and non-weighted-parameters in order to compare the quality of clusters based on (K); () Computing the integrated rate of each cluster namely that is C j =w R C j R + w C j + j w M C M where w R, w, w M and w L are the relative importance of the RML variables; (4) Computing Distance between center of each cluster with zero point namely D; () achieving appropriate equation for assessing CLV ranking under two parameter namely D and ; () finally, splitting dataset into training data and testing data and validating data in order to evaluate the results The computed process is introduced step by step as follows: 4

Step : Preparing and preprocessing data At first, select the dataset for empirical study To preprocess the dataset to make knowledge discovery easier is needed Thus, we firstly delete the records which include missing values or inaccurate values, eliminate the redundant attributes and transform the datum into a format that will be more easily and effectively processed for clustering customer value Step : Clustering customer value by K-means algorithm The following step is to define the variables of model and the scale of R M L attributes based on Hughes then yield quantitative value of RML attributes as input attributes for clustering customer value by using K-means algorithm with K optimum according to Davies- Bouldin Index and compare the performances of these clustering methods using intraclass method The detail process of this step is expressed into these sub-steps: () Defining the R M L variables for empirical study () The R M L attributes have different weight which is obtained by filling the questionnaire and using eigenvector technique for group decision makers in firm which is computed as follow: w R =0, w = 0, w M =0 and w L =0 () Defining the scale of four R M L attributes, which are, 4,, and and refering to the customer contributions to revenue for enterprises for each product or example refers to the most customer contribution to revenue and refers to the least contribution to revenue for m parameter So we totally have ( * * * ) combinations in the proposed model (4) We can evaluate the optimum k by Davies-Bouldin Index, in order to use in K-means algorithm for the purpose of clustering () The parts have been clustered in two ways which are based on weighted-parameters and non-weighted-parameters () We can compare the quality of clusters based on (K) Step : The CLV ranking was derived to help develop more effective strategies for retaining customers and thus identify and compare market segments Clustering the integrated rate of each cluster refers to C j =w R C R j + w C j + w M C M j + w L C L j namely, where w R, w, w M and w L are the relative importance of the RML variables inally, sort the to 4 by descendant order which can be indicated the degree of loyalty Step 4: Base on Davies-Bouldin Index, let k = 4 for the purpose of clustering urthermore, so we can obtain the center of clusters as follows: C ={S,S,, S P },, C 4 ={S 4,S 4,, S 4PK } where S ij denotes the jth element of C i, P i denotes the number of elements in C i, so we obtain c, c,, and c 4 as the center of the clusters as follow: c = (v, v, v, v4 ) c 4 = (v 4, v 4, v 4, v 44 ) Then compute the distance between c i and zero point namely D inally, sort the distance D to D 4 by descendant order which can be indicated the degree of loyalty Step : when we combine ranking of D and ranking of for indicating the degree of loyalty, we can obtain an appropriate equation for customer loyalty assessment based on the degree of loyalty and number of clusters Step : evaluating the result irst evaluation is R-squared value test in order to R-squared value which is shown on equation chart based on ranking of with, D, +D values, ranking of D with, D, +D values and ranking of +D with, D, +D values Second evaluation is the accuracy rate of the generated rules with Decision Tree and Artificial Neural Network methods; compare the proposed model with the non-weighted RM model of classes on output

Empirical Study The developed methodology was implemented in SAPCO Co this company was founded in and is one of the most leading supplier companies in Iran which provided items, equipment and car accessories for several car factories in Iran through some distributor agent companies such as ISACO Co that usually provides the customer with product after-sale services Recently she received the EQM award by successfully auditing her company under Iranian National Productivity and Business Excellence Award A practical dataset of SAPCO CO for automotive industry in Iran has been collected from 00//0 to 00// The computing process using for SAPCO can be shown according to the ig () with this parameters: R: supply percent of each product for ISACO : frequency of the purchase of each product by ISACO M: monetary value of the order from ISACO site L: period of product activity for above- mentioned period of time The matrix of group decision making for determining the relative importance weights of the RML variables as follow: Table the matrix of group decision making 0 0 0 0 04 4 0 4 4 Applying the concept shown in igure (), the results has been demonstrated in table (,, 4,,, 7) and ig (,, 4, ) Table the partial data of SAPCO CO ID R M L 770 40XX K40 407 YG0 K40A 4 00 0000 000 74 4 04000 404 7440 70 070000 00 7

Table the real scaling of R--M-L attributes Scaling 4 R - 00-4 - 7-4 0 - -4-- 0-- 7-- 4-- -- M 7000-4400 44-00 0400-47000 770-040 000-00 L -7-4- - Davies- Bouldin Index 000 0000 000 000 7000 000 000 4000 000 0 4 7 0 4 7 0 4 7 40 k ig Determine optimum k Table 4 Determine optimum k Method Clustering with non weighted parameter Clustering with weighted parameter (4) 004 0 Table the cluster result by K-means with 4 classes on output base on D,, +D Cluster No Member R M L D i D i Rank C j Rank + D i + D i Rank 4 07 4 44 7 4 0 000 7 4 4 4 444 4 0 00 000 00 400 7 00 7 4 4 0 07 4 0 70 0 7

y = -04x + 4 R = 04 y = -0x + 7 0 R = 04 4 0 y = -004x + 40 R = 04 7 7 7 D +D Linear (+D) Linear (D) Linear () ig Ranking of with, D, +D 4 0 4 0 y = -07x + 404 R = 044 y = -04x + R = 0 y = -00x + 47 R = 0 7 7 7 D +D Linear (+D) Linear (D) Linear () ig 4 Ranking of D with, D, +D y = -04x + 40 4 R = 0 y = -0x + 0 R = 07 4 0 y = -004x + 477 R = 07 7 7 7 D +D Linear (+D) Linear (D) Linear () So the result for validation: ig Ranking of +D with, D, +D Table irst evaluation D +D R / 0 0 0 R /D 0 0 ٠٩٤ R /+D 0 0 0 Mean 00 0 0

Table 7 Second evaluation Method Proposed Model Base Model Artificial Neural Network 0 0 Decision Tree 7 Analysis and Conclusion After assessing CLV ranking under two parameters namely D and, the results indicate that when we combine these two parameters, we can obtain a better degree of loyalty which has been confirmed with R-squared value test and accuracy rate of the generated rules with Decision Tree and Artificial Neural Network methods, also we can obtain customer loyalty assessment function which has been shown in ig Customer Loyalty Assessment unction +D value 0 y = -0004x - 007x + R = 04 4 7 0 4 7 0 4 7 0 4 +D degree ig Customer Loyalty Assessment unction The application of proposed model is implemented in SAPCO, one of the leading car manufacturing supplying companies in Iran The application of clustering and classifying procedure used for the above-mentioned company indicates that out of 0% of total sale is classified into 40% of the target parts as shown in ig 7 ig 7 the result of the application of proposed model in SAPCO

The proposed procedure has shown that for the purpose of clustering, when we combine the expanded WRM model into K-means algorithm, we can see a tremendous improvement in classifying accuracy in order to reach to an excellent CRM As the distance and the integrated rate of each cluster commonly used by many researchers as a separate and independent parameters, in this study the combination of these two parameters has been considered in clustering and classification analysis The result of statistical test for model validation has shown that the developed methodology for CRM has an acceptable result with a high level of confidence in comparing with other commonly used models by researchers The proposed CRM methodology can be used by industries as well as service sectors in evaluating their customers loyalty in a most efficient and effective manners 0

References Parvatiyar, A and Sheth, J, Conceptual framework of customer relationship management, in Sheth, JN, Parvatiyar, A and Shainesh, G (Eds), Customer Relationship Management: Emerging Concepts, Tools and Applications, th ed, Tata McGraw-Hill Publishing, New Delhi, 004 Anton, J, Customer Relationship Management, Prentice-Hall, New York, NY, Anton, J and Hoeck, M, E-business Customer Service, The Anton Press, Santa Monica, CA, 00 4 Handen, L Brown, The three Ws of technology, Customer Relationship Management: A Strategic Imperative in the World of E-business, Wiley, Toronto, pp -, 000 Chatterjee, J, Managing customer relationships in the e-business economy, Journal of Scientific and Industrial Research, August-September, pp 74-, 000 einberg, R and Kadam, R, E-CRM web service attributes as determinants of customer satisfaction with retail web sites, International Journal of Service Industry Management, Vol No, pp 4-, 00 7 einberg, R and Trotter, M, Immaculate deception: the unintended negative effects of the CRM revolution: maybe we would be better off without customer relations management, in Renner, DH (Ed), Defying the Limits, Montgomery Research, San ranscisco, CA, 00 Lam SY, Shankar V, Erramilli MK, Murthy B Customer value, satisfaction, loyalty, and switching costs: an illustration from a business-to-business service context, Vol No, pp, 004 Reichheld, Teal T The Loyalty Effect: The Hidden orce behind Growth, Profits, and Lasting Value Trans Boston, Mass: Harvard Business School Press; 0 Ricard L, Perrien J Explaining and evaluating the implementation of organizational relationship marketing in the banking industry: clients' perception J Bus Res; 4, pp 0, Chow S, Holden R Toward an understanding of loyalty: the moderating role of trust J Manage Issue; Vol No, pp 7, 7 Eriksson K, Vaghult LA Customer retention, purchasing behavior and relationship substance in professional services Ind Mark Manage; Vol : 7, 000 Duen-Ren Liu, Ya-Yueh Shih, Integrating AHP and data mining for product recommendation based on customer lifetime value, Information & Management, Vol 4, pp 7 400, 00 4 Ching-Hsue Cheng, You-Shyang Chen, Classifying the segmentation of customer value via RM model and RS theory, Expert Systems with Applications, 00 Baier M, Ruf KM, Chakraborty G Contemporary database marketing: concepts and applications Evanston: Racom Communications; 00 Hughes, A M Strategic database marketing Chicago: Probus Publishing Company, 4 7 Stone, B Successful direct marketing methods Lincolnwood, IL: NTC Business Books pp 7 7, Han, J, & Kamber, M Data mining: Concepts and techniques San rancisco: Morgan Kaufmann Publishers, 00 HWShina, SYSohnb, Segmentation of stock trading customers according to potential value, Expert Systems with Applications, pp 7-, 004 0 Michaud, P Clustering techniques uture Generation Computer System, Vol No pp 47, 7