School of athematcs and Systems Engneerng Reports from SI - Rapporter från SI Data nng Analyss and odelng for arketng Based on Attrbutes of Customer Relatonshp Xaoshan Du Sep 2006 SI Report 06129 Väö Unversty ISSN 1650-2647 SE-351 95 VÄXJÖ ISRN VXU/SI/DA/E/--06129/--SE
Data nng Analyss and odelng for arketng Based on Attrbutes of Customer Relatonshp Xaoshan Du School of athematcs and Systems Engneerng, Väö Unversty SE-351 95 Väö, Sweden Supervsor: Joakm Nvre
Abstract. Wth the rapd growng marketng busness, Data nng technology s playng a more and more mportant role n the demands of analyzng and utlzng the large scale nformaton gathered from customers. To predct the consequent busness strategy by usng Data nng, the Customer Relatonshp anagement CR nowadays s requred to evaluate the customer performance, dscover the trends or patterns n customer behavor, and understand the factual value of ther customers to ther company. In ths paper, we present an effectve model to apply Data nng to the CR problem of categorzng the customers n marketng to search for potental clents based on ther propertes by 1 computng Dstance n Cluster Analyss and Lft n Assocaton Rules accordng to the Attrbutes of Customer Relatonshp ACR ncludng Self-Relance Inde, Impact Inde and atr for customer value, and 2 n the Data nng modelng theory, constructng the Regresson odel n the ACR and mplementng the correspondng algorthm to mne the most proftable customer group. - 2 -
1 Introducton - 5-1.1 Background and motvaton... - 5-1.2 roblem State... - 6-1.3 Thess Outlne... - 6-2 Data nng ethods and odels - 7-2.1 Data nng rocess... - 7-2.1.1 Data reparaton... - 7-2.1.2 Knowledge dscovery n database... - 8-2.1.3 odel Eplan and Estmate... - 9-2.2 Categorzaton of Data nng ethods... - 9-2.2.1 Categorzaton Based on nng Tasks... - 9-2.2.2 Categorzaton Based on nng Obects... - 10-2.2.3 Categorzaton Based on nng Technques... - 10-2.3 Analyss and odelng for Data nng... - 12-2.3.1 Fundamentals of odel... - 12-2.3.2 Structures of redctve model... - 13-2.3.3 Lnear Regresson odel... - 13-2.3.4 redctve model for Classfcaton... - 15-2.3.5 Stochastc arts of Data nng odel... - 15-2.3.6 Summary... - 16-2.4 Data nng n arketng... - 17-2.4.1 Applcaton of Data nng n arketng... - 17-2.5 Applcaton of Data nng n CR... - 19-2.5.1 Introducton of CR... - 19-2.5.2 Concept of acr... - 20-2.6 Summary... - 22-3 odelng Based on Attrbutes of Customer Relatonshp ACR- 23-3.1 roblem statement... - 23-3.1.1 Crteron of Customer Value... - 23-3.1.2 Dscusson based on Customer Classfcaton... - 24-3.2 Segmentaton of Customer Value... - 24-3.3 Concept of Attrbutes of Customer Relatonshp ARC... - 26-3.4 Dssmlarty n Cluster Algorthm... - 28-3.5 Lft n Assocaton Rules... - 29-3.6 Search Reference ethod for Network Relaton... - 31-4 Implementaton of Data nng odel - 32 - - 3 -
4.1 Symbolc System... - 32-4.2 Estmate of urchase robablty... - 32-4.3 Evaluate Customer Value... - 35-5 Eperments - 37-5.1 Epermental Setup... - 37-5.1.1 Data Set... - 37-5.1.2 an Functons Name and Descrpton... - 39-5.1.3 nng Algorthm... - 40-5.2 The evaluated results... - 40-5.3 Dscusson... - 41-6 Concluson and Future Works - 43-7 Acknowledgement - 44-8 References - 45 - - 4 -
1 Introducton From ths Chapter you wll get a man dea of ths thess, nclude ts background, why we choose topc, and whch problems wll be solved n ths thess, as well as how can we solved them. 1.1 Background and motvaton Tradtonal Large-scale sales pattern s the most famlar sales pattern for companes. Based on ths atten, companes usually am at ther produces, products and then gve all the customers same sales promoton. However, ths knd of sales promotons neglects the dfferences among customers. In most cases, these promotons cost a lot, but only get few real profts from customers. That means many promotons are waste. In the meanwhle, data mnng technologes become more and more popular n commercal terran, such as n bankng ndustry, nsurance ndustry and retal trade. Data mnng can solve many typcal commercal problems, such as Database arketng, Customer Segmentaton and Classfcaton, rofle Analyss, Cross-sellng, Churn Analyss, Credt Scorng, Fraud Detecton, and so on. Snce those data mnng technologes appeared, companes have changed ther sales target from products to customers. How to classfy customers? How to fnd out the common character of customers from database? How to dg up the potental customers? How to fnd out the most valuable customers? These knds of questons become the most popular data mnng applcatons n marketng. Nevertheless, the recent customer relaton analyses have some serous drawbacks. The most mportant one s that based on those analyses company usually consder the customer as an solated obect and havng value only when he/she deals wth ths company. Neglect the network value of each customer and the value from potental purchase probablty. In ths paper, whch s based on the applcaton of data mnng n marketng and recent research result of Customer Relatonshp anagement, we would lke to try to use new vsual angle to mprove these drawbacks. - 5 -
1.2 roblem State In ths paper, two man problems wll be solved. The frst one s how to generally classfy the customers by ther value? The second one s how data mnng technques can be used to estmate the value of a customer gven a database contanng nformaton about hs/her name, age, professon, etc? Ths s the most mportant one. In ths paper the value of a customer should be consdered as how much proft he/she can brng to the company. Ths value should be calculated as a numercal scale. It can be prmarly defned lke ths: V= rm - rn C. Where r s the proft brought from a specfc produce when customer buys t. m s the customer s purchase probablty when there s a sales promoton, whle n s the customer s purchase probablty when there s no sales promoton. C s the cost of the sales promoton. To get the value of a customer V, the customer s purchase probablty must be calculated at frst. In ths paper, Attrbute of Customers Relaton ACR wll be defned to represent the probablty At last, the new model and algorthms should be epermented and the result should be evaluated. 1.3 Thess Outlne The frst Chapter s an ntroducton of the paper, ncludng background, motvaton and thess outlne. In the second Chapter, we are gong to ntroduce some related data mnng methods and models. Furthermore, gve some ntroductons of data mnng n marketng. Gve a short ntroducton of CR, ncludng ts categorzaton and applcatons of data mnng n CR. In thrd Chapter, we would lke to focus on modelng based on ACR. We are gong to defne some new concepts and algorthms n ths Chapter, n order to get ready for the followng chapter as well. In the fourth Chapter, the mplementaton of our data mnng model wll be defned. After that, the ffth Chapter wll gve out the general dea of eperments. In addton, we put concluson and future works, acknowledgement, and references at the end of ths paper. - 6 -
2 Data nng ethods and odels Frst of all, we d lke to gve out the defnton of Data nng. Data nng s the process of dentfyng hdden patterns and relatonshps wthn data. [11] In another word, Data nng s the process fndng hdden nformaton n a database. [12] From the defnton of data mnng, we learn that t s a knd of technologes that can help us know the useful thngs hdden n the data. Therefore, data mnng should be an nterestng work. 2.1 Data nng rocess There are three man steps of data mnng process. 2.1.1 Data reparaton In the whole data mnng process, data preparaton s somehow a sgnfcant process. Some book says that f data mnng s consdered as a process then? Data preparaton s at the heart of ths process. However, nowadays databases are hghly susceptble to nose, mssng and nconsstent data. So preprocessng data mprove the effcency and ease of the data mnng process, ths becomes an mportant problem. Several consultng frms, such as IB, have approved that data preparaton costs 50%~ 80% resource of the whole data mnng process. From ths vew, we really need to pay attenton to data preparaton. There are three data preprocessng technques should be consdered n data mnng: 1 Data cleanng a Inconsstent data: Not all the data we get s clean. For eample, a lst of Natonalty may have the values of Chna,.R.Chna, and anland Chna. These values refer to the same country, but are not known by the computer. Therefore, ths s a consstency problem. b ssng values Data from a company s database often contans mssng values. Sometmes the approaches requre rows of data to be complete n order to mne them, but the database may contan several attrbutes - 7 -
wth mssng values. If too many values are mssng n a data set, t becomes hard to gather useful nformaton from ths data. c Nosy data Nose s a random error or varance n a measured varable. 2 Data ntegraton Usually the data analyss task wll nvolve data ntegraton. It combnes data from multplyng sources nto a coherent data store. Those sources nclude multple database or flat fles. Several ssues should be consdered durng data ntegraton, such as schema ntegraton, correlaton analyss for detectng redundancy, and detecton and resoluton of data value conflcts. Careful ntegraton of the data can help mprove the accuracy and speed of the mnng process. 3 Data reducton If you select data from a data warehouse, you probably fnd the data set s huge. Data reducton technques can be appled to obtan a reduced representaton of the data set. nng on reduced data set should be more effcent yet produce the same analytcal results. It ncludes several strateges, such as data cube aggregaton, dmenson reducton, data compresson, numerosty reducton, and dscretzaton and concept herarchy generaton. 2.1.2 Knowledge dscovery n database As a core data mnng technques, knowledge and nformaton dscovery has several man components: 1 Determne the type of data mnng tasks We must confrm that the functons and tasks to be acheved by recent system belong to whch knd of classfcaton or clusterng. 2 Choose sutable technologes for data mnng We can choose the approprate data mnng technologes based on the tasks we have confrmed. Such as, classfcaton model often use learnng neural network or decson tree to realze; whle clusterng usually use clusterng analyss algorthms to realze; assocaton rules often use assocaton and sequence dscovery to realze. 3 Choose the algorthms Based on the technologes have been chosen, we can select a specfc algorthm. Furthermore, a new effcent algorthm can be desgned by the specfc mnng tasks. To choce data mnng algorthms, we should determne the hdden pattern n selectng data. - 8 -
4 nng data We are supposed to use the selected algorthms or algorthms portfolo to do repeated and teratve searchng. Etract the hdden and nnovatve patterns from data set. 2.1.3 odel Eplan and Estmate Eplan and estmate the patterns got from data mnng, get the useful knowledge. For nstance, remove some rrespectve and redundant patterns, after fltraton the nformaton should be presented to customers; Use vsualzaton technology to epress the meanngful model, n order to translate t nto understandable language for users. A good applcaton of data mnng can change prmal data to more compact and easly understand form and ths form can be defned defntely. It also ncludes solvng the potental conflct between mnng results and prevous knowledge, and usng statstcal methods to evaluate the current model, n order to decde whether t s necessary to repeat the prevous work to get the best and sutable model. The nformaton acheved by data mnng can be used later to eplan current or hstorcal phenomenon, predct the future, and help decson-makers make polcy from the ested facts. 2.2 Categorzaton of Data nng ethods There are several data mnng methods, and there are some dfferent ways to classfy them as well. 2.2.1 Categorzaton Based on nng Tasks Based on the dfferent mnng tasks, we can categorze date mnng methods as classfcaton, clusterng, regresson, assocaton rules, sequence dscovery, predcton, and so on. [13] 1 Classfcaton Classfcaton maps data nto predefned group or classes. Because the classes are determned before eamnng the data, classfcaton s often consdered as supervsed learnng. Classfcaton algorthms requre that the classes be defned based on data attrbute values. They often descrbe these classes by lookng at the characterstcs of data whch are already known to belong to the classes. 2 Clusterng - 9 -
Clusterng s smlar to classfcaton; the dfference s the groups are not predefned. It s alternatvely referred to as unsupervsed learnng. It s usually acheved by determnng the smlarty among the predefned attrbutes of the data. The most smlar data are grouped nto clusters. 3 Regresson Regresson s used to map a data tem to a real valued predcton varable. Regresson assumes that the target data ft nto some known type of functons and then determnes the best functon of ths type. A smple eample of regresson s the standard lnear regresson. 4 Assocaton rules Assocaton rules alternatvely referred to as affnty analyss. An assocaton rule s a model that dentfes specfc types of data assocatons. They are usually used n the retal sales communty to dentfy tems whch are often purchased together. 5 Sequence dscovery It s used to determne sequental patterns n data. Those patterns are based on a tme sequence of actons. The relatonshp of those patterns s based on tme, and they are smlar to assocatons. 6 redcton Based on past and current data, many real-world data mnng applcatons can be consdered as predctng future data states. redcton s vewed as a type of classfcaton. The dfference s that predcton s predctng a future state rather than a current state. redcton applcatons nclude floodng, speech recognton, machne leerng, and pattern recognton. 2.2.2 Categorzaton Based on nng Obects If we categorze based on mnng obects, the data mnng methods can be dvded nto based on Relatonal Database, Obect Orented Database, Spatal Database, Tet Data Sources, Temporal Database, ultmeda Database, Heterogeneous Database, and Web source. 2.2.3 Categorzaton Based on nng Technques There are many dfferent technques used to acheve D tasks, so we can prmarly dvded D methods nto achne learnng methods, Statstcal - 10 -
methods, and Neural Networks methods. And then subdvson them as follows:[13] 1 achne Learnng methods a Decson Trees Decson tree s one of the most popular classfcaton algorthms n current achne Learnng. A decson tree s usually used n classfcaton, clusterng and predcton tasks as a predctve modelng technque. They are deal methods for makng fnancal decsons where lots of comple nformaton needs to be taken nto account. b Genetc Algorthms Genetc algorthms are a method of breedng computer solutons to optmzaton problems by smulated evoluton. The processes based on crossover and mutaton repeatedly appled to a populaton of bnary strngs. Tme after tme, the better ft ndvduals and average ndvduals are created, and a good soluton to the problem s found. Genetc algorthms usually are used to predct and used to replace the mssng attrbutes. It means when there are some attrbutes mssed, we can analyses ts specmens usng genetc algorthms to get the possble value and replace the mssng one. 2 Statstcal ethods a Statstcal Analyss Statstcal analyss s one of the most mature and proven data mnng methods. The key of ths method s construct approprate statstcal models and mathematcal models to nterpret the data models. Ths approach requres the user has abundant knowledge of ths feld. Generally, statstcal analyss has two steps: frstly, the user chooses the approprate data from data warehouse. Secondly, the user uses vsualzaton functons and analyss functons provded by statstcal analyss tools, n order to fnd the relatonshp between the data and statstcal models and constructed mathematcal models to nterpret data. The second step needs to be repeated and contnuous carefully. b Cluster Analyss Cluster analyss classfcaton s based on ther characterstcs n order to dscover typcal pattern. When the data that should be analyzed mss the descrbe nformaton or can not be organzed nto any classfcaton model, the use of cluster analyss wll be automatcally dvded nto categores accordng to certan - 11 -
characterstcs. The substance of cluster analyss s an overall optmal problem, commonly used n the market subdvson, customer orentaton, performance evaluaton, and other aspects. 3 Neural Networks Neural Networks s an nformaton processng system that conssts of a graph representng the processng system as well as varous algorthms that access ths graph. It s structured as a graph wth many nodes and arcs between them. It can be consdered as a drected graph wth nput, output and nternal nodes. After accept a varety of nput, every nerve calculates the total nput value and then uses flterng mechansms to compare the total nput, n order to determne ts own output value. When change the lnk weght between two nerves or two layers, neural network s on a study or "tranng." After "tranng" the neural network can be used to predct the lkely outcome of the estng cases, the analyss could also be appled n customer relatons, or other felds. 2.3 Analyss and odelng for Data nng In the prevous sectons we brefly descrbed the process of data mnng and data analyss methods. As some of desgn modelng wll nvolved n ths paper, ths subsecton wll be used to gve more n-depth dscusson about the concept of modelng, and nspect several maor types of models for data mnng, n order to provde a theoretcal bass for the follow-up sectons. odel s hgh-level data sets, and a global summary. It usually treats the nteger through a large group of samples. odels can be descrptve, whch nduce data usng a concse manner; they can also be ratonal, whch allowng makng some certan nferences for the data nteger or future data. In ths paper, we wll ntegrate several forms of theoretcal models as a bass, such as Lnear Regresson odels, Hybrd odels, and arkov odel. In modelng, t must be noted that when summarzed data we should take account nto factors, such as tme factor. If the used method has certan lmtatons, t may lead to a dstorton n the model. Therefore, a model s good or bad, that needs to be tested by realty. 2.3.1 Fundamentals of odel - 12 -
odels are abstract descrpton of the real world process. We start from the smplest model to epatate ts meanng. For eample, a smple model mght take the form of Y = ax + c, where Y and X are varables and a and c are parameters of the model constants determned durng the course of the data mnng eercse. Here we would lke to say that the functonal form of the model s lnear, snce Y s a lnear functon of X. We can set a=1, c=2 to smplfy ths model. ore generally, we also can mprove ths model to Y = ax +c +e, where e s a random varable component of the mappng from X to Y. Normally, a, c are consdered as the parameters of the model, and we often use the notaton θ to epress a generc parameter or a parameters set, where θ = {a, c}. Gven the form of structure of a model, we choose the approprate values for ts parameters. Ths s acheved by mnmzng or mamzng an approprate score functons measurng the ft the model to the data. odelng s to dscuss ssues from a theoretcal perspectve, but we must be recognzed that the theory and practcal phenomenon are always dfferent. We should recognze that "all models are not perfect, but some are useful. For eample, we may assume that the estence of a lnear model abstracts a process, but the realty s that there are always some nonlnear roles. Ths s what the models can not take nto account. What we want s to fnd a model whch can summarze the man features of a process. The followng are eamples of several common model structures [16]. 2.3.2 Structures of redctve model In a predctve model, the varable s epressed by a functon of the other varables. We take Y = ax +c +e as an eample. The values of the response varable Y are predcted from gven values of predctor varables X. Generally, the responsor varable n predctve models s often denoted by Y, and the p predctor varables are denoted by 1,2, p. The model wll be yeld predctons, y=f 1, 2, p ; θ, where y s the predcton of the model and θ represents the parameters of the model structure. When Y s quanttatve, ths task of estmatng a mappng from p-dmensonal X to Y s known as Regresson. When Y s categorcal, the task of learnng a mappng from X to Y s called Classfcaton learnng. Both of them can be referred as Functon Appromaton problems n whch we are learnng a mappng from a p-dmensonal varable X to Y. 2.3.3 Lnear Regresson odel - 13 -
Net, we are gong to dscuss a lnear predctve model. Its structure s smple and easy to understand. Its response varable s a lnear functon of the predctor varables: Y = a 0 + p = 1 a X a a a Whereθ = { 0, 1,..., p }. We note that the model s purely emprcal, so that the estence of a well fttng and hghly predctve model does not mply any causal relatonshp. We can retan the addtve nature of the model, whle generalzng beyond lnear functons of the predctor varables: Where the Y = a p 0 + a f X = 1 f functons are smooth functons of X. f could be log, square-root, or related transformatons of orgnal X varables. The model assumes that the dependent varable Y depends on the ndependent varables X of the model n an addtve fashon. Ths may be a strong assumpton n practce, but t wll lead to a model n whch t may be easy to nterpret the contrbuton of each ndvdual X varable. Ths can be found n our frst model n ths paper. Furthermore, we can generalze ths lnear model structure to allow general polynomals n the Xs wth cross-product terms to allow nteracton among the X n the model. Note that by allowng models wth hgher order terms and nteractons between the components of X we can estmate a more comple surface than a smple lnear model. However, we note that as the dmensonalty p ncreases, the number of possble nteracton terms X n the model ncrease as a combnatoral functon of p. The nterpretaton and understandng of such a model makes the problem more dffcult, moreover t wll become more dffcult as p ncreasng. But the response varables compared to the parameters of model are stll lnear. So the estmate of the parameters wll become much easer. The generalzaton to polynomals s called the complety of the model. The more comple models contan the smpler models as specal cases. For eample, the 1st order a 1 X 1 +a 0 model can be consdered as a specal case of 2 the 2nd order polynomal model a 2X1 + a1 X1 + a0 by set a 2 = 0. So t s clear that a comple model can always ft the observed data at least as well as any smpler model can. Ths rases the complety of how we should choose one - 14 -
model rather than others when the complety of each s dfferent. There s always a queston. We may want a model whch s closest to some hypothess, a model that captures the man features of the data wthout beng too complcated, and so on. We must know how to fnd a model balance both precson and effcency. To generalze a lnear structure we can transform the predctor varables X. We can also transform the response varable. A good way for further generalzaton s to assume that Y s locally lnear n the X s, wth a dfferent local dependence varous regons of the X, that s a pecewse lnear model. The pecewse lnear model s a good way to solve how we can buld relatvely comple models for nonlnear phenomena by pecng together smple components. Ths s also why we use t as an mportant fundamental of our model. 2.3.4 redctve model for Classfcaton By now we have dscussed about predctve models whch the varable s predcted, where Y s quanttatve. Now we consder the case of a categorcal varable Y, whch only take a few possble categorcal values. Ths s a classfcaton problem. The am s to assgn a new obect to ts correct Y category on the bass of ts observance X value. In classfcaton problem what we need to do s to set dfferent data to dfferent categores. A classc approach s to use a lnear hyperplane n the p-dmensonal X space to defne a decson boundary between two classes. The model parttons the X space nto dsont decson regons, where the decson regons are separated by lnear boundares. We can use the hgher-order polynomal terms, yeldng smooth polynomal decson boundares, to get a more comple model. 2.3.5 Stochastc arts of Data nng odel In the prevous dscusson, we brefly referred to stochastc parts of data mnng model. Now we are gong to talk about the functons of stochastc parts. It s very hard to fnd a perfect functonal relatonshp between the predctor varables X and the response varable Y. The fact s for any gven predctor varables, more than one value of Y can be observed. The dstrbuton of the values Y at each value of X represents an aspect of varaton. The varaton can be dvded nto two categores: 1 Uneplanable varaton - 15 -
Ths knd of varaton wll be reduced by decreasng the complety of the model. We call t uneplanable or nonsystematc or random parts of the varaton. 2 Eplanable varaton Ths knd of varaton s also called Systematc varaton. The varaton n Y can be eplaned by the X varables. For eample: We have mentoned the regresson modelng before. We can etend t to nclude a stochastc part. We assume that for each X we can observe a partcular Y, but the Y s added some nose. So the relatonshp between X and Y become: y = g, θ + e Where g, θ s a determnstc functon of X, whle e s usually set to zero and assumed to be a random varable wth constant 2 varance σ, whch s ndependent from X. The random term e reflects the nose n the measurement process. ore generally, e reflects the fact that there are hdden varables. Ther affectons on Y can not be epressed by the determnstc functon of X. Rasng the stochastc parts gves a good way to mprove the accuracy of models. 2.3.6 Summary In the dscusson so far, we brefly ntroduced the modelng theory n data mnng. The core prncple s ncorporatng the relatvely smple models nto comple model, or usng dfferent methods generalze the smple model to comple model. Accordng to the modelng theory, none of the models for data mnng s absolutely solated, but nterconnected by a varety of relatons. Ths s not hard to understand. As a comple functon s ncorporated by several basc functons, each model for data mnng s a generalzaton of other models, or a specal case of other model. As we all know, n data mnng the key of establshng an effectve model s to select the best model form n order to solve current problem. Ths s not only the process of selectng a model, dealng wth date, and gvng out the results. The process of modelng needs contnuous fttng work, and repeated mprovng work. Ths process s knd of endless. - 16 -
2.4 Data nng n arketng In the recent decades, the development of nformaton and communcatons technologes nects new vtalty for enterprse marketng. For eample, bar code technology and the emergence of onlne stores greatly both enhance the effcency of the enterprse. Resultng, company managers are begnnng to face the enormous data. The data s ncreasng at a very rapd pace, probably 1000 tmes than fve years ago. However, the data and busness profts are not drectly proportonal. Unfortunately, the human bran can not handle so much data. In the meanwhle, data mnng technology becomes very mature n theory. Thess the technology-orented applcatons for enterprse decson makers wth a new perspectve to look at market. Those advanced technologes let enterprses obtan a lot of resources from dfferent channels, and use those effectve tools to translate data nto unlmted opportuntes. 2.4.1 Applcaton of Data nng n arketng D technology n the marketng s a relatvely unversal applcaton. Such applcatons are referred to a Boundary Scence, because t sets a varety of scentfc theores n all. Frst, two basc dscplnes: Informaton Technology and arketng. Another very mportant bass s Statstcs. In addton, t relates to the psychology and socology as well. The charm of ths area s ust about the wde scope of dscplnes study. Generally speakng, through the collecton, processng and dsposal of the large amount of nformaton nvolvng consumer behavor, dentfy the nterest of specfc consumer groups or ndvdual, consumpton habts, consumer preferences and demand, moreover nfer correspondng consumpton group and the net group or ndvdual consumpton behavor, then based on them sale produces to the dentfcaton consumer groups for a specfc content-orented marketng. Ths s the basc dea. As automaton s popular n all the ndustry operate processes, enterprses have a lot of operatonal data. The data are not collected for the purpose of analyss, but come from commercal operaton. Analyss of these data does not am at studyng t, but for gvng busness decson-maker the real valued nformaton, n order to get profts. Commercal nformaton comes from the market through varous channels. For eample, purchasng process by credt card, we can collect the customer s consumpton data, such as tme, place, nterestng goods or servces nterested, wllng prce and the level of recepton capacty; when buyng a brand of cosmetcs or fllng n a member form can collect customer purchase trends and frequency. In addton, - 17 -
enterprses can also buy a varety of customer nformaton from other consultng frms. arketng based on data mnng usually can gve the customer sales promoton accordng to hs prevenent purchase records. It should be emphaszed data mnng s applcaton-orented. There are several typcal applcatons n bankng, nsurance, traffc-system, retal and such knd of commercal feld. Generally speakng, the problems that can be solved by data mnng technologes nclude: analyss of market, such as Database arketng, Customer Segmentaton & classfcaton, rofle Analyss and Cross-sellng. And they are also used for Churn Analyss, Credt Scorng and Fraud Detecton. Fg 2.1 shows us the relaton between applcaton and data mnng technques clearly and completely. Fg.2.1 Applcaton of data mnng for marketng The basc process of data mnng n marketng show as follows: Fg.2.2 shows the prncple of data mnng applcaton n marketng a repare prmtve data. It ncludes ndvdual character nformaton such as age, gender, hobby, background, professon, address, postcode, and ncome, the prevous purchase eperence, and - 18 -
the relatonshp wthn customers. The preprocessng of prmtve data s very mportant for selectng potental customers. b Establsh a certan model. Ths model may utlze plenty of tradtonal data mnng technologes and many technologes from other related subects. However, the problem whch those technologes should solve s seekng for the best or acceptable market plan, wthn lmted data source, lmted tme, and lmted epense. The three lmts are the fundamentalty of modelng algorthm. c At last, accordng to the model, utlze testng data to get each pattern or parameter. Ultmately, use ths model to select customers and decde marketng plan. Data from nsde arketng Data from outsde Sample D methods Test data usng other models Create model & Eperment Evaluate epermental Fg. 2.2. Schematcs for D applcaton n arketng 2.5 Applcaton of Data nng n CR In ths subsecton, we gve out the ntroduce of CR. It ncludes both ocr and acr. Whle what we focus on s acr. 2.5.1 Introducton of CR Customer Relatonshp anagement CR s a strategy to acqure new customers, to retan them and to recover them f they defected. [15] In the recent days, ndvdual customer has brought pressure of change n marketng - 19 -
practces. One of the man goals of CR s: Generatng addtonal product benefts by means of communcatons and servces whch are desgned and delvered to match the ndvdual needs of customers. There are two knds of CR [15]: 1 Operatonal CR ocr actvty s mplemented n the enterprse processes: sales, marketng or servce. ocr nvolves all actvtes about the drect customer contact. 2 Analytcal CR acr provdes all components to analyze customer characterstcs n order to accomplsh ocr actvtes, wth respect to the customers needs and epectatons. There, the dealstc goal s to provde all nformaton necessary to create a talored cross-channel dalogue wth each sngle customer on the bass of hs or her actual reactons. We d better to look at CR whch ncludes ocr and acr as a cross enterprse process, n order to acheve the goal to show merely one company mappng to a customer. arketng, sales and servce departments have to coordnate ther responsbltes, actvtes, nformaton systems and data. The Fg.2.3 shows the cross functonal process of CR. arketng Sales Servce CR as cross-enterprse process 2.5.2 Concept of acr Indvdual Fg.2.3. CR as cross functonal process Data mnng n the CR applcaton s prmarly emboded n: customer classfcaton, analyss of customer relatons, market orentaton, and establshng predctve models. That s CR analyss module. Fg.2.4 shows the structure of acr. - 20 -
CR analyzng and estmatng subsystem Customer Classfcaton odule Analyss of Customer Behavors odule Analyss of arket odule Other Subsystems Customer Database & Routne Data Deposted Fg.2.4. Structure of acr 1 Customer classfcaton module Ths module classfes customers by customers value and set relevant customer level. It can lead the enterprses to dstrbute the resources of market, sales, and servces to the valuable customers. The enterprses can am at the valuable customers gvng them specal sales promotons, and provdng more personalzed servces, n order to get the mamum returns by least nvestment. Ths s what we are nterested n. Generally, classfcaton can be consdered through three aspects: a. Eteror attrbutes These attrbutes nclude the customer s regonal dstrbutng, holdng produces, and organzatonal attrbute Customer can be dvded to enterprse customer, government customer and ndvdual customer. Usually, ths knd of classfcaton s smple and ntutonst. The data s also very easy to get. However, ths knd of classfcaton s general. We stll don t know that those hgh valuable customers wthn n each classfcaton are. What we know s only whch category of customers have more purchase power than other category. b. Inherent attrbutes These attrbutes nclude age, gender, nterest, ncome, credt, and so on. c. Classfcaton of consume behavors The analyss of consume behavor s usually consdered as three aspect. That s RF: recent consume, frequency of consume, and magntude of consume. All of these data can be acheved from the accountng system. However, ths knd of classfcaton can only be used on estng customers. Snce - 21 -
there s no consume, the potental customers can not be classfed by t. CR can dvde customers nto many categores. The customers wthn one category have the same attrbutes, whle the attrbutes of customers from dfferent category are certanly dfferent. We can provde dfferent servce to customers from dfferent category, n order to enhance the satsfacton. We can easly fnd the advantages of classfcaton. Even a smple classfcaton can brng the enterprse a satsfyng result. 2 Analyss of customer behavors module Ths module manly process the analyss of customer s satsfacton, loyalty of customer, correspondence of customer, predcton of customer s lost, and cross sales. 3 Analyss of market module arket s the man goal of enterprse. The enterprse can wn the competton only by handlng the trend of market. redcton of market trend ncludes analyzng and predctng the development of produces, predctng the dfferent consume trend of customer from dfferent regon, and predctng the changes appearng as the season s change. 2.6 Summary Whle ocr and acr are accepted by enterprses, data mnng technologes also get a more mportant role wthn acr. The key of acr s how to fnd out the most valuable customer and customer group for enterprses by data mnng technologes. Ths s what we are gong to solve n the followng chapters. - 22 -
3 odelng Based on Attrbutes of Customer Relatonshp ACR In ths chapter, we are gong to let you know why we use Attrbutes of Customer Relatonshp ACR and how does t look lke. Ths chapter s basc nformaton for Chapter 4. 3.1 roblem statement In ths subsecton, we focus gve a general dea about customer value. Ths wll help us defne how to fnd bg customer or more valuable customer. 3.1.1 Crteron of Customer Value Applcaton of data mnng n CR s helpng enterprse to dg out the most valuable customers. any managers and marketng decson-makers usually focus on the ncome-flu brought to enterprses by customers. Commonly, consume quantum s the crteron of customer value. It means that customer wth hgh consume quantum wll get more attentons. oreover, they wll get more favorable prce and better servce. Nevertheless, ths crteron of customer value s doubted lately. ore and more companes fnd that many bg customers are not large proftless customers, furthermore, sometmes they brng the company negatve proft. The reason s that company ddn t use a reasonable crteron to estmate customer value. Usng consume quantum as the crteron, the company potentally thought the more consume quantum s, the more value customer have. As a result, they gve too many servces to the bg customers whch dd not get a good result. Now we gve out two customers who have the same consume quantum, after comparng you wll fnd the problem of the crteron mentoned above. Assumng we have two bg customers, named A and B. A and B look smlar, because they brought almost same purchase to the company n last 12 months, furthermore ther consume trends are smlar. If we only use ncome-flu as the crteron of customer value, A and B should have the same value. But the probable fact s A s a loyal customer of ths company for years. A do not only buy produces from the company, he also recommend them to hs colleagues - 23 -
and frends. After he bought one produce, hs frends maybe buy ten same produces. B as another bg customer buys produces, but he s always alone and rarely recommends others. Thus, we can fnd out the value of A and B s clearly dfferent, but ths dfference can not be udged by the smple crteron mentoned above. Va ths eample, we can fnd the lmt of untary crteron. Customer A certanly should get more attenton from the company. So we should fnd the more comprehensve crteron of customer value. 3.1.2 Dscusson based on Customer Classfcaton If we are not gong to use ncome-flu as the crteron to evaluate customers, we must fnd out another prncple for customer classfcaton. Customer classfcaton s the customer set parttoned by any attrbute of customer. We referred to much nformaton about customer classfcaton. Furthermore, we fnd that there are many questons when we make the classfcaton: 1 What s the dfference between customers? 2 What s the most comprehensve udgment of customer value? 3 What s the dfference of the customers who have the same purchase record? 4 Whch factor wll mpact the loyalty of customer? 5 What fashon does customer classfcaton have? Is there any other classfcaton varable besdes the proft brought by customer to company. 6 Is classfcaton untve? Thus, once a customer s classfed to a category, all the departments of ths company should have the same classfcaton behavor, rght? In order to consder all the questons above, we d better fnd the approprate crteron of classfcaton, and establsh approprate model to smulate customer group. Furthermore, use data mnng technology to partton certan customer group. And then use relatve marketng strategy on the most valuable customers. 3.2 Segmentaton of Customer Value Tradtonal concept of customer classfcaton n CR usually ust consders customers as many ndvdual unts or an obect. The methods we mentoned before are all based on the purchase probablty of potental customer and the profts from whch company can get. It assumes customers as a large obect. - 24 -
Every ndvdual customer s solaton and wthout any connecton. It merely consders the profts whch customers brng to enterprse. Ths s what we call self-value. However, customers and enterprses are all n the certan socal relatonshp network. When the customer s gong to make the decson of a purchase, the decson s not only depended on hs own nterest, but also mpacted by the opnons from others. In the meanwhle, he can mpact on the probablty of others of purchase the produce. Ths s what we call network-value. When two customers have the same self-value, we should gve more sales promotons to the one wth hgher network-value. In ths paper, we are gong to add the concept of network-value to the customer consderng system of CR. We dvde the large obect of customer, n order to do some deeper researches on the comple relatonshp of customer. Ths wll gve CR a new vsual angle of customer classfcaton. Frst of all accordng to the concept of network-value, we prmly dvde customer value nto two dmensonaltes. They are customer s self-value and network-value. Furthermore, we gve each dmensonalty two levels: hgh level and low level. By now we can dvde customers nto four groups. Ths subdvson wll be epressed as a matr, whch s atr for Customer Value Fg.5. III IV Network-value I II Self-value Fg 3.1. atr for Customer Value Accordng to ths Fg 3.1, we can fnd that IV category of customer s the one enterprse should focus on, whle I category s opposte. eanwhle, II and III category of customer should be dscussed. Category II s the favorte customer to tradtonal CR. They often can get the best prces and servces from enterprses. However, category III s usually neglected by tradtonal CR customer classfcaton. They rarely get the attentons from enterprse - 25 -
because of the low self-value. After a long tme, ths category of customer wll lose confdence of the enterprse, so they leave. Accordng to ther hgh network-value, ths wll cause a large loss for the enterprse. After the analyss above, we thnk that evaluatng both self-value and network-value of customer carefully s sgnfcant to the marketng of enterprses. How to evaluate customers self-value? How to get the network-value by data mnng tools? To solve these problems s an nterestng work and we wll talk about them later. Therenafter we are gong to do more analyss of the atr for Customer Value. 3.3 Concept of Attrbutes of Customer Relatonshp ARC As we talked before, prevenent sales market has the lmt of nformaton communcaton. The enterprse only consders customer as an solatve unt. As the development of nformaton and network technologes, a comple customer network s formed. Accordng to the matr from last subsecton, self-value s epressed by the purchase after sales promotons. Network-value s epressed by the mpact of the purchase to other customer. We must menton that customer network s an unordered network. The mpact s mutual. Whle the customer affects to others, he also get the mpact from others. For eample, before many purchases the customers wll ask for comments from others who have bought the produce or they wll search from webstes. And then they wll make ther own decsons. The same after ther purchases they may gve out ther comments to others va nternet. Wthn ths process, some customers have more self-leadng. They mostly make decsons by ther own nterests. However, there are some others may change ther thought after readng comments. At the same tme, some of the customers would lke to gve ther comments of produces to others, but some others maybe not lke to. So we thnk the network-value of a customer should be referred to the mpact on others. Thus, others attrbutes should be used as the crteron of a customer s network-value. Accordng to ths knd of comple stuatons, tradtonal CR analytcal strategy looks powerless. We must use a new strategy to evaluate customer value. So we subdvde customers based on the matr of customer value. We change the dmensonalty nto self-value and mpact-value. Lke Fg.3.2: - 26 -
Impact-value III Bdrectonal I Dvng IV Consultng II Self-centered Self-value Fg.3.2. atr for subdvson of Customer Value I. We call them Dvng customer. Ths group of customers would lke to search for the comments of a produce before they makng ther purchase decsons. And they may change ther mnd easly accordng to other s mpact. However, they don t lke to gve out ther comments to others. II. We call them Self-centered customer. Ths group of customers has more defnte dea. They usually make purchase decsons by themselves and they are not gong to make others lsten to ther opnon. III. We call them Bdrectonal customer. Ths group of customers would lke to lsten to others opnons and also lke to gve out ther own comments. IV. We call them Consultng customer. Ths group of customers s very sut to do consultng to potental customer. They have a lot defnte deas. In the meanwhle, they would lke to share ther purchase eperences wth others. After ths categorzng, enterprses should be able to dg out the most valuable customers and gve them approprate sales promotons. However, when we check the estent data we fnd out that the most valuable customers are not from the same category. For eample, seemngly the customers from category III should be gve the most sales promotons. Snce they are easly to accept others opnons and they have more opportuntes to persuade others to buy some produces. However, ths s not the truth. Let s take the customers from category I as an nstance. Although they rarely consder others opnon, they may have very hgh self-desre to buy a produce. Ths desre may be - 27 -
much more than the desre of Bdrectonal customer whch s gotten after others mpact. Therefore, we consder the purchase trend of customer as the synthetcal ehbton of customer relatonshp attrbutes. Thus, customer value depends on both self-relance nde wthn purchase behavor and mpact nde after purchase. In ths paper, we call them Attrbutes of Customer Relatonshp ACR. ACR s the key of customer classfcaton. And how to effectvely dg out these ACs s the man work of ths paper. 3.4 Dssmlarty n Cluster Algorthm In ths subsecton we are gong to dscuss about how to evaluate whether a customer trend for hearng hs own dea more. Here we would lke to use the concept of Dssmlarty n Clusterng Algorthm for data mnng, n order to gve out an evaluatng method whch suts to our model. Dssmlarty n Clusterng Algorthm for data mnng s used to represent the smlar degree of two obects. When the attrbutes of represented obects are dfferent, the algorthm for dssmlarty s dfferent as well. In the Clusterng problem doman, for hgh dmensonal sparse data of bnary varables, a dssmlarty measure algorthm named Spare Feature Dssmlarty SFD of a set was put forward [4]. SFD of a set represents the smlar degree of all the obects n a set. Defnton 3.1 SFD of a set: Gven n obects, each obect s descrbed by m attrbutes. m equals 1 or 0. X s a set of obects, n whch the number of obects s denoted as X, the number of attrbutes that equal 1 for all obects s ndcated by a, and the number of attrbutes that equal 1 for some obects and equal 0 for other obects s ndcated by e. SFD of set X, denoted as SFDX s defned as: [18] e X SFD X = a 3.1 SFDX represents the smlarty of the obects n set X. The smaller the SFD s, the more smlar the obects are. Ths feature of SFD totally accord wth the logstc defnton of customer s self-relance nde. The more the dssmlarty s, the more trend of hearng hs own dea the customer has. Thus, we gve out the defnton of self-relance nde based on the concept of dssmlarty. We abstract the behavor of customer based on the purchase record n database as follow: - 28 -
In database each customer s purchase record relates to produce ID. In order to make the follow data processng convenent, we defne each customer as an obect. All the produces wthn database are the attrbutes of ths obect. When the customer buyng a produce, the relatve attrbute of ths customer wll be set as 1, otherwse the attrbute wll be set as 0. We know that ths abstract completely accord wth the defnton of SFD. So we defne self-relance nde SR nde as follows: Defnton 3.2 SR nde: Gven n obects of customers, each obect has m attrbutes for produces, set as 1 or 0. X={ 1, 2 n } s an ordered obect subset, n whch the number of obects s denoted as X. The number of attrbutes, that equal 1 for the frst obect and equal 0 for the net X -1 obects and the attrbutes that equal 0 for the frst obect and equal 1 for the net X -1 obects, s ndcated by e; and the number of attrbutes that equal 1 for the frst obect and the equal 1 for at least one of net X -1 obects s ndcated by a. Thus, dssmlarty of the frst obect n X s defned as: e SFD 1 = X a 3.2 And self-relance nde s defned as SFD 1 SR 1 = 1+ SFD 1 3.3 So we can learn from ths defnton: the bgger the SFD s, the bgger the SR nde s. 3.5 Lft n Assocaton Rules By now we have confrmed Self-relance nde for each customer. That means we know how much the customer wll hear others opnon. The followng work s evaluatng how much the customer can mpact others. Now we are gong to defne our evaluatng method based on the concepts from assocaton rules. Here are some concepts from assocaton rules [4]: Defnton 3.3 Confdence: In the transacton set D, transacton T support temset X. There are C% transactons from T also support temset Y. C% s named as Confdence of assocaton rule X { Y. That s, T T DAnd X U Y T } ConfdenceX Y = 3.4 T T DAndX T { } - 29 -
Defnton 3.4 EpectedConfdence: In the transacton set D, e% of transacton T support temset Y. Thus e% s named EpectedConfdence of assocaton rule X Y. That s, { T T DAndY T } EpectedconfdeceX Y = 3.5 D Defnton 3.5 Lft: Lft s the rate of confdence and epectedconfdence. That s, confdence X Y LftX Y = 3.6 epectedconfdence X Y Lft represents how much mpact that the estng of temset X wll affect on the estng of temset Y. When the frst tme we saw ths defnton, we were very ectng. Because ths s ust the evaluatng method what we need to represent our mpact nde of customer. Thus, we gve out our defnton of mpact nde. Defnton 3.6.1: In the database D, customer X ests n the customer purchase record T, and c% of the stuaton ests customer Y as well. c% s called the confdence of X and Y purchases a produce at the same tme. That s the rate of when customer X purchases one produce customer Y also purchases t. { T T DAnd X U Y T} Confdence Y X = 3.7 T T DAndX T { } Defnton 3.6.2: In the database D, e% of the customer purchase record T has customer Y. e% s called as the epectedconfdence of X and Y purchases a produce at the same tme. Epectedconfdence represents the probablty of customer Y purchase one produce wthout any condton. { T T DAndY T} Epectedconfdence Y X = 3.8 D Defnton 3.6: Lft represents the rate of confdence and epectedconfdence. In our model, that s mpact nde. It represents when customer X purchase one produce how much mpact wll beng brng to customer Y. confdence Y X Impact Y X = 3.9 epectedconfdence Y X By then, we already can mne the self-relance nde and mpact nde of a customer. The net task s fndng an approprate model to estmate the value of each customer. - 30 -
3.6 Search Reference ethod for Network Relaton As the network mpact has been consdered n ths paper, ths mpact should be evaluated. Snce t has lot completes, we are gong to learn some related models and try to fnd nspre from them. After a long tme searchng, two models were found. The analyss of these two wll help us to buld our own model. 1 odel based on arkov random feld edro Domngos and att Rchardso gave out a model for evaluatng the customer s network value, whch s the epected proft from sales to other customers he may nfluence to buy.[5] Instead of vewng a market as a set of ndependent enttes, they regarded t as a socal network and model t as a arkov random feld. Ther soluton s based on modelng socal networks, where each customer s probablty of buyng s a functon of both the ntrnsc desrablty of the product for the customer and nfluence of other customers. 2 odel based on lnear network We found another model based on lnear network was gven out later. Ths model etends the prevous technques, acheves a large reducton n computatonal cost, and apples them to data from a knowledge-sharng ste. [11] They showed how to fnd optmal vral marketng plans, used contnuously valued marketng actons, and reduce computatonal costs. They employ a smple lnear model to appromate the nteracton between customers. It smply consders a customer s value s the combnaton of customer s ntrnsc value and network value. Those references gve us a good dea to establsh ours. - 31 -
4 Implementaton of Data nng odel In ths secton, we are gong to set up a lnear regresson model for customer value. The data mnng modelng theores and lnear math models mentoned n Chapter 2 wll gve us a good academc basc. That s the reason that we ntroduce them at frst. 4.1 Symbolc System a To make t smple, we focus on a certan produce when we mne the most valuable customer. The feature set of ths produce s Y={ y 1,y 2,,y m }. b Gven n potental customers to ths produce. It s set X={ 1, 2,, n }. Furthermore, we defne the neghbors of customer s N X - { }. Ths neghbor represents the one who get drect mpact from the purchase of. c Defne mappng s: X {0,1} to epress purchase states of all the customers. If customer bought ths produce set s =1, otherwse set s =0. Smply, we use to represent s. So we can use X={ 1, 2,, n } to represent the purchase state of each customer. Usng N to represent purchase state of the recent customer s neghbor. d At last, defne mappng m: X {0,1} to epress the recent sales promoton of enterprse. m represents the sales promoton degree to customer. We denote m as m.let ={ m 1,m 2,, m n } represent the state of the entre sales promoton plan. 4.2 Estmate of urchase robablty Now we are gong to estmate the purchase probablty of a customer. Based on certan sales promoton and purchase state, ths produce s purchase probablty of each customer s: X { }, Y,. As customer wll not be mpact by customer X-N, we have: X { }, Y, = N, Y, - 32 -
Furthermore, we do lnear estmate on N, Y,. We consder N, Y, equal to the lnear ncreased of purchase probablty from customer hmself and from neghbor s mpact. That s, X { }, Y, = N, Y, 4.1 = SR Y, + 1 SR N, Y, 0 Y, 0 N s the purchase probablty based on customer s self-wllng. We call t embedded probablty. It doesn t relate to customer s neghbor. N N, Y, s the purchase probablty based on mpact from neghbor. We call t network probablty. Whle SR s the self-relance nde whch we mentoned before. oreover, we do lnear estmate on N = 1 N, Y,. We consder t as the lnear plus of all neghbor s states. That s, = 1 N, Y, = mpact = 1 Y, 4.2 N N mpact represents how much customer mpact on customer. s the recent state of customer. That s s. If N, set mpact =0. From formula 4.1 and 4.2, we get = 1 N, Y, = SR = 1 Y, + 1 SR mpact = 1 Y, 0 N 4.3 Gven C N s the set of all the possble states of customer from N. That ~ s f N C N, N ~ wll be one of the states of customer from N. Generally, all the neghbor states are unknown, so = 1 Y, s what we need to calculate. So we have - 33 -
- 34 - + = = = = = CN N CN N CN N N N N N N ~ ~ 0 ~, ~ ~ 1, ~, 1, ~,, ~ 1,, 1 Y N mpact SR Y Y SR Y Y Y N 4.4 ~ N represent the value of based on N ~. As, 0 Y doesn t relate to the state of N, we have = CN N N ~ 0, ~, 1 Y Y SR =, 1 0 Y SR =. Snce the value of ~ N s 1 or 0, we have CN N N ~, ~ ~ 1 Y N mpact SR N = = N N Y N mpact SR, ~ ~ 1 1 ~ ~ N wth CN N = = N Y mpact SR, 1 1 At last, there s = + = = = N Y mpact SR Y SR Y, 1, 1, 1 0 1 4.5 Accordng to formula 4.5, we fnd that to teratve calculate all the customers purchase probablty we only need to know the embedded probablty of each customer, that s, 1 0 Y =. In the meanwhle, we should use t as the ntal value of customer purchase probablty. Accordng to Naïve Bayes odel, we have = = Y Y Y Y Y, 0 0 0 0 0 0 0 0 0 0 0 4.6
Now we only need to calculate 0 Y, and 0,. For the frst n Y + mp one, we can use 0 Y, = to calculate. n Y + p n Y represents the number of customer s purchase based on the attrbute Y. p s an ntal estmate of probablty, usually use ts eperence value. For the second one, we can suppose that 0 = 1 = 1 = mn α 0 = 1 = 0,1}. It means a customer s purchase probablty after sales promoton s α tmes as the probablty wthout sales promoton. That s the estmate of sales promoton s effect. Smply, we consder 0 = = 1 0.5, so we get: 0 = 0 = 0 = = 1 = 1 = 1 = = 1 = 1 = 1 = 1 + = 1 = 1 = 1 = 1 = 1 = 1 = 0 = 0 4.7 α = α + 1 1 0 = 1 = 0 = α + 1 By then, we can use formula 4.5 to calculate purchase probablty of each customer. 4.3 Evaluate Customer Value For evaluatng customer value, we are gong to use.domngos and.rchardson s evaluatng method. Ths can let us use the same crteron to estmate our eperment result. At frst, we defne 0 as a sales promoton confguraton when f =0. f 1 s a sales promoton confguraton when =1. The cost of sales promoton for one customer s c. The proft brought from the produce when customer buys t s r. So the proft whch enterprse can get from customer after sales promoton s: 1 1 0 EL = r = 1 Y, f r = 1 Y, f c 4.8-35 -
Whle based on a specfc sales promoton plan s state, enterprse can get the total proft s n ELY, = r = 1 Y, r = 1 Y, 0 c 4.9 = 1 0 ={0,0,,0} means there s no sales promoton. Thus, customer s entre value s 1 0 TOLV = EL Y, f EL Y, f 4.10 Utlze ths evaluatng method we can translate customer s purchase probablty nto the value for enterprse. - 36 -
5 Eperments In ths chapter, we ntent to gve you a general ntroduce about our eperments. It ncludes from data preparng to calculate result. Those fgures and tables wll help you get an ntutonstc dea. 5.1 Epermental Setup The ntenton of ths subsecton s to gve an abstract descrpton of the epermental setup. It ncludes the data sets, man functons and man algorthm. 5.1.1 Data Set We used 5726 purchase records as our epermental data. They are all true data from a real world company. Lookng at the orgnal data set, each purchase s epressed as a contract from, ncludng the nformaton as Fg 5.1-37 -
Fg 5.1 Contract sheet of the each purchase There are about 3800 Users, n our data set, And we also have the detal nformaton of each customer, ncludng User type company or ndvdual, User d, User name, ost code, rofesson type. as Fg5.2 Fg 5.2 User Informaton Sheet Fg 5.3 shows the dstrbutng of all the users. The pcture can not show each of those 3800 users clearly, but t s very obvous that some of the users purchased a lot of tmes, others dd not. That s easy for the company to suppose there are some bg customers n ths data set. They should have hgher value for the company, but ust from ths dstrbutng the company can not tell who the most valuable customers are. To solve ths problem s ust the goal of our model whch s defned n the prevous sectons. Furthermore, our eperment wll use the algorthm to set up before to mne the top ten valuable customers n ths data set. - 38 -
Fg 5.3 Dstrbutng of Customer urchases 5.1.2 an Functons Name and Descrpton Table 5.1 gves out the man functons n the eperment whch are used for calculate the value of customers. Functon Name Descrpton CalculateImpact Used to calculate the mpact among customers wthn one neghbor group. CalculateNeghbor Usng CABOSFV to classfy customers nto ther neghbor groups. CalculatenY Used to calculate N, Y, Calculateo Used to calculate, N 0 0 Y CalculateoY Used to calculate, CalculateoY Used to calculate Y, CalculateSR CalculateVOC 0 Used to calculate each customer s SR Used to calculate customers value based on both ACR and s model Table 5.1 Descrpton of man functons - 39 -
5.1.3 nng Algorthm 1 Code for mnng self-relance nde: vod CalculateSR { ths->calculateneghbor; } ths->sr.resze ths->u.sze ; fornt =0;<ths->u.sze;++ { ths->sr[] = ths->getsr ; } 2 Code for mnng mpact nde shows as follow: vod CalculateImpact { ths->i.resze ths->u.sze ; char tlne[1024]; fornt =0;<ths->u.sze;++ { ths->i[].resze ths->u.sze ; fornt =0;<ths->u.sze;++ { ths->i[][] = ths->getimpact, ; } } return; All the algorthms used by our eperment have been ntroduced and proved n secton 3 and 4. Furthermore, we used the formula 4.5 to calculate each customer s purchase probablty and formula 4.10 to calculate the total value of each customer. The evaluated results show n the net subsecton. 5.2 The evaluated results Here we choose 100 customers as a sample data set and dd a preprocess for ther purchase records, such as changng the user d to 1-100 n order to make t easy to be read. - 40 -
usr_d Our VoC usr_d 's VoC 62 9.3023 54 4.6846 75 8.6379 62 4.6512 88 8.3056 6 4.4877 76 7.9734 75 4.3189 79 7.9734 88 4.1528 97 7.6412 52 4.0253 66 6.9767 76 3.9867 73 5.9801 79 3.9867 6 4.7971 8 3.9410 54 4.0140 2 3.9401 Table 5.2 Evaluaton of the Value of Customer VoC For the table 5.2, the frst two rows are the top ten valuable customers d and the Value of Customer VoC gotten from our algorthm. Whle the last two rows show the top ten valuable customers d and Voc mned from s model. The mportant thng that must be emphaszed s the promoton costs from these two algorthms are the same. As we can see now, we have the smlar set of most valuable customer n our result and s. At the meantme, we have some dfferent customers. All of these show our model s correctness and possble mprove. oreover, when we look at the values we got. We fnd that the results from s model are wthout bg dssmlarty, whch means t hasn t gven a well estmate among those customers. However, our result can show the dfference between customers values more clearly. In another word, our model gves a promnence to ACR when we usng t calculate customer value and we emphasze the dfferences among them. Ths result s ust what we want at the begnnng of ths thess. 5.3 Dscusson The general dea of the eperments n ths paper has three steps. The frst one s gettng ecel fle data from a company s database and gvng preprocessng to the orgnal data. The preprocessng smplfed the contract form, and only kept roduce d as an attrbute set of User. The second step s realzng our algorthms usng VC++ whch have been set up n secton 3 and 4. - 41 -
Furthermore, usng tt fle as an nput data we calculate all the customers value based on ACR. The thrd step s usng the same data but s algorthm calculates all the customers value. Snce we know that s model s consdered as a mature model, we trust ts valdty and correctness. So when got the results, we take the top ten valuable customer as typcal results n order to make a contrast between to results. One the other hand, scence the data s from an nsurance company ther own eperences about bg customers also confrm our result as reasonable result. - 42 -
6 Concluson and Future Works By now the two man problems gven n the ntroducton has been solved. A more consummate segmentaton of customer has been gven out n Secton 3. It gave a more careful classfcaton based on the socal network envronment. Actually, t s also the basc dea of our model for estmatng customer value. In ths thess, the lnear regresson model was mentoned. It dvdes the huge obect of customer. oreover, t uses network relatonshp to gve a classfcaton of customers. Ths model may be not eact, because t used two lnear appromatons. But we vew t as reasonable appromatons. As when the probabltes are all small, as n most cases for marketng domans, lnear models usually perform well. When data s sparse they can provde sgnfcant advantages for computaton. Furthermore, by mplementng the model we gave n ths thess, the second problem gven n the ntroducton has been solved. However, we must say that the eperments n ths thess are very prelmnary. In the future work, we should try to mprove the vsualzaton of our applcaton. And the man contrbuton of the thess s n the proposal of methods, whch stll have to be evaluated on a large scale n realstc settngs. And also n the real world socal network s not that smple as we dscussed n ths paper. In the future works, we can stll mprove our models. For eample, we smply consder the mpact relatonshp of customers as one-to-one lnear ncreased. We dd not consder the stuaton of many-to-one mpact relatonshp. So the future work can be etended the lnear regresson model to a more comple trust relatonshp as well. - 43 -
7 Acknowledgement Ths paper was fnshed n my second semester n Vao Unversty. As an echange student, I want to thank ths unversty to gve me such a wonderful opportunty of dong a thess. Although t was hard, I apprecated t very much. Durng workng on t, I got a lot of help from both teachers and frends. Thanks to my supervsor ro. Joakm, he gave me plenty of good deas at the begnnng of ths paper, helped me to decde the eact problem doman whch should be solved, and gave me contnuous advses durng the work. Thanks to my teacher r. athas. He always tracks our work to make sure there s no problem, and f there s, he would gve you mmedate help to solve the problem. And he also gave me a good dea on how to wrte the thess and what s the process of ths course. Thanks to my frends, they gave me lots of encourage and help on both study and lfe. Thanks to my parents, they are always let me know they love me whch gves me huge motlty. At last, I want to thank ths echange program. I got abundant of wonderful memores from here. - 44 -
8 References 1. Jedd-Jah Jonker, Nanda ersma and Rob otharst, Drect alng Decsons for a Dutch Fundraser, Econometrc Insttute Report EI 2002-19 2. Keth C.C. Chan, Wa-Ho Au, Berry Cho, nng Fuzzy Rules n A Donor Database for Drect arketng by A Chartable Organzaton, 3. Rob otharst, Uzay Kaymak and Wm uls, Neural networks for target selecton n drect marketng, n ERI REORT SERIES RESEARCH IN ANAGEENT, arch 2001, ERS-2001-14-LIS 4.. Bastan, Data warehousng and data mnng 5.. Domngos and. Rchardson, nng the Network Value of Customers, In roceedngs of the Seventh Internatonal Conference on Knowledge Dscovery and Data nng, pages 57-66, San Francsco, CA, 2001. AC ress. 6. Fayyad, U. Et al., Knowledge Dscovery and Data nng Towards a Unfyng Framework, KDD-96 roc. 2nd Intl. Conf. On Knowledge Dscovery & Data nng, AAAI press, 1996 7. Charles X. Lng and Chenhu L : Data nng for Drect arketng roblems and Solutons, KDD-98 8. eter Van Der utten, Data nng In Drect arketng Databases 9. Data nng n the Insurance Industry,Solvng Busness roblems usng SAS Enterprse ner Software 10. Haughton, D. and S. Oulab 1993. Drect marketng modelng wth CART and CHAID, Journal of Drect arketng, 7, 16-26. 11. edro Domngos, att Rchardson, nng Knowledge-Sharng Stes for Vral arketng 12. Robert Groth, Data nng A hands-on Approach for Busness rofessonals 13. argaret H.Dunham, Data nng Introductory and Advanced Topcss 14. chael J. Shaw a,b,c,, Chandrasekar Subramanam a, Gek Woo Tan a,chael E. Welge b,knowledge management and data mnng for marketng. - 45 -
15. Data nng for arketng Applcatons, 12th European Conference on achne Learnng ECL'01 and 5th European Conference on rncples and ractce of Knowledge Dscovery n Databases KDD'01, September 7th, 2001 Freburg, Germany 16. Davd J. Hand, Hekk annla and adhrac Smyth, rncples of Data nng. 17. eter Van and Der utten, Data nng n Drect arketng Databases, World Scentfc, October 15, 1998 18. Sen Wu and Xuedong Gao, CABOSFV algorthm for hgh dmensonal sparse data clusterng - 46 -
Väö unverstet atematska och systemteknska nsttutonen SE-351 95 Väö tel 0470-70 80 00, fa 0470-840 04 www.ms.vu.se