Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques

Size: px
Start display at page:

Download "Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques"

Transcription

1 Expert Systems wth Applcatons Expert Systems wth Applcatons 34 (2008) Churn predcton n subscrpton servces: An applcaton of support vector machnes whle comparng two parameter-selecton technques Krstof Coussement, Drk Van den Poel * Ghent Unversty, Faculty of Economcs and Busness Admnstraton, Department of Marketng, Tweekerkenstraat 2, 9000 Ghent, Belgum Abstract CRM gans ncreasng mportance due to ntensve competton and saturated markets. Wth the purpose of retanng customers, academcs as well as practtoners fnd t crucal to buld a churn predcton model that s as accurate as possble. Ths study apples support vector machnes n a newspaper subscrpton context n order to construct a churn model wth a hgher predctve performance. Moreover, a comparson s made between two parameter-selecton technques, needed to mplement support vector machnes. Both technques are based on grd search and cross-valdaton. Afterwards, the predctve performance of both knds of support vector machne models s benchmarked to logstc regresson and random forests. Our study shows that support vector machnes show good generalzaton performance when appled to nosy marketng data. Nevertheless, the parameter optmzaton procedure plays an mportant role n the predctve performance. We show that only when the optmal parameter-selecton procedure s appled, support vector machnes outperform tradtonal logstc regresson, whereas random forests outperform both knds of support vector machnes. As a substantve contrbuton, an overvew of the most mportant churn drvers s gven. Unlke ample research, monetary value and frequency do not play an mportant role n explanng churn n ths subscrpton-servces applcaton. Even though most mportant churn predctors belong to the category of varables descrbng the subscrpton, the nfluence of several clent/company-nteracton varables cannot be neglected. Ó 2006 Elsever Ltd. All rghts reserved. Keywords: Data mnng; Churn predcton; Subscrpton servces; Support vector machnes; Parameter-selecton technque 1. Introducton * Correspondng author. Tel.: ; fax: E-mal addresses: Krstof.Coussement@UGent.be (K. Coussement), Drk.VandenPoel@UGent.be (D. Van den Poel). Nowadays, more and more companes start to focus on Customer Relatonshp Management, CRM. Indeed due to saturated markets and ntensve competton, a lot of companes do realze that ther exstng database s ther most valuable asset (Athanassopoulos, 2000; Jones, Mothersbaugh, & Beatty, 2000; Thomas, 2001). Ths trend s also notable n subscrpton servces. Companes start to shft away from ther tradtonal, mass marketng strateges, n favor of targeted marketng actons (Burez & Van den Poel, forthcomng). It s more proftable to keep and satsfy exstng customers than to constantly attract new customers who are characterzed by a hgh attrton rate (Renartz & Kumar, 2003). The dea of dentfyng those customers most prone to swtchng carres a hgh prorty (Keaveney & Parthasarathy, 2001). It has been shown that a small change n retenton rate can result n sgnfcant changes n contrbuton (Van den Poel & Larvère, 2004). In order to effectvely manage customer churn wthn a company, t s crucal to buld an effectve and accurate customer-churn model. To accomplsh ths, there are numerous predctvemodelng technques avalable. These data-mnng technques can effectvely assst wth the selecton of customers most prone to churn (Hung, Yen, & Wang, 2006). These technques vary n terms of statstcal technque (e.g., neural nets versus logstc regresson), varable-selecton method (e.g., theory versus stepwse selecton), number of /$ - see front matter Ó 2006 Elsever Ltd. All rghts reserved. do: /j.eswa

2 314 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) varables ncluded n the model, tme spent to buld the fnal model, as well as n terms of allocatng the tme across the dfferent tasks n the modelng process (Nesln, Gupta, Kamakura, Lu, & Mason, 2004). Ths study contrbutes to the exstng lterature by nvestgatng the effectveness of the support vector machnes (SVMs) approach n detectng customer churn n subscrpton servces. Ample research focuses on predctng customer churn n dfferent ndustres, ncludng nvestment products, nsurance, electrc utltes, health care provders, credt card provders, bankng, nternet servce provders, telephone servce provders, onlne servces,... Although SVMs have shown excellent generalzaton performance n a wde range of areas lke bonformatcs (Chen, Harrson, & Zhang, 2005; He, Hu, & Harrson, 2005; Zhong, He, Harrson, Ta, & Pan, forthcomng), beat recognton (Acr, 2006), automatc face authentcaton (Bcego, Grosso, & Tstarell, 2005), evaluaton of consumer loans (L, Shue, & Huang, 2006), estmatng producton values (Chen & Wang, 2007; Pa & Ln, 2005), text categorzaton (Bratko & Flpc, 2006), medcal dagnoss (Glotsos, Tohka, & Ravazoula, 2005), mage classfcaton (Km, Yang, & Seo, 2005) and hand-wrtten dgt recognton (Burges & Scholkopf, 1997; Cortes & Vapnk, 1995), the applcatons n marketng are rather scarce (Cu & Curry, 2005). To our knowledge only a few mplementatons of SVMs n a customer churn envronment are publshed (Km, Shn, & Park, 2005; Zhao, L, & L, 2005). Ths study wll extend the use of SVMs n a customer-churn context n two ways: (1) Unlke former studes that mplemented SVMs on a very small sample, ths study apples SVMs n a more realstc churn settng. Indeed, once a churn model has been bult, t must be able to accurately valdate a new marketng dataset whch contans n practce ten thousands of records and often a lot of nose. Ths study contrbutes to the exstng lterature by usng a suffcent sample sze for tranng and valdatng the SVM models n a subscrber churn framework. These SVMs are benchmarked to logstc regresson and state-of-the-art random forests. Nesln et al. (2004) concluded that logstc modelng may even outperform the more sophstcated technques (lke neural networks), whle n a marketng settng random forests already proved to be superor to other more tradtonal classfcaton technques (Bucknx & Van den Poel, 2005; Larvère & Van den Poel, 2005). (2) Before SVMs can be mplemented, several parameters have to be optmzed n order to construct a frst-class classfer. Extractng the optmal parameters s crucal when mplementng SVMs (Hsu, Chang, & Ln, 2004; Km, Shn et al., 2005; Km, Yang et al., 2005). Consequently, a fnetuned parameter-selecton procedure has to be appled. Hsu et al. (2004) proposed a grd search and a cross-valdaton to extract the optmal parameters for SVMs. Ths procedure tres dfferent parameter pars on the tranng set usng a cross-valdaton procedure. Hsu et al. (2004) propose to select that par of parameters wth the best cross-valdaton accuracy.e., percentage of cases correctly classfed (PCC). The second contrbuton of ths study les n extendng ths prncple by selectng one addtonal parameter par. Not only the parameters wth the best cross-valdaton accuracy as proposed by Hsu et al. (2004) are selected, also the parameter par whch results n the hghest cross-valdaton area under the recever operatng curve (AUC) s used. In contrast to PCC, AUC takes nto account the ndvdual class performance by use of the senstvty and specfcty for several thresholds on the classfer s posteror churn probabltes (Egan, 1975; Swets, 1989; Swets & Pckett, 1982). In the end, t s possble to compare the predctve performance of these two parameter-selecton technques wth that of logstc regresson and random forests. As a substantve contrbuton, an overvew of the most mportant churn predctors s gven wthn ths subscrpton-servces settng. As such, marketng managers gan nsght nto whch predctors are mportant n dentfyng churn. Consequently, t may be possble to adapt ther marketng strateges based on ths newly obtaned nformaton. Followng an ntroducton of the modelng technques (.e., SVMs, random forests and logstc regresson), Secton 3 explans the evaluaton measures used n ths study. The model-selecton procedure for SVMs s presented n Secton 4. Secton 5 presents the research data, whle Secton 6 explans the expermental results. Conclusons and drectons for future research are gven n Secton Modelng technques 2.1. Support vector machnes The SVM approach s a novel classfcaton technque based on neural network technology usng statstcal learnng theory (Vapnk, 1995, 1998). In a bnary classfcaton context, SVMs try to fnd a lnear optmal hyperplane so that the margn of separaton between the postve and the negatve examples s maxmzed. Ths s equvalent to solvng a quadratc optmzaton problem n whch only the support vectors,.e., the data ponts closest to the optmal hyperplane, play a crucal role. However, n practce, the data s often not lnearly separable. In order to enhance the feasblty of lnear separaton, one may transform the nput space va a non-lnear mappng nto a hgher dmensonal feature space. Ths transformaton s done by usng a kernel functon. There are some advantages n usng SVMs (Km, Shn et al., 2005; Km, Yang et al., 2005): (1) there are only two free parameters to be chosen, namely the upper bound and the kernel parameter; (2) the soluton of SVM s unque, optmal and global snce the tranng of a SVM s done by solvng a lnearly constraned quadratc problem; (3) SVMs are based on the structural rsk mnmzaton (SRM) prncple, whch means that ths type of classfer mnmzes the upper bound on the actual rsk, compared to other classfers whch mnmze the emprcal rsk. Ths results n a very good generalzaton performance.

3 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) We wll gve a general overvew of a SVM for a bnary classfcaton problem. For more detals about SVMs, we refer to the tutoral of Burges (1998). Gven a set of labeled tranng examples {x,y } wth = 1,2,3,...,N where y 2 { 1,1} and x 2 R n, and n the dmenson of the nput space. Suppose that the tranng data s lnearly separable, there exsts a weght vector w and a bas b such that the nequaltes w x þ b P 1; when y ¼ 1; ð1þ w x þ b 6 1; when y ¼ 1 ð2þ are vald for all elements of the tranng set. As such, we can rewrte these nequaltes n the form: y ðw x þ bþ P 1 wth ¼ 1; 2; 3;...; N: ð3þ Eq. (3) comes down to fnd two parallel boundares: B1: w x þ b ¼ 1; ð4þ B2: w x þ b ¼ 1 ð5þ at the opposte sdes of the optmal separatng hyperplane: H : w x þ b ¼ 0 wth margn wdth between the two boundares equal to 2/ kwk (Fg. 1). Thus one can fnd the par of boundares whch gves the maxmum margn by mnmzng 1 2 w2 ð7þ subject to y ðw x þ bþ P 1: ð8þ ð6þ maxmzng W ðaþ ¼ X a 1 X X a a j y 2 y j x x j j subject to a P 0 wth ¼ 1; 2; 3;...; N and X a y ¼ 0: The weght vector could be stated as follows: ð9þ ð10þ w ¼ X a y x : ð11þ The decson functon f(x) can be wrtten as " f ðxþ ¼sgnðw x þ bþ ¼sgn X # a y ðx x Þþb ; ð12þ where sgn s a sgn functon. In practce, the nput data wll often not be lnearly separable. However, one can stll mplement a lnear model by ntroducng a hgher dmensonal feature space to whch an nput vector s mapped va a non-lnear transformaton: H : X! X 0 ; x! Hðx Þ; ð13þ ð14þ where X s the nput space, H s the non-lnear transformaton and H(x ) represents the value of x mapped nto the hgher dmensonal feature space X 0. Therefore, Eq. (9) can be transformed to W ðaþ ¼ X a 1 X X a a j y 2 y j Hðx ÞHðx j Þ j ð15þ Ths constraned optmzaton problem can be solved usng the characterstcs of the Lagrange multplers (a) by subject to a P 0 wth ¼ 1; 2; 3;...; N and X a y ¼ 0: ð16þ By mappng the nput space nto a hgher dmensonal feature space, the problem of hgh dmensonalty and mplementaton complexty occurs. One can ntroduce the concept of nner product kernels. Consequently, there s no more need to know the exact value of H(x ), only the dot nner product s consdered whch facltates the mplementaton (Fg. 2). Kðx ; x j Þ¼Hðx ÞHðx j Þ ð17þ Fg. 1. Ths fgure shows the soluton for a bnary lnearly separable classfcaton problem. The boundares B1 and B2 separate the two classes. Data ponts on the boundares are called support vectors. Thus one tres to fnd the hyperplane H * where the margn s maxmal. Therefore, the decson functon becomes " # f ðxþ ¼sgn X a y HðxÞHðx Þþb " ¼ sgn X # a y Kðx x Þþb : ð18þ For resolvng ths decson functon, several types of kernel functons are avalable as gven n Table 1.

4 316 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) Random forests Fg. 2. The non-lnear boundary n the nput space s mapped va a kernel functon nto hgher dmensonal feature space. The data becomes lnearly separable n the feature space. Table 1 Overvew of the dfferent kernel functons Kernel functon Mathematcal form a Lnear kernel K(x,x )=(xæx ) Polynomal kernel of degree d K(x,x )=(cxæx + r) d Radal bass functon K(x,x ) = exp{ ckx x k 2 } Sgmod kernel wth r 2 N K(x,x ) = tanh(cxæx + r) a d, r 2 N; c 2 R +. It s possble to extend these deas to handle non-separable data. In ths case, the margn wll become very small and t wll be mpossble to separate the data wthout any msclassfcaton. To solve ths problem, we relax the constrants (1) and (2) by ntroducng postve slack varables (e) (Cortes & Vapnk, 1995). Eqs. (1) and (2) become w x þ b P 1 e ; when y ¼ 1; ð19þ w x þ b 6 1 þ e ; when y ¼ 1 ð20þ wth e P 0. Eqs. (19) and (20) can be rewrtten as y ðw x þ bþ P 1 e wth ¼ 1; 2; 3;...; N: ð21þ The goal of the optmzaton process s to fnd the hyperplane that maxmzes the margn and mnmzes the probablty of msclassfcaton: mnmze 1 2 w2 þ C X e ð22þ subject to y ðw x þ bþ P 1 e ð23þ wth C, the cost, the penalty parameter for the error term. The larger C, the hgher the penalty to errors. Adaptng Eq. (15) to the non-separable case, one receves the followng optmzaton problem: maxmzng W ðaþ¼ X subject to a 1 X X a a j y 2 y j Kðx ;x j Þ 0 6 a 6 C wth ¼ 1;2;3;...;N and X j a y ¼ 0: ð24þ ð25þ More detals concernng the optmzaton process can be found n Chang and Ln (2004). In a bnary classfcaton context, decson trees (DT) became very popular because of ther easness and nterpretablty (Duda, Hart, & Stork, 2001). Moreover, DTs have the ablty to handle covarates measured at dfferent measurement levels. One major problem wth DTs s ther hgh nstablty (Haste, Tbshran, & Fredman, 2001). A small change n the data often results n very dfferent seres of splts, whch s often suboptmal when valdatng the traned model. In the past, ths problem was extensvely researched. It was Breman (2001) who ntroduced a soluton to the prevously mentoned problem. The new classfcaton technque s called: Random Forests. Ths technque uses a subset of m randomly chosen predctors to grow each tree on a bootstrap sample of the tranng data. Typcally, ths number of selected varables.e., m s much lower than the total number of varables n the model. After a large number of trees are generated, each tree votes for the most popular class. By aggregatng these votes over the dfferent trees, each case s predcted a class label. Random forests are already appled n several domans lke bonformatcs, quanttatve crmnology, geology, pattern recognton, medcne,...however, the applcatons n marketng are rare (Bucknx & Van den Poel, 2005; Larvère & Van den Poel, 2005). Random forests are used as benchmark n ths study, manly for fve reasons: (1) Luo et al. (2004) stated that the predctve performance s among the best of the avalable technques. (2) The outcomes of the classfer are very robust to outlers and nose (Breman, 2001). (3) Ths classfer outputs useful nternal estmates of error, strength, correlaton and varable mportance (Breman, 2001). (4) Reasonable computaton tme s observed by Bucknx and Van den Poel (2005). (5) Random forests are easy to mplement because there are only two free parameters to be set, namely m, the number of randomly chosen predctors, and the total number of trees to be grown. We follow Breman s (2001) suggestons: m s set equal to the square root of the total number of varables.e., 9 because 82 explanatory varables are ncluded n the model and a large number of trees.e., 1000 are chosen Logstc regresson Logstc regresson s a well-known classfcaton technque for predctng a dchotomous dependent varable. In runnng a logstc regresson analyss, the maxmum lkelhood functon s produced and maxmzed n order to acheve an approprate ft to the data (Allson, 1999). Ths technque s very popular for manly three reasons: (1) logt modelng s conceptually smple (Buckln & Gupta, 1992). (2) A closed-form soluton for the posteror probabltes s avalable (n contrary to SVMs). (3) It provdes quck and robust results n comparson to other classfcaton technques (Nesln et al., 2004).

5 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) Evaluaton crtera After buldng a predctve model, marketers want to use these classfcaton models to predct future behavor. It s essental to evaluate the classfer n terms of performance. Frstly, the predctve model s estmated on a tranng set. Afterwards, ths model s valdated on an unseen dataset, the test set. It s essental to evaluate the performance on a test set, n order to ensure that the traned model s able to generalze well. For all three modelng technques, PCC, AUC and the top-decle lft are calculated. PCC, also known as accuracy, s undoubtedly the most commonly used evaluaton metrc of a classfer. Practcally, the posteror churn probabltes generated by the classfer are ranked from most lkely to churn to least lkely to churn. All cases above a certan threshold are classfed as churners; all cases havng a lower churn probablty are classfed as non-churners. In sum, PCC computes the rato of correctly classfed cases to the total number of cases to be classfed. It s mportant to notce that PCC s hghly dependent on the chosen threshold because only one threshold s consdered. Consequently, t does not gve an ndcaton how the performance wll vary when the cut-off s vared. Moreover, PCC does not consder the ndvdual class performance of a classfer. For example, wthn a skewed class dstrbuton, wrong predctons for the underrepresented class are very costly. Nevertheless, a model that predcts always the most common class thus neglectng the mnorty class stll provdes a relatvely good performance when evaluated on PCC. Unlke PCC, AUC takes nto account the ndvdual class performance for all possble thresholds. In other words, AUC wll compare the predcted class of an event wth the real class of that event, consderng all possble cut-off values for the predcted class. The recever operatng curve (ROC) s a graphcal plot of the senstvty.e., the number of true postves versus the total number of events and 1-specfcty.e., the number of true negatves versus the total number of non-events. The ROC can also be represented by plottng the fracton of true postves versus the fracton of false postves. The area under the recever operatng curve s used to evaluate the performance of a bnary classfcaton system (Hanley & McNel, 1982). In order to assess whether AUCs of the dfferent classfcaton technques are sgnfcantly dfferent from each other, the non-parametrc test of DeLong, DeLong, and Clarke-Pearson (1988) s used. In marketng applcatons, one s especally nterested n ncreasng the densty of the real events. The top 10% decle s an evaluaton measure that only focuses on the 10% cases most lkely to churn. Practcally, the cases are frst sorted from predcted most lkely to churn to predcted least lkely to churn. Afterwards, the proporton of real events n the top 10% most lkely to churn s compared wth the proporton of real events n the total dataset. Ths ncrease n densty s called the top-decle lft. For example, a top-decle lft of two means that the densty of churners n the top 10% s twce the densty of churners n the total dataset. The hgher the top-decle lft, the better the classfer. Potentally ths top-decle lft s very nterestng to target, because t contans a hgher number of real events. In other words, marketng analysts are nterested n just 10% of the customer base.e., those who are most lkely to churn because marketng budgets are lmted and actons to reduce churn would typcally nvolve only 10% of the entre lst of customers. 4. Model selecton for the support vector machnes Frst, we wll argue why the radal bass functon (RBF) kernel s used as the default kernel functon throughout ths study. Secondly, the grd-search method and cross-valdaton procedure for choosng the optmal penalty parameter C and kernel parameter c s explaned. In the thrd secton, two types of parameter-selecton technques are descrbed RBF kernel functon The RBF kernel functon s used as the default kernel functon wthn ths study, manly for four reasons (Hsu et al., 2004): (1) ths type of kernel makes t possble to map the non-lnear boundares of the nput space nto a hgher dmensonal feature space. So unlke the lnear kernel, the RBF kernel can handle a non-lnear relatonshp between the dependent and the explanatory varables. (2) In terms of performance Keerth and Ln (2003) concluded that the lnear kernel wth a parameter C has the same performance as the RBF kernel wth parameters (C,c). Ln and Ln (2003) showed that the sgmod kernel behaves lke the RBF kernel for certan parameters. (3) When lookng at the number of hyperparameters, the polynomal kernel has more hyperparameters than the RBF kernel. (4) The RBF kernel has less numercal dffcultes because the kernel values le between zero and one, whle the polynomal kernel values may go to nfnty or zero whle the degree s large. On the bass of these arguments, the RBF kernel s used as the default kernel functon Optmal parameter selecton usng grd search and cross-valdaton The RBF kernel needs two parameters to be set; C and c, wthc the penalty parameter for the error term and c as the kernel parameter. Both parameters play a crucal role n the performance of SVMs (Hsu et al., 2004; Km, Shn et al., 2005; Km, Yang et al., 2005). Improper selecton of these parameters can be counterproductve. Beforehand t s mpossble to know whch combnaton of (C,c) wll result n the hghest performance when valdatng the traned SVM to unseen data. Some knd of parameter-selecton procedure has to be done. Hsu et al. (2004) propose a grd search on C and c and a v-fold cross-valdaton on the tranng data. The goal of ths

6 318 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) procedure s to dentfy the optmal C and c, so that the classfer can accurately predct unseen data. A common way to accomplsh ths s 2-fold cross-valdaton, where the tranng set s dvded nto two parts of whch one s unseen n tranng the classfer. Ths performance better reflects the capabltes of the classfer n valdatng unknown data. More generally, n a v-fold cross-valdaton, the tranng data s splt nto v subsets of equal sze. Iteratvely, one part s left out for valdaton, whle the other remanng (v 1) parts are used for tranng. Fnally, each case n the tranng set s predcted once. The crossvaldaton performance wll better reflect the true performance as when valdatng the classfer to unseen data, whle the valdaton set stays untouched. In order to dentfy whch parameter par performs best, one can repeat ths procedure wth several pars of (C,c). As such t s possble to calculate a cross-valdated evaluaton measure for every parameter par. In the end, t s possble to select these parameters based on the best cross-valdated performance Two parameter-selecton technques In ths study, a grd search on C and c s performed on the tranng set usng a 5-fold cross-valdaton. The grd search s realzed by evaluatng exponental sequences of C and c (.e., C =2 5, 2 3,...,2 13 ; c =2 3, 2 1,...,2 15 ). Bascally, all combnatons of (C, c) are tred and two pars of parameters are restraned: (1) the one wth the best crossvaldated accuracy as proposed by Hsu et al., 2004 and (2) the one wth the bggest cross-valdated area under the recever operatng curve. Ths addtonal parameter par s selected for the reason that unlke PCC, AUC consders the senstvty and specfcty as ndvdual class performance metrcs over all possble thresholds. Once these optmal parameter pars are obtaned, the whole tranng set s traned agan. Both classfers wll be used to valdate an unseen dataset. In the end, one can compare and benchmark the performance of both knds of SVMs. 5. Research data For the purpose of ths study, data from a Belgan newspaper publshng company s used. The subscrbers have to pay a fxed amount of money dependng on the length of subscrpton and the promotonal offer gven. The company does not allow endng the subscrpton pror to the maturty date. The churn-predcton problem n ths subscrpton context comes down to predctng whether the subscrpton wll/wll not be renewed wthn a perod of four weeks after the maturty date. Durng ths four-week perod, the company stll delvers the newspapers to the subscrbers. In ths way, the company gves the subscrbers the opportunty to renew ther subscrpton. Fg. 3 graphcally traces back the tme wndow of analyss. We use subscrpton data from January 2002 through September Usng ths tme frame, t s possble to derve the dependent Fg. 3. Graphcal dsplay of the tme wndow used to buld the churn model. varable and the explanatory varables. For constructng the dependent varable, the renewal ponts between July 2004 and July 2005 are consdered. Consequently, a customer s consdered as churner when hs/her subscrpton s not renewed wthn four weeks after the expry date. The explanatory varables contan nformaton coverng a 30- month perod returnng from every ndvdual renewal pont. These varables contan nformaton about clent/ company-nteractons, renewal-related nformaton, socodemographcs and subscrpton-descrbng nformaton (see Appendx A). Ths varety of nformaton s gathered at two levels: subscrpton level and subscrber level. At the subscrpton level, all nformaton from the current subscrpton s ncluded, whle at the subscrber level, all nformaton related to the subscrber s covered. For nstance, one can calculate the total number of complants on the current subscrpton only.e., the subscrpton level whle one can also consder the total number of complants of a subscrber coverng all hs/her subscrptons.e., subscrber level. Fnally, one ends up wth an ndvdual tmelne per subscrber for every renewal pont n the tme nterval. We decded to randomly select two samples of suffcent sze; the tranng set s used to estmate the model, whle the test set s used to valdate the model. The tranng set con- Table 2 Dstrbuton of the tranng set and test set Number of observatons Tranng set Subscrptons not renewed 22, Subscrptons renewed 22, Total 45, Relatve percentage Test set Subscrptons not renewed Subscrptons renewed 39, Total 45,

7 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) Table 3 The cross-valdated accuracy per (C, c) c C Table 4 The cross-valdated performance (AUC) per (C, c) c C tans as many churners as non-churners because many authors emphasze the need for a balanced tranng sample n order to relably dfferentate between defectors and nondefectors (Dekmpe & Degraeve, 1997; Rust & Metters, 1996; Yamaguch, 1992). So t s not uncommon to tran a model wth a non-natural dstrbuton (Chan & Stolfo, 1998; Wess & Provost, 2001). The test set contans a proporton of churners that s representatve for the true populaton n order to approxmate the predctve performance n a real-lfe stuaton. For both datasets, all varables are constructed n the same way. The explanatory varables are compled over a 30-month perod, whle the dependent varable contans nformaton whether the subscrpton wll/wll not be renewed. 6. Emprcal analyss 6.1. SVM models After conductng the grd search on the tranng data, the optmal (C,c) s(2 13,2 7 ) wth a cross-valdated accuracy of %. Table 3 summarzes the results of the grd search usng the cross-valdated accuracy as an evaluaton crteron. Furthermore, parameter par (2 7,2 7 ) results n the hghest cross-valdated AUC, Table 4 consders the results of the grd-search procedure wth the cross-valdated AUC as a performance measure. These two parameters pars are used to tran a model on the complete tranng set. Two SVMs are obtaned, namely SVM acc 1 and SVM auc. 2 Fnally, both models can be valdated on a test set. On the one hand, one can compare the performance among both SVMs, whle on the other hand both SVMs can be benchmarked wth the performance of logstc regresson and random forests Comparng predctve performance among both knds of SVMs In ths secton, a comparson s made between the predctve performance of SVM acc and SVM auc. The evaluaton s performed n terms of AUC, PCC and top-decle lft. Both models are traned on a balanced tranng set, whle n the end these classfers have to be evaluated on a dataset whch represents the actual densty of churners (see Table 2). In order to assess the senstvty of the results to the actual proporton of churners n the dataset, we wll compare the performance of both SVMs on artfcal test sets wth dfferent class dstrbutons. More specfcally, 1 SVM acc = SVM generated usng parameters based on the model wth the best cross-valdated accuracy durng grd search. 2 SVM auc = SVM generated usng parameters based on the model wth the best cross-valdated AUC durng grd search.

8 320 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) svmauc svmacc AUC % 40% 30% 20% 18% 16% 14% 11.14% % churners Fg. 4. Area under the recever operatng curve for SVM acc and SVM auc appled to several test sets wth dfferent class dstrbutons svmauc svmacc PCC % 40% 30% 20% 18% 16% 14% 11.14% % churners Fg. 5. Percentage correctly classfed for SVM acc and SVM auc appled to several test sets wth dfferent class dstrbutons. 3.e., the dstrbuton that contans the proporton of churners that s representatve for the true populaton. we compare the natural dstrbuton 3 (11.14% churners) wth the artfcal ones (50%, 40%, 30%, 20%, 18%, 16%, 14%). These artfcal sets are created by randomly undersamplng the real test set.e., the one wth 11.14% churners. Fgs. 4 6 and Table 5 depct the performance of SVM acc and SVM auc for the dfferent class dstrbutons. As such a comparson can be made between both SVMs. As one may observe from Fg. 4, SVM auc performs better than SVM acc wthn all class dstrbutons n terms of AUC performance. In order to ensure that the dfferences n AUC are sgnfcant, the test proposed by DeLong et al. (1988) s appled. As such one can compare f the AUCs between SVM acc and SVM auc are sgnfcantly dfferent wthn a certan class dstrbuton. Table 5 reveals that on all test sets that contan 30% churners or less, SVM auc sgnfcantly outperforms SVM acc on a 90% confdence level (DeLong et al., 1988). When valdated on the natural dstrbuton, SVM auc sgnfcantly outperforms SVM acc at the 95% confdence level. Fg. 5 shows the performance of both SVMs n terms of PCC. Despte the fact that the dfferences n PCC are rather small, one may observe that SVM auc does not have an nferor performance compared to SVM acc when comng closer to the natural dstrbuton. Prevous fndngs are confrmed when evaluatng both SVMs usng the top-decle lft. There s a gap n top-decle lft between SVM acc and SVM auc. SVM auc has a hgher top-decle lft compared to SVM acc. Ths gap ncreases when devatng from the orgnal tranng dstrbuton.e., the one wth 50% churners. On the natural dstrbuton, SVM auc succeeds n retanng more churners wthn the top 10% customer most lkely to churn n comparson to SVM acc.

9 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) svmauc svmacc top-decle % 40% 30% 20% 18% 16% 14% 11.14% % churners Fg. 6. Top-decle lft for SVM acc and SVM auc appled to several test sets wth dfferent class dstrbutons. Table 5 Parwse comparson of performance (AUC) among several test sets usng dfferent class dstrbutons Number of churners (%) SVM acc SVM auc (1); (1); (1); a (1); a (1); a,b (1); a,b (1); a,b (1); a,b v 2 (df); p-value: a Sgnfcantly dfferent on 90% confdence level. b Sgnfcantly dfferent on 95% confdence level. Table 6 compares the predctve capabltes between SVM acc and SVM auc on the real test set (see Table 2). One can clearly see the gap n performance. SVM auc exhbts better predctve performance than SVM acc when both models are evaluated on the real test set. In terms of PCC, the ncrease s 0.55% ponts. There s also a sgnfcant mprovement n AUC of 0.24 (DeLong et al., 1988). Wth respect to the top-decle lft, an ncrease from to s acheved. In sum, when a SVM s traned wth a non-natural dstrbuton, t may be better to select ts parameters durng the grd search based on the cross-valdated AUC. The new parameter-selecton technque sgnfcantly mproves the AUC and the top-decle lft of the model, whle accuracy s certanly not decreased. Table 6 The performance of SVM acc and SVM auc : PCC and AUC on the real test set PCC AUC Top-decle lft SVM acc a SVM auc a a Sgnfcantly dfferent on 95% confdence level. In the followng part, we compare the performance of both knds of SVMs wth logstc regresson and random forests Comparng predctve performance of SVMs, logt and random forests The evaluaton measures on the real test set (see Table 2) for all models are represented n Tables 7 9. Table 7 compares the predctve performance of logt, random forests, SVM acc and SVM auc n terms of PCC and AUC. Table 8 shows the results from the test of DeLong et al. (1988) whch nvestgates f the AUCs of two models are sgnfcantly dfferent. One can fnd the top-decle lft for all models n Table 9. Addtonally, Tables 7 9 gve nformaton concernng the performance of SVM acc and SVM auc benchmarked to Table 7 The performance of the dfferent algorthms: PCC and AUC on the real test set Model PCC AUC Logt Random forests SVM acc SVM auc Table 8 Parwse comparson of performance (AUC) on the real test set Random forests SVM acc SVM auc Logt (1) a 2.53 (1) b,c (1) a Random forests (1) a (1) a SVM acc 6.04 (1) a Ch 2 (df): a Sgnfcantly dfferent on 95% confdence level. b Equal on a 95% confdence level. c Equal on a 90% confdence level.

10 322 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) Table 9 The performance of the dfferent algorthms: top-decle lft on the real test Logt Random forests SVM acc SVM auc logstc regresson. Only SVM auc dffers sgnfcantly n terms of predctve performance when compared to logstc regresson. In contrast to SVM auc, SVM acc classfes fewer cases correctly than logstc regresson. Moreover the test of DeLong et al. (1988) confrms that the AUC of SVM acc s not sgnfcantly dfferent from that of the logstc regresson. The need to select the rght parameter-selecton technque s confrmed when lookng at the top-decle lft crteron. SVM auc dentfes more churners than logstc regresson, whle the top-decle lft of SVM acc s lower than that of logt. From Tables 7 9, one can also compare the performance of both SVMs wth the performance of the random forests. It s clear that despte the parameter-selecton technque, SVMs are surpassed by random forests. In sum, t s shown that the parameter-selecton technque nfluences the predctve performance of SVMs. Consequently, when a SVM s traned on a balanced dstrbuton, t may be vable and preferable to consder other than the tradtonal parameter-selecton methods. Each mprovement n predctve performance wll result n a better return on nvestment of subscrber-retenton actons based on these predcton models. In ths study, SVMs are traned on a non-natural dstrbuton; t s shown that selectng the parameters based on the best cross-valdated AUC results n a better performance than when selectng them based on the hghest cross-valdated accuracy as was suggested n Hsu et al. (2004). In sum, one may say that choosng the rght parameter-selecton technque s vtal for optmzng a SVM applcaton. In the end, t would also be counterproductve to smply rely on tradtonal technques lke logstc regresson. SVMs n combnaton wth the correct parameter-selecton technque and random forests, both outperform logstc regresson. Nevertheless, n ths study random forests are better n predctng churn n the subscrpton servces than SVMs Varable mportance In ths secton, an overvew of the most mportant varables s gven. Ths s done based on the outcome of the random forest mportance measures for manly two reasons: () Random forests gve the best predctve performance compared to logstc regresson and SVM. () Unlke random forests, the SVM software does not produce an nternal rankng of varable mportance. Moreover, we do not report any measures for logstc regresson e.g., standardzed estmates because most measures are prone to multcollnearty. However, ths s not a problem when the focus les manly on predcton. In ths study, we wll elaborate the top-10 most mportant churn predctors. It s clear from Appendx B that the length of the subscrpton and recency.e., elapsed tme snce last renewal whch both belong to the category of varables descrbng a subscrpton 4 are ranked on top. Furthermore, another varable from the same category.e., the month of contract expraton s part of the top-10 most explanng churn varables. In contrast to extant research (e.g., Bauer, 1988), monetary value and frequency.e., the number of renewal ponts are not present wthn the top-10 lst of most mportant churn predctors n ths study. Although most mportant churn predctors are varables that belong to the group of varables descrbng a subscrpton, the mpact of some clent/company-nteracton varables cannot be neglected when nvestgatng the top-10 lst of most mportant varables: () Varables related to the ablty of voluntarly suspendng the subscrpton durng holday, durng a busness trp,... are present n the top-10. () Recency of complanng.e., the elapsed tme snce the last complant s also present n the top-10 most mportant churn predctors. Consequently, effcent-complant handlng strateges are mportant. Tax, Brown, and Chandrashekaran (1998) already stated that companes do not deal successfully wth servce falures because most companes underestmate the mpact of effcent complant handlng. () Moreover, ths study shows that the varable whch ndcates whether or not a subscrpton started from own ntatve belongs to the top-10 lst n contrast to smlar varables related to other purchase motvators lke drect malng campagns, tele-marketng actons, face-toface promotons,... In spte of the mportance of age, one can conclude that soco-demographcs do not play an mportant role n explanng churn n ths study whch confrms the fndng of Guadagn and Lttle (1983) and more recently, Ross, McCulloch, and Allenby (1996). 7. Conclusons and future research In ths study, we show that SVMs are able to predct churn n subscrpton servces. By mappng non-lnear nputs nto a hgh-dmensonal feature space, SVMs break down complex problems nto smpler dscrmnant functons. Because SVMs are based on the Structural Rsk Mnmzaton prncple that mnmzes the upper bound on the actual rsk, they show a very good performance when appled to a new, nosy marketng dataset. To valdate the performance of ths novel technque, we statstcally compare ts predctve performance wth those of logstc regresson and random forests. It s shown that a SVM whch s traned on a balanced dstrbuton outperforms a logstc regresson only when the approprate parameter-selecton technque s appled. However, when compar- 4 See Appendx A.

11 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) ng the predctve capabltes of these SVMs wth state-ofthe-art random forests, our study ndcates that SVMs are surpassed by the random forests. Partcularly n ths study, we mplement a grd search usng a 5-fold cross-valdaton for obtanng the optmal upper bound C and kernel parameter c that are the most mportant when mplementng a SVM. Ths study offers an alternatve parameter-selecton technque that outperforms the prevously used technque by Hsu et al. (2004). The way n whch the optmal parameters are selected, can have sgnfcant nfluences on the performance of a SVM. Takng nto account alternatve parameter-selecton technques s crucal because even the smallest change n predctve performance can have sgnfcantly ncreases n the return on nvestment of the marketng-retenton actons based on these predcton models (Van den Poel & Larvère, 2004). In addton, one can say that academcs as well as practtoners don t have to smply rely on tradtonal technques lke logstc regresson. SVMs n combnaton wth the rght parameter-selecton technque and random forests offer some alternatves. Nevertheless, a trade-off has to be made between the tme allocated to the modelng procedure and the performance acheved. In ths study, most mportant churn predctors are part of the group of varables descrbng the subscrpton. Unlke ample research, monetary value and frequency are not present n the top-10 most mportant churn drvers. On the other hand, several clent/company-nteracton varables play an mportant role n predctng churn. In spte of the mportance of age, soco-demographcs do not play an mportant role n explanng churn n ths study. Drectons for future research are gven by the fact that nowadays there s no complete workng meta-theory to assst wth the selecton of the correct kernel functon and SVM parameters. Dervng a procedure to select the proper kernel functon and correct parameter values accordng to a specfc type of classfcaton problem s an nterestng topc for further research. Furthermore, applyng SVMs usng a suffcent sample sze can be very tme-consumng due to the long computatonal tme and often requres specfc software. Before SVMs can be wdely adopted, easy-to-use computer software should be avalable n the tradtonal data mnng packages. Acknowledgements We would lke to thank the anonymous Belgan publshng company for dsposng ther data. Next, we also lke to thank (1) Ghent Unversty for fundng the PhD project of Krstof Coussement (BOF 01D26705) and (2) the Flemsh government and Ghent Unversty (BOF equpment 011B5901) for fundng our computng resources durng ths project. Also specal thanks to L. Breman (z) for freely dstrbutng the random forest software, as well as C.-C. Chang and C.-J Ln for sharng ther SVM-toolbox, LIBSVM. Appendx A. Explanatory varables ncluded n the churnpredcton model Clent/company-nteracton varables: varables descrbng the clent/company relatonshp the number of complants; elapsed tme snce the last complant; the average cost of a complant (n terms of compensaton newspapers); the average postonng of the complants n the current subscrpton; the purchase motvator of the subscrpton; how the newspaper s delvered; the conversons made n dstrbuton channel, payment method & edton; elapsed tme snce last converson n dstrbuton channel, payment method & edton; the number of responses on drect marketng actons; the number of suspensons; the average suspenson length (n number of days); elapsed tme snce last suspenson; elapsed tme snce last response on a drect marketng acton; the number of free newspapers. Renewal-related varable: varables contanng renewalspecfc nformaton whether the prevous subscrpton was renewed before the expry date; how many days before the expry date, the prevous subscrpton was renewed; the average number of days the prevous subscrptons are renewed before expry date; the varance n the number of days the prevous subscrptons are renewed before expry date; elapsed tme snce last step n renewal procedure; the number of tmes the churner dd not renew a subscrpton. Soco-demographc varables: varables descrbng the subscrber age, whether the age s known, gender, physcal person (s the subscrber a company or a physcal person), whether contact nformaton (telephone, moble number, emal) s avalable. Subscrpton-descrbng varables: group of varables descrbng the subscrpton elapsed tme snce last renewal, monetary value,

12 324 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) the number of renewal ponts, the length of the current subscrpton, the number of days a week the newspaper s delvered (ntensty ndcaton), what product the subscrber has, the month of contract expraton. Appendx B. Varable mportance measures No. AvgNormImp Varable name Level a Relatve varable b The length of the current subscrpton Subscrpton Elapsed tme snce last renewal Subscrpton Elapsed tme snce last suspenson Subscrber Elapsed tme snce last suspenson Subscrpton The month of contract expraton Subscrpton Age Subscrber Elapsed tme snce last complant Subscrber The average suspenson length (n number of days) Subscrber X The purchase motvator of the subscrpton: own ntatve Subscrpton The average suspenson length (n number of days) Subscrber Elapsed tme snce last complant Subscrpton Monetary value Subscrpton Elapsed tme snce last step n renewal procedure Subscrpton Physcal person: physcal person YES/NO Subscrber The varance n the number of days the prevous subscrptons are Subscrber renewed before expry date The average number of days the prevous subscrptons are renewed Subscrber before expry date Elapsed tme snce last response on a drect marketng acton Subscrber The average number of days the prevous subscrptons are renewed Subscrpton before expry date The number of renewal ponts Subscrpton The number of suspensons Subscrber X The average suspenson length (n number of days) Subscrpton X The purchase motvator of the subscrpton: drect marketng acton Subscrpton The number of suspensons Subscrpton X How many days before the expry date, the prevous subscrpton was Subscrpton renewed Elapsed tme snce last converson n payment method Subscrber Elapsed tme snce last converson n payment method Subscrpton The number of complants Subscrber X The average postonng of the complants n the current subscrpton Subscrpton The conversons made n payment method Subscrpton The average suspenson length (n number of days) Subscrpton The conversons made n payment method Subscrpton X The number of responses on drect marketng actons Subscrber X The varance n the number of days the prevous subscrptons are Subscrpton renewed before expry date The conversons made n payment method Subscrber What product the subscrber has: edton X Subscrpton The purchase motvator of the subscrpton: tele-marketng acton Subscrpton What product the subscrber has: edton Y Subscrpton The conversons made n payment method Subscrber X Elapsed tme snce last converson n edton Subscrber The number of responses on drect marketng actons Subscrber (contnued on next page)

13 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) Appendx B (contnued) No. AvgNormImp Varable name Level a Relatve varable b Elapsed tme snce last converson n dstrbuton channel Subscrpton The number of suspensons Subscrber The number of complants Subscrpton X Whether the prevous subscrpton was renewed before the Subscrpton expry date Elapsed tme snce last converson n edton Subscrpton The purchase motvator of the subscrpton: promotonal Subscrpton offer The number of suspensons Subscrpton Elapsed tme snce last converson n dstrbuton channel Subscrber The number of complants Subscrber How the newspaper s delvered: prvate dstrbuton channel Subscrpton Gender: female YES/NO Subscrber The number of complants Subscrpton Physcal person: company YES/NO Subscrber How the newspaper s delvered: ndvdual newsboy Subscrpton Whether the age s known Subscrber The number of tmes the subscrber dd not renew a Subscrber subscrpton The conversons made n dstrbuton channel Subscrber X The conversons made n edton Subscrber X The conversons made n dstrbuton channel Subscrber The purchase motvator of the subscrpton: drect marketng Subscrpton acton The purchase motvator of the subscrpton: face-to-face Subscrpton marketng The conversons made n edton Subscrber The conversons made n dstrbuton channel Subscrpton The conversons made n edton Subscrpton X The conversons made n dstrbuton channel Subscrpton X What product the subscrber has: edton Z Subscrpton The conversons made n edton Subscrpton The average cost of a complant (n terms of compensaton Subscrber X newspapers) The average cost of a complant (n terms of compensaton Subscrpton X newspapers) Gender: male YES/NO Subscrber The average cost of a complant (n terms of compensaton Subscrber newspapers) How the newspaper s delvered: publc dstrbuton channel Subscrpton Gender: prvate company YES/NO Subscrber The purchase motvator of the subscrpton: drect marketng Subscrpton malng acton How the newspaper s delvered: pck up newspaper at shop Subscrpton The average cost of a complant (n terms of compensaton Subscrpton newspapers) Gender: publc company YES/NO Subscrber The number of free newspapers Subscrpton The number of days a week the newspaper s delvered Subscrpton (ntensty ndcaton) Whether contact nformaton (telephone, moble number, Subscrber emal) s avalable How the newspaper s delvered: delvered abroad va courer Subscrpton What product the subscrber has: edton W Subscrpton a See Secton 5: Research data. b Correcton of the varable by usng the length of subscrpton.

14 326 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) References Acr, N. (2006). A support vector machne classfer algorthm based on a perturbaton method and ts applcaton to ECG beat recognton systems. Expert Systems wth Applcatons, 31(1), Allson, P. D. (1999). Logstc regresson usng the SAS system: Theory and applcaton. Cary, NC: SAS Insttute Inc. Athanassopoulos, A. D. (2000). Customer satsfacton cues to support market segmentaton and explan swtchng behavor. Journal of Busness Research, 47(3), Bauer, C. L. (1988). A drect mal customer purchase model. Journal of Drect Marketng, 2(3), Bcego, M., Grosso, E., & Tstarell, M. (2005). Face authentcaton usng one-class support vector machnes. Lecture Notes n Computer Scence, 3781, Bratko, A., & Flpc, B. (2006). Explotng structural nformaton for sem-structured document categorzaton. Informaton Processng and Management, 42(3), Breman, L. (2001). Random forests. Machne Learnng, 45(1), Bucknx, W., & Van den Poel, D. (2005). Customer base analyss: Partal defecton of behavourally loyal clents n a non-contractual FMCG retal settng. European Journal Of Operatonal Research, 164(1), Buckln, R. E., & Gupta, S. (1992). Brand choce, purchase ncdence and segmentaton: An ntegrated modelng approach. Journal of Marketng Research, 29, Burez, J., & Van den Poel, D. (forthcomng). CRM at Canal+ Belgque: Reducng customer attrton through targeted marketng. Expert Systems wth Applcatons. Burges, C. J. C., & Scholkopf, B. (1997). Improvng the accuracy and speed of support vector machnes. In M. Mozer, M. Jordan, & T. Petche (Eds.), Advances n neural nformaton processng systems. Cambrdge, MA: MIT Press. Burges, C. J. C. (1998). A tutoral on support vector machnes for pattern recognton. Data Mnng and Knowledge Dscovery, 2(2), Chan P. K., & Stolfo S. J. (1998). Learnng wth non-unform class and cost dstrbutons: A case study n credt card fraud detecton. Proceedngs fourth nternatonal conference on knowledge dscovery and data mnng (pp ). Chang, C.-C., & Ln, C.-J. (2004). LIBSVM: A lbrary for support vector machnes. Techncal Report, Department of Computer Scence and Informaton Engneerng, Natonal Tawan Unversty. Chen, K.-Y., & Wang, C.-H. (2007). A hybrd SARIMA and support vector machnes n forecastng the producton values of the machnery ndustry n Tawan. Expert Systems wth Applcatons, 32(1), Chen, X. J., Harrson, R., & Zhang, Y. Q. (2005). Mult-SVM fuzzy classfcaton and fuson method and applcatons n bonformatcs. Journal of Computatonal and Theoretcal Nanoscence, 2(4), Cortes, C., & Vapnk, V. (1995). Support-vector networks. Machne Learnng, 20(3), Cu, D., & Curry, D. (2005). Predctons n marketng usng the support vector machne. Marketng Scence, 24(4), Dekmpe, M. G., & Degraeve, Z. (1997). The attrton of volunteers. European Journal of Operatonal Research, 98(1), DeLong, E. R., DeLong, D. M., & Clarke-Pearson, D. L. (1988). Comparng the areas under two or more correlated recever operatng characterstc curves: A nonparametrc approach. Bometrcs, 44(3), Duda, R. O., Hart, P. E., & Stork, D. G. (2001). Pattern classfcaton. New York: Wley. Egan, J. P. (1975). Sgnal detecton theory and ROC analyss. Seres n cognton and percepton. New York: Academc Press. Glotsos, D., Tohka, J., & Ravazoula, P. (2005). Automated dagnoss of bran tumours astrocytomas usng probablstc neural network clusterng and support vector machnes. Internatonal Journal of Neural Systems, 15(1 2), Guadagn, P. M., & Lttle, J. D. C. (1983). A logt model of brand choce calbrated on scanner data. Marketng Scence, 2(3), Hanley, J. A., & McNel, B. J. (1982). The meanng and use of the area under a recever operatng characterstc (ROC) curve. Radology, 143(1), Haste, T., Tbshran, R., & Fredman, J. (2001). The elements of statstcal learnng: Data mnng, nference and predcton. New York: Sprnger-Verlag. He, J. Y., Hu, H. J., & Harrson, R. (2005). Understandng proten structure predcton usng SVM_DT. Lecture Notes n Computer Scence, 3759, Hsu, C.-W., Chang, C.-C., & Ln, C.-J. (2004). A practcal gude to support vector classfcaton. Techncal Report, Department of Computer Scence and Informaton Engneerng, Natonal Tawan Unversty. Hung, S.-Y., Yen, D. C., & Wang, H.-Y. (2006). Applyng data mnng to telecom churn management. Expert Systems wth Applcatons, 31(3), Jones, M. A., Mothersbaugh, D. L., & Beatty, S. E. (2000). Swtchng barrers and repurchase ntentons n servces. Journal of Retalng, 76(2), Keaveney, S., & Parthasarathy, M. (2001). Customer swtchng behavor n onlne servces: An exploratory study of the role of selected atttudnal, behavoral and demographc factors. Journal of the Academy of Marketng Scence, 29(4), Keerth, S. S., & Ln, C.-J. (2003). Asymptotc behavours of support vector machnes wth Gaussan kernel. Neural Computaton, 15(7), Km, S., Shn, K. S., & Park, K. (2005). An applcaton of support vector machnes for customer churn analyss: Credt card case. Lecture Notes n Computer Scence, 3611, Km, S. K., Yang, S., & Seo, K. S. (2005). Home photo categorzaton based on photographc regon templates. Lecture Notes n Computer Scence, 3689, Larvère, B., & Van den Poel, D. (2005). Predctng customer retenton and proftablty by usng random forests and regresson forests technques. Expert Systems Wth Applcatons, 29(2), L, S.-T., Shue, W., & Huang, M.-H. (2006). The evaluaton of consumer loans usng support vector machnes. Expert Systems wth Applcatons, 30(4), Ln, H.-T., & Ln, C.-J. (2003). A study on sgmod kernels for SVM and the tranng of non-psd kernels by SMO-type methods. Techncal report, Department of Computer Scence and Informaton Engneerng, Natonal Tawan Unversty. Luo, T., Kramer, K., Goldgof, D. B., Hall, L. O., Samson, S., Remsen, A., et al. (2004). Recognzng plankton mages from the shadow mage partcle proflng evaluaton recorder. IEEE Transactons on Systems Man and Cybernetcs Part B Cybernetcs, 34(4), Nesln, S. A., Gupta, S., Kamakura, W., Lu, J., & Mason, C. (2004). Defecton detecton: Improvng predctve accuracy of customer churn models. Workng Paper. Pa, P. F., & Ln, C. S. (2005). Usng support vector machnes to forecast the producton values of the machnery ndustry n Tawan. Internatonal Journal of Advanced Manufacturng Technology, 27(1 2), Renartz, W., & Kumar, V. (2003). The mpact of customer relatonshp characterstcs on proftable lfetme duraton. Journal of Marketng, 67(1), Ross, P. E., McCulloch, R. E., & Allenby, G. M. (1996). Value of household nformaton n target marketng. Marketng Scence, 15, Rust, R. T., & Metters, R. (1996). Mathematcal models of servce. European Journal of Operatonal Research, 91(3), Swets, J. A. (1989). ROC analyss appled to the evaluaton of medcal magng technques. Investgatve Radology, 14,

15 K. Coussement, D. Van den Poel / Expert Systems wth Applcatons 34 (2008) Swets, J. A., & Pckett, R. M. (1982). Evaluaton of dagnostc systems: Methods from sgnal detecton theory. New York: Academc Press. Tax, S. S., Brown, S. W., & Chandrashekaran, M. (1998). Customer evaluatons of servce complant experences: Implcatons for relatonshp marketng. Journal of Marketng, 62(Aprl), Thomas, J. S. (2001). A methodology for lnkng customer acquston to customer retenton. Journal of Marketng Research, 38(2), Van den Poel, D., & Larvère, B. (2004). Customer attrton analyss for fnancal servces usng proportonal hazard models. European Journal of Operatonal Research, 157, Vapnk, V. (1998). Statstcal learnng theory. New York: Wley. Vapnk, V. (1995). The nature of statstcal learnng theory. New York: Sprnger. Wess, G., & Provost, F. (2001). The effect of class dstrbuton on classfer learnng. Techncal Report ML-TR-43, Department of Computer Scence, Rutgers Unversty. Yamaguch, K. (1992). Accelerated falure-tme regresson models wth a regresson model of survvng fracton: An applcaton to the analyss of permanent employment n Japan. Journal of the Amercan Statstcal Assocaton, 87(418), Zhao, Y., L, B., & L, X. (2005). Customer churn predcton usng mproved one-class support vector machne. Lecture Notes n Artfcal Intellgence, 3584, Zhong, W., He, J., Harrson, R., Ta, P. C., & Pan, Y. (forthcomng). Clusterng support vector machnes for proten local structure predcton. Expert Systems wth Applcatons.

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

The Greedy Method. Introduction. 0/1 Knapsack Problem

The Greedy Method. Introduction. 0/1 Knapsack Problem The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

Single and multiple stage classifiers implementing logistic discrimination

Single and multiple stage classifiers implementing logistic discrimination Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

CHAPTER 14 MORE ABOUT REGRESSION

CHAPTER 14 MORE ABOUT REGRESSION CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

Performance Analysis and Coding Strategy of ECOC SVMs

Performance Analysis and Coding Strategy of ECOC SVMs Internatonal Journal of Grd and Dstrbuted Computng Vol.7, No. (04), pp.67-76 http://dx.do.org/0.457/jgdc.04.7..07 Performance Analyss and Codng Strategy of ECOC SVMs Zhgang Yan, and Yuanxuan Yang, School

More information

Statistical Methods to Develop Rating Models

Statistical Methods to Develop Rating Models Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001 Proceedngs of the Annual Meetng of the Amercan Statstcal Assocaton, August 5-9, 2001 LIST-ASSISTED SAMPLING: THE EFFECT OF TELEPHONE SYSTEM CHANGES ON DESIGN 1 Clyde Tucker, Bureau of Labor Statstcs James

More information

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services An Evaluaton of the Extended Logstc, Smple Logstc, and Gompertz Models for Forecastng Short Lfecycle Products and Servces Charles V. Trappey a,1, Hsn-yng Wu b a Professor (Management Scence), Natonal Chao

More information

1. Measuring association using correlation and regression

1. Measuring association using correlation and regression How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a

More information

Improved SVM in Cloud Computing Information Mining

Improved SVM in Cloud Computing Information Mining Internatonal Journal of Grd Dstrbuton Computng Vol.8, No.1 (015), pp.33-40 http://dx.do.org/10.1457/jgdc.015.8.1.04 Improved n Cloud Computng Informaton Mnng Lvshuhong (ZhengDe polytechnc college JangSu

More information

Discussion Papers. Support Vector Machines (SVM) as a Technique for Solvency Analysis. Laura Auria Rouslan A. Moro. Berlin, August 2008

Discussion Papers. Support Vector Machines (SVM) as a Technique for Solvency Analysis. Laura Auria Rouslan A. Moro. Berlin, August 2008 Deutsches Insttut für Wrtschaftsforschung www.dw.de Dscusson Papers 8 Laura Aura Rouslan A. Moro Support Vector Machnes (SVM) as a Technque for Solvency Analyss Berln, August 2008 Opnons expressed n ths

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending

Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending Proceedngs of 2012 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 25 (2012) (2012) IACSIT Press, Sngapore Bayesan Network Based Causal Relatonshp Identfcaton and Fundng Success

More information

SVM Tutorial: Classification, Regression, and Ranking

SVM Tutorial: Classification, Regression, and Ranking SVM Tutoral: Classfcaton, Regresson, and Rankng Hwanjo Yu and Sungchul Km 1 Introducton Support Vector Machnes(SVMs) have been extensvely researched n the data mnng and machne learnng communtes for the

More information

BANKRUPTCY PREDICTION BY USING SUPPORT VECTOR MACHINES AND GENETIC ALGORITHMS

BANKRUPTCY PREDICTION BY USING SUPPORT VECTOR MACHINES AND GENETIC ALGORITHMS BANKRUPCY PREDICION BY USING SUPPOR VECOR MACHINES AND GENEIC ALGORIHMS SALEHI Mahd Ferdows Unversty of Mashhad, Iran ROSAMI Neda Islamc Azad Unversty Scence and Research Khorasan-e-Razav Branch Abstract:

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

Support Vector Machine Model for Currency Crisis Discrimination. Arindam Chaudhuri 1. Abstract

Support Vector Machine Model for Currency Crisis Discrimination. Arindam Chaudhuri 1. Abstract Support Vector Machne Model for Currency Crss Dscrmnaton Arndam Chaudhur Abstract Support Vector Machne (SVM) s powerful classfcaton technque based on the dea of structural rsk mnmzaton. Use of kernel

More information

Lecture 2: Single Layer Perceptrons Kevin Swingler

Lecture 2: Single Layer Perceptrons Kevin Swingler Lecture 2: Sngle Layer Perceptrons Kevn Sngler kms@cs.str.ac.uk Recap: McCulloch-Ptts Neuron Ths vastly smplfed model of real neurons s also knon as a Threshold Logc Unt: W 2 A Y 3 n W n. A set of synapses

More information

Logistic Regression. Steve Kroon

Logistic Regression. Steve Kroon Logstc Regresson Steve Kroon Course notes sectons: 24.3-24.4 Dsclamer: these notes do not explctly ndcate whether values are vectors or scalars, but expects the reader to dscern ths from the context. Scenaro

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

Support vector domain description

Support vector domain description Pattern Recognton Letters 20 (1999) 1191±1199 www.elsever.nl/locate/patrec Support vector doman descrpton Davd M.J. Tax *,1, Robert P.W. Dun Pattern Recognton Group, Faculty of Appled Scence, Delft Unversty

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

Transition Matrix Models of Consumer Credit Ratings

Transition Matrix Models of Consumer Credit Ratings Transton Matrx Models of Consumer Credt Ratngs Abstract Although the corporate credt rsk lterature has many studes modellng the change n the credt rsk of corporate bonds over tme, there s far less analyss

More information

STATISTICAL DATA ANALYSIS IN EXCEL

STATISTICAL DATA ANALYSIS IN EXCEL Mcroarray Center STATISTICAL DATA ANALYSIS IN EXCEL Lecture 6 Some Advanced Topcs Dr. Petr Nazarov 14-01-013 petr.nazarov@crp-sante.lu Statstcal data analyss n Ecel. 6. Some advanced topcs Correcton for

More information

The Current Employment Statistics (CES) survey,

The Current Employment Statistics (CES) survey, Busness Brths and Deaths Impact of busness brths and deaths n the payroll survey The CES probablty-based sample redesgn accounts for most busness brth employment through the mputaton of busness deaths,

More information

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background:

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background: SPEE Recommended Evaluaton Practce #6 efnton of eclne Curve Parameters Background: The producton hstores of ol and gas wells can be analyzed to estmate reserves and future ol and gas producton rates and

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy 4.02 Quz Solutons Fall 2004 Multple-Choce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multple-choce questons. For each queston, only one of the answers s correct.

More information

Study on Model of Risks Assessment of Standard Operation in Rural Power Network

Study on Model of Risks Assessment of Standard Operation in Rural Power Network Study on Model of Rsks Assessment of Standard Operaton n Rural Power Network Qngj L 1, Tao Yang 2 1 Qngj L, College of Informaton and Electrcal Engneerng, Shenyang Agrculture Unversty, Shenyang 110866,

More information

Credit Limit Optimization (CLO) for Credit Cards

Credit Limit Optimization (CLO) for Credit Cards Credt Lmt Optmzaton (CLO) for Credt Cards Vay S. Desa CSCC IX, Ednburgh September 8, 2005 Copyrght 2003, SAS Insttute Inc. All rghts reserved. SAS Propretary Agenda Background Tradtonal approaches to credt

More information

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble

More information

IMPACT ANALYSIS OF A CELLULAR PHONE

IMPACT ANALYSIS OF A CELLULAR PHONE 4 th ASA & μeta Internatonal Conference IMPACT AALYSIS OF A CELLULAR PHOE We Lu, 2 Hongy L Bejng FEAonlne Engneerng Co.,Ltd. Bejng, Chna ABSTRACT Drop test smulaton plays an mportant role n nvestgatng

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School Robust Desgn of Publc Storage Warehouses Yemng (Yale) Gong EMLYON Busness School Rene de Koster Rotterdam school of management, Erasmus Unversty Abstract We apply robust optmzaton and revenue management

More information

An Empirical Study of Search Engine Advertising Effectiveness

An Empirical Study of Search Engine Advertising Effectiveness An Emprcal Study of Search Engne Advertsng Effectveness Sanjog Msra, Smon School of Busness Unversty of Rochester Edeal Pnker, Smon School of Busness Unversty of Rochester Alan Rmm-Kaufman, Rmm-Kaufman

More information

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES The goal: to measure (determne) an unknown quantty x (the value of a RV X) Realsaton: n results: y 1, y 2,..., y j,..., y n, (the measured values of Y 1, Y 2,..., Y j,..., Y n ) every result s encumbered

More information

Traffic-light a stress test for life insurance provisions

Traffic-light a stress test for life insurance provisions MEMORANDUM Date 006-09-7 Authors Bengt von Bahr, Göran Ronge Traffc-lght a stress test for lfe nsurance provsons Fnansnspetonen P.O. Box 6750 SE-113 85 Stocholm [Sveavägen 167] Tel +46 8 787 80 00 Fax

More information

Gender Classification for Real-Time Audience Analysis System

Gender Classification for Real-Time Audience Analysis System Gender Classfcaton for Real-Tme Audence Analyss System Vladmr Khryashchev, Lev Shmaglt, Andrey Shemyakov, Anton Lebedev Yaroslavl State Unversty Yaroslavl, Russa vhr@yandex.ru, shmaglt_lev@yahoo.com, andrey.shemakov@gmal.com,

More information

The Use of Analytics for Claim Fraud Detection Roosevelt C. Mosley, Jr., FCAS, MAAA Nick Kucera Pinnacle Actuarial Resources Inc.

The Use of Analytics for Claim Fraud Detection Roosevelt C. Mosley, Jr., FCAS, MAAA Nick Kucera Pinnacle Actuarial Resources Inc. Paper 1837-2014 The Use of Analytcs for Clam Fraud Detecton Roosevelt C. Mosley, Jr., FCAS, MAAA Nck Kucera Pnnacle Actuaral Resources Inc., Bloomngton, IL ABSTRACT As t has been wdely reported n the nsurance

More information

SIMPLE LINEAR CORRELATION

SIMPLE LINEAR CORRELATION SIMPLE LINEAR CORRELATION Smple lnear correlaton s a measure of the degree to whch two varables vary together, or a measure of the ntensty of the assocaton between two varables. Correlaton often s abused.

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

Improved Mining of Software Complexity Data on Evolutionary Filtered Training Sets

Improved Mining of Software Complexity Data on Evolutionary Filtered Training Sets Improved Mnng of Software Complexty Data on Evolutonary Fltered Tranng Sets VILI PODGORELEC Insttute of Informatcs, FERI Unversty of Marbor Smetanova ulca 17, SI-2000 Marbor SLOVENIA vl.podgorelec@un-mb.s

More information

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES In ths chapter, we wll learn how to descrbe the relatonshp between two quanttatve varables. Remember (from Chapter 2) that the terms quanttatve varable

More information

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,

More information

A study on the ability of Support Vector Regression and Neural Networks to Forecast Basic Time Series Patterns

A study on the ability of Support Vector Regression and Neural Networks to Forecast Basic Time Series Patterns A study on the ablty of Support Vector Regresson and Neural Networks to Forecast Basc Tme Seres Patterns Sven F. Crone, Jose Guajardo 2, and Rchard Weber 2 Lancaster Unversty, Department of Management

More information

A PROBABILITY-MAPPING ALGORITHM FOR CALIBRATING THE POSTERIOR PROBABILITIES: A DIRECT MARKETING APPLICATION

A PROBABILITY-MAPPING ALGORITHM FOR CALIBRATING THE POSTERIOR PROBABILITIES: A DIRECT MARKETING APPLICATION Document de traval du LEM 2011-06 A PROBABILITY-MAPPIG ALGORITHM FOR CALIBRATIG THE POSTERIOR PROBABILITIES: A DIRECT MARKETIG APPLICATIO Krstof Coussement *, Wouter Bucknx ** * IESEG School of Management

More information

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008 Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn

More information

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The

More information

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) 2127472, Fax: (370-5) 276 1380, Email: info@teltonika.

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) 2127472, Fax: (370-5) 276 1380, Email: info@teltonika. VRT012 User s gude V0.1 Thank you for purchasng our product. We hope ths user-frendly devce wll be helpful n realsng your deas and brngng comfort to your lfe. Please take few mnutes to read ths manual

More information

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000 Problem Set 5 Solutons 1 MIT s consderng buldng a new car park near Kendall Square. o unversty funds are avalable (overhead rates are under pressure and the new faclty would have to pay for tself from

More information

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS Chrs Deeley* Last revsed: September 22, 200 * Chrs Deeley s a Senor Lecturer n the School of Accountng, Charles Sturt Unversty,

More information

Customer Lifetime Value Modeling and Its Use for Customer Retention Planning

Customer Lifetime Value Modeling and Its Use for Customer Retention Planning Customer Lfetme Value Modelng and Its Use for Customer Retenton Plannng Saharon Rosset Enat Neumann Ur Eck Nurt Vatnk Yzhak Idan Amdocs Ltd. 8 Hapnna St. Ra anana 43, Israel {saharonr, enatn, ureck, nurtv,

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

Dynamic Pricing for Smart Grid with Reinforcement Learning

Dynamic Pricing for Smart Grid with Reinforcement Learning Dynamc Prcng for Smart Grd wth Renforcement Learnng Byung-Gook Km, Yu Zhang, Mhaela van der Schaar, and Jang-Won Lee Samsung Electroncs, Suwon, Korea Department of Electrcal Engneerng, UCLA, Los Angeles,

More information

Section 5.4 Annuities, Present Value, and Amortization

Section 5.4 Annuities, Present Value, and Amortization Secton 5.4 Annutes, Present Value, and Amortzaton Present Value In Secton 5.2, we saw that the present value of A dollars at nterest rate per perod for n perods s the amount that must be deposted today

More information

Fault tolerance in cloud technologies presented as a service

Fault tolerance in cloud technologies presented as a service Internatonal Scentfc Conference Computer Scence 2015 Pavel Dzhunev, PhD student Fault tolerance n cloud technologes presented as a servce INTRODUCTION Improvements n technques for vrtualzaton and performance

More information

ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble

ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble 1 ECE544NA Fnal Project: Robust Machne Learnng Hardware va Classfer Ensemble Sa Zhang, szhang12@llnos.edu Dept. of Electr. & Comput. Eng., Unv. of Illnos at Urbana-Champagn, Urbana, IL, USA Abstract In

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM

GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM BARRIOT Jean-Perre, SARRAILH Mchel BGI/CNES 18.av.E.Beln 31401 TOULOUSE Cedex 4 (France) Emal: jean-perre.barrot@cnes.fr 1/Introducton The

More information

A Genetic Programming Based Stock Price Predictor together with Mean-Variance Based Sell/Buy Actions

A Genetic Programming Based Stock Price Predictor together with Mean-Variance Based Sell/Buy Actions Proceedngs of the World Congress on Engneerng 28 Vol II WCE 28, July 2-4, 28, London, U.K. A Genetc Programmng Based Stock Prce Predctor together wth Mean-Varance Based Sell/Buy Actons Ramn Rajaboun and

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model

More information

Calculating the high frequency transmission line parameters of power cables

Calculating the high frequency transmission line parameters of power cables < ' Calculatng the hgh frequency transmsson lne parameters of power cables Authors: Dr. John Dcknson, Laboratory Servces Manager, N 0 RW E B Communcatons Mr. Peter J. Ncholson, Project Assgnment Manager,

More information

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE Yu-L Huang Industral Engneerng Department New Mexco State Unversty Las Cruces, New Mexco 88003, U.S.A. Abstract Patent

More information

Interpreting Patterns and Analysis of Acute Leukemia Gene Expression Data by Multivariate Statistical Analysis

Interpreting Patterns and Analysis of Acute Leukemia Gene Expression Data by Multivariate Statistical Analysis Interpretng Patterns and Analyss of Acute Leukema Gene Expresson Data by Multvarate Statstcal Analyss ChangKyoo Yoo * and Peter A. Vanrolleghem BIOMATH, Department of Appled Mathematcs, Bometrcs and Process

More information

Method for assessment of companies' credit rating (AJPES S.BON model) Short description of the methodology

Method for assessment of companies' credit rating (AJPES S.BON model) Short description of the methodology Method for assessment of companes' credt ratng (AJPES S.BON model) Short descrpton of the methodology Ljubljana, May 2011 ABSTRACT Assessng Slovenan companes' credt ratng scores usng the AJPES S.BON model

More information

Knowledge Discovery in a Direct Marketing Case using Least Squares Support Vector Machines

Knowledge Discovery in a Direct Marketing Case using Least Squares Support Vector Machines Knowledge Dscovery n a Drect Marketng Case usng Least Squares Support Vector Machnes S. Vaene, 1, * B. Baesens, 1 T. Van Gestel, 2 J. A. K. Suykens, 2 D. Van den Poel, 3 J. Vanthenen, 1 B. De Moor, 2 G.

More information

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall SP 2005-02 August 2005 Staff Paper Department of Appled Economcs and Management Cornell Unversty, Ithaca, New York 14853-7801 USA Farm Savngs Accounts: Examnng Income Varablty, Elgblty, and Benefts Brent

More information

Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits

Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits Lnear Crcuts Analyss. Superposton, Theenn /Norton Equalent crcuts So far we hae explored tmendependent (resste) elements that are also lnear. A tmendependent elements s one for whch we can plot an / cure.

More information

On-Line Fault Detection in Wind Turbine Transmission System using Adaptive Filter and Robust Statistical Features

On-Line Fault Detection in Wind Turbine Transmission System using Adaptive Filter and Robust Statistical Features On-Lne Fault Detecton n Wnd Turbne Transmsson System usng Adaptve Flter and Robust Statstcal Features Ruoyu L Remote Dagnostcs Center SKF USA Inc. 3443 N. Sam Houston Pkwy., Houston TX 77086 Emal: ruoyu.l@skf.com

More information

Hollinger Canadian Publishing Holdings Co. ( HCPH ) proceeding under the Companies Creditors Arrangement Act ( CCAA )

Hollinger Canadian Publishing Holdings Co. ( HCPH ) proceeding under the Companies Creditors Arrangement Act ( CCAA ) February 17, 2011 Andrew J. Hatnay ahatnay@kmlaw.ca Dear Sr/Madam: Re: Re: Hollnger Canadan Publshng Holdngs Co. ( HCPH ) proceedng under the Companes Credtors Arrangement Act ( CCAA ) Update on CCAA Proceedngs

More information

Data Mining Analysis and Modeling for Marketing Based on Attributes of Customer Relationship

Data Mining Analysis and Modeling for Marketing Based on Attributes of Customer Relationship School of athematcs and Systems Engneerng Reports from SI - Rapporter från SI Data nng Analyss and odelng for arketng Based on Attrbutes of Customer Relatonshp Xaoshan Du Sep 2006 SI Report 06129 Väö Unversty

More information

14.74 Lecture 5: Health (2)

14.74 Lecture 5: Health (2) 14.74 Lecture 5: Health (2) Esther Duflo February 17, 2004 1 Possble Interventons Last tme we dscussed possble nterventons. Let s take one: provdng ron supplements to people, for example. From the data,

More information

Learning to Classify Ordinal Data: The Data Replication Method

Learning to Classify Ordinal Data: The Data Replication Method Journal of Machne Learnng Research 8 (7) 393-49 Submtted /6; Revsed 9/6; Publshed 7/7 Learnng to Classfy Ordnal Data: The Data Replcaton Method Jame S. Cardoso INESC Porto, Faculdade de Engenhara, Unversdade

More information

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S S C H E D A E I N F O R M A T I C A E VOLUME 0 0 On Mean Squared Error of Herarchcal Estmator Stans law Brodowsk Faculty of Physcs, Astronomy, and Appled Computer Scence, Jagellonan Unversty, Reymonta

More information

An Efficient and Simplified Model for Forecasting using SRM

An Efficient and Simplified Model for Forecasting using SRM HAFIZ MUHAMMAD SHAHZAD ASIF*, MUHAMMAD FAISAL HAYAT*, AND TAUQIR AHMAD* RECEIVED ON 15.04.013 ACCEPTED ON 09.01.014 ABSTRACT Learnng form contnuous fnancal systems play a vtal role n enterprse operatons.

More information

Lecture 5,6 Linear Methods for Classification. Summary

Lecture 5,6 Linear Methods for Classification. Summary Lecture 5,6 Lnear Methods for Classfcaton Rce ELEC 697 Farnaz Koushanfar Fall 2006 Summary Bayes Classfers Lnear Classfers Lnear regresson of an ndcator matrx Lnear dscrmnant analyss (LDA) Logstc regresson

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

Gender differences in revealed risk taking: evidence from mutual fund investors

Gender differences in revealed risk taking: evidence from mutual fund investors Economcs Letters 76 (2002) 151 158 www.elsever.com/ locate/ econbase Gender dfferences n revealed rsk takng: evdence from mutual fund nvestors a b c, * Peggy D. Dwyer, James H. Glkeson, John A. Lst a Unversty

More information

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Internatonal Journal of Electronc Busness Management, Vol. 3, No. 4, pp. 30-30 (2005) 30 THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Yu-Mn Chang *, Yu-Cheh

More information

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA*

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* Luísa Farnha** 1. INTRODUCTION The rapd growth n Portuguese households ndebtedness n the past few years ncreased the concerns that debt

More information

Formulating & Solving Integer Problems Chapter 11 289

Formulating & Solving Integer Problems Chapter 11 289 Formulatng & Solvng Integer Problems Chapter 11 289 The Optonal Stop TSP If we drop the requrement that every stop must be vsted, we then get the optonal stop TSP. Ths mght correspond to a ob sequencng

More information

Estimating Total Claim Size in the Auto Insurance Industry: a Comparison between Tweedie and Zero-Adjusted Inverse Gaussian Distribution

Estimating Total Claim Size in the Auto Insurance Industry: a Comparison between Tweedie and Zero-Adjusted Inverse Gaussian Distribution Avalable onlne at http:// BAR, Curtba, v. 8, n. 1, art. 3, pp. 37-47, Jan./Mar. 2011 Estmatng Total Clam Sze n the Auto Insurance Industry: a Comparson between Tweede and Zero-Adjusted Inverse Gaussan

More information

An Inductive Fuzzy Classification Approach applied to Individual Marketing

An Inductive Fuzzy Classification Approach applied to Individual Marketing An Inductve Fuzzy Classfcaton Approach appled to Indvdual Marketng Mchael Kaufmann, Andreas Meer Abstract A data mnng methodology for an nductve fuzzy classfcaton s ntroduced. The nducton step s based

More information