Knowledge Discovery in a Direct Marketing Case using Least Squares Support Vector Machines

Size: px
Start display at page:

Download "Knowledge Discovery in a Direct Marketing Case using Least Squares Support Vector Machines"

Transcription

1 Knowledge Dscovery n a Drect Marketng Case usng Least Squares Support Vector Machnes S. Vaene, 1, * B. Baesens, 1 T. Van Gestel, 2 J. A. K. Suykens, 2 D. Van den Poel, 3 J. Vanthenen, 1 B. De Moor, 2 G. Dedene 1 1 K.U. Leuven, Department of Appled Economc Scences, Naamsestraat 69, B-3000 Leuven, Belgum 2 K.U. Leuven, Department of Electrcal Engneerng ESAT-SISTA, Kasteelpark Arenberg 10, B-3001 Leuven, Belgum 3 Ghent Unversty, Department of Marketng, Hovenersberg 24, B-9000 Ghent, Belgum We study the problem of repeat-purchase modelng n a drect marketng settng usng Belgan data. More specfcally, we nvestgate the detecton and qualfcaton of the most relevant explanatory varables for predctng purchase ncdence. The analyss s based on a wrapped form of nput selecton usng a senstvty based prunng heurstc to gude a greedy, stepwse, and backward traversal of the nput space. For ths purpose, we make use of a powerful and promsng least squares support vector machne Ž LS-SVM. classfer formulaton. Ths study extends beyond the standard recency frequency monetary Ž RFM. modelng semantcs n two ways: Ž 1. by ncludng alternatve operatonalzatons of the RFM varables, and Ž. 2 by addng several other Ž non-rfm. predctors. Results ndcate that elmnaton of redundant rrelevant nputs allows sgnfcant reducton of model complexty. The emprcal fndngs also hghlght the mportance of frequency and monetary varables, whle the recency varable category seems to be of somewhat lesser mportance to the case at hand. Results also pont to the added value of ncludng non-rfm varables for mprovng customer proflng. More specfcally, customer company nteracton, measured usng ndcators of nformaton requests and complants, and merchandse returns provde addtonal predctve power to purchase ncdence modelng for database marketng John Wley & Sons, Inc. * Author to whom correspondence should be addressed; e-mal: Stjn.Vaene@ econ.kuleuven.ac.be. Contract grant sponsor: Flemsh government ŽResearch Councl K.U. Leuven: GOA-Mefsto; FWO-Flanders: ICCoS and AN-MMM; IWT: STWW Eureka.. Contract grant sponsor: Belgum government. Contract grant number: IUAP-IV 02, IV-24. Contract grant sponsor: European Commsson. Contract grant number: TMR, ERNSI. Ž. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, VOL. 16, John Wley & Sons, Inc.

2 1024 VIAENE ET AL. I. INTRODUCTION The man objectve of ths paper nvolves the detecton and qualfcaton of the most relevant varables for repeat-purchase modelng n a drect marketng settng. Ths knowledge s beleved to vastly enrch customer proflng and thus contrbute drectly to more targeted customer contact. The emprcal study focuses on the purchase ncdence,.e., the ssue whether or not a purchase s made from any product category offered by the drect malng company. Standard recency frequency monetary modelng seman- 1 tcs underle the dscussed purchase ncdence model. Ths bnary Žbuyer versus nonbuyer. classfcaton problem s beng tackled n ths paper by usng least squares support vector machne Ž LS-SVM. classfers. LS-SVMs have recently been ntroduced n the lterature 2 and excellent benchmark results have been reported. 3 Havng constructed an LS-SVM classfer wth all avalable predctors, we engage n an nput selecton experment. Input selecton has been an actve area of research n the datamnng feld for many years now. A compact, yet hghly accurate model may come n very handy n Ž on-lne. customer proflng systems. Furthermore, by reducng the number of nput features, both human understandng and computatonal performance can often be vastly enhanced. Secton II elaborates on some response modelng ssues ncludng a lterature revew and descrpton of the data set. In Secton III, we dscuss the basc underpnnngs of LS-SVMs for bnary classfcaton. The nput selecton experment and correspondng results are presented and dscussed n Secton IV. II. THE RESPONSE MODELING CASE FOR DIRECT MARKETING A. Response Modelng n Drect Marketng Cullnan s generally credted for dentfyng the three sets of varables most often used n response modelng: recency, frequency, and monetary Ž RFM.. 1,4,5 Snce then, the lterature has provded so many uses of these three varable categores, that there s overwhelmng evdence both from academcally revewed studes as well as from practtoners experence that the RFM varables are an mportant set of predctors for modelng mal-order repeat purchasng. However, the benefcal effect of ncludng other varables nto the response model has also been nvestgated. For mal-order response modelng, several alternatve problem formulatons have been proposed based on the choce of the dependent varable. The frst category s purchase ncdence modelng. 6 In ths problem formulaton, the man queston s whether a customer wll purchase durng the next malng perod,.e., one tres to predct the purchase ncdence wthn a fxed tme nterval Žtypcally half a year.. Other authors have nvestgated related problems dealng wth both the purchase ncdence and the amount of purchase n a jont model. 7,8 A thrd alternatve perspectve for response modelng s to model nterpurchase tme through survval analyss or Ž splt-. hazard rate models whch model whether a purchase takes place together wth the duraton of tme untl a purchase

3 KNOWLEDGE DISCOVERY IN DIRECT MARKETING 1025 occurs. 9,10 Table I provdes a summary of contrbutons wth regard to the three alternatve problem formulatons. We observe that the Ž frst. purchase ncdence formulaton s clearly the most popular n the exstng lterature. 11 Moreover, most studes nclude many predctors, even though only a mnorty ncludes all categores. Ths paper focuses on the frst type of problem,.e., purchase ncdence modelng. Ths choce s motvated by the fact that the majorty of prevous research n the drect marketng lterature focuses on the purchase ncdence problem. 12,13 Furthermore, ths s exactly the settng that mal-order companes are typcally confronted wth. They have to decde whether or not a specfc offerng wll be sent to a Ž potental. customer durng a certan malng perod. Gven a tendency of rsng malng costs and ncreasng competton, we can easly see an ncreasng mportance for response modelng. 14 Improvng the targetng of the offers may ndeed counter these two challenges by lowerng nonresponse. Moreover, from the perspectve of the recpent of the Ždrect mal. messages, mal-order companes do not want to overload consumers wth catalogs. The mportance of response modelng to the mal-order ndustry s further llustrated by the fact that the ssue of mprovng targetng was among the top three concerns wth 73.5% of the catalogers n the sample mentoned n Ref. 15. B. The Data Set From a major Belgan mal-order company, we obtaned data on past purchase behavor at the order-lne level,.e., we know when a customer purchased what quantty of a partcular product at whch prce as part of what order. Ths allowed us, n close cooperaton wth doman experts and guded by the extensve lterature, to derve all the necessary purchase behavor varables for a total sample sze of 5,000 customers, of whch 37.94% represent buyers. For each customer, these varables were measured for the perod between 1 July 1993 and 30 June The goal s to predct whether an exstng customer wll repurchase n the observaton perod between 1 July 1997 and 31 December 1997 usng the nformaton provded by the purchase behavor varables. Ths problem bols down to a bnary classfcaton problem: Wll a customer repurchase or not? Notce that the focus s on customer retenton and not on customer acquston. The recency, frequency, and monetary varables have then been modeled as descrbed n detal n Ref. 11. We used two tme horzons for all RFM varables. The Hst horzon refers to the fact that the varable s measured for the perod between 1 July 1993 and 30 June The Year horzon refers to the fact that the varable s measured over the last year. Includng both tme horzons allows us to check whether more recent data are more relevant than hstorcal data. All RFM varables are modeled both wth and wthout the occurrence of returned merchandse, ndcated by R and N n the varable name, respectvely. The former s operatonalzed by ncludng the counts of returned merchandse n the varable values, whereas n the latter case these counts are omtted. Takng

4 1026 VIAENE ET AL. Table I. Lterature revew of response modelng papers. Independent Varable Dependent Varable Context of Applcaton Other Soco- Catalog or Behavoral Demographc Bnary & Bnary & Fund- Catalog Specalty Reference R F M Varables Varables Bnary Amount Tmng Rasng Ž general. Malng Berger and Maglozz Ž X X X X X X Btran and Mondschen Ž X X X X X Bult and Wttnk Ž X X X X X Bult Ž X X X X X Bult et al. Ž X X X X X X X Gonul and Sh Ž X X X X Kaslow Ž X X X X X X X Levn and Zahav Ž X X X X X X X X Maglozz and Berger Ž X X X X X Maglozz Ž X X X X X Thrasher Ž X X X X X X Van den Poel & Leuns Ž X X X X X X Ž. 11 Van den Poel 1999 X X X X X X X Van der Scheer Ž X X X X Zahav and Levn Ž X X X X X X

5 KNOWLEDGE DISCOVERY IN DIRECT MARKETING 1027 nto account both tme horzons Ž Year versus Hst. and ncluson versus excluson of returned tems Ž R versus N., we arrve at a 2 2 desgn n whch each RFM varable s operatonalzed n four ways. For the recency varable, many operatonalzatons have already been suggested. In ths paper, we defne the recency varable as the number of days snce the last purchase wthn a specfc tme wndow Ž Hst versus Year. and nor excludng returned merchandse Ž R versus N.. 4 Recency has been found to be nversely related to the probablty of the next purchase,.e., the longer the tme delay snce the last purchase the lower the probablty of a next purchase wthn the specfc perod. 1 In the context of drect mal, t has generally been observed that multbuyers Ž buyers who already purchased several tmes. are more lkely to repurchase than buyers who only purchased once. 4,26 Although no detaled results are reported because of the propretary nature of most studes, the frequency varable s generally consdered to be the most mportant of the RFM varables 12. Bauer suggests to operatonalze the frequency varable as the number of purchases dvded by the tme on the customer lst snce the frst purchase. 4 We choose to operatonalze the frequency varable as the number of purchases made n a certan tme perod Ž Hst versus Year. whle n- or excludng returned merchandse Ž R versus N.. In the drect marketng lterature, the general conventon s that the more money a person has spent wth a company, the hgher hs her lkelhood of purchasng the next offerng. 27 Nash suggests to operatonalze monetary value as the hghest transacton sale or as the average order sze. 12 Levn and Zahav propose to use the average amount of money per purchase. 27 We model the monetary varable as the total accumulated monetary amount of spendng by a customer durng a certan tme perod Ž Hst versus Year. whle n- or excludng returned merchandse Ž R versus N.. Addtonally, we nclude the natural logarthmc transformaton Ž ln. of all monetary varables as a means to reduce the skewness of the dstrbutons. Apart from the RFM varables, we also ncluded nne other customer proflng nputs. 11 The type and frequency of contact whch customers have wth the mal-order company may yeld mportant nformaton about ther future purchasng behavor. The GenInfo and GenCust are bnary customer company nteracton varables ndcatng whether the customer asked for general nformaton Ž respectvely, fled general complants.. Snce customer Ž ds. satsfacton may not only be revealed by general complants but also by returnng tems, we ncluded two extra varables. The RetMerch varable s a bnary varable ndcatng whether the customer has ever returned an tem that was prevously ordered from the mal-order company. The RetPerc varable measures the total monetary amount of returned orders dvded by the total amount of spendng. The Ndays varable models the length of the customer relatonshp n days. It s commonly beleved that consumers households wth a longer relatonshp wth the company have a hgher probablty of repurchase than households wth shorter relatonshps. IncrHst and IncrYear are operatonalzatons of a behavoral loyalty measure. We propose to perform a medan splt of the length of the

6 1028 VIAENE ET AL. relatonshp Ž tme snce the household became a customer.. Ths enables us to compare the number of purchases Ž.e., frequency. between the frst and last half of the tme wndow. The followng formula s used: purchases second half purchases frst half purchases frst half Ž 1. When the above measure s postve, ths may gve us an ndcaton of ncreasng loyalty by that customer to the Ž mal-order. company, and pso facto satsfacton wth the current level of servce. Remember that the suffx Hst reflects that the whole purchase hstory s used, whereas n the case of the suffx Year, only transactons from the last year are ncluded. The ProdclaT respectvely ProdclaM varables represent the total Ž T. respectvely, mean Ž M. forward-lookng weghted product ndex. The weghtng procedure represents the forward-lookng nature of a product category purchase, derved from another sample of data. Table II gves an overvew of the varables dscussed above. Notce that all mssng values were handled by the mean mputaton procedure 28 and that all predctor varables were normalzed to zero mean and unt varance pror to ther ncluson n the model. 29 III. LEAST SQUARES SVM CLASSIFICATION A. LS-SVMs for Bnary Classfcaton Gven a tranng set x 4, y 1 N wth nput data x n and correspondng bnary class labels y 1, 1 4, the SVM classfer, accordng to Vapnk s orgnal formulaton, satsfes the followng condtons: w T Ž x. b 1 f y 1 Ž 2. w T Ž x. b 1 f y 1 Table II. A lstng of all nputs Ž both RFM and non-rfm. ncluded n the drect marketng case. Recency Frequency Monetary Other RecYearR FrYearR MonHstR ProdclaT RecYearN FrYearN MonHstN ProdclaM RecHstR FrHstR MonYearR GenCust RecHstN FrHstN MonYearN GenInfo lnž MonHstR. Ndays lnž MonHstN. IncrHst lnž MonYearR. lnž MonYearN. IncrYear RetMerch RetPerc

7 whch s equvalent to: KNOWLEDGE DISCOVERY IN DIRECT MARKETING 1029 Ž. Ž. T y w x b 1 1,...,N 3 Ž. n n The nonlnear functon : h maps the nput space to a hgh dmensonal Ž and possbly nfnte dmensonal. feature space. In prmal weght space the classfer then takes the form: T yž x. sgn w Ž x. b Ž 4. however, t s never evaluated n ths form. One defnes the optmzaton problem as: subject to: 1 T N Ý 1 mn T Ž w,. w w c Ž 5. 2 w, b, Ž. T y w x b 1 1,...,N 0 1,...,N The varables are slack varables whch are needed to allow msclassfcatons n the set of nequaltes Ž e.g., due to overlappng dstrbutons.. The postve real constant c should be consdered as a tunng parameter n the algorthm. For nonlnear SVMs, the QP-problem and the classfer are never solved and evaluated n ths form. Instead, a dual space formulaton and representaton are obtaned by applyng the Mercer condton Ž see Refs for detals.. Vapnk s SVM classfer formulaton was modfed by Suykens and Vandewalle 2 nto the followng LS-SVM formulaton: N T 2 Ý 1 Ž mn TŽ w, e. w w e Ž w, b, e subject to the equalty constrants: Ž. Ž. T y w x b 1 e, 1,...,N 8 Ths formulaton now conssts of equalty nstead of nequalty constrants and takes nto account a squared error wth a regularzaton term smlar to rdge regresson. The soluton s obtaned after constructng the Lagrangan: N T Ý 1 4 L Ž w, b, e;. TŽ w, e. y w Ž x. b 1 e Ž 9. where are the Lagrange multplers. After takng the condtons for optmal ty, one obtans the followng lnear system 2 : 0 Y T b 0 Ž Y I 1

8 1030 VIAENE ET AL. where Z Ž x. T y ;...; Ž x. T y, Y y ;...; y, 1 1;...;1, 1 1 N N 1 N T 2 ;...;, ZZ, and Mercer s condton s appled wthn the matrx: 1 N T j j Ž. Ž j. yy x x yykž x, x. j j For the kernel functon KŽ,. one typcally has the followng choces: KŽ x, x. x T x, Ž lnear kernel. Ž T d K x, x x x 1, polynomal kernel of degree d Ž. Ž. KŽ x, x. exp x x, Ž radal bass functon Ž RFB. kernel. Ž. KŽ x, x. tanh x T x, multlayer perceptron Ž MLP. kernel Ž 11. Ž. where d,,, and are constants. Notce that the Mercer condton holds for all and d values n the RBF and the polynomal cases, but not for all possble choces of and n the MLP case. The LS-SVM classfer s then constructed as follows: N Ý 1 yž x. sgn ykž x, x. b Ž 12. Note that the matrx n Ž 10. s of dmenson Ž N 1. Ž N 1.. For large values of N, ths matrx cannot easly be stored, such that an teratve soluton method for solvng t s needed. A Hestenes Stefel conjugate gradent algorthm s suggested n Ref. 34 to overcome ths problem. Bascally, the latter rests upon a Ž. 34 transformaton of the matrx n 10 to a postve defnte form. A straghtforward extenson of LS-SVMs to multclass problems has been proposed n Ref. 35, where addtonal outputs are taken to encode multclasses as s often done n classcal neural network methodology. 29 A drawback of LS-SVMs s that sparseness s lost due to the choce of a 2-norm. However, ths can be crcumvented n a second stage by a prunng procedure whch s based upon removng tranng ponts guded by the sorted support value spectrum. 36 B. Calbratng the RBF LS-SVM Classfer All classfers were traned usng RBF kernels. 3 Estmaton of the generalzaton ablty of the RBF LS-SVM classfer s then realzed by the followng expermental setup 3 : 3 1 Ž. 1 Set asde 4 of the data for the tranng set and the remanng 4 for testng, respectng the orgnal class dstrbuton. Ž. 2 Perform 10-fold cross valdaton on the tranng data for each Ž,. combnaton from the ntal canddate tunng sets and typcally chosen as follows: 0.5, 5, 10, 15, 25, 50, 100, 250, 5004 n , 0.5, 1, 10, 50, 100, 500, N '

9 KNOWLEDGE DISCOVERY IN DIRECT MARKETING 1031 The square root ' n of the number of nputs n s ntroduced, snce x x 2 2 n 1 the RBF kernel s proportonal to n and the factor N s ntroduced such that the msclassfcaton term Ý 1 N e 2 s normalzed wth the sze of the data set. Ž. 3 Choose optmal Ž,. from the ntal canddate tunng sets and by lookng at the best cross valdaton performance for each Ž,. combnaton. Ž. 4 Refne and teratvely by means of a grd search mechansm to further optmze the tunng parameters Ž,.. In our experments, we repeated ths step three tmes. Ž. 5 Construct the LS-SVM classfer usng the total tranng set for the optmal choce of the tuned hyperparameters Ž,.. Ž. 6 Assess the generalzaton ablty by means of the ndependent test set. Followng the procedure outlned above, one obtaned the results depcted 3 n Table III. The optmzed RBF LS-SVM classfer, traned on 4 of the data set, acheves a percentage correctly classfed on the tranng data of 77.54% wth and Performance on the ndependent test data amounts to 74.48% correctly classfed. We contrasted these results wth those obtaned usng a lnear kernel for the LS-SVM classfer. As can be observed from Table III, the percentage correctly classfed drops to 76.26% on the tranng set and to 73.76% on the ndependent test set. IV. THE INPUT SELECTION EXPERIMENT A. Input Selecton n a Nutshell Input selecton s a commonly adhered technque to reduce model complexty. The goal s to fnd a reduced coordnate system that allows one to project the data on a more compact representaton. The general assumpton underlyng ths operaton and justfyng t, s that the studed data approxmately le wthn the bounds of ths reduced space. As such, models wth fewer nputs are capable of mprovng both human understandng and computatonal performance. Moreover, elmnaton of redundant and or rrelevant nputs may also mprove the predctve power of an algorthm. 37 Selectng the best subset of a set of n predctors s a nontrval problem. Ths follows from the fact that the optmal nput subset can only be obtaned when the nput space s exhaustvely searched. When n nputs are present, ths would mply the need to evaluate 2 n 1 nput subsets. Unfortunately, as n grows, ths very quckly becomes computatonally nfeasble. For that reason, heurstc search procedures are often preferred. Input selecton can then ether be performed as a preprocessng step, ndepen- Table III. Classfcaton accuracy of the optmzed RFB LS-SVM classfer versus an LS-SVM optmzed usng a lnear kernel. LS-SVM LS-SVM Classfcaton Accuracy RBF Kernel Lnear Kernel Tranng Ž 3750 observatons % 76.26% Test Ž 1250 observatons % 73.76%

10 1032 VIAENE ET AL. dent of the nducton algorthm, or explctly make use of t. The former approach s termed flter, the latter wrapper. 38 Flter methods operate ndependently of the learnng algorthm. Undesrable nputs are fltered out of the data before nducton commences. Focus 39 and Relef 40 are well-known flter methods. Wrapper methods make use of the actual target learnng algorthm to evaluate the usefulness of nputs. Typcally, the nput evaluaton heurstc that s used s based upon nspecton of the traned parameters and or comparson of predctve performance under dfferent nput subset confguratons. Input selecton s then often performed n a sequental fashon, e.g., guded by a best-frst nput selecton strategy. The backward selecton scheme starts from a full nput set and stepwse prunes nput varables that are undesrable. The forward selecton scheme starts from the empty nput set and stepwse adds nput varables that are desrable. Hybrds of the above also exst. B. Wrappng the Optmzed LS-SVM Classfer Input selecton effectvely starts at the moment the LS-SVM classfer has been constructed on the full set of n avalable predctors. The nput selecton procedure s based upon a Ž greedy. best-frst heurstc, gudng a backward search mechansm through the nput space. 38 The mechancs of the mplemented heurstc for assessng the senstvty of the classfer to a certan nput are qute straghtforward. We apply a strategy of constant substtuton n whch an nput s perturbed to ts mean whle all other nputs keep ther values and compute the mpact of ths operaton on the performance of the obtaned LS-SVM classfer wthout reestmaton of the LS-SVM parameters and b. Ths assessment s done usng the separate prunng set to obtan an unbased estmate of the change n classfcaton accuracy of the constructed classfer. The prunng set conssts of 1250 observatons that were randomly selected from the tranng set of 3750 observatons. Fgure 1 provdes a concse overvew of the dfferent steps of the expermental procedure. Fgure 1.

11 KNOWLEDGE DISCOVERY IN DIRECT MARKETING 1033 Startng wth a full nput set F 1, all n nputs are pruned sequentally,.e., one by one. The frst nput fp to be removed, s determned at the end of Step 1 task Ž. 4. After havng removed ths nput from F 1, the reduced nput set F F f p s used for subsequent nput removal. At ths moment, an teraton of dentcal Steps s started, n whch, n a frst phase, the LS-SVM parameters and b are re-estmated on the tranng set task Ž. 1 of Step, however, wthout recalbraton for and, and the generalzaton ablty of the classfer s quantfed on the ndependent test set task Ž. 2 of Step. Notce that the orgnally optmzed and values obtaned n task Ž. 1 of Step 1 reman unchanged durng the entre nput selecton phase. Agan, nput senstvtes of the resultng classfcaton model Ž wthout re-estmaton of and b. are assessed on the prunng set to dentfy the nput to whch the classfer s least senstve when perturbed to ts mean task Ž. 3 of Step. Ths nput s then pruned from the remanng nput subset and dsregarded for further analyss. The prunng procedure s thereupon resumed wth a reduced nput set, untl all nputs are eventually removed. Once all nputs have been pruned, the preferred reduced model s then determned by means of the hghest prunng set performance. Table IV summarzes the emprcal fndngs of the prunng procedure for the RFM case. Observe how the suggested nput selecton method allows sgnfcant reducton of model complexty Ž from 25 to 9 nputs. wthout any sgnfcant degradaton of the generalzaton behavor on the ndependent test set. The test set performance amounts to 73.92% for the full model and 73.52% for the reduced model. The order of nput removal as depcted n Table V, provdes further nsght nto the relatve mportance of the predctor categores Ž cf. Table II.. The reduced model conssts of the nne nputs that are underlned n Table V. Ths reduced set of predctors conssts of frequency, monetary, and other Ž non-rfm. varables. It s especally mportant to note that the reduced model ncludes nformaton on returned merchandse. Furthermore, notce the absence of the recency component n the reduced nput set. Inspecton of the order of removal of nputs, whle further prunng ths reduced nput set, hghlghts the relatve mportance of the frequency varables. More specfcally, the last two varables to be removed belong to ths predctor category. Note that an nput set consstng of only these two nputs, stll yelds a percentage correctly classfed at 72.00% on the test set. Results also pont to the benefcal effect of ncludng the Table IV. Emprcal assessment of the RBF LS-SVM classfers for the full and reduced models. Classfcaton Accuracy Full Model Reduced Model Tranng Ž 2500 observatons % 76.04% Prunng Ž 1250 observatons % 77.20% Test Ž 1250 observatons % 73.52% Number of Inputs 25 9

12 1034 VIAENE ET AL. Table V. Order of nput removal. Each nput s qualfed by ts category wth r, f, m and o respectvely standng for recency, frequency, monetary and other Ž cf. Table II.. Prunng Steps RetPerc o ProdclaM o RecHstN r FrYearN f MonYearR m lnž MonHstN. m MonHstR m IncrHst o lnž MonHstR. m MonYearN m RecHstR r IncrYear o RecYearR r MonHstN m GenInfo o Ndays o lnž MonYearR. m RecYearN r GenCust o FrHstR f ProdclaT o lnž MonYearN. m FrHstN f RetMerch o FrYearR f non-rfm customer proflng varables GenInfo and GenCust for mprovng predctve accuracy. They underlne that customer company nteracton varables, here measured by ndcators of nformaton requests and complants, provde addtonal predctve power to purchase ncdence modelng for database marketng. V. CONCLUSION In ths paper, we appled an LS-SVM based nput selecton wrapper to a real-lfe drect marketng case nvolvng the modelng of repeat-purchase behavor. Based on a thorough revew of the lterature, we extended the well-known recency, frequency, monetary Ž RFM. framework Ž 1. by usng alternatve operatonalzatons of the orgnal varables, and Ž. 2 by ncludng several addtonal behavoral varables. The senstvty based, stepwse nput selecton method, constructed as a wrapper around the LS-SVM classfer, allows sgnfcant reducton of model complexty wthout degradng predctve performance. The emprcal fndngs hghlght the role of frequency and monetary varables n the reduced model, whle the recency varable category seems to be of somewhat lesser mportance wthn the response model. Results also pont to the benefcal effect of ncludng non-rfm customer proflng varables for mprovng predctve accuracy. More specfcally, customer company nteracton, measured by ndcators of nformaton requests and complants, and merchandse returns provde addtonal predctve power to purchase ncdence modelng for database marketng. Ths work was partly carred out at the Leuven Insttute for Research on Informaton Systems Ž LIRIS. of the Department of Appled Economc Scences of the K.U. Leuven n the framework of the KBC Insurance Research Char. Ths work was partly carred out at the ESAT laboratory and the Interdscplnary Center of Neural Networks ICNN of the K.U. Leuven. S. Vaene, holder of the KBC Insurance Research Char, and B. Baesens are both Research Assstants of LIRIS. T. Van Gestel s a research assstant wth the Fund for Scentfc Research Flanders Ž FWO-Flanders., J. Suykens s a postdoctoral researcher wth the Fund for Scentfc Research Flanders Ž FWO-Flanders., B. De Moor s a senor research assocate wth the Fund for Scentfc Research Flanders Ž FWO-Flanders.. D. Van den Poel s an assstant professor at the Department of

13 KNOWLEDGE DISCOVERY IN DIRECT MARKETING 1035 Marketng at Ghent Unversty. J. Vanthenen and G. Dedene are Senor Research Assocates of LIRIS. References 1. Cullnan GJ. Pckng them by ther battng averages recency-frequency-monetary method of controllng crculaton. Drect Mal Marketng Assocaton, New York, manual release 2103 edton, Suykens JAK, Vandewalle J. Least squares support vector machne classfers. Neural Process Lett 1999;9Ž. 3 : Van Gestel T, Suykens JAK, Baesens B, Vaene S, Vanthenen J, Dedene G, De Moor B, Vandewalle J. Benchmarkng least squares support vector machne classfers. Techncal Report 00-37, ESAT-SISTA, K.U. Leuven, Leuven, Belgum, Bauer A. A drect mal customer purchase model. J Drect Marketng 1988;2Ž. 3 : Kestnbaum RD. Quanttatve database methods. In: The drect marketng handbook. New York: McGraw Hll; p Bult JR. Target selecton for drect marketng, PhD thess, Gronngen Unversty, Levn N, Zahav J. Contnuous predctve modelng: a comparatve analyss. J Interactve Marketng 1998;12Ž. 2 : Van der Scheer HR. Quanttatve approaches for proft maxmzaton n drect marketng, PhD thess, Gronngen Unversty, Dekmpe MG, Degraeve Z. The attrton of volunteers. European J Operat Res 1997;98: Van den Poel D, Leuns J. Database marketng modelng for fnancal servces usng hazard rate models. Internat Rev Retal, Dstrbuton and Consumer Res 1998; 8Ž. 2 : Van den Poel D. Response Modelng for Database Marketng usng Bnary Classfcaton, PhD thess, K.U. Leuven, Nash EL. Drect marketng: strategy, plannng, executon. Thrd Ed. New York: McGraw Hll; Zahav J, Levn N. Issues and problems n applyng neural computng to target marketng. J Drect Marketng 1997;11Ž. 4 : Hauser B. Lst Segmentaton. In: The drect marketng handbook. New York: McGraw-Hll; p DMA. Statstcal fact book Twenteth Ed. New York: Drect Marketng Assocaton; Berger P, Maglozz T. The effect of sample sze and proporton of buyers n the sample on the performance of lst segmentaton equatons generated by regresson analyss. J Drect Marketng 1992;6Ž. 1 : Btran GR, Mondschen SV. Malng decsons n the catalog sales ndustry. Manage Sc 1996;42Ž. 9 : Bult JR, Wttnk DR. Estmatng and valdatng asymmetrc heterogeneous loss functons appled to health care fund rasng. Int J Res Marketng 1996;13: Bult JR. Semparametrc versus parametrc classfcaton models: an applcaton to drect marketng. J Marketng Res 1993;30: Bult JR, Van der Scheer H, Wansbeek T. Interacton between target and malng characterstcs n drect marketng, wth an applcaton to health care fund rasng. Int J Res Marketng 1997;14: Gonul F, Sh MZ. Optmal malng of catalogs: a new methodology usng estmable structural dynamc programmng models. Manage Sc 1998;44Ž. 9 : Kaslow GA. A mcroeconomc analyss of consumer response to drect marketng and mal order, PhD thess, Calforna Insttute of Technology, 1997.

14 1036 VIAENE ET AL. 23. Maglozz TL, Berger PD. Lst segmentaton strateges n drect marketng. Omega Int J Manage Sc 1993;21Ž. 1 : Maglozz TL. An emprcal nvestgaton of regresson meta-strateges for drect marketng lst segmentaton models, PhD thess, Boston Unversty, Thrasher RP. Cart: a recent advance n tree-structured lst segmentaton methodology. J Drect Marketng 1991;5Ž. 1 : Stone B. Successful drect marketng methods. Chcago: Cran books; Levn N, Zahav J. Segmentaton analyss wth manageral judgment. J Drect Marketng 1996;10Ž. 3 : Lttle RJA. Regresson wth mssng x s: a revew. J Amer Statst Assoc 1992;87Ž 420.: Bshop CM. Neural networks for pattern recognton. New York: Oxford Unversty Press; Crstann N, Shawe-Taylor J. An ntroducton to support vector machnes. Cambrdge, UK: Cambrdge Unversty Press; Smola A. Learnng wth kernels, PhD thess, Techncal Unversty, Berln, Vapnk V. The nature of statstcal learnng theory. New York: Sprnger-Verlag; Vapnk V. Statstcal learnng theory. New York: John Wley & Sons; Suykens JAK, Lukas L, Van Dooren P, De Moor B, Vandewalle J. Least squares support vector machne classfers: a large scale algorthm. In: 14th European Conference on Crcuts Theory and Desgn, Stresa, Italy, p Suykens JAK, Vandewalle J. Multclass least squares support vector machnes. In: 10th Internatonal Jont Conference on Neural Networks, Washngton DC, Suykens JAK, Lukas L, Vandewalle J. Sparse least squares support vector machne classfers. In: 9th European Symposum on Artfcal Neural Networks, Bruges, Belgum; p Bellman RE. Adaptve control processes. Prnceton: Prnceton Unversty Press; John G, Kohav R, Pfleger K. Irrelevant features and the subset selecton problem. In: Machne Learnng: Proc Eleventh Int Conf, San Francsco, Calforna, p Almuallm H, Detterch TG. Learnng wth many rrelevant features. In: Nnth Nat Conf on Artfcal Intellgence, Anahem, Calforna. Menlo Park, CA: AAAI Press; p Kra K, Rendell LA. The feature selecton problem: Tradtonal methods and a new algorthm. In: Tenth Nat Conf on Artfcal Intellgence, San Jose, Calforna. Menlo Park, CA: AAAI Press; p

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye [email protected] [email protected] [email protected] Abstract - Stock market s one of the most complcated systems

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna [email protected] Abstract.

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht [email protected] 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending

Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending Proceedngs of 2012 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 25 (2012) (2012) IACSIT Press, Sngapore Bayesan Network Based Causal Relatonshp Identfcaton and Fundng Success

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Statistical Methods to Develop Rating Models

Statistical Methods to Develop Rating Models Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

A study on the ability of Support Vector Regression and Neural Networks to Forecast Basic Time Series Patterns

A study on the ability of Support Vector Regression and Neural Networks to Forecast Basic Time Series Patterns A study on the ablty of Support Vector Regresson and Neural Networks to Forecast Basc Tme Seres Patterns Sven F. Crone, Jose Guajardo 2, and Rchard Weber 2 Lancaster Unversty, Department of Management

More information

IMPACT ANALYSIS OF A CELLULAR PHONE

IMPACT ANALYSIS OF A CELLULAR PHONE 4 th ASA & μeta Internatonal Conference IMPACT AALYSIS OF A CELLULAR PHOE We Lu, 2 Hongy L Bejng FEAonlne Engneerng Co.,Ltd. Bejng, Chna ABSTRACT Drop test smulaton plays an mportant role n nvestgatng

More information

Performance Analysis and Coding Strategy of ECOC SVMs

Performance Analysis and Coding Strategy of ECOC SVMs Internatonal Journal of Grd and Dstrbuted Computng Vol.7, No. (04), pp.67-76 http://dx.do.org/0.457/jgdc.04.7..07 Performance Analyss and Codng Strategy of ECOC SVMs Zhgang Yan, and Yuanxuan Yang, School

More information

Credit Limit Optimization (CLO) for Credit Cards

Credit Limit Optimization (CLO) for Credit Cards Credt Lmt Optmzaton (CLO) for Credt Cards Vay S. Desa CSCC IX, Ednburgh September 8, 2005 Copyrght 2003, SAS Insttute Inc. All rghts reserved. SAS Propretary Agenda Background Tradtonal approaches to credt

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet

2008/8. An integrated model for warehouse and inventory planning. Géraldine Strack and Yves Pochet 2008/8 An ntegrated model for warehouse and nventory plannng Géraldne Strack and Yves Pochet CORE Voe du Roman Pays 34 B-1348 Louvan-la-Neuve, Belgum. Tel (32 10) 47 43 04 Fax (32 10) 47 43 01 E-mal: [email protected]

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

Single and multiple stage classifiers implementing logistic discrimination

Single and multiple stage classifiers implementing logistic discrimination Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,

More information

Multiple-Period Attribution: Residuals and Compounding

Multiple-Period Attribution: Residuals and Compounding Multple-Perod Attrbuton: Resduals and Compoundng Our revewer gave these authors full marks for dealng wth an ssue that performance measurers and vendors often regard as propretary nformaton. In 1994, Dens

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy 4.02 Quz Solutons Fall 2004 Multple-Choce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multple-choce questons. For each queston, only one of the answers s correct.

More information

Gender Classification for Real-Time Audience Analysis System

Gender Classification for Real-Time Audience Analysis System Gender Classfcaton for Real-Tme Audence Analyss System Vladmr Khryashchev, Lev Shmaglt, Andrey Shemyakov, Anton Lebedev Yaroslavl State Unversty Yaroslavl, Russa [email protected], [email protected], [email protected],

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

How To Calculate The Accountng Perod Of Nequalty

How To Calculate The Accountng Perod Of Nequalty Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada [email protected] Abstract Ths s a note to explan support vector machnes.

More information

Financial market forecasting using a two-step kernel learning method for the support vector regression

Financial market forecasting using a two-step kernel learning method for the support vector regression Ann Oper Res (2010) 174: 103 120 DOI 10.1007/s10479-008-0357-7 Fnancal market forecastng usng a two-step kernel learnng method for the support vector regresson L Wang J Zhu Publshed onlne: 28 May 2008

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy Fnancal Tme Seres Analyss Patrck McSharry [email protected] www.mcsharry.net Trnty Term 2014 Mathematcal Insttute Unversty of Oxford Course outlne 1. Data analyss, probablty, correlatons, vsualsaton

More information

Traffic-light a stress test for life insurance provisions

Traffic-light a stress test for life insurance provisions MEMORANDUM Date 006-09-7 Authors Bengt von Bahr, Göran Ronge Traffc-lght a stress test for lfe nsurance provsons Fnansnspetonen P.O. Box 6750 SE-113 85 Stocholm [Sveavägen 167] Tel +46 8 787 80 00 Fax

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

The Application of Fractional Brownian Motion in Option Pricing

The Application of Fractional Brownian Motion in Option Pricing Vol. 0, No. (05), pp. 73-8 http://dx.do.org/0.457/jmue.05.0..6 The Applcaton of Fractonal Brownan Moton n Opton Prcng Qng-xn Zhou School of Basc Scence,arbn Unversty of Commerce,arbn [email protected]

More information

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School Robust Desgn of Publc Storage Warehouses Yemng (Yale) Gong EMLYON Busness School Rene de Koster Rotterdam school of management, Erasmus Unversty Abstract We apply robust optmzaton and revenue management

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services An Evaluaton of the Extended Logstc, Smple Logstc, and Gompertz Models for Forecastng Short Lfecycle Products and Servces Charles V. Trappey a,1, Hsn-yng Wu b a Professor (Management Scence), Natonal Chao

More information

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts Power-of-wo Polces for Sngle- Warehouse Mult-Retaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)

More information

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

Logistic Regression. Steve Kroon

Logistic Regression. Steve Kroon Logstc Regresson Steve Kroon Course notes sectons: 24.3-24.4 Dsclamer: these notes do not explctly ndcate whether values are vectors or scalars, but expects the reader to dscern ths from the context. Scenaro

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

Scale Dependence of Overconfidence in Stock Market Volatility Forecasts

Scale Dependence of Overconfidence in Stock Market Volatility Forecasts Scale Dependence of Overconfdence n Stoc Maret Volatlty Forecasts Marus Glaser, Thomas Langer, Jens Reynders, Martn Weber* June 7, 007 Abstract In ths study, we analyze whether volatlty forecasts (judgmental

More information

The Current Employment Statistics (CES) survey,

The Current Employment Statistics (CES) survey, Busness Brths and Deaths Impact of busness brths and deaths n the payroll survey The CES probablty-based sample redesgn accounts for most busness brth employment through the mputaton of busness deaths,

More information

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S S C H E D A E I N F O R M A T I C A E VOLUME 0 0 On Mean Squared Error of Herarchcal Estmator Stans law Brodowsk Faculty of Physcs, Astronomy, and Appled Computer Scence, Jagellonan Unversty, Reymonta

More information

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Internatonal Journal of Electronc Busness Management, Vol. 3, No. 4, pp. 30-30 (2005) 30 THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Yu-Mn Chang *, Yu-Cheh

More information

Searching for Interacting Features for Spam Filtering

Searching for Interacting Features for Spam Filtering Searchng for Interactng Features for Spam Flterng Chuanlang Chen 1, Yun-Chao Gong 2, Rongfang Be 1,, and X. Z. Gao 3 1 Department of Computer Scence, Bejng Normal Unversty, Bejng 100875, Chna 2 Software

More information

Mining Feature Importance: Applying Evolutionary Algorithms within a Web-based Educational System

Mining Feature Importance: Applying Evolutionary Algorithms within a Web-based Educational System Mnng Feature Importance: Applyng Evolutonary Algorthms wthn a Web-based Educatonal System Behrouz MINAEI-BIDGOLI 1, and Gerd KORTEMEYER 2, and Wllam F. PUNCH 1 1 Genetc Algorthms Research and Applcatons

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

Calculating the high frequency transmission line parameters of power cables

Calculating the high frequency transmission line parameters of power cables < ' Calculatng the hgh frequency transmsson lne parameters of power cables Authors: Dr. John Dcknson, Laboratory Servces Manager, N 0 RW E B Communcatons Mr. Peter J. Ncholson, Project Assgnment Manager,

More information

New Approaches to Support Vector Ordinal Regression

New Approaches to Support Vector Ordinal Regression New Approaches to Support Vector Ordnal Regresson We Chu [email protected] Gatsby Computatonal Neuroscence Unt, Unversty College London, London, WCN 3AR, UK S. Sathya Keerth [email protected]

More information

Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques

Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques Expert Systems wth Applcatons Expert Systems wth Applcatons 34 (2008) 313 327 www.elsever.com/locate/eswa Churn predcton n subscrpton servces: An applcaton of support vector machnes whle comparng two parameter-selecton

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008 Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn

More information

行 政 院 國 家 科 學 委 員 會 補 助 專 題 研 究 計 畫 成 果 報 告 期 中 進 度 報 告

行 政 院 國 家 科 學 委 員 會 補 助 專 題 研 究 計 畫 成 果 報 告 期 中 進 度 報 告 行 政 院 國 家 科 學 委 員 會 補 助 專 題 研 究 計 畫 成 果 報 告 期 中 進 度 報 告 畫 類 別 : 個 別 型 計 畫 半 導 體 產 業 大 型 廠 房 之 設 施 規 劃 計 畫 編 號 :NSC 96-2628-E-009-026-MY3 執 行 期 間 : 2007 年 8 月 1 日 至 2010 年 7 月 31 日 計 畫 主 持 人 : 巫 木 誠 共 同

More information

A PROBABILITY-MAPPING ALGORITHM FOR CALIBRATING THE POSTERIOR PROBABILITIES: A DIRECT MARKETING APPLICATION

A PROBABILITY-MAPPING ALGORITHM FOR CALIBRATING THE POSTERIOR PROBABILITIES: A DIRECT MARKETING APPLICATION Document de traval du LEM 2011-06 A PROBABILITY-MAPPIG ALGORITHM FOR CALIBRATIG THE POSTERIOR PROBABILITIES: A DIRECT MARKETIG APPLICATIO Krstof Coussement *, Wouter Bucknx ** * IESEG School of Management

More information

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE Yu-L Huang Industral Engneerng Department New Mexco State Unversty Las Cruces, New Mexco 88003, U.S.A. Abstract Patent

More information

Prediction of Stock Market Index Movement by Ten Data Mining Techniques

Prediction of Stock Market Index Movement by Ten Data Mining Techniques Vol. 3, o. Modern Appled Scence Predcton of Stoc Maret Index Movement by en Data Mnng echnques Phchhang Ou (Correspondng author) School of Busness, Unversty of Shangha for Scence and echnology Rm 0, Internatonal

More information

1.1 The University may award Higher Doctorate degrees as specified from time-to-time in UPR AS11 1.

1.1 The University may award Higher Doctorate degrees as specified from time-to-time in UPR AS11 1. HIGHER DOCTORATE DEGREES SUMMARY OF PRINCIPAL CHANGES General changes None Secton 3.2 Refer to text (Amendments to verson 03.0, UPR AS02 are shown n talcs.) 1 INTRODUCTION 1.1 The Unversty may award Hgher

More information

USING GOAL PROGRAMMING TO INCREASE THE EFFICIENCY OF MARKETING CAMPAIGNS

USING GOAL PROGRAMMING TO INCREASE THE EFFICIENCY OF MARKETING CAMPAIGNS Journal of Internatonal & Interdscplnary Busness Research Volume 2 Journal of Internatonal & Interdscplnary Busness Research Artcle 6 1-1-2015 USING GOAL PROGRAMMING TO INCREASE THE EFFICIENCY OF MARKETING

More information

Enabling P2P One-view Multi-party Video Conferencing

Enabling P2P One-view Multi-party Video Conferencing Enablng P2P One-vew Mult-party Vdeo Conferencng Yongxang Zhao, Yong Lu, Changja Chen, and JanYn Zhang Abstract Mult-Party Vdeo Conferencng (MPVC) facltates realtme group nteracton between users. Whle P2P

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall SP 2005-02 August 2005 Staff Paper Department of Appled Economcs and Management Cornell Unversty, Ithaca, New York 14853-7801 USA Farm Savngs Accounts: Examnng Income Varablty, Elgblty, and Benefts Brent

More information

PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIGIOUS AFFILIATION AND PARTICIPATION

PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIGIOUS AFFILIATION AND PARTICIPATION PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIIOUS AFFILIATION AND PARTICIPATION Danny Cohen-Zada Department of Economcs, Ben-uron Unversty, Beer-Sheva 84105, Israel Wllam Sander Department of Economcs, DePaul

More information

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,

More information

Methodology to Determine Relationships between Performance Factors in Hadoop Cloud Computing Applications

Methodology to Determine Relationships between Performance Factors in Hadoop Cloud Computing Applications Methodology to Determne Relatonshps between Performance Factors n Hadoop Cloud Computng Applcatons Lus Eduardo Bautsta Vllalpando 1,2, Alan Aprl 1 and Alan Abran 1 1 Department of Software Engneerng and

More information

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA*

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* Luísa Farnha** 1. INTRODUCTION The rapd growth n Portuguese households ndebtedness n the past few years ncreased the concerns that debt

More information

An MILP model for planning of batch plants operating in a campaign-mode

An MILP model for planning of batch plants operating in a campaign-mode An MILP model for plannng of batch plants operatng n a campagn-mode Yanna Fumero Insttuto de Desarrollo y Dseño CONICET UTN [email protected] Gabrela Corsano Insttuto de Desarrollo y Dseño

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

Least 1-Norm SVMs: a New SVM Variant between Standard and LS-SVMs

Least 1-Norm SVMs: a New SVM Variant between Standard and LS-SVMs ESANN proceedngs, European Smposum on Artfcal Neural Networks - Computatonal Intellgence and Machne Learnng. Bruges (Belgum), 8-3 Aprl, d-sde publ., ISBN -9337--. Least -Norm SVMs: a New SVM Varant between

More information

High Correlation between Net Promoter Score and the Development of Consumers' Willingness to Pay (Empirical Evidence from European Mobile Markets)

High Correlation between Net Promoter Score and the Development of Consumers' Willingness to Pay (Empirical Evidence from European Mobile Markets) Hgh Correlaton between et Promoter Score and the Development of Consumers' Wllngness to Pay (Emprcal Evdence from European Moble Marets Ths paper shows that the correlaton between the et Promoter Score

More information

Gaining Insights to the Tea Industry of Sri Lanka using Data Mining

Gaining Insights to the Tea Industry of Sri Lanka using Data Mining Proceedngs of the Internatonal MultConference of Engneers and Computer Scentsts 2008 Vol I Ganng Insghts to the Tea Industry of Sr Lanka usng Data Mnng H.C. Fernando, W. M. R Tssera, and R. I. Athauda

More information

Time Delayed Independent Component Analysis for Data Quality Monitoring

Time Delayed Independent Component Analysis for Data Quality Monitoring IWSSIP 1-17th Internatonal Conference on Systems, Sgnals and Image Processng Tme Delayed Independent Component Analyss for Data Qualty Montorng José Márco Faer Sgnal Processng Laboratory, COE/Pol Federal

More information

When Talk is Free : The Effect of Tariff Structure on Usage under Two- and Three-Part Tariffs

When Talk is Free : The Effect of Tariff Structure on Usage under Two- and Three-Part Tariffs 0 When Talk s Free : The Effect of Tarff Structure on Usage under Two- and Three-Part Tarffs Eva Ascarza Ana Lambrecht Naufel Vlcassm July 2012 (Forthcomng at Journal of Marketng Research) Eva Ascarza

More information

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble

More information

PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS

PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS Yunhong Xu, Faculty of Management and Economcs, Kunmng Unversty of Scence and Technology,

More information

Detecting Credit Card Fraud using Periodic Features

Detecting Credit Card Fraud using Periodic Features Detectng Credt Card Fraud usng Perodc Features Alejandro Correa Bahnsen, Djamla Aouada, Aleksandar Stojanovc and Björn Ottersten Interdscplnary Centre for Securty, Relablty and Trust Unversty of Luxembourg,

More information

Loop Parallelization

Loop Parallelization - - Loop Parallelzaton C-52 Complaton steps: nested loops operatng on arrays, sequentell executon of teraton space DECLARE B[..,..+] FOR I :=.. FOR J :=.. I B[I,J] := B[I-,J]+B[I-,J-] ED FOR ED FOR analyze

More information

The Use of Analytics for Claim Fraud Detection Roosevelt C. Mosley, Jr., FCAS, MAAA Nick Kucera Pinnacle Actuarial Resources Inc.

The Use of Analytics for Claim Fraud Detection Roosevelt C. Mosley, Jr., FCAS, MAAA Nick Kucera Pinnacle Actuarial Resources Inc. Paper 1837-2014 The Use of Analytcs for Clam Fraud Detecton Roosevelt C. Mosley, Jr., FCAS, MAAA Nck Kucera Pnnacle Actuaral Resources Inc., Bloomngton, IL ABSTRACT As t has been wdely reported n the nsurance

More information

A Genetic Programming Based Stock Price Predictor together with Mean-Variance Based Sell/Buy Actions

A Genetic Programming Based Stock Price Predictor together with Mean-Variance Based Sell/Buy Actions Proceedngs of the World Congress on Engneerng 28 Vol II WCE 28, July 2-4, 28, London, U.K. A Genetc Programmng Based Stock Prce Predctor together wth Mean-Varance Based Sell/Buy Actons Ramn Rajaboun and

More information

A Hierarchical Anomaly Network Intrusion Detection System using Neural Network Classification

A Hierarchical Anomaly Network Intrusion Detection System using Neural Network Classification IDC IDC A Herarchcal Anomaly Network Intruson Detecton System usng Neural Network Classfcaton ZHENG ZHANG, JUN LI, C. N. MANIKOPOULOS, JAY JORGENSON and JOSE UCLES ECE Department, New Jersey Inst. of Tech.,

More information

Feasibility of Using Discriminate Pricing Schemes for Energy Trading in Smart Grid

Feasibility of Using Discriminate Pricing Schemes for Energy Trading in Smart Grid Feasblty of Usng Dscrmnate Prcng Schemes for Energy Tradng n Smart Grd Wayes Tushar, Chau Yuen, Bo Cha, Davd B. Smth, and H. Vncent Poor Sngapore Unversty of Technology and Desgn, Sngapore 138682. Emal:

More information