Customer Segmentation Using Clustering and Data Mining Techniques

Size: px
Start display at page:

Download "Customer Segmentation Using Clustering and Data Mining Techniques"

Transcription

1 Internatonal Journal of Computer Theory and Engneerng, Vol. 5, No. 6, December 2013 Customer Segmentaton Usng Clusterng and Data Mnng Technques Kshana R. Kashwan, Member, IACSIT, and C. M. Velu fronter research topc n computer scence and engneerng. For clusterng method, the most mportant property s that a tuple of partcular cluster s more lely to be smlar to the other tuples wthn the same cluster than the tuples of other clusters. For classfcaton, the smlarty measure s defned as sm (t, tl), between any two tuples, t, tj D. For a gven cluster, Km of N ponts {tml, tm2... tmn}, the centrod s defned as the mddle of the cluster. Many of the clusterng algorthms assume that the cluster s represented by centrally located one object n the cluster, called a medod. The radus s the square root of the average mean squared dstance from any pont n the cluster to the centrod. We use the notaton Mm to ndcate the medod for cluster Km. For gven clusters K and Kj, there are several ways to determne the dstance between the clusters. A natural choce of dstance s Eucldean dstance measure [5]. Sngle ln s defned as smallest dstance between elements n dfferent clusters gven by ds(k, Kj) = mn(dst(t1, tjm)) tl K Kj and tjm K Kj. The complete ln s defned as the largest dstance between elements n dfferent clusters gven by ds (K, Kj) = max (ds (tl, tjm)), tl K Kj and tjm Kj Kj. The average ln s the average dstance between elements n dfferent clusters. We thus have, ds(k, Kj) = mean(ds(tl, tjm)), tl K Kj, tjm Kj Kj. If clusters are represented by centrods, the dstance between two clusters s the dstance between ther respectve centrods. We thus have, ds (K, Kj ) = ds (C, Cj), where C and Cj are the centrod for K and Kj respectvely. If each cluster s represented by ts medod then the dstance between the cluster can be defned as the dstance between medods whch can be gven as ds ( K, Kj )=ds (M, Mj ), where M and Mj are the Medod for K and Kj respectvely Abstract Clusterng technque s crtcally mportant step n data mnng process. It s a multvarate procedure qute sutable for segmentaton applcatons n the maret forecastng and plannng research. Ths research paper s a comprehensve report of -means clusterng technque and SPSS Tool to develop a real tme and onlne system for a partcular super maret to predct sales n varous annual seasonal cycles. The model developed was an ntellgent tool whch receved nputs drectly from sales data records and automatcally updated segmentaton statstcs at the end of day s busness. The model was successfully mplemented and tested over a perod of three months. A total of n = 2138, customer, were tested for observatons whch were then dvded nto = 4 smlar groups. The classfcaton was based on nearest mean. An ANOVA analyss was also carred out to test the stablty of the clusters. The actual day to day sales statstcs were compared wth predcted statstcs by the model. Results were qute encouragng and had shown hgh accuracy. Index Terms Cluster analyss, data mnng, customer segmentaton, ANOVA analyss. I. INTRODUCTION Hghlght Clusterng s a statstcal technque much smlar to classfcaton. It sorts raw data nto meanngful clusters and groups of relatvely homogeneous observatons. The objects of a partcular cluster have smlar characterstcs and propertes but dffer wth those of other clusters. The groupng s accomplshed by fndng smlartes among data accordng to characterstcs found n raw data [1]. The man objectve was to fnd optmum number of clusters. There are two basc types of clusterng methods, herarchcal and non-herarchcal. Clusterng process s not one tme tas but s contnuous and an teratve process of nowledge dscovery from huge quanttes of raw and unorganzed data [2]. For a partcular classfcaton problem, an approprate clusterng algorthm and parameters must be selected for obtanng optmum results. [3]. Clusterng s a type of exploratve data mnng used n many applcaton orented areas such as machne learnng, classfcaton and pattern recognton [4]. In recent tmes, data mnng s ganng much faster momentum for nowledge based servces such as dstrbuted and grd computng. Cloud computng s yet another example of II. K-MEANS CLUSTERING TECHNIQUE The algorthm s called -means due to the fact that the letter represents the number of clusters chosen. An observaton s assgned to a partcular cluster for whch ts dstance to the cluster mean s the smallest. The prncpal functon of algorthm nvolves fndng the -means. Frst, an ntal set of means s defned and then subsequent classfcaton s based on ther dstances to the centres [6]. Next, the clusters mean s computed agan and then reclassfcaton s done based on the new set of means. Ths s repeated untl cluster means don t change much between successve teratons [7]. Fnally, the means of the clusters once agan calculated and then all the cases are assgned to the permanent clusters. Gven a set of observatons (x1, x2,.., xn), where each observaton x s a d-dmensonal real vector. The -means clusterng algorthm ams to partton the n observatons nto Manuscrpt receved December 25, 2012; revsed February 28, Kshana R. Kashwan s wth the Department of Electroncs and Communcaton Engneerng PG, Sona College of Technology (An Autonomous Insttuton Afflated to Anna Unversty), TPT Road, Salem , Taml Nadu, Inda (e-mal: drrashwan@gmal.com, drrashwan@sonatech.ac.n). C. M. Velu s wth the Department of CSE, Dattaala Group of Insttutons, Swam Chnchol, Daund, Pune , Inda (e-mal: cmvelu41@gmal.com). DOI: /IJCTE.2013.V

2 groups of observatons called clusters where n, so as to mnmze the sum of squares of dstances between observatons wthn a partcular cluster [8]. As shown n Table I, the sum of squares of the dstance may be gven by the equaton arg mn S = Σ =1 Σ sj s x j μ j 2, where μ s the mean of ponts n S. Gven an ntal set, -means computes ntal means m (1) (1) 1,,m and t dentfes clusters n gven raw data set. TABLE I: K-MEANS ALGORITHM Smplfed smulaton flow of -means algorthm Begn Inputs: X = (x 1, x 2,.., x n) Determne: Clusters Intal Centrods - C 1, C 2,.., C Assgn each nput to the cluster wth the closest centrod Determne: Update Centrods - C 1, C 2,.., C Repeat: Untl Centrods don t change sgnfcantly (specfed threshold value) Output: Fnal Stable Centrods - C 1, C 2,.., C End In most of the cases, -means s qute slow to converge. For very accurate condtons, t taes qute a long tme to converge exponentally. A reasonable threshold value may be specfed for conversng n most of the cases to produce quc results wthout compromsng much accuracy [9]. As shown n Table II, the Sum of Square of Errors (SSE) may be consderably reduced by defnng more number of clusters. It s always desrable to mprove SSE wthout ncreasng number of clusters whch s possble due to the fact that -means converges to a local mnmum [10]. To decrease SSE, a cluster may be splt or a new cluster centrod may be ntroduced. TABLE II: BISECTING OF K-MEANS ALGORITHM Bsectng sequence of -means algorthm Begn Intalze clusters Do: Remove a cluster from lst Select a cluster and bsect t usng -means algorthm Compute SSE Choose from bsected clusters one wth least SSE Add bsected clusters to the lst of clusters Repeat: Untl the number of cluster have been reached to End To ncrease SSE, a cluster may be dspersed or two clusters may be merged. To obtan -clusters from a set of all observaton ponts, the observaton ponts are splt nto two clusters and agan one of these clusters s splt further nto two clusters. Intally a cluster of largest sze or a cluster wth largest SSE may be chosen for splttng process. Ths s repeated untl the numbers of clusters have been produced. Thus t s easly observable that the SSE can be changed by splttng or mergng the clusters [11]. Ths specfc property of the -means clusterng s very much desrable for maretng segmentaton research. The new SSE s agan computed after updatng cluster centrod. Ths s repeated untl SSE s reached to a mnmum value or becomes constant wthout changng further, a condton smlar to congruence. The SSE s represented mathematcally by SSE = Σ =1 (μ - x) 2 where μ s the centrod of th cluster represented by c and x s any pont n the same cluster. A condton for achevng mnmum SSE can be easly computed by dfferentatng SSE, settng t equal to 0 and then solvng the equaton [12]. SSE m 1 1 xc xc ( x) 2 ( x ) 0 xc xc x 2 ( x) Here m s total number of elements and μ K s centrod n th cluster c. Further t can be smplfed as 1 m xc Ths concludes that the mnmum SSE can be acheved under the condton of the centrod of the cluster beng equal to the mean of the ponts n the th cluster c. III. MARKET SEGMENTATION SURVEY The maret segmentaton s a process to dvde customers nto homogeneous groups whch have smlar characterstcs such as buyng habts, lfe style, food preferences etc. [13]. Maret segmentaton s one of the most fundamental strategc plannng and maretng concepts wheren groupng of people s done under dfferent categores such as the eenness, purchasng capablty and the nterest to buy. The segmentaton operaton s performed accordng to smlarty n people n several dmensons related to a product under consderaton. The more accurately and approprately the segments performed for targetng customers by a partcular organzaton, the more successful the organzaton s n the maretplace. The man objectve of maret segmentaton s accurately predctng the needs of customers and thereby ntern mprovng the proftablty by procurng or manufacturng products n rght quantty at tme for the rght customer at optmum cost. To meet these strngent requrements -means clusterng technque may be appled for maret segmentaton to arrve at an approprate forecastng and plannng decsons [14]. It s possble to classfy objects such as brands, products, utlty, durablty, ease of use etc wth cluster analyss [15]. For example, whch brands are clustered together n terms of consumer perceptons for a postonng exercse or whch ctes are clustered together n terms of ncome, qualfcaton etc. [16]. The data set conssted of usages of brands under dfferent condtons, demographc varables and varyng atttudes of the customers. The respondents consttuted a representatve random sample of 2138 as data ponts from customer transactons n a retal super maret where varous household x 2 857

3 products were sold to ts customers. The survey was carred for a perod of about 3 months. The modellng and testng of maret segmentaton usng clusterng for forecastng was based on the customers of a leadng super maret retal house hold suppler located at Chenna branch, Inda. The organzaton s name and the varables drectly related to the organzaton are delberately suppressed to mantan confdentalty as per our agreement. It was requred to map the profle of the target customers n terms of lfestyle, atttudes and perceptons. The man objectve was to measure mportant varables or factors whch can lead to vtal nputs for decson mang n forecastng. The survey contaned 15 dfferent questonnares as gven below n Table III. preferred emals to wrtng letters whereas 306 customers only agreed that they preferred emal. Smlarly 541 customers strongly dsagreed wth the dea of emals, may be they ddn t have access to nternet or otherwse and so on. Graphcal response for 5 scale pont s shown n Fg. 1. The cluster mappng vsualzaton of response matrx s llustrated n Fg. 2, whch shows that dstrbuton of the responses s qute wde and scattered but farly unform. TABLE III: VARIABLES CHOSEN Dfferent varables used for maretng segmentaton Var1: Prefer emal to wrtng a letter Var2: Feel that qualty products are prced hgher Var3: Thn wsely before buyng anythng Var4: Televson s a major source of entertanment Var5: A entertanment s a necessty rather than a luxury Var6: Prefers fast food and ready-to-use products Var7: More health-conscous Var8: Competton mproves the qualty Var9: Women are actve partcpants n purchase Var10:The advertsements can play a postve role Var11: Enjoy watchng moves Var12: Le modern style and fashon Var13: Prefers branded products Var14: Prefer outng on weeends Var15: Prefer to pay by credt card than cash Fg. 1. Response of customer on the fve pont scale. Fg. 2. Customer response mappng for questonnare. TABLE IV: CUSTOMER RESPONSE ON THE SCALE OF FIVE POINTS Varable Strongly agree (5) Agree (4) Dsagree (2) No Opnon (3) Strongly dsagree (1) Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var A fve pont ratng scale was used to represent varables n segmentaton. For ths, the customers were ased to gve ther response n categores of strongly agree as 5, agree as 4, No Opnon as 3, dsagree as 2 and strongly dsagree as 1. The Eucldean dstance was used to measure the clusterng analyss. Eucldean dstance s deally sutable for smlar nterval scaled varables. The nput data matrx of 2138 respondents wth 15 varables s shown n Table IV. Ths could be explaned as 412 customers strongly agreed that they TABLE V: INITIAL AND FINAL CONVERGED CUSTOMER CENTERS Varable Intal Cluster Centre Fnal Cluster Centre Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Further on, the clusterng was carred out as explaned n Secton II. The value of was chosen as 4 and t was desred to now that what nd of 4 groups exsted n the data set of customer response matrx. For the values gven n Table IV, -means clusterng s computed by usng standard SPSS pacage. Table V shows ntal and fnally converged cluster centers wth ther means. The ntal cluster vsualzaton s shown n Fg. 3 wth observaton of qute scattered dstrbuton. Intal centers were randomly selected, thus had wde varatons and then SPSS 858

4 teratons were performed untl there was no sgnfcant change n the poston of cluster centers. Ths condton s called as convergence and as a result of t, fnally refned and stable cluster centers, as llustrated n Fg. 4, was acheved. Fg. 3. Intal clusters dstrbuton as chosen randomly. remanng other varables are statstcally sgnfcant as they all have probabltes > TABLE VI: ANOVA ANALYSIS Varable Cluster MS Error MS F-Statstc P-value VAR VAR VAR VAR VAR VAR VAR VAR VAR VAR VAR VAR VAR VAR VAR Fg. 4. Fnal clusters dstrbuton as computed by SPSS. IV. STATISTICAL SIMULATION TESTS There are many statstcal tests whch are normally used to perform the clusterng process. In statstcs, qute often a very mportant tool called Analyss of Varance (ANOVA) s employed for varous analyss and data processng ncludng clusterng and data mnng. ANOVA test gves sutable nference whle splttng and mergng the clusters dynamcally. The stablty of the clusters can be checed through splttng the sample and repeatng the cluster analyss. Typcally an ANOVA conssts of varous statstcal models and ther related procedures, n whch varance s observed n a partcular random varable. The varance s dvded nto dfferent components whch can be attrbuted to dfferent sources of varaton n random varable. The ANOVA s nvarably used n comparng more than one means or centrods. In ts smplest form ANOVA provdes a statstcal test of whether or not the means or centrods of several groups of random varables are all equal or not. The ANOVA statstcs appled on data collected for maretng segmentatons s lsted n Table VI. It was very clear from that statstc test that whch of the 15 chosen varables were sgnfcantly dfferent across the 4 fnal clusters obtaned by -means clusterng as shown n Table VI. The last column ndcates that the varables 2, 7, 8, 10, 11, 12 and 13 are nsgnfcant at the confdence level of 0.10 snce all the probabltes of these varables are < The The Table V shows the dfference between ntal and fnal centers. It can be observed that there s a dfference of 0.8 n the coeffcents between the cluster-3 soluton of ntal stage and fnal stage for varable 2. The hghest dfference s 1.6, between ntal stage and fnal stage of cluster-1 of varable 3. Cluster formaton s spatally qute dfferent for ntal and fnal stages as llustrated n Fg. 3 and Fg. 4. For maret segmentaton problem, Table IV gves the outputs of -means clusterng for the = 4 (chosen value) for stable and refned fnal cluster centers. Subsequently null hypothess s tested by usng ANOVA method as llustrated by Table VI. Ths can be explaned further as cluster-1 s descrbed by the mean value of 1.80 for Var1. Smlarly the mean value s 2.60 for Var2 and 3.40 for Var3 and so on. In the same way cluster-2 s descrbed by the mean of 3.13 for Var1, 3.50 for Var2 and 2.50 for Var3 and so on. The 15 varables were taen nto consderaton and tred to nterpret the sgnfcance of the clusters formed thereof. The graphcal llustraton of ANOVA analyss s depcted n Fg. 5 for chosen statstcal values of cluster mean, mean square error, F-statstcs and P-value. Fg. 5. Statstcal dstrbuton analyss of clusters. For example, for cluster-1, the mean of 1.80 for Var1 nterprets that people prefer emal. Smlarly, a mean of 3.40 for Var3 ndcates that the peoples are careful whle spendng. 859

5 The mean of 2.60 for Var2 says that the qualty products come always at hgher prce. For these same varables, cluster-2 shows that people prefer conventonal letters to e-mal whch s ndcated by the mean of 3.13 for Var1. The people who do not prefer hgh prce for good qualty s shown by the mean of 3.50 for Var2 and tend to be neutral about care n spendng wth mean of 2.50 for Var3. Smlarly, the varables for cluster-3 and cluster-4 can be compared. For the gven maret segmentaton problem, 4 clusters were analyzed for varous consderaton. Cluster-1 ndcated that the varables whch were ncluded n ths cluster namely, varable1es 1, 2, 5, 12, 13, 14 not le varables 4, 6, 9, 11, 15 and not sure of varable 7. The derved nference from ths was thus exhbtng many tradtonal values, except that they had adopted to emal use. They had also begun to spend more lberally and were probably n the transton process of few other factors le acceptance of women as decson maers and more use of credt cards as convenence. Cluster-2 ndcated that the varables found n the clusters, namely, 1, 5, 8, 9, 15 were not n the same statstcal characterstcs as compared to varables 3, 4, 6, 7, 10, 11, 12, 13, 14. They beleved n negotatons or were aggressve buyers. It could be concluded that, t was a group whch led to use credt cards, spent more freely, beleved n women power, beleved n economcs rather than poltcs and felt qualty products could be worth purchasng. Also, they seemed to have taste of modern lfe style and were fashon orented. Cluster-4 gave out an analyss that the varables 2, 4, 5, 7, 10 belong to ths cluster, had opposte statstcal characterstcs to the varable 1, 3, 6, 8, 11, 12, 13 and were neutral n comparson to varables 14, 15. It was concluded that, ths group was optmstc, free spendng and a good target for TV advertsng, partcularly consumer durables tems and entertanment. But they need not to get necessarly nfluenced by brands. They wanted value for money, n case f they understood that the tem was worth, they would tend to buy the same. V. CONCLUSION AND DISCUSSIONS In summary, the cluster analyss of the chosen sample of respondents explaned a lot about the possble segments whch exsted n the target customer populaton. Once the number of clusters was dentfed, a -means clusterng algorthm, whch s a non-herarchcal method, was used. For computng -means clusterng, the ntal cluster centers were chosen and then fnal stable cluster centers were computed by contnung number of teratons untl means had stopped further changng wth next teratons. Ths convergent condton was also acheved by settng a threshold value for change n the mean. The fnal cluster centers contaned the mean values for each varable n each cluster. Also, ths was nterpreted n mult-dmensonal projectons related to maret forecastng and plannng. To chec the stablty of the clusters, the sample data was frst splt nto two parts and was checed that whether smlar stable and dstnct clusters emerged from both the sub-samples. These analyses at the end provded further llustratons of usng cluster method for maret segmentaton for forecastng. Computng based system developed was an ntellgent and t automatcally presented results to the mangers to nfer for quc and fast decson mang process. The smulaton tests were also computed for cluster brands and other characterstcs of the cluster representng a partcular class of people. The future wor wll nvolve more trals and automaton of the maret forecastng and plannng ACKNOWLEDGMENT The authors feel deeply ndebted and thanful to all who opned for techncal nowhow and helped n collecton of maret data. Authors also feel thanful to all customers who volunteered for feedbac and transactonal nformaton. Authors feel thanful to ther famly members for constant support and motvaton. REFERENCES [1] I. S. Dhllon and D. M. Modha, Concept decompostons for large sparse text data usng clusterng, Machne Learnng, vol. 42, ssue 1, pp , [2] T. Kanungo, D. M. Mount, N. S. Netanyahu, C. D. Pato, R. Slverman, and A. Y. Wu, An effcent K-means clusterng algorthm, IEEE Trans. Pattern Analyss and Machne Intellgence, vol. 24, pp , [3] MacKay and Davd, An Example Inference Tas: Clusterng, Informaton Theory, Inference and Learnng Algorthms, Cambrdge Unversty Press, pp , [4] M. Inaba, N. Katoh, and H. Ima, Applcatons of weghted Vorono dagrams and randomzaton to varance-based -clusterng, n Proc.10 th ACM Symposum on Computatonal Geometry, 1994, pp [5] D. Alose, A. Deshpande, P. Hansen, and P. Popat, NP-hard Eucldean sum-of-squares clusterng, Machne Learnng, vol. 75, pp , [6] S. Dasgupta and Y. Freund, Random Trees for Vector Quantzaton, IEEE Trans. on Informaton Theory, vol. 55, pp , [7] M. Mahajan, P. Nmbhorar, and K. Varadarajan, The Planar K-Means Problem s NP-Hard, LNCS, Sprnger, vol. 5431, pp , [8] A. Vattan, K-means exponental teratons even n the plane, Dscrete and Computatonal Geometry, vol. 45, no. 4, pp , [9] C. Elan, Usng the trangle nequalty to accelerate K-means, n Proc. the 12 th Internatonal Conference on Machne Learnng (ICML), [10] H. Zha, C. Dng, M. Gu, X. He, and H. D. Smon, Spectral Relaxaton for K-means Clusterng, Neural Informaton Processng Systems, Vancouver, Canada, vol.14, pp , [11] C. Dng and X.-F. He, K-means Clusterng va Prncpal Component Analyss, n Proc. Int'l Conf. Machne Learnng (ICML), 2004, pp [12] P.-N. Tan, V. Kumar, and M. Stenbach, Introducton to Data Mnng, Pearson Educato Inc. and Dorlng Kndersley (Inda) Pvt. Ltd., New Delh and Chenna Mcro Prnt Pvt. Ltd., Inda, [13] D. D. S. Garla and G. Charaborty, Comparson of Probablstc-D and -Means Clusterng n Segment Profles for B2B Marets, SAS Global Forum 2011, Management, SAS Insttute Inc., USA. [14] H.-B. Wang, D. Huo, J. Huang, Y.-Q. Xu, L.-X. Yan, W. Sun, X.-L. L, and Jr. A. R. Sanchez, An approach for mprovng K-means algorthm on maret segmentaton, n Proc. Internatonal Conference on Systme Scence and Engneerng (ICSSE), IEEE Xplore, [15] H. Hruscha and M. Natter, Comparng performance of feedforward neural nets and K-means for cluster-based maret segmentaton, European Journal of Operatonal Research, Elsver Scence, vol. 114, pp , [16] P. Ahmad, Pharmaceutcal Maret Segmentaton Usng GA- K-means, European Journal of Economcs, Fnance and Admnstratve Scences, ssue 22,

6 Kshana R. Kashwan receved the degrees of M. Tech. n Electroncs Desgn and Technology and Ph.D. n Electroncs and Communcaton Engneerng from Tezpur Unversty (a central unversty of Inda), Tezpur, Inda, n 2002 and 2007 respectvely. Presently he s a professor and Dean of Post Graduate Studes n the department of Electroncs and Engneerng (Post Graduate Studes), Sona College of Technology (An Autonomous Insttuton Afflated to Anna Unversty), TPT Road, Salem , Taml Nadu, Inda. He has publshed extensvely at nternatonal and natonal level and has travelled to many countres. Hs research areas are VLSI Desgn, Communcaton Systems, Crcuts and Systems and SoC / PSoC. He s also drector of the Centre of Excellence n VLSI Desgn and Embedded SoC at Sona College of Technology. He s a member of Academc Councl, Research Commttee and Board of Studes of Electroncs and Communcaton Engneerng at Sona College of Technology. He has successfully guded many scholars for ther master s and doctoral theses. Kashwan has completed many funded research projects. Currently, he s worng on a few funded projects from Government of Inda. Dr. Kashwan s a member of IEEE, IASTED and Senor Member of IACSIT. He s a lfe member of ISTE and Member of IE (Inda). C. M. Velu receved hs M. E n CSE from Sathyabama Unversty. He s currently pursung hs doctoral program under the faculty of Informaton and Communcaton Engneerng regstered at Anna Unversty Chenna, Inda. He has vsted UAE as a Computer faculty. He served as faculty of CSE for more than two and half decades. He has publshed many research papers n nternatonal and natonal journals. Hs areas of nterest are Data Warehousng and Data Mnng, Artfcal Intellgence, Artfcal Neural Networs, Dgtal Image Processng and Pattern Recognton. 861

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications CMSC828G Prncples of Data Mnng Lecture #9 Today s Readng: HMS, chapter 9 Today s Lecture: Descrptve Modelng Clusterng Algorthms Descrptve Models model presents the man features of the data, a global summary

More information

A DATA MINING APPLICATION IN A STUDENT DATABASE

A DATA MINING APPLICATION IN A STUDENT DATABASE JOURNAL OF AERONAUTICS AND SPACE TECHNOLOGIES JULY 005 VOLUME NUMBER (53-57) A DATA MINING APPLICATION IN A STUDENT DATABASE Şenol Zafer ERDOĞAN Maltepe Ünversty Faculty of Engneerng Büyükbakkalköy-Istanbul

More information

A Simple Approach to Clustering in Excel

A Simple Approach to Clustering in Excel A Smple Approach to Clusterng n Excel Aravnd H Center for Computatonal Engneerng and Networng Amrta Vshwa Vdyapeetham, Combatore, Inda C Rajgopal Center for Computatonal Engneerng and Networng Amrta Vshwa

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

Credit Limit Optimization (CLO) for Credit Cards

Credit Limit Optimization (CLO) for Credit Cards Credt Lmt Optmzaton (CLO) for Credt Cards Vay S. Desa CSCC IX, Ednburgh September 8, 2005 Copyrght 2003, SAS Insttute Inc. All rghts reserved. SAS Propretary Agenda Background Tradtonal approaches to credt

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

How To Calculate The Accountng Perod Of Nequalty

How To Calculate The Accountng Perod Of Nequalty Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.

More information

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION NEURO-FUZZY INFERENE SYSTEM FOR E-OMMERE WEBSITE EVALUATION Huan Lu, School of Software, Harbn Unversty of Scence and Technology, Harbn, hna Faculty of Appled Mathematcs and omputer Scence, Belarusan State

More information

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts Power-of-wo Polces for Sngle- Warehouse Mult-Retaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)

More information

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services An Evaluaton of the Extended Logstc, Smple Logstc, and Gompertz Models for Forecastng Short Lfecycle Products and Servces Charles V. Trappey a,1, Hsn-yng Wu b a Professor (Management Scence), Natonal Chao

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble

More information

Mining Multiple Large Data Sources

Mining Multiple Large Data Sources The Internatonal Arab Journal of Informaton Technology, Vol. 7, No. 3, July 2 24 Mnng Multple Large Data Sources Anmesh Adhkar, Pralhad Ramachandrarao 2, Bhanu Prasad 3, and Jhml Adhkar 4 Department of

More information

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES Zuzanna BRO EK-MUCHA, Grzegorz ZADORA, 2 Insttute of Forensc Research, Cracow, Poland 2 Faculty of Chemstry, Jagellonan

More information

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Internatonal Journal of Electronc Busness Management, Vol. 3, No. 4, pp. 30-30 (2005) 30 THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION Yu-Mn Chang *, Yu-Cheh

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

320 The Internatonal Arab Journal of Informaton Technology, Vol. 5, No. 3, July 2008 Comparsons Between Data Clusterng Algorthms Osama Abu Abbas Computer Scence Department, Yarmouk Unversty, Jordan Abstract:

More information

Cluster Analysis. Cluster Analysis

Cluster Analysis. Cluster Analysis Cluster Analyss Cluster Analyss What s Cluster Analyss? Types of Data n Cluster Analyss A Categorzaton of Maor Clusterng Methos Parttonng Methos Herarchcal Methos Densty-Base Methos Gr-Base Methos Moel-Base

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

An Adaptive and Distributed Clustering Scheme for Wireless Sensor Networks

An Adaptive and Distributed Clustering Scheme for Wireless Sensor Networks 2007 Internatonal Conference on Convergence Informaton Technology An Adaptve and Dstrbuted Clusterng Scheme for Wreless Sensor Networs Xnguo Wang, Xnmng Zhang, Guolang Chen, Shuang Tan Department of Computer

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION

More information

Data Mining from the Information Systems: Performance Indicators at Masaryk University in Brno

Data Mining from the Information Systems: Performance Indicators at Masaryk University in Brno Data Mnng from the Informaton Systems: Performance Indcators at Masaryk Unversty n Brno Mkuláš Bek EUA Workshop Strasbourg, 1-2 December 2006 1 Locaton of Brno Brno EUA Workshop Strasbourg, 1-2 December

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

A Secure Password-Authenticated Key Agreement Using Smart Cards

A Secure Password-Authenticated Key Agreement Using Smart Cards A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,

More information

Gaining Insights to the Tea Industry of Sri Lanka using Data Mining

Gaining Insights to the Tea Industry of Sri Lanka using Data Mining Proceedngs of the Internatonal MultConference of Engneers and Computer Scentsts 2008 Vol I Ganng Insghts to the Tea Industry of Sr Lanka usng Data Mnng H.C. Fernando, W. M. R Tssera, and R. I. Athauda

More information

A Fast Incremental Spectral Clustering for Large Data Sets

A Fast Incremental Spectral Clustering for Large Data Sets 2011 12th Internatonal Conference on Parallel and Dstrbuted Computng, Applcatons and Technologes A Fast Incremental Spectral Clusterng for Large Data Sets Tengteng Kong 1,YeTan 1, Hong Shen 1,2 1 School

More information

Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms

Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms Internatonal Journal of Appled Informaton Systems (IJAIS) ISSN : 2249-0868 Foundaton of Computer Scence FCS, New York, USA Volume 7 No.7, August 2014 www.jas.org Cluster Analyss of Data Ponts usng Parttonng

More information

"Research Note" APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES *

Research Note APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES * Iranan Journal of Scence & Technology, Transacton B, Engneerng, ol. 30, No. B6, 789-794 rnted n The Islamc Republc of Iran, 006 Shraz Unversty "Research Note" ALICATION OF CHARGE SIMULATION METHOD TO ELECTRIC

More information

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,

More information

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS 21 22 September 2007, BULGARIA 119 Proceedngs of the Internatonal Conference on Informaton Technologes (InfoTech-2007) 21 st 22 nd September 2007, Bulgara vol. 2 INVESTIGATION OF VEHICULAR USERS FAIRNESS

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

Scale Dependence of Overconfidence in Stock Market Volatility Forecasts

Scale Dependence of Overconfidence in Stock Market Volatility Forecasts Scale Dependence of Overconfdence n Stoc Maret Volatlty Forecasts Marus Glaser, Thomas Langer, Jens Reynders, Martn Weber* June 7, 007 Abstract In ths study, we analyze whether volatlty forecasts (judgmental

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

CHAPTER 14 MORE ABOUT REGRESSION

CHAPTER 14 MORE ABOUT REGRESSION CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm

Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm Document Clusterng Analyss Based on Hybrd PSO+K-means Algorthm Xaohu Cu, Thomas E. Potok Appled Software Engneerng Research Group, Computatonal Scences and Engneerng Dvson, Oak Rdge Natonal Laboratory,

More information

SPECIALIZED DAY TRADING - A NEW VIEW ON AN OLD GAME

SPECIALIZED DAY TRADING - A NEW VIEW ON AN OLD GAME August 7 - August 12, 2006 n Baden-Baden, Germany SPECIALIZED DAY TRADING - A NEW VIEW ON AN OLD GAME Vladmr Šmovć 1, and Vladmr Šmovć 2, PhD 1 Faculty of Electrcal Engneerng and Computng, Unska 3, 10000

More information

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S S C H E D A E I N F O R M A T I C A E VOLUME 0 0 On Mean Squared Error of Herarchcal Estmator Stans law Brodowsk Faculty of Physcs, Astronomy, and Appled Computer Scence, Jagellonan Unversty, Reymonta

More information

Improved SVM in Cloud Computing Information Mining

Improved SVM in Cloud Computing Information Mining Internatonal Journal of Grd Dstrbuton Computng Vol.8, No.1 (015), pp.33-40 http://dx.do.org/10.1457/jgdc.015.8.1.04 Improved n Cloud Computng Informaton Mnng Lvshuhong (ZhengDe polytechnc college JangSu

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy Fnancal Tme Seres Analyss Patrck McSharry patrck@mcsharry.net www.mcsharry.net Trnty Term 2014 Mathematcal Insttute Unversty of Oxford Course outlne 1. Data analyss, probablty, correlatons, vsualsaton

More information

Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT

Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT Chapter 4 ECOOMIC DISATCH AD UIT COMMITMET ITRODUCTIO A power system has several power plants. Each power plant has several generatng unts. At any pont of tme, the total load n the system s met by the

More information

An Enhanced Super-Resolution System with Improved Image Registration, Automatic Image Selection, and Image Enhancement

An Enhanced Super-Resolution System with Improved Image Registration, Automatic Image Selection, and Image Enhancement An Enhanced Super-Resoluton System wth Improved Image Regstraton, Automatc Image Selecton, and Image Enhancement Yu-Chuan Kuo ( ), Chen-Yu Chen ( ), and Chou-Shann Fuh ( ) Department of Computer Scence

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

Implementation of Deutsch's Algorithm Using Mathcad

Implementation of Deutsch's Algorithm Using Mathcad Implementaton of Deutsch's Algorthm Usng Mathcad Frank Roux The followng s a Mathcad mplementaton of Davd Deutsch's quantum computer prototype as presented on pages - n "Machnes, Logc and Quantum Physcs"

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending

Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending Proceedngs of 2012 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 25 (2012) (2012) IACSIT Press, Sngapore Bayesan Network Based Causal Relatonshp Identfcaton and Fundng Success

More information

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy 4.02 Quz Solutons Fall 2004 Multple-Choce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multple-choce questons. For each queston, only one of the answers s correct.

More information

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

Statistical Methods to Develop Rating Models

Statistical Methods to Develop Rating Models Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and

More information

Estimating the Number of Clusters in Genetics of Acute Lymphoblastic Leukemia Data

Estimating the Number of Clusters in Genetics of Acute Lymphoblastic Leukemia Data Journal of Al Azhar Unversty-Gaza (Natural Scences), 2011, 13 : 109-118 Estmatng the Number of Clusters n Genetcs of Acute Lymphoblastc Leukema Data Mahmoud K. Okasha, Khaled I.A. Almghar Department of

More information

PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB.

PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB. PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB. INDEX 1. Load data usng the Edtor wndow and m-fle 2. Learnng to save results from the Edtor wndow. 3. Computng the Sharpe Rato 4. Obtanng the Treynor Rato

More information

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The

More information

Financial Mathemetics

Financial Mathemetics Fnancal Mathemetcs 15 Mathematcs Grade 12 Teacher Gude Fnancal Maths Seres Overvew In ths seres we am to show how Mathematcs can be used to support personal fnancal decsons. In ths seres we jon Tebogo,

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

Methodology to Determine Relationships between Performance Factors in Hadoop Cloud Computing Applications

Methodology to Determine Relationships between Performance Factors in Hadoop Cloud Computing Applications Methodology to Determne Relatonshps between Performance Factors n Hadoop Cloud Computng Applcatons Lus Eduardo Bautsta Vllalpando 1,2, Alan Aprl 1 and Alan Abran 1 1 Department of Software Engneerng and

More information

Fault tolerance in cloud technologies presented as a service

Fault tolerance in cloud technologies presented as a service Internatonal Scentfc Conference Computer Scence 2015 Pavel Dzhunev, PhD student Fault tolerance n cloud technologes presented as a servce INTRODUCTION Improvements n technques for vrtualzaton and performance

More information

Fixed income risk attribution

Fixed income risk attribution 5 Fxed ncome rsk attrbuton Chthra Krshnamurth RskMetrcs Group chthra.krshnamurth@rskmetrcs.com We compare the rsk of the actve portfolo wth that of the benchmark and segment the dfference between the two

More information

ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble

ECE544NA Final Project: Robust Machine Learning Hardware via Classifier Ensemble 1 ECE544NA Fnal Project: Robust Machne Learnng Hardware va Classfer Ensemble Sa Zhang, szhang12@llnos.edu Dept. of Electr. & Comput. Eng., Unv. of Illnos at Urbana-Champagn, Urbana, IL, USA Abstract In

More information

Data Visualization by Pairwise Distortion Minimization

Data Visualization by Pairwise Distortion Minimization Communcatons n Statstcs, Theory and Methods 34 (6), 005 Data Vsualzaton by Parwse Dstorton Mnmzaton By Marc Sobel, and Longn Jan Lateck* Department of Statstcs and Department of Computer and Informaton

More information

Marginal Benefit Incidence Analysis Using a Single Cross-section of Data. Mohamed Ihsan Ajwad and Quentin Wodon 1. World Bank.

Marginal Benefit Incidence Analysis Using a Single Cross-section of Data. Mohamed Ihsan Ajwad and Quentin Wodon 1. World Bank. Margnal Beneft Incdence Analyss Usng a Sngle Cross-secton of Data Mohamed Ihsan Ajwad and uentn Wodon World Bank August 200 Abstract In a recent paper, Lanjouw and Ravallon proposed an attractve and smple

More information

Multi-sensor Data Fusion for Cyber Security Situation Awareness

Multi-sensor Data Fusion for Cyber Security Situation Awareness Avalable onlne at www.scencedrect.com Proceda Envronmental Scences 0 (20 ) 029 034 20 3rd Internatonal Conference on Envronmental 3rd Internatonal Conference on Envronmental Scence and Informaton Applcaton

More information

Calculating the high frequency transmission line parameters of power cables

Calculating the high frequency transmission line parameters of power cables < ' Calculatng the hgh frequency transmsson lne parameters of power cables Authors: Dr. John Dcknson, Laboratory Servces Manager, N 0 RW E B Communcatons Mr. Peter J. Ncholson, Project Assgnment Manager,

More information

The Greedy Method. Introduction. 0/1 Knapsack Problem

The Greedy Method. Introduction. 0/1 Knapsack Problem The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton

More information

LIFETIME INCOME OPTIONS

LIFETIME INCOME OPTIONS LIFETIME INCOME OPTIONS May 2011 by: Marca S. Wagner, Esq. The Wagner Law Group A Professonal Corporaton 99 Summer Street, 13 th Floor Boston, MA 02110 Tel: (617) 357-5200 Fax: (617) 357-5250 www.ersa-lawyers.com

More information

Single and multiple stage classifiers implementing logistic discrimination

Single and multiple stage classifiers implementing logistic discrimination Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,

More information

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) 2127472, Fax: (370-5) 276 1380, Email: info@teltonika.

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) 2127472, Fax: (370-5) 276 1380, Email: info@teltonika. VRT012 User s gude V0.1 Thank you for purchasng our product. We hope ths user-frendly devce wll be helpful n realsng your deas and brngng comfort to your lfe. Please take few mnutes to read ths manual

More information

1.1 The University may award Higher Doctorate degrees as specified from time-to-time in UPR AS11 1.

1.1 The University may award Higher Doctorate degrees as specified from time-to-time in UPR AS11 1. HIGHER DOCTORATE DEGREES SUMMARY OF PRINCIPAL CHANGES General changes None Secton 3.2 Refer to text (Amendments to verson 03.0, UPR AS02 are shown n talcs.) 1 INTRODUCTION 1.1 The Unversty may award Hgher

More information

Conversion between the vector and raster data structures using Fuzzy Geographical Entities

Conversion between the vector and raster data structures using Fuzzy Geographical Entities Converson between the vector and raster data structures usng Fuzzy Geographcal Enttes Cdála Fonte Department of Mathematcs Faculty of Scences and Technology Unversty of Combra, Apartado 38, 3 454 Combra,

More information

Using Series to Analyze Financial Situations: Present Value

Using Series to Analyze Financial Situations: Present Value 2.8 Usng Seres to Analyze Fnancal Stuatons: Present Value In the prevous secton, you learned how to calculate the amount, or future value, of an ordnary smple annuty. The amount s the sum of the accumulated

More information

Brigid Mullany, Ph.D University of North Carolina, Charlotte

Brigid Mullany, Ph.D University of North Carolina, Charlotte Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte

More information

METHODOLOGY TO DETERMINE RELATIONSHIPS BETWEEN PERFORMANCE FACTORS IN HADOOP CLOUD COMPUTING APPLICATIONS

METHODOLOGY TO DETERMINE RELATIONSHIPS BETWEEN PERFORMANCE FACTORS IN HADOOP CLOUD COMPUTING APPLICATIONS METHODOLOGY TO DETERMINE RELATIONSHIPS BETWEEN PERFORMANCE FACTORS IN HADOOP CLOUD COMPUTING APPLICATIONS Lus Eduardo Bautsta Vllalpando 1,2, Alan Aprl 1 and Alan Abran 1 1 Department of Software Engneerng

More information

Gender Classification for Real-Time Audience Analysis System

Gender Classification for Real-Time Audience Analysis System Gender Classfcaton for Real-Tme Audence Analyss System Vladmr Khryashchev, Lev Shmaglt, Andrey Shemyakov, Anton Lebedev Yaroslavl State Unversty Yaroslavl, Russa vhr@yandex.ru, shmaglt_lev@yahoo.com, andrey.shemakov@gmal.com,

More information

Efficient Project Portfolio as a tool for Enterprise Risk Management

Efficient Project Portfolio as a tool for Enterprise Risk Management Effcent Proect Portfolo as a tool for Enterprse Rsk Management Valentn O. Nkonov Ural State Techncal Unversty Growth Traectory Consultng Company January 5, 27 Effcent Proect Portfolo as a tool for Enterprse

More information

Performance Management and Evaluation Research to University Students

Performance Management and Evaluation Research to University Students 631 A publcaton of CHEMICAL ENGINEERING TRANSACTIONS VOL. 46, 2015 Guest Edtors: Peyu Ren, Yancang L, Hupng Song Copyrght 2015, AIDIC Servz S.r.l., ISBN 978-88-95608-37-2; ISSN 2283-9216 The Italan Assocaton

More information

Sample Design in TIMSS and PIRLS

Sample Design in TIMSS and PIRLS Sample Desgn n TIMSS and PIRLS Introducton Marc Joncas Perre Foy TIMSS and PIRLS are desgned to provde vald and relable measurement of trends n student achevement n countres around the world, whle keepng

More information

Lecture 2: Single Layer Perceptrons Kevin Swingler

Lecture 2: Single Layer Perceptrons Kevin Swingler Lecture 2: Sngle Layer Perceptrons Kevn Sngler kms@cs.str.ac.uk Recap: McCulloch-Ptts Neuron Ths vastly smplfed model of real neurons s also knon as a Threshold Logc Unt: W 2 A Y 3 n W n. A set of synapses

More information

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems STAN-CS-73-355 I SU-SE-73-013 An Analyss of Central Processor Schedulng n Multprogrammed Computer Systems (Dgest Edton) by Thomas G. Prce October 1972 Techncal Report No. 57 Reproducton n whole or n part

More information

Evaluation of Coordination Strategies for Heterogeneous Sensor Networks Aiming at Surveillance Applications

Evaluation of Coordination Strategies for Heterogeneous Sensor Networks Aiming at Surveillance Applications Evaluaton of Coordnaton Strateges for Heterogeneous Sensor Networs Amng at Survellance Applcatons Edson Pgnaton de Fretas, *, Tales Hemfarth*, Carlos Eduardo Perera*, Armando Morado Ferrera, Flávo Rech

More information

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining Rsk Model of Long-Term Producton Schedulng n Open Pt Gold Mnng R Halatchev 1 and P Lever 2 ABSTRACT Open pt gold mnng s an mportant sector of the Australan mnng ndustry. It uses large amounts of nvestments,

More information

Design and Development of a Security Evaluation Platform Based on International Standards

Design and Development of a Security Evaluation Platform Based on International Standards Internatonal Journal of Informatcs Socety, VOL.5, NO.2 (203) 7-80 7 Desgn and Development of a Securty Evaluaton Platform Based on Internatonal Standards Yuj Takahash and Yoshm Teshgawara Graduate School

More information

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA*

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* Luísa Farnha** 1. INTRODUCTION The rapd growth n Portuguese households ndebtedness n the past few years ncreased the concerns that debt

More information

Section 5.4 Annuities, Present Value, and Amortization

Section 5.4 Annuities, Present Value, and Amortization Secton 5.4 Annutes, Present Value, and Amortzaton Present Value In Secton 5.2, we saw that the present value of A dollars at nterest rate per perod for n perods s the amount that must be deposted today

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

taposh_kuet20@yahoo.comcsedchan@cityu.edu.hk rajib_csedept@yahoo.co.uk, alam_shihabul@yahoo.com

taposh_kuet20@yahoo.comcsedchan@cityu.edu.hk rajib_csedept@yahoo.co.uk, alam_shihabul@yahoo.com G. G. Md. Nawaz Al 1,2, Rajb Chakraborty 2, Md. Shhabul Alam 2 and Edward Chan 1 1 Cty Unversty of Hong Kong, Hong Kong, Chna taposh_kuet20@yahoo.comcsedchan@ctyu.edu.hk 2 Khulna Unversty of Engneerng

More information