Classification Data Mining with Hybrid Fuzzy Logic Aggregation

Size: px
Start display at page:

Download "Classification Data Mining with Hybrid Fuzzy Logic Aggregation"

Transcription

1 Classificatio Data Miig with Hybrid Fuzzy Logic ggregatio JOHN F. SNFORD Iformatio Systems Philadelphia Uiversity Philadelphia, P US [email protected] LES M. SZTNDER Computer Iformatio Systems Philadelphia Uiversity Philadelphia, P US [email protected] bstract - Fuzzy logic is applied to the category discrimiatio problem related to idetificatio of mammary lesios as beig or maligat. Results of other similar studies are reviewed. The curret aalysis expads the fuzzy logic approach by usig the ormal distributio fuctio as set membership fuctios ad usig a geetic algorithm to optimize performace with the traiig partitio. The approach is applicable to problems havig arbitrarily large umber of parameters. Two differet data sets are examied. Data is portioed ito a traiig set ad validatio set ad each set is segregated ito beig ad maligat records. Values of mea ad stadard deviatio are iitially computed from the associated attributes ad are differet for the beig ad maligat records. I oe traiig method the stadard deviatios are adjusted to miimize overall error. I a secod method a bias adjusts the importace of each membership fuctio. Defuzzificatio is accomplished i three ways: modified averagig ad OR process; compariso of multiplied fuzzy set values; ad compariso of the multiplied squared set values. Results are compared with results obtaied through statistical logistic regressio. Key Words: fuzzy data-aalysis discrimiatio statistical aalysis screeig 1. Itroductio 1.1 Geeral Sice its itroductio 1965 by Lotfi Zadeh [1] fuzzy logic has foud extesive applicatio i cotrol systems from small appliaces to cameras ad eve heavy equipmet. lthough cotrol systems are its forte, ot surprisigly it has foud its way ito most if ot all areas of data miig as a seriously competig aalytic tool. To this repertoire we would add umerous iformatio search ad utilizatio techiques that have spawed eve a fuzzy database query laguage developed by Takihashi i 1995 [2]. Fuzzy logic shows itself to great advatage whe it ca simplify a otherwise extremely difficult algebraic formulatio for a cotrol system without apparet loss of effectiveess. I fact the fuzzy approach may be more effective. For example i the automatic cotrol of a camera the iput parameters are light itesity, distace to object, motio of the object relative to the camera, ad capability of the recordig media. Of these the oly costat parameter is the capability of recordig media. Other parameters may exhibit variatios over a wide rage. Formulatio of a crisp (exact) cotrol system is quite difficult. The fuzzy approach classifies the iput parameters ito fuzzy sets. For example, these sets might be dim light, medium light, bright light, earby, some distace, ad remote distace. The task the is to determie the percet of membership that the actual parameter has i each associated set. For example light may have 80% membership i medium, 10 % membership i bright ad zero membership i dim. The settig of camera speed ad aperture the ISSN: Issue 11, Volume 10, November 2011

2 is based o liguistic rules. Oe such rule might be, If bright light ad remote distace ad zero relative motio the aperture settig is 16 ad speed is 1/100. other might be if medium light ad remote distace ad zero relative motio the aperture settig is 11ad speed is 1/100 Liguistic rules are logic statemets. They iclude itersectio ad uio. Typically these are resolved through a set of rules as follows. ND B where ad B are limited to the rage (0, 1), becomes equivalet to mi (, B) ad OR B, where ad B are limited to the rage (0, 1), becomes equivalet to max (, B). The capability of classifyig patters for subsequet decisio makig processes is oe of the most fudametal characteristics of busiess itelligece ad data miig. s a field of study, classificatio has bee evolvig sice the early 1950s, closely followig the emergece ad evolutio of computer techology ad classificatio techiques. s previously metioed fuzzy logic has bee applied to this type of problem. 1.2 Other papers i the area. There is a perfusio of articles o the topic of fuzzy data discrimiatio. We metio a few that deal with medical diagosis which is also the directio of the curret work. Cios J K, et al. [3] reported o Fuzzy logic used for classificatio of coroary steosis from plaar thallium-201 scitigraphs. The object was diagosis of steosis which might be occurrig i ay of the three major arteries, left aterior descedig (LD), right coroary artery (RC), the circumflex artery (CCX). The scitigraphy cosisted of three views, left vetricle-aterior (NT), left lateral (LT), ad aterior oblique (LO). These data are themselves fuzzy with cosiderable overlap i perfusio patters that might be created by steosis. Some other approaches to solvig this problem iclude: fuzzy sets with probability assiged membership fuctios described by Cios al i 1991[4], machie learig algorithms described by Cios al i 1994 [5] ad Cios et al i 1994 [6], ad a expert system approach described by Cios et al i 1990 [7]. Cios et al i 1994 [3] used a data set cotaiig 64 patiets 46 with steosis ad 18 without. They used trapezoidal membership fuctios for each of the three scitigraphy views. They evaluated three kow learig algorithms, LFS, CLILP2, ad EXP. They formulated 13 differet liguistic statemets to provide crisp results. The otatio used i formulatig the if-the statemets relates to the medical parameters described i the paper but oe statemet is preseted here as a demostratio of defuzzificatio approach. IF LT4 has perfusio defect value i the rage of [0,4], [15,34],[45,54],[85,1001] with (0.2) ND LT8 has perfusio defect value i the rage of [5,24],[35,44],[75,100] with (0.4) ND LT5 has perfusio defect value i the rage of [0,4] with (0.4) THEN STENOSIS is i CCX. They reported o the results of the three differet algorithms for defuzzificatio. ccuracy for classifyig ormal arteries raged from 96% to 100%. However accuracy for correctly classifyig steosis raged from 85% to 93% for two algorithms. third had lower accuracy. Malek J, et al [7] discussed a automated breast cacer diagosis system utilizig fuzzy aggregatio. They proposed a fuzzy istrumet for hardware implemetatio of the automated system usig a CMOS itegrated circuit. They reported o active research coducted with 200 patiets to prove the cocept ad fially proposed a cocept desig for the CMOS itegrated circuit. For the active research, cells were extracted from the patiets lesios ad fixed o microscope slides. For each patiet 19 attributes are aalyzed. The GVF- Sake model was used to idetify the cotour of a typical ucleus withi the object. Texture-aalysisbased feature extractio ivolved wavelet trasforms. Statistical attributes icluded mea, variace, etropy, ad ormalized Shao etropy. The process is mathematically itesive. The fuzzy clusterig C- meas algorithm as is available i MTLB was use to assig fuzzy set memberships. Each ew patter was assiged a degree of membership of maligat class or beig class. The sum of all membership fuctios at ay poit equals 100% They reported 97% correct classificatio of beig cases ad 93% correct classificatio of maligat cases. ISSN: Issue 11, Volume 10, November 2011

3 Bagher-Ebadia H, et al [8] i 2004 used a automated eural etwork ad fuzzy clusterig for diagosis of coroary artery disease. They report two studies, oe with 58 subjects ad oe with 115. Data was obtaied from plaar images i three projectios, aterior, left aterior oblique, ad left lateral usig a collimator. Images were take at rest ad agai uder stress. Backgroud oise was suppressed by segmetig myocardium from the backgroud usig the fuzzy clusterig Picard iteratio algorithm. Euclidea distace was used as the distace metric. Each myocardium was partitioed ito four sectios ad the ceter of activity foud for each sectio. Samplig radii were take every 2 degrees to project the sectors o a x-y plae. There were three trasformed projectios divided ito 4 regios for at rest ad the same for stress. Thus there were two vectors of dimesio12. The mea ad variace row vectors were defied as feature vectors. The umber of features was reduced through a selectio process based o maximum separatio i multidimesioal feature space The artificial eural etwork was traied ad the evaluated. The true values, ormal or abormal, were obtaied from coroary agiographies take withi three moths of the imagig. ccuracy i predictio positive results varied from 77% to 86% ad predictio of egative results from 63% to 66%. I 2010 Licata [10] preseted a most iterestig demostratio of fuzzy logic as applied to the more geeral diagostic process. This is differet from the previous citatios i that it does ot deal with data aalysis to idetify oe of two possible outcomes as a test for a specific disease. Rather, he suggested that the applicatio of fuzzy set theory i place of the physicia s usual reliace solely o probabilistic logic ca improve the geeral diagosis process which attempts to idetify the cause of observed symptoms. He developed the fuzzy set theory process through a example applicatio to oe case study. Specific symptoms i this case are: dyspea at rest (a), oedema (b), tachyarrhythmias (c), epatomegally (d), ascites (e), pleural effusio (f). Possible diagoses would be hepatic cirrhosis (t), ephrosic sydrome (u), peumoia (w), myocardial ischemia (x) cogestive heart failure (y), worseig of the supravetricular arrhythmias (z). typical If-the rule would have the form, If a ad b ad c ad d ad e ad f the t or w or x or y or Severity of these symptoms a, b, c, etc. are typically liguistic ad fuzzy such as tachyarrhythmias is slow, moderate, or fast. Numerous laboratory tests provide more data for assigmet of fuzzy membership fuctios. Liguistic observatios ad laboratory results are assiged set memberships betwee 0 ad 1 through the use of charts that have bee prepared. Defuzzificatio ivolves a cosiderable umber of predicate logic statemets. These are evaluated usig geeralized modus poes ad sometimes more directly. This likely ivolves a good deal more skill i logic tha most physicias would poses. Licata s work poits out that the fuzzy logic approach is i lieu of the probabilistic approach geerally employed. Licata suggests that the fuzzy set theory solutio is more exactig ad probably superior to the usual process where idividual decisios i the chai leadig to a coclusio are made o the basis of probabilistic estimates. This paper is of course ot able to preset accuracy figures sice there is oly oe case cosidered. It is recouted here as a idicatio of the pervasive ifluece of fuzzy logic o the medical disciplie. 1.3 Differet approach The work we preset i the curret paper offers aother variatio. It employs the ormal distributio as set membership fuctios. These fuctios are idividually adjusted by usig weight ad varyig each stadard deviatio. Optimizatio is obtaied through applicatio of a. Liguistic rules are ot employed ad hece the fudametal process is applicable to data sets cotaiig may parameters ad to classificatio ito more tha two categories. The process is applied to several differet data sets ad results are compared with the more commo statistical regressio aalysis. 2. alytic approach Step 1 of the process ivolves assigig data to memberships i the fuzzy sets. ISSN: Issue 11, Volume 10, November 2011

4 2.1 Triagular membership fuctio Numerous membership fuctios are recogized for creatig the fuzzy sets [11] These iclude triagle, rectagle, parabola, trapezoid, ad eve projectios of oe operatig characteristic oto aother plae. If, for example, a isosceles triagle is used the the height would geerally be cosidered equal to 1 ad the apex is located at the parameter s average value over the totality of past data. particular data record will geerally have a value for this parameter that is ot at the average. Membership will be determied by the height of the lie C i Figure 1. This height has a simple relatioship show i equatios 1 ad 2 B B OB O O (1) C = h = 1 = = 1 OB OB OB OB Membership = 1 = 0 O OB For O OB Otherwise It should be oted that the value of O eed ot be a exact match to the average value of the parameter for all existig data records. I some cases, particularly i cases ivolvig dyamic cotrol systems, the value of O may be desigated i accordace with egieerig aalysis or other cosideratios. compariso of performace with triagular membership fuctio ad the expoetial fuctio was reported i a earlier paper [14]. I this case some adjustmet was made to triagular base ad expoetial width i a effort to fid the better fuctio. The expoetial fuctio performed better i that istace but triagular, expoetial (or sie wave), ad trapezoidal are all commo. 2.2 Trapezoidal membership fuctios Suppose that there are geeralized fuzzy umbers 1, 2,...,, with trapezoidal membership fuctios i = (c i, a i, b i, d i, w i ). The trapezoidal membership fuctio of a geeralized fuzzy umber, i, is the give by: (3) i (x)= a i x b i (x-ai)/(bi-ai) b i x c i 1 c i x d i (d i -x)/(d i -c i ) 2.3 Expoetial membership fuctio Expoetial membership fuctios ad sig-wave fuctios have also bee employed. I view of the fact that all parameters i the data sets uder study here deal with physical attributes that are most likely ormally distributed, the ormal distributio suggested itself as a membership fuctio for the curret aalysis. This distributio is expoetial ad probability is measured as betwee zero ad oe. The membership fuctio has the happy property that o parameter will actually have zero membership but will have a dimiishigly small membership as its locatio diverges far from the mea. The probability of beig at the mea is ½ ad ot 1 so for o other reaso tha esthetics we divided our membership percet by ½ i order to produce membership of 1 for parameters actually at the mea. The mea for each membership fuctio is the mea of that parameter over existig data. Spread of the membership fuctio is achieved by adjustig the stadard deviatio. Before traiig this value is set at the stadard deviatio of that parameter over existig data. d membership percetage is determied by the probability of the tail. Equatio 3 describes this. for X (4) Membership = P( X ) /.5 = (1 P( X )) /.5for X > Thus for a give parameter such as thickess Stadard deviatio = σ Mea = Measured value = x If x > the membership = (1- P(x))/0.5 If x < the membership = P(x)/ Geeral ggregatio Operatios ggregatio operatios are operatios by which several fuzzy sets are combied ito a sigle set. I geeral, ay aggregatio operatio is defied by a mappig [14] ISSN: Issue 11, Volume 10, November 2011

5 (5) h:[0,1] [0,1] for >=2. Whe applied to fuzzy sets 1, 2,..., defied o X, h produces a aggregate fuzzy set by operatig o the membership grades, (x) of each x ε X i the aggregated sets. Thus, we have: (6) (x)=h ( 1 (x), 2 (x),. (x)) for each x εx: The followig requiremets express the essece of the otio of aggregatio, after Klir ad Yua (Klir ad Yua, 1995): xiom 1. h (0,0,...,0) = 0 ad h (1,... 1) = 1. xiom 2. For ay pair of umbers a i, b i, where a i ε [0,1], b i ε [0, 1], if a i b i for all i the h(a i ) h(b i ), that is, h is mootoic odecreasig i all its argumets. itersectio, respectively. Geeralized Dombi's operatios [13] possess properties of fuzzy uios ad itersectios, which will be utilized i this work. They are defied as follows: Dombi's Fuzzy Uio: where λ is a parameter by which differet uios are distiguished, ad λ ε (0, ) Dombi's Fuzzy Itersectio Two additioal axioms are also employed to characterize aggregatio operatios. xiom 3. h is a cotiuous fuctio, that is it guaratees that a ifiitesimal variatio i ay argumet of h does ot produce a oticeable chage i the aggregate. xiom 4. h is symmetric fuctio i all its argumets, that is h(a i ) = h(a p ) for ay permutatio, p, of the argumets, that is the aggregated sets are equally importat. Fuzzy uios ad itersectios ca be qualified as aggregatio operatios o oly two argumets, their property of associativity, provides a mechaism for extedig their defiitio to ay umber of argumets. lso, i the case of bi-symmetry ad strict mootoicity, the extesios are straightforward. Thus, geeralized fuzzy uios ad itersectios ca be viewed as special aggregatio operatios that are symmetric, usually cotiuous, ad are required to satisfy cojuctive ad disjuctive attitudes. s a result, geeralized fuzzy uios ad itersectios ca produce oly aggregates that are subject to the followig restrictios [12] (7) max (a, b) (a, b) max (a, b) where λ is a parameter by which differet itersectios are distiguished, λ ε (0, ). From the above iequalities, oe ca see that geeralized fuzzy uios ad itersectios do ot produce ay aggregates that geerate values betwee mi ad max. Hece the geeral rules previously stated. ND B where ad B are limited to the rage (0, 1), becomes equivalet to mi (, B) ad OR B, where ad B are limited to the rage (0, 1), becomes equivalet to max (, B). ggregates that are ot restricted i this way, however, are allowed by xioms 1 through 4. Operatios that produce them are called averagig operatios. There are several classes of averagig operatios are a compromise betwee mm ad mm Oe such class, which covers the etire iterval betwee the mi ad max operatios cosists of geeralized meas. This class of operatios was used i fuzzy predictio, defied after (Klir ad Yua, 1995) as: (8) ad i mi (a, b) i(a, b) mi (a, b) where u(a,b) ad i(a, b) is the fuzzy uio, ad fuzzy ISSN: Issue 11, Volume 10, November 2011

6 α α a1 + a a (11) ha ( a1, a2,... a) = where α is a parameter by which differet meas are distiguished, ad α ε R (α is ot equal to 0). I our applicatio, we opted for α = 4, based o our experiece. Fuctio ha clearly satisfies all axioms of aggregatio operatios ad, cosequetly, it represets a parameterized class of cotiuous ad symmetric aggregatio operatios. variat of this approach is used i the followig aalysis. The approach uses weightig values for each alpha ad these weightig value may be adjusted for optimum performace. further variat that is ivestigated ivolves multiplicatio rather tha averagig. I that case the idividual weightig factors are of o cosequece. Liguistic statemets are usually associated with defuzzificatio. However i a classificatio problem, whe there are may parameters of equal importace, it is difficult to formulate such liguistic statemets. 2.5 Fuzzy Sets ad Data Miig The typical aalytic approach to data miig employs radom selectio to partitio the data ito two groups (sometimes 3). The first group is used to trai the data miig algorithm. For example if regressio is employed, the traiig partitio is used to set the parameters used i the regressio formula. The usual measures for performace are used for this purpose, residual error, p-values of coefficiets, ad so o. Whe two partitios are use they are usually based o a 60/40 or 70/30 ratio. These are geerally accepted ratios but it ot really possible to demostrate a theoretical optimum ad if the data set is small more may be required for traiig. Oce the data miig model has bee selected it is evaluated agaist the secod partitio, usually called the validatio partitio, to see how it performs. If the data has bee divided ito three partitios the differet models may be evaluated agaist the secod data set ad small further adjustmets may be made to the most promisig model before it is evaluated agaist the third partitio of data. Oly two partitios were employed for the aalysis reported here. Data i the first, traiig partitio are separated ito α 1 α two groups. Oe cotaiig date of kow beig istaces ad the other cotaiig data of the kow maligat istaces. For each set the mea ad stadard deviatio is computed for each of the attributes. These values are the use to set the membership fuctio for each attribute as described i equatio (4) above. Whe the model is ru the data from each record is evaluated for membership i each of the two groups of fuzzy sets. The results are aggregated ad compared with the decisio for beig or maligat beig made o the basis of which aggregatio has the larger value. Data from both the first, traiig, partitio ad the secod, validatio, partitio are evaluated by the model. Usually the traiig data performs better as might be expected. 2.6 Methods employed here Several aggregatio methods have bee compared. There is ot clear aalytical presidet for some of the selectios. Establishig a data miig model is i some ways a empirical process. That is why the traiig partitio ad the validatio partitio methodology is employed. The averagig method would be defied as follows. I this case the idividual membership fuctios are ot weighted but the aggregate is. (12) x ) = ( x ) additive ( k= 1 k k ( k= 1 k k k (13) x ) = b ( x ) additive with idividual b k weight adjustmet. is the membership fuctio for class. 1 is the membership fuctio for class, parameter umber 1 2 is the membership fuctio for class, parameter umber 2. x1is the value of parameter # 1 associated with object x. x2 is the value of parameter # 2 associated with object x. Equatios (12) ad (13) would be repeated for B where is associated maligacy ad B with beig. The multiplicative aggregates are defied as: ISSN: Issue 11, Volume 10, November 2011

7 (14) ( x) = k = k ( x 1 k ) multiplicative 2 (15) ( x) = k = ( k ( x 1 k )) multiply squares I each istace the idividual k ( x k ) membership fuctios may be adjusted durig the traiig phase to achieve miimum classificatio error. 2.7 Traiig Traiig is the distictive feature of the somewhat heuristic model preseted here. Durig the traiig stage the stadard deviatio used i each membership fuctio ca be adjusted to improve overall performace of the traiig partitio data. Icreasig idividual stadard deviatio has the effect of icreasig the width of that membership expoetial fuctio. The fuctio ow departs from the true statistical probability fuctio, but the choice of the ormal probability fuctio for the expoetial was arbitrary ad ot usually used i fuzzy logic ayway. The data represets physical attributes ad such attributes might be assumed to have ormal distributios. However the data was obtaied from huma observatios that are quatized ad hece ot ormal. The uaided model performed quite well. However performace was improved by adjustig the value used for stadard deviatio of each idividual parameter associated with each class. The additive method of combiig idividual parameter memberships ad the product method performed comparably i practice. It would seem that usig the squares of membership values would emphasize parameters where there is good membership. However empirical work suggests that there was o sigificat differece. adjustig coefficiet may be used whe addig membership values. This allows a small amout of improvemet i some istaces. With some data miig models over traiig is possible. I that case the error percetage is very good for the traiig partitio but markedly degraded for the validatio set. This situatio did ot preset itself. 2.8 Traiig Optimizatio There are may parameters ad each oe has a adjustable stadard deviatio. djustig by eye is time cosumig. was used i a attempt to optimize this process. The geetic algorithm used was a Microsoft Excel add-i called xl bit ( [15] 3. Results with Data Set #1 2.1 Statistical Method Classificatio data miig is ofte accomplished o a statistical basis with a logistic regressio model. I order to compare performace of the fuzzy model logistic regressio was applied to the same data. The vehicle used was a Microsoft Excel add-i called XLMier [16]. We report results obtaied with several data sets here. Data set umber 1 is Mammographic Mass Data made public by Schulz-Wedtlad [17]. The mammographic Mass database cosisted of the followig parameters. 1. BI-RDS assessmet: 1 to 5 (ordial) 2. ge: patiet's age i years (iteger) 3. Shape: mass shape: roud=1 oval=2 lobular=3 irregular=4 (omial) 4. Margi: mass margi: circumscribed=1 microlobulated=2 obscured=3 ill-defied=4 spiculated=5 (omial) 5. Desity: mass desity high=1 iso=2 low=3 fatcotaiig=4 (ordial) 6. Severity: beig=0 or maligat=1 (biomial) Missig ttribute Values: - BI-RDS assessmet: 2 - ge: 5 - Shape: 31 - Margi: 48 - Desity: 76 - Severity: 0 The predicted classificatio is either beig or maligat. Cases with missig variables were removed. Summary of results are i the table. For these calculatios the gai of each idividual fuzzy set was left costat durig the aggregatio (averagig) process but idividual stadard deviatios were adjusted so as to arrow or icrease the width of the membership fuctio. Percetages show are error ISSN: Issue 11, Volume 10, November 2011

8 TBLE logistic regressio logistic regressio Fuzzy Logic Model sum of membership values Fuzzy Logic sum of membership values Fuzzy Logic Model usig membership product Fuzzy Logic Model usig membership product Membership product & geetic algorithm to adjust Sigma Membership product & geetic algorithm to adjust sigma usig sum of to adjust Sigma usig sum of to adjust sigma Beig 15.95% 41/ % 30/ % 51/ % 27/ / / % 46/ % 35/ % 39/ % 26/167 Cacer 15.61% 37 / % 26/ % 40/ % 29/ / / % 27/ % 20/ % 37/ % 27/163 Overall Error 15.79% 78/ % 56/ % 91/ % 56/ / / % 73/ % 55/ % 76/ /330 percetages. The ratios, such as 7/270, show error cout ad total records i that classificatio. Validatio scores are the importat oes from the poit of view of data aalysis. lso there is ot a great deal of variatio amog results. This seems to match results from some other researchers i this area. Traiig ad validatio scores are show. s aticipated, traiig scores are slightly better. Usig the to optimize the membership fuctios cotributed a slight improvemet whe the decisio rule was based o the sum of membership values ad made o improvemet i overall accuracy whe the product of membership values was used. I this latter istace it altered the probabilities for failures i detectig maligacy by icreasig them with a cosequet decrease i failures to classify as beig. The, like other traiig adjustmets, worked to miimize the overall error i the traiig process. The hope is that this will project ito improved performace with the validatio partitio data. It is true that some adjustmet of the sigma values i the idividual membership fuctios ca be used to shift errors from oe type to aother while still maitaiig overall error rate ear costat. I lieu of adjustig the stadard deviatios a weightig factor for each attribute ca be adjusted. Of course, this is oly possible whe the aggregatio fuctio is a summatio. Thus: (16) x ) = b ( x ) ( k= 1 k k k The coefficiet b k applies to the attribute k. Results i this simple case are show i Table B. d they are ot markedly differet from results previously obtaied ISSN: Issue 11, Volume 10, November 2011

9 TBLE B sum of to adjust weight sum of to adjust weights Beig 13.23% 34/ % 20/167 Cacer 20.25% 48/ % 33/163 Overall Error 16.6% 82/ /330 t least i some istaces weightig the membership fuctios is as useful as adjustig the idividual shapes of the membership fuctios. It is ot obvious that this would be the case. It does say that simple dyamic weightig adjustmet durig the traiig phase may be as useful as aythig else. Results also suggest that the statistical logistic regressio approach is about as valuable as the fuzzy logic approach. This is ot the case i cotrol systems where fuzzy logic performs exceedigly well. This paper reports o a empirical ivestigatio with a limited amout of data. It offers o proof that statistical methods will always perform as well or less well as fuzzy methods. Not all researchers have reported a compariso so oe hesitates to make geeralized observatios. 4. Results Data Set #2 Data set #2 was Breast Cacer data made public by Magasaria & Wolberg [18] [19]. This data set was also briefly reported i our previous paper. The methodology has ow bee greatly expaded with adjustmet of idividual membership fuctios ad use of the. Data set parameters are as follows: 1. Sample code umber id umber 2. Clump Thickess Uiformity of Cell Size Uiformity of Cell Shape Margial dhesio Sigle Epithelial Cell Size Bare Nuclei Blad Chromati Normal Nucleoli Mitoses Class 2= beig; 4 = maligat TBLE C logistic regressio logistic regressio usig sum of to adjust Sigma usig sum of to adjust sigma Beig 1.11% 3/ % 7/ % 11/ % 2/174 Cacer / % 3/ % 19/ % 10/98 Overall Error 1.71% 7/ % 10/ % 30/ % 12/272 The predicted classificatio is either beig or maligat. Stadard deviatio of membership fuctios were adjusted usig the. Sets were aggregated as a averagig process similar to equatio (11) ad selectio made o the basis of which was the larger. Summary of results are i the table C. gai results from the fuzzy approach are very good but ot better that statistical logistical regressio. They are i some respect comparable. Results for data set #2 were cosiderably better tha those for data set #1 possibly because data set #2 had more attributes from which to establish set memberships. ll results were really comparable with results of other researchers as reported earlier i this paper. The Fitess chart developed by the software for data set #2 is preseted as Figure 2. It shows that the has approached asymptotically a horizotal lie ad further iteratios may ot produce improvemet uless there are local miimums. Cosiderig the ature of the problem, local miimums are ulikely. ISSN: Issue 11, Volume 10, November 2011

10 the result. It ca lead to icorrect aswers whe usig regressio to fit higher order polyomials. But this is ot a problem here. I the heuristic approach the model is based o actual performace. No oe attempts so say it would be valid for classes differet from the oes with which the model is validated. 5. Solutio Efficacy There is aturally a questio as to why these aalytic attempts are ot more successful i discrimiatig amog the two classes. I most medical ad probably most biological etities various attributes will be highly correlated. Figure 3 preset the correlatio matrix for data set #2. Here it ca be see that a high degree of correlatio exists. I geeral this makes statistical regressio less accurate or questioable at least. This limitatio will certaily be preset i all attempts at discrimiatio of biological etities util or uless orthogoal characteristics ca be foud. 6. Observatios alysis shows that the parameters of data sets are strogly correlated. This is ot surprisig sice they usually represet physical characteristics of some class of objects. The multiple liear regressio model assumes idepedet variables. Colliearity may make it impossible to see the cotributios of each variable to The fuzzy logic approach preseted here is ot limited to selectio amog two categories. Clearly the resolutio through a logical OR operatio ca be applied to three or eve more categories ad presumably eve more. [14] There is o practical limit provided there is sufficiet data. Three classes requires that traiig data be divided ito three sets; for classes would require 4 sets, etc. Statistical evaluatio of results would require some reasoable umber of records for each class. 7. Coclusios It is clear that fuzzy logic employig a expoetial membership fuctio, weighted averagig, ad logical OR operatio is a effective tool i data aalysis ivolvig discrimiatio where may parameters are ivolved. It performs as well as results reported by other researchers lookig at similar problems with differet fuzzy methods ad hybrid fuzzy methods. Because membership fuctios may be adjusted idividually the usual method of dividig data ito a traiig partitio ad a validatio partitio is appropriate. Rather extesive adjustmet ( learig ) durig the traiig stage improves performace. This may iclude use of a. Results observed here must be classified as aecdotal because oly two data sets were used. However, from these observatios it appears that fuzzy logic is comparable but does ot outperform usual data aalysis methods such as logistic regressio. I istaces where image aalysis is a pricipal feature (as reported i other papers oted earlier) the situatio does ot led itself to stadard statistical regressio methods. Refereces [1] Zadeh, L.. "Fuzzy sets", Iformatio ad Cotrol 8 (3): [2] Takahashi Y. (1995) fuzzy Query Laguage for Relatioal Databases, i Fuzziess i Database Maagemet Systems, Physica Publisher, Heidelberg, ISSN: Issue 11, Volume 10, November 2011

11 [3] Cios K J, Goodeday L S, Sztadera L M Hybrid itelligece system for diagosig coroary steosis. Combiig fuzzy geeralized operators with decisio rules geerated by machie learig algorithms, Egieerig i Medicie ad Biology Magazie, IEEE, Vol. 13 Issue 5, 1994 pages [4] Cios K J, Shi I, ad Goodeday LS Usig Fuzzy Sets to Diagose Coroary rtery Steosis, IEEE Computer Magazie Pages 57-63, 1991 [5] Cios K J, Liu N (1994) machie Learig algoriothm Usig Iterger Liear Programmig KYBERNETES, MCB Uiversity Press,U.K [6] Cios K J, Noraes I 91991) iductive Learig lgorithm KYBERNETES, MCB Uiversity Press, U.K. 920:3), pages [7] Cios K J, Freasier R E, Goodeday L S, drews L T expert system for diagosis of coroary artery steosis based o 201TI scitigrams usig the Dempster-Shafer theory of evidece, Oxford Uiversity Press, U.K. (6) No. 4 pages [8] Malek J, Sebri, Mabrouk S, Torki K, Tourki R Joural of Sigal Processig Systems Volume 55 Issue 1-3, pages 49-66, pril 2009 Kluwer cademic Publishers [9] Bagher-Ebadia H, Soltaia-Zadeh H, Setayeshi S, Smith S T Neural etwork ad fuzzy clusterig approach for automatic diagosis of coroary artery disease i uclear medicie IEEE Trasactios o Nuclear Sciece Vol51 Issue 1 pages (Feb 2004) [10] Licata G "Employig fuzzy logic i the diagosis of a cliical case" Health., Vol.2, No.3, pages March 2010 [14] Saford J, Sztadera L (2010) Classificatio Data Miig Usig Fuzzy Logic, Iteratioal cademy of Busiess ad Public dmiistratio Disciplies New Orleas Coferece Proceedigs October 2010, pages [15] XLBIT Geetic algorithm for Microsoft Excel from based i Kuala Lumpur, Malaysia. [16] XL-Mier is a Excel dd-i developed by Cytel Software Corporatio ad distributed by Resamplig Stats Ic. [17] Schulz-Wedtlad 2007 Mammographic Mass Data. Origial ower: Prof. Dr. Rüdiger Schulz- Wedtlad, Istitute of Radiology, Gyaecological Radiology, Uiversity Erlage-Nuremberg, Uiversitätsstraße 21-23, Erlage, Germay. Relevat Paper: M. Elter, R. Schulz-Wedtlad ad T. Witteberg (2007), The predictio of breast cacer biopsy outcomes usig two CD approaches that both emphasize a itelligible decisio process. Medical Physics 34(11), pp [17] Magasaria O. & Wolberg W. (1990) "Cacer diagosis via liear programmig", SIM News, 23(5), [18] Magasaria O & Wolberg W, 1990, "Cacer diagosis via liear programmig", SIM News, Volume 23, Number 5, September 1990, pp 1 & 18. [19] Wolberg W. & Magasaria O. (1990) "Multisurface method of patter separatio for medical diagosis applied to breast cytology", Proceedigs of the Natioal cademy of Scieces, U.S.., Volume 87, pp [11] Pedrycz W. Fuzzy Cotrol ad Fuzzy Systems Joh Wiley & Sos, ISBN [12] Klir G. J. ad Yua B. (1995), Fuzzy Sets ad Fuzzy Logic Theory ad pplicatios, Pretice Hall, Eglewood Cliffs. [13] Sztadera L. M., Goodeday, L. S., ad Cios K. J. (1996), Neuro-Fuzzy lgorithm for Diagosis of Coroary rtery Steosis, Computers i Biology ad Medicie Joural, 26(2), ISSN: Issue 11, Volume 10, November 2011

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

Review: Classification Outline

Review: Classification Outline Data Miig CS 341, Sprig 2007 Decisio Trees Neural etworks Review: Lecture 6: Classificatio issues, regressio, bayesia classificatio Pretice Hall 2 Data Miig Core Techiques Classificatio Clusterig Associatio

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

A Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design

A Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design A Combied Cotiuous/Biary Geetic Algorithm for Microstrip Atea Desig Rady L. Haupt The Pesylvaia State Uiversity Applied Research Laboratory P. O. Box 30 State College, PA 16804-0030 [email protected] Abstract:

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich [email protected] [email protected] Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

Department of Computer Science, University of Otago

Department of Computer Science, University of Otago Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly

More information

CHAPTER 3 THE TIME VALUE OF MONEY

CHAPTER 3 THE TIME VALUE OF MONEY CHAPTER 3 THE TIME VALUE OF MONEY OVERVIEW A dollar i the had today is worth more tha a dollar to be received i the future because, if you had it ow, you could ivest that dollar ad ear iterest. Of all

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical

More information

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

Research Article Sign Data Derivative Recovery

Research Article Sign Data Derivative Recovery Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 63070, 7 pages doi:0.540/0/63070 Research Article Sig Data Derivative Recovery L. M. Housto, G. A. Glass, ad A. D. Dymikov

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

Asymptotic Growth of Functions

Asymptotic Growth of Functions CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is 0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values

More information

INVESTMENT PERFORMANCE COUNCIL (IPC)

INVESTMENT PERFORMANCE COUNCIL (IPC) INVESTMENT PEFOMANCE COUNCIL (IPC) INVITATION TO COMMENT: Global Ivestmet Performace Stadards (GIPS ) Guidace Statemet o Calculatio Methodology The Associatio for Ivestmet Maagemet ad esearch (AIM) seeks

More information

Normal Distribution.

Normal Distribution. Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued

More information

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling Taig DCOP to the Real World: Efficiet Complete Solutios for Distributed Multi-Evet Schedulig Rajiv T. Maheswara, Milid Tambe, Emma Bowrig, Joatha P. Pearce, ad Pradeep araatham Uiversity of Souther Califoria

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

NEW HIGH PERFORMANCE COMPUTATIONAL METHODS FOR MORTGAGES AND ANNUITIES. Yuri Shestopaloff,

NEW HIGH PERFORMANCE COMPUTATIONAL METHODS FOR MORTGAGES AND ANNUITIES. Yuri Shestopaloff, NEW HIGH PERFORMNCE COMPUTTIONL METHODS FOR MORTGGES ND NNUITIES Yuri Shestopaloff, Geerally, mortgage ad auity equatios do ot have aalytical solutios for ukow iterest rate, which has to be foud usig umerical

More information

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics

More information

5.4 Amortization. Question 1: How do you find the present value of an annuity? Question 2: How is a loan amortized?

5.4 Amortization. Question 1: How do you find the present value of an annuity? Question 2: How is a loan amortized? 5.4 Amortizatio Questio 1: How do you fid the preset value of a auity? Questio 2: How is a loa amortized? Questio 3: How do you make a amortizatio table? Oe of the most commo fiacial istrumets a perso

More information

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return EVALUATING ALTERNATIVE CAPITAL INVESTMENT PROGRAMS By Ke D. Duft, Extesio Ecoomist I the March 98 issue of this publicatio we reviewed the procedure by which a capital ivestmet project was assessed. The

More information

LECTURE 13: Cross-validation

LECTURE 13: Cross-validation LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M

More information

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

Lesson 17 Pearson s Correlation Coefficient

Lesson 17 Pearson s Correlation Coefficient Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig

More information

Your organization has a Class B IP address of 166.144.0.0 Before you implement subnetting, the Network ID and Host ID are divided as follows:

Your organization has a Class B IP address of 166.144.0.0 Before you implement subnetting, the Network ID and Host ID are divided as follows: Subettig Subettig is used to subdivide a sigle class of etwork i to multiple smaller etworks. Example: Your orgaizatio has a Class B IP address of 166.144.0.0 Before you implemet subettig, the Network

More information

Systems Design Project: Indoor Location of Wireless Devices

Systems Design Project: Indoor Location of Wireless Devices Systems Desig Project: Idoor Locatio of Wireless Devices Prepared By: Bria Murphy Seior Systems Sciece ad Egieerig Washigto Uiversity i St. Louis Phoe: (805) 698-5295 Email: [email protected] Supervised

More information

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT Keywords: project maagemet, resource allocatio, etwork plaig Vladimir N Burkov, Dmitri A Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT The paper deals with the problems of resource allocatio betwee

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

Chapter 7: Confidence Interval and Sample Size

Chapter 7: Confidence Interval and Sample Size Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum

More information

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations CS3A Hadout 3 Witer 00 February, 00 Solvig Recurrece Relatios Itroductio A wide variety of recurrece problems occur i models. Some of these recurrece relatios ca be solved usig iteratio or some other ad

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

Chair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics

Chair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics Chair for Network Architectures ad Services Istitute of Iformatics TU Müche Prof. Carle Network Security Chapter 2 Basics 2.4 Radom Number Geeratio for Cryptographic Protocols Motivatio It is crucial to

More information

Data Analysis and Statistical Behaviors of Stock Market Fluctuations

Data Analysis and Statistical Behaviors of Stock Market Fluctuations 44 JOURNAL OF COMPUTERS, VOL. 3, NO. 0, OCTOBER 2008 Data Aalysis ad Statistical Behaviors of Stock Market Fluctuatios Ju Wag Departmet of Mathematics, Beijig Jiaotog Uiversity, Beijig 00044, Chia Email:

More information

ADAPTIVE NETWORKS SAFETY CONTROL ON FUZZY LOGIC

ADAPTIVE NETWORKS SAFETY CONTROL ON FUZZY LOGIC 8 th Iteratioal Coferece o DEVELOPMENT AND APPLICATION SYSTEMS S u c e a v a, R o m a i a, M a y 25 27, 2 6 ADAPTIVE NETWORKS SAFETY CONTROL ON FUZZY LOGIC Vadim MUKHIN 1, Elea PAVLENKO 2 Natioal Techical

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

Automatic Tuning for FOREX Trading System Using Fuzzy Time Series

Automatic Tuning for FOREX Trading System Using Fuzzy Time Series utomatic Tuig for FOREX Tradig System Usig Fuzzy Time Series Kraimo Maeesilp ad Pitihate Soorasa bstract Efficiecy of the automatic currecy tradig system is time depedet due to usig fixed parameters which

More information

One-sample test of proportions

One-sample test of proportions Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

Enhancing Oracle Business Intelligence with cubus EV How users of Oracle BI on Essbase cubes can benefit from cubus outperform EV Analytics (cubus EV)

Enhancing Oracle Business Intelligence with cubus EV How users of Oracle BI on Essbase cubes can benefit from cubus outperform EV Analytics (cubus EV) Ehacig Oracle Busiess Itelligece with cubus EV How users of Oracle BI o Essbase cubes ca beefit from cubus outperform EV Aalytics (cubus EV) CONTENT 01 cubus EV as a ehacemet to Oracle BI o Essbase 02

More information

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5

More information

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

ODBC. Getting Started With Sage Timberline Office ODBC

ODBC. Getting Started With Sage Timberline Office ODBC ODBC Gettig Started With Sage Timberlie Office ODBC NOTICE This documet ad the Sage Timberlie Office software may be used oly i accordace with the accompayig Sage Timberlie Office Ed User Licese Agreemet.

More information

Math C067 Sampling Distributions

Math C067 Sampling Distributions Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters

More information

Chatpun Khamyat Department of Industrial Engineering, Kasetsart University, Bangkok, Thailand [email protected]

Chatpun Khamyat Department of Industrial Engineering, Kasetsart University, Bangkok, Thailand ocpky@hotmail.com SOLVING THE OIL DELIVERY TRUCKS ROUTING PROBLEM WITH MODIFY MULTI-TRAVELING SALESMAN PROBLEM APPROACH CASE STUDY: THE SME'S OIL LOGISTIC COMPANY IN BANGKOK THAILAND Chatpu Khamyat Departmet of Idustrial

More information

Domain 1: Designing a SQL Server Instance and a Database Solution

Domain 1: Designing a SQL Server Instance and a Database Solution Maual SQL Server 2008 Desig, Optimize ad Maitai (70-450) 1-800-418-6789 Domai 1: Desigig a SQL Server Istace ad a Database Solutio Desigig for CPU, Memory ad Storage Capacity Requiremets Whe desigig a

More information

A Fuzzy Model of Software Project Effort Estimation

A Fuzzy Model of Software Project Effort Estimation TJFS: Turkish Joural of Fuzzy Systems (eissn: 309 90) A Official Joural of Turkish Fuzzy Systems Associatio Vol.4, No.2, pp. 68-76, 203 A Fuzzy Model of Software Project Effort Estimatio Oumout Chouseioglou

More information

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth Questio 1: What is a ordiary auity? Let s look at a ordiary auity that is certai ad simple. By this, we mea a auity over a fixed term whose paymet period matches the iterest coversio period. Additioally,

More information

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork Solutios to Selected Problems I: Patter Classificatio by Duda, Hart, Stork Joh L. Weatherwax February 4, 008 Problem Solutios Chapter Bayesia Decisio Theory Problem radomized rules Part a: Let Rx be the

More information

CHAPTER 3 DIGITAL CODING OF SIGNALS

CHAPTER 3 DIGITAL CODING OF SIGNALS CHAPTER 3 DIGITAL CODING OF SIGNALS Computers are ofte used to automate the recordig of measuremets. The trasducers ad sigal coditioig circuits produce a voltage sigal that is proportioal to a quatity

More information

Estimating Probability Distributions by Observing Betting Practices

Estimating Probability Distributions by Observing Betting Practices 5th Iteratioal Symposium o Imprecise Probability: Theories ad Applicatios, Prague, Czech Republic, 007 Estimatig Probability Distributios by Observig Bettig Practices Dr C Lych Natioal Uiversity of Irelad,

More information

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity

More information

Entropy of bi-capacities

Entropy of bi-capacities Etropy of bi-capacities Iva Kojadiovic LINA CNRS FRE 2729 Site école polytechique de l uiv. de Nates Rue Christia Pauc 44306 Nates, Frace [email protected] Jea-Luc Marichal Applied Mathematics

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

*The most important feature of MRP as compared with ordinary inventory control analysis is its time phasing feature.

*The most important feature of MRP as compared with ordinary inventory control analysis is its time phasing feature. Itegrated Productio ad Ivetory Cotrol System MRP ad MRP II Framework of Maufacturig System Ivetory cotrol, productio schedulig, capacity plaig ad fiacial ad busiess decisios i a productio system are iterrelated.

More information

HCL Dynamic Spiking Protocol

HCL Dynamic Spiking Protocol ELI LILLY AND COMPANY TIPPECANOE LABORATORIES LAFAYETTE, IN Revisio 2.0 TABLE OF CONTENTS REVISION HISTORY... 2. REVISION.0... 2.2 REVISION 2.0... 2 2 OVERVIEW... 3 3 DEFINITIONS... 5 4 EQUIPMENT... 7

More information

5 Boolean Decision Trees (February 11)

5 Boolean Decision Trees (February 11) 5 Boolea Decisio Trees (February 11) 5.1 Graph Coectivity Suppose we are give a udirected graph G, represeted as a boolea adjacecy matrix = (a ij ), where a ij = 1 if ad oly if vertices i ad j are coected

More information

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here). BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly

More information

Chapter 5 O A Cojecture Of Erdíos Proceedigs NCUR VIII è1994è, Vol II, pp 794í798 Jeærey F Gold Departmet of Mathematics, Departmet of Physics Uiversity of Utah Do H Tucker Departmet of Mathematics Uiversity

More information

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13 EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as: A Test of Normality Textbook Referece: Chapter. (eighth editio, pages 59 ; seveth editio, pages 6 6). The calculatio of p values for hypothesis testig typically is based o the assumptio that the populatio

More information

Quadrat Sampling in Population Ecology

Quadrat Sampling in Population Ecology Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may

More information

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book) MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:

More information

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of

More information

Measuring Magneto Energy Output and Inductance Revision 1

Measuring Magneto Energy Output and Inductance Revision 1 Measurig Mageto Eergy Output ad Iductace evisio Itroductio A mageto is fudametally a iductor that is mechaically charged with a iitial curret value. That iitial curret is produced by movemet of the rotor

More information

How to read A Mutual Fund shareholder report

How to read A Mutual Fund shareholder report Ivestor BulletI How to read A Mutual Fud shareholder report The SEC s Office of Ivestor Educatio ad Advocacy is issuig this Ivestor Bulleti to educate idividual ivestors about mutual fud shareholder reports.

More information

FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10

FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10 FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10 [C] Commuicatio Measuremet A1. Solve problems that ivolve liear measuremet, usig: SI ad imperial uits of measure estimatio strategies measuremet strategies.

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Lecture 2: Karger s Min Cut Algorithm

Lecture 2: Karger s Min Cut Algorithm priceto uiv. F 3 cos 5: Advaced Algorithm Desig Lecture : Karger s Mi Cut Algorithm Lecturer: Sajeev Arora Scribe:Sajeev Today s topic is simple but gorgeous: Karger s mi cut algorithm ad its extesio.

More information

INVESTMENT PERFORMANCE COUNCIL (IPC) Guidance Statement on Calculation Methodology

INVESTMENT PERFORMANCE COUNCIL (IPC) Guidance Statement on Calculation Methodology Adoptio Date: 4 March 2004 Effective Date: 1 Jue 2004 Retroactive Applicatio: No Public Commet Period: Aug Nov 2002 INVESTMENT PERFORMANCE COUNCIL (IPC) Preface Guidace Statemet o Calculatio Methodology

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis Ruig Time ( 3.) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

THE HEIGHT OF q-binary SEARCH TREES

THE HEIGHT OF q-binary SEARCH TREES THE HEIGHT OF q-binary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average

More information

MARTINGALES AND A BASIC APPLICATION

MARTINGALES AND A BASIC APPLICATION MARTINGALES AND A BASIC APPLICATION TURNER SMITH Abstract. This paper will develop the measure-theoretic approach to probability i order to preset the defiitio of martigales. From there we will apply this

More information

Evaluating Model for B2C E- commerce Enterprise Development Based on DEA

Evaluating Model for B2C E- commerce Enterprise Development Based on DEA , pp.180-184 http://dx.doi.org/10.14257/astl.2014.53.39 Evaluatig Model for B2C E- commerce Eterprise Developmet Based o DEA Weli Geg, Jig Ta Computer ad iformatio egieerig Istitute, Harbi Uiversity of

More information

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments Project Deliverables CS 361, Lecture 28 Jared Saia Uiversity of New Mexico Each Group should tur i oe group project cosistig of: About 6-12 pages of text (ca be loger with appedix) 6-12 figures (please

More information

Evaluation of Different Fitness Functions for the Evolutionary Testing of an Autonomous Parking System

Evaluation of Different Fitness Functions for the Evolutionary Testing of an Autonomous Parking System Evaluatio of Differet Fitess Fuctios for the Evolutioary Testig of a Autoomous Parkig System Joachim Wegeer 1, Oliver Bühler 2 1 DaimlerChrysler AG, Research ad Techology, Alt-Moabit 96 a, D-1559 Berli,

More information

Forecasting. Forecasting Application. Practical Forecasting. Chapter 7 OVERVIEW KEY CONCEPTS. Chapter 7. Chapter 7

Forecasting. Forecasting Application. Practical Forecasting. Chapter 7 OVERVIEW KEY CONCEPTS. Chapter 7. Chapter 7 Forecastig Chapter 7 Chapter 7 OVERVIEW Forecastig Applicatios Qualitative Aalysis Tred Aalysis ad Projectio Busiess Cycle Expoetial Smoothig Ecoometric Forecastig Judgig Forecast Reliability Choosig the

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

Convention Paper 6764

Convention Paper 6764 Audio Egieerig Society Covetio Paper 6764 Preseted at the 10th Covetio 006 May 0 3 Paris, Frace This covetio paper has bee reproduced from the author's advace mauscript, without editig, correctios, or

More information

Subject CT5 Contingencies Core Technical Syllabus

Subject CT5 Contingencies Core Technical Syllabus Subject CT5 Cotigecies Core Techical Syllabus for the 2015 exams 1 Jue 2014 Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical techiques which ca be used to model ad value

More information

AP Calculus AB 2006 Scoring Guidelines Form B

AP Calculus AB 2006 Scoring Guidelines Form B AP Calculus AB 6 Scorig Guidelies Form B The College Board: Coectig Studets to College Success The College Board is a ot-for-profit membership associatio whose missio is to coect studets to college success

More information

THE ARITHMETIC OF INTEGERS. - multiplication, exponentiation, division, addition, and subtraction

THE ARITHMETIC OF INTEGERS. - multiplication, exponentiation, division, addition, and subtraction THE ARITHMETIC OF INTEGERS - multiplicatio, expoetiatio, divisio, additio, ad subtractio What to do ad what ot to do. THE INTEGERS Recall that a iteger is oe of the whole umbers, which may be either positive,

More information