Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger

Size: px
Start display at page:

Download "Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger"

Transcription

1 Enrchng the Knowledge Sources Used n a Maxmum Entropy Part-of-Speech Tagger Krstna Toutanova Dept of Computer Scence Gates Bldg 4A, 353 Serra Mall Stanford, CA , USA krstna@cs.stanford.edu Chrstopher D. Mannng Depts of Computer Scence and Lngustcs Gates Bldg 4A, 353 Serra Mall Stanford, CA , USA mannng@cs.stanford.edu Abstract Ths paper presents results for a maxmumentropy-based part of speech tagger, whch acheves superor performance prncpally by enrchng the nformaton sources used for taggng. In partcular, we get mproved results by ncorporatng these features: () more extensve treatment of captalzaton for unknown words; () features for the dsambguaton of the tense forms of verbs; () features for dsambguatng partcles from prepostons and adverbs. The best resultng accuracy for the tagger on the Penn Treebank s 96.86% overall, and 86.91% on prevously unseen words. Introducton 1 There are now numerous systems for automatc assgnment of parts of speech ( taggng ), employng many dfferent machne learnng methods. Among recent top performng methods are Hdden Markov Models (Brants 2000), maxmum entropy approaches (Ratnaparkh 1996), and transformaton-based learnng (Brll 1994). An overvew of these and other approaches can be found n Mannng and Schütze (1999, ch. 10). However, all these methods use largely the same nformaton sources for taggng, and often almost the same features as well, and as a consequence they also offer very smlar levels of performance. Ths stands n contrast to the (manually-bul EngCG tagger, whch acheves better performance by usng lexcal and contextual nformaton sources and generalzatons beyond those avalable to such statstcal taggers, as Samuelsson and Voutlanen (1997) demonstrate. 1 We thank Dan Klen and Mchael Saunders for useful dscussons, and the anonymous revewers for many helpful comments. Ths paper explores the noton that automatcally bult tagger performance can be further mproved by expandng the knowledge sources avalable to the tagger. We pay specal attenton to unknown words, because the markedly lower accuracy on unknown word taggng means that ths s an area where sgnfcant performance gans seem possble. We adopt a maxmum entropy approach because t allows the ncluson of dverse sources of nformaton wthout causng fragmentaton and wthout necessarly assumng ndependence between the predctors. A maxmum entropy approach has been appled to partof-speech taggng before (Ratnaparkh 1996), but the approach s ablty to ncorporate nonlocal and non-hmm-tagger-type evdence has not been fully explored. Ths paper descrbes the models that we developed and the experments we performed to evaluate them. 1 The Baselne Maxmum Entropy Model We started wth a maxmum entropy based tagger that uses features very smlar to the ones proposed n Ratnaparkh (1996). The tagger learns a loglnear condtonal probablty model from tagged text, usng a maxmum entropy method. The model assgns a probablty for every tag t n the set Τ of possble tags gven a word and ts context h, whch s usually defned as the sequence of several words and tags precedng the word. Ths model can be used for estmatng the probablty of a tag sequence t 1 t n gven a sentence w 1 w n : t t w w ) = n t t, w w ) t h ) 1 n 1 n n = 1 = 1 t n As usual, taggng s the process of assgnng the maxmum lkelhood tag sequence to a strng of words. The dea of maxmum entropy modelng s to choose the probablty dstrbuton p that has the hghest entropy out of those dstrbutons

2 that satsfy a certan set of constrants. The constrants restrct the model to behave n accordance wth a set of statstcs collected from the tranng data. The statstcs are expressed as the expected values of approprate functons defned on the contexts h and tags t. In partcular, the constrants demand that the expectatons of the features for the model match the emprcal expectatons of the features over the tranng data. For example, f we want to constran the model to tag make as a verb or noun wth the same frequency as the emprcal model nduced by the tranng data, we defne the features: f ( h, = 1 1 f ( h, = 1 2 ff ff w w = make and t = NN = make and t = VB Some commonly used statstcs for part of speech taggng are: how often a certan word was tagged n a certan way; how often two tags appeared n sequence or how often three tags appeared n sequence. These look a lot lke the statstcs a Markov Model would use. However, n the maxmum entropy framework t s possble to easly defne and ncorporate much more complex statstcs, not restrcted to n-gram sequences. The constrants n our model are that the expectatons of these features accordng to the jont dstrbuton p are equal to the expectatons of the features n the emprcal (tranng data) dstrbuton ~ p : E p ( h, f ( h, = E ~ p ( h, f ( h,. Havng defned a set of constrants that our model should accord wth, we proceed to fnd the model satsfyng the constrants that maxmzes the condtonal entropy of p. The ntuton s that such a model assumes nothng apart from that t should satsfy the gven constrants. Followng Berger et al. (1996), we approxmate p ( h,, the jont dstrbuton of contexts and tags, by the product of ~ p ( h ), the emprcal dstrbuton of hstores h, and the condtonal dstrbuton p ( t h) : h, ~ p ( h) t h). Then for the example above, our constrants j 1,2 : would be the followng, for { } h H, t T ~ p ( h, f j ( h, = h H, t T ~ p ( h) t h) f j ( h, Ths approxmaton s used to enable effcent computaton. The expectaton for a feature f s: E f = h H, t T ~ p ( h) t h) f ( h, where H s the space of possble contexts h when predctng a part of speech tag t. Snce the contexts contan sequences of words and tags and other nformaton, the space H s huge. But usng ths approxmaton, we can nstead sum just over the smaller space of observed contexts X n the tranng sample, because the emprcal pror ~ p ( h ) s zero for unseen contexts h: E f = ~ h) t h) f ( h, (1) h X, t T The model that s a soluton to ths constraned optmzaton task s an exponental (or equvalently, loglnear) model wth the parametrc form: λ j f j ( h, e p t h = j= 1,, K ( ) λ j f j ( h, t ) t T j= 1, e, K where the denomnator s a normalzng term (sometmes referred to as the partton functon). The parameters j correspond to weghts for the features f j. We wll not dscuss n detal the characterstcs of the model or the parameter estmaton procedure used Improved Iteratve Scalng. For a more extensve dscusson of maxmum entropy methods, see Berger et al. (1996) and Jelnek (1997). However, we note that our parameter estmaton algorthm drectly uses equaton (1). Ratnaparkh (1996: 134) suggests use of an approxmaton summng over the tranng data, whch does not sum over possble tags: E f j n = 1 ~ p ( h ) t h ) f j ( h, t ) However, we beleve ths passage s n error: such an estmate s neffectve n the teratve scalng algorthm. Further, we note that expectatons of the form (1) appear n Ratnaparkh (1998: 12). 1.1 Features n the Baselne Model In our baselne model, the context avalable when predctng the part of speech tag of a word w n a sentence of words {w 1 w n } wth tags {t 1 t n } s {t -1 t -2 w w +1 }. The features that defne the constrants on the model are obtaned by nstantaton of feature templates as n Ratnaparkh (1996). Specal feature templates exst for rare words n the tranng data, to ncrease the model s predcton capacty for unknown words.

3 The actual feature templates for ths model are shown n the next table. They are a subset of the features used n Ratnaparkh (1996). No. Feature Type Template 1. General w =X & t =T 2. General t -1 =T 1 & t =T 3. General t -1 =T 1 & t -2 =T 2 & t =T 4. General w +1 =X & t =T 5. Rare Suffx of w =S, S <5 & t =T 6. Rare Prefx of w =P, 1< P <5 & t =T 7. Rare w contans a number & t =T 8. Rare w contans an uppercase character & t =T 9. Rare w contans a hyphen & t =T Table 1 Baselne Model Features General feature templates can be nstantated by arbtrary contexts, whereas rare feature templates are nstantated only by hstores where the current word w s rare. Rare words are defned to be words that appear less than a certan number of tmes n the tranng data (here, the value 7 was used). In order to be able to throw out features that would gve msleadng statstcs due to sparseness or nose n the data, we use two dfferent cutoff values for general and rare feature templates (n ths mplementaton, 5 and 45 respectvely). As seen n Table 1 the features are conjunctons of a boolean functon on the hstory h and a boolean functon on the tag t. Features whose frst conjuncts are true for more than the correspondng threshold number of hstores n the tranng data are ncluded n the model. The feature templates n Ratnaparkh (1996) that were left out were the ones that look at the prevous word, the word two postons before the current, and the word two postons after the current. These features are of the same form as template 4 n Table 1, but they look at words n dfferent postons. Our motvaton for leavng these features out was the results from some experments on successvely addng feature templates. Addng template 4 to a model that ncorporated the general feature templates 1 to 3 only and the rare feature templates 5 8 sgnfcantly ncreased the accuracy on the development set from 96.0% to 96.52%. The addton of a feature template that looked at the precedng word and the current tag to the resultng model slghtly reduced the accuracy. 1.2 Testng and Performance The model was traned and tested on the part-ofspeech tagged WSJ secton of the Penn Treebank. The data was dvded nto contguous parts: sectons 0 20 were used for tranng, sectons as a development test set, and sectons as a fnal test set. The data set szes are shown below together wth numbers of unknown words. Data Set Tokens Unknown Tranng 1,061,768 Words Development 116, (2.81%) Test 111, (2.59%) Table 2 Data Szes The testng procedure uses a beam search to fnd the tag sequence wth maxmal probablty gven a sentence. In our experments we used a beam of sze 5. Increasng the beam sze dd not result n mproved accuracy. The precedng tags for the word at the begnnng of the sentence are regarded as havng the pseudo-tag NA. In ths way, the nformaton that a word s the frst word n a sentence s avalable to the tagger. We do not have a specal end-of-sentence symbol. We used a tag dctonary for known words n testng. Ths was bult from tags found n the tranng data but augmented so as to capture a few basc systematc tag ambgutes that are found n Englsh. Namely, for regular verbs the -ed form can be ether a VBD or a VBN and smlarly the stem form can be ether a VBP or VB. Hence for words that had occurred wth only one of these tags n the tranng data the other was also ncluded as possble for assgnment. The results on the test set for the Baselne model are shown n Table 3. Model Overall Unknown Word Accuracy Accuracy Baselne 96.72% 84.5% Ratnaparkh (1996) 96.63% 85.56% Table 3 Baselne model performance Ths table also shows the results reported n Ratnaparkh (1996: 142) for convenence. The accuracy fgure for our model s hgher overall

4 but lower for unknown words. Ths may stem from the dfferences between the two models feature templates, thresholds, and approxmatons of the expected values for the features, as dscussed n the begnnng of the secton, or may just reflect dfferences n the choce of tranng and test sets (whch are not precsely specfed n Ratnaparkh (1996)). The dfferences are not great enough to justfy any defnte statement about the dfferent use of feature templates or other partculartes of the model estmaton. One concluson that we can draw s that at present the addtonal word features used n Ratnaparkh (1996) lookng at words more than one poston away from the current do not appear to be helpng the overall performance of the models. 1.3 Dscusson of Problematc Cases A large number of words, ncludng many of the most common words, can have more than one syntactc category. Ths ntroduces a lot of ambgutes that the tagger has to resolve. Some of the ambgutes are easer for taggers to resolve and others are harder. Some of the most sgnfcant confusons that the Baselne model made on the test set can be seen n Table 5. The row labels n Table 5 sgnfy the correct tags, and the column labels sgnfy the assgned tags. For example, the number 244 n the (NN, JJ) poston s the number of words that were NNs but were ncorrectly assgned the JJ category. These partcular confusons, shown n the table, account for a large percentage of the total error (2652/3651 = 72.64%). Table 6 shows part of the Baselne model s confuson matrx for just unknown words. Table 4 shows the Baselne model s overall assgnment accuraces for dfferent parts of speech. For example, the accuracy on nouns s greater than the accuracy on adjectves. The accuracy on NNPS (plural proper nouns) s a surprsngly low 41.1%. Tag Accuracy Tag Accuracy IN 97.3% JJ 93.0% NN 96.5% RB 92.2% NNP 96.2% VBN 90.4% VBD 95.2% RP 41.5% VB 94.0% NNPS 41.1% VBP 93.4% Table 4 Accuracy of assgnments for dfferent parts of speech for the Baselne model. Tagger errors are of varous types. Some are the result of nconsstency n labelng n the tranng data (Ratnaparkh 1996), whch usually reflects a lack of lngustc clarty or determnaton of the correct part of speech n context. For nstance, the status of varous noun premodfers (whether chef or maxmum s NN or JJ, or whether a word n -ng s actng as a JJ or VBG) s of ths type. Some, such as errors between NN/NNP/NNPS/NNS largely reflect dffcultes wth unknown words. But other cases, such as VBN/VBD and VB/VBP/NN, represent systematc tag ambguty patterns n Englsh, for whch the rght answer s nvarably clear n context, and for whch there are n general good structural contextual clues that one should be able to use to dsambguate. Fnally, n another class of cases, of whch the most promnent s probably the RP/IN/RB ambguty of words lke up, out, and on, the lngustc dstnctons, whle havng a sound emprcal bass (e.g., see Baker (1995: ), are qute subtle, and often requre semantc ntutons. There are not good syntactc cues for the correct tag (and furthermore, human taggers not nfrequently make errors). Wthn ths classfcaton, the greatest hopes for taggng mprovement appear to come from mnmzng errors n the second and thrd classes of ths classfcaton. In the followng sectons we dscuss how we nclude addtonal knowledge sources to help n the assgnment of tags to forms of verbs, captalzed unknown words, partcle words, and n the overall accuracy of part of speech assgnments. 2 Improvng the Unknown Words Model The accuracy of the baselne model s markedly lower for unknown words than for prevously seen ones. Ths s also the case for all other taggers, and reflects the mportance of lexcal nformaton to taggers: n the best accuracy fgures publshed for corpus-based taggers, known word accuracy s around 97%, whereas unknown word accuracy s around 85%. In followng experments, we examned ways of usng addtonal features to mprove the accuracy of taggng unknown words. As prevously dscussed n Mkheev (1999), t s possble to mprove the accuracy on captalzed words that mght be proper nouns or the frst word n a sentence, etc.

5 JJ NN NNP NNPS RB RP IN VB VBD VBN VBP Total JJ NN NNP NNPS RB RP IN VB VBD VBN VBP Total Table 5 Confuson matrx of the Baselne model showng top confuson pars overall JJ NN NNP NNS NNPS VBN Total JJ NN NNP NNPS NNS VBN Total Table 6 Confuson matrx of the Baselne model for unknown words showng top confuson pars Baselne Model 1 Model 2 Model 3 Captalzaton Verb forms Partcles Accuracy Test Set 96.72% 96.76% 96.83% 96.86% Unknown Words Accuracy Test Set 84.50% 86.76% 86.87% 86.91% Accuracy Development Set 96.53% 96.55% 96.58% 96.62% Unknown Words Accuracy Development Set 85.48% 86.03% 86.03% 86.06% Table 7 Accuraces of all models on the test and development sets Baselne Model 1 Captalzaton Model 2 Verb Forms Model 3 Partcles 1. Current word 15,832 15,832 15,837 15, Prevous tag 1,424 1,424 1,424 1, Prevous two tags 16,124 16,124 16,124 16, Next word 80,075 80,075 80,075 80, Suffxes 3,361 3,361 3,361 3, Prefxes 5, Contans uppercase character Contans number Contans hyphen Captalzed and md. sentence All letters uppercase VBP VB feature VBD VBN feature Partcles, type Partcles, type ,178 Total 122, , , ,944 Table 8 Number of features of dfferent types

6 For example, the error on the proper noun category (NNP) accounts for a sgnfcantly larger percent of the total error for unknown words than for known words. In the Baselne model, of the unknown word error 41.3% s due to words beng NNP and assgned to some other category, or beng of other category and assgned NNP. The percentage of the same type of error for known words s 16.2%. The ncorporaton of the followng two feature schemas greatly mproved NNP accuracy: (1) A feature that looks at whether all the letters of a word are uppercase. The feature that looked at captalzaton before (cf. Table 1, feature No. 8) s actvated when the word contans an uppercase character. Ths turns out to be a notable dstncton because, for example, n ttles n the WSJ data all words are n all uppercase, and the dstrbuton of tags for these words s dfferent from the overall dstrbuton for words that contan an uppercase character. (2) A feature that s actvated when the word contans an uppercase character and t s not at the start of a sentence. These word tokens also have a dfferent tag dstrbuton from the dstrbuton for all tokens that contan an uppercase character. Conversely, emprcally t was found that the prefx features for rare words were havng a net negatve effect on accuracy. We do not at present have a good explanaton for ths phenomenon. The addton of the features (1) and (2) and the removal of the prefx features consderably mproved the accuracy on unknown words and the overall accuracy. The results on the test set after addng these features are shown below: Overall Accuracy Unknown Word Accuracy 96.76% 86.76% Table 9 Accuracy when addng captalzaton features and removng prefx features. Unknown word error s reduced by 15% as compared to the Baselne model. It s mportant to note that (2) s composed of nformaton already known to the tagger n some sense. Ths feature can be vewed as the conjuncton of two features, one of whch s already n the baselne model, and the other of whch s the negaton of a feature exstng n the baselne model snce for words at the begnnng of a sentence, the precedng tag s the pseudo-tag NA, and there s a feature lookng at the precedng tag. Even though our maxmum entropy model does not requre ndependence among the predctors, t provdes for free only a smple combnaton of feature weghts, and addtonal nteracton terms are needed to model non-addtve nteractons (n log-space terms) between features. 3 Features for Dsambguatng Verb Forms Two of the most sgnfcant sources of classfer errors are the VBN/VBD ambguty and the VBP/VB ambguty. As seen n Table 5, VBN/VBD confusons account for 6.9% of the total word error. The VBP/VB confusons are a smaller 3.7% of the errors. In many cases t s easy for people (and for taggers) to determne the correct form. For example, f there s a to nfntve or a modal drectly precedng the VB/VBP ambguous word, the form s certanly non-fnte. But often the modal can be several postons away from the current poston stll obvous to a human, but out of sght for the baselne model. To help resolve a VB/VBP ambguty n such cases, we can add a feature that looks at the precedng several words (we have chosen 8 as a threshold), but not across another verb, and actvates f there s a to there, a modal verb, or a form of do, let, make, or help (verbs that frequently take a bare nfntve complemen. Rather than havng a separate feature look at each precedng poston, we defne one feature that looks at the chosen number of postons to the left. Ths both ncreases the scope of the avalable hstory for the tagger and provdes a better statstc because t avods fragmentaton. We added a smlar feature for resolvng VBD/VBN confusons. It actvates f there s a have or be auxlary form n the precedng several postons (agan the value 8 s used n the mplementaton). The form of these two feature templates was motvated by the structural rules of Englsh and not nduced from the tranng data, but t should be possble to look for predctors for certan parts of speech n the precedng words n the sentence by, for example, computng assocaton strengths. The addton of the two feature schemas helped reduce the VB/VBP and VBD/VBN confusons. Below s the performance on the test set

7 of the resultng model when features for dsambguatng verb forms are added to the model of Secton 2. The number of VB/VBP confusons was reduced by 23.1% as compared to the baselne. The number of VBD/VBN confusons was reduced by 12.3%. Overall Accuracy Unknown Word Accuracy 96.83% 86.87% Table 10 Accuracy of the extended model 4 Features for Partcle Dsambguaton As dscussed n secton 1.3 above, the task of determnng RB/RP/IN tags for words lke down, out, up s dffcult and n partcular examples, there are often no good local syntactc ndcators. For nstance, n (2), we fnd the exact same sequence of parts of speech, but (2a) s a partcle use of on, whle (2b) s a prepostonal use. Consequently, the accuracy on the rarer RP (partcles) category s as low as 41.5% for the Baselne model (cf. Table 4). (2) a. Km took on the monster. b. Km sat on the monster. We tred to mprove the tagger s capablty to resolve these ambgutes through addng nformaton on verbs preferences to take specfc words as partcles, or adverbs, or prepostons. There are verbs that take partcles more than others, and partcular words lke out are much more lkely to be used as a partcle n the context of some verb than other words ambguous between these tags. We added two dfferent feature templates to capture ths nformaton, consstng as usual of a predcate on the hstory h, and a condton on the tag t. The frst predcate s true f the current word s often used as a partcle, and f there s a verb at most 3 postons to the left, whch s known to have a good chance of takng the current word as a partcle. The verb-partcle pars that are known by the system to be very common were collected through analyss of the tranng data n a preprocessng stage. The second feature template has the form: The last verb s v and the current word s w and w has been tagged as a partcle and the current tag s t. The last verb s the pseudo-symbol NA f there s no verb n the prevous three postons. These features were some help n reducng the RB/IN/RP confusons. The accuracy on the RP category rose to 44.3%. Although the overall confusons n ths class were reduced, some of the errors were ncreased, for example, the number of INs classfed as RBs rose slghtly. There seems to be stll consderable room to mprove these results, though the attanable accuracy s lmted by the accuracy wth whch these dstnctons are marked n the Penn Treebank (on a quck nformal study, ths accuracy seems to be around 85%). The next table shows the fnal performance on the test set. Overall Accuracy Unknown Word Accuracy 96.86% 86.91% Table 11 Accuracy of the fnal model For ease of comparson, the accuraces of all models on the test and development sets are shown n Table 7. We note that accuracy s lower on the development set. Ths presumably corresponds wth Charnak s (2000: 136) observaton that Secton 23 of the Penn Treebank s easer than some others. Table 8 shows the dfferent number of feature templates of each knd that have been nstantated for the dfferent models as well as the total number of features each model has. It can be seen that the features whch help dsambguate verb forms, whch look at captalzaton and the frst of the feature templates for partcles are a very small number as compared to the features of the other knds. The mprovement n classfcaton accuracy therefore comes at the prce of addng very few parameters to the maxmum entropy model and does not result n ncreased model complexty. Concluson Even when the accuracy fgures for corpusbased part-of-speech taggers start to look extremely smlar, t s stll possble to move performance levels up. The work presented n ths paper explored just a few nformaton sources n addton to the ones usually used for taggng. Whle progress s slow, because each new feature apples only to a lmted range of cases, nevertheless the mprovement n accuracy as compared to prevous results s notceable, partcularly for the ndvdual decsons on whch we focused. The potental of maxmum entropy methods has not prevously been fully exploted for the task of assgnment of parts of speech. We ncorporated nto a maxmum entropy-based tagger

8 more lngustcally sophstcated features, whch are non-local and do not look just at partcular postons n the text. We also added features that model the nteractons of prevously employed predctors. All of these changes led to modest ncreases n taggng accuracy. Ths paper has thus presented some ntal experments n mprovng tagger accuracy through usng addtonal nformaton sources. In the future we hope to explore automatcally dscoverng nformaton sources that can be proftably ncorporated nto maxmum entropy part-of-speech predcton. References Baker, C. L Englsh Syntax. Cambrdge, MA: MIT Press, 2 nd edton. Berger, Adam L., Della Petra, Stephen A., and Della Petra, Vncent J A Maxmum Entropy Approach to Natural Language Processng. Computatonal Lngustcs 22: Brants, Thorsten TnT A Statstcal Part-of- Speech Tagger. In Proceedngs of the Sxth Appled Natural Language Processng Conference (ANLP 2000), Seattle, WA, pp Brll, Erc Some Advances n Transformaton- Based Part of Speech Taggng. Proceedngs of AAAI, Vol. 1, pp Charnak, Eugene A Maxmum-Entropy- Inspred Parser. Proceedngs of the 1 st Meetng of the North Amercan Chapter of the Assocaton for Computatonal Lngustcs, pp Jelnek, Frederck Statstcal Methods for Speech Recognton. Cambrdge, MA: MIT Press. Mannng, Chrstopher D. and Hnrch Schütze Foundatons of Statstcal Natural Language Processng. Cambrdge, MA: MIT Press. Mkheev, Andre Perods, Captalzed Words, etc. Ms., Unversty of Ednburgh. Avalable at: Ratnaparkh, Adwat A maxmum entropy model for part-of-speech taggng. In Proceedngs of the Conference on Emprcal Methods n Natural Language Processng, Unversty of Pennsylvana, pp Ratnaparkh, Adwat Maxmum Entropy Models for Natural Language Ambguty Resoluton. PhD Thess, Unversty of Pennsylvana. Samuelsson, Chrster and Atro Voutlanen Comparng a Lngustc and a Stochastc Tagger. In Proceedngs of the 25 th Annual Meetng of the Assocaton for Computatonal Lngustcs, pp

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

A Probabilistic Theory of Coherence

A Probabilistic Theory of Coherence A Probablstc Theory of Coherence BRANDEN FITELSON. The Coherence Measure C Let E be a set of n propostons E,..., E n. We seek a probablstc measure C(E) of the degree of coherence of E. Intutvely, we want

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

+ + + - - This circuit than can be reduced to a planar circuit

+ + + - - This circuit than can be reduced to a planar circuit MeshCurrent Method The meshcurrent s analog of the nodeoltage method. We sole for a new set of arables, mesh currents, that automatcally satsfy KCLs. As such, meshcurrent method reduces crcut soluton to

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

Lecture 2: Single Layer Perceptrons Kevin Swingler

Lecture 2: Single Layer Perceptrons Kevin Swingler Lecture 2: Sngle Layer Perceptrons Kevn Sngler kms@cs.str.ac.uk Recap: McCulloch-Ptts Neuron Ths vastly smplfed model of real neurons s also knon as a Threshold Logc Unt: W 2 A Y 3 n W n. A set of synapses

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

Logistic Regression. Steve Kroon

Logistic Regression. Steve Kroon Logstc Regresson Steve Kroon Course notes sectons: 24.3-24.4 Dsclamer: these notes do not explctly ndcate whether values are vectors or scalars, but expects the reader to dscern ths from the context. Scenaro

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S S C H E D A E I N F O R M A T I C A E VOLUME 0 0 On Mean Squared Error of Herarchcal Estmator Stans law Brodowsk Faculty of Physcs, Astronomy, and Appled Computer Scence, Jagellonan Unversty, Reymonta

More information

Assessing Student Learning Through Keyword Density Analysis of Online Class Messages

Assessing Student Learning Through Keyword Density Analysis of Online Class Messages Assessng Student Learnng Through Keyword Densty Analyss of Onlne Class Messages Xn Chen New Jersey Insttute of Technology xc7@njt.edu Brook Wu New Jersey Insttute of Technology wu@njt.edu ABSTRACT Ths

More information

Traffic State Estimation in the Traffic Management Center of Berlin

Traffic State Estimation in the Traffic Management Center of Berlin Traffc State Estmaton n the Traffc Management Center of Berln Authors: Peter Vortsch, PTV AG, Stumpfstrasse, D-763 Karlsruhe, Germany phone ++49/72/965/35, emal peter.vortsch@ptv.de Peter Möhl, PTV AG,

More information

CHAPTER 14 MORE ABOUT REGRESSION

CHAPTER 14 MORE ABOUT REGRESSION CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp

More information

The Greedy Method. Introduction. 0/1 Knapsack Problem

The Greedy Method. Introduction. 0/1 Knapsack Problem The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton

More information

Statistical Methods to Develop Rating Models

Statistical Methods to Develop Rating Models Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

Title Language Model for Information Retrieval

Title Language Model for Information Retrieval Ttle Language Model for Informaton Retreval Rong Jn Language Technologes Insttute School of Computer Scence Carnege Mellon Unversty Alex G. Hauptmann Computer Scence Department School of Computer Scence

More information

Transition Matrix Models of Consumer Credit Ratings

Transition Matrix Models of Consumer Credit Ratings Transton Matrx Models of Consumer Credt Ratngs Abstract Although the corporate credt rsk lterature has many studes modellng the change n the credt rsk of corporate bonds over tme, there s far less analyss

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems STAN-CS-73-355 I SU-SE-73-013 An Analyss of Central Processor Schedulng n Multprogrammed Computer Systems (Dgest Edton) by Thomas G. Prce October 1972 Techncal Report No. 57 Reproducton n whole or n part

More information

Calculating the high frequency transmission line parameters of power cables

Calculating the high frequency transmission line parameters of power cables < ' Calculatng the hgh frequency transmsson lne parameters of power cables Authors: Dr. John Dcknson, Laboratory Servces Manager, N 0 RW E B Communcatons Mr. Peter J. Ncholson, Project Assgnment Manager,

More information

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA*

HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* Luísa Farnha** 1. INTRODUCTION The rapd growth n Portuguese households ndebtedness n the past few years ncreased the concerns that debt

More information

IMPACT ANALYSIS OF A CELLULAR PHONE

IMPACT ANALYSIS OF A CELLULAR PHONE 4 th ASA & μeta Internatonal Conference IMPACT AALYSIS OF A CELLULAR PHOE We Lu, 2 Hongy L Bejng FEAonlne Engneerng Co.,Ltd. Bejng, Chna ABSTRACT Drop test smulaton plays an mportant role n nvestgatng

More information

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES Zuzanna BRO EK-MUCHA, Grzegorz ZADORA, 2 Insttute of Forensc Research, Cracow, Poland 2 Faculty of Chemstry, Jagellonan

More information

Damage detection in composite laminates using coin-tap method

Damage detection in composite laminates using coin-tap method Damage detecton n composte lamnates usng con-tap method S.J. Km Korea Aerospace Research Insttute, 45 Eoeun-Dong, Youseong-Gu, 35-333 Daejeon, Republc of Korea yaeln@kar.re.kr 45 The con-tap test has the

More information

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001 Proceedngs of the Annual Meetng of the Amercan Statstcal Assocaton, August 5-9, 2001 LIST-ASSISTED SAMPLING: THE EFFECT OF TELEPHONE SYSTEM CHANGES ON DESIGN 1 Clyde Tucker, Bureau of Labor Statstcs James

More information

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall SP 2005-02 August 2005 Staff Paper Department of Appled Economcs and Management Cornell Unversty, Ithaca, New York 14853-7801 USA Farm Savngs Accounts: Examnng Income Varablty, Elgblty, and Benefts Brent

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

I I I I DIAGNOSIS AS A NOTION OF GRAMMAR. to deal with them. Mitchell Marcus Artificial Intelligence Laboratory M.I.T.

I I I I DIAGNOSIS AS A NOTION OF GRAMMAR. to deal with them. Mitchell Marcus Artificial Intelligence Laboratory M.I.T. DAGNOSS AS A NOTON OF GRAMMAR Mtchell Marcus Artfcal ntellgence Laboratory M..T. Ths paper wll sketch an approach to natural language parsng based on a new concepton of what makes up a recognton grammar

More information

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services An Evaluaton of the Extended Logstc, Smple Logstc, and Gompertz Models for Forecastng Short Lfecycle Products and Servces Charles V. Trappey a,1, Hsn-yng Wu b a Professor (Management Scence), Natonal Chao

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

Gender Classification for Real-Time Audience Analysis System

Gender Classification for Real-Time Audience Analysis System Gender Classfcaton for Real-Tme Audence Analyss System Vladmr Khryashchev, Lev Shmaglt, Andrey Shemyakov, Anton Lebedev Yaroslavl State Unversty Yaroslavl, Russa vhr@yandex.ru, shmaglt_lev@yahoo.com, andrey.shemakov@gmal.com,

More information

Stochastic Protocol Modeling for Anomaly Based Network Intrusion Detection

Stochastic Protocol Modeling for Anomaly Based Network Intrusion Detection Stochastc Protocol Modelng for Anomaly Based Network Intruson Detecton Juan M. Estevez-Tapador, Pedro Garca-Teodoro, and Jesus E. Daz-Verdejo Department of Electroncs and Computer Technology Unversty of

More information

How To Calculate An Approxmaton Factor Of 1 1/E

How To Calculate An Approxmaton Factor Of 1 1/E Approxmaton algorthms for allocaton problems: Improvng the factor of 1 1/e Urel Fege Mcrosoft Research Redmond, WA 98052 urfege@mcrosoft.com Jan Vondrák Prnceton Unversty Prnceton, NJ 08540 jvondrak@gmal.com

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

Single and multiple stage classifiers implementing logistic discrimination

Single and multiple stage classifiers implementing logistic discrimination Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

Topic Identification based on Bayesian Belief Networks in the context of an Air Traffic Control Task

Topic Identification based on Bayesian Belief Networks in the context of an Air Traffic Control Task Procesamento del Lenguaje Natural, núm. 35 (2005), pp. 327-332 recbdo 06-05-2005; aceptado 01-06-2005 Topc Identfcaton based on Bayesan Belef Networs n the context of an Ar Traffc Control Tas F. Fernández,

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

8 Algorithm for Binary Searching in Trees

8 Algorithm for Binary Searching in Trees 8 Algorthm for Bnary Searchng n Trees In ths secton we present our algorthm for bnary searchng n trees. A crucal observaton employed by the algorthm s that ths problem can be effcently solved when the

More information

Using Supervised Clustering Technique to Classify Received Messages in 137 Call Center of Tehran City Council

Using Supervised Clustering Technique to Classify Received Messages in 137 Call Center of Tehran City Council Usng Supervsed Clusterng Technque to Classfy Receved Messages n 137 Call Center of Tehran Cty Councl Mahdyeh Haghr 1*, Hamd Hassanpour 2 (1) Informaton Technology engneerng/e-commerce, Shraz Unversty (2)

More information

Fault tolerance in cloud technologies presented as a service

Fault tolerance in cloud technologies presented as a service Internatonal Scentfc Conference Computer Scence 2015 Pavel Dzhunev, PhD student Fault tolerance n cloud technologes presented as a servce INTRODUCTION Improvements n technques for vrtualzaton and performance

More information

Learning from Multiple Outlooks

Learning from Multiple Outlooks Learnng from Multple Outlooks Maayan Harel Department of Electrcal Engneerng, Technon, Hafa, Israel She Mannor Department of Electrcal Engneerng, Technon, Hafa, Israel maayanga@tx.technon.ac.l she@ee.technon.ac.l

More information

An MILP model for planning of batch plants operating in a campaign-mode

An MILP model for planning of batch plants operating in a campaign-mode An MILP model for plannng of batch plants operatng n a campagn-mode Yanna Fumero Insttuto de Desarrollo y Dseño CONICET UTN yfumero@santafe-concet.gov.ar Gabrela Corsano Insttuto de Desarrollo y Dseño

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

Brigid Mullany, Ph.D University of North Carolina, Charlotte

Brigid Mullany, Ph.D University of North Carolina, Charlotte Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte

More information

A Performance Analysis of View Maintenance Techniques for Data Warehouses

A Performance Analysis of View Maintenance Techniques for Data Warehouses A Performance Analyss of Vew Mantenance Technques for Data Warehouses Xng Wang Dell Computer Corporaton Round Roc, Texas Le Gruenwald The nversty of Olahoma School of Computer Scence orman, OK 739 Guangtao

More information

Open Access A Load Balancing Strategy with Bandwidth Constraint in Cloud Computing. Jing Deng 1,*, Ping Guo 2, Qi Li 3, Haizhu Chen 1

Open Access A Load Balancing Strategy with Bandwidth Constraint in Cloud Computing. Jing Deng 1,*, Ping Guo 2, Qi Li 3, Haizhu Chen 1 Send Orders for Reprnts to reprnts@benthamscence.ae The Open Cybernetcs & Systemcs Journal, 2014, 8, 115-121 115 Open Access A Load Balancng Strategy wth Bandwdth Constrant n Cloud Computng Jng Deng 1,*,

More information

Fixed income risk attribution

Fixed income risk attribution 5 Fxed ncome rsk attrbuton Chthra Krshnamurth RskMetrcs Group chthra.krshnamurth@rskmetrcs.com We compare the rsk of the actve portfolo wth that of the benchmark and segment the dfference between the two

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE Yu-L Huang Industral Engneerng Department New Mexco State Unversty Las Cruces, New Mexco 88003, U.S.A. Abstract Patent

More information

Extending Probabilistic Dynamic Epistemic Logic

Extending Probabilistic Dynamic Epistemic Logic Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set

More information

An Empirical Study of Search Engine Advertising Effectiveness

An Empirical Study of Search Engine Advertising Effectiveness An Emprcal Study of Search Engne Advertsng Effectveness Sanjog Msra, Smon School of Busness Unversty of Rochester Edeal Pnker, Smon School of Busness Unversty of Rochester Alan Rmm-Kaufman, Rmm-Kaufman

More information

Performance Analysis and Coding Strategy of ECOC SVMs

Performance Analysis and Coding Strategy of ECOC SVMs Internatonal Journal of Grd and Dstrbuted Computng Vol.7, No. (04), pp.67-76 http://dx.do.org/0.457/jgdc.04.7..07 Performance Analyss and Codng Strategy of ECOC SVMs Zhgang Yan, and Yuanxuan Yang, School

More information

Fast Fuzzy Clustering of Web Page Collections

Fast Fuzzy Clustering of Web Page Collections Fast Fuzzy Clusterng of Web Page Collectons Chrstan Borgelt and Andreas Nürnberger Dept. of Knowledge Processng and Language Engneerng Otto-von-Guercke-Unversty of Magdeburg Unverstätsplatz, D-396 Magdeburg,

More information

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6 PAR TESTS If a WEIGHT varable s specfed, t s used to replcate a case as many tmes as ndcated by the weght value rounded to the nearest nteger. If the workspace requrements are exceeded and samplng has

More information

How To Calculate The Accountng Perod Of Nequalty

How To Calculate The Accountng Perod Of Nequalty Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.

More information

Section 5.4 Annuities, Present Value, and Amortization

Section 5.4 Annuities, Present Value, and Amortization Secton 5.4 Annutes, Present Value, and Amortzaton Present Value In Secton 5.2, we saw that the present value of A dollars at nterest rate per perod for n perods s the amount that must be deposted today

More information

Web Spam Detection Using Machine Learning in Specific Domain Features

Web Spam Detection Using Machine Learning in Specific Domain Features Journal of Informaton Assurance and Securty 3 (2008) 220-229 Web Spam Detecton Usng Machne Learnng n Specfc Doman Features Hassan Najadat 1, Ismal Hmed 2 Department of Computer Informaton Systems Faculty

More information

Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem

Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME, ISSUE, FEBRUARY ISSN 77-866 Logcal Development Of Vogel s Approxmaton Method (LD- An Approach To Fnd Basc Feasble Soluton Of Transportaton

More information

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The

More information

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems Jont Schedulng of Processng and Shuffle Phases n MapReduce Systems Fangfe Chen, Mural Kodalam, T. V. Lakshman Department of Computer Scence and Engneerng, The Penn State Unversty Bell Laboratores, Alcatel-Lucent

More information

Improved Mining of Software Complexity Data on Evolutionary Filtered Training Sets

Improved Mining of Software Complexity Data on Evolutionary Filtered Training Sets Improved Mnng of Software Complexty Data on Evolutonary Fltered Tranng Sets VILI PODGORELEC Insttute of Informatcs, FERI Unversty of Marbor Smetanova ulca 17, SI-2000 Marbor SLOVENIA vl.podgorelec@un-mb.s

More information

A Secure Password-Authenticated Key Agreement Using Smart Cards

A Secure Password-Authenticated Key Agreement Using Smart Cards A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,

More information

Disagreement-Based Multi-System Tracking

Disagreement-Based Multi-System Tracking Dsagreement-Based Mult-System Trackng Quannan L 1, Xnggang Wang 2, We Wang 3, Yuan Jang 3, Zh-Hua Zhou 3, Zhuowen Tu 1 1 Lab of Neuro Imagng, Unversty of Calforna, Los Angeles 2 Huazhong Unversty of Scence

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

Evaluating credit risk models: A critique and a new proposal

Evaluating credit risk models: A critique and a new proposal Evaluatng credt rsk models: A crtque and a new proposal Hergen Frerchs* Gunter Löffler Unversty of Frankfurt (Man) February 14, 2001 Abstract Evaluatng the qualty of credt portfolo rsk models s an mportant

More information

How To Solve An Onlne Control Polcy On A Vrtualzed Data Center

How To Solve An Onlne Control Polcy On A Vrtualzed Data Center Dynamc Resource Allocaton and Power Management n Vrtualzed Data Centers Rahul Urgaonkar, Ulas C. Kozat, Ken Igarash, Mchael J. Neely urgaonka@usc.edu, {kozat, garash}@docomolabs-usa.com, mjneely@usc.edu

More information

Estimating Total Claim Size in the Auto Insurance Industry: a Comparison between Tweedie and Zero-Adjusted Inverse Gaussian Distribution

Estimating Total Claim Size in the Auto Insurance Industry: a Comparison between Tweedie and Zero-Adjusted Inverse Gaussian Distribution Estmatng otal Clam Sze n the Auto Insurance Industry: a Comparson between weede and Zero-Adjusted Inverse Gaussan Dstrbuton Autora: Adrana Bruscato Bortoluzzo, Italo De Paula Franca, Marco Antono Leonel

More information

1. Measuring association using correlation and regression

1. Measuring association using correlation and regression How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a

More information

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES The goal: to measure (determne) an unknown quantty x (the value of a RV X) Realsaton: n results: y 1, y 2,..., y j,..., y n, (the measured values of Y 1, Y 2,..., Y j,..., Y n ) every result s encumbered

More information

Implementation of Deutsch's Algorithm Using Mathcad

Implementation of Deutsch's Algorithm Using Mathcad Implementaton of Deutsch's Algorthm Usng Mathcad Frank Roux The followng s a Mathcad mplementaton of Davd Deutsch's quantum computer prototype as presented on pages - n "Machnes, Logc and Quantum Physcs"

More information

FORMAL ANALYSIS FOR REAL-TIME SCHEDULING

FORMAL ANALYSIS FOR REAL-TIME SCHEDULING FORMAL ANALYSIS FOR REAL-TIME SCHEDULING Bruno Dutertre and Vctora Stavrdou, SRI Internatonal, Menlo Park, CA Introducton In modern avoncs archtectures, applcaton software ncreasngly reles on servces provded

More information

A Lyapunov Optimization Approach to Repeated Stochastic Games

A Lyapunov Optimization Approach to Repeated Stochastic Games PROC. ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, OCT. 2013 1 A Lyapunov Optmzaton Approach to Repeated Stochastc Games Mchael J. Neely Unversty of Southern Calforna http://www-bcf.usc.edu/

More information

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES In ths chapter, we wll learn how to descrbe the relatonshp between two quanttatve varables. Remember (from Chapter 2) that the terms quanttatve varable

More information

Joe Pimbley, unpublished, 2005. Yield Curve Calculations

Joe Pimbley, unpublished, 2005. Yield Curve Calculations Joe Pmbley, unpublshed, 005. Yeld Curve Calculatons Background: Everythng s dscount factors Yeld curve calculatons nclude valuaton of forward rate agreements (FRAs), swaps, nterest rate optons, and forward

More information

Prediction of Disability Frequencies in Life Insurance

Prediction of Disability Frequencies in Life Insurance Predcton of Dsablty Frequences n Lfe Insurance Bernhard Köng Fran Weber Maro V. Wüthrch October 28, 2011 Abstract For the predcton of dsablty frequences, not only the observed, but also the ncurred but

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

Dynamic Pricing for Smart Grid with Reinforcement Learning

Dynamic Pricing for Smart Grid with Reinforcement Learning Dynamc Prcng for Smart Grd wth Renforcement Learnng Byung-Gook Km, Yu Zhang, Mhaela van der Schaar, and Jang-Won Lee Samsung Electroncs, Suwon, Korea Department of Electrcal Engneerng, UCLA, Los Angeles,

More information

The Application of Fractional Brownian Motion in Option Pricing

The Application of Fractional Brownian Motion in Option Pricing Vol. 0, No. (05), pp. 73-8 http://dx.do.org/0.457/jmue.05.0..6 The Applcaton of Fractonal Brownan Moton n Opton Prcng Qng-xn Zhou School of Basc Scence,arbn Unversty of Commerce,arbn zhouqngxn98@6.com

More information

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy Fnancal Tme Seres Analyss Patrck McSharry patrck@mcsharry.net www.mcsharry.net Trnty Term 2014 Mathematcal Insttute Unversty of Oxford Course outlne 1. Data analyss, probablty, correlatons, vsualsaton

More information

Frequency Selective IQ Phase and IQ Amplitude Imbalance Adjustments for OFDM Direct Conversion Transmitters

Frequency Selective IQ Phase and IQ Amplitude Imbalance Adjustments for OFDM Direct Conversion Transmitters Frequency Selectve IQ Phase and IQ Ampltude Imbalance Adjustments for OFDM Drect Converson ransmtters Edmund Coersmeer, Ernst Zelnsk Noka, Meesmannstrasse 103, 44807 Bochum, Germany edmund.coersmeer@noka.com,

More information