WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce

Size: px
Start display at page:

Download "WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce"

Transcription

1 WSE-ntegrator: An Automatc ntegrator of Web Search nterfaces for E-Commerce Ha He, Wey Meng Dept. of Computer Scence SUNY at Bnghamton Bnghamton, NY Clement Yu Dept. of Computer Scence Unv. of llnos at Chcago Chcago, Zonghuan Wu Center for Adv. Compu. Studes Unv. of ousana at afayette afayette, A zwu@cacs.lousana.edu Abstract More and more databases are becomng Web accessble through form-based search nterfaces, and many of these sources are E-commerce stes. Provdng a unfed access to multple E- commerce search engnes sellng smlar products s of great mportance n allowng users to search and compare products from multple stes wth ease. One key task for provdng such a capablty s to ntegrate the Web nterfaces of these E- commerce search engnes so that user queres can be submtted aganst the ntegrated nterface. Currently, ntegratng such search nterfaces s carred out ether manually or sem-automatcally, whch s neffcent and dffcult to mantan. n ths paper, we present WSE-ntegrator - a tool that performs automatc ntegraton of Web nterfaces of Search Engnes. WSE-ntegrator employs sophstcated technques to dentfy matchng attrbutes from dfferent search nterfaces for ntegraton. t also resolves doman dfferences of matchng attrbutes. Our expermental results based on 20 and 50 nterfaces n two dfferent domans ndcate that WSE- ntegrator can acheve hgh attrbute matchng accuracy and can produce hgh-qualty ntegrated search nterfaces wthout human nteractons. 1. ntroducton More and more databases are becomng Web accessble through form-based search nterfaces. Among these web Permsson to copy wthout fee all or part of ths materal s granted provded that the copes are not made or dstrbuted for drect commercal advantage, the VDB copyrght notce and the ttle of the publcaton and ts date appear, and notce s gven that copyng s by permsson of the Very arge Data Base Endowment. To copy otherwse, or to republsh, requres a fee and/or specal permsson from the Endowment Proceedngs of the 29 th VDB Conference, Berln, Germany, 2003 sources, E-commerce search engnes (ESEs) account for a large proporton. t s of great mportance to provde a unfed access to multple ESEs sellng smlar products because ths would allow users to search and compare products from multple stes wth ease. n ths paper, we call a system that supports unfed access to multple ESEs as an E-commerce metasearch engne (EMSE for short). Currently, there are a number of EMSEs on the nternet, such as and However, ther technques are not publcly avalable. To the best of our knowledge, most exstng EMSEs are bult manually or sem-automatcally. Furthermore, as ESEs operate autonomously, changes/upgrades to them may affect the operaton of the EMSE. As a result, mantanng the operaton of an EMSE s a costly long-term effort. Our E-Metabase proect ams to automate the process of buldng large-scale EMSEs so as to sgnfcantly reduce the cost of buldng and mantanng EMSEs. Ths proect conssts of a number of components. Frst, a specal crawler s used to crawl the Web and dentfy ESEs from the fetched Web pages. Second, the found ESEs are clustered nto dfferent groups such that ESEs n the same group sell the same type of products (.e., n the same doman). Thrd, the nterfaces of the ESEs n the same group are ntegrated nto a unfed nterface that becomes the nterface of the EMSE for ths group. Fourth, a global query submtted to the EMSE s mapped to queres for the underlyng ESEs. Ffth, a component that s responsble for connectng to each ESE s bult so that a query can be passed to and results can be returned back from each ESE. Sxth, nformaton of every product returned by each ESE needs to be correctly extracted from the returned result pages by an nformaton extracton program. Fnally, the extracted results from dfferent ESEs need to be fltered accordng to the global query and then combned nto a sngle lst for presentaton to the user based on some desred features, say prce. WSE- ntegrator s desgned to automate the nterface ntegraton step. n ths paper, we present our technques used to buld WSE-ntegrator. WSE-ntegrator s appled to each group of ESEs to produce an ntegrated

2 nterface for ths group of ESEs. Wthout loss of generalty, n ths paper, we assume that all ESEs under consderaton are n the same product doman. (Our technques for clusterng ESEs can be found n [PMH03]) Ths paper has the followng contrbutons. Frst, we provde a comprehensve soluton to the nterface ntegraton problem. nterface ntegraton ncludes schema ntegraton, attrbute value mergng, format ntegraton and layout generaton of global attrbutes n the global (ntegrated) nterface. n contrast, related exstng works deal wth only schema ntegraton (see Secton 2). Second, we propose an automated soluton for nterface ntegraton usng only general (.e., doman-ndependent) knowledge. Most exstng works employ manual or semautomatc technques. One of the key ssues n nterface ntegraton s to dentfy matchng attrbutes from dfferent nterfaces and we propose a clusterng and weght-based two-step method to tackle ths problem. Furthermore, ths method also solves a rarely addressed ssue,.e., fndng approprate names for attrbutes n the global nterface automatcally. Our expermental results based on 20 and 50 nterfaces n two dfferent domans ndcate that WSE-ntegrator can acheve hgh attrbute matchng accuracy and can produce hgh-qualty ntegrated search nterfaces wthout human nteractons. The rest of ths paper s organzed as follows. Secton 2 brefly revews prevous research works related to our work. Secton 3 dscusses nterface representaton used by WSE-ntegrator. n Secton 4, we present our method for matchng attrbutes. n Secton 5, we dscuss mergng attrbute domans. n Secton 6, we dscuss global nterface constructon. Expermental results are reported n Secton 7. Secton 8 concludes the paper. 2. Related work A thorough survey of approaches for automatc schema matchng can be found n [RB01]. [GKD97] predefnes the mappng rules for each attrbute and assembles these rules nto a knowledge base for nterpretaton when a query s handled. [RO96] uses a world vew to represent all sources but t does not dscuss how to construct the world vew automatcally. [DEW96] predefnes each doman descrpton that ncludes nformaton about product attrbutes, and then uses some heurstcs and mappng functons for the felds of each search nterface but t does not provde much detal about user nterface. [BBB01, BCV01] use Descrpton ogcs, Common Thesaurus and clusterng technques for semantc schema ntegraton. WordNet [WDNT] s also used to dentfy semantc relatonshps between schema terms. Ths s a sem-automatc approach as the ntegraton process stll nvolves human nteracton. Furthermore, the approach used for matchng attrbutes s manly based on name affnty and structure affnty, and only a few metadata (such as key, foregn key) of schemas are used. [C00] uses neural network technques and focuses on utlzng both schema level and data contents level metadata to automatcally dentfy matchng attrbutes. Our approach has adopted some deas from [C00] but there are sgnfcant dfferences (see Secton 4.3 for more comparson wth [C00]). [MBR01] nvestgates algorthms for generc schema matchng. t combnes a number of past technques, such as lngustc-based matchng and some metadata of schemas. t proposes structure-matchng algorthms for herarchy schemas (tree structures) n whch a structural smlarty s computed between each par of schema elements. However, how a global schema s obtaned s not dscussed. [DDH01] uses and extends machne-learnng technques to semautomatcally fnd mappngs between source schemas and the medated schema. Ths approach needs human users to manually construct the semantc mappngs between a small set of data sources (tranng) and the medated schema. [HC03] uses a statstcal approach for schema ntegraton of query nterfaces of the deep web. t argues that as the Web sources prolferate the aggregate schema vocabulary of sources n the same doman tends to stablze at a relatvely small sze, and that underlyng these sources, there exsts a unfed hdden schema model. Then t uses statstcal probablty and goes through three steps (hypothess modelng, generaton and selecton) to obtan the hdden schema model. t uses only attrbute names for statstcs, and t does not apply other schema nformaton such as doman type, default value and attrbute values whch, we fnd, based on our experments, to be very effectve n nterface ntegraton. t s not clear how semantc relatonshps between names (such as synonymy and hypernymy) are obtaned n ths work. n addton, t dscusses only schema ntegraton, but not attrbute mergng and global nterface generaton. [MGR02] uses the dea of P packet floodng to flood the smlarty of elements. t converts each schema nto a drected labeled graph. On the bass of the graph model, a part of the smlarty of two elements propagates to ther respectve neghbors. The smlarty floodng algorthm termnates after a fx pont s reached and some flters are used to get a subset of the result mappng. No lngustc name matchng s done beyond utlzng a smple strng matcher to compare common prefxes and suffxes of lterals. Ths approach s not sutable for search nterfaces because name matchng plays an mportant role n the ntegraton of search nterfaces. [DR02] dscusses combnng dfferent matchng algorthms n a flexble way and supports dfferent ways to combne match results. n [DR02], schemas are represented as rooted drected acyclc graphs. t mantans a matcher lbrary for smple matchers such as approxmate strng matcher, synonym matcher, data type matcher and hybrd matchers (e.g. name matcher and structural matchers). t uses data type but not other schema and doman nformaton to help fnd matches. The man dfference between our work and exstng works s that we am to perform comprehensve nterface

3 ntegraton automatcally whle others perform only schema ntegraton, employng mostly manual or semautomatc technques. There are bascally no publshed work, to the best of our knowledge, on automatc attrbute value mergng, format ntegraton and layout generaton. Furthermore, compared wth other approaches for Web sources ntegraton, we utlze a rcher set of schema and doman nformaton to fnd matchng attrbutes and we utlze the nformaton dfferently (see Secton 4). 3. nterface representaton A search nterface for E-commerce s usually presented through an HTM form n a Web page [HT4]. t may contan elements such as text box, rado button, check box and selecton lst, and each element usually has a label (descrptve text) assocated wth t. Users fll out the form and then submt the flled form as a query through the browser to the remote server. The server then returns to users the results that satsfy the query condtons. n general, much useful nformaton s embedded n the HTM source fle of each local nterface and such nformaton needs to be extracted for nterface ntegraton. n ths paper, we do not dscuss how to extract the needed nformaton (some related work can be found n [RGM01, DEW96], and our work on ths wll be reported n another paper). nstead, we focus on what nformaton should be used to represent each search nterface for the purpose of nterface ntegraton. Each local ESE nterface can be conceptually vewed as a partal export relatonal schema of the underlyng product database. n our nterface representaton, each label s consdered as the name of an attrbute of the underlyng products. Each attrbute has one or more assocated elements. Each element has a format whch s the nput format of the element. There are generally four types of formats: text box, rado button, check box and selecton lst. Each element also has a doman that defnes the set of values that can be used to nstantate the element when formng a query. Text box allows users to nput whatever value they want and thus the correspondng doman can be consdered to be nfnte. A selecton lst provdes a fnte number of pre-determned values for users to select whle a check box and rado button have one assocated value. These three formats thus have a fnte doman. Often multple check boxes or multple rado buttons are used together to accomplsh the same functon as a selecton lst. n addton, each element or a group of elements may have ts or ther default value, whch s used to help formng a query when a user does not make a dfferent selecton. For each attrbute, there s a type for ts values. Sx value types are consdered and they are date, tme, currency, number, char and d. The d type ndcates that the attrbute s used for dentfcaton purpose (e.g., product number, order number). The type nformaton can be obtaned through analyzng attrbute name (contanng date, tme, prce etc.) and the pattern or format of attrbute values (that are vewable on the nterface). For example, $300 for currency and 3:00PM for tme. When the value type s dffcult to determne, a default value type,.e., char, s used. Whenever possble, the scale/unt of the attrbute values s also extracted. For example, all values wth US$ are consdered to have the same unt but US$ and CAN$ have dfferent unts even though they are both of currency value type. Fnally, each attrbute has ts layout poston n the nterface. The poston value s determned by the layout order of attrbutes n an nterface. More mportant attrbutes are usually arranged ahead of less mportant ones. n addton to the label of an attrbute, each element of the attrbute may have ts own label. For example, n Fgure 1, attrbute publcaton year has two text box elements wth ther own labels after and before, respectvely. Such label helps defne the semantc meanng of the element. Fgure 1: Examples of element relatonshp type When an attrbute has multple elements, these elements are related n some way. We dentfy the followng four relatonshp types among related elements based on our observatons. Range type: t refers to the stuaton where two or more elements are used to specfy the range semantcs for an attrbute. For example, n Fgure 1, the prce range has two related elements ndcatng the mnmum and the maxmum values allowed. Part type: t refers to the part-of relatonshp. For example, n Fgure 1, author has two elements frst name and last name and each of them s part of author. Range type s a specal case of part type. Group type: Multple check boxes/rado buttons are sometmes used together to form a sngle semantc concept (attrbute). n ths case, the labels assocated wth the check boxes/rado buttons are values of the attrbute. n Fgure 1, attrbute Platform has a group of check boxes. Constrant type: An element can be used as a constrant for another element. For example, for a text nput box, a check box may be used to specfy whether or not the nput s case senstve. n ths case, the check box s meanngless wthout beng related to the text nput box.

4 To summarze, n our approach, each attrbute A s represented as A = (N, P, DT, DF, VT, SU, ES, R), where N s the name (label) of A, P s the layout poston of A, DT s the doman type of A, DF s the default value of A (possbly null and there s at most one default value for each group of check boxes and rado buttons),vt s the value type of A, SU s the scale/unt of A, ES s the set of elements assocated wth A and R s the relatonshp type between the elements n ES. For example, for attrbute Prce Range n Fgure 1, ts ES contans two text box elements labeled between US$ and and US$, and ther relatonshp type s range type. f ES contans only one element, then R s null. Each element E n ES s tself represented as a quadruplet E(, F, V, DV), where s ts label (possbly empty), F s the format, V s the set of values (for fnte doman type of elements only), and DV s the default value of the element (possbly null ). 4. Matchng attrbutes n ths secton, we present our method for matchng attrbutes from multple local nterfaces. 4.1 Semantc relatonshps Semantc relatonshps between concepts or obects are very mportant n the database schema ntegraton and Web source ntegraton. n our approach, we dentfy the followng three semantc relatonshps between terms (attrbute names or element s values): Synonymy, Hypernymy/Hyponymy and Meronymy [M95, WDNT, BCV01, BBB01]. Gven a term, we use WordNet [M95, WDNT] to get ts synonyms, hypernyms and meronyms, f applcable. Synonymy. Term T 1 s a synonym of term T 2, denoted by S(T 1,T 2 ), f T 1 s n the synonym-set of T 2. Hypernymy/Hyponymy. Term T 1 s a hypernymy of term T 2, denoted by H(T 1,T 2 ), f T 1 s more generc than T 2. For example, H(tree, maple) and H(format, hardcover). Meronymy. Term T 1 s a meronym of term T 2, denoted by M(T 1,T 2 ), f T 1 s a part of T 2. For example, M(frst name, name) and M(last name, name). However, hypernymy and meronymy terms that can be found from WordNet are very lmted. n WSE- ntegrator, we also dentfy hypernymy and meronymy relatonshps of two terms usng the nformaton n the nterface representatons. For example, suppose we have two nterfaces, one has a hardcover attrbute and the other has a format attrbute that contans a value hardcover. From ths, we can dentfy the hypernymy between the two attrbutes: H(format, hardcover). For meronymy, we use the part relatonshp of elements. For example, f a search nterface contans an author attrbute that has two parts: frst name and last name, we can say M(frst name, author) and M(last name, author). Other nterfaces that contan frst name or last name wthout author can use the relatonshp to match. 4.2 Normalzaton Before ntegraton, attrbute names and element values are normalzed as follows to reduce msmatches. Convert each name or value strng to lower case equvalents. Remove all content n parentheses, ncludng parentheses. Replace all characters that are not alphanumerc wth a space character. Tokenze each strng usng space, replace abbrevaton and acronym (f any) [MBR01] and use WordNet to get the base form of each token. Remove stop words when a name or a value conssts of multple words. 4.3 Mergng attrbutes Mergng attrbutes has two tasks: one s to fnd the matchng attrbutes from search nterfaces to be ntegrated, and the other s to determne what global attrbute name should be used for each group of matchng attrbutes. To the best of our knowledge, no n-depth dscusson of the second task has been reported n the lterature. The SEMNT approach n [C00] utlzes and extends the metadata characterstcs n [NE89] to determne matchng attrbutes. SEMNT ntroduces three levels of metadata that can be used: attrbute names (the dctonary level), feld specfcaton (the schema level, e.g., data type and prmary key) and attrbute values and patterns (data content level). SEMNT ust focuses on usng the metadata at the schema level and data content level to determne attrbute correspondences. t descrbes 20 characterstcs at the two levels, such as data length, data type, nullable, prmary key, default scale, mnmum, maxmum, average and so on. We adopt the basc dea of the SEMNT approach for the attrbute-matchng task n the sense we also use metadata characterstcs n multple levels. Our approach dffers from the SEMNT approach n four aspects. Frst, the set of characterstcs used s dfferent. For example, prmary key nformaton and maxmum value are readly avalable n a database context but they are not avalable for nterface ntegraton. On the other hand, nformaton such as element format apples to only nterface ntegraton. Second, we utlze all three levels of metadata nstead of ust two. Thrd, we classfy matches based on dfferent metadata nto postve matches and predctve matches (see below). Fourth, SEMNT uses neural network technques but we don t. Furthermore, the SEMNT approach does not address the second task of mergng attrbutes. As mentoned above, n our approach, we use the three levels of metadata to determne matchng attrbutes. At the dctonary level, we explore sx possble matches on attrbute names: exact match, approxmate strng match [WM92], vector space smlarty match [FB92] (see secton 4.3.2), synonymy match, hypernymy match and meronymy match. At the schema level, scale, value type,

5 doman type, default value and Boolean property are used. At the data content level, we focus on comparng values n the elements. n our approach, we classfy the dfferent matches nto two types: postve matches and predctve matches. postve matches nclude exact name match, semantc (synonymy, hypernymy and meronymy) matches and value-based match. For value-based match, we employ exact match, approxmate strng match, synonymy match and hypernymy match to compare values. When enough values from the two attrbutes are matched (a threshold s used), value-based match s recognzed as succeeded. When one of the postve matches occurs durng our ntegraton process, the correspondng attrbutes are recognzed as matched. Predctve matches consst of approxmate name match, vector space smlarty match of names, and matches based on scale, doman type, value type, default value, Boolean property and value pattern. Predctve matches must be suffcently strong (based on a weght threshold) for two attrbutes to be recognzed as matched. Our approach for accomplshng the two tasks of mergng attrbutes s descrbed n the next two subsectons Clusterng (postve match) Ths s to group attrbutes nto clusters based on the postve matches between attrbutes. Ths step consders all nterfaces. There are three steps for the clusterng: Group attrbutes nto clusters based on the exact match of attrbute names n all nterfaces of the same doman. Thus, after ths step, all attrbutes n the same cluster have the same attrbute name. For each dstnct attrbute name, the number of nterfaces havng the attrbute s counted. Values of all attrbutes n each cluster, f any, are unoned. Merge the clusters produced n the frst step based on the matchng of values n each cluster and the semantc (synonymy, hypernymy and meronymy) matches of attrbute names. New clusters are generated n ths step. Determne the representatve attrbute name of each cluster produced n the second step. Ths attrbute name s a canddate to be the global attrbute name to whch other attrbutes n the cluster are mapped. To determne the representatve attrbute name of each cluster, generally we employ the maorty rule. n other words, the attrbute name that appears n most nterfaces n a cluster would be chosen as the representatve attrbute name of the cluster. However, we also consder the semantc relatonshps among attrbute names n the cluster. For example, f a cluster contans four dfferent attrbute names: format, bndng type hardcover and paperback, we do not choose hardcover or paperback as the representatve name of the cluster even f they appear n more nterfaces. The reason s that hardcover or paperback s a knd of format or bndng type. Therefore, durng the clusterng we buld hypernymy herarchy trees for attrbute names n the cluster. We then choose the representatve attrbute name among the roots usng the maorty rule. For the prevous example, we ust need to compare the number of occurrences between format and bndng type. n our approach, the clusterng step does prelmnary attrbute matchng and representatve attrbute dentfcaton. The step ust collects the knowledge about what attrbutes should be matched based on the postve nformaton. No ntermedate ntegrated nterface s yet constructed after ths step. There are several reasons to perform clusterng. Frst, count the number of nterfaces an attrbute name appears n; ths nformaton s mportant for determnng the global attrbute names. Second, determne the representatve attrbute name of one cluster n advance. Thrd, make sure that attrbutes that should be matched (based on postve matches) are matched together. Ths can smplfy the comparsons n the weghtbased match step and avod msmatches. Our experments ndcate that ths two-step approach s effectve Weght-based match (predctve match) Ths step s to utlze the knowledge obtaned n the frst step and the predctve matches to construct the ntegrated nterface and fnalze the global attrbute names. ntally, there s no ntermedate ntegrated nterface. n ths case, gven a local nterface, our approach takes an attrbute n t and looks up the representatve attrbute name of the cluster n whch the attrbute appears. Then the representatve attrbute name s added to the ntermedate nterface (t s empty ntally) as the global attrbute name. These two operatons are repeated untl all attrbutes n the local nterface are handled. The global attrbute names may be adusted later. Once the frst ntermedate nterface s generated, weght-based match begns to work. Frst, we present the defnton of weght-based match as follows: Defnton: Gven an ntegrated ntermedate nterface ={A 1, A 2, A 3,, A n } and a local nterface ={ A 1, A 2, A 3,, A m }, where A s an attrbute of, A s an attrbute of, the mapped attrbute n for an attrbute A s the one wth the hghest weght: W( A, A ) > W( A, A ) > w, = 1,..., n, k= 1,..., n, k, where W( A, k A ) s the weght of attrbutes A and A, w s the weght threshold. n our approach, the weght-based match computes the matchng weght between two attrbutes and then predcts whether the two attrbutes are matched based on the weght. The weght between attrbutes A and A can be computed based on the followng metrcs (predctve matches):

6 1. Approxmate strng match Compare the two attrbute names to fnd out f the edt-dstance between the two strngs s wthn the allowed threshold. We use an approxmate strngmatch algorthm [WM92] to fnd the match. f the edt-dstance s wthn the allowed threshold T, assgn a postve weght Wam; otherwse Wam s Vector space smlarty The vector space smlarty s the smlarty between two text strngs based on the Vector Space Model [FB92]. The approach s also used n [Coh98]. We tokenze each strng and get the term frequency of each term n each strng. The weght of ths metrc s the Cosne smlarty of two strngs. Wvss ( v, w ) = = 1 m v = 1 m ( v ) 2 m ( w where m s the number of unque terms n the two strngs, w s the term frequency of the th term n attrbute strng w and v s the term frequency of the th term n attrbute strng v. 3. Compatble doman We consder four doman types: fnte, nfnte, hybrd and range. The doman type of an attrbute s derved from ts assocated element(s). f the element(s) of the attrbute has the range semantcs, the doman type of the attrbute s range. Hybrd s the combnaton of fnte and nfnte. f an attrbute doman s hybrd, users can ether select from a lst of pre-compled values or fll n a new value. n our approach, the hybrd type s only lmted to the ntermedate nterface and the global nterface. Hybrd s compatble wth fnte and nfnte; the same types are compatble. f two attrbutes have compatble doman types, assgn a weght Wcd; otherwse Wcd s 0. n addton, we observed that range type s used much less often than fnte and nfnte types. Thus, f two attrbutes have range doman type, we double Wcd. 4. Value type match As mentoned n Secton 3, we consder sx value types: date, tme, currency, number, char and d. f two attrbutes have the same value type, assgn a weght Wvtm; otherwse Wvtm s 0 5. Scale/unt match Consder two attrbutes that have the same value type. f they also have the same scale or unt, assgn a weght Wcs; otherwse (.e., f they have dfferent value types or dfferent scales/unts), Wcs s 0. For example, f two attrbutes are both of currency type and ther values are n US$, then Wcs s assgned to the overall match of the two attrbutes. 6. Default value n a search nterface, some elements may have ther default values. n some cases, an element may have w = 1 ) 2 no assocated label, but t has a default value whch s mportant for the element to fnd a matchng attrbute. n addton, f an attrbute s n a cluster, then ts default value s consdered as one of the default values of the cluster. So when we check default values of two attrbutes we check default values of the two attrbutes themselves as well as ther clusters. f two attrbutes have the same default value, assgn a weght Wdv; otherwse Wdv s 0; 7. Boolean property f an attrbute has ust a sngle check box, ths check box s usually used to mark a yes-or-no selecton. Such an attrbute s consdered to have a Boolean property. f both attrbutes have the Boolean property, assgn a weght Wbp; otherwse Wbp s Value pattern We apply value pattern only to the numerc attrbutes. We compute the average of all numerc values n each attrbute. f the two averages are close, assgn a weght Wvp; otherwse Wvp s 0. The weght between attrbutes A and A s the sum of the above eght metrc weghts (the values of these weghts are determned expermentally, see Secton 7.2.2): W( A, A ) = Wam + Wvss + Wcd + Wvtm + Wcs + Wdv +Wbp+ Wvp Gven the ntermedate nterface and an attrbute A n a local nterface, the approach frst looks up the attrbute thesaurus to see f the attrbute s already mapped to a global attrbute n the ntermedate nterface. The attrbute thesaurus s establshed ncrementally durng the weght-based matchng process. f t has been mapped, the attrbute A would drectly be mapped to the global attrbute name. f t has not, the representatve attrbute name would be found usng the name of attrbute A. Then recheck the attrbute thesaurus usng the representatve attrbute name to see f the representatve attrbute name s mapped to a global attrbute. f the mappng s found, A s mapped to the global attrbute; otherwse compute the weghts between A and all attrbutes n the ntermedate nterface. After these weghts are computed, the attrbute wth the hghest weght s selected. f ths weght s greater than the threshold w, the selected attrbute s consdered as the matchng attrbute of the attrbute A ; otherwse, we assume that no matchng attrbute s found. n the former case, we have to determne the global attrbute name between the two attrbutes. n our approach, the attrbute name that appears n more local nterfaces would be selected. f applcable, the correspondng entry n the attrbute mappng table (whch keeps mappngs between

7 each global attrbute and ts correspondng local attrbutes) s changed and so s the thesaurus. n the latter case, the attrbute A s added as a new attrbute to the ntermedate nterface and a new entry for the attrbute s added n the attrbute mappng table Mantenance of ntegrated nterface After a global nterface s generated, t s lkely that new local nterfaces need to be added to or some exstng local nterfaces need to be removed from the global nterface from tme to tme. Ths requres mantanng the global nterface. For addng new local nterfaces, the frst step of clusterng needs to be performed on the new local nterfaces, followed by the second step of clusterng to cluster the output of the frst step to the exstng clusters. The representatve attrbute name may need to be updated based on the current and prevous statstcal and semantc knowledge. Then, the weght-based match s performed. For removng some local nterfaces from the global nterface, we remove the attrbute names and ther correspondng values from the clusters and the related mappng nformaton from the attrbute mappng table. n both cases, the count ndcatng the number of nterfaces contanng an attrbute needs to be updated accordngly, and f applcable, the global attrbute name of the global nterface may also need to be changed. 5. Mergng attrbute domans When a local attrbute s mapped to a global attrbute n the ntermedate nterface, we must determne the global attrbute doman after the mappng. Ths ncludes the followng two aspects: 1) The global doman type. As mentoned prevously, four doman types are supported n our approach and they are fnte, nfnte, hybrd and range. A compatble doman type between the two attrbutes should be used. 2) The attrbute values. Need to merge the values that represent the same concept and provde a set of values for the global attrbute. To deal wth these two ssues, dfferences between the two domans should be dentfed and resolved, ncludng format dfference, semantc conflct, scale dfference, range dfference and constrant dfference. Here we need to take a closer look at the range dfference. n Fgure 3, we can see that there are varous range formats. Two aspects need to be consdered n resolvng range conflct, one s about range modfers such as from, to, less than, under and so on, and the other s about range wdth. Fgure 3 shows that dfferent range domans may have dfferent range modfers and dfferent range wdth. The resoluton of range conflcts s to generate a global range doman that s compatble wth the range domans of the matchng attrbutes (see detals n secton 5.3). 5.1 Determne global doman type For a gven local attrbute A and a matchng global attrbute A, we use the followng rules to determne the new doman type for A : 1) fnte + fnte fnte 2) nfnte + nfnte nfnte 3) range + any type range 4) (fnte + nfnte) or (hybrd + fnte) or (hybrd + nfnte) hybrd The frst rule can be explaned as follows: f the local attrbute A s fnte and the global attrbute fnte, then the new global doman type of Other rules can be explaned smlarly. 5.2 Mergng alphabetc domans A s also A s fnte. f a local attrbute and ts matchng global attrbute are fnte or hybrd and have alphabetc values, we should consder how to merge ther values and form a new value set for the global attrbute. n WSE-ntegrator, ths s carred out n two phases. The frst phase s n the clusterng step dscussed n Secton n ths phase, attrbutes that have some values n common are grouped nto the same cluster. Furthermore, due to the matchng technques employed (exact match, approxmate strng match, synonymy match and hypernymy match), semantc relatonshps between values are dentfed. n the second phase, we use the knowledge of the relatonshps between values to merge values and generate a global value set. Phase 2 conssts of the followng steps. Frst, we cluster all values nto categores based on approxmate strng match, vector space smlarty match, synonymy match and hypernymy match. Thus, all values that are smlar, synonymy or hypernymy are clustered nto the same category. Next, we solve the followng two problems: (1) Whch value should be chosen as the global value f multple smlar and synonym values are n the same category? (2) How to provde values to users f the values n the same category have hypernymy relatonshps? For the frst problem, we can keep a counter for each value and use the maorty rule to choose the most popular value. As to the second problem, we need to make a tradeoff between choosng generc concepts and choosng specfc concepts as the choce would have dfferent effects on query cost and nterface frendlness. The cost of evaluatng a global query ncludes the cost of nvokng local ESEs to submt sub-queres, the cost of processng sub-queres at local ESEs, result transmsson cost and post-processng cost (e.g., result extracton and mergng). f we choose only generc concepts as global values and do not use specfc concepts, a query aganst the global nterface may need to be mapped to multple values (correspondng to specfc concepts) n some local

8 nterfaces, leadng to multple nvocatons to the local search engnes. On the other hand, f we keep only specfc concepts and gnore generc concepts, users who want to query generc concepts (.e., have broader coverage) may have to submt multple queres usng specfc concepts, resultng n less user-frendly nterface. Our approach s to provde a concept herarchy of values to users, ncludng generc and specfc concepts. Ths remedes the problems of the prevous two optons and gves the users more flexblty to form ther queres. Value clusterng may produce multple categores and a value herarchy s created for each category. Each herarchy s lmted to at most three levels to make t easer to use. After these two phases, the mappngs between global values and local values are establshed. Subects Network Databases Programmng - anguages Subect TCP/P Wrelessnetwork Oracle Sybase Sql server C C++ Java Pascal Network --TCP/P --Wreless network Databases --Oracle --Sybase --Sql server Programmng- anguages --C --C++ --Java --Pascal Fgure 2: Example of mergng doman values Example 1: Consder two Web bookstore nterfaces, one has an attrbute subects wth values Network, Databases, Programmng anguages and so on and the other has a correspondng attrbute subect wth values TCP/P, Wreless network, Oracle, Sybase, Sql server, C, C++, Java, Pascal and so on. After clusterng the values, some semantc herarches between the values from the two nterfaces can be dentfed. There are three possble ways to generate the global doman values for subect. One s to use only generc concept values,.e., values from the frst nterface, namely Network, Databases, Programmng anguages etc. n ths case, suppose a user wants to fnd nformaton about Oracle. Snce Oracle s not avalable, the user has to select Databases on the global nterface and submt the query. Ths global query would have to be mapped to three sub-queres for the second nterface, namely Oracle, Sybase, and Sql server. Obvously, searchng based on Sybase and Sql server wastes the resources at the second ste and returns more useless results to the user. The second opton s to use only the values wth more specfc concepts,.e., the values from the second nterface. n ths case, a user who wants to fnd nformaton about database (not of any specfc type) needs to submt three queres respectvely usng Oracle, Sybase and Sql server. Ths s nconvenent to the user. Our approach wll organze related values nto a herarchy (see the box on the rght sde n Fgure 2). n ths case, f the user selects Databases, then the metasearch engne wll generate three sub-queres for the second ste on behalf of the user. On the other hand, f any of the three sub-concepts of Databases s selected, only that concept wll be used for the second ste but Databases wll be used for the frst ste. Ths soluton remedes the problems of the frst two solutons. We should pont out that not every category of values can form a herarchy. n that case, we ust provde a lst of values. 5.3 Mergng numerc domans To merge numerc domans, we need to perform the followng tasks: 1) Resolve scale dfference. We assume the dentfcaton of the scale/unt of a numerc attrbute has already been done by the nterface extractor. n our approach, we buld a scale relatonshp dctonary n advance for some popular scales. The system can look up the dctonary to fnd out how to map one scale to another scale. The numerc values n those attrbutes are transformed to the same global scale durng value mergng. 2) Understand the semantc dfferences nvolved. 3) Generate a global doman wth query cost taken nto consderaton. We dentfy two types of numerc domans: range numerc doman and non-range numerc doman. Nonrange numerc doman attrbutes may come from the numerc attrbutes that are ether fnte or nfnte. f the domans of the matchng local attrbutes are non-range numerc, we ust unon all values of these attrbutes for the global attrbute. For the rest of ths subsecton, we focus on range numerc doman. For the range numerc doman, three types of formats can be dentfed as shown n Fgure 3. 1) One selecton lst. The range type conssts of only one selecton lst, for example, the frst four selecton lsts n Fgure 3. 2) One selecton lst and one text box. The range doman s lke the publcaton date n Fgure 3, whch has two elements, one s a selecton lst for range modfer and the other s a text box for numerc value. 3) Two textboxes or two selecton lsts. The type conssts of two elements and each of them may be a textbox or a selecton lst. The examples are prce range, publcaton year n Fgure 3. From Fgure 3, we can see that numerc values are mostly combned wth other semantc words. To help the system understand such formats, we need to let the system know the meanng of the range modfers such as less

9 than, from, to and over. For ths purpose, we buld a semantc dctonary that keeps all possble range modfers for numerc domans. n addton to these range modfers, we also save the meanng of other terms related to numerc values. For example, n Fgure 3, we can see that baby and teen are n reader age. We have to specfy the real meanngs of these words to help the system know what they are. We can say that baby represents under 3 years, teen s years and adult s over 18 years. Then, we desgn a specal extractor that can extract the range modfers and the numerc values, and use the semantc meanngs n that dctonary to buld a semantc range table that can be understood by the system. The semantc range table keeps multple ranges correspondng to the orgnal ranges n the element(s). Ths table can be used n query mappng and submsson. ths end, we keep a lst for the matchng numerc attrbutes. Every tme when a numerc attrbute s mapped or added to the ntermedate nterface, the numerc values that are prevously extracted from the numerc attrbute are added to the lst (scale conflcts are resolved before ths step). When all attrbutes are matched, the lst s sorted n ascendng order of the values. The ranges are generated usng every two consecutve numerc values n the lst. For the mnmum and the maxmum values, under and over range modfers are used, respectvely. Range modfers Meanng ess than < Over > Under < Greater than > From* >= To* <= Between* >= And* <= After > Before < All All range Any All range o H nternal value 0 5 lessthan lessthan lessthan lessthan lessthan lessthan50 0 allrange Table 1: Range modfers dctonary * modfers to be used n pars Table 2: A range element semantc table Fgure 3: Examples of dfferent range formats Suppose we handle the element that has less than range modfers n Fgure 3. From ths element, we can obtan numerc values: 5, 10, 15, 20, 25 and 50 by extracton. We can also get the semantc words: all prce ranges and less than. Wth the nformaton, we can buld a range semantc table as shown n Table 2. The nternal values are the values n the HTM text that correspond to the values of the element. So far we have solved the frst two problems of mergng range numerc domans. The last thng we need to do s to generate a global range format that s compatble wth the local doman formats of the matchng attrbutes. And the global range format should consder query effcency as much as possble. ntutvely, a larger range condton n the global nterface would lead to more nvocatons to some local stes, causng more local server processng effort, more data transmsson and more postprocessng effort. Therefore, we am to reduce the range wdth of each range condton n the global nterface. To ocal range Under $10 From $10 to $20 From $20 to $30 From $30 to $40 From $40 to $50 Over $50 ocal range ess than $5 ess than $10 ess than $15 ess than $20 ess than $25 ess than $50 Global range Under $5 From $5 to $10 From $10 to $15 From $15 to $20 Form $20 to $25 From $25 to $30 From $30 to $40 From $40 to $50 Over $50 Fgure 4: Example of a global range doman Example 2: Suppose n Fgure 3 two attrbutes wth from and less than range modfers are matched, then the lst of values kept for the two matchng attrbutes s: 5, 10, 15, 20, 25, 30, 40 and 50. The global range format s shown n Fgure 4. From Fgure 4, we see that one global range condton s translated to only one approprate local range condton. For example, from $10 to $15 n the global range format s respectvely mapped to from $10 to $20 and less than $15 n the local range formats. Thus multple query nvocatons to local nterfaces are avoded and other costs ncludng post-processng tme are also reduced.

10 6. Generatng global nterface WSE-ntegrator uses the results of both the attrbute matchng and the attrbute doman mergng to generate the global nterface and show the nterface n HTM format. t also has to decde whch attrbute should appear n the global nterface and the layout of all the attrbutes. 6.1 Attrbute poston Each attrbute has ts layout poston n a gven local nterface. These layout postons reflect the mportance of the attrbutes as perceved by local nterface desgners and ther users, and they may nfluence users behavors of selectng attrbutes to use. To be user-frendly, we aggregate local mportance of each attrbute and arrange more mportant attrbutes ahead of less mportant ones. n WSE-ntegrator, the global layout poston of a global attrbute s computed as follows. P ( A ) = m = 1 P ( A where P (A) denotes the poston value of the -th global attrbute A, m s the number of local nterfaces to be ntegrated, P (A ) s the layout poston of the local attrbute n the -th local nterface that s mapped to A ; P(A ) s assgned the total number of global attrbutes when no matchng local attrbute exsts n the -th local nterface. All global attrbutes are lad out n ncreasng order of ther poston values. Clearly, usng ths method, attrbutes that appear n hgh postons (the frst poston s the hghest) n many local nterfaces are lkely to appear n hgh postons n the global nterface. 6.2 Attrbute selecton When a large number of local nterfaces are ntegrated, the global nterface may have too many attrbutes to be user-frendly. Whle some key attrbutes about the underlyng products appear n most or all local nterfaces, some less mportant attrbutes appear n only a small number of local nterfaces. One way to remedy ths problem s to trm some less mportant attrbutes from the global nterface. We use the global poston of each global attrbute to trm off less mportant attrbutes,.e., those that have large global poston values. A user-adustable threshold can be used to control ths. 7. mplementaton and expermental results 7.1 mplementaton WSE-ntegrator s developed usng JDK1.4 and s operatonal. WordNet1.6 s embedded nto the system through APs based on C. The GU of the system s shown n Fgure 5. The system can read the nterface descrpton of each Web ste and then dsplay the nterface descrpton vsually n a tree structure. From the ) tree vew, users can see all nformaton on each search nterface. The global nterface and the attrbute matchng nformaton are shown after the ntegraton s fnshed. Through the GU, users can remove or add any nterface at any tme on the fly. And the new global nterface s generated wthout startng from scratch. n addton, a user can choose any parameter value to trm attrbutes from the global nterface. To see a demo of WSE-ntegrator, go to the Web ste: Fgure 5: WSE-ntegrator nterface 7.2 Experments Evaluaton crtera Three qualtatve crtera for measurng the qualty of a global conceptual schema n the context of database schema ntegraton are proposed n [BN86] and they are Completeness and Correctness, Mnmalty and Understandablty. We rephrase these crtera and propose the followng prncples to gude the evaluaton of search nterface ntegraton. Correctness. Attrbutes that should be matched are correctly matched; attrbute domans for the matchng attrbutes are correctly merged and constructed. Completeness. f a result can be retreved drectly through a local nterface, then the result can also be retreved through the global nterface. Effcency. Global nterface constructon should consder query cost. Whle query cost s usually consdered at the query evaluaton tme, a bad global nterface may cause a hgh query cost despte of good query evaluaton algorthms. For example, supportng only very wde range condtons n the global nterface may cause too many local queres to be submtted to a local engne and too useless results to be transmtted to the metasearch engne. Frendlness. A global nterface should be smple and easy to understand and use by users. As an example, t s better to provde users a lst of values for an attrbute

11 Doman The number of Total Case 1 Case 2 Case 3 ama(%) amc(%) nterfaces Attrbutes 10 (1 st round) (2 nd round) Book 30 (3 rd round) (4 th round) (5 th round) Electroncs 10 (1 st round) (2 nd round) Average when these values are avalable for the attrbute rather than let users fll out wthout any knowledge. As another example, frequently used attrbutes should be arranged ahead of less frequently used ones. Effcency and frendlness of the global schema are taken nto consderaton by WSE-ntegrator (see Sectons 5 and 6). n the next subsecton, we report our expermental results for completeness and correctness for matchng attrbutes Expermental results To perform the experments, we collected the search nterfaces of 50 book Web stores and 20 electroncs Web stes, and then constructed the nterface representaton for each search nterface by hand (Tools for automatc constructon of ESE nterface representaton s under development and wll be reported n another paper). Correctness requres that attrbutes that should be matched across all search nterfaces be matched and that attrbutes that should not be matched not be matched. t also requres that the attrbutes n the global nterface be semantcally unque. To help measure the correctness of attrbute matchng, the global attrbute name and semantcs are used as a reference to measure how well local attrbutes are matched to the global attrbute. f there exst multple global attrbutes that are semantcally the same n the global nterface, the global attrbute wth more local attrbutes matched s consdered as the only real global attrbute whle others should be matched to t. There exst three cases for attrbute matchng: 1) Attrbutes are correctly matched to a unque global attrbute. 2) Attrbutes are ncorrectly matched to a global attrbute. 3) Attrbutes are correctly matched to a global attrbute, but they should belong to another matched group that has more matchng attrbutes. Our evaluaton metrc s called Attrbute Matchng Accuracy (ama), whch defnes what percentage of all attrbutes s correctly matched. ama n m = 1 = n a = 1 Table 3: Attrbute matchng correctness and completeness where n s the number of all local nterfaces used for ntegraton, m s the number of correctly matched attrbutes n the -th nterface (case 1), a s the number of all attrbutes n the -th nterface. Completeness requres that all contents and capabltes of each local nterface be preserved n the global nterface. As we mentoned above, three cases exst for attrbute matchng. Among these three cases, case 2 would reduce the completeness because some attrbutes are msmatched to a global attrbute; usng such global attrbutes may lead to ncorrect results from some local search engnes. For case 3, although the unqueness requrement s not satsfed, usng these global attrbutes can stll retreve results from the matchng local nterfaces. Therefore, case 3 matches do not affect completeness. We defne the Attrbute Matchng Completeness (amc) measure as follows: amc n ( a ) r = 1 = n a where r s the number of msmatched attrbutes n the -th nterface (case 2). We performed 5 rounds of experments on book nterfaces. n the frst round, 10 nterfaces were randomly selected and a global nterface was generated for them. n each subsequent round, 10 addtonal nterfaces were randomly selected and added to prevously selected nterfaces. Then a global nterface was generated from all selected nterfaces from scratch. Then we manually checked how well the attrbutes are matched. We also performed 2 rounds of experments usng nterfaces of electroncs stes. The expermental results are shown n Table 3. We can see that, on the average, the overall correctness and completeness of our approach for the two domans are 95.25% and 97.91%, respectvely. n addton, the results are remarkable stable (wth all correctness and completeness values wthn a narrow range) despte the dfferences n the number of nterfaces used and the product types. n all experments, the weghts for the seven metrcs n secton (the other metrc, smlarty match, has no fxed weght) are: Wam=0.5, Wcs=0.2, Wcd=0.1, Wvtm=0.4, Wdv=0.6, Wbp=0.1 and Wvp=0 (value pattern match s not used n our experments) and the weght = 1

12 threshold w s These values are obtaned from the experments usng the book nterfaces and they are appled to the electroncs nterfaces wthout changes. As the nterfaces for books are very dfferent from those for electroncs, the expermental results ndcate that the above parameter/threshold values are robust. 8. Conclusons n ths paper, we provded a comprehensve soluton to the problem of automatcally ntegratng the nterfaces of E-commerce search engnes. The problem s sgnfcantly dfferent from schema ntegraton for tradtonal database applcatons. Here we need to deal wth not only schema ntegraton, but also attrbute value ntegraton, format ntegraton and layout ntegraton. n ths paper, we descrbed our technques used to buld WSE-ntegrator. Wth approprate nterface representaton of local nterfaces, WSE-ntegrator automatcally ntegrates them nto a global nterface usng only doman (applcaton) ndependent knowledge. Our two-step approach based on postve matches and predctve matches for mergng attrbutes was shown to be very effectve by our experments. We beleve that the proposed approach can also be appled to other domans of E-commerce or ones beyond E-commerce such as dgtal lbrary and some professonal databases on the nternet. Whle good results were obtaned usng our method, there s room for mprovement. One possblty s to use the Open Drectory Herarchy to fnd more hypernymy relatonshps. One possble way to reduce the case 3 problem n Secton s to allow an attrbute n a local nterface to match more than one attrbute n the ntermedate nterface n Secton We wll nvestgate these possbltes n the near future. Acknowledgements Ths work s supported n part by the followng grants from Natonal Scence Foundaton: S , S , EA and ARO We thank Mr. eond Botsov for provdng us hs agrep algorthm mplementaton (tman.narod.ru/englsh/aboutnotser.html). References [BN86] C. Batn, M. enzern, S. Navathe. A Comparatve Analyss of Methodologes for Database Schema ntegraton. ACM Computng Surveys, 18(4): , December [BBB01]. Benett, D. Beneventano, S. Bergamasch, F. Guerra and M. Vncn. S-Desgner: An ntegraton Framework for E-Commerce. 17 th JCA-01, Seattle. [BCV01] S. Bergamasch, S. Castano, M. Vncn, D. Beneventano. Semantc ntegraton of Heterogeneous nformaton Sources. Journal of Data and Knowledge Engneerng, 36(3): , [Coh98] W. Cohen. ntegraton of Heterogeneous Databases Wthout Common Domans Usng Queres Based on Textual Smlarty. ACM SGMOD Conference, Seattle, WA, [DR02] H. Do, E. Rahm. COMA- A System for Flexble Combnaton of Schema Matchng Approaches. The 28 th VDB conference, Hong Kong, [DDH01] A. Doan, P. Domngos, A. Halevy. Reconclng Schemas of Dsparate Data Sources: A Machne-learnng Approach. ACM SGMOD Conference, May [DEW96] R. B.Doorenbos, O. Etzon, and D. S.Weld. A Scalable Comparson-Shoppng Agent for the World Wde Web. Techncal Report UW-CSE , Unversty of Washngton, [FB92] W. Frakes and R. Baeza-Yates. nformaton Retreval: Data Structures & Algorthms. Prentce Hall, Englewood Clffs, N.J [GKD97] M. Genesereth, A. Keller, O. Duschka. nfomaster: An nformaton ntegraton System. ACM SGMOD Conference, May 1997 [HC03] B. He, K. Chang. Statstcal Schema ntegraton Across the Deep Web. ACM SGMOD Conference, [HT4] HTM4: [NE89] J. arson, S. Navathe, R. Elmasr. A Theory of Attrbute Equvalence n Databases wth Applcaton to Schema ntegraton. EEE Transactons on Software Engneerng, Vol.15, No.4, Aprl [RO96] A. evy, A. Raaraman, J. J.Ordlle. Queryng Heterogeneous nformaton Sources Usng Source Descrpton. The 22 nd VDB Conference, nda, 1996 [C00] W., and C. Clfton. SEMNT: A Tool for dentfyng Attrbute Correspondences n Heterogeneous Databases Usng Neural Networks. Data & Knowledge Engneerng, 33: 49-84, [MBR01] J. Madhavan, P. Bernsten, E. Rahm. Generc Schema Matchng wth Cupd. VDB Conference, [MGR02] S. Melnk, H. Garca-Molna, and E. Rahm. Smlarty Floodng: A Versatle Graph Matchng Algorthm and ts Applcaton to Schema Matchng. EEE Conference on Data Engneerng, San Jose, [M95] A. Mller. WordNet: A excal Database for Englsh. Communcatons of the ACM, 38(11): 39-41,1995. [PMH03] Q. Peng, W. Meng, H. He, and C. Yu. Clusterng of E-Commerce Search Engnes. Submtted for publcaton, [RGM01] S. Raghavan, H. Garca-Molna. Crawlng the Hdden Web. The 27 th VDB Conference, [RB01] E. Rahm, P. Bernsten. A Survey of Approaches to Automatc Schema Matchng. VDB Journal, 10: , [WDNT] WordNet: [WM92] S. Wu and U. Manber. Fast Text Searchng Allowng Errors. Communcatons of the ACM, 35(10):83-91, October 1992.

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

Web Object Indexing Using Domain Knowledge *

Web Object Indexing Using Domain Knowledge * Web Object Indexng Usng Doman Knowledge * Muyuan Wang Department of Automaton Tsnghua Unversty Bejng 100084, Chna (86-10)51774518 Zhwe L, Le Lu, We-Yng Ma Mcrosoft Research Asa Sgma Center, Hadan Dstrct

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

Enterprise Master Patient Index

Enterprise Master Patient Index Enterprse Master Patent Index Healthcare data are captured n many dfferent settngs such as hosptals, clncs, labs, and physcan offces. Accordng to a report by the CDC, patents n the Unted States made an

More information

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) 2127472, Fax: (370-5) 276 1380, Email: info@teltonika.

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) 2127472, Fax: (370-5) 276 1380, Email: info@teltonika. VRT012 User s gude V0.1 Thank you for purchasng our product. We hope ths user-frendly devce wll be helpful n realsng your deas and brngng comfort to your lfe. Please take few mnutes to read ths manual

More information

8 Algorithm for Binary Searching in Trees

8 Algorithm for Binary Searching in Trees 8 Algorthm for Bnary Searchng n Trees In ths secton we present our algorthm for bnary searchng n trees. A crucal observaton employed by the algorthm s that ths problem can be effcently solved when the

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

A Performance Analysis of View Maintenance Techniques for Data Warehouses

A Performance Analysis of View Maintenance Techniques for Data Warehouses A Performance Analyss of Vew Mantenance Technques for Data Warehouses Xng Wang Dell Computer Corporaton Round Roc, Texas Le Gruenwald The nversty of Olahoma School of Computer Scence orman, OK 739 Guangtao

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

Extending Probabilistic Dynamic Epistemic Logic

Extending Probabilistic Dynamic Epistemic Logic Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set

More information

The Greedy Method. Introduction. 0/1 Knapsack Problem

The Greedy Method. Introduction. 0/1 Knapsack Problem The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton

More information

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION NEURO-FUZZY INFERENE SYSTEM FOR E-OMMERE WEBSITE EVALUATION Huan Lu, School of Software, Harbn Unversty of Scence and Technology, Harbn, hna Faculty of Appled Mathematcs and omputer Scence, Belarusan State

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

Software project management with GAs

Software project management with GAs Informaton Scences 177 (27) 238 241 www.elsever.com/locate/ns Software project management wth GAs Enrque Alba *, J. Francsco Chcano Unversty of Málaga, Grupo GISUM, Departamento de Lenguajes y Cencas de

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

IMPACT ANALYSIS OF A CELLULAR PHONE

IMPACT ANALYSIS OF A CELLULAR PHONE 4 th ASA & μeta Internatonal Conference IMPACT AALYSIS OF A CELLULAR PHOE We Lu, 2 Hongy L Bejng FEAonlne Engneerng Co.,Ltd. Bejng, Chna ABSTRACT Drop test smulaton plays an mportant role n nvestgatng

More information

Conversion between the vector and raster data structures using Fuzzy Geographical Entities

Conversion between the vector and raster data structures using Fuzzy Geographical Entities Converson between the vector and raster data structures usng Fuzzy Geographcal Enttes Cdála Fonte Department of Mathematcs Faculty of Scences and Technology Unversty of Combra, Apartado 38, 3 454 Combra,

More information

1.1 The University may award Higher Doctorate degrees as specified from time-to-time in UPR AS11 1.

1.1 The University may award Higher Doctorate degrees as specified from time-to-time in UPR AS11 1. HIGHER DOCTORATE DEGREES SUMMARY OF PRINCIPAL CHANGES General changes None Secton 3.2 Refer to text (Amendments to verson 03.0, UPR AS02 are shown n talcs.) 1 INTRODUCTION 1.1 The Unversty may award Hgher

More information

Lecture 2: Single Layer Perceptrons Kevin Swingler

Lecture 2: Single Layer Perceptrons Kevin Swingler Lecture 2: Sngle Layer Perceptrons Kevn Sngler kms@cs.str.ac.uk Recap: McCulloch-Ptts Neuron Ths vastly smplfed model of real neurons s also knon as a Threshold Logc Unt: W 2 A Y 3 n W n. A set of synapses

More information

Design and Development of a Security Evaluation Platform Based on International Standards

Design and Development of a Security Evaluation Platform Based on International Standards Internatonal Journal of Informatcs Socety, VOL.5, NO.2 (203) 7-80 7 Desgn and Development of a Securty Evaluaton Platform Based on Internatonal Standards Yuj Takahash and Yoshm Teshgawara Graduate School

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy Fnancal Tme Seres Analyss Patrck McSharry patrck@mcsharry.net www.mcsharry.net Trnty Term 2014 Mathematcal Insttute Unversty of Oxford Course outlne 1. Data analyss, probablty, correlatons, vsualsaton

More information

Automatic Integration of Web Search Interfaces with WISE-Integrator

Automatic Integration of Web Search Interfaces with WISE-Integrator Automatic Integration of Web Search Interfaces with WISE-Integrator HAI HE 1, WEIYI MENG 1, CLEMENT YU 2, ZONGHUAN WU 3 1 Department of Computer Science, SUNY at Binghamton, Binghamton, NY 13902, USA {haihe,

More information

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6 PAR TESTS If a WEIGHT varable s specfed, t s used to replcate a case as many tmes as ndcated by the weght value rounded to the nearest nteger. If the workspace requrements are exceeded and samplng has

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

Inter-Ing 2007. INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, 15-16 November 2007.

Inter-Ing 2007. INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, 15-16 November 2007. Inter-Ing 2007 INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, 15-16 November 2007. UNCERTAINTY REGION SIMULATION FOR A SERIAL ROBOT STRUCTURE MARIUS SEBASTIAN

More information

A Secure Password-Authenticated Key Agreement Using Smart Cards

A Secure Password-Authenticated Key Agreement Using Smart Cards A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,

More information

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000 Problem Set 5 Solutons 1 MIT s consderng buldng a new car park near Kendall Square. o unversty funds are avalable (overhead rates are under pressure and the new faclty would have to pay for tself from

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

A DATA MINING APPLICATION IN A STUDENT DATABASE

A DATA MINING APPLICATION IN A STUDENT DATABASE JOURNAL OF AERONAUTICS AND SPACE TECHNOLOGIES JULY 005 VOLUME NUMBER (53-57) A DATA MINING APPLICATION IN A STUDENT DATABASE Şenol Zafer ERDOĞAN Maltepe Ünversty Faculty of Engneerng Büyükbakkalköy-Istanbul

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

ERP Software Selection Using The Rough Set And TPOSIS Methods

ERP Software Selection Using The Rough Set And TPOSIS Methods ERP Software Selecton Usng The Rough Set And TPOSIS Methods Under Fuzzy Envronment Informaton Management Department, Hunan Unversty of Fnance and Economcs, No. 139, Fengln 2nd Road, Changsha, 410205, Chna

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

Survey on Virtual Machine Placement Techniques in Cloud Computing Environment

Survey on Virtual Machine Placement Techniques in Cloud Computing Environment Survey on Vrtual Machne Placement Technques n Cloud Computng Envronment Rajeev Kumar Gupta and R. K. Paterya Department of Computer Scence & Engneerng, MANIT, Bhopal, Inda ABSTRACT In tradtonal data center

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

Implementations of Web-based Recommender Systems Using Hybrid Methods

Implementations of Web-based Recommender Systems Using Hybrid Methods Internatonal Journal of Computer Scence & Applcatons Vol. 3 Issue 3, pp 52-64 2006 Technomathematcs Research Foundaton Implementatons of Web-based Recommender Systems Usng Hybrd Methods Janusz Sobeck Insttute

More information

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S S C H E D A E I N F O R M A T I C A E VOLUME 0 0 On Mean Squared Error of Herarchcal Estmator Stans law Brodowsk Faculty of Physcs, Astronomy, and Appled Computer Scence, Jagellonan Unversty, Reymonta

More information

Instructions for Analyzing Data from CAHPS Surveys:

Instructions for Analyzing Data from CAHPS Surveys: Instructons for Analyzng Data from CAHPS Surveys: Usng the CAHPS Analyss Program Verson 4.1 Purpose of ths Document...1 The CAHPS Analyss Program...1 Computng Requrements...1 Pre-Analyss Decsons...2 What

More information

To manage leave, meeting institutional requirements and treating individual staff members fairly and consistently.

To manage leave, meeting institutional requirements and treating individual staff members fairly and consistently. Corporate Polces & Procedures Human Resources - Document CPP216 Leave Management Frst Produced: Current Verson: Past Revsons: Revew Cycle: Apples From: 09/09/09 26/10/12 09/09/09 3 years Immedately Authorsaton:

More information

Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm

Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm Document Clusterng Analyss Based on Hybrd PSO+K-means Algorthm Xaohu Cu, Thomas E. Potok Appled Software Engneerng Research Group, Computatonal Scences and Engneerng Dvson, Oak Rdge Natonal Laboratory,

More information

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy 4.02 Quz Solutons Fall 2004 Multple-Choce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multple-choce questons. For each queston, only one of the answers s correct.

More information

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES Zuzanna BRO EK-MUCHA, Grzegorz ZADORA, 2 Insttute of Forensc Research, Cracow, Poland 2 Faculty of Chemstry, Jagellonan

More information

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications CMSC828G Prncples of Data Mnng Lecture #9 Today s Readng: HMS, chapter 9 Today s Lecture: Descrptve Modelng Clusterng Algorthms Descrptve Models model presents the man features of the data, a global summary

More information

Multiple-Period Attribution: Residuals and Compounding

Multiple-Period Attribution: Residuals and Compounding Multple-Perod Attrbuton: Resduals and Compoundng Our revewer gave these authors full marks for dealng wth an ssue that performance measurers and vendors often regard as propretary nformaton. In 1994, Dens

More information

) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance

) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance Calbraton Method Instances of the Cell class (one nstance for each FMS cell) contan ADC raw data and methods assocated wth each partcular FMS cell. The calbraton method ncludes event selecton (Class Cell

More information

Negative Selection and Niching by an Artificial Immune System for Network Intrusion Detection

Negative Selection and Niching by an Artificial Immune System for Network Intrusion Detection Negatve Selecton and Nchng by an Artfcal Immune System for Network Intruson Detecton Jungwon Km and Peter Bentley Department of omputer Scence, Unversty ollege London, Gower Street, London, W1E 6BT, U.K.

More information

Traffic State Estimation in the Traffic Management Center of Berlin

Traffic State Estimation in the Traffic Management Center of Berlin Traffc State Estmaton n the Traffc Management Center of Berln Authors: Peter Vortsch, PTV AG, Stumpfstrasse, D-763 Karlsruhe, Germany phone ++49/72/965/35, emal peter.vortsch@ptv.de Peter Möhl, PTV AG,

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

GENESYS BUSINESS MANAGER

GENESYS BUSINESS MANAGER GENESYS BUSINESS MANAGER e-manager Onlne Conference User Account Admnstraton User Gude Ths User Gude contans the followng sectons: Mnmum Requrements...3 Gettng Started...4 Sgnng On to Genesys Busness Manager...7

More information

Methodology to Determine Relationships between Performance Factors in Hadoop Cloud Computing Applications

Methodology to Determine Relationships between Performance Factors in Hadoop Cloud Computing Applications Methodology to Determne Relatonshps between Performance Factors n Hadoop Cloud Computng Applcatons Lus Eduardo Bautsta Vllalpando 1,2, Alan Aprl 1 and Alan Abran 1 1 Department of Software Engneerng and

More information

Nordea G10 Alpha Carry Index

Nordea G10 Alpha Carry Index Nordea G10 Alpha Carry Index Index Rules v1.1 Verson as of 10/10/2013 1 (6) Page 1 Index Descrpton The G10 Alpha Carry Index, the Index, follows the development of a rule based strategy whch nvests and

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

+ + + - - This circuit than can be reduced to a planar circuit

+ + + - - This circuit than can be reduced to a planar circuit MeshCurrent Method The meshcurrent s analog of the nodeoltage method. We sole for a new set of arables, mesh currents, that automatcally satsfy KCLs. As such, meshcurrent method reduces crcut soluton to

More information

Rank Based Clustering For Document Retrieval From Biomedical Databases

Rank Based Clustering For Document Retrieval From Biomedical Databases Jayanth Mancassamy et al /Internatonal Journal on Computer Scence and Engneerng Vol.1(2), 2009, 111-115 Rank Based Clusterng For Document Retreval From Bomedcal Databases Jayanth Mancassamy Department

More information

Brigid Mullany, Ph.D University of North Carolina, Charlotte

Brigid Mullany, Ph.D University of North Carolina, Charlotte Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte

More information

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble

More information

We assume your students are learning about self-regulation (how to change how alert they feel) through the Alert Program with its three stages:

We assume your students are learning about self-regulation (how to change how alert they feel) through the Alert Program with its three stages: Welcome to ALERT BINGO, a fun-flled and educatonal way to learn the fve ways to change engnes levels (Put somethng n your Mouth, Move, Touch, Look, and Lsten) as descrbed n the How Does Your Engne Run?

More information

Canon NTSC Help Desk Documentation

Canon NTSC Help Desk Documentation Canon NTSC Help Desk Documentaton READ THIS BEFORE PROCEEDING Before revewng ths documentaton, Canon Busness Solutons, Inc. ( CBS ) hereby refers you, the customer or customer s representatve or agent

More information

PAS: A Packet Accounting System to Limit the Effects of DoS & DDoS. Debish Fesehaye & Klara Naherstedt University of Illinois-Urbana Champaign

PAS: A Packet Accounting System to Limit the Effects of DoS & DDoS. Debish Fesehaye & Klara Naherstedt University of Illinois-Urbana Champaign PAS: A Packet Accountng System to Lmt the Effects of DoS & DDoS Debsh Fesehaye & Klara Naherstedt Unversty of Illnos-Urbana Champagn DoS and DDoS DDoS attacks are ncreasng threats to our dgtal world. Exstng

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

METHODOLOGY TO DETERMINE RELATIONSHIPS BETWEEN PERFORMANCE FACTORS IN HADOOP CLOUD COMPUTING APPLICATIONS

METHODOLOGY TO DETERMINE RELATIONSHIPS BETWEEN PERFORMANCE FACTORS IN HADOOP CLOUD COMPUTING APPLICATIONS METHODOLOGY TO DETERMINE RELATIONSHIPS BETWEEN PERFORMANCE FACTORS IN HADOOP CLOUD COMPUTING APPLICATIONS Lus Eduardo Bautsta Vllalpando 1,2, Alan Aprl 1 and Alan Abran 1 1 Department of Software Engneerng

More information

A Replication-Based and Fault Tolerant Allocation Algorithm for Cloud Computing

A Replication-Based and Fault Tolerant Allocation Algorithm for Cloud Computing A Replcaton-Based and Fault Tolerant Allocaton Algorthm for Cloud Computng Tork Altameem Dept of Computer Scence, RCC, Kng Saud Unversty, PO Box: 28095 11437 Ryadh-Saud Araba Abstract The very large nfrastructure

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

Planning for Marketing Campaigns

Planning for Marketing Campaigns Plannng for Marketng Campagns Qang Yang and Hong Cheng Department of Computer Scence Hong Kong Unversty of Scence and Technology Clearwater Bay, Kowloon, Hong Kong, Chna (qyang, csch)@cs.ust.hk Abstract

More information

Master s Thesis. Configuring robust virtual wireless sensor networks for Internet of Things inspired by brain functional networks

Master s Thesis. Configuring robust virtual wireless sensor networks for Internet of Things inspired by brain functional networks Master s Thess Ttle Confgurng robust vrtual wreless sensor networks for Internet of Thngs nspred by bran functonal networks Supervsor Professor Masayuk Murata Author Shnya Toyonaga February 10th, 2014

More information

Selecting Best Employee of the Year Using Analytical Hierarchy Process

Selecting Best Employee of the Year Using Analytical Hierarchy Process J. Basc. Appl. Sc. Res., 5(11)72-76, 2015 2015, TextRoad Publcaton ISSN 2090-4304 Journal of Basc and Appled Scentfc Research www.textroad.com Selectng Best Employee of the Year Usng Analytcal Herarchy

More information

Section 5.4 Annuities, Present Value, and Amortization

Section 5.4 Annuities, Present Value, and Amortization Secton 5.4 Annutes, Present Value, and Amortzaton Present Value In Secton 5.2, we saw that the present value of A dollars at nterest rate per perod for n perods s the amount that must be deposted today

More information

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and POLYSA: A Polynomal Algorthm for Non-bnary Constrant Satsfacton Problems wth and Mguel A. Saldo, Federco Barber Dpto. Sstemas Informátcos y Computacón Unversdad Poltécnca de Valenca, Camno de Vera s/n

More information

PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS

PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS PEER REVIEWER RECOMMENDATION IN ONLINE SOCIAL LEARNING CONTEXT: INTEGRATING INFORMATION OF LEARNERS AND SUBMISSIONS Yunhong Xu, Faculty of Management and Economcs, Kunmng Unversty of Scence and Technology,

More information

http://wrap.warwick.ac.uk

http://wrap.warwick.ac.uk Orgnal ctaton: Crstea, Alexandra I. and De Moo, A. (2003) Desgner adaptaton n adaptve hypermeda authorng. In: Internatonal Conference on Informaton Technology : Codng and Computng (ITCC 2003), Las Vegas,

More information

1. Measuring association using correlation and regression

1. Measuring association using correlation and regression How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a

More information

"Research Note" APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES *

Research Note APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES * Iranan Journal of Scence & Technology, Transacton B, Engneerng, ol. 30, No. B6, 789-794 rnted n The Islamc Republc of Iran, 006 Shraz Unversty "Research Note" ALICATION OF CHARGE SIMULATION METHOD TO ELECTRIC

More information

APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT

APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT Toshhko Oda (1), Kochro Iwaoka (2) (1), (2) Infrastructure Systems Busness Unt, Panasonc System Networks Co., Ltd. Saedo-cho

More information

Stochastic Protocol Modeling for Anomaly Based Network Intrusion Detection

Stochastic Protocol Modeling for Anomaly Based Network Intrusion Detection Stochastc Protocol Modelng for Anomaly Based Network Intruson Detecton Juan M. Estevez-Tapador, Pedro Garca-Teodoro, and Jesus E. Daz-Verdejo Department of Electroncs and Computer Technology Unversty of

More information

Fault tolerance in cloud technologies presented as a service

Fault tolerance in cloud technologies presented as a service Internatonal Scentfc Conference Computer Scence 2015 Pavel Dzhunev, PhD student Fault tolerance n cloud technologes presented as a servce INTRODUCTION Improvements n technques for vrtualzaton and performance

More information

Testing Database Programs using Relational Symbolic Execution

Testing Database Programs using Relational Symbolic Execution Testng Database Programs usng Relatonal Symbolc Executon Mchaël Marcozz 1, Wm Vanhoof, Jean-Luc Hanaut Faculty of Computer Scence Unversty of Namur Rue Grandgagnage, 21 5000 Namur, Belgum Abstract Symbolc

More information

Reporting Forms ARF 113.0A, ARF 113.0B, ARF 113.0C and ARF 113.0D FIRB Corporate (including SME Corporate), Sovereign and Bank Instruction Guide

Reporting Forms ARF 113.0A, ARF 113.0B, ARF 113.0C and ARF 113.0D FIRB Corporate (including SME Corporate), Sovereign and Bank Instruction Guide Reportng Forms ARF 113.0A, ARF 113.0B, ARF 113.0C and ARF 113.0D FIRB Corporate (ncludng SME Corporate), Soveregn and Bank Instructon Gude Ths nstructon gude s desgned to assst n the completon of the FIRB

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

Overview of monitoring and evaluation

Overview of monitoring and evaluation 540 Toolkt to Combat Traffckng n Persons Tool 10.1 Overvew of montorng and evaluaton Overvew Ths tool brefly descrbes both montorng and evaluaton, and the dstncton between the two. What s montorng? Montorng

More information

Ants Can Schedule Software Projects

Ants Can Schedule Software Projects Ants Can Schedule Software Proects Broderck Crawford 1,2, Rcardo Soto 1,3, Frankln Johnson 4, and Erc Monfroy 5 1 Pontfca Unversdad Católca de Valparaíso, Chle FrstName.Name@ucv.cl 2 Unversdad Fns Terrae,

More information

HowHow to Find the Best Online Stock Broker

HowHow to Find the Best Online Stock Broker A GENERAL APPROACH FOR SECURITY MONITORING AND PREVENTIVE CONTROL OF NETWORKS WITH LARGE WIND POWER PRODUCTION Helena Vasconcelos INESC Porto hvasconcelos@nescportopt J N Fdalgo INESC Porto and FEUP jfdalgo@nescportopt

More information

Financial Mathemetics

Financial Mathemetics Fnancal Mathemetcs 15 Mathematcs Grade 12 Teacher Gude Fnancal Maths Seres Overvew In ths seres we am to show how Mathematcs can be used to support personal fnancal decsons. In ths seres we jon Tebogo,

More information

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,

More information

Period and Deadline Selection for Schedulability in Real-Time Systems

Period and Deadline Selection for Schedulability in Real-Time Systems Perod and Deadlne Selecton for Schedulablty n Real-Tme Systems Thdapat Chantem, Xaofeng Wang, M.D. Lemmon, and X. Sharon Hu Department of Computer Scence and Engneerng, Department of Electrcal Engneerng

More information