Network Maximal Correlation Soheil Feizi, Ali Makhdoumi, Ken Duffy, Manolis Kellis, and Muriel Medard

Size: px
Start display at page:

Download "Network Maximal Correlation Soheil Feizi, Ali Makhdoumi, Ken Duffy, Manolis Kellis, and Muriel Medard"

Transcription

1 Computer Scence and Artfcal Intellgence Laboratory Techncal Report MIT-CSAIL-TR September 2, 205 Network Maxmal Correlaton Sohel Fez, Al Makhdoum, Ken Duffy, Manols Kells, and Murel Medard massachusetts nsttute of technology, cambrdge, ma 0239 usa

2 Network Maxmal Correlaton Sohel Fez,3, Al Makhdoum,3, Ken Duffy 2, Manols Kells and Murel Médard September 205 Abstract Identfyng nonlnear relatonshps n large datasets s a dauntng task partcularly when the form of the nonlnearty s unknown. Here, we ntroduce Network Maxmal Correlaton (NMC) as a fundamental measure to capture nonlnear assocatons n networks wthout the knowledge of underlyng nonlnearty shapes. NMC nfers, possbly nonlnear, transformatons of varables wth zero means and unt varances by maxmzng total nonlnear correlaton over the underlyng network. For the case of havng two varables, NMC s equvalent to the standard Maxmal Correlaton. We characterze a soluton of the NMC optmzaton usng geometrc propertes of Hlbert spaces for both dscrete and jontly Gaussan varables. For dscrete random varables, we show that the NMC optmzaton s an nstance of the Maxmum Correlaton Problem and provde necessary condtons for ts global optmal soluton. Moreover, we propose an effcent algorthm based on Alternatng Condtonal Expectaton (ACE) whch converges to a local NMC optmum. For ths algorthm, we provde gudelnes for choosng approprate startng ponts to jump out of local maxmzers. We also propose a dstrbuted algorthm to compute a -ɛ approxmaton of the NMC value for large and dense graphs usng graph parttonng. For jontly Gaussan varables, under some condtons, we show that the NMC optmzaton can be smplfed to a Max-Cut problem, where we provde condtons under whch an NMC soluton can be computed exactly. Under some general condtons, we show that NMC can nfer the underlyng graphcal model for functons of latent jontly Gaussan varables. These functons are unknown, bjectve, and can be nonlnear. Ths result broadens the famly of contnuous dstrbutons whose graphcal models can be characterzed effcently. We llustrate the robustness of NMC n real world applcatons by showng ts contnuty wth respect to small perturbatons of jont dstrbutons. We also show that sample NMC (NMC computed usng emprcal dstrbutons) converges exponentally fast to the true NMC value. Fnally, we apply NMC to dfferent cancer datasets ncludng breast, kdney and lver cancers, and show that NMC nfers gene modules that are sgnfcantly assocated wth survval tmes of ndvduals whle they are not detected usng lnear assocaton measures. Introducton Identfyng relatonshps among varables n large datasets s an ncreasngly mportant task n systems bology [], socal scences [2], fnance [3], etc. Whle correlaton-based measures capture lnear assocatons, they can fal to nfer true nonlnear relatonshps among varables, whch can often occur n real-world applcatons [4]. One famly of measures to nfer nonlnear assocatons among Massachusetts Insttute of Technology (MIT), Cambrdge, US. 2 Hamlton Insttute, Maynooth Unversty, Ireland. 3 These authors contrbuted equally to ths work.

3 varables s based on mutual nformaton [5, 6]. Although mutual nformaton computes a measure of assocaton strength among varables, t does not provde functons through whch varables are related to each other. Moreover, relable computaton of mutual nformaton requres an excessve number of samples, partcularly for large number of varables [7]. A classcal measure to nfer a nonlnear relatonshp between two varables s Maxmal Correlaton (MC), ntroduced by Gebelen [8] and studed n references [9 2]. MC nfers, possbly nonlnear, transformatons of two varables wth zero means and unt varances by maxmzng ther parwse correlaton. MC can be computed effcently for both dscrete [3] and contnuous [4] random varables. For dscrete varables, under some mld condtons, MC s equal to the second largest sngular value of a normalzed jont probablty dstrbuton matrx [3]. In that case, transformatons of varables can be characterzed usng rght and left sngular vectors of the normalzed probablty dstrbuton matrx. Recently, MC has been used n dfferent applcatons n nformaton theory [5 7], nformaton-theoretc securty and prvacy [8 20], and data processng [2, 22]. Many modern applcatons nclude large number of varables wth possbly nonlnear relatonshps among them. Usng MC to capture parwse assocatons can cause sgnfcant over-fttng ssues because each varable can be assgned to multple nonlnear relatons. Here we propose Network Maxmal Correlaton (NMC) as a fundamental measure to capture nonlnear assocatons n networks wthout the knowledge of underlyng nonlnearty shapes. In the NMC optmzaton, each varable s assgned to at most one transformaton functon wth zero mean and unt varance. NMC nfers optmal transformatons of varables by maxmzng ther nner products over edges of the underlyng graph. For the case of two varables, NMC s equvalent to MC. The NMC defnton does not assume a specfc relatonshp among node varables and the graph structure. For llustraton, we consder ths relatonshp n dfferent NMC applcatons such as graphcal model nference. Furthermore, the NMC optmzaton can be regularzed to have even fewer nonlnear transformatons to avod over-fttng ssues. In ths paper, we characterze a soluton of the NMC optmzaton usng geometrc propertes of Hlbert spaces for both dscrete and contnuous jontly Gaussan varables. For dscrete random varables, we show that the NMC optmzaton s an nstance of the Maxmum Correlaton Problem (MCP) whch s NP-hard [23 26]. In ths case, usng results of the Multvarate Egenvector Problem (MEP) [23], we provde necessary condtons for a global NMC optmum. We also propose an effcent algorthm based on Alternatng Condtonal Expectaton (ACE) [3], whch converges to a local NMC optmum. We also provde gudelnes for choosng approprate startng ponts of the algorthm to jump out of local maxmzers. The proposed ACE algorthm does not requre formng jont dstrbuton matrces whch could be expensve for varables wth large alphabet szes. We also propose a dstrbuted verson of the ACE algorthm to compute a -ɛ approxmaton of the NMC value for large and dense graphs usng graph parttonng. For jontly Gaussan varables, we use projectons over Hermtte-Chebychev polynomals to characterze an optmal soluton of the NMC optmzaton. Under some condtons, we show that the NMC optmzaton s equvalent to the Max-Cut problem, whch s NP-complete [27]. However, there exst algorthms to approxmate ts soluton usng Semdefnte Programmng (SDP) wthn an approxmaton factor of [28]. In ths case, we provde condtons under whch an NMC soluton can be computed exactly. Usng these results, under some general condtons we show that NMC can nfer the underlyng graphcal model for functons of latent jontly Gaussan varables. These functons are unknown, bjectve, and can be nonlnear. Ths result broadens the famly of contnuous dstrbutons whose graphcal models can be characterzed effcently. 2

4 In real-world applcatons, often only nosy samples of jont dstrbutons are avalable. For ths case, we prove a fnte sample generalzaton bound, and error guarantees for NMC. In partcular, under general condtons we prove that NMC s contnuous wth respect to jont probablty dstrbutons. That s, a small perturbaton n the dstrbuton results n a small change n the NMC value. Moreover, we show that Sample NMC (.e., NMC computed usng emprcal dstrbutons) converges exponentally fast to the NMC value as the sample sze grows. Moreover, we use the NMC optmzaton to characterze a nonlnear global relevance graph wth a certan complexty [29] and propose a greedy algorthm to nfer such a nonlnear relevance graph approxmately. Fnally, we apply NMC to dfferent cancer datasets [30] ncludng breast, kdney and lver cancers and show that usng the NMC network, we can nfer gene modules that are sgnfcantly assocated wth survval tmes of ndvduals whle they are not detected usng lnear assocaton measures. 2 Maxmal Correlaton In ths secton, we ntroduce notatons and revew pror work on maxmal correlaton. 2. Notaton Suppose X and X 2 are two random varables defned on probablty space (Ω, F, P ) takng values n (X, B ) and (X 2, B 2 ), respectvely. The map X (Ω, F) (X, B ) generates the subalgebra F = X (B ) of F. Let P X be the restrcton of the measure P on F, =, 2. For dscrete varables, X and X 2 are ther fnte support sets wth cardnaltes X, for =, Defnton and General Propertes A Pearson s lnear correlaton coeffcent between real-valued varables X and X 2 s defned as cor(x, X 2 ) = E [(X E[X ]) (X 2 E[X 2 ])] var(x ), var(x 2 ) where var(x ) represents the varance of random varable X, for =, 2. Correlaton-based measures capture lnear assocatons between varables, gnorng possble nonlnear relatonshps. Example Suppose X s a Gaussan varable wth zero mean and unt varance. Let X 2 = X 2. In ths case, even though varables are strongly assocated wth each other, the correlaton coeffcent between them s close to zero (see e.g. Fgure -a). It s because these varables are related through a nonlnear transformaton. One way to capture such a nonlnear relatonshp between these varables s to quantfy maxmum correlaton between ther, possbly nonlnear, transformatons. In ths example, suppose φ (X ) = α X 2 + α 2 and φ 2 (X 2 ) = α 2 X 2 + α 22, where coeffcents α j are selected so that φ (X ) has zero mean and unt varance, for both =, 2. In ths case, the correlaton coeffcent between transformed varables s one (see e.g. Fgure -b), capturng a strong nonlnear assocaton between varables X and X 2. Maxmal correlaton (MC) between varables X and X 2 whch was ntroduced by Gebelen [8] captures a nonlnear assocaton between them by selectng, possbly nonlnear, transformaton functons φ (X ) and φ 2 (X 2 ) so that φ (X ) and φ (X ) have the hghest correlaton among all other transformaton functons wth zero means and unt varances. 3

5 (a) 0.25 (b) 0.06 X Maxmal Corrolaton X Fgure : (a) Samples of varables X and X 2 wth a nonlnear relatonshp consdered n Example. (b) φ (X ) and φ 2 (X 2 ) are transformed varables, capturng the nonlnear relatonshp between X and X 2. Defnton (Maxmal Correlaton) Maxmal correlaton between two random varables X and X 2 s defned as ρ(x, X 2 ) max φ,φ 2 E[φ (X ) φ 2 (X 2 )], (2.) subject to φ (X ) Ω R s measurable, E[φ (X )] = 0, and E[φ (X ) 2 ] =, for =, 2. For =, 2, let φ (X ) denote an optmal soluton of (2.). Maxmal correlaton ρ(x, X 2 ) s always between 0 and, where a hgh MC value ndcates a strong assocaton between two varables [8]. The study of maxmal correlaton and other prncple nerta components between two varables dates back to Hrschfeld [9], Gebelen [8], Sarmanov [0], Rény [], and Greenacre [2]. Recently, MC has been used n nformaton theory and appled probablty problems such as data processng, nference of common randomness among others [0, 4, 22, 3, 32]. Unlke lnear correlaton, MC only depends on the jont dstrbuton of varables P X,X 2 (, ), and not on ther alphabets X. Several works have nvestgated dfferent aspects of optmzaton (2.) for both dscrete [3] and contnuous [4] random varables. In partcular, the exstence of an optmal soluton for the MC optmzaton and the unqueness of such solutons have been nvestgated n [3]. Reference [4] has used projectons over Hlbert spaces to compute MC for Gaussan varables. We extend ths approach to derve exstng MC results for dscrete varables. In the next secton, we use a smlar approach based on Hlbert projectons to characterze network maxmal correlaton for both dscrete and jontly Gaussan varables. Defnton 2 For =, 2, we defne a Hlbert space H as H = {φ (X ) φ (X ) s measurable, E[φ (X )] = 0, E[(φ (X )) 2 ] < }, where the product s defned as φ, φ E[φ (X ) φ (X )]. Snce every Hlbert space has an orthonormal bass (Theorem 2.4, [33]), we let {ψ, } = and {ψ 2,} = be correspondng orthonormal bases of H and H 2, respectvely. Consder the followng optmza- φ s a mappng from X to R and X s a mappng from Ω to X. Thus, we have φ (X ) = φ X Ω R. 4

6 ton: max a,j where ρ j E[ψ, (X ) ψ 2,j (X 2 )]. a, a 2,j ρ j (2.2),j a 2,j =, =, 2, j= j= a,j E[ψ,j (X )] = 0, =, 2, Proposton Suppose φ ( ) and a,j are optmal solutons of optmzatons (2.) and (2.2), respectvely. Then, we have φ (x) = j= Moreover, the jont probablty dstrbuton can be wrtten as Proof A proof s presented n Secton 0.. a,j ψ,j (x). (2.3) P X X 2 (x, x 2 ) = ρ j ψ, (x )ψ 2,j (x 2 ).,j Proposton provdes an alternatve optmzaton (2.2) to solve the maxmal correlaton problem (2.). Selectng approprate orthonormal bases for Hlbert spaces H and H 2 s crtcal to obtanng a tractable optmzaton (2.2). In the followng, we use Proposton to solve the maxmal correlaton optmzaton for general dscrete varables as well as for jontly Gaussan varables. Example 2 (MC for Dscrete Random Varables) Suppose X and X 2 are two dscrete random varables wth a jont probablty functon P X,X 2 (, ). Let {,..., X } and {,..., X 2 } be alphabets of random varables X and X 2, respectvely. We choose the followng orthonormal bases for H and H 2 : ψ, (x) = {x = } PX () and ψ 2,j(x) = {x = j} PX2 (j). By these selectons of bases, we have ρ j = E[ψ, (X ) ψ 2,j (X 2 )] = P X X 2 (, j) PX () P X2 (j). Moreover, we have E[ψ,j (X )] = P X (j), =, 2. Thus, optmzaton (2.2) s smplfed to the followng optmzaton: max P X,X a, a 2 (, j) 2,j,j PX () P X2 (j) X j= X j= (a,j ) 2 =, =, 2, a,j PX (j) = 0, =, 2. (2.4) 5

7 Accordng to Proposton, to solve MC optmzaton (2.) t s suffcent to fnd an optmal soluton of optmzaton (2.4). In the followng, we show that an optmal soluton of optmzaton (2.4) can be computed n a closed form usng matrx spectral decomposton. Defne the normalzed jont dstrbuton matrx as whose sze s X X 2. Let be coeffcent vectors. Moreover, let Q(, j) P X,X 2 (, j) PX () P X2 (j) a (a,, a,2,..., a, X ) T and a 2 (a 2,, a 2,2,..., a 2, X2 ) T (2.5) p ( P X (), P X (2),..., P X ( X )) T (2.6) p2 ( P X2 (), P X2 (2),..., P X2 ( X 2 )) T be vectors of square roots of margnal probabltes. Optmzaton (2.4) can be re-wrtten as follows: max a T Q a 2 (2.7) a 2 =, =, 2, a p, =, 2. In the followng, we show that optmal coeffcent vectors a and a 2 of optmzaton (2.7) are equal to the left and rght sngular vectors of the matrx Q correspondng to ts second largest sngular value. Moreover, the optmal value (the maxmal correlaton between two varables X and X 2 ) s equal to the second largest sngular value of the matrx Q. To show ths, we defne random varables Z and Z 2 such that P Z = a, PX (), Z 2 = a 2,j = P PX2 (j) X,X 2 (, j), where a = and a 2 =. Usng the Cauchy-Schwartz nequalty, we have that a T Qa 2 = E[Z Z 2 ] E[Z 2]E[Z2 2 ] = a a 2 =. Therefore, the maxmum sngular value of Q s at most one. Usng (2.5), one can see that the rght and left sngular vectors of Q wth the sngular value one are p and p 2, respectvely. Thus, the feasble set of optmzaton (2.7) ncludes unt-norm vectors orthogonal to leadng sngular vectors of Q. Thus, the optmal value s equal to the second largest sngular value and optmal vectors a and a 2 are left and rght sngular vectors correspondng to the second largest sngular value. Example 3 (MC for Jontly Gaussan Random Varables) Ths example s studed n reference [4] to compute MC between two Gaussan varables. In Secton 6, we use a smlar approach to characterze network maxmal correlaton for jontly Gaussan varables. Suppose (X, X 2 ) are jontly Gaussan varables wth the correlaton coeffcent ρ. The k-th Hermtte-chebychev polynomal s defned as dk Ψ k (x) = ( ) k e x2 dx k e x2. (2.8) 6

8 These polynomals form an orthonormal bass wth respect to Gaussan dstrbutons. That s, H (x )H j (x 2 )f(x, x 2 )dx dx 2 = ρ =j, (2.9) where f(x, y) s the jont densty functon of Gaussan varables wth correlaton ρ, and =j s one when = j, otherwse t s zero. Let ψ,j to be the j-th Hermtte-Chebychev polynomal, for =, 2. Usng (2.9), we have Moreover, we have ρ j = E[ψ, (X ) ψ 2,j (X 2 )] = ρ =j. E[ψ,j (X )] = j=0, =, 2, (2.0) because all of these functons for j have zero means over a Gaussan dstrbuton. Therefore, optmzaton (2.2) can be wrtten as max a, a 2, ρ (2.) =0 j=0 (a,j ) 2 =, =, 2, a,0 = 0, =, 2. Snce ρ, an optmal soluton of optmzaton (2.) s obtaned when a, =, a 2, =, whle other coeffcents are equal to zero. The sgns of a, for =, 2 are determned so that a, a 2, ρ = ρ. Ths leads to the maxmal correlaton ρ between two varables that s equal to the absolute value of the correlaton coeffcent between them when the two random varables are jontly Gaussan. Moreover, optmal transformaton functons are φ (X ) = a, ψ, = ±X, =, 2, where sgns of varables are selected so that a, a 2, ρ = ρ. 3 Statstcal Propertes of Maxmal Correlaton In many applcatons, often only nosy samples of jont dstrbutons are observed. In ths secton, we prove a fnte sample generalzaton bound, and error guarantees for maxmal correlaton of dscrete random varables. Specfcally, under some general condtons, we prove that ˆ maxmal correlaton s a contnuous measure wth respect to the jont probablty dstrbuton. That s, a small perturbaton n the dstrbuton results n a small change n the MC value. ˆ sample maxmal correlaton between two varables, computed usng m samples from the jont dstrbuton, converges exponentally fast to the MC value, as m grows. These propertes establsh maxmal correlaton as a robust assocaton measure to capture nonlnear dependences between varables n real-world applcatons. 7

9 Throughout ths subsecton we only consder dscrete random varables and assume that all alphabet elements x X have postve probabltes (otherwse they can be neglected wthout loss of generalty). That s, f δ arg mn x X P X (x ), =, 2, (3.) then δ(p ) mn{δ, δ 2 } > 0. The emprcal dstrbuton of these varables usng m observed samples s defned as P (m) (x, x 2 ) = m m = {x() = x, x () 2 = x 2 }, where {x (), x() 2 }m = are..d. samples drawn accordng to a dstrbuton P X,X 2. The vector of observed samples of varable X s denoted by x = (x (), x (2),..., x (m) ). For any vector v = (v,..., v p ) R d and p, we let v p represent the standard p-norm of the vector v defned as d v p = ( v p p ). For p = 2, we drop the subscrpt f no confuson arses,.e., v = v Contnuty of Maxmal Correlaton = Let P X,X 2 (, ) and P X,X 2 (, ) be two dstrbutons over alphabets (X, X 2 ) wth the correspondng MC values ρ and ρ, respectvely. In the followng, we show that f the dstance between P and P s small (.e., P P ɛ), ther correspondng MC values (ρ and ρ) are close to each other as well. Theorem Let P P ɛ, for some ɛ > 0. Then, we have where D max{ X, X 2 }, and δ mn(δ(p ), δ( P )). Proof A proof s presented n Secton 0.2. ρ ρ 2 ɛ δ 2 D3/2, (3.2) The sketch of the proof s as follows: The normalzed jont dstrbuton matrx Q (2.5) can be wrtten as Q = D X (P ) 2 PX,X 2 D X2 (P ) 2, (3.3) where D X (P ) denotes a dagonal matrx whose dagonal s P X, for =, 2. Snce the matrx Q s a contnuous functon of P X,X 2 (, ), ts sngular values (and therefore ts second largest sngular value) are contnuous functons of P as well. 3.2 Sample Maxmal Correlaton Let {x, x 2 }m = be..d. samples drawn accordng to a jont probablty dstrbuton P X,X 2 (, ). Suppose P (m) (, ) denotes the emprcal dstrbuton obtaned from these samples. Maxmal correlaton computed usng ths emprcal probablty dstrbuton s called Sample Maxmal Correlaton and s denoted by ρ m (X, X 2 ). In the followng, we show that ρ m (X, X 2 ) converges to ρ(x, X 2 ) exponentally fast, as m. 8

10 Theorem 2 For any dstrbuton P, and any ɛ > 0, P [ ρ m (X, X 2 ) ρ(x, X 2 ) > ɛ] 0, exponentally fast. More precsely, f then, m 3 24 D log ( δ(p ) 2 ɛ η ), P [ ρ m (X, X 2 ) ρ(x, X 2 ) > ɛ] η, where D = max{ X, X 2 }. The bound can also be wrtten as P [ ρ m (X, X 2 ) ρ(x, X 2 ) > ɛ] 24 exp ( mδ(p )2 ɛ 3 D ). Proof A proof s presented n Secton 0.3. The proof follows from the facts that maxmal correlaton s a contnuous functon of the nput dstrbuton accordng to Theorem, and the emprcal dstrbuton converges exponentally fast to the true dstrbuton. 4 Network Maxmal Correlaton 4. Defnton and General Propertes In ths secton, we ntroduce Network Maxmal Correlaton (NMC) as a fundamental measure to capture nonlnear assocatons over networks. Let G = (V, E) be a graph wth n nodes and E edges. The graph G s un-weghted, does not have self-loops, and can be drected or undrected. Each node s assgned to a random varable X. Here, we ntroduce NMC wthout assumng a specfc relatonshp among node varables and the graph structure. We dscuss ths relatonshp n dfferent applcatons of NMC n Sectons 6, 7, and 8. NMC nfers best nonlnear transformaton functons assgned to each node varable so that the total parwse correlaton over the network s maxmzed. Suppose X,..., X n are n random varables defned on probablty space (Ω, F, P ), where X takes values n (X, B ). The map X (Ω, F) (X, B ) generates the subalgebra F = X (B ) of F. Let P X be the restrcton of the measure P on F, =,..., n. For dscrete varables, X and X 2 are ther fnte support sets wth cardnaltes n = X, for =,..., n. Defnton 3 (Network Maxmal Correlaton) Network maxmal correlaton among varables X,..., X n connected by a graph G = (V, E) s defned as ρ G (X,..., X n ) max φ,...,φ n E[φ (X ) φ j (X j )], (4.) (,j) E subject to φ (X ) Ω R s measurable, E[φ (X )] = 0, and E[φ (X ) 2 ] =, for n. The Optmzaton (4.) maxmzes total parwse correlaton over the network wthout dstngushng among postve and negatve correlatons. In some applcatons, the strength of an assocaton does not depend on the sgn of the correlaton coeffcent. In those cases, one can re-wrte the NMC optmzaton (4.) to maxmze the total absolute parwse correlatons over the network as follows: 9

11 Defnton 4 (Absolute Network Maxmal Correlaton) Consder the followng optmzaton: max φ,...,φ n E[φ (X ) φ j (X j )], (4.2) (,j) E subject to φ (X ) Ω R s measurable, E[φ (X )] = 0 and E[φ 2 (X )] =, for any n. We refer to ths optmzaton as an absolute NMC optmzaton. Let φ ( ) be an optmal soluton of the NMC optmzaton (4.) (n Proposton 2 we prove the exstence of such soluton). Then, an edge maxmal correlaton between varables and j s defned as ρ G (X, X j ) E[φ (X ) φ j (X j )], (4.3) where (, j) E. Unlke maxmal correlaton formulaton of (2.), transformaton functons n optmzaton (4.) are constraned by the network structure. Therefore, an edge maxmal correlaton between varables X and X j s always smaller than or equal to ther maxmal correlaton,.e., ρ G (X, X j ) ρ(x, X j ). Computaton of maxmal correlaton for each edge ndependently results n two nonlnear functons assgned to nodes of that edge. Therefore, f the network has E edges, t wll result n nference of 2 E possbly nonlnear functons. In that setup, each node can be assocated to dfferent nonlnear transformaton functons whch can rase over-fttng ssues partcularly for dense networks. On the other hand, n the NMC formulaton of (4.), we assgn a sngle functon to each node n the graph. Therefore, optmzaton (4.) results n n possbly nonlnear functons. Lemma The NMC optmzaton (4.) s equvalent to the followng MSE optmzaton: mn φ,...,φ n 2 E[(φ (X ) φ j (X j )) 2 ], (4.4) (,j) E where E[φ (X )] = 0 and E[φ 2 (X )] =, for any n. Proof A proof s presented n Secton 0.4. Smlarly to Defnton 2, for =, 2,..., n, we defne a Hlbert space H as H = {φ (X ) φ (X ) s measurable, E[φ (X )] = 0, E[(φ (X )) 2 ] < }, where the product s defned as φ, φ E[φ (X ) φ (X )]. The followng proposton shows the exstence of optmal transformatons of the NMC optmzaton (4.): Proposton 2 Under the assumpton that Hlbert spaces H s are compact, there exst functons φ such that E[φ (X )] = 0 and E[φ (X ) 2 ] = for n, that acheve the optmal value of optmzaton (4.). Proof A proof s presented n Secton 0.5. The assumpton that Hlbert spaces H s are compact holds when X s are dscrete random varables wth fnte support, or when X s are jontly Gaussan random varables. 0

12 Let P denote the projecton operaton from the space H j (for any j ) onto H, for any n. Accordng to Lemma 5 ths projecton can be characterzed usng condtonal expectatons a s follows: For random varable φ j H j, we have P φ j = argmn φ H E [(φ φ j ) 2 ] = E[φ j X ] E[φj X ] 2. The followng proposton characterzes optmal NMC transformaton functons usng projecton operators: Proposton 3 Optmal transformaton functons of NMC optmzaton (4.) {φ, n} satsfy φ = j N () P φ j j N () P φ j = E[ j N () φ j X ] E[ j N () φ j X ], (4.5) where N () represents neghbors of node n the graph G = (V, E). Proof A proof s presented n Secton 0.6. Note A smlar approach can be used to characterze the absolute NMC optmzaton (4.2) by ntroducng extra varables to represent correlaton sgns of edges: max s,j E[φ (X )φ j (X j )] (,j) E E[φ (X )] = 0, E[φ 2 (X )] =, n. (4.6) In ths case, smlarly to Proposton (0.6), we can wrte where φ = j N () s j P φ j j N () s j P φ j, s j = sgn (E[φ (X )φ j (X j )]). Proposton 3 characterzes a property of optmal transformatons of NMC (4.) usng projecton operatons wthout explctly computng the optmal NMC soluton. In the followng, we use orthonormal representatons of the Hlbert spaces H and propose a constructve approach to solve the NMC optmzaton. Recall that {ψ,j } j= represents an orthonormal bass for H. Consder the followng optmzaton: max (, ) E j= j= j,j a,j a,j a 2,j =, n, a,j E[ψ,j (X )] = 0, n, ρj,j, (4.7) where ρ j,j, E[ψ,j (X ) ψ,j (X )].

13 Theorem 3 Suppose φ ( ) and a,j are optmal solutons of optmzatons (4.) and (4.7), respectvely. Then, we have φ (x) = Proof A proof s presented n Secton 0.7. j= a,j ψ,j (x). (4.8) Smlarly to the case of two varables dscussed n Secton 2.2, selectng approprate Hlbert spaces H s crtcal to have a tractable optmzaton (4.7). In the followng, we consder the NMC optmzaton for dscrete varables, whle the Gaussan case s dscussed n Secton 6. Example 4 (NMC for Dscrete Random Varables) Suppose X s a dscrete random varable wth alphabet {,..., X }. Smlarly to Example 2, let ψ,j (x) = {x = j} be an orthonormal bass for H. Thus, we have ρ j,j, = E[ψ,j (X ) ψ,j (X )] = P X X (j, j ) PX (j) P X (j ). PX (j) Therefore, optmzaton (4.7) s smplfed to the followng optmzaton: max (, ) E X j= X j= P XX a,j a (j, j ),j j,j PX (j) P X (j ) (a,j ) 2 =, n, a,j PX (j) = 0, n. (4.9) Smlarly to Example 2, we defne the matrx Q, Q, (j, j ) whose sze s X X. Moreover, recall that for =,..., n, we have as P X,X (j, j ) PX (j) P X (j ), (4.0) a = (a,, a,2,..., a, X ) T p = ( P X (), P X (2),..., P X ( X )) T. Therefore, optmzaton (4.9) can be re-wrtten as follows: max a T Q, a (4.) (, ) E a 2 =, n, a p, n. Optmzaton (4.) s not convex nor concave n general. In Secton 5.2, we show that ths optmzaton s an nstance of the standard Maxmum Correlaton Problem (MCP) proposed by Hotellng [24, 25]. By makng ths connecton, we use establshed technques of solvng Multvarate Egenvalue Problem (MEP) to solve optmzaton (4.). 2

14 4.2 Statstcal Propertes of NMC In ths part, we nvestgate the robustness of NMC for dscrete varables wth fnte support aganst small perturbatons of jont probablty dstrbutons of varable pars. Moreover, we show that sample NMC (.e., NMC computed usng emprcal dstrbutons) converges to the true NMC value exponentally fast as the sample sze ncreases. To smplfy notaton, suppose P, s the matrx representaton of the jont probablty dstrbuton of varables X and X. Theorem 4 Network maxmal correlaton s a contnuous functon of the jont probablty dstrbutons P,, for all (, ) E. Let P, P, ɛ, for some ɛ > 0, and all (, ) E. Then, we have ρ G ρ G ɛ E D δ 2, (4.2) where D = max{ X,..., X n }, and δ = mn n (mn{δ(p X ), δ( P X )}). Proof A proof s presented n Secton 0.8. Next, we show that the sample NMC denoted by ρ m (G) converges to the true NMC value ρ G exponentally fast, as the sample sze m ncreases: Theorem 5 Sample NMC converges to NMC, exponentally fast. Partcularly, let δ = mn n δ(p X ) and D = max{ X,..., X n }. Then, for we have Proof A proof s presented n Secton 0.9. m ( 24 E 2 D 3 8 max{ V, E } ɛ 2 δ 2 ) log ( ), (4.3) η P[ ρ m (G) ρ G > ɛ] η. (4.4) Note that for the case of havng two varables, robustness bounds provded n Theorems 4 and 5 are more loose compared to bounds provded by Theorem and 2 owng to the generalty of relaxatons used n NMC performance characterzaton. 4.3 Regularzed NMC In ths secton, we assume that all varables are real-valued (note that ths s not a necessary condton for MC and NMC). The NMC optmzaton (4.) results n n possbly nonlnear transformaton functons φ (X ) whose dstances from orgnal varables can be arbtrarly large (.e., E[φ (X ) X ] can be arbtrarly small). In some applcatons, one may wsh to have fewer than n nonlnear transformaton functons assgned to varables, or alternatvely to control dstances among transformed and orgnal varables. Here, we propose a regularzed NMC optmzaton framework whch penalzes dstances among optmal transformaton functons φ (X ) and the orgnal varables X. Suppose varables have mean zero and unt varance. I.e., E[X ] = 0 and E[X 2 ] =. 3

15 Algorthm Alternatng Condtonal Expectaton Intalzaton: φ (0) (X ), φ (0) 2 (X 2) wth mean zero. for k=0,,... φ (k+) (X ) = E[φ (k) 2 (X 2) X ]. update: φ (k+) (X ) = φ (k+) 2 (X 2 ) = E[φ (k) update: φ (k+) 2 (X 2 ) = (X ) X 2 ]. φ (k+) (X ) E[(φ (k+) (X )) 2 ] φ (k+) 2 (X 2 ) E[(φ (k+) 2 (X 2 )) 2 ] update: ρ (k+) = E[φ (k+) (X )φ (k+) 2 (X 2 )] end Defnton 5 (Regularzed NMC) Regularzed NMC among varables X,..., X n connected by a graph G = (V, E) s defned as the soluton of the followng optmzaton: max ( λ) φ,...,φ n (,j) E E[φ (X ) φ j (X j )] + λ E[φ (X ) X ], (4.5) V where E[φ (X )] = 0 and E[φ 2 (X )] =, for any n. 0 λ s the regularzaton parameter. Unlke MC and NMC, whch only depend on the jont dstrbutons of varables, the regularzed NMC depends on both jont dstrbutons and alphabets of varables because of the regularzaton term. Moreover, one can defne regularzed absolute NMC smlarly to optmzaton (4.2). Let optmal transformaton functons computed by optmzaton (4.5) be φ,λ. If λ = 0, φ,λ = φ, whle f λ =, φ,λ = X. By varyng λ between 0 and, transformaton functons vary from φ to X. Suppose ρ G,λ (X,..., X n ) E[φ,λ (X ) φ j,λ (X j)]. (,j) E Therefore, ρ G,0 = ρ G and ρ G, s the total lnear correlatons over the network. By the defnton of NMC, ρ G,0 ρ G,. 5 Computaton of MC and NMC In ths secton, we frst revew an exstng algorthm to compute MC and then ntroduce an effcent algorthm to compute NMC. We also propose a parallelzable verson of the NMC algorthm based on network parttonng and show that ts expected performance s ɛ-away from the true NMC value. 5. Computaton of Maxmal Correlaton Gven the jont dstrbuton of varables, one can use Proposton to compute the MC value and optmal transformaton functons. In partcular, for dscrete random varables, Example 2 shows that a soluton of optmzaton (2.) can be characterzed by the second largest sngular value of the normalzed jont dstrbuton matrx Q (2.5). Gven samples of varables (.e., {x (), x() 2 }m = ), one can compute MC usng the emprcal jont dstrbuton of varables P m (, ). Robustness of MC 4

16 computaton usng emprcal dstrbutons s dscussed n Secton 3. If alphabet szes of varables are large, formng the jont dstrbuton matrx can be costly. An teratve approach to compute maxmal correlaton wthout formng the jont dstrbuton functon s based on Alternatng Condtonal Expectaton (ACE) [3]. Brefly, at each nteracton, the ACE algorthm computes optmal transformaton functons usng condtonal expectatons, assumng that the other transformaton functon s fxed (n a Gauss-Sedel manner [34]). If the correlaton value does not ncrease by a certan value, the algorthm termnates. We descrbe the steps of ths algorthm n Algorthm. Proposton 4 The sequence ρ (k) generated by Algorthm converges to a local optmum of optmzaton (2.). Moreover, f startng ponts of Algorthm are such that vectors and (φ (0) () p X (),..., φ (0) ( X ) p X ( X )) (φ (0) 2 () p X2 (),..., φ (0) 2 ( X 2 ) p X2 ( X 2 )) are not orthogonal to the span of the left and rght sngular vectors correspondng to the second largest sngular value of Q, then ACE algorthm converges to the global optmum. Moreover, f the Q matrx has unque sngular vectors (left and rght) correspondng to the second largest sngular value, optmal transformaton functons are unque maxmzers of optmzaton (2.). Proof See Theorems 5.4 and 5.5 of reference [3]. 5.2 Computaton of NMC In ths secton, we frst establsh a connecton between the NMC optmzaton (4.) wth Maxmum Correlaton Problem (MCP) and Multvarate Egenvalue Problem (MEP) ([23 26]). Then, we deploy technques used to solve MEP and MCP n order to compute NMC. The Maxmum Correlaton Problem (MCP), proposed by Hotellng [24, 25], s to fnd the lnear combnaton of one set of varables that correlates maxmally wth the lnear combnaton of another set of varables. Ths problem s defned as max b n b T C,j b j,j= b =, n, (5.) where b R n and C,j R n n j. Optmzaton (5.) s n the standard form of the MCP problem [24,25]. Upon employng the Lagrange multpler theory [34], the frst-order optmalty condton for optmzaton (5.7) s the exstence of real-valued scalars, namely, Lagrange multplers λ,..., λ n, such that the followng system of equatons s satsfed: n j= C j b j = λ b, n b =, n. (5.2) Ths system of equatons s called Multvarate Egenvalue Problem (MEP). We next establsh the connecton between NMC and MCP. To that end, we defne the followng notaton: For each, 5

17 snce I X p p T s postve semdefnte, we take ts square root 2 and wrte I p p T = B B T, where I X s a X X dentty matrx. Let b B a. Let U Σ U T be the sngular value decomposton of B where U (j) s the j-th column of U and σ (j) s the j-th sngular value of B. We wll show that only one sngular value of B s zero whch s equal to the sngular vector p. Wthout loss of generalty, suppose σ = 0, for all. Defne A a X X matrx as follows: A (2) [U,..., U ( X ) ] dag σ (2),..., σ ( X ) (2) [U,..., U ( X ) ] T. (5.3) Snce σ (j) 0, for all n, and j 2, thus A s well-defned accordng to (5.3). Theorem 6 The NMC optmzaton (4.) can be re-wrtten as follows: max b b T A T (Q p T p ) A b (5.4) (, ) E s.t. b 2 =. (5.5) Proof A proof s presented n Secton 0.0. Let C be a matrx consstng of submatrces C, where f (, ) E, C, A T (Q p p T ) A, (5.6) otherwse C, s an all zero matrx of sze X X. Let b (b T,..., bt n ) T R M, where b R X and M = n = X. Proposton 5 The NMC optmzaton (5.4) can be wrtten as follows: max b T Cb (5.7) b 2 =, n. Optmzaton (5.7) s n the standard form of the MCP problem [24, 25]. After showng that the NMC optmzaton can be reformulated as the MCP, we use the exstng technques n the lterature to solve t. Several local maxmzers exst for cases that fndng a global optmum of optmzaton (5.7) s computatonal dffcult [23, 35]. For example, an aggregated power method that terates on blocks of C was proposed by Horst [36] as a general technque for solvng the MEP numercally. Below, we summarze general algorthmc deas to solve MCP: () Frst, an effcent algorthm s used to solve MEP whch s the necessary frst order condton for MCP. Ths step s studed n references [23, 36]. (2) Next, a strategy s used to properly choose startng ponts of the algorthm or jump out of the local mnma of optmzaton (5.7). Ths step s studed n [26, 37]. 6

18 Algorthm 2 Gauss-Sedel Algorthm for MEP Input: C R M R M. Intalzaton: b (0) R M. for k = 0,,... for =,..., n end end b (k) λ (k) b (k+) = j= C jb (k+) j = b (k) 2. = b (k) λ (k) + n j= C jb (k) j. An effcent algorthm to solve MEP: Algorthm 2 s a Gauss-Sedel algorthm [34] to solve MEP whch s proposed by [23]. Ths algorthm s essentally a varant of the classcal power teraton method (see e.g. [38]). Let r(b (k) ) = (b (k) ) T Cb (k), λ (b) = b T [C,..., C n ]b, and Λ(b) = dag(λ (b)i X,..., λ n (b)i Xn ). Theorem 7 ([26]) Suppose the matrx C s symmetrc. We have a) The sequence {r(b (k) )} generated by Algorthm 2 s monotoncally ncreasng and convergent. b) Let (Λ, b ) be a soluton of MEP. If b s a local maxmzer of (5.7), then for any, we have λ (b ) σ X (C ). Moreover, f b s a global maxmzer of (5.7), then for any, we have λ (b ) σ (C ), where are egenvalues of the matrx C. σ (C ) σ X (C ) A strategy for avodng local optmums of MCP: Let b be a soluton of MEP. Usng Theorem 7, snce C s a zero matrx, n order to have b a global maxmzer of optmzaton (5.7), we need to have λ (b ) 0. Based on ths observaton, we have the followng strategy for choosng an startng pont for Algorthm 2. Let b be a soluton of (5.2) wth the correspondng Λ. Suppose that there exst an n such that λ < 0. Let w be the unt vector assocated wth the egenvalue of λ I X. Now let ˆb = b q, 2 Square root of a symmetrc postve semdefnte matrx A s defned as A = UΣ /2 U T where A = UΣU T. 7

19 Algorthm 3 Network ACE to compute NMC Input: G, X,..., X n, Intalzaton: φ (0) (X ),..., φ (0) n (X n ) wth mean zero. for k = 0,... φ (X ) = φ (k) (X ), n for = n φ (X ) = E [ j N φ j (X j ) X ] end end φ (k+) ρ (k+) G update: φ (X ) = φ (X ) E[φ (X ) 2 ] (X ) = φ (X ), n = (,j) E E [φ (k+) (X )φ (k+) j (X j )] where q s a vector where q = 0 for all and q = 2w T b w stll satsfes ˆb = for all =,..., n and gves r(ˆb) = r( b) 4λ (w T b ) 2 > r( b). We repeat ths process untl we have λ 0 for all =,..., n. Note that ths s not a suffcent condton for the global maxmzer of (5.7) and s only a necessary condton as Theorem 7 shows. After repeatng ths procedure, the condton gven n Theorem 7 holds. We then call upon Algorthm 2 to produce yet a better soluton for (5.7). Based on Algorthm 2, we ntroduce an algorthm to compute NMC usng alternatng condtonal expectaton. We prove that the proposed algorthm converges to the local optmum of the NMC optmzaton (4.). We then use a strategy explaned n ths secton to jump out of local maxmzers. At each teraton of the algorthm, we update transformaton functons as follows: Suppose at teraton r, transformaton functons are φ r j. If we fx all varables except the transformaton functon of node, an optmal soluton of φ r+ can be wrtten as the normalzed condtonal expectaton of functons of ts neghbors (see Proposton 3). In each update, the objectve functon of the NMC optmzaton ncreases or stays the same. Proposton 6 The sequence {ρ (k) G soluton of NMC optmzaton (4.). } k=0 generated by Algorthm 3 converges to a local optmum Proof A proof s presented n Secton 0.. Fgure 2 llustrates the convergence of the ACE algorthm to compute NMC of sx jontly Gaussan varables connected over a complete graph (for more detals, see Example 5). Proposton 7 The computatonal complexty of each teraton of Algorthm 3 s where d max s the maxmum node degree and D = max X. O(nDd max + E ), (5.8) Smlarly to Algorthm 2 for solvng MCP, Algorthm 3 fnds a local optmum soluton. Once the algorthm termnates, usng Theorem 7 and the strategy provded n the prevous secton, f the convergence pont does not satsfy the necessary condtons for a global optmum, we update the startng pont of Algorthm 3 and run t agan to reach a yet better soluton. 8

20 NMC value Number of teratons Fgure 2: An llustraton of the convergence of the ACE algorthm 3 to compute NMC over a complete graph connectng sx jontly Gaussan varables. Note that we can use a smlar algorthm to Algorthm 3 to compute regularzed NMC of Defnton 5. The objectve functon of the regularzed NMC optmzaton (4.5) can be wrtten as follows: V E[φ (X ) (( λ) j N () φ j (X j ) + λx )] (5.9) Thus, to compute the regularzed NMC, one can use a smlar ACE Algorthm 3 wth the followng updates for transformaton functons: φ (X ) = E[( λ) j N () φ j (X j ) + λx X ]. (5.0) If varables are contnuous and we only observe samples from ther jont dstrbutons, emprcal computaton of condtonal expectatons n Algorthm 3 may be challengng owng to the lack of suffcent samples at each pont. One way to overcome ths ssue s to dscretze contnuous varables by quantzng them. However, ths approach can ntroduce sgnfcant quantzaton errors. An alternatve approach to compute emprcal condtonal expectatons at pont x 0 R s to use all samples n ts B neghborhood (.e., x 0 B/2 x x 0 + B/2). By usng ths approach, the computaton of emprcal condtonal expectatons n Algorthm 3 can be wrtten as follows: φ (X = x ) = E[ φ j (X j ) x B/2 X x + B/2]. (5.) j N () We use ths approach n our ACE mplementatons to compute NMC for contnuous varables. If the graph G = (V, E) s sparse (.e., E = O(n)), Proposton 7 shows that the NMC computaton can be performed effcently n lnear tme complexty wth respect to the number of nodes n the network. However, f the graph s dense or the number of nodes n the network s large, ths computaton may be expensve. In the followng, we propose an approach to compute NMC usng parallel computatons. 9

21 5.3 Parallel Computaton of NMC For large and dense networks, exact computaton of NMC may become computatonally expensve (Proposton 7). For those cases, we propose a parallelzable algorthm whch approxmates NMC usng network parttonng. The dea can be descrbed as follow. For a gven graph G = (V, E), () Partton the graph nto small dsjont sets, (2) Estmate NMC for each partton ndependently, (3) Combne NMC solutons over sub-graphs to form an approxmaton of NMC for the orgnal graph. Defnton 6 An (ɛ, k)- parttonng of graph G = (V, E) s a dstrbuton on fnte parttons of V so that for any partton {V,..., V M } wth non zero probablty, V m k, for all m M. Moreover, the probablty that an edge falls across parttons s bounded by ɛ: for any e E, P[e E c ] ɛ, where E c = E m (V m V m ) s the set of cut edges. The probablty s wth respect to the dstrbuton on parttons. Defnton 7 A graph G s poly-growth f there exsts r > 0 and C > 0, such that for any vertex v n the graph, N v (d) Cd r, where N v (d) s the number of nodes wthn dstance d of v n G. Reference [39] descrbes the followng procedure for generatng an (ɛ, k) parttonng on a graph:. Gven G = (V, E), k, and ɛ > 0, we defne the truncated geometrc dstrbuton as follows: P[x = l] = { ɛ( ɛ)l, l < k, ( ɛ) k, l = k. (5.2) 2. We then order nodes arbtrarly,..., N. For node, we sample R accordng to dstrbuton (5.2) and assgn all nodes wthn that dstance from node to color (dstance s defned as the shortest path length on the graph). If a node has already colored, we re-color t wth color. 3. All nodes wth the same color form a partton. Proposton 8 If G s a poly-graph, then by selectng k = Θ( r ɛ log r ɛ ), the above procedure results n an (ɛ, Ck r ) partton. Proof See reference [39]. Next, we use an (ɛ, k)-graph parttonng to approxmate NMC over large graphs usng parallel computatons. Consder the followng approach: () Gven an (ɛ, k)- parttonng of G, we sample a partton {V,..., V M } of V. 20

22 (2) For each partton m M, we compute NMC over G m = (V m, E (V m V m )), denoted by ˆρ Gm. (3) Let ˆρ G = M m= ˆρ(G m ) be an approxmaton of ρ G. In the followng, we bound the approxmaton error by boundng boundary effects: Theorem 8 Consder an (ɛ, k)- parttonng of the graph G. We have, where the expectaton s over (ɛ, k)- parttons of graph G. Proof A proof s presented n Secton 0.2. E[ˆρ G ] ( ɛ)ρ G, (5.3) 6 NMC Applcaton n Inference of Nonlnear Gaussan Graphcal Models In ths secton, we dscuss an applcaton of NMC to nfer graphcal models for nonlnear functons of jontly Gaussan varables. Suppose (X,..., X n ) are jontly Gaussan varables wth zero means and unt varances. Let ρ, be the correlaton coeffcent between varables X and X. Let ψ,j to be the j-th Hermtte-Chebychev polynomal (2.8), for n. Recall that these polynomals form an orthonormal bass wth respect to Gaussan dstrbuton (see Example 3 and reference [4] for convergence detals). We have ρ j,j, = E[ψ,j (X ) ψ,j (X )] (6.) = ρ j, j=j, where j=j s equal to one f j = j, otherwse t s zero. Moreover, usng the defnton of Hermtte- Chebychev polynomals (2.8), we have E[ψ,j (X )] = j=0, n. (6.2) because all of these functons for j have zero means over a Gaussan dstrbuton. Therefore, optmzaton (4.7) can be wrtten as max (, ) E j=2 j=2 a,j a,j ρ j, (6.3) (a,j ) 2 =, n. In general, solvng optmzaton (6.3) s NP-complete. We establsh ths by dentfyng that one nstance of ths optmzaton s smplfed to the max-cut problem whch s NP-complete [27]. Theorem 9 Let s {, } for n. Let G = (V, E) be a complete graph. Suppose ( s s )ρ, 0, n, s s ρ, ρ 2,, n. Then, a = (0, s, 0,..., 0), for n s a global maxmzer of optmzaton (6.3). 2

23 Proof A proof s presented n Secton 0.3. Proposton 9 Under assumptons of Theorem 9, the NMC optmzaton (4.) s smplfed to the followng max-cut optmzaton: max s s ρ, (6.4) s {, }, n. Moreover, for all n, we have φ (X ) = s X, where φ and s are optmal solutons of optmzatons (3) and (6.4), respectvely. Proof A proof s presented n Secton 0.4. Note 2 Under the condtons of Theorem 9, one can compute the strength of the nonlnear relatonshps among varables by solvng multple parwse MC optmzaton (2.). However, one needs to solve optmzaton (6.4) to compute the sgns of covarance coeffcents. In general, Max-Cut optmzaton (6.4) s NP-complete [27]. However, there exst algorthms to approxmate ts soluton usng Semdefnte Programmng (SDP) wth approxmaton factor of [28]. Corollary Let φ (X ) be an optmal soluton of NMC optmzaton (3). Under assumptons of Theorem 9, f then, φ (X ) = X. ρ, ρ 2,, n, (6.5) Intutvely, the assumptons of Corollary consders jontly Gaussan varables wth correlaton coeffcents that are mostly postve. However, the covarance matrx can have negatve values as well. We wll show that ths assumpton s crtcal n graphcal model nference of nonlnear jontly Gaussan varables. Suppose (X,..., X n ) are jontly Gaussan varables wth the covarance matrx Λ X. Wthout loss of generalty, we assume all varables have zero means and unt varances. I.e., E[X ] = 0 and E[X 2 ] =, for all n. Let J X be the nformaton (precson) matrx [40] of these varables where J X = Λ X. Defne G X = (V X, E X ) such that, (, j) E X f and only f J X (, j) 0. Theorem 0 (e.g., [40]) If (, j) E X, then where represents ndependency between varables. X X j {X k, k, j} (6.6) Theorem 0 represents a way to explctly model the jont dstrbuton of Gaussan varables usng a graphcal model G X = (V X, E X ). Ths result s crtcal n several applcatons nvolved wth Gaussan varables whch requres computaton of margnal dstrbutons, or computaton of the mode of the dstrbuton. These computatons can be performed effcently over the graphcal model usng belef propagaton approaches [40]. Moreover, Gaussan graphcal models play an mportant 22

24 role n many applcatons such as lnear regresson [4], partal correlaton [42], maxmum lkelhood estmaton [43], etc. In many applcatons, even f varables are not jontly Gaussan, a Gaussan approxmaton s used often n practce, partally owng to the effcent nference of ther graphcal models. In the followng, under some condtons, we use the NMC optmzaton to characterze graphcal models for functons of latent jontly Gaussan varables. These functons are unknown, bjectve, and can be lnear or nonlnear. More precsely, let Y = f (X ), where f R R s a bjectve and dfferentable functon. Our goal s to characterze a graphcal model for varables (Y, Y 2,..., Y n ) wthout the knowledge of f ( ) functons. Consder the followng NMC optmzaton: max E[g (Y ) g (Y )], (6.7) (, ) E[g (Y )] = 0, n, E[g 2 (Y )] =, n. Suppose g ( ) represents an optmal soluton for optmzaton (6.7). Defne the matrx Λ nmc such that Λ nmc (, j) = E[g (Y )gj (Y j)]. Moreover, let J nmc = Λ nmc. Defne G nmc = (V nmc, E nmc ) such that, (, j) E nmc ff J nmc (, j) 0. The followng theorem characterzes the graphcal model of varables (Y,..., Y n ). Theorem Suppose X s satsfy the condtons of Corollary. If (, j) E nmc, then Proof A proof s presented n Secton 0.5. Y Y j {Y k, k, j}. (6.8) Under the condtons of Corollary, Theorem characterzes the graphcal model of varables Y s that are related to latent jontly Gaussan varables X s through unknown bjectve functons f s. The famly of dstrbutons consdered n Theorem s broad and ncludes many Gaussan dstrbutons as well as dstrbutons whose varables are bjectve functons of Gaussan varables. Graphcal models characterzed n Theorem can be used n computaton of margnal dstrbutons, computaton of the mode of the jont dstrbuton, and n other applcatons of estmaton and predcton smlarly to the case of Gaussan graphcal models. Note that Theorem only consders f functons that are bjectve and dfferentable. If these functons were not bjectve, the feasble set of optmzaton (6.7) s n fact smaller than the feasble set of the orgnal NMC optmzaton (4.) over Gaussan varables. Example 5 Consder sx jontly Gaussan varables X,...,X 6, each wth unt varance and mean 23

25 X X X X2 0. X X3 0. X X6 ph3 ph2 ph Y6 Y X ph6 X2 Y Y3 Y2 X ph4 0. after NMC Y 0.02 (b) before NMC 0.08 ph5 (a) X Fgure 3: (a) Relatonshps among latent jontly Gaussan varables X and ther nonlnear observatons Y. (b) Relatonshps among latent jontly Gaussan varables and nferred transformatons usng the NMC optmzaton. zero. We observe Y = f (X ) where, 0X, Y = f (X ) = 0 X, Y2 = f2 (X2 ) = e20x2, f X 0, otherwse, (6.9) Y3 = f3 (X3 ) = X3, Y4 = f4 (X4 ) = X43, e20x5, Y5 = f5 (X5 ) = 20X 5, e Y6 = f6 (X6 ) = X65. f X5 0, otherwse, The functons f ( ) reman unknown for the nference part. Relatonshps among orgnal, observed and NMC varables are depcted n Fgure 3. Then, we use φ (Y ) to nfer the underlyng covarance and precson matrces accordng to Theorem. As t s llustrated n Fgure 4, covarance and precson matrces computed usng observed varables Y show sgnfcant errors compared to the true networks owng to the exstence of extensve nonlnear relatonshps. However, nferred covarance and precson matrces usng the NMC optmzaton closely approxmate the true covarance and precson matrces, respectvely (Theorem ). Small errors n covarance coeffcent estmaton n ths example are owng to computaton of the NMC soluton usng emprcal dstrbutons accordng to the ACE algorthm 3. 24

26 (a) before NMC after NMC (b) before NMC after NMC error error.2 Fgure 4: For sx nonlnear functons of jontly Gaussan varables descrbed n Example 5, panel (a) plots covarance matrx errors before and after NMC. Usng NMC, estmated covarance coeffcents among varables are close to true ones. Panel (b) plots precson matrx errors before and after NMC. Usng NMC, estmated elements of the precson matrx are close to true ones. 7 NMC Applcaton n Inference of Nonlnear Relevance Graphs Relevance graphs (RG s) play an mportant role n many applcatons ncludng systems bology, socal and economc scences as they characterze varable pars wth hghest observed smlartes [29]. Let X,...,X n be n random varables wth zero means and unt varances. Consder a smlarty measure S(X, X j ) between varables X and X j. If the smlarty measure between varables X and X j s ndependent of other varables, the resultng graph s called a parwse relevance graph (PRG). Defnton 8 Consder the followng optmzaton: G S = arg max G=(V,E) E =k S(X, X j ). (7.) (,j) E Then, G S = (V, E ) s called a parwse relevance graph (PRG) of varables X,..., X n, wth k edges, correspondng to the smlarty measure S(, ). PRG s can be nferred effcently n practce. In the followng, we hghlght two examples of such graphs: Example 6 If S(, ) s a correlaton-based smlarty measure (e.g., S(X, X j ) = E[X X j ] ), the optmzaton (7.) results n a correlaton-based PRG. Ths graph only captures the top k lnear assocatons among varables, gnorng nonlnear ones. Example 7 If S(, ) s a mutual nformaton-based smlarty measure (.e., S(X, X j ) = I(X ; X j ) where I(.;.) represents the mutual nformaton functon [5]), optmzaton (7.) results n an MIbased PRG whch captures nonlnear assocatons among pars of varables. However, t does not provde explct forms of such nonlnear relatonshps. 25

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

v a 1 b 1 i, a 2 b 2 i,..., a n b n i.

v a 1 b 1 i, a 2 b 2 i,..., a n b n i. SECTION 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS 455 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS All the vector spaces we have studed thus far n the text are real vector spaces snce the scalars are

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

Production. 2. Y is closed A set is closed if it contains its boundary. We need this for the solution existence in the profit maximization problem.

Production. 2. Y is closed A set is closed if it contains its boundary. We need this for the solution existence in the profit maximization problem. Producer Theory Producton ASSUMPTION 2.1 Propertes of the Producton Set The producton set Y satsfes the followng propertes 1. Y s non-empty If Y s empty, we have nothng to talk about 2. Y s closed A set

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

How To Calculate The Accountng Perod Of Nequalty

How To Calculate The Accountng Perod Of Nequalty Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

PERRON FROBENIUS THEOREM

PERRON FROBENIUS THEOREM PERRON FROBENIUS THEOREM R. CLARK ROBINSON Defnton. A n n matrx M wth real entres m, s called a stochastc matrx provded () all the entres m satsfy 0 m, () each of the columns sum to one, m = for all, ()

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

Extending Probabilistic Dynamic Epistemic Logic

Extending Probabilistic Dynamic Epistemic Logic Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

Fisher Markets and Convex Programs

Fisher Markets and Convex Programs Fsher Markets and Convex Programs Nkhl R. Devanur 1 Introducton Convex programmng dualty s usually stated n ts most general form, wth convex objectve functons and convex constrants. (The book by Boyd and

More information

We are now ready to answer the question: What are the possible cardinalities for finite fields?

We are now ready to answer the question: What are the possible cardinalities for finite fields? Chapter 3 Fnte felds We have seen, n the prevous chapters, some examples of fnte felds. For example, the resdue class rng Z/pZ (when p s a prme) forms a feld wth p elements whch may be dentfed wth the

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

where the coordinates are related to those in the old frame as follows.

where the coordinates are related to those in the old frame as follows. Chapter 2 - Cartesan Vectors and Tensors: Ther Algebra Defnton of a vector Examples of vectors Scalar multplcaton Addton of vectors coplanar vectors Unt vectors A bass of non-coplanar vectors Scalar product

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 71 (2011) 62 76 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. journal homepage: www.elsever.com/locate/jpdc Optmzng server placement n dstrbuted systems n

More information

Quantization Effects in Digital Filters

Quantization Effects in Digital Filters Quantzaton Effects n Dgtal Flters Dstrbuton of Truncaton Errors In two's complement representaton an exact number would have nfntely many bts (n general). When we lmt the number of bts to some fnte value

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

1. Measuring association using correlation and regression

1. Measuring association using correlation and regression How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a

More information

The Greedy Method. Introduction. 0/1 Knapsack Problem

The Greedy Method. Introduction. 0/1 Knapsack Problem The Greedy Method Introducton We have completed data structures. We now are gong to look at algorthm desgn methods. Often we are lookng at optmzaton problems whose performance s exponental. For an optmzaton

More information

When Network Effect Meets Congestion Effect: Leveraging Social Services for Wireless Services

When Network Effect Meets Congestion Effect: Leveraging Social Services for Wireless Services When Network Effect Meets Congeston Effect: Leveragng Socal Servces for Wreless Servces aowen Gong School of Electrcal, Computer and Energy Engeerng Arzona State Unversty Tempe, AZ 8587, USA xgong9@asuedu

More information

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(7):1884-1889 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A hybrd global optmzaton algorthm based on parallel

More information

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6 PAR TESTS If a WEIGHT varable s specfed, t s used to replcate a case as many tmes as ndcated by the weght value rounded to the nearest nteger. If the workspace requrements are exceeded and samplng has

More information

Availability-Based Path Selection and Network Vulnerability Assessment

Availability-Based Path Selection and Network Vulnerability Assessment Avalablty-Based Path Selecton and Network Vulnerablty Assessment Song Yang, Stojan Trajanovsk and Fernando A. Kupers Delft Unversty of Technology, The Netherlands {S.Yang, S.Trajanovsk, F.A.Kupers}@tudelft.nl

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

Loop Parallelization

Loop Parallelization - - Loop Parallelzaton C-52 Complaton steps: nested loops operatng on arrays, sequentell executon of teraton space DECLARE B[..,..+] FOR I :=.. FOR J :=.. I B[I,J] := B[I-,J]+B[I-,J-] ED FOR ED FOR analyze

More information

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems

Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems Jont Schedulng of Processng and Shuffle Phases n MapReduce Systems Fangfe Chen, Mural Kodalam, T. V. Lakshman Department of Computer Scence and Engneerng, The Penn State Unversty Bell Laboratores, Alcatel-Lucent

More information

CHAPTER 14 MORE ABOUT REGRESSION

CHAPTER 14 MORE ABOUT REGRESSION CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp

More information

SIMPLE LINEAR CORRELATION

SIMPLE LINEAR CORRELATION SIMPLE LINEAR CORRELATION Smple lnear correlaton s a measure of the degree to whch two varables vary together, or a measure of the ntensty of the assocaton between two varables. Correlaton often s abused.

More information

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications

Descriptive Models. Cluster Analysis. Example. General Applications of Clustering. Examples of Clustering Applications CMSC828G Prncples of Data Mnng Lecture #9 Today s Readng: HMS, chapter 9 Today s Lecture: Descrptve Modelng Clusterng Algorthms Descrptve Models model presents the man features of the data, a global summary

More information

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching)

Face Verification Problem. Face Recognition Problem. Application: Access Control. Biometric Authentication. Face Verification (1:1 matching) Face Recognton Problem Face Verfcaton Problem Face Verfcaton (1:1 matchng) Querymage face query Face Recognton (1:N matchng) database Applcaton: Access Control www.vsage.com www.vsoncs.com Bometrc Authentcaton

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model

More information

A Probabilistic Theory of Coherence

A Probabilistic Theory of Coherence A Probablstc Theory of Coherence BRANDEN FITELSON. The Coherence Measure C Let E be a set of n propostons E,..., E n. We seek a probablstc measure C(E) of the degree of coherence of E. Intutvely, we want

More information

Logistic Regression. Steve Kroon

Logistic Regression. Steve Kroon Logstc Regresson Steve Kroon Course notes sectons: 24.3-24.4 Dsclamer: these notes do not explctly ndcate whether values are vectors or scalars, but expects the reader to dscern ths from the context. Scenaro

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES The goal: to measure (determne) an unknown quantty x (the value of a RV X) Realsaton: n results: y 1, y 2,..., y j,..., y n, (the measured values of Y 1, Y 2,..., Y j,..., Y n ) every result s encumbered

More information

Single and multiple stage classifiers implementing logistic discrimination

Single and multiple stage classifiers implementing logistic discrimination Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

+ + + - - This circuit than can be reduced to a planar circuit

+ + + - - This circuit than can be reduced to a planar circuit MeshCurrent Method The meshcurrent s analog of the nodeoltage method. We sole for a new set of arables, mesh currents, that automatcally satsfy KCLs. As such, meshcurrent method reduces crcut soluton to

More information

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,

More information

Ring structure of splines on triangulations

Ring structure of splines on triangulations www.oeaw.ac.at Rng structure of splnes on trangulatons N. Vllamzar RICAM-Report 2014-48 www.rcam.oeaw.ac.at RING STRUCTURE OF SPLINES ON TRIANGULATIONS NELLY VILLAMIZAR Introducton For a trangulated regon

More information

Point cloud to point cloud rigid transformations. Minimizing Rigid Registration Errors

Point cloud to point cloud rigid transformations. Minimizing Rigid Registration Errors Pont cloud to pont cloud rgd transformatons Russell Taylor 600.445 1 600.445 Fall 000-014 Copyrght R. H. Taylor Mnmzng Rgd Regstraton Errors Typcally, gven a set of ponts {a } n one coordnate system and

More information

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The

More information

Enabling P2P One-view Multi-party Video Conferencing

Enabling P2P One-view Multi-party Video Conferencing Enablng P2P One-vew Mult-party Vdeo Conferencng Yongxang Zhao, Yong Lu, Changja Chen, and JanYn Zhang Abstract Mult-Party Vdeo Conferencng (MPVC) facltates realtme group nteracton between users. Whle P2P

More information

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008 Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn

More information

Downlink Power Allocation for Multi-class. Wireless Systems

Downlink Power Allocation for Multi-class. Wireless Systems Downlnk Power Allocaton for Mult-class 1 Wreless Systems Jang-Won Lee, Rav R. Mazumdar, and Ness B. Shroff School of Electrcal and Computer Engneerng Purdue Unversty West Lafayette, IN 47907, USA {lee46,

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

A Lyapunov Optimization Approach to Repeated Stochastic Games

A Lyapunov Optimization Approach to Repeated Stochastic Games PROC. ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, OCT. 2013 1 A Lyapunov Optmzaton Approach to Repeated Stochastc Games Mchael J. Neely Unversty of Southern Calforna http://www-bcf.usc.edu/

More information

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and POLYSA: A Polynomal Algorthm for Non-bnary Constrant Satsfacton Problems wth and Mguel A. Saldo, Federco Barber Dpto. Sstemas Informátcos y Computacón Unversdad Poltécnca de Valenca, Camno de Vera s/n

More information

Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem

Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME, ISSUE, FEBRUARY ISSN 77-866 Logcal Development Of Vogel s Approxmaton Method (LD- An Approach To Fnd Basc Feasble Soluton Of Transportaton

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

Abstract. Clustering ensembles have emerged as a powerful method for improving both the

Abstract. Clustering ensembles have emerged as a powerful method for improving both the Clusterng Ensembles: {topchyal, Models jan, of punch}@cse.msu.edu Consensus and Weak Parttons * Alexander Topchy, Anl K. Jan, and Wllam Punch Department of Computer Scence and Engneerng, Mchgan State Unversty

More information

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S S C H E D A E I N F O R M A T I C A E VOLUME 0 0 On Mean Squared Error of Herarchcal Estmator Stans law Brodowsk Faculty of Physcs, Astronomy, and Appled Computer Scence, Jagellonan Unversty, Reymonta

More information

Generalizing the degree sequence problem

Generalizing the degree sequence problem Mddlebury College March 2009 Arzona State Unversty Dscrete Mathematcs Semnar The degree sequence problem Problem: Gven an nteger sequence d = (d 1,...,d n ) determne f there exsts a graph G wth d as ts

More information

On the Solution of Indefinite Systems Arising in Nonlinear Optimization

On the Solution of Indefinite Systems Arising in Nonlinear Optimization On the Soluton of Indefnte Systems Arsng n Nonlnear Optmzaton Slva Bonettn, Valera Ruggero and Federca Tnt Dpartmento d Matematca, Unverstà d Ferrara Abstract We consder the applcaton of the precondtoned

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

Period and Deadline Selection for Schedulability in Real-Time Systems

Period and Deadline Selection for Schedulability in Real-Time Systems Perod and Deadlne Selecton for Schedulablty n Real-Tme Systems Thdapat Chantem, Xaofeng Wang, M.D. Lemmon, and X. Sharon Hu Department of Computer Scence and Engneerng, Department of Electrcal Engneerng

More information

Energy Efficient Routing in Ad Hoc Disaster Recovery Networks

Energy Efficient Routing in Ad Hoc Disaster Recovery Networks Energy Effcent Routng n Ad Hoc Dsaster Recovery Networks Gl Zussman and Adran Segall Department of Electrcal Engneerng Technon Israel Insttute of Technology Hafa 32000, Israel {glz@tx, segall@ee}.technon.ac.l

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts Power-of-wo Polces for Sngle- Warehouse Mult-Retaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)

More information

Sketching Sampled Data Streams

Sketching Sampled Data Streams Sketchng Sampled Data Streams Florn Rusu, Aln Dobra CISE Department Unversty of Florda Ganesvlle, FL, USA frusu@cse.ufl.edu adobra@cse.ufl.edu Abstract Samplng s used as a unversal method to reduce the

More information

Least Squares Fitting of Data

Least Squares Fitting of Data Least Squares Fttng of Data Davd Eberly Geoetrc Tools, LLC http://www.geoetrctools.co/ Copyrght c 1998-2016. All Rghts Reserved. Created: July 15, 1999 Last Modfed: January 5, 2015 Contents 1 Lnear Fttng

More information

8 Algorithm for Binary Searching in Trees

8 Algorithm for Binary Searching in Trees 8 Algorthm for Bnary Searchng n Trees In ths secton we present our algorthm for bnary searchng n trees. A crucal observaton employed by the algorthm s that ths problem can be effcently solved when the

More information

Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT

Chapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT Chapter 4 ECOOMIC DISATCH AD UIT COMMITMET ITRODUCTIO A power system has several power plants. Each power plant has several generatng unts. At any pont of tme, the total load n the system s met by the

More information

SVM Tutorial: Classification, Regression, and Ranking

SVM Tutorial: Classification, Regression, and Ranking SVM Tutoral: Classfcaton, Regresson, and Rankng Hwanjo Yu and Sungchul Km 1 Introducton Support Vector Machnes(SVMs) have been extensvely researched n the data mnng and machne learnng communtes for the

More information

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background:

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background: SPEE Recommended Evaluaton Practce #6 efnton of eclne Curve Parameters Background: The producton hstores of ol and gas wells can be analyzed to estmate reserves and future ol and gas producton rates and

More information

On the Interaction between Load Balancing and Speed Scaling

On the Interaction between Load Balancing and Speed Scaling On the Interacton between Load Balancng and Speed Scalng Ljun Chen, Na L and Steven H. Low Engneerng & Appled Scence Dvson, Calforna Insttute of Technology, USA Abstract Speed scalng has been wdely adopted

More information

Solving Factored MDPs with Continuous and Discrete Variables

Solving Factored MDPs with Continuous and Discrete Variables Solvng Factored MPs wth Contnuous and screte Varables Carlos Guestrn Berkeley Research Center Intel Corporaton Mlos Hauskrecht epartment of Computer Scence Unversty of Pttsburgh Branslav Kveton Intellgent

More information

"Research Note" APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES *

Research Note APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES * Iranan Journal of Scence & Technology, Transacton B, Engneerng, ol. 30, No. B6, 789-794 rnted n The Islamc Republc of Iran, 006 Shraz Unversty "Research Note" ALICATION OF CHARGE SIMULATION METHOD TO ELECTRIC

More information

An MILP model for planning of batch plants operating in a campaign-mode

An MILP model for planning of batch plants operating in a campaign-mode An MILP model for plannng of batch plants operatng n a campagn-mode Yanna Fumero Insttuto de Desarrollo y Dseño CONICET UTN yfumero@santafe-concet.gov.ar Gabrela Corsano Insttuto de Desarrollo y Dseño

More information

The descriptive complexity of the family of Banach spaces with the π-property

The descriptive complexity of the family of Banach spaces with the π-property Arab. J. Math. (2015) 4:35 39 DOI 10.1007/s40065-014-0116-3 Araban Journal of Mathematcs Ghadeer Ghawadrah The descrptve complexty of the famly of Banach spaces wth the π-property Receved: 25 March 2014

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

HÜCKEL MOLECULAR ORBITAL THEORY

HÜCKEL MOLECULAR ORBITAL THEORY 1 HÜCKEL MOLECULAR ORBITAL THEORY In general, the vast maorty polyatomc molecules can be thought of as consstng of a collecton of two electron bonds between pars of atoms. So the qualtatve pcture of σ

More information

Chapter XX More advanced approaches to the analysis of survey data. Gad Nathan Hebrew University Jerusalem, Israel. Abstract

Chapter XX More advanced approaches to the analysis of survey data. Gad Nathan Hebrew University Jerusalem, Israel. Abstract Household Sample Surveys n Developng and Transton Countres Chapter More advanced approaches to the analyss of survey data Gad Nathan Hebrew Unversty Jerusalem, Israel Abstract In the present chapter, we

More information

Efficient Project Portfolio as a tool for Enterprise Risk Management

Efficient Project Portfolio as a tool for Enterprise Risk Management Effcent Proect Portfolo as a tool for Enterprse Rsk Management Valentn O. Nkonov Ural State Techncal Unversty Growth Traectory Consultng Company January 5, 27 Effcent Proect Portfolo as a tool for Enterprse

More information

New Approaches to Support Vector Ordinal Regression

New Approaches to Support Vector Ordinal Regression New Approaches to Support Vector Ordnal Regresson We Chu chuwe@gatsby.ucl.ac.uk Gatsby Computatonal Neuroscence Unt, Unversty College London, London, WCN 3AR, UK S. Sathya Keerth selvarak@yahoo-nc.com

More information

Lecture 5,6 Linear Methods for Classification. Summary

Lecture 5,6 Linear Methods for Classification. Summary Lecture 5,6 Lnear Methods for Classfcaton Rce ELEC 697 Farnaz Koushanfar Fall 2006 Summary Bayes Classfers Lnear Classfers Lnear regresson of an ndcator matrx Lnear dscrmnant analyss (LDA) Logstc regresson

More information

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems

An Analysis of Central Processor Scheduling in Multiprogrammed Computer Systems STAN-CS-73-355 I SU-SE-73-013 An Analyss of Central Processor Schedulng n Multprogrammed Computer Systems (Dgest Edton) by Thomas G. Prce October 1972 Techncal Report No. 57 Reproducton n whole or n part

More information

Improved SVM in Cloud Computing Information Mining

Improved SVM in Cloud Computing Information Mining Internatonal Journal of Grd Dstrbuton Computng Vol.8, No.1 (015), pp.33-40 http://dx.do.org/10.1457/jgdc.015.8.1.04 Improved n Cloud Computng Informaton Mnng Lvshuhong (ZhengDe polytechnc college JangSu

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

On the Interaction between Load Balancing and Speed Scaling

On the Interaction between Load Balancing and Speed Scaling On the Interacton between Load Balancng and Speed Scalng Ljun Chen and Na L Abstract Speed scalng has been wdely adopted n computer and communcaton systems, n partcular, to reduce energy consumpton. An

More information

Imperial College London

Imperial College London F. Fang 1, C.C. Pan 1, I.M. Navon 2, M.D. Pggott 1, G.J. Gorman 1, P.A. Allson 1 and A.J.H. Goddard 1 1 Appled Modellng and Computaton Group Department of Earth Scence and Engneerng Imperal College London,

More information

Stochastic epidemic models revisited: Analysis of some continuous performance measures

Stochastic epidemic models revisited: Analysis of some continuous performance measures Stochastc epdemc models revsted: Analyss of some contnuous performance measures J.R. Artalejo Faculty of Mathematcs, Complutense Unversty of Madrd, 28040 Madrd, Span A. Economou Department of Mathematcs,

More information

Learning from Multiple Outlooks

Learning from Multiple Outlooks Learnng from Multple Outlooks Maayan Harel Department of Electrcal Engneerng, Technon, Hafa, Israel She Mannor Department of Electrcal Engneerng, Technon, Hafa, Israel maayanga@tx.technon.ac.l she@ee.technon.ac.l

More information

Lecture 2 Sequence Alignment. Burr Settles IBS Summer Research Program 2008 bsettles@cs.wisc.edu www.cs.wisc.edu/~bsettles/ibs08/

Lecture 2 Sequence Alignment. Burr Settles IBS Summer Research Program 2008 bsettles@cs.wisc.edu www.cs.wisc.edu/~bsettles/ibs08/ Lecture 2 Sequence lgnment Burr Settles IBS Summer Research Program 2008 bsettles@cs.wsc.edu www.cs.wsc.edu/~bsettles/bs08/ Sequence lgnment: Task Defnton gven: a par of sequences DN or proten) a method

More information

Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits

Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits Lnear Crcuts Analyss. Superposton, Theenn /Norton Equalent crcuts So far we hae explored tmendependent (resste) elements that are also lnear. A tmendependent elements s one for whch we can plot an / cure.

More information

Optimal monitoring in large networks by Successive c-optimal Designs

Optimal monitoring in large networks by Successive c-optimal Designs Optmal montorng n large networs by Successve c-optmal Desgns Gullaume Sagnol, Stéphane Gaubert INRIA Saclay & CMAP, Ecole Polytechnque Emal: {gullaumesagnol,stephanegaubert}@nrafr Mustapha Bouhtou Orange

More information