High-dimensional support union recovery in multivariate regression

Size: px
Start display at page:

Download "High-dimensional support union recovery in multivariate regression"

Transcription

1 High-dimesioal support uio recovery i multivariate regressio Guillaume Oboziski Departmet of Statistics UC Berkeley gobo@stat.berkeley.edu Marti J. Waiwright Departmet of Statistics Dept. of Electrical Egieerig ad Computer Sciece UC Berkeley waiwright@stat.berkeley.edu Michael I. Jorda Departmet of Statistics Departmet of Electrical Egieerig ad Computer Sciece UC Berkeley jorda@stat.berkeley.edu Abstract We study the behavior of block l /l regularizatio for multivariate regressio, where a K-dimesioal respose vector is regressed upo a fixed set of p covariates. The problem of support uio recovery is to recover the subset of covariates that are active i at least oe of the regressio problems. Studyig this problem uder high-dimesioal scalig (where the problem parameters as well as sample size ted to ifiity simultaeously), our mai result is to show that exact recovery is possible oce the order parameter give by θ l/l (, p, s) : = /[ψ(b ) log(p s)] exceeds a critical threshold. Here is the sample size, p is the ambiet dimesio of the regressio model, s is the size of the uio of supports, ad ψ(b ) is a sparsity-overlap fuctio that measures a combiatio of the sparsities ad overlaps of the K-regressio coefficiet vectors that costitute the model. This sparsity-overlap fuctio reveals that block l /l regularizatio for multivariate regressio ever harms performace relative to a aive l -approach, ad ca yield substatial improvemets i sample complexity (up to a factor of K) whe the regressio vectors are suitably orthogoal relative to the desig. We complemet our theoretical results with simulatios that demostrate the sharpess of the result, eve for relatively small problems. Itroductio A recet lie of research i machie learig has focused o regularizatio based o block-structured orms. Such structured orms are well motivated i various settigs, amog them kerel learig [3, 8], grouped variable selectio [], hierarchical model selectio [3], simultaeous sparse approximatio [0], ad simultaeous feature selectio i multi-task learig [7]. Block-orms that compose a l -orm with other orms yield solutios that ted to be sparse like the Lasso, but the structured orm also eforces blockwise sparsity, i the sese that parameters withi blocks are more likely to be zero (or o-zero) simultaeously. The focus of this paper is the model selectio cosistecy of block-structured regularizatio i the settig of multivariate regressio. Our goal is to perform model or variable selectio, by which we mea extractig the subset of relevat covariates that are active i at least oe regressio. We refer to this problem as the support uio problem. I lie with a large body of recet work i statistical machie learig (e.g., [, 9, 4, ]), our aalysis is high-dimesioal i ature, meaig that we allow the model dimesio p (as well as other structural parameters) to grow alog with the sample size. A great deal of work has focused o the case of ordiary l -regularizatio (Lasso) [,, 4], showig for istace that the Lasso ca recover the support of a sparse sigal eve whe p.

2 Some more recet work has studied cosistecy issues for block-regularizatio schemes, icludig classical aalysis (p fixed) of the group Lasso [], ad high-dimesioal aalysis of the predictive risk of block-regularized logistic regressio [5]. Although there have bee various empirical demostratios of the beefits of block regularizatio, the geeralizatios of the result of [] obtaied by [6, 4] fail to capture the improvemets observed i practice. I this paper, our goal is to uderstad the followig questio: uder what coditios does block regularizatio lead to a quatifiable improvemet i statistical efficiecy, relative to more aive regularizatio schemes? Here statistical efficiecy is assessed i terms of the sample complexity, meaig the miimal sample size required to recover the support uio; we wish to kow how this scales as a fuctio of problem parameters. Our mai cotributio is to provide a fuctio quatifyig the beefits of block regularizatio schemes for the problem of multivariate liear regressio, showig i particular that, uder suitable structural coditios o the data, the block-orm regularizatio we cosider ever harms performace relative to aive l -regularizatio ad ca lead to substatial gais i sample complexity. More specifically, we cosider the followig problem of multivariate liear regressio: a group of K scalar outputs are regressed o the same desig matrix X R p. Represetig the regressio coefficiets as a p K matrix B, the regressio model takes the form Y = XB + W, () where Y R K ad W R K are matrices of observatios ad zero-mea oise respectively ad B has colums β (),..., β (K) which are the parameter vectors of each uivariate regressio. We are iterested { i recoverig the uio } of the supports of idividual regressios, more specifically if S k = i {,..., p}, β (k) i 0 we would like to recover S = k S k. The Lasso is ofte preseted as a relaxatio of the so-called l 0 regularizatio, i.e., the cout of the umber of o-zero parameter coefficiets, a itractable o-covex fuctio. More geerally, block-orm regularizatios ca be thought of as the relaxatio of a o-covex regularizatio which couts the umber of covariates i for which at least oe of the uivariate regressio parameters β (k) i is o-zero. More specifically, let βi deote the ith row of B, ad defie, for q, B l0 /l q = {i {,..., p}, β i q > 0} ad B l /l q = p βi q All l 0 /l q orms defie the same fuctio, but differ coceptually i that they lead to differet l /l q relaxatios. I particular the l /l regularizatio is the same as the usual Lasso. The other coceptually most atural block-orms are l /l ad l /l. While l /l is of iterest, it seems ituitively to be relevat essetially to situatios where the support is exactly the same for all regressios, a assumptio that we are ot willig to make. I the curret paper, we focus o the l /l case ad cosider the estimator B obtaied by solvig the followig disguised secod-order coe program: { } mi B R p K Y XB F + λ B l /l, () where M F : = ( i,j m ij )/ deotes the Frobeius orm. We study the support uio problem uder high-dimesioal scalig, meaig that the umber of observatios, the ambiet dimesio p ad the size of the uio of supports s ca all ted to ifiity. The mai cotributio of this paper is to show that uder certai techical coditios o the desig ad oise matrices, the model selectio performace of block-regularized l /l regressio () is govered by the cotrol ψ(b,σ SS ) log(p s) parameter θ l /l (, p ; B ) : =, where is the sample size, p is the ambiet dimesio, s = S is the size of the uio of the supports, ad ψ( ) is a sparsity-overlap fuctio defied below. More precisely, the probability of correct support uio recovery coverges to oe for all sequeces (, p, s, B ) such that the cotrol parameter θ l /l (, p ; B ) exceeds a fixed critical threshold θ crit < +. Note that θ l /l is a measure of the sample complexity of the problem that is, the sample size required for exact recovery as a fuctio of the problem parameters. Whereas the ratio (/ log p) is stadard for high-dimesioal theory o l -regularizatio (essetially due to coverig umberigs of l balls), the fuctio ψ(b, Σ SS ) is a ovel ad iterestig quatity, which i=

3 measures both the sparsity of the matrix B, as well as the overlap betwee the differet regressio tasks (colums of B ). I Sectio, we itroduce the models ad assumptios, defie key characteristics of the problem ad state our mai result ad its cosequeces. Sectio 3 is devoted to the proof of this mai result, with most techical results deferred to the appedix. Sectio 4 illustrates with simulatios the sharpess of our aalysis ad how quickly the asymptotic regime arises.. Notatios For a (possibly radom) matrix M R p K, ad for parameters a b, we distiguish the l a /l b block orms from the (a, b)-operator orms, defied respectively as { p ( K ) a } M la /l b : = m ik b b a ad M a, b : = sup Mx a, (3) x b = i= k= although l /l p orms belog to both families (see Lemma B.0.). For brevity, we deote the spectral orm M, as M, ad the l -operator orm M, = max i j M ij as M. Mai result ad some cosequeces The aalysis of this paper applies to multivariate liear regressio problems of the form (), i which the oise matrix W R K is assumed to cosist of i.i.d. elemets W ij N(0, σ ). I additio, we assume that the measuremet or desig matrices X have rows draw i a i.i.d. maer from a zero-mea Gaussia N(0, Σ), where Σ 0 is a p p covariace matrix. Suppose that we partitio the full set of covariates ito the support set S ad its complemet S c, with S = s, S c = p s. Cosider the followig block decompositios of the regressio coefficiet matrix, the desig matrix ad its covariace matrix: B = [ ] B S BS, X = [X S X S c], ad Σ = c [ ] ΣSS Σ SS c. Σ S c S Σ S c S c We use β i to deote the ith row of B, ad assume that the sparsity of B is assessed as follows: (A0) Sparsity: The matrix B has row support S : = {i {,..., p} β i 0}, with s = S. I additio, we make the followig assumptios about the covariace Σ of the desig matrix: (A) Bouded eigespectrum: There exist a costat C mi > 0 (resp. C max < + ) such that all eigevalues of Σ SS (resp. Σ) are greater tha C mi (resp. smaller tha C max ). (A) Mutual icoherece: There exists γ (0, ] such that Σ S c S(Σ SS ) γ. (A3) Self icoherece: There exists a costat D max such that (Σ SS ) D max. Assumptio A is a stadard coditio required to prevet excess depedece amog elemets of the desig matrix associated with the support S. The mutual icoherece assumptio A is also well kow from previous work o model selectio with the Lasso [0, 4]. These assumptios are trivially satisfied by the stadard Gaussia esemble (Σ = I p ) with C mi = C max = D max = γ =. More geerally, it ca be show that various matrix classes satisfy these coditios [4, ].. Statemet of mai result With the goal of estimatig the uio of supports S, our mai result is a set of sufficiet coditios usig the followig procedure. Solve the block-regularized problem () with regularizatio parameter λ > 0, thereby obtaiig a solutio B = B(λ ). Use this solutio to compute a estimate of the support uio as Ŝ( B) { } : = i {,..., p} βi 0. This estimator is uambiguously defied if the solutio B is uique, ad as part of our aalysis, we show that the solutio B is ideed uique with high probability i the regime of iterest. We study the behavior of this estimator for a 3

4 sequece of liear regressios idexed by the triplet (, p, s), for which the data follows the geeral model preseted i the previous sectio with defiig parameters B () ad Σ() satisfyig A0- A3. As (, p, s) teds to ifiity, we give coditios o the triplet ad properties of B for which B is uique, ad such that P[Ŝ = S]. The cetral objects i our mai result are the sparsity-overlap fuctio, ad the sample complexity parameter, which we defie here. For ay vector β i 0, defie ζ(β i ) : = β i β i. We exted the fuctio ζ to ay matrix B S R s K with o-zero rows by defiig the matrix ζ(b S ) R s K with i th row [ζ(b S )] i = ζ(β i ). With this otatio, we defie the sparsity-overlap fuctio ψ(b) ad the sample complexity parameter θ l /l (, p ; B ) as ψ(b) : = ζ(bs ) T (Σ SS ) ζ(b S ) ad θ l /l (, p ; B ): = ψ(b ) log(p s). (4) Fially, we use b mi : = mi i S β i to deote the miimal l row-orm of the matrix B S. With this otatio, we have the followig result: Theorem. Cosider a radom desig matrix X draw with i.i.d. N(0, Σ) row vectors, a observatio matrix Y specified by model (), ad a regressio matrix B such that (b mi ) decays strictly more slowly tha f(p) max {s, log(p s)}, for ay fuctio f(p) +. ( Suppose that we solve f(p) ) the block-regularized program () with regularizatio parameter λ = Θ log(p)/. For ay sequece (, p, B ) such that the l /l cotrol parameter θ l/l (, p ; B ) exceeds the critical threshold θ crit (Σ) : = Cmax γ, the with probability greater tha exp( Θ(log p)), (a) the block-regularized program () has a uique solutio B, ad (b) its support set Ŝ( B) is equal to the true support uio S. Remarks: (i) For the stadard Gaussia esemble (Σ = I p ), the critical threshold is simply θ crit (Σ) =. (ii) A techical coditio that we require o the regularizatio parameter is which is satisfied by the choice give i the statemet.. Some cosequeces of Theorem λ log(p s) (5) It is iterestig to cosider some special cases of our mai result. The simplest special case is the uivariate regressio problem (K = ), i which case the fuctio ζ(β ) outputs a s-dimesioal sig vector with elemets z i = sig(β i ), so that ψ(β ) = z T (Σ SS ) z = Θ(s). Cosequetly, the order parameter of block l /l -regressio for uivariate regresio is give by Θ(/(s log(p s)), which matches the scalig established i previous work o the Lasso []. More geerally, give our assumptio (A) o Σ SS, the sparsity overlap ψ(b ) always lies i the s s iterval [ KC max, C mi ]. At the most pessimistic extreme, suppose that B : = β K T that is, B cosists of K copies of the same coefficiet vector β R p, with support of cardiality S = s. We the have [ζ(b )] ij = sig(βi )/ K, from which we see that ψ(b ) = z T (Σ SS ) z, with z agai the s-dimesioal sig vector with elemets zi = sig(βi ), so that there is o beefit i sample complexity relative to the aive strategy of solvig separate Lasso problems ad costructig the uio of idividually estimated supports. This might seem a pessimistic result, sice uder model (), we essetially have K observatios of the coefficiet vector β with the same desig matrix but K idepedet oise realizatios. However, the thresholds as well as the rates of covergece i high-dimesioal results such as Theorem are ot determied by the oise variace, but rather by the umber of iterferig variables (p s). At the most optimistic extreme, cosider the case where Σ SS = I s ad (for s > K) suppose that B is costructed such that the colums of the s K matrix ζ(b ) are all orthogoal ad of equal legth. Uder this coditio, we have 4

5 Corollary (Orthoormal tasks). If the colums of the matrix ζ(b ) are all orthogoal with equal legth ad Σ SS = I s s the the block-regularized problem () succeeds i uio support recovery oce the sample complexity parameter /( s K log(p s)) is larger tha. For the stadard Gaussia esemble, it is kow [] that the Lasso fails with probability oe for all sequeces such that < ( ν)s log(p s) for ay arbitrarily small ν > 0. Cosequetly, Corollary shows that uder suitable coditios o the regressio coefficiet matrix B, l /l ca provides a K-fold reductio i the umber of samples required for exact support recovery. As a third illustratio, cosider, for Σ SS = I s s, the case where the supports S k of idividual regressio problems are all disjoit. The sample complexity parameter for each of the idividual Lassos is /(s k log(p s k )) where S k = s k, so that the sample size required to recover the support uio from idividual Lassos scales as = Θ(max k [s k log(p s k )]). However, if the supports are all disjoit, the the colums of the matrix Z S = ζ(b S ) are orthogoal, ad Z S T Z S = diag(s,..., s K ) so that ψ(b ) = max k s k ad the sample complexity is the same. I other words, eve though there is o sharig of variables at all there is surprisigly o pealty from regularizig joitly with the l /l -orm. However, this is ot always true if Σ SS I s s ad i may situatios l /l -regularizatio ca have higher sample complexity tha separate Lassos. 3 Proof of Theorem I additio to previous otatios, the proofs use the shorthads: Σ SS = XT S X S, Σ S c S= XT S cx S ad Π S = X S ( Σ SS ) X T S deotes the orthogoal projectio oto the rage of X S. High-level proof outlie: At a high level, our proof is based o the otio of what we refer to as a primal-dual witess: we first formulate the problem () as a secod-order coe program (SOCP), with the same primal variable B as i () ad a dual variable Z whose rows coicide at optimality with the subgradiet of the l /l orm. We the costruct a primal matrix B alog with a dual matrix Ẑ such that, uder the coditios of Theorem, with probability covergig to : (a) The pair ( B, Ẑ) satisfies the Karush-Kuh-Tucker (KKT) coditios of the SOCP. (b) I spite of the fact that for geeral high-dimesioal problems (with p ), the SOCP eed ot have a uique solutio a priori, a strict feasibility coditio satisfied by the dual variables Ẑ guaratees that B is the uique optimal solutio of (). (c) The support uio Ŝ of B is idetical to the support uio S of B. At the core of our costructive procedure is the followig covex-aalytic result, which characterizes a optimal primal-dual pair for which the primal solutio B correctly recovers the support set S: Lemma. Suppose that there exists a primal-dual pair ( B, Ẑ) that satisfy the coditios: Ẑ S = ζ( B S ) (6a) Σ SS ( B S BS) XT S W = λ Ẑ S (6b) λ Ẑ S c : = l /l Σ S c S( B S BS) XT S l /l cw < λ (6c) B S c = 0. (6d) The ( B, Ẑ) is the uique optimal solutio to the block-regularized problem, with Ŝ( B) = S by costructio. Appedix A proves Lemma, with the strict feasibility of ẐSc give by (6c) to certify uiqueess. 3. Costructio of primal-dual witess Based o Lemma, we costruct the primal dual pair ( B, Ẑ) as follows. First, we set B S c = 0, to satisfy coditio (6d). Next, we obtai the pair ( B S, ẐS) by solvig a restricted versio of (): { [ ] } B S = arg mi B S R s K Y X BS + λ 0 S c B S l/l. (7) F 5

6 Sice s <, the empirical covariace (sub)matrix Σ SS = XT S X S is strictly positive defiite with probability oe, which implies that the restricted problem (7) is strictly covex ad therefore has a uique optimum B S. We the choose ẐS to be the solutio of equatio (6b). Sice ay such matrix ẐS is also a dual solutio to the SOCP (7), it must be a elemet of the subdifferetial B S l /l. It remais to show that this costructio satisfies coditios (6a) ad (6c). I order to satisfy coditio (6a), it suffices to show that β i 0, i S. From equatio (6b) ad sice Σ SS is ivertible, we may solve as follows ( B ) [ ] X S BS) T = S W ( ΣSS λ Ẑ S = : U S. (8) For ay row i S, we have β i βi U S l /l. Thus, it suffices to show that the followig evet occurs with high probability { E(U S ) : = U S l /l } b mi (9) to show that o row of B S is idetically zero. We establish this result later i this sectio. Turig to coditio (6c), by substitutig expressio (8) for the differece ( B S BS ) ito equatio (6c), we obtai a (p s) K radom matrix V S c, whose row j S c is give by V j : = X T j ( [Π S I ] W λ X S ( Σ SS ) Ẑ S ). (0) I order for coditio (6c) to hold, it is ecessary ad sufficiet that the probability of the evet } E(V S c) : = { V S c l /l < λ () coverges to oe as teds to ifiity. Correct iclusio of supportig covariates: We begi by aalyzig the probability of E(U S ). Lemma. Uder assumptio A3 ad coditios (5) of Theorem, with probability exp( Θ(log s)), we have ( ) ( ( )) U S l /l O (log s)/ + λ D max + O s /. This lemma is proved i i the Appedix. With the assumed scalig = Ω (s log(p s)), ad the assumed slow decrease of b mi, which we write explicitly as (b mi ) f(p) max{s,log(p s)} ε for some ε 0, we have U S l /l b mi O(ε ), () so that the coditios of Theorem esure that E(U S ) occurs with probability covergig to oe. Correct exclusio of o-support: Next we aalyze the evet E(V S c). For simplicity, i the followig argumets, we drop the idex S c ad write V for V S c. I order to show that V l /l < λ with probability covergig to oe, we make use of the decompositio 3 V λ l /l T i where T : = E [V X S ] λ l /l, i= T : = λ E [V X S, W ] E [V X S ] l /l ad T 3 : = λ V E [V X S, W ] l /l. Lemma 3. Uder assumptio A, T γ. Uder coditios (5) of Theorem, T = o p (). Therefore, to show that λ V l /l < with high probability, it suffices to show that T 3 < γ with high probability. Util ow, we have t appealed to the sample complexity parameter θ l /l (, p ; B ). I the ext sectio, we prove that θ l /l (, p ; B ) > θ crit (Σ) implies that T 3 < γ with high probability. 6

7 Lemma 4. Coditioally o W ad X S, we have ( Vj E [V j X S, W ] W, X S ) d = ( ΣS c S) jj ξt j M ξ j, where ξ j N( 0 K, I K ) ad where the K K matrix M = M (X S, W ) is give by M : = λ ẐT S ( Σ SS ) Ẑ S + W T (Π S I )W. (3) But the covariace matrix M is itself cocetrated. Ideed, Lemma 5. Uder the coditios (5) of Theorem, for ay δ > 0, the followig evet T (δ) has probability covergig to : T (δ) : = { M λ ψ(b ) } ( + δ). (4) For ay fixed δ > 0, we have P[T 3 γ] P[T 3 γ T (δ)] + P[T (δ) c ], but, from lemma 5, P[T (δ) c ] 0, so that it suffices to deal with the first term. Give that (Σ S c S) jj (Σ S c S c) jj C max for all j, o the evet T (δ), we have max (Σ j S c S c S) jj ξj T M ξ j C max M max ξ j j S c C max λ ψ(b ) [ ] P[T 3 γ T (δ)] P max ξ j j S c t (, B ) with t (, B ) : = γ max j S c ξ j C max ad ψ(b ) ( + δ). Fially usig the uio boud ad a large deviatio boud for χ variates we get the followig coditio which is equivalet to the coditio of Theorem : θ l/l (, p ; B ) > θ crit (Σ): [ ] Lemma 6. P max ξ j j S c t (, B ) 0 if t (, B ) > ( + ν) log(p s) for some ν > 0. 4 Simulatios I this sectio, we illustrate the sharpess of Theorem ad furthermore ascertai how quickly the predicted behavior is observed as, p, s grow i differet regimes, for two regressio tasks (i.e., K = ). I the followig simulatios, the matrix B of regressio coefficiets is desiged with etries βij i { /, / } to yield a desired value of ψ(b ). The desig matrix X is sampled from the stadard Gaussia esemble. Sice βij = / i this costructio, we have B S = ζ(b S ), ad b mi =. Moreover, sice Σ = I p, the sparsity-overlap ψ(b ) is simply ζ(b ) T ζ(b ). From our aalysis, the sample complexity parameter θ l /l is cotrolled by the iterferece of irrelevat covariates, ad ot by the variace of a oise compoet. We cosider liear sparsity with s = αp, for α = /8, for various ambiet model dimesios p {3, 56, 04}. For each value of p, we perform simulatios varyig the sample size to match correspodig values of the basic Lasso sample complexity parameter, give by θ Las : = /(s log(p s)), i the iterval [0.5,.5]. I each case, we solve the blockregularized problem () with sample size = θ Las s log(p s) usig the regularizatio parameter λ = log(p s) (log s)/. I all cases, the oise level is set at σ = 0.. For our costructio of matrices B, we choose both p ad the scaligs for the sparsity so that the obtaied values for s that are multiples of four, ad costruct the colums Z () ad Z () of the matrix B = ζ(b ) from copies of vectors of legth 4. Deotig by the usual matrix tesor product, we cosider: Idetical regressios: We set Z () = Z () = s, so that the sparsity-overlap is ψ(b ) = s. Orthogoal regressio: Here B is costructed with Z () Z (), so that ψ(b ) = s, the most favorable situatio. To achieve this, we set Z () = s ad Z () = s/ (, ) T. Itermediate agles: I this itermediate case, the colums Z () ad Z () are at a 60 agle, which leads to ψ(b ) = 3 4 s. We set Z() = s ad Z () = s/4 (,,, ) T. Figure shows plots of all three cases ad the referece Lasso case for the three differet values of the ambiet dimesio ad the two types of sparsity described above. Note how the curves all udergo a threshold pheomeo, with the locatio cosistet with the predictios of Theorem. 7

8 L p=3 s=p/8=4 p=56 s=p/8=3 p=04 s=p/8=8 P(support correct) Z =Z Ð (Z,Z )=60 o Z Z P(support correct) P(support correct) θ θ θ Figure. Plots of support recovery probability P[Ŝ = S] versus the basic l cotrol parameter θ Las=/[s log(p s)] for liear sparsity s=p/8, ad for icreasig values of p {3, 56, 04} from left to right. Each graph shows four curves correspodig to the case of idepedet l regularizatio (pluses), ad for l /l regularizatio, the cases of idetical regressio (crosses), itermediate agles (ablas), ad orthogoal regressios (squares). As plotted i dotted vertical lies, Theorem predicts that idetical case should succeed for θ Las > (same as ordiary Lasso), itermediate case for θ Las >0.75, ad orthogoal case for θ Las >0.50. The shift of these curves cofirms this predictio. 5 Discussio We studied support uio recovery uder high-dimesioal scalig with the l /l regularizatio, ad show that its sample complexity is determied by the fuctio ψ(b ). The latter itegrates the sparsity of each uivariate regressio with the overlap of all the supports ad the discrepacies betwee each of the vectors of parameter estimated. I favorable cases, for K regressios, the sample complexity for l /l is K times smaller tha that of the Lasso. Moreover, this gai is ot obtaied at the expese of a assumptio of shared support over the data. I fact, for stadard Gaussia desigs, the regularizatio seems adaptive i sese that it does t perform worse tha the Lasso for disjoit supports. This is ot ecessarily the case for more geeral desigs ad i some situatios, which eed to be characterized i future work, it could do worse tha the Lasso. Refereces [] F. Bach. Cosistecy of the group Lasso ad multiple kerel learig. Techical report, INRIA - Départemet d Iformatique, Ecole Normale Supérieure, 008. [] F. Bach, G. Lackriet, ad M. Jorda. Multiple kerel learig, coic duality, ad the SMO algorithm. I Proc. It. Cof. Machie Learig (ICML). Morga Kaufma, 004. [3] D. Dooho, M. Elad, ad V. M. Temlyakov. Stable recovery of sparse overcomplete represetatios i the presece of oise. IEEE Tras. Ifo Theory, 5():6 8, Jauary 006. [4] H. Liu ad J. Zhag. O the l l q regularized regressio. Techical Report arxiv:080.57v, Caregie Mello Uiversity, 008. [5] L. Meier, S. va de Geer, ad P. Bühlma. The group lasso for logistic regressio. Techical report, Mathematics Departmet, Swiss Federal Istitute of Techology Zürich, 007. [6] Y. Nardi ad A. Rialdo. O the asymptotic properties of the group lasso estimator for liear models. Electroic Joural of Statistics, : , 008. [7] G. Oboziski, B. Taskar, ad M. Jorda. Joit covariate selectio ad joit subspace selectio for multiple classificatio problems. Statistics ad Computig, 009. To appear. [8] M. Potil ad C.A. Michelli. Learig the kerel fuctio via regularizatio. Joural of Machie Learig Research, 6:099 5, 005. [9] P. Ravikumar, J. Lafferty, H. Liu, ad L. Wasserma. SpAM: sparse additive models. I Neural Ifo. Proc. Systems (NIPS), Vacouver, Caada, December 007. [0] J. A. Tropp. Just relax: Covex programmig methods for idetifyig sparse sigals i oise. IEEE Tras. Ifo Theory, 5(3):030 05, March 006. [] M. J. Waiwright. Sharp thresholds for high-dimesioal ad oisy recovery of sparsity usig usig l -costraied quadratic programs. Techical Report 709, Departmet of Statistics, UC Berkeley, 006. [] M. Yua ad Y. Li. Model selectio ad estimatio i regressio with grouped variables. Joural of the Royal Statistical Society B, (68):4967, 006. [3] P. Zhao, G. Rocha, ad B. Yu. Grouped ad hierarchical model selectio through composite absolute pealties. Techical report, Statistics Departmet, UC Berkeley, 007. [4] P. Zhao ad B. Yu. Model selectio with the lasso. J. of Machie Learig Research, pages ,

SUPPORT UNION RECOVERY IN HIGH-DIMENSIONAL MULTIVARIATE REGRESSION 1

SUPPORT UNION RECOVERY IN HIGH-DIMENSIONAL MULTIVARIATE REGRESSION 1 The Aals of Statistics 2011, Vol. 39, No. 1, 1 47 DOI: 10.1214/09-AOS776 Istitute of Mathematical Statistics, 2011 SUPPORT UNION RECOVERY IN HIGH-DIMENSIONAL MULTIVARIATE REGRESSION 1 BY GUILLAUME OBOZINSKI,

More information

Department of Computer Science, University of Otago

Department of Computer Science, University of Otago Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

AMS 2000 subject classification. Primary 62G08, 62G20; secondary 62G99

AMS 2000 subject classification. Primary 62G08, 62G20; secondary 62G99 VARIABLE SELECTION IN NONPARAMETRIC ADDITIVE MODELS Jia Huag 1, Joel L. Horowitz 2 ad Fegrog Wei 3 1 Uiversity of Iowa, 2 Northwester Uiversity ad 3 Uiversity of West Georgia Abstract We cosider a oparametric

More information

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling Taig DCOP to the Real World: Efficiet Complete Solutios for Distributed Multi-Evet Schedulig Rajiv T. Maheswara, Milid Tambe, Emma Bowrig, Joatha P. Pearce, ad Pradeep araatham Uiversity of Souther Califoria

More information

Asymptotic Growth of Functions

Asymptotic Growth of Functions CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll

More information

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT Keywords: project maagemet, resource allocatio, etwork plaig Vladimir N Burkov, Dmitri A Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT The paper deals with the problems of resource allocatio betwee

More information

HIGH-DIMENSIONAL REGRESSION WITH NOISY AND MISSING DATA: PROVABLE GUARANTEES WITH NONCONVEXITY

HIGH-DIMENSIONAL REGRESSION WITH NOISY AND MISSING DATA: PROVABLE GUARANTEES WITH NONCONVEXITY The Aals of Statistics 2012, Vol. 40, No. 3, 1637 1664 DOI: 10.1214/12-AOS1018 Istitute of Mathematical Statistics, 2012 HIGH-DIMENSIONAL REGRESSION WITH NOISY AND MISSING DATA: PROVABLE GUARANTEES WITH

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

CME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 8

CME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 8 CME 30: NUMERICAL LINEAR ALGEBRA FALL 005/06 LECTURE 8 GENE H GOLUB 1 Positive Defiite Matrices A matrix A is positive defiite if x Ax > 0 for all ozero x A positive defiite matrix has real ad positive

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

SUPPLEMENTARY MATERIAL TO GENERAL NON-EXACT ORACLE INEQUALITIES FOR CLASSES WITH A SUBEXPONENTIAL ENVELOPE

SUPPLEMENTARY MATERIAL TO GENERAL NON-EXACT ORACLE INEQUALITIES FOR CLASSES WITH A SUBEXPONENTIAL ENVELOPE SUPPLEMENTARY MATERIAL TO GENERAL NON-EXACT ORACLE INEQUALITIES FOR CLASSES WITH A SUBEXPONENTIAL ENVELOPE By Guillaume Lecué CNRS, LAMA, Mare-la-vallée, 77454 Frace ad By Shahar Medelso Departmet of Mathematics,

More information

Chapter 5: Inner Product Spaces

Chapter 5: Inner Product Spaces Chapter 5: Ier Product Spaces Chapter 5: Ier Product Spaces SECION A Itroductio to Ier Product Spaces By the ed of this sectio you will be able to uderstad what is meat by a ier product space give examples

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13 EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may

More information

Convexity, Inequalities, and Norms

Convexity, Inequalities, and Norms Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for

More information

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009) 18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the

More information

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

MARTINGALES AND A BASIC APPLICATION

MARTINGALES AND A BASIC APPLICATION MARTINGALES AND A BASIC APPLICATION TURNER SMITH Abstract. This paper will develop the measure-theoretic approach to probability i order to preset the defiitio of martigales. From there we will apply this

More information

Chapter 5 O A Cojecture Of Erdíos Proceedigs NCUR VIII è1994è, Vol II, pp 794í798 Jeærey F Gold Departmet of Mathematics, Departmet of Physics Uiversity of Utah Do H Tucker Departmet of Mathematics Uiversity

More information

LECTURE 13: Cross-validation

LECTURE 13: Cross-validation LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M

More information

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL.

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL. Auities Uder Radom Rates of Iterest II By Abraham Zas Techio I.I.T. Haifa ISRAEL ad Haifa Uiversity Haifa ISRAEL Departmet of Mathematics, Techio - Israel Istitute of Techology, 3000, Haifa, Israel I memory

More information

1. MATHEMATICAL INDUCTION

1. MATHEMATICAL INDUCTION 1. MATHEMATICAL INDUCTION EXAMPLE 1: Prove that for ay iteger 1. Proof: 1 + 2 + 3 +... + ( + 1 2 (1.1 STEP 1: For 1 (1.1 is true, sice 1 1(1 + 1. 2 STEP 2: Suppose (1.1 is true for some k 1, that is 1

More information

Irreducible polynomials with consecutive zero coefficients

Irreducible polynomials with consecutive zero coefficients Irreducible polyomials with cosecutive zero coefficiets Theodoulos Garefalakis Departmet of Mathematics, Uiversity of Crete, 71409 Heraklio, Greece Abstract Let q be a prime power. We cosider the problem

More information

Systems Design Project: Indoor Location of Wireless Devices

Systems Design Project: Indoor Location of Wireless Devices Systems Desig Project: Idoor Locatio of Wireless Devices Prepared By: Bria Murphy Seior Systems Sciece ad Egieerig Washigto Uiversity i St. Louis Phoe: (805) 698-5295 Email: bcm1@cec.wustl.edu Supervised

More information

THE ABRACADABRA PROBLEM

THE ABRACADABRA PROBLEM THE ABRACADABRA PROBLEM FRANCESCO CARAVENNA Abstract. We preset a detailed solutio of Exercise E0.6 i [Wil9]: i a radom sequece of letters, draw idepedetly ad uiformly from the Eglish alphabet, the expected

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

Lecture 4: Cheeger s Inequality

Lecture 4: Cheeger s Inequality Spectral Graph Theory ad Applicatios WS 0/0 Lecture 4: Cheeger s Iequality Lecturer: Thomas Sauerwald & He Su Statemet of Cheeger s Iequality I this lecture we assume for simplicity that G is a d-regular

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

Universal coding for classes of sources

Universal coding for classes of sources Coexios module: m46228 Uiversal codig for classes of sources Dever Greee This work is produced by The Coexios Project ad licesed uder the Creative Commos Attributio Licese We have discussed several parametric

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

Plug-in martingales for testing exchangeability on-line

Plug-in martingales for testing exchangeability on-line Plug-i martigales for testig exchageability o-lie Valetia Fedorova, Alex Gammerma, Ilia Nouretdiov, ad Vladimir Vovk Computer Learig Research Cetre Royal Holloway, Uiversity of Lodo, UK {valetia,ilia,alex,vovk}@cs.rhul.ac.uk

More information

THE HEIGHT OF q-binary SEARCH TREES

THE HEIGHT OF q-binary SEARCH TREES THE HEIGHT OF q-binary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average

More information

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy

More information

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio

More information

Overview of some probability distributions.

Overview of some probability distributions. Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability

More information

Class Meeting # 16: The Fourier Transform on R n

Class Meeting # 16: The Fourier Transform on R n MATH 18.152 COUSE NOTES - CLASS MEETING # 16 18.152 Itroductio to PDEs, Fall 2011 Professor: Jared Speck Class Meetig # 16: The Fourier Trasform o 1. Itroductio to the Fourier Trasform Earlier i the course,

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. Powers of a matrix We begi with a propositio which illustrates the usefuless of the diagoalizatio. Recall that a square matrix A is diogaalizable if

More information

5 Boolean Decision Trees (February 11)

5 Boolean Decision Trees (February 11) 5 Boolea Decisio Trees (February 11) 5.1 Graph Coectivity Suppose we are give a udirected graph G, represeted as a boolea adjacecy matrix = (a ij ), where a ij = 1 if ad oly if vertices i ad j are coected

More information

Normal Distribution.

Normal Distribution. Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued

More information

Ekkehart Schlicht: Economic Surplus and Derived Demand

Ekkehart Schlicht: Economic Surplus and Derived Demand Ekkehart Schlicht: Ecoomic Surplus ad Derived Demad Muich Discussio Paper No. 2006-17 Departmet of Ecoomics Uiversity of Muich Volkswirtschaftliche Fakultät Ludwig-Maximilias-Uiversität Müche Olie at http://epub.ub.ui-mueche.de/940/

More information

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics

More information

How To Solve The Homewor Problem Beautifully

How To Solve The Homewor Problem Beautifully Egieerig 33 eautiful Homewor et 3 of 7 Kuszmar roblem.5.5 large departmet store sells sport shirts i three sizes small, medium, ad large, three patters plaid, prit, ad stripe, ad two sleeve legths log

More information

Notes on exponential generating functions and structures.

Notes on exponential generating functions and structures. Notes o expoetial geeratig fuctios ad structures. 1. The cocept of a structure. Cosider the followig coutig problems: (1) to fid for each the umber of partitios of a -elemet set, (2) to fid for each the

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

Overview on S-Box Design Principles

Overview on S-Box Design Principles Overview o S-Box Desig Priciples Debdeep Mukhopadhyay Assistat Professor Departmet of Computer Sciece ad Egieerig Idia Istitute of Techology Kharagpur INDIA -721302 What is a S-Box? S-Boxes are Boolea

More information

Section 11.3: The Integral Test

Section 11.3: The Integral Test Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult

More information

Subject CT5 Contingencies Core Technical Syllabus

Subject CT5 Contingencies Core Technical Syllabus Subject CT5 Cotigecies Core Techical Syllabus for the 2015 exams 1 Jue 2014 Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical techiques which ca be used to model ad value

More information

ARTICLE IN PRESS. Statistics & Probability Letters ( ) A Kolmogorov-type test for monotonicity of regression. Cecile Durot

ARTICLE IN PRESS. Statistics & Probability Letters ( ) A Kolmogorov-type test for monotonicity of regression. Cecile Durot STAPRO 66 pp: - col.fig.: il ED: MG PROD. TYPE: COM PAGN: Usha.N -- SCAN: il Statistics & Probability Letters 2 2 2 2 Abstract A Kolmogorov-type test for mootoicity of regressio Cecile Durot Laboratoire

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Data Analysis and Statistical Behaviors of Stock Market Fluctuations

Data Analysis and Statistical Behaviors of Stock Market Fluctuations 44 JOURNAL OF COMPUTERS, VOL. 3, NO. 0, OCTOBER 2008 Data Aalysis ad Statistical Behaviors of Stock Market Fluctuatios Ju Wag Departmet of Mathematics, Beijig Jiaotog Uiversity, Beijig 00044, Chia Email:

More information

Theorems About Power Series

Theorems About Power Series Physics 6A Witer 20 Theorems About Power Series Cosider a power series, f(x) = a x, () where the a are real coefficiets ad x is a real variable. There exists a real o-egative umber R, called the radius

More information

A Recursive Formula for Moments of a Binomial Distribution

A Recursive Formula for Moments of a Binomial Distribution A Recursive Formula for Momets of a Biomial Distributio Árpád Béyi beyi@mathumassedu, Uiversity of Massachusetts, Amherst, MA 01003 ad Saverio M Maago smmaago@psavymil Naval Postgraduate School, Moterey,

More information

Perfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing

Perfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing SIAM REVIEW Vol. 44, No. 1, pp. 95 108 c 2002 Society for Idustrial ad Applied Mathematics Perfect Packig Theorems ad the Average-Case Behavior of Optimal ad Olie Bi Packig E. G. Coffma, Jr. C. Courcoubetis

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

Research Article Sign Data Derivative Recovery

Research Article Sign Data Derivative Recovery Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 63070, 7 pages doi:0.540/0/63070 Research Article Sig Data Derivative Recovery L. M. Housto, G. A. Glass, ad A. D. Dymikov

More information

The Gompertz Makeham coupling as a Dynamic Life Table. Abraham Zaks. Technion I.I.T. Haifa ISRAEL. Abstract

The Gompertz Makeham coupling as a Dynamic Life Table. Abraham Zaks. Technion I.I.T. Haifa ISRAEL. Abstract The Gompertz Makeham couplig as a Dyamic Life Table By Abraham Zaks Techio I.I.T. Haifa ISRAEL Departmet of Mathematics, Techio - Israel Istitute of Techology, 32000, Haifa, Israel Abstract A very famous

More information

CONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION

CONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION www.arpapress.com/volumes/vol8issue2/ijrras_8_2_04.pdf CONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION Elsayed A. E. Habib Departmet of Statistics ad Mathematics, Faculty of Commerce, Beha

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? JÖRG JAHNEL 1. My Motivatio Some Sort of a Itroductio Last term I tought Topological Groups at the Göttige Georg August Uiversity. This

More information

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork Solutios to Selected Problems I: Patter Classificatio by Duda, Hart, Stork Joh L. Weatherwax February 4, 008 Problem Solutios Chapter Bayesia Decisio Theory Problem radomized rules Part a: Let Rx be the

More information

Generalization Dynamics in LMS Trained Linear Networks

Generalization Dynamics in LMS Trained Linear Networks Geeralizatio Dyamics i LMS Traied Liear Networks Yves Chauvi Psychology Departmet Staford Uiversity Staford, CA 94305 Abstract For a simple liear case, a mathematical aalysis of the traiig ad geeralizatio

More information

DAME - Microsoft Excel add-in for solving multicriteria decision problems with scenarios Radomir Perzina 1, Jaroslav Ramik 2

DAME - Microsoft Excel add-in for solving multicriteria decision problems with scenarios Radomir Perzina 1, Jaroslav Ramik 2 Itroductio DAME - Microsoft Excel add-i for solvig multicriteria decisio problems with scearios Radomir Perzia, Jaroslav Ramik 2 Abstract. The mai goal of every ecoomic aget is to make a good decisio,

More information

Review: Classification Outline

Review: Classification Outline Data Miig CS 341, Sprig 2007 Decisio Trees Neural etworks Review: Lecture 6: Classificatio issues, regressio, bayesia classificatio Pretice Hall 2 Data Miig Core Techiques Classificatio Clusterig Associatio

More information

Interference Alignment and the Generalized Degrees of Freedom of the X Channel

Interference Alignment and the Generalized Degrees of Freedom of the X Channel Iterferece Aligmet ad the Geeralized Degrees of Freedom of the X Chael Chiachi Huag, Viveck R. Cadambe, Syed A. Jafar Electrical Egieerig ad Computer Sciece Uiversity of Califoria Irvie Irvie, Califoria,

More information

1 The Gaussian channel

1 The Gaussian channel ECE 77 Lecture 0 The Gaussia chael Objective: I this lecture we will lear about commuicatio over a chael of practical iterest, i which the trasmitted sigal is subjected to additive white Gaussia oise.

More information

Probabilistic Engineering Mechanics. Do Rosenblatt and Nataf isoprobabilistic transformations really differ?

Probabilistic Engineering Mechanics. Do Rosenblatt and Nataf isoprobabilistic transformations really differ? Probabilistic Egieerig Mechaics 4 (009) 577 584 Cotets lists available at ScieceDirect Probabilistic Egieerig Mechaics joural homepage: wwwelseviercom/locate/probegmech Do Roseblatt ad Nataf isoprobabilistic

More information

arxiv:1506.03481v1 [stat.me] 10 Jun 2015

arxiv:1506.03481v1 [stat.me] 10 Jun 2015 BEHAVIOUR OF ABC FOR BIG DATA By Wetao Li ad Paul Fearhead Lacaster Uiversity arxiv:1506.03481v1 [stat.me] 10 Ju 2015 May statistical applicatios ivolve models that it is difficult to evaluate the likelihood,

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

CHAPTER 3 THE TIME VALUE OF MONEY

CHAPTER 3 THE TIME VALUE OF MONEY CHAPTER 3 THE TIME VALUE OF MONEY OVERVIEW A dollar i the had today is worth more tha a dollar to be received i the future because, if you had it ow, you could ivest that dollar ad ear iterest. Of all

More information

A Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design

A Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design A Combied Cotiuous/Biary Geetic Algorithm for Microstrip Atea Desig Rady L. Haupt The Pesylvaia State Uiversity Applied Research Laboratory P. O. Box 30 State College, PA 16804-0030 haupt@ieee.org Abstract:

More information

Totally Corrective Boosting Algorithms that Maximize the Margin

Totally Corrective Boosting Algorithms that Maximize the Margin Mafred K. Warmuth mafred@cse.ucsc.edu Ju Liao liaoju@cse.ucsc.edu Uiversity of Califoria at Sata Cruz, Sata Cruz, CA 95064, USA Guar Rätsch Guar.Raetsch@tuebige.mpg.de Friedrich Miescher Laboratory of

More information

Chapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions

Chapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions Chapter 5 Uit Aual Amout ad Gradiet Fuctios IET 350 Egieerig Ecoomics Learig Objectives Chapter 5 Upo completio of this chapter you should uderstad: Calculatig future values from aual amouts. Calculatig

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

Coordinating Principal Component Analyzers

Coordinating Principal Component Analyzers Coordiatig Pricipal Compoet Aalyzers J.J. Verbeek ad N. Vlassis ad B. Kröse Iformatics Istitute, Uiversity of Amsterdam Kruislaa 403, 1098 SJ Amsterdam, The Netherlads Abstract. Mixtures of Pricipal Compoet

More information

Institute of Actuaries of India Subject CT1 Financial Mathematics

Institute of Actuaries of India Subject CT1 Financial Mathematics Istitute of Actuaries of Idia Subject CT1 Fiacial Mathematics For 2014 Examiatios Subject CT1 Fiacial Mathematics Core Techical Aim The aim of the Fiacial Mathematics subject is to provide a groudig i

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006 Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

Permutations, the Parity Theorem, and Determinants

Permutations, the Parity Theorem, and Determinants 1 Permutatios, the Parity Theorem, ad Determiats Joh A. Guber Departmet of Electrical ad Computer Egieerig Uiversity of Wiscosi Madiso Cotets 1 What is a Permutatio 1 2 Cycles 2 2.1 Traspositios 4 3 Orbits

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5

More information

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

, a Wishart distribution with n -1 degrees of freedom and scale matrix. UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that

More information

Infinite Sequences and Series

Infinite Sequences and Series CHAPTER 4 Ifiite Sequeces ad Series 4.1. Sequeces A sequece is a ifiite ordered list of umbers, for example the sequece of odd positive itegers: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29...

More information

FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10

FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10 FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10 [C] Commuicatio Measuremet A1. Solve problems that ivolve liear measuremet, usig: SI ad imperial uits of measure estimatio strategies measuremet strategies.

More information