Approximating Cross-validatory Predictive Evaluation in Bayesian Latent Variables Models with Integrated IS and WAIC
|
|
|
- Dale Hoover
- 10 years ago
- Views:
Transcription
1 Approxmatng Cross-valdatory Predctve Evaluaton n Bayesan Latent Varables Models wth Integrated IS and WAIC Longha L Department of Mathematcs and Statstcs Unversty of Saskatchewan Saskatoon, SK, CANADA 3 Aprl 2014
2 Acknowledgements Jont work wth Sh Qu, Be Zhang and Cndy X. Feng. The work was supported by grants from Natural Scences and Engneerng Research Councl of Canada (NSERC) and Canada Foundaton for Innovaton (CFI). Thank Dr. Yao and Dr. Du for ther warm hostng of my vst to Kansas State Unversty.
3 Outlne 1 Introducton 2 Bayesan Models wth Unt-specfc Latent Varables 3 Cross-valdatory Predctve Evaluaton 4 Importance Samplng (IS) Approxmatons Non-ntegrated Importance Samplng (nis) Integrated Importance Samplng (IS) 5 WAIC Approxmatons Non-Integrated WAIC Integrated WAIC 6 Real Data Examples Mxture Models Correlated Random Spatal Effect Models CV Posteror p-values n Logstc Regresson 7 Conclusons and Future Work 8 References
4 Approxmatons for Out-of-Sample Predctve Evaluaton Predctve evaluaton s often used for model comparson, dagnostcs, and detectng outlers n practce. There are three ways for ths wth ther own advantages and lmtatons: Out of sample Valdaton Leave One Out Cross Valdaton Tranng Valdaton + Bas Correcton y obs 1 y obs 1 y obs 1 y obs 1 y obs 1 y obs 2 y obs 2 y obs 2 y obs 2 y obs y obs n y obs n y obs n y obs n y obs n y obs n+1 + a Correcton for Optmstc Bas Cross-valdaton wth Integrated IS and WAIC/1. Introducton/ 4/46
5 Revews of Bas-corrected Tranng Valdaton 1 AIC, DIC and others (eg., Spegelhalter et al. (2002), Celeux et al. (2006), Plummer (2008), and Ando (2007)). Partcularly, ) DIC = 2 (log P(y obs ˆθ) p DIC, where, (1) p DIC = 2[log P(y obs ˆθ) E post ( log(p(y obs θ)) ) ] (2) Good for models wth dentfable parameters. 2 Importance Samplng (eg. Gelfand et al. (1992)). For each unt: P(y obs y obs ) = 1/E post (1/P(y obs θ)) (3) 3 Wdely Applcable Informaton Crteron (WAIC, proposed by Watanabe (2009)). For each unt: y obs) = E post(p(y obs θ))/ exp [ ( V post log(p(y obs θ)) )] (4) P(y obs Applcable to models wth non-dentfable parameters, but not to models wth correlated unts. Cross-valdaton wth Integrated IS and WAIC/1. Introducton/ 5/46
6 What Wll We Propose? Two mproved methods (namely IS, and WAIC) nspred by mportance samplng formulae for Bayesan models wth unt-specfc latent varables that may be correlated. Cross-valdaton wth Integrated IS and WAIC/1. Introducton/ 6/46
7 Bayesan Models wth Unt-specfc Latent Varables The two methods to be proposed am at mprovng IS and WAIC evaluaton for such models: for = 1,, n for = 1,, n x covarate varables y observable varables for = 1,, n b θ model parameters latent varables Fgure 1: Graphcal representaton. The double arrows n the box for b 1:n mean possble dependency between b 1:n. Note that the covarate x wll be omtted n the condtons of denstes for b and y throughout ths paper for smplcty. Cross-valdaton wth Integrated IS and WAIC/2. Bayesan Models wth Unt-specfc Latent Varables/ 7/46
8 Posteror Dstrbuton Gven Full Data Suppose condtonal on θ, we have specfed a densty for y gven b : P(y b, θ), a jont pror densty for latent varables b 1:n : P(b 1:n θ), and a pror densty for θ: P(θ). The posteror of (b 1:n, θ) gven observatons y obs 1:n s proportonal to the jont densty of y obs 1:n, b 1:n, and θ: P post (θ, b 1:n y obs 1:n) = n j=1 P(y obs j b j, θ)p(b 1:n θ)p(θ)/c 1, (5) where C 1 s the normalzng constant nvolvng only wth y obs 1:n. Cross-valdaton wth Integrated IS and WAIC/2. Bayesan Models wth Unt-specfc Latent Varables/ 8/46
9 CV Posteror Dstrbutons To do cross-valdaton, for each = 1,..., n, we omt observaton, and then draw MCMC samples from CV posteror dstrbuton: y obs P post(-) (θ, b 1:n y obs ) = P(y obs j b j, θ)p(b 1:n θ)p(θ) / C 2, (6) j If we drop b from samples of (θ, b 1:n ) (6), we obtan samples of (θ, b ) from the margnalzed CV posteror: P post(-), M (θ, b y obs ) = P(y obs j b j, θ)p(b θ)p(θ) / C 2, (7) j where P(b θ) = P(b 1:n θ)db. It s useful to note that P post(-) (θ, b 1:n y obs ) = P post(-), M (θ, b y obs )P(b b, θ) (8) Samplng P post(-) = samplng P post(-), M + drawng b P(b b, θ). Cross-valdaton wth Integrated IS and WAIC/3. Cross-valdatory Predctve Evaluaton/ 9/46
10 CV Posteror Predctve Evaluaton: General Suppose we specfy an evaluaton functon a(y obs, θ, b ) that measures certan goodness-of-ft (or dscrepancy) of the dstrbuton P(y θ, b ) to the actual observaton y obs. CV posteror predctve evaluaton s defned as the expectaton of the a(y obs 1:n,.,.) wth respect to P post(-)(θ, b 1:n y obs ) gven n equatons (8) or (6): E post(-) (a(y obs, θ, b )) = a(y obs, θ, b )P post(-) (θ, b 1:n y obs )dθdb 1:n (9) Cross-valdaton wth Integrated IS and WAIC/3. Cross-valdatory Predctve Evaluaton/ 10/46
11 CV Posteror Predctve Evaluaton: Two Specfc Cases 1 Let a be the value of predctve densty: a(y obs, θ, b ) = P(y obs θ, b ). (10) Then E post(-) (a(y obs, θ, b )) = P(y obs y obs ) (11) We call t CV posteror predctve densty for the held-out unt y obs. CV nformaton crteron (CVIC) for evaluatng a Bayesan model s: CVIC = 2 2 Let a be a tal probablty: a(y obs n =1 log(p(y obs y obs )). (12), θ, b ) = Pr(y > y obs θ, b ) + 0.5Pr(y = y obs θ, b ), (13) Then, E post(-) (a(y obs, θ, b )) = Pr(y > y obs call t CV posteror p-value. y obs ) + 0.5Pr(y = y obs y obs ). We Cross-valdaton wth Integrated IS and WAIC/3. Cross-valdatory Predctve Evaluaton/ 11/46
12 Non-ntegrated IS (nis) Approxmaton: General If our samples are from P post (θ, b 1:n y obs 1:n ), but we are nterested n estmatng the mean of a wth respect to P post(-) (θ, b 1:n y obs ) as n (9), mportance weghtng method s based on the followng equalty for CV expected evaluaton: E post(-) (a(y obs, θ, b )) = E [ post a(y obs, θ, b )W nis (θ, b 1:n ) ] [ E post W nis (θ, b 1:n ) ], (14) where E post [ ] s expectaton wth respect to P post (θ, b 1:n y obs 1:n ), and W nis ) (θ, b 1:n ) = P post(-)(θ, b 1:n y obs P post (θ, b 1:n y obs 1:n ) C 2 = C 1 1 P(y obs θ, b ). (15) Gelfand et al. (1992) may be the frst to propose ths method. Cross-valdaton wth Integrated IS and WAIC/4. Importance Samplng (IS) Approxmatons/nIS 12/46
13 nis Estmate of CVIC To estmate CVIC, we set a(y obs, θ, b ) = P(y obs θ, b ), the CV posteror predctve densty P(y obs y obs ) s equal to harmonc mean of the non-ntegrated predctve densty P(y obs θ, b ) wth respect to P(θ, b 1:n y obs 1:n ): P(y obs y obs ) = 1 E post [ 1/P(y obs θ, b ) ]. (16) Based on (16), nis estmates the CV posteror predctve densty by: ˆP nis (y obs y obs ) = 1 Ê post [ 1/P(y obs θ, b ) The correspondng nis estmate of CVIC usng (17) s ĈVIC nis = 2 n =1 ]. (17) log( ˆP nis (y obs y obs )) (18) However, nis often doesn t work well because b fts y obs too well. Cross-valdaton wth Integrated IS and WAIC/4. Importance Samplng (IS) Approxmatons/nIS 13/46
14 Theory for Integrated Importance Samplng (IS): I 1 Integrated Evaluaton Functon Rewrte the expectaton n (9) as E post(-) (a(y obs, θ, b )) = E post(-), M (A(y obs, θ, b )) (19) = A(y obs, θ, b )P(θ, b y obs )dθdb (20) where, A(y obs, θ, b ) = a(y obs, θ, b )P(b b, θ)db. (21) Note: In (21), we ntegrate a(y obs, θ, b ) wth respect to P(b b, θ), whch s uncondtonal on y obs. Cross-valdaton wth Integrated IS and WAIC/4. Importance Samplng (IS) Approxmatons/IS 14/46
15 Theory for Integrated Importance Samplng (IS): II 2 Integrated Predctve Densty The full data posteror of (θ, b ) s P post, M (θ, b y obs )= where, [ j P(y obs θ, b ) = ] P(y obs j b j, θ)p(b θ)p(θ) P(y obs θ, b )/C 1, (22) P(y obs b, θ)p(b b, θ)db. (23) We wll call (23) ntegrated predctve densty, because t ntegrates away b wthout reference to y obs. Cross-valdaton wth Integrated IS and WAIC/4. Importance Samplng (IS) Approxmatons/IS 15/46
16 Theory for Integrated Importance Samplng (IS): III 3 Integrated Importance Samplng Formula Usng the standard mportance weghtng method, we wll estmate (20) by [ A(y obs E post(-), M (A(y obs where W IS, θ, b )) = E post, M, θ, b ) W IS (θ, b ) ] [ E post, M W IS (θ, b ) ], s the ntegrated mportance weght: (24) W IS ) (θ, b ) = P post(-), M(θ, b y obs P post, M (θ, b y obs ) C 2 = C 1 1 P(y obs θ, b ). (25) In summary, n IS, we ntegrate the evaluaton functon a(y obs, θ, b ) and P(y obs θ, b ) over b drawn from P(b b, θ), whch s uncondtonal on, to fnd ntegrated evaluaton and predctve densty functons. y obs Cross-valdaton wth Integrated IS and WAIC/4. Importance Samplng (IS) Approxmatons/IS 16/46
17 IS Estmate for CVIC In the specal case of estmatng CVIC, the evaluaton functon a s just the predctve densty P(y obs θ, b ), therefore, A s just recprocal of W IS Therefore, the IS estmate for P(y obs y obs ) s ˆP IS (y obs y obs ) = Accordngly, IS estmate of CVIC s ĈVIC IS = 2 1 Ê post, M [ 1/P(y obs θ, b ) n =1. ]. (26) log( ˆP IS (y obs y obs )) (27) Cross-valdaton wth Integrated IS and WAIC/4. Importance Samplng (IS) Approxmatons/IS 17/46
18 WAIC for Models wthout Latent Varables Watanabe (2009) defnes a verson of WAIC for models wthout latent varables as follows: WAIC = 2 n =1 [ log(epost (P(y obs θ))) V post (log(p(y obs θ))) ], (28) where E post and V post stand for mean and varance over θ wth respect to P(θ y obs 1,..., y obs n ). By comparng the forms of WAIC and CVIC, we can thnk of that n WAIC, the CV posteror predctve densty s estmated by: ˆP WAIC (y obs y obs ) = exp { log(e post (P(y obs θ))) V post (log(p(y obs θ))) }. (29) Cross-valdaton wth Integrated IS and WAIC/5. WAIC Approxmatons/ 18/46
19 nwaic for Latent Varables Models For the models wth possbly correlated latent varables, a nave way to approxmate CVIC s to apply WAIC drectly to the non-ntegrated predctve densty of y obs condtonal on θ and b : ˆP nwaic (y obs y obs ) = exp { log(e post (P(y obs θ, b ))) V post (log(p(y obs θ, b ))) }. (30) We wll refer to (30) as non-ntegrated WAIC (or nwaic for short) method for approxmatng CV posteror predctve densty. The correspondng nformaton crteron based on (30) s: nwaic = 2 n =1 log( ˆP nwaic (y obs y obs )). (31) Cross-valdaton wth Integrated IS and WAIC/5. WAIC Approxmatons/nWAIC 19/46
20 WAIC for Latent Varables Models Usng heurstcs, we propose to apply WAIC approxmaton to the ntegrated predctve densty (23) to estmate the CV posteror predctve densty: ˆP WAIC (y obs y obs ) = exp { log(e post (P(y obs θ, b ))) V post (log(p(y obs θ, b ))) }. (32) Accordngly, WAIC for approxmatng CVIC s gven by : WAIC = 2 n =1 log( ˆP WAIC (y obs y obs )). (33) Cross-valdaton wth Integrated IS and WAIC/5. WAIC Approxmatons/WAIC 20/46
21 Galaxy Data We obtaned the data set from R package MASS. The data set s a numerc vector of veloctes (km/sec) of 82 galaxes from 6 well-separated conc sectons of an unflled survey of the Corona Boreals regon. Densty Densty Densty (a) K = 4 (b) K = 5 (c) K = 6 Fgure 2: Hstograms of Galaxy data and three estmated densty curves usng MCMC samples from fttng fnte mxture models wth dfferent numbers of components, K = 4, 5, 6 and the full data set. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Mxture Models 21/46
22 Mxture Models wth a Fxed Number, K, of Components We appled mxture models to ft the 82 numbers. The fnte mxture model that we used to ft Galaxy data s as follows: y z = k, µ 1:K, σ 1:K N(µ k, σk 2 ), for = 1,..., n (34) z p 1:K Category(p 1,..., p K ), for = 1,..., n (35) µ k N(20, 10 4 ), for k = 1,..., K (36) σ 2 k Inverse-Gamma(0.01, ), for k = 1,..., K (37) p k Drchlet(1,..., 1) for k = 1,..., K (38) Here we set the pror mean of µ k to 20, whch s the mean of the 82 numbers, and set the scale for Inverse Gamma pror for σk 2 to 20, whch s the varance of the 82 numbers. Our purpose of computng CVIC for fnte mxture models s to determne the numbers of mxture components, K, that can adequately capture the heterogenety n a data but don t overft the data. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Mxture Models 22/46
23 How Dd We Run MCMC? We used JAGS to run MCMC smulatons for fttng the above model to Galaxy data wth varous choce of K. To avod the problem that MCMC may get stuck n a model wth only one component, we followed JAGS eyes example to restrct the MCMC to have at least a data pont n each component. All MCMC smulatons started wth a randomly generated z 1:n, and ran 5 parallel chans, each dong 2000, 2000, and 100,000 teratons for adaptng, burnng, and samplng, respectvely. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Mxture Models 23/46
24 The Mxture Model s a Latent Varable Model The fnte mxture model (equatons (35) - (38)) falls n the class of models depcted by Fgure 1: the observed varable s y, the latent varable b s the mxture component ndcator z, and the model parameters θ s (µ 1:K, σ 2 1:K, p 1:K ). In ths model, the latent varables z 1,..., z n n ths model are ndependent gven the model parameter θ. It follows that y 1,..., y n are ndependent gven θ. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Mxture Models 24/46
25 Computng nis, IS, nwaic, WAIC n Mxture Models For each MCMC sample of (θ, z 1,..., z n ) and each unt, we compute The non-ntegrated predctve densty: P(y obs z, θ) = φ(y obs µ z, σ z ). The ntegrated predctve densty: P(y obs θ, z ) = P(y obs θ) = K k=1 p k φ(y obs µ k, σ k ) (39) Notes: 1) z and y are ndependent gven θ. 2) the large component, not the component close to y obs, domnates (39) Then we can compute nis, IS, nwaic and WAIC. We see that, to compute IS and WAIC, we just apply IS and WAIC to the margnalzed models wth z 1:n ntegrated out, although z 1:n are ncluded n MCMC smulatons. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Mxture Models 25/46
26 Comparson of 5 Informaton Crtera Table 1: Comparson of 5 nformaton crtera for mxture models. The numbers are the averages of ICs from 100 ndependent MCMC smulatons. The numbers n brackets ndcates standard devatons. K DIC nwaic nis WAIC IS CVIC (1.64) (0.39) (3.45) (0.14) (0.17) (45.12) (9.94) (6.17) (4.70) (3.79) (31.58) (1.87) (5.32) (0.53) (0.54) (25.34) (0.29) (3.70) (0.09) (0.10) (17.48) (1.33) (5.72) (0.27) (0.31) (18.57) (0.30) (4.49) (0.07) (0.12) Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Mxture Models 26/46
27 Comparson of Statstcal Sgnfcance CVIC s the sum of mnus twce of log CV posteror predctve denstes. Therefore, the statstcal sgnfcance of the dfferences of two CVICs (or estmates) can be accessed by lookng at the populaton mean dfferences of two groups of log CV posteror predctve denstes (or ther estmates). Table 2: One-sded pared t-test p-values for comparng means of 82 log posteror predctve denstes for Galaxy data gven by mxture models wth dfferent number of mxture components, K. par of models nwaic nis WAIC IS CVIC K =3 vs K = K = 4 vs K = K = 5 vs K = K = 6 vs K = K = 7 vs K = Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Mxture Models 27/46
28 Vsualze the Need of Integratng z Fgure 3: Scatter-plot of non-ntegrated predctve denstes aganst µ z, gven MCMC samples from the full data posteror (4a) and the actual CV posteror wth the 3rd number removed (4b), when K = 5 components are used log harmonc mean = µ Z log(p(y µz σ Z )) (a) log mean = µ Z log(p(y µz σ Z )) (b) Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Mxture Models 28/46
29 Scottsh Lp Cancer Data I The data represents male lp cancer counts (over the perod ) n the n = 56 dstrcts of Scotland. The data ncludes these columns: the number of observed cases of lp cancer, y ; the number of expected cases, E, whch are based on age effects, and are proportonal to a populaton at rsk after such effects have been taken nto account; the percent of populaton employed n agrculture, fshng and forestry, x, used as a covarate; and a lst of the neghbourng regons. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Correlated Random Spatal Effect Models 29/46
30 Scottsh Lp Cancer Data II Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Correlated Random Spatal Effect Models 30/46
31 Four Models Consdered: I The y s modelled as a Posson random varable: y E, λ Posson(λ E ), (40) where λ denotes the underlyng relatve rsk for dstrct. Let s = log(λ ). We consder four dfferent models for the vector s = (s 1,, s n ) : model 1 (spatal+lnear, full) : s N n (α + X β, Φτ 2 ), (41) model 2 (spatal) : s N n (α, Φτ 2 ), (42) model 3 (lnear) : s N n (α + X β, I n τ 2 ), (43) model 4 (exchangable) : s N n (α, I n τ 2 ), (44) where Φ specfy spatal assocaton between dstrcts, wth detals follow. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Correlated Random Spatal Effect Models 31/46
32 Four Models Consdered: II Φ = (I n φc) 1 M s a matrx modellng spatal dependency, n whch, c j = (E j /E ) 1/2 f areas and j are neghbours, equals to 0 otherwse, m = E 1 and m j = 0 f j. Ths model s called proper condtonal auto regresson (CAR) model. Lookng at the condtonal dstrbuton of s s, α, β, φ (48) may help understand ths dstrbuton. At a hgher level, we assgn β, τ, andφ wth very dffuse pror: τ 2 Inv-Gamma(0.5, ) (45) β N(0, ) (46) φ Unf(φ 0, φ 1 ), (47) where (φ 0, φ 1 ) s the nterval for φ such that Φ s postve-defnte. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Correlated Random Spatal Effect Models 32/46
33 How Dd we Run MCMC? We used OpenBUGS through R package R2OpenBUGS to run MCMC smulatons for fttng the above four models to lp cancer data. For each smulaton, we ran two parrallel chans, each for teratons, and the frst 5000 were dscarded as burnng. For replcatng computng nformaton crteron (wth each method), we ran 100 ndependent smulatons as above by randomzng ntal θ and randomzng bugs random seed for OpenBUGS. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Correlated Random Spatal Effect Models 33/46
34 The Posson Model s a Latent Varable Model the observed varable s y, the latent varable b s s (or λ ) the model parameters θ s (τ, β, φ). In models 1 and 2, the latent varables s 1,..., s n are dependent gven the model parameter θ. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Correlated Random Spatal Effect Models 34/46
35 Computng IS and WAIC n Model 1 For each unt, and for each MCMC sample of (s, θ): Condtonal dstrbuton (proper auto regresson): s s, θ N(α + x β + φ j N (c j (s j α x j β)), τ 2 m ), (48) where N s the set of neghbours of dstrct. Integrated predctve densty: P(y obs θ, s ) = dposson(y obs λ E )P(s θ, s )ds (49) We generate 200 random numbers of s from the dstrbuton (48), and then estmate the ntegral n (49). Then we can compute IS and WAIC wth the ntegrated predctve densty. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Correlated Random Spatal Effect Models 35/46
36 Comparson of 5 Informaton Crtera Table 3: Comparsons of nformaton crtera for lp cancer data. Each table entry shows the average of 100 nformaton crtera computed from 100 ndependent MCMC smulatons, and the standard devaton n bracket. Model CVIC DIC WAIC IS nwaic nis full (12.30) (0.12) (0.19) (0.21) (1.27) spatal (10.15) (0.06) (0.37) (0.18) (1.85) lnear (0.11) (0.05) (0.05) (0.21) (3.02) exch (0.12) (0.03) (0.03) (0.17) (3.46) Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/Correlated Random Spatal Effect Models 36/46
37 Seeds Data The example s taken from Table 3 of Crowder (1978). The study concerns about the proporton of seeds that germnated on each of 21 plates arranged accordng to a 2 by 2 factoral layout by seed and type of root extract. For = 1,..., 21, let r be the number of germnated seeds n the th plate, n be the total number of seeds n the th plate, x 1 be the seed type (0/1), and x 2 be root extract (0/1). Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/CV Posteror p-values n Logstc Regresson 37/46
38 Logstc Regresson Models wth Random Effects The condtonal dstrbuton of r gven n, x 1 and x 2 are specfed as follows: r n, p Bnomal(n, p ) (50) logt(p ) = α 0 + α 1 x 1 + α 2 x 2 + α 12 x 1 x 2 + b (51) b N(0, σ 2 ), (52) and parameters α 0, α 1, α 2, α 12 are assgned wth N(0, 10 6 ) as pror, and σ 2 s assgned wth Inverse-Gamma (0.001, 0.001) as pror. Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/CV Posteror p-values n Logstc Regresson 38/46
39 CV Posteror p-value for Outler Detecton The p-value (gven parameters and latent varable) defned by (13) for ths example s the rght tal probablty of Bnomal dstrbuton wth number of trals n and success rate p : p-value(r obs, θ, b ) = 1 pbnom(r obs ; n, p ) dbnom(r obs ; n, p ), (53) where r obs s the actual observaton of r, and pbnom and dbnom denote CDF and PMF of Bnomal dstrbuton. CV posteror p-value for observaton r obs s the mean of p-value(r obs, θ, b ) wth respect to the CV posteror dstrbuton P(θ, b r obs ). Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/CV Posteror p-values n Logstc Regresson 39/46
40 Four Methods for Approxmatng CV Posteror p-value: I Posteror check (Gelman et al. 1996) Average each p-value(r obs, θ, b ) wth respect to the posteror of (θ, b ) gven the full data set r obs 1:21. Ghostng method (Marshall and Spegelhater, 2003) For each MCMC sample, one averages p-value(r obs, θ, b ) wth respect to the condtonal dstrbuton of b gven θ (but wthout r obs ) to obtan ghostng p-value, then averages the ghostng p-values over all MCMC samples. nis: Average p-value(r obs, θ, b ) after beng weghted wth the nverse of probablty densty (mass) of r obs : 1/dbnom(r obs ; n, p ) Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/CV Posteror p-values n Logstc Regresson 40/46
41 Four Methods for Approxmatng CV Posteror p-value: II IS: For each MCMC sample, we frst average each of p-value(r obs, θ, b ) and dbnom(r obs ; n, p ) over 30 b randomly generated from b θ to fnd the ntegrated p-value and the ntegrated predctve densty respectvely. Then compute the weghted average of the ntegrated p-values wth the reversed ntegrated predctve densty as weghts over all MCMC samples usng formula (24). Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/CV Posteror p-values n Logstc Regresson 41/46
42 Comparson of Estmated CV Posteror p-value I Fgure 4: Scatterplots of estmated posteror p-values from an MCMC smulaton aganst actual CV posteror p-values. The number for ponts show ndces of plates Estmated p value by Posteror Check Estmated p value by Ghost method CV Posteror p value (a) Posteror checkng CV Posteror p value (b) Ghostng method Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/CV Posteror p-values n Logstc Regresson 42/46
43 Comparson of Estmated CV Posteror p-value II Estmated p value by nis Estmated p value by IS CV Posteror p value (c) Non-ntegrated IS (nis) CV Posteror p value (d) Integrated IS (IS) Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/CV Posteror p-values n Logstc Regresson 43/46
44 Replcaton Study To measure more precsely the accuracy of estmated p-values to the actual CV p-values, we use absolute relatve error n percentage scale defned as RE = (1/n) n =1 ˆp p 100, (54) mn(p, 1 p ) where ˆp 1:n are estmates of p 1:n. Ths measure emphaszes greatly on the error between ˆp and p when p s very small or very large, for whch we demand more on absolute error than when p s close to 0.5. Table 4: Comparsons of the averages of 100 absolute relatve errors (n percentage) of estmated CV p-values from 100 ndependent MCMC smulatons, for logstc regresson example. The numbers n brackets ndcate standard devatons. IS nis Ghostng Posteror checkng 2.319(0.399) 5.234(1.083) (1.267) (3.854) Cross-valdaton wth Integrated IS and WAIC/6. Real Data Examples/CV Posteror p-values n Logstc Regresson 44/46
45 Conclusons and Future Work The new proposed IS and WAIC sgnfcantly reduce the bas of nis and nwaic n evaluatng Bayesan models wth unt-specfc latent varables. In our studes, they gave results very close to what gven by the actual cross-valdaton. WAIC works very well n the spatal random effect models. The result s surprsng and encouragng. One may consder nvestgatng the valdty of WAIC theoretcally. IS and WAIC are lmted to Bayesan model wth unt-specfc latent varables. In many models, a latent varable s shared by multple unts. How to mprove IS and WAIC for such models? Advancng applcatons to other models. An nterestng model s auto logstc regresson model for spatal data, where the model s defned wth condtonal dstrbuton only. I wll nvestgate the applcablty of IS, WAIC or cross-valdaton tself for comparson and dagnostcs of such models. Cross-valdaton wth Integrated IS and WAIC/7. Conclusons and Future Work/ 45/46
46 References To read more of ths topc, the followng s a short lst of references: Vehtar, A. and Ojanen, J. (2012), A survey of Bayesan predctve methods for model assessment, selecton and comparson, Statstcs Surveys, 6, Spegelhalter, D. J., Best, N. G., Carln, B. P., and van der Lnde, A. (2002), Bayesan measures of model complexty and ft, JRSSB, 64, Watanabe, S. (2009), Asymptotc Equvalence of Bayes Cross Valdaton and Wdely Applcable Informaton Crteron n Sngular Learnng Theory, Journal of Machne Learnng Research, 11, Gelman, A., Hwang, J., and Vehtar, A. (2013), Understandng predctve nformaton crtera for Bayesan models, unpublshed onlne manuscrpt, avalable from Gelman s webste. The paper wth more detals about ths talk can be found from: Cross-valdaton wth Integrated IS and WAIC/8. References/ 46/46
CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements
Lecture 3 Densty estmaton Mlos Hauskrecht [email protected] 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there
benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).
REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or
PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12
14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed
Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting
Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of
THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek
HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo
NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6
PAR TESTS If a WEIGHT varable s specfed, t s used to replcate a case as many tmes as ndcated by the weght value rounded to the nearest nteger. If the workspace requrements are exceeded and samplng has
GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM
GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM BARRIOT Jean-Perre, SARRAILH Mchel BGI/CNES 18.av.E.Beln 31401 TOULOUSE Cedex 4 (France) Emal: [email protected] 1/Introducton The
What is Candidate Sampling
What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble
CHAPTER 14 MORE ABOUT REGRESSION
CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp
Can Auto Liability Insurance Purchases Signal Risk Attitude?
Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang
Forecasting the Direction and Strength of Stock Market Movement
Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye [email protected] [email protected] [email protected] Abstract - Stock market s one of the most complcated systems
DEFINING %COMPLETE IN MICROSOFT PROJECT
CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,
How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence
1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh
STATISTICAL DATA ANALYSIS IN EXCEL
Mcroarray Center STATISTICAL DATA ANALYSIS IN EXCEL Lecture 6 Some Advanced Topcs Dr. Petr Nazarov 14-01-013 [email protected] Statstcal data analyss n Ecel. 6. Some advanced topcs Correcton for
Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006
Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model
An Alternative Way to Measure Private Equity Performance
An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate
Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification
Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson
How To Calculate The Accountng Perod Of Nequalty
Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.
Regression Models for a Binary Response Using EXCEL and JMP
SEMATECH 997 Statstcal Methods Symposum Austn Regresson Models for a Bnary Response Usng EXCEL and JMP Davd C. Trndade, Ph.D. STAT-TECH Consultng and Tranng n Appled Statstcs San Jose, CA Topcs Practcal
Realistic Image Synthesis
Realstc Image Synthess - Combned Samplng and Path Tracng - Phlpp Slusallek Karol Myszkowsk Vncent Pegoraro Overvew: Today Combned Samplng (Multple Importance Samplng) Renderng and Measurng Equaton Random
8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by
6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng
The Application of Fractional Brownian Motion in Option Pricing
Vol. 0, No. (05), pp. 73-8 http://dx.do.org/0.457/jmue.05.0..6 The Applcaton of Fractonal Brownan Moton n Opton Prcng Qng-xn Zhou School of Basc Scence,arbn Unversty of Commerce,arbn [email protected]
Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining
Rsk Model of Long-Term Producton Schedulng n Open Pt Gold Mnng R Halatchev 1 and P Lever 2 ABSTRACT Open pt gold mnng s an mportant sector of the Australan mnng ndustry. It uses large amounts of nvestments,
) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance
Calbraton Method Instances of the Cell class (one nstance for each FMS cell) contan ADC raw data and methods assocated wth each partcular FMS cell. The calbraton method ncludes event selecton (Class Cell
Calculation of Sampling Weights
Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample
Binomial Link Functions. Lori Murray, Phil Munz
Bnomal Lnk Functons Lor Murray, Phl Munz Bnomal Lnk Functons Logt Lnk functon: ( p) p ln 1 p Probt Lnk functon: ( p) 1 ( p) Complentary Log Log functon: ( p) ln( ln(1 p)) Motvatng Example A researcher
Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation
Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The
1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)
6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes
1 De nitions and Censoring
De ntons and Censorng. Survval Analyss We begn by consderng smple analyses but we wll lead up to and take a look at regresson on explanatory factors., as n lnear regresson part A. The mportant d erence
The OC Curve of Attribute Acceptance Plans
The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4
Luby s Alg. for Maximal Independent Sets using Pairwise Independence
Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent
Brigid Mullany, Ph.D University of North Carolina, Charlotte
Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte
The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis
The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna [email protected] Abstract.
Survival analysis methods in Insurance Applications in car insurance contracts
Survval analyss methods n Insurance Applcatons n car nsurance contracts Abder OULIDI 1 Jean-Mare MARION 2 Hervé GANACHAUD 3 Abstract In ths wor, we are nterested n survval models and ther applcatons on
BERNSTEIN POLYNOMIALS
On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful
PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB.
PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB. INDEX 1. Load data usng the Edtor wndow and m-fle 2. Learnng to save results from the Edtor wndow. 3. Computng the Sharpe Rato 4. Obtanng the Treynor Rato
Estimation of Dispersion Parameters in GLMs with and without Random Effects
Mathematcal Statstcs Stockholm Unversty Estmaton of Dsperson Parameters n GLMs wth and wthout Random Effects Meng Ruoyan Examensarbete 2004:5 Postal address: Mathematcal Statstcs Dept. of Mathematcs Stockholm
Control Charts with Supplementary Runs Rules for Monitoring Bivariate Processes
Control Charts wth Supplementary Runs Rules for Montorng varate Processes Marcela. G. Machado *, ntono F.. Costa * * Producton Department, Sao Paulo State Unversty, Campus of Guaratnguetá, 56-4 Guaratnguetá,
Lecture 3: Force of Interest, Real Interest Rate, Annuity
Lecture 3: Force of Interest, Real Interest Rate, Annuty Goals: Study contnuous compoundng and force of nterest Dscuss real nterest rate Learn annuty-mmedate, and ts present value Study annuty-due, and
Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College
Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure
Imperial College London
F. Fang 1, C.C. Pan 1, I.M. Navon 2, M.D. Pggott 1, G.J. Gorman 1, P.A. Allson 1 and A.J.H. Goddard 1 1 Appled Modellng and Computaton Group Department of Earth Scence and Engneerng Imperal College London,
1. Measuring association using correlation and regression
How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a
Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University
Characterzaton of Assembly Varaton Analyss Methods A Thess Presented to the Department of Mechancal Engneerng Brgham Young Unversty In Partal Fulfllment of the Requrements for the Degree Master of Scence
An Interest-Oriented Network Evolution Mechanism for Online Communities
An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne
Analysis of Premium Liabilities for Australian Lines of Business
Summary of Analyss of Premum Labltes for Australan Lnes of Busness Emly Tao Honours Research Paper, The Unversty of Melbourne Emly Tao Acknowledgements I am grateful to the Australan Prudental Regulaton
Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008
Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn
Prediction of Disability Frequencies in Life Insurance
Predcton of Dsablty Frequences n Lfe Insurance Bernhard Köng Fran Weber Maro V. Wüthrch October 28, 2011 Abstract For the predcton of dsablty frequences, not only the observed, but also the ncurred but
Economic Interpretation of Regression. Theory and Applications
Economc Interpretaton of Regresson Theor and Applcatons Classcal and Baesan Econometrc Methods Applcaton of mathematcal statstcs to economc data for emprcal support Economc theor postulates a qualtatve
SIMPLE LINEAR CORRELATION
SIMPLE LINEAR CORRELATION Smple lnear correlaton s a measure of the degree to whch two varables vary together, or a measure of the ntensty of the assocaton between two varables. Correlaton often s abused.
Damage detection in composite laminates using coin-tap method
Damage detecton n composte lamnates usng con-tap method S.J. Km Korea Aerospace Research Insttute, 45 Eoeun-Dong, Youseong-Gu, 35-333 Daejeon, Republc of Korea [email protected] 45 The con-tap test has the
Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur
Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..
Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION
Vson Mouse Saurabh Sarkar a* a Unversty of Cncnnat, Cncnnat, USA ABSTRACT The report dscusses a vson based approach towards trackng of eyes and fngers. The report descrbes the process of locatng the possble
A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña
Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION
L10: Linear discriminants analysis
L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss
Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network
700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School
Portfolio Loss Distribution
Portfolo Loss Dstrbuton Rsky assets n loan ortfolo hghly llqud assets hold-to-maturty n the bank s balance sheet Outstandngs The orton of the bank asset that has already been extended to borrowers. Commtment
Extending Probabilistic Dynamic Epistemic Logic
Extendng Probablstc Dynamc Epstemc Logc Joshua Sack May 29, 2008 Probablty Space Defnton A probablty space s a tuple (S, A, µ), where 1 S s a set called the sample space. 2 A P(S) s a σ-algebra: a set
Logistic Regression. Steve Kroon
Logstc Regresson Steve Kroon Course notes sectons: 24.3-24.4 Dsclamer: these notes do not explctly ndcate whether values are vectors or scalars, but expects the reader to dscern ths from the context. Scenaro
Estimation of Attrition Biases in SIPP
Estmaton of Attrton Bases n SIPP Erc V. Slud 1 and Leroy Baley 2 1 Census Bureau, SRD, and Unv. of Maryland College Park 2 Census Bureau, Statstcal Research Dvson Abstract Ths paper studes estmators for
An artificial Neural Network approach to monitor and diagnose multi-attribute quality control processes. S. T. A. Niaki*
Journal of Industral Engneerng Internatonal July 008, Vol. 4, No. 7, 04 Islamc Azad Unversty, South Tehran Branch An artfcal Neural Network approach to montor and dagnose multattrbute qualty control processes
CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol
CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL
CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES
CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES In ths chapter, we wll learn how to descrbe the relatonshp between two quanttatve varables. Remember (from Chapter 2) that the terms quanttatve varable
Statistical Methods to Develop Rating Models
Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and
A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression
Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy S-curve Regresson Cheng-Wu Chen, Morrs H. L. Wang and Tng-Ya Hseh Department of Cvl Engneerng, Natonal Central Unversty,
ERP Software Selection Using The Rough Set And TPOSIS Methods
ERP Software Selecton Usng The Rough Set And TPOSIS Methods Under Fuzzy Envronment Informaton Management Department, Hunan Unversty of Fnance and Economcs, No. 139, Fengln 2nd Road, Changsha, 410205, Chna
SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background:
SPEE Recommended Evaluaton Practce #6 efnton of eclne Curve Parameters Background: The producton hstores of ol and gas wells can be analyzed to estmate reserves and future ol and gas producton rates and
Diagnostic Tests of Cross Section Independence for Nonlinear Panel Data Models
DISCUSSION PAPER SERIES IZA DP No. 2756 Dagnostc ests of Cross Secton Independence for Nonlnear Panel Data Models Cheng Hsao M. Hashem Pesaran Andreas Pck Aprl 2007 Forschungsnsttut zur Zukunft der Arbet
Fragility Based Rehabilitation Decision Analysis
.171. Fraglty Based Rehabltaton Decson Analyss Cagdas Kafal Graduate Student, School of Cvl and Envronmental Engneerng, Cornell Unversty Research Supervsor: rcea Grgoru, Professor Summary A method s presented
Georey E. Hinton. University oftoronto. Email: [email protected]. Technical Report CRG-TR-96-1. May 21, 1996 (revised Feb 27, 1997) Abstract
The EM Algorthm for Mxtures of Factor Analyzers Zoubn Ghahraman Georey E. Hnton Department of Computer Scence Unversty oftoronto 6 Kng's College Road Toronto, Canada M5S A4 Emal: [email protected] Techncal
How To Understand The Results Of The German Meris Cloud And Water Vapour Product
Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller
The Racial and Gender Interest Rate Gap. in Small Business Lending: Improved Estimates Using Matching Methods*
The Racal and Gender Interest Rate Gap n Small Busness Lendng: Improved Estmates Usng Matchng Methods* Yue Hu and Long Lu Department of Economcs Unversty of Texas at San Antono Jan Ondrch and John Ynger
Estimating Age-specific Prevalence of Testosterone Deficiency in Men Using Normal Mixture Models
Journal of Data Scence 7(2009), 203-217 Estmatng Age-specfc Prevalence of Testosterone Defcency n Men Usng Normal Mxture Models Yungta Lo Mount Sna School of Medcne Abstract: Testosterone levels declne
Searching for Interacting Features for Spam Filtering
Searchng for Interactng Features for Spam Flterng Chuanlang Chen 1, Yun-Chao Gong 2, Rongfang Be 1,, and X. Z. Gao 3 1 Department of Computer Scence, Bejng Normal Unversty, Bejng 100875, Chna 2 Software
Properties of Indoor Received Signal Strength for WLAN Location Fingerprinting
Propertes of Indoor Receved Sgnal Strength for WLAN Locaton Fngerprntng Kamol Kaemarungs and Prashant Krshnamurthy Telecommuncatons Program, School of Informaton Scences, Unversty of Pttsburgh E-mal: kakst2,[email protected]
Inverse Modeling of Tight Gas Reservoirs
Inverse Modelng of Tght Gas Reservors Der Fakultät für Geowssenschaften, Geotechnk und Bergbau der Technschen Unverstät Bergakademe Freberg engerechte Dssertaton Zur Erlangung des akademschen Grades Doktor-Ingeneur
Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy
Fnancal Tme Seres Analyss Patrck McSharry [email protected] www.mcsharry.net Trnty Term 2014 Mathematcal Insttute Unversty of Oxford Course outlne 1. Data analyss, probablty, correlatons, vsualsaton
Transition Matrix Models of Consumer Credit Ratings
Transton Matrx Models of Consumer Credt Ratngs Abstract Although the corporate credt rsk lterature has many studes modellng the change n the credt rsk of corporate bonds over tme, there s far less analyss
Bayesian Network Based Causal Relationship Identification and Funding Success Prediction in P2P Lending
Proceedngs of 2012 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 25 (2012) (2012) IACSIT Press, Sngapore Bayesan Network Based Causal Relatonshp Identfcaton and Fundng Success
Joint Scheduling of Processing and Shuffle Phases in MapReduce Systems
Jont Schedulng of Processng and Shuffle Phases n MapReduce Systems Fangfe Chen, Mural Kodalam, T. V. Lakshman Department of Computer Scence and Engneerng, The Penn State Unversty Bell Laboratores, Alcatel-Lucent
How Much to Bet on Video Poker
How Much to Bet on Vdeo Poker Trstan Barnett A queston that arses whenever a gae s favorable to the player s how uch to wager on each event? Whle conservatve play (or nu bet nzes large fluctuatons, t lacks
HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA*
HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* Luísa Farnha** 1. INTRODUCTION The rapd growth n Portuguese households ndebtedness n the past few years ncreased the concerns that debt
An Empirical Study of Search Engine Advertising Effectiveness
An Emprcal Study of Search Engne Advertsng Effectveness Sanjog Msra, Smon School of Busness Unversty of Rochester Edeal Pnker, Smon School of Busness Unversty of Rochester Alan Rmm-Kaufman, Rmm-Kaufman
Decision Tree Model for Count Data
Proceedngs of the World Congress on Engneerng 2012 Vol I Decson Tree Model for Count Data Yap Bee Wah, Norashkn Nasaruddn, Wong Shaw Voon and Mohamad Alas Lazm Abstract The Posson Regresson and Negatve
Scale Dependence of Overconfidence in Stock Market Volatility Forecasts
Scale Dependence of Overconfdence n Stoc Maret Volatlty Forecasts Marus Glaser, Thomas Langer, Jens Reynders, Martn Weber* June 7, 007 Abstract In ths study, we analyze whether volatlty forecasts (judgmental
Lecture 5,6 Linear Methods for Classification. Summary
Lecture 5,6 Lnear Methods for Classfcaton Rce ELEC 697 Farnaz Koushanfar Fall 2006 Summary Bayes Classfers Lnear Classfers Lnear regresson of an ndcator matrx Lnear dscrmnant analyss (LDA) Logstc regresson
A Model of Private Equity Fund Compensation
A Model of Prvate Equty Fund Compensaton Wonho Wlson Cho Andrew Metrck Ayako Yasuda KAIST Yale School of Management Unversty of Calforna at Davs June 26, 2011 Abstract: Ths paper analyzes the economcs
Fixed income risk attribution
5 Fxed ncome rsk attrbuton Chthra Krshnamurth RskMetrcs Group [email protected] We compare the rsk of the actve portfolo wth that of the benchmark and segment the dfference between the two
Support Vector Machines
Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada [email protected] Abstract Ths s a note to explan support vector machnes.
1 Example 1: Axis-aligned rectangles
COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton
