Journal of Statistical Software

Size: px
Start display at page:

Download "Journal of Statistical Software"

Transcription

1 JSS Journal of Statstcal Software July 2010, Volume 35, Issue 2. %QLS SAS Macro: A SAS Macro for Analyss of Correlated Data Usng Quas-Least Squares Hanjoo Km Forest Research Insttute, Inc. Justne Shults Unversty of Pennsylvana Abstract Quas-least squares (QLS) s an alternatve computatonal approach for estmaton of the correlaton parameter n the framework of generalzed estmatng equatons (GEE). QLS overcomes some lmtatons of GEE that were dscussed n Crowder (1995). In addton, t allows for easer mplementaton of some correlaton structures that are not avalable for GEE. We descrbe a user wrtten SAS macro called %QLS, and demonstrate applcaton of our macro usng a clncal tral example for the comparson of two treatments for a common toenal nfecton. %QLS also computes the lower and upper boundares of the correlaton parameter for analyss of longtudnal bnary data that were descrbed by Prentce (1988). Furthermore, t dsplays a warnng message f the Prentce constrants are volated. Ths warnng s not provded n exstng GEE software packages and other packages that were recently developed for applcaton of QLS (n Stata, MATLAB, and R). %QLS allows for analyss of contnuous, bnary, or count data wth one of the followng workng correlaton structures: the frst-order autoregressve, equcorrelated, Markov, or tr-dagonal structures. Keywords: longtudnal data, generalzed estmatng equatons, quas-least squares, SAS. 1. Introducton Quas-least squares (QLS, Chaganty 1997; Shults and Chaganty 1998; Chaganty and Shults 1999) s a two-stage approach for fttng longtudnal data that are correlated over tme usng an assumed workng correlaton structure n the framework of generalzed estmatng equatons (GEE, Lang and Zeger 1986). Both QLS and GEE estmate the regresson parameter β va soluton of the same estmatng equaton whch contans an addtonal unknown correlaton parameter α. These methods alternate between updatng the estmate of β, and updatng the estmate of α teratvely, but they dffer wth respect to estmaton of α. GEE uses a moment estmate of α based on the Pearson resdual, whle QLS solves estmatng equatons for α that

2 2 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS are orthogonal to the GEE estmatng equaton of β. There are prmarly two mportant reasons to consder the mplementaton of QLS over GEE. Crowder (1995) ponted out that there may be no feasble estmate of α under msspecfcaton of the workng correlaton structure, whch could result n a falure to converge for the teratve GEE estmaton procedure. On the other hand, the QLS estmates of α are guaranteed to be feasble for several structures, ncludng the frst-order autoregressve (AR(1)), equcorrelated, Markov, and tr-dagonal structures. The other man advantage of the applcaton of QLS s that t allows for relatvely straghtforward mplementaton of more complex and/or bologcally plausble correlaton structures than are currently avalable n GEE. For example, Shults and Chaganty (1998) and Chaganty and Shults (1999) descrbed estmatng equatons for α for the Markov correlaton structure that s a generalzaton of the AR(1) structure for measurements that are unequally spaced n tme. Furthermore, QLS has been successfully mplemented n mult-outcome longtudnal studes whch contan multple sources of correlatons, e.g., correlatons between multple outcomes as well as wthn subject for each outcome over tme, usng Kronecker product-based correlaton structures (Shults and Morrow 2002; Chaganty and Nak 2002; Shults et al. 2004; Shults and Ratclffe 2009). In addton, Shults et al. (2006) mplemented a banded Toepltz structure that had prevously not been mplemented for GEE. There has been a consderable effort to dssemnate and promote the use of QLS va the development of user wrtten QLS programs n some of the major statstcal software packages. For example, Shults et al. (2007) developed xtqls usng Stata s propretary hgher programmng language (StataCorp. 2003). Xe and Shults (2009) provde a user wrtten QLS program called qlspack usng R. These programs estmate the regresson parameter β by callng exstng GEE procedures, e.g., xtgee n Stata and the user wrtten package geepack (Halekoh et al. 2006) n R, and then solvng the stage one and two estmatng equatons for α teratvely. Lastly, Ratclffe and Shults (2008) developed GEEQBOX software n MATLAB for the mplementaton of both GEE and QLS. Pror to ntroducton of GEEQBOX, there was no software avalable for the mplementaton of GEE n MATLAB. All these QLS programs can be downloaded from whle GEEQBOX s also avalable for download on the webste of the Journal of Statstcal Software provded n Ratclffe and Shults (2008). Ths manuscrpt reflects our ongong effort to develop QLS software n SAS (SAS Insttute Inc. 2003) whch s one of the most wdely used software packages, due to ts versatlty ncludng vared and powerful procedures for data management and statstcal analyss. In ths manuscrpt, we present our user wrtten SAS macro called %QLS for the mplementaton of QLS developed under SAS verson 9.1 usng SAS/IML, an nteractve matrx language n SAS. We demonstrate %QLS usng a randomzed clncal tral reported by Backer et al. (1996) for the comparson of two treatments for a common toenal nfecton. (See Secton 4 for more descrpton of the study). %QLS can be used for analyss of contnuous, bnary, or count data, wth an AR(1), equcorrelated, Markov, or tr-dagonal correlaton structure to descrbe the pattern of assocaton among the repeated measurements on each subject, or cluster. A unque feature of %QLS, whch s not avalable n current GEE packages and other userwrtten QLS software, s that t computes the so called Prentce boundary (Prentce 1988) of the estmate of α for analyss of bnary data (see Secton 2.5 for more detals). Our outlne for ths manuscrpt s as follows: Secton 2 descrbes each workng correlaton

3 Journal of Statstcal Software 3 structure mplemented n %QLS. Then t brefly descrbes QLS and the Prentce constrants for α. Secton 3 descrbes a complete lst of the parameters n %QLS. Fnally, Secton 4 demonstrates %QLS usng the clncal tral data (toenal data) for testng the tme-averaged dfference between the two treatment groups. Ths ncludes a demonstraton of an analyss that results n a volaton of the Prentce constrants, n addton to an applcaton of the Markov structure that currently s unavalable for GEE. 2. Descrpton of quas-least squares In ths secton, we provde a bref descrpton of QLS smlar to that provded n Shults et al. (2007); Ratclffe and Shults (2008) and Xe and Shults (2009). For a more through treatment of the development of QLS and ts related asymptotc dstrbutonal propertes, see Chaganty (1997) for a descrpton of stage one of QLS for balanced and equally spaced data, Shults and Chaganty (1998) for stage one of QLS for unbalanced and unequally spaced data, and Chaganty and Shults (1999) for the second stage of QLS. Pror to Chaganty and Shults (1999), the QLS estmate of α was n general not consstent, even f the correlaton structure was correctly specfed. As noted earler, QLS s a method n the framework of GEE. For an excellent text on GEE, see Hardn and Hlbe (2002). We consder outcome measurements y = (y 1,..., y n ) wth assocated covarates x j = (x j1,..., x jk ) collected at measurement occasons j = 1,..., n, for each subject = 1,..., m. For the model specfcaton, we assume that the mean and varance of the outcome varable satsfy E(y j ) = g 1 (x j β) = µ j and Var(y j ) = φh(µ j ), respectvely, where φ s known as the dsperson parameter. Unlke the generalzed lnear model (GLM), both GEE and QLS assume that the covarance matrx Cov(y ) = φa 1/2 R (α)a 1/2 where A = dag(h(µ 1 ),..., h(µ n )), and R (α) s known as the workng correlaton matrx that descrbes the pattern of assocaton among the repeated measurements on each subject Workng correlaton structures mplemented n %QLS %QLS currently allows for applcaton of the AR(1), equcorrelated, Markov, and tr-dagonal structures that are descrbed as follows: 1. The frst-order autoregressve (AR(1)) structure: Ths structure assumes that Corr(y j, y k ) = α j k, wth feasble regon ( 1, 1). The feasble regon for α s defned as the nterval on whch α yelds a postve defnte correlaton matrx. 2. The equcorrelated structure: Ths structure assumes that Corr(y j, y k ) = α. The feasble regon for ths structure s ( 1/(n m 1), 1), where n m s the maxmum value of n over = 1, 2,..., m. Note that for large n m, ths structure may be unrealstc unless all parwse correlatons are roughly dentcal across all tme ponts. 3. The Markov correlaton structure: Ths structure assumes that Corr(y j, y k ) = α t j t k, wth feasble regon ( 1, 1). Ths structure s a generalzaton of the AR(1) structure and useful n modelng data that are unequally spaced n tme. In addton, ths structure has not been mplemented n any of the currently avalable packages that mplement GEE.

4 4 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS 4. The tr-dagonal correlaton structure: Ths structure assumes that Corr(y j, y k ) = α for j k = 1 and s zero otherwse. The feasble regon for ths structure s ( 1/c m, 1/c m ), where ( ) π[nm 1] c m = 2 sn 2[n m + 1] and n m s the maxmum value of n over = 1, 2,..., m. Ths nterval s approxmately ( 1/2, 1/2) for large n and contans ( 1/2, 1/2) for all n. %QLS does not allow for applcaton of the ndependent structure (dentty matrx) because QLS s dentcal to GEE for ths structure. In addton, applcaton of the unstructured matrx s complex for QLS. In SAS, we therefore suggest applcaton of PROC GENMOD wth the repeated statement and the opton corr=nd for applcaton of the ndependent correlaton structure, or corr=un for applcaton of the unstructured correlaton matrx for GEE Estmatng equatons and the algorthm for quas-least squares QLS s a two-stage procedure for estmaton of the regresson parameter β and the correlaton parameter α. In stage one t alternates between updatng the GEE estmatng equaton for β and the estmatng equaton for α. After convergence n stage one, an updated estmate of α s obtaned by solvng the stage two estmatng equaton for α. A fnal estmate of β s then obtaned by solvng the GEE estmatng equaton for β. The estmatng equatons are as follows: ˆ The GEE estmatng equaton for β: n =1 ˆ The stage one estmatng equaton for α: ( ) µ A 1/2 R 1 (α)a 1/2 [y µ (β)] = 0 (1) β { n Z α =1 (β) R 1 (α) Z (β) where Z (β) = (z 1,..., z n ) and z j = (y j µ j )/h(µ j ). } = 0, (2) ˆ The stage two estmatng equaton for α (evaluated at the stage one estmate ˆα): n { R 1 (δ) trace R (α)} = 0. (3) δ δ=ˆα =1 Note that the stage one and two estmatng equatons (2) and (3) nvolve the frst dervatve of the nverse of R (α), whch may not be easly obtaned for some correlaton structures, e.g., the tr-dagonal structure. However, t can be easly shown that R 1 (α) α = R 1 (α) R (α) α R 1 (α) (4)

5 Journal of Statstcal Software 5 where R (δ) δ s computed by takng the dervatve of each element n R (α). For example, for the tr-dagonal structure, R (δ) δ s an n n matrx wth ones on the off-dagonal and zeros elsewhere,.e., the (j, k)th element of R (α) α s 1 f j k = 1 and s 0 otherwse. %QLS nvolves applcaton of the followng algorthm to obtan the estmates of β and α: 1. Ft a generalzed lnear model wth an approprate lnk and varance functon usng PROC GENMOD n SAS to obtan an ntal estmate for β. 2. In stage one of QLS, repeat the followng steps untl a pre-specfed convergence crteron s met for estmatng β and α. () Compute the Pearson resduals at the current estmate of β, where the jth Pearson resdual on subject s gven by z j = (y j û j )/h(û j ). () Compute the estmate of α by solvng the QLS stage one estmatng equaton (2) for α usng a pre-specfed workng correlaton structure. () Update the estmate of β by solvng the GEE estmatng equaton (1) for β at the current estmate of α. 3. After convergence n the estmates of β and α n stage one, obtan an updated estmate of α by solvng the stage two estmatng equaton (3) for α. Then obtan a fnal estmate of β by solvng the GEE estmatng for β that s evaluated at the stage two estmate of α Algebrac expressons of the estmatng equatons for β and α At each teraton, %QLS uses the Gauss-Newton method to compute the current estmate β usng the prevous estmates β and α as follows: { n ( ) ( ) } 1 β µ = β + A 1/2 R 1 (α)a 1/2 µ β β =1 { n ( ) } µ A 1/2 R 1 (α)a 1/2 [y µ (β)] β =1 Although (4) can be used to obtan solutons to the stage one and two estmatng equatons (2) and (3), explct expressons for α are preferred snce ther applcaton s computatonally more effcent. Except for the tr-dagonal structure, %QLS mplements explct expressons for α that were obtaned by solvng the stage one and two estmatng equatons (2) and (3) for α, for partcular workng structures. For the AR(1) structure wth unbalanced data, Shults and Chaganty (1998) proved that the feasble stage one estmate ˆα A-ONE can be expressed as: m n m (z j + z j 1 ) 2 n (z j + z j 1 ) 2 m n (z j z j 1 ) 2 ˆα A-ONE = =1 j=2 =1 j=2 2 m n =1 j=2 =1 j=2 z j z j 1,

6 6 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS whle Chaganty and Shults (1999) showed that the stage two estmate ˆα A-TWO s gven by ˆα A-TWO = 2ˆα A-ONE 1 + ˆα A-ONE 2. For the equcorrelated structure wth unbalanced data, Shults (1996) showed that the stage one estmatng equaton of α has a unque feasble soluton ˆα A-ONE gven by :n >1 Z Z :n >1 1 + α 2 (n 1) (1 + α(n 1)) 2 (Z (β) e ) 2 = 0 where I n s the dentty matrx and e s a n 1 column vector of ones. Shults and Morrow (2002) obtaned the stage two estmate ˆα E-TWO gven by n (n 1) ˆα E-ONE (ˆα E-ONE (n 2) + 2) (1 + ˆα(n 1)) 2 / n (n 1) ( 1 + ˆα E-ONE 2 (n 1) ) (1 + ˆα E-ONE (n 1)) 2. :n >1 m n e j α e j =1 j=2 :n >1 For the Markov structure wth unbalanced data, Shults (1996) obtaned the QLS stage one estmatng equaton for α as follows: [ α 2e j z j z,j 1 α e j ( ) ] zj 2 + z2,j 1 + z j z,j 1 (1 α 2e j) 2 = 0 where e j = t j t,j 1. The stage two estmatng equaton of the Markov structure (Chaganty and Shults 1999) s then gven by m n 2e j δ 2ej 1 α ej [ e j δ e j 1 + δ 3e j 1 ] (1 δ 2e j) 2 = 0. δ=ˆαm-one =1 j= Estmates of the covarance matrx %QLS provdes two types of estmated covarance matrces of the estmated regresson parameter, the model-based and robust sandwch-based estmates. The robust sandwch-based estmate of the covarance matrx s the default matrx. It s often preferred when there s any uncertanty n the choce of workng correlaton structure. However, the standard errors are not necessarly smaller for the sandwch covarance matrx. Therefore, applcaton of both the model based and sandwch based covarance matrces mght be consdered n an analyss. %QLS computes the model-based covarance matrx as follows: Ĉov M ( ˆβ) = ˆφ n =1 X A 1/2 R 1 (ˆα)A 1/2 X. where ˆφ s the estmate of the dsperson parameter wth or wthout the bas correcton gven by ˆφ BC = 1 n Z ( N p ˆβ) Z ( ˆβ), or ˆφ B = 1 n Z ( N ˆβ) Z ( ˆβ) =1 =1

7 Journal of Statstcal Software 7 respectvely, where p s the dmenson of the regresson parameter β and N = m =1 n. By default, %QLS provdes the bas corrected estmate of the dsperson parameter ˆφ BC. For the robust sandwch-based estmate of the covarance matrx, %QLS computes } Ĉov R ( ˆβ) = W 1 n { n =1 X A 1/2 R 1 (ˆα)Z ( ˆβ)Z 1 ( ˆβ)R (ˆα)A 1/2 X W 1 n. where W n = n =1 X A 1/2 R 1 (ˆα)A 1/2 X. %QLS also computes the (1 α)100% confdence nterval, and a p value for testng each ndvdual regresson parameter β j = 0, based on ether the model-based or the robust sandwchbased estmate of the covarance matrx of β Prentce boundary of the estmate of α for analyzng bnary data Consder longtudnal bnary measurements y 1,..., y n wth expected values E(y j ) = Pr(y j = 1) = p j, wth q j = Pr(y j = 0) = 1 p j, and the correlaton between measurements y j and y k denoted by Corr(y j, y k ). An mportant feature of correlated bnary data s that the p j, q j and Corr(y j, y k ) completely determne the bvarate dstrbuton of y j and y k because the par-wse probabltes pr(y j, y k ) can be expressed as Pr(y j, y k ) = p y j j q1 y j j p y k k q1 y k k { 1 + Corr(y j, y k ) (y } j p j )(y k p k ) (p j p k q j q k ) 1/2. (5) Prentce (1988) ponted out that the probabltes n (5) wll be non-negatve,.e., Pr(y j, y k ) 0, only f the correlatons satsfy the followng constrants that depend on the margnal means: for all = 1,..., n. { } { } pj p k qj q k pj q k qj p k max, Corr(y j, y k ) mn, j k q j q k p j p k j k q j p k p j q k A man problem wth volaton of the Prentce constrants s that ths leads to a lack of theoretcal justfcaton for the exstence of a vald jont probablty mass functon, gven the estmated parameter values of β and α. However, as Molenberghs and Verbeke (2005) ponted out, the estmate of the workng correlaton s merely a devce to provde consstent and asymptotcally normal pont estmates for the margnal regresson parameters whch should not the made a part of formal nference. Rochon (1998) also noted that volaton of the Prentce constrants appears to cause no dffculty n practce, although the potental for volaton should be taken nto account n the desgn phase of a study. For example, when conductng sample sze calculatons, values of α that satsfy the Prentce constrants should be consdered. Shults et al. (2009) also suggest that a severe volaton of bounds mght be used to remove a partcular workng structure from a lst of canddate workng structures. As noted n Secton 1, the exstng GEE and user-wrtten QLS software (n Stata, MATLAB, and R) do not provde the estmated Prentce constrants (6) for analyss of bnary data. %QLS, on the other hand, computes the estmated Prentce constrants for each correlaton structure, (6)

8 8 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS and alerts users to a potental problem by provdng a warnng message f the estmate of α does not fall wthn the nterval. 3. Lst of parameters n the macro A complete lst of the parameters n %QLS s as follows: %QLS(data, y, x, d, tme, lnk, corr, robust, dsperson, alpha, ntalout, stage1out, stage2out, cmatrx, reference, converge, maxter) where ˆ data s the name of the data set n the usual longtudnal data format to be read n PROC GENMOD. The data set must not contan any mssng values. ˆ y s the outcome varable. ˆ x are the predctors (covarates) n the regresson model. ˆ d s the ID varable. ˆ tme s the tme varable. ˆ lnk equals 1 for the dentty lnk, 2 for the logt lnk, and 3 for the log lnk (default s 1). ˆ corr equals 1 for the AR(1), 2 for the equcorrelated, 3 for the Markov, 4 for the tr-dagonal (default s 1). ˆ robust equals 1 for robust sandwch-based standard errors, 2 for model-based standard errors (default s 1). ˆ dsperson equals 1 for bas not corrected, 0 for bas-corrected (default s 1). ˆ alpha s the sgnfcance level to be used n testng each regresson coeffcent (default s 0.05). ˆ ntalout equals 1 creates a SAS permanent data set n the current work space for the ntal output, 0 otherwse (default s 0). ˆ stage1out equals 1 creates a SAS permanent data set n the current work space for the stage 1 output, 0 otherwse (default s 0). ˆ stage2out equals 1 creates a SAS permanent data set n the current work space for the stage 2 output, 0 otherwse (default s 0). ˆ cmatrx equals 1 creates a SAS permanent data set n the current work space for the stage 2 correlaton matrx, 0 otherwse (default s 0). ˆ reference equals 1 prnts out the reference nformaton, 0 otherwse (default s 0). ˆ converge s the convergence crteron for estmaton of β and of α (default s ).

9 Journal of Statstcal Software 9 ˆ maxter s the maxmum number of allowable teratons for estmaton of β and α (default s 100). Note that many of the parameters have default values, so that they do not have to be specfed. %QLS assumes the usual longtudnal data format to be read n PROC GENMOD wthout any mssng observaton contaned n the data. If there are mssng observatons n the data that are coded as mssng, these must be deleted pror to mplementaton of %QLS. Ths s equvalent to assumng that the observatons are mssng completely at random (MCAR), as n the usual GEE analyss mplemented by PROC GENMOD wth the repeated statement. 4. Clncal tral example Backer et al. (1996) reported a 12-week, randomzed, double-blnd, mult-center tral for the comparson between the standard oral drug (terbnafne 250mg daly) and the expermental oral drug (thertraconazole 200mg daly) n the treatment of a common toenal nfecton called dermatophyte toe onychomycoss (DTO) whch affects more than 2% of the Brtsh populaton (Roberts 1992). The data was also descrbed n Molenberghs and Verbeke (2005), s avalable along wth ths manuscrpt, and can also be downloaded from ~sratclf/qlsproject.html. A total of 189 patents were randomzed to each treatment group and followed over 12 weeks, wth measurements taken at baselne, and at months 1, 2, 3, 6, 9, and 12. The prmary outcome measure was the severty of the toe nal nfecton, that was defned as 1 f the nfecton was severe, and 0 otherwse. For the purpose of demonstraton, we frst consder a smple logstc regresson model for comparson of the tme-averaged treatment dfference between the standard treatment group and the expermental treatment group. The toenal data, toenal.txt, contans 4 varables: tme, treatment, y, and d, where tme s the tme varable, treatment s the treatment ndcator (1 for the standard arm, and 0 otherwse), y s the outcome varable (1 f the nfecton s severe, and 0 otherwse), and d s the ID number assgned to each patent. Let y j follow the Bernoull dstrbuton wth pr(y j = 1) = p j such that ( ) pj log = β 0 + β 1 treatment (7) 1 p j where treatment s the treatment ndcator that equals 1 f the th subject s assgned to the standard drug and 0 otherwse. One advantage of mplementng the model n (7) s that the upper lmt of the Prentce constrant for α s always equal to 1. In general, any model that nvolves covarates that do not vary wthn clusters wll have an upper Prentce boundary (for α) of 1, due the fact that the estmated probabltes p j wll not vary wthn subjects (clusters) when only cluster level covarates are consdered n the model. Before we demonstrate %QLS to ft the model (7), frst we assume that the data, s read nto the current SAS workspace, e.g., data toenal; nfle "D:\toenal.txt"; nput tme treatment y d; run; toenal.txt,

10 10 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS where we assume that the toenal.txt s stored n the D:\ drectory Applcaton of the AR(1) correlaton structure The followng codes can be used to analyze the toenal data usng the QLS regresson model (7) wth the AR1 structure: %QLS(data = toenal, y = y, x = treatment, d = d, tme = tme, lnk = 2, corr = 1); The estmated standard errors are the robust sandwch-based estmates that are set by default. The outputs from the code are as follows: Quas-Least Squares SAS Macro Verson 1.0 Regresson Analyss usng Quas-Least Squares (QLS) QLS Model Informaton Varance Functon Lnk Functon Dependent Varable Correlaton Structure : Bnomal : Logt : Y : AR(1) Number of Observaton Read : 1907 Number of Clusters : 294 Maxmum Cluster Sze : 7 Mnmum Cluster Sze : 1 Correlaton Matrx Dmenson : 7 Number of Dstnct Tme Ponts : 7 TIME Number of Events : 408 Number of Trals : 1907 Analyss of Intal Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TREATMENT %QLS s modelng the probablty that Y= page 1/

11 Journal of Statstcal Software 11 Correlaton converged after 1 teratons ( tolerance = 0 ) Reg. coeff. converged after 2 teratons ( tolerance = ) Analyss of Stage 1 QLS Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TREATMENT Stage 1 Correlaton Parameter Estmate Dsperson Parameter Estmate at Stage 1 1 Stage 1 Workng Correlaton Matrx page 2/ Correlaton converged after 1 teratons ( tolerance = 0 ) Reg. coeff. converged after 3 teratons ( tolerance = E-9 ) Analyss of Stage 2 QLS Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TREATMENT Prentce Boundary Stage 2 Correlaton Parameter Estmate

12 12 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS Dsperson Parameter Estmate at Stage 2 1 Stage 2 Workng Correlaton Matrx page 3/ The output of %QLS contans the model nformaton followed by the estmates of the stage one and two estmates of β and of α. As noted earler, the upper lmt of the Prentce nterval s equal to 1 n the above output. From the stage two output, the p value correspondng to the tme-averaged treatment effect s equal to 0.38, whch suggests that there s no sgnfcant tme-averaged treatment dfference between the standard drug versus the expermental drug Applcaton of the Markov correlaton structure Here we demonstrate applcaton of the Markov correlaton structure, whch s currently unavalable for GEE. Ths s mportant because the toenal data s unequally spaced n tme, e.g., the varable tme n ths data set ndcates the month of measurement (post baselne) and takes value n {0, 1, 2, 3, 6, 9, 12} for each subject. Therefore, the Markov correlaton structure s preferable for the analyss of ths tral. The followng code can be used to ft the model (7) wth the Markov correlaton structure: %QLS(data = toenal, y = y, x = treatment, d = d, tme = tme, lnk = 2, corr = 3); Here we omt the stage one output, and present the ntal and stage two outputs. Quas-Least Squares SAS Macro Verson 1.0 Regresson Analyss usng Quas-Least Squares (QLS) QLS Model Informaton Varance Functon Lnk Functon Dependent Varable : Bnomal : Logt : Y

13 Journal of Statstcal Software 13 Correlaton Structure : Markov Number of Observaton Read : 1907 Number of Clusters : 294 Maxmum Cluster Sze : 7 Mnmum Cluster Sze : 1 Correlaton Matrx Dmenson : 7 Number of Dstnct Tme Ponts : 7 TIME Number of Events : 408 Number of Trals : 1907 Analyss of Intal Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TREATMENT %QLS s modelng the probablty that Y= page 1/ Correlaton converged after 27 teratons ( tolerance = ) Reg. coeff. converged after 3 teratons ( tolerance = E-7 ) Analyss of Stage 2 QLS Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TREATMENT Prentce Boundary Stage 2 Correlaton Parameter Estmate Dsperson Parameter Estmate at Stage 2 1

14 14 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS Stage 2 Workng Correlaton Matrx page 3/ The results are smlar to those for the AR(1) structure, wth an estmated α n stage two of ˆα = 0.79 versus ˆα = 0.74 for the AR(1) structure. Further, the p value wth respect to the tme-averaged treatment effect s Hence, the same concluson follows as wth the AR(1) structure Applcaton of the equcorrelated and tr-dagonal structures Although the equcorrelated and tr-dagonal structures may not be the best canddate correlaton structures for the toenal data, we nclude the mplementaton of theses structures for the purpose of demonstraton. Here we only present the codes for fttng the model (7) wth the equcorrelated and tr-dagonal correlaton structures, but omt ther outputs. For the equcorrelated correlaton structure, we have %QLS(data = toenal, y = y, x = treatment, d = d, tme = tme, lnk = 2, corr = 2); For the tr-dagonal correlaton structure, we have %QLS(data = toenal, y = y, x = treatment, d = d, tme = tme, lnk = 2, corr = 4); 4.4. Volaton of the Prentce boundary Here we brefly demonstrate volaton of the Prentce constrants usng the toenal data. Consder the followng model for testng the treatment effect over tme: ( ) πj log = β 0 + β 1 (treatment ) + β 2 (tme j ) + β 3 (treatment tme j ) (8) 1 π j where treatment s the treatment ndcator that equals 1 f the th subject s assgned to the standard drug and 0 otherwse, tme j represents the tme of the measurement collected on subject at the jth measurement occason, and treatment tme j s the treatment by tme nteracton. To ft the model n (8) usng %QLS, a new varable correspondng to the nteracton term must be created frst, e.g.,

15 Journal of Statstcal Software 15 data toenal; nfle "D:\toenal.txt"; nput tme treatment y d; nteracton=treatment*tme; run; To ft the model (8) wth the AR(1) structure, we use %QLS(data = toenal, y = y, x = treatment tme nteracton, d = d, tme = tme, lnk = 2, corr = 1); Here we provde the ntal and stage two outputs for the AR(1) structure. Quas-Least Squares SAS Macro Verson 1.0 Regresson Analyss usng Quas-Least Squares (QLS) QLS Model Informaton Varance Functon Lnk Functon Dependent Varable Correlaton Structure : Bnomal : Logt : Y : AR(1) Number of Observaton Read : 1907 Number of Clusters : 294 Maxmum Cluster Sze : 7 Mnmum Cluster Sze : 1 Correlaton Matrx Dmenson : 7 Number of Dstnct Tme Ponts : 7 TIME Number of Events : 408 Number of Trals : 1907 Analyss of Intal Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TREATMENT TIME INTERACTION

16 16 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS %QLS s modelng the probablty that Y= page 1/ Correlaton converged after 1 teratons ( tolerance = 0 ) Reg. coeff. converged after 5 teratons ( tolerance = ) Analyss of Stage 2 QLS Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TREATMENT TIME INTERACTION Prentce Boundary Stage 2 Correlaton Parameter Estmate Dsperson Parameter Estmate at Stage 2 1 Stage 2 Workng Correlaton Matrx Warnng! Correlaton parameter estmate s not wthn the boundary. The exstence of a multvarate bnary dstrbuton s questonable page 3/ From the stage two output, the estmated stage two α s 0.71, whch exceeds the upper lmt (0.31) of the Prentce constrants. It s also mportant to note that although the results are not shown here, applcaton of GEE for the AR(1) structure would also result n a volaton of the Prentce constrants.

17 Journal of Statstcal Software 17 The above results suggest somethng dfferent than the tme-averaged model, whch s that there s a dfference n the lkelhood of hgh severty between the two treatment condtons. However, graphcal dsplays (not shown) suggest that the assumpton of lnearty n the logt s not approprate for these data. For an extensve dscusson of approaches for assessment of the lnearty n the logt assumpton, see Hlbe (2009). For demonstraton purposes, we now present a model that dd not volate the lnearty n the logt assumpton, and that also dd not result n a volaton of the Prentce bounds for α. Ths model contans ndcator varables for the second (1 month), thrd (2 month), ffth (6 month), and seventh (12 month) measurements on each subject, an ndcator varable for the standard treatment, and a vst seven (12 month) by treatment ndcator varable. (All other treatment by vst ndcator varables dd not dffer sgnfcantly from zero). The correspondng data set, toenal2.txt, can be also downloaded from html. Here we present the code, and the ntal and stage two outputs. Frst, we read n the data: data toenal2; nfle "D:\toenal2.txt" delmter="," ; nput y tme2 tme3 tme5 tme7 treatment tme7_trt d tme; run; Next, we ft the QLS model. %QLS(data = toenal2, y = y, x = tme2 tme3 tme5 tme7 treatment tme7_trt, d = d, tme = tme, lnk = 2, corr = 1); Quas-Least Squares SAS Macro Verson 1.0 Regresson Analyss usng Quas-Least Squares (QLS) QLS Model Informaton Varance Functon Lnk Functon Dependent Varable Correlaton Structure : Bnomal : Logt : Y : AR(1) Number of Observaton Read : 1907 Number of Clusters : 294 Maxmum Cluster Sze : 7 Mnmum Cluster Sze : 1 Correlaton Matrx Dmenson : 7 Number of Dstnct Tme Ponts : 7 TIME Number of Events : 408

18 18 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS Number of Trals : 1907 Analyss of Intal Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TIME TIME TIME TIME TREATMENT TIME7_TRT %QLS s modelng the probablty that Y= page 1/ Correlaton converged after 1 teratons ( tolerance = 0 ) Reg. coeff. converged after 8 teratons ( tolerance = ) Analyss of Stage 2 QLS Parameter Estmates Parameter Estmate Stand Err Z Pr> Z [95% Con. Interval] INTERCEPT TIME TIME TIME TIME TREATMENT TIME7_TRT Prentce Boundary Stage 2 Correlaton Parameter Estmate Dsperson Parameter Estmate at Stage 2 1

19 Journal of Statstcal Software 19 Stage 2 Workng Correlaton Matrx page 3/ For the above model, the Prentce constrants are not volated. In addton, the results seem more n agreement wth the tme-averaged model, whch also dd not dentfy a sgnfcant dfference between the two treatment condtons wth respect to severty of toenal nfecton. 5. Dscusson %QLS can ft a model to longtudnal data usng the method of quas-least squares, and can consder data whch follows the normal, Bernoull, or Posson dstrbuton wth the AR(1), Markov, equcorrelated, and tr-dagonal structures. The syntax and the output of %QLS are smlar to the exstng GEE procedures n SAS,.e., PROC GENMOD wth the repeated statement, that would be famlar to SAS users. %QLS assumes that there are no mssng observatons n the dataset. Hence, any observatons that are coded as mssng should be deleted pror to the mplementaton of the macro. As noted earler, ths s equvalent to assumng that the data s mssng completely at random (MCAR), whch s a typcal assumpton n a GEE analyss. Further updates of %QLS wll be made to allow for mplementaton of other structures that are not currently avalable for GEE, ncludng the famlal structure, and Kronecker product-based structures whch account for multple sources of correlaton, e.g., mult-outcome longtudnal data for whch the sources of correlatons come from wthn subjects over tme, and between multple outcomes. It wll also be of nterest to develop and compare exstng methods for selecton of a workng correlaton structure and for assessment of goodness of ft of QLS (and GEE) models. Acknowledgments Work on ths manuscrpt was supported by the NIH grant R01CA Longtudnal Analyss for Dverse Populatons. We are also extremely grateful to the assocate edtor and referee who revewed our manuscrpt and software and provded very helpful suggestons for mprovement. We also thank edtor Achm Zeles for hs techncal suggestons.

20 20 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS References Backer MD, Keyser PD, Vroey CD, Lesaffre E (1996). A 12-Week Treatment for Dermatophyte Toe Onychomycoss Terbnafne 250mg/day vs. Itraconazole 200mg/day n a Double-Blnd Comparatve Tral. Brtsh Journal of Dermatology, 134, Chaganty NR (1997). An Alternatve Approach to the Analyss of Longtudnal Data Va Generalzed Estmatng Equatons. Journal of Statstcal Plannng and Inference, 63, Chaganty NR, Nak DN (2002). Analyss Of Multvarate Longtudnal Data Usng Quas- Least Squares. Journal of Statstcal Plannng and Inference, 103(1-2), Chaganty NR, Shults J (1999). On Elmnatng the Asymptotc Bas n the Quas-Least Squares Estmate of the Correlaton Parameter. Journal of Statstcal Plannng and Inference, 76, Crowder M (1995). On the Use of a Workng Correlaton Matrx n Usng Generalsed Lnear Models for Repeated Measures. Bometrka, 82, Halekoh U, Højsgaard S, Yan J (2006). The R Package geepack for Generalzed Estmatng Equatons. Journal of Statstcal Software, 15(2), URL org/v15/02/. Hardn J, Hlbe JM (2002). Generalzed Estmatng Equatons. Chapman & Hall/CRC Press, Boca Raton. Hlbe JM (2009). Logstc Regresson Models. Chapman & Hall/CRC Press, Boca Raton. Lang KY, Zeger SL (1986). Longtudnal Data Analyss Usng Generalzed Lnear Models. Bometrka, 73, Molenberghs G, Verbeke G (2005). Models for Dscrete Longtudnal Data. Sprnger-Verlag, New York. Prentce RL (1988). Correlated Bnary Regresson wth Covarates Specfc to Each Bnary Observaton. Bometrcs, 44(4), Ratclffe S, Shults J (2008). GEEQBOX: A MATLAB Toolbox for Generalzed Estmatng Equatons and Quas-Least Squares. Journal of Statstcal Software, 25(14), URL Roberts D (1992). Prevalence of Dermatophyte Onychomycoss n the Unted Kngdom: Results of an Omnbus Survey. Brtsh Journal of Dermatology, 126, Supplement 39. Rochon J (1998). Applcaton of GEE Procedures for Sample Sze Calculatons n Repeated Measures Experments. Statstcs n Medcne, 17(14), SAS Insttute Inc (2003). The SAS System, Verson 9.1. Cary, NC. URL com/.

21 Journal of Statstcal Software 21 Shults J (1996). The Analyss of Unbalanced and Unequally Spaced Longtudnal Data Usng Quas-Least Squares. Ph.D. thess, Department of Mathematcs and Statstcs, Old Domnon Unversty, Norfolk, Vrgna. Shults J, Chaganty NR (1998). Analyss of Serally Correlated Data Usng Quas-Least Squares. Bometrcs, 54(4), Shults J, Mazurck C, Lands R (2006). Analyss of Repeated Bouts of Measurements n the Framework of Generalzed Estmatng Equatons. Statstcs n Medcne, 25(23), Shults J, Morrow A (2002). Use of Quas-Least Squares to Adjust for Two Levels of Correlaton. Bometrcs, 58, Shults J, Ratclffe S (2009). Analyss of Mult-Level Correlated Data n the Framework of Generalzed Estmatng Equatons va xtmultcorr Procedures n Stata and qls Functons n MATLAB. Statstcs and Its Interface, 2, Shults J, Ratclffe S, Leonard M (2007). Improved Generalzed Estmatng Equaton Analyss va xtqls for Implementaton of Quas-Least Squares n Stata. Stata Journal, 7(2), Shults J, Sun W, Tu X, Km H, Amsterdam J, Hlbe JM, Ten-Have T (2009). A Comparson of Several Approaches for Choosng Between Workng Correlaton Structures n Generalzed Estmatng Equaton Analyss of Longtudnal Bnary Data. Statstcs n Medcne, 28(18), Shults J, Whtt M, Kumanyka S (2004). Analyss of Data wth Multple Sources of Correlaton n the Framework of Generalzed Estmatng Equatons. Statstcs n Medcne, 23(20), StataCorp (2003). Stata Statstcal Software: Release 8. StataCorp LP, College Staton, TX. URL Xe J, Shults J (2009). Implementaton of Quas-Least Squares wth the R Package qlspack. Workng Paper 32, UPenn Bostatstcs Workng Papers. URL com/upennbostat/papers/art32. Afflaton: Han-Joo Km Department of Bostatstcs Forest Research Insttute, Inc. Fnancal Harborsde Center Plaza V Jersey Cty, NJ , Unted States of Amerca Telephone: +1/201/427/8356 E-mal: hanjookm@gmal.com

22 22 %QLS: Analyss of Correlated Data Usng Quas-Least Squares n SAS Justne Shults Department of Bostatstcs and Epdemology Center for Clncal Epdemology and Bostatstcs Unversty of Pennsylvana School of Medcne 423 Guardan Drve, 610 Blockley Hall Phladelpha, PA , Unted States of Amerca Telephone: +1/215/573/6526 E-mal: jshults@mal.med.upenn.edu URL: Journal of Statstcal Software publshed by the Amercan Statstcal Assocaton Volume 35, Issue 2 Submtted: July 2010 Accepted:

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

Variance estimation for the instrumental variables approach to measurement error in generalized linear models

Variance estimation for the instrumental variables approach to measurement error in generalized linear models he Stata Journal (2003) 3, Number 4, pp. 342 350 Varance estmaton for the nstrumental varables approach to measurement error n generalzed lnear models James W. Hardn Arnold School of Publc Health Unversty

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6 PAR TESTS If a WEIGHT varable s specfed, t s used to replcate a case as many tmes as ndcated by the weght value rounded to the nearest nteger. If the workspace requrements are exceeded and samplng has

More information

Binomial Link Functions. Lori Murray, Phil Munz

Binomial Link Functions. Lori Murray, Phil Munz Bnomal Lnk Functons Lor Murray, Phl Munz Bnomal Lnk Functons Logt Lnk functon: ( p) p ln 1 p Probt Lnk functon: ( p) 1 ( p) Complentary Log Log functon: ( p) ln( ln(1 p)) Motvatng Example A researcher

More information

How To Calculate The Accountng Perod Of Nequalty

How To Calculate The Accountng Perod Of Nequalty Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

Statistical Methods to Develop Rating Models

Statistical Methods to Develop Rating Models Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and

More information

CHAPTER 14 MORE ABOUT REGRESSION

CHAPTER 14 MORE ABOUT REGRESSION CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp

More information

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by 6 CHAPTER 8 COMPLEX VECTOR SPACES 5. Fnd the kernel of the lnear transformaton gven n Exercse 5. In Exercses 55 and 56, fnd the mage of v, for the ndcated composton, where and are gven by the followng

More information

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES The goal: to measure (determne) an unknown quantty x (the value of a RV X) Realsaton: n results: y 1, y 2,..., y j,..., y n, (the measured values of Y 1, Y 2,..., Y j,..., Y n ) every result s encumbered

More information

Regression Models for a Binary Response Using EXCEL and JMP

Regression Models for a Binary Response Using EXCEL and JMP SEMATECH 997 Statstcal Methods Symposum Austn Regresson Models for a Bnary Response Usng EXCEL and JMP Davd C. Trndade, Ph.D. STAT-TECH Consultng and Tranng n Appled Statstcs San Jose, CA Topcs Practcal

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP) 6.3 / -- Communcaton Networks II (Görg) SS20 -- www.comnets.un-bremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

Chapter XX More advanced approaches to the analysis of survey data. Gad Nathan Hebrew University Jerusalem, Israel. Abstract

Chapter XX More advanced approaches to the analysis of survey data. Gad Nathan Hebrew University Jerusalem, Israel. Abstract Household Sample Surveys n Developng and Transton Countres Chapter More advanced approaches to the analyss of survey data Gad Nathan Hebrew Unversty Jerusalem, Israel Abstract In the present chapter, we

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

STATISTICAL DATA ANALYSIS IN EXCEL

STATISTICAL DATA ANALYSIS IN EXCEL Mcroarray Center STATISTICAL DATA ANALYSIS IN EXCEL Lecture 6 Some Advanced Topcs Dr. Petr Nazarov 14-01-013 petr.nazarov@crp-sante.lu Statstcal data analyss n Ecel. 6. Some advanced topcs Correcton for

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

1. Measuring association using correlation and regression

1. Measuring association using correlation and regression How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a

More information

SIMPLE LINEAR CORRELATION

SIMPLE LINEAR CORRELATION SIMPLE LINEAR CORRELATION Smple lnear correlaton s a measure of the degree to whch two varables vary together, or a measure of the ntensty of the assocaton between two varables. Correlaton often s abused.

More information

Evaluating the generalizability of an RCT using electronic health records data

Evaluating the generalizability of an RCT using electronic health records data Evaluatng the generalzablty of an RCT usng electronc health records data 3 nterestng questons Is our RCT representatve? How can we generalze RCT results? Can we use EHR* data as a control group? *) Electronc

More information

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Luby s Alg. for Maximal Independent Sets using Pairwise Independence Lecture Notes for Randomzed Algorthms Luby s Alg. for Maxmal Independent Sets usng Parwse Independence Last Updated by Erc Vgoda on February, 006 8. Maxmal Independent Sets For a graph G = (V, E), an ndependent

More information

L10: Linear discriminants analysis

L10: Linear discriminants analysis L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss

More information

The Current Employment Statistics (CES) survey,

The Current Employment Statistics (CES) survey, Busness Brths and Deaths Impact of busness brths and deaths n the payroll survey The CES probablty-based sample redesgn accounts for most busness brth employment through the mputaton of busness deaths,

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES In ths chapter, we wll learn how to descrbe the relatonshp between two quanttatve varables. Remember (from Chapter 2) that the terms quanttatve varable

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM

GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM GRAVITY DATA VALIDATION AND OUTLIER DETECTION USING L 1 -NORM BARRIOT Jean-Perre, SARRAILH Mchel BGI/CNES 18.av.E.Beln 31401 TOULOUSE Cedex 4 (France) Emal: jean-perre.barrot@cnes.fr 1/Introducton The

More information

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS Chrs Deeley* Last revsed: September 22, 200 * Chrs Deeley s a Senor Lecturer n the School of Accountng, Charles Sturt Unversty,

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB.

PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB. PRACTICE 1: MUTUAL FUNDS EVALUATION USING MATLAB. INDEX 1. Load data usng the Edtor wndow and m-fle 2. Learnng to save results from the Edtor wndow. 3. Computng the Sharpe Rato 4. Obtanng the Treynor Rato

More information

Economic Interpretation of Regression. Theory and Applications

Economic Interpretation of Regression. Theory and Applications Economc Interpretaton of Regresson Theor and Applcatons Classcal and Baesan Econometrc Methods Applcaton of mathematcal statstcs to economc data for emprcal support Economc theor postulates a qualtatve

More information

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts

Power-of-Two Policies for Single- Warehouse Multi-Retailer Inventory Systems with Order Frequency Discounts Power-of-wo Polces for Sngle- Warehouse Mult-Retaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)

More information

Sketching Sampled Data Streams

Sketching Sampled Data Streams Sketchng Sampled Data Streams Florn Rusu, Aln Dobra CISE Department Unversty of Florda Ganesvlle, FL, USA frusu@cse.ufl.edu adobra@cse.ufl.edu Abstract Samplng s used as a unversal method to reduce the

More information

PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIGIOUS AFFILIATION AND PARTICIPATION

PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIGIOUS AFFILIATION AND PARTICIPATION PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIIOUS AFFILIATION AND PARTICIPATION Danny Cohen-Zada Department of Economcs, Ben-uron Unversty, Beer-Sheva 84105, Israel Wllam Sander Department of Economcs, DePaul

More information

On fourth order simultaneously zero-finding method for multiple roots of complex polynomial equations 1

On fourth order simultaneously zero-finding method for multiple roots of complex polynomial equations 1 General Mathematcs Vol. 6, No. 3 (2008), 9 3 On fourth order smultaneously zero-fndng method for multple roots of complex polynomal euatons Nazr Ahmad Mr and Khald Ayub Abstract In ths paper, we present

More information

RECENT DEVELOPMENTS IN QUANTITATIVE COMPARATIVE METHODOLOGY:

RECENT DEVELOPMENTS IN QUANTITATIVE COMPARATIVE METHODOLOGY: Federco Podestà RECENT DEVELOPMENTS IN QUANTITATIVE COMPARATIVE METHODOLOGY: THE CASE OF POOLED TIME SERIES CROSS-SECTION ANALYSIS DSS PAPERS SOC 3-02 INDICE 1. Advantages and Dsadvantages of Pooled Analyss...

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.

More information

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification

Logistic Regression. Lecture 4: More classifiers and classes. Logistic regression. Adaboost. Optimization. Multiple class classification Lecture 4: More classfers and classes C4B Machne Learnng Hlary 20 A. Zsserman Logstc regresson Loss functons revsted Adaboost Loss functons revsted Optmzaton Multple class classfcaton Logstc Regresson

More information

Evaluating credit risk models: A critique and a new proposal

Evaluating credit risk models: A critique and a new proposal Evaluatng credt rsk models: A crtque and a new proposal Hergen Frerchs* Gunter Löffler Unversty of Frankfurt (Man) February 14, 2001 Abstract Evaluatng the qualty of credt portfolo rsk models s an mportant

More information

General Iteration Algorithm for Classification Ratemaking

General Iteration Algorithm for Classification Ratemaking General Iteraton Algorthm for Classfcaton Ratemakng by Luyang Fu and Cheng-sheng eter Wu ABSTRACT In ths study, we propose a flexble and comprehensve teraton algorthm called general teraton algorthm (GIA)

More information

MARKET SHARE CONSTRAINTS AND THE LOSS FUNCTION IN CHOICE BASED CONJOINT ANALYSIS

MARKET SHARE CONSTRAINTS AND THE LOSS FUNCTION IN CHOICE BASED CONJOINT ANALYSIS MARKET SHARE CONSTRAINTS AND THE LOSS FUNCTION IN CHOICE BASED CONJOINT ANALYSIS Tmothy J. Glbrde Assstant Professor of Marketng 315 Mendoza College of Busness Unversty of Notre Dame Notre Dame, IN 46556

More information

Estimation of Dispersion Parameters in GLMs with and without Random Effects

Estimation of Dispersion Parameters in GLMs with and without Random Effects Mathematcal Statstcs Stockholm Unversty Estmaton of Dsperson Parameters n GLMs wth and wthout Random Effects Meng Ruoyan Examensarbete 2004:5 Postal address: Mathematcal Statstcs Dept. of Mathematcs Stockholm

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION

More information

) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance

) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance Calbraton Method Instances of the Cell class (one nstance for each FMS cell) contan ADC raw data and methods assocated wth each partcular FMS cell. The calbraton method ncludes event selecton (Class Cell

More information

1 De nitions and Censoring

1 De nitions and Censoring De ntons and Censorng. Survval Analyss We begn by consderng smple analyses but we wll lead up to and take a look at regresson on explanatory factors., as n lnear regresson part A. The mportant d erence

More information

Statistical algorithms in Review Manager 5

Statistical algorithms in Review Manager 5 Statstcal algorthms n Reve Manager 5 Jonathan J Deeks and Julan PT Hggns on behalf of the Statstcal Methods Group of The Cochrane Collaboraton August 00 Data structure Consder a meta-analyss of k studes

More information

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008 Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn

More information

Diagnostic Tests of Cross Section Independence for Nonlinear Panel Data Models

Diagnostic Tests of Cross Section Independence for Nonlinear Panel Data Models DISCUSSION PAPER SERIES IZA DP No. 2756 Dagnostc ests of Cross Secton Independence for Nonlnear Panel Data Models Cheng Hsao M. Hashem Pesaran Andreas Pck Aprl 2007 Forschungsnsttut zur Zukunft der Arbet

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

International University of Japan Public Management & Policy Analysis Program

International University of Japan Public Management & Policy Analysis Program Internatonal Unversty of Japan Publc Management & Polcy Analyss Program Practcal Gudes To Panel Data Modelng: A Step by Step Analyss Usng Stata * Hun Myoung Park, Ph.D. kucc65@uj.ac.jp 1. Introducton.

More information

14.74 Lecture 5: Health (2)

14.74 Lecture 5: Health (2) 14.74 Lecture 5: Health (2) Esther Duflo February 17, 2004 1 Possble Interventons Last tme we dscussed possble nterventons. Let s take one: provdng ron supplements to people, for example. From the data,

More information

v a 1 b 1 i, a 2 b 2 i,..., a n b n i.

v a 1 b 1 i, a 2 b 2 i,..., a n b n i. SECTION 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS 455 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS All the vector spaces we have studed thus far n the text are real vector spaces snce the scalars are

More information

Learning from Large Distributed Data: A Scaling Down Sampling Scheme for Efficient Data Processing

Learning from Large Distributed Data: A Scaling Down Sampling Scheme for Efficient Data Processing Internatonal Journal of Machne Learnng and Computng, Vol. 4, No. 3, June 04 Learnng from Large Dstrbuted Data: A Scalng Down Samplng Scheme for Effcent Data Processng Che Ngufor and Janusz Wojtusak part

More information

Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem

Logical Development Of Vogel s Approximation Method (LD-VAM): An Approach To Find Basic Feasible Solution Of Transportation Problem INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME, ISSUE, FEBRUARY ISSN 77-866 Logcal Development Of Vogel s Approxmaton Method (LD- An Approach To Fnd Basc Feasble Soluton Of Transportaton

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School

Robust Design of Public Storage Warehouses. Yeming (Yale) Gong EMLYON Business School Robust Desgn of Publc Storage Warehouses Yemng (Yale) Gong EMLYON Busness School Rene de Koster Rotterdam school of management, Erasmus Unversty Abstract We apply robust optmzaton and revenue management

More information

Demographic and Health Surveys Methodology

Demographic and Health Surveys Methodology samplng and household lstng manual Demographc and Health Surveys Methodology Ths document s part of the Demographc and Health Survey s DHS Toolkt of methodology for the MEASURE DHS Phase III project, mplemented

More information

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background:

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background: SPEE Recommended Evaluaton Practce #6 efnton of eclne Curve Parameters Background: The producton hstores of ol and gas wells can be analyzed to estmate reserves and future ol and gas producton rates and

More information

Control Charts with Supplementary Runs Rules for Monitoring Bivariate Processes

Control Charts with Supplementary Runs Rules for Monitoring Bivariate Processes Control Charts wth Supplementary Runs Rules for Montorng varate Processes Marcela. G. Machado *, ntono F.. Costa * * Producton Department, Sao Paulo State Unversty, Campus of Guaratnguetá, 56-4 Guaratnguetá,

More information

Efficient Project Portfolio as a tool for Enterprise Risk Management

Efficient Project Portfolio as a tool for Enterprise Risk Management Effcent Proect Portfolo as a tool for Enterprse Rsk Management Valentn O. Nkonov Ural State Techncal Unversty Growth Traectory Consultng Company January 5, 27 Effcent Proect Portfolo as a tool for Enterprse

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

Product-Form Stationary Distributions for Deficiency Zero Chemical Reaction Networks

Product-Form Stationary Distributions for Deficiency Zero Chemical Reaction Networks Bulletn of Mathematcal Bology (21 DOI 1.17/s11538-1-9517-4 ORIGINAL ARTICLE Product-Form Statonary Dstrbutons for Defcency Zero Chemcal Reacton Networks Davd F. Anderson, Gheorghe Cracun, Thomas G. Kurtz

More information

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining Rsk Model of Long-Term Producton Schedulng n Open Pt Gold Mnng R Halatchev 1 and P Lever 2 ABSTRACT Open pt gold mnng s an mportant sector of the Australan mnng ndustry. It uses large amounts of nvestments,

More information

Online Appendix for Forecasting the Equity Risk Premium: The Role of Technical Indicators

Online Appendix for Forecasting the Equity Risk Premium: The Role of Technical Indicators Onlne Appendx for Forecastng the Equty Rsk Premum: The Role of Techncal Indcators Chrstopher J. Neely Federal Reserve Bank of St. Lous neely@stls.frb.org Davd E. Rapach Sant Lous Unversty rapachde@slu.edu

More information

1.2 DISTRIBUTIONS FOR CATEGORICAL DATA

1.2 DISTRIBUTIONS FOR CATEGORICAL DATA DISTRIBUTIONS FOR CATEGORICAL DATA 5 present models for a categorcal response wth matched pars; these apply, for nstance, wth a categorcal response measured for the same subjects at two tmes. Chapter 11

More information

Implementation of Deutsch's Algorithm Using Mathcad

Implementation of Deutsch's Algorithm Using Mathcad Implementaton of Deutsch's Algorthm Usng Mathcad Frank Roux The followng s a Mathcad mplementaton of Davd Deutsch's quantum computer prototype as presented on pages - n "Machnes, Logc and Quantum Physcs"

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

Quantization Effects in Digital Filters

Quantization Effects in Digital Filters Quantzaton Effects n Dgtal Flters Dstrbuton of Truncaton Errors In two's complement representaton an exact number would have nfntely many bts (n general). When we lmt the number of bts to some fnte value

More information

Single and multiple stage classifiers implementing logistic discrimination

Single and multiple stage classifiers implementing logistic discrimination Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,

More information

A 'Virtual Population' Approach To Small Area Estimation

A 'Virtual Population' Approach To Small Area Estimation A 'Vrtual Populaton' Approach To Small Area Estmaton Mchael P. Battagla 1, Martn R. Frankel 2, Machell Town 3 and Lna S. Balluz 3 1 Abt Assocates Inc., Cambrdge MA 02138 2 Baruch College, CUNY, New York

More information

Method for assessment of companies' credit rating (AJPES S.BON model) Short description of the methodology

Method for assessment of companies' credit rating (AJPES S.BON model) Short description of the methodology Method for assessment of companes' credt ratng (AJPES S.BON model) Short descrpton of the methodology Ljubljana, May 2011 ABSTRACT Assessng Slovenan companes' credt ratng scores usng the AJPES S.BON model

More information

Rate-Based Daily Arrival Process Models with Application to Call Centers

Rate-Based Daily Arrival Process Models with Application to Call Centers Submtted to Operatons Research manuscrpt (Please, provde the manuscrpt number!) Authors are encouraged to submt new papers to INFORMS journals by means of a style fle template, whch ncludes the journal

More information

Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University

Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University Characterzaton of Assembly Varaton Analyss Methods A Thess Presented to the Department of Mechancal Engneerng Brgham Young Unversty In Partal Fulfllment of the Requrements for the Degree Master of Scence

More information

Ring structure of splines on triangulations

Ring structure of splines on triangulations www.oeaw.ac.at Rng structure of splnes on trangulatons N. Vllamzar RICAM-Report 2014-48 www.rcam.oeaw.ac.at RING STRUCTURE OF SPLINES ON TRIANGULATIONS NELLY VILLAMIZAR Introducton For a trangulated regon

More information

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy Fnancal Tme Seres Analyss Patrck McSharry patrck@mcsharry.net www.mcsharry.net Trnty Term 2014 Mathematcal Insttute Unversty of Oxford Course outlne 1. Data analyss, probablty, correlatons, vsualsaton

More information

Time Domain simulation of PD Propagation in XLPE Cables Considering Frequency Dependent Parameters

Time Domain simulation of PD Propagation in XLPE Cables Considering Frequency Dependent Parameters Internatonal Journal of Smart Grd and Clean Energy Tme Doman smulaton of PD Propagaton n XLPE Cables Consderng Frequency Dependent Parameters We Zhang a, Jan He b, Ln Tan b, Xuejun Lv b, Hong-Je L a *

More information

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College Feature selecton for ntruson detecton Slobodan Petrovć NISlab, Gjøvk Unversty College Contents The feature selecton problem Intruson detecton Traffc features relevant for IDS The CFS measure The mrmr measure

More information

Survival analysis methods in Insurance Applications in car insurance contracts

Survival analysis methods in Insurance Applications in car insurance contracts Survval analyss methods n Insurance Applcatons n car nsurance contracts Abder OULIDI 1 Jean-Mare MARION 2 Hervé GANACHAUD 3 Abstract In ths wor, we are nterested n survval models and ther applcatons on

More information

Adaptive Clinical Trials Incorporating Treatment Selection and Evaluation: Methodology and Applications in Multiple Sclerosis

Adaptive Clinical Trials Incorporating Treatment Selection and Evaluation: Methodology and Applications in Multiple Sclerosis Adaptve Clncal Trals Incorporatng Treatment electon and Evaluaton: Methodology and Applcatons n Multple cleross usan Todd, Tm Frede, Ngel tallard, Ncholas Parsons, Elsa Valdés-Márquez, Jeremy Chataway

More information

Credit Limit Optimization (CLO) for Credit Cards

Credit Limit Optimization (CLO) for Credit Cards Credt Lmt Optmzaton (CLO) for Credt Cards Vay S. Desa CSCC IX, Ednburgh September 8, 2005 Copyrght 2003, SAS Insttute Inc. All rghts reserved. SAS Propretary Agenda Background Tradtonal approaches to credt

More information

ADVERSE SELECTION IN INSURANCE MARKETS: POLICYHOLDER EVIDENCE FROM THE U.K. ANNUITY MARKET *

ADVERSE SELECTION IN INSURANCE MARKETS: POLICYHOLDER EVIDENCE FROM THE U.K. ANNUITY MARKET * ADVERSE SELECTION IN INSURANCE MARKETS: POLICYHOLDER EVIDENCE FROM THE U.K. ANNUITY MARKET * Amy Fnkelsten Harvard Unversty and NBER James Poterba MIT and NBER * We are grateful to Jeffrey Brown, Perre-Andre

More information

Linear regression analysis of censored medical costs

Linear regression analysis of censored medical costs Bostatstcs (2), 1, 1,pp. 35 47 Prnted n Great Brtan Lnear regresson analyss of censored medcal costs D. Y. LIN Department of Bostatstcs, Box 357232, Unversty of Washngton, Seattle, WA 98195, USA danyu@bostat.washngton.edu

More information

Transition Matrix Models of Consumer Credit Ratings

Transition Matrix Models of Consumer Credit Ratings Transton Matrx Models of Consumer Credt Ratngs Abstract Although the corporate credt rsk lterature has many studes modellng the change n the credt rsk of corporate bonds over tme, there s far less analyss

More information

Analysis of Premium Liabilities for Australian Lines of Business

Analysis of Premium Liabilities for Australian Lines of Business Summary of Analyss of Premum Labltes for Australan Lnes of Busness Emly Tao Honours Research Paper, The Unversty of Melbourne Emly Tao Acknowledgements I am grateful to the Australan Prudental Regulaton

More information

Lecture 5,6 Linear Methods for Classification. Summary

Lecture 5,6 Linear Methods for Classification. Summary Lecture 5,6 Lnear Methods for Classfcaton Rce ELEC 697 Farnaz Koushanfar Fall 2006 Summary Bayes Classfers Lnear Classfers Lnear regresson of an ndcator matrx Lnear dscrmnant analyss (LDA) Logstc regresson

More information