Package metafuse. November 7, 2015
|
|
- Donald Cobb
- 7 years ago
- Views:
Transcription
1 Type Package Package metafuse November 7, 2015 Title Fused Lasso Approach in Regression Coefficient Clustering Version Date Author Lu Tang, Peter X.K. Song Maintainer Lu Tang Description For each covariate, cluster its coefficient effects across different data sets during data integration. Supports Gaussian, binomial and Poisson regression models. License GPL-2 Imports glmnet, Matrix, MASS Repository CRAN NeedsCompilation no Date/Publication :24:15 R topics documented: metafuse-package datagenerator metafuse metafuse.l Index 8 metafuse-package Fused Lasso Approach in Regression Coefficient Clustering Description For each covariate, cluster its coefficient effects across different data sets during data integration. Supports Gaussian, binomial and Poisson regression models. 1
2 2 metafuse-package Details Very simple to use. Accepts X, y, and sid (study ID) for regression models. Index of help topics: datagenerator metafuse metafuse-package metafuse.l simulate data fit a GLM with fusion penalty for data integraion Fused Lasso Approach in Regression Coefficient Clustering fit a GLM with fusion penalty for data integraion Author(s) Lu Tang, Peter X.K. Song Maintainer: Lu Tang <lutang@umich.edu> References Lu Tang, Peter X.K. Song. Fused Lasso Approach in Regression Coefficients Clustering - Learning Parameter Heterogeneity in Data Integration. In preparation. Fei Wang. Development of Joint Estimating Equation Approaches to Merging Clustered or Longitudinal Datasets from Multiple Biomedical Studies. PhD Thesis, The University of Michigan, (A Biometrics paper is tentatively accepted. We will revise this a later time.) Jiahua Chen and Zehua Chen. Extended bayesian information criteria for model selection with large model spaces. Biometrika, 95(3): , Xin Gao and Peter X-K Song. Composite likelihood bayesian information criteria for model selection in high-dimensional data. Journal of the American Statistical Association, 105(492): , Examples ########### Generate Data ########### n <- 200 # sample size in each study K <- 10 # number of studies p <- 3 # number of covariates in X (including intercept) N <- n*k # total sample size # the coefficient matrix, used this to set desired heterogeneous pattern (depends on p and K) beta0 <- matrix(c(0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0, # intercept 0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0, # beta_1, etc. 0.0,0.0,0.0,0.0,0.5,0.5,0.5,1.0,1.0,1.0), K, p) # generate a data set, family=c("gaussian", "binomial", "poisson") data <- datagenerator(n=n, beta0=beta0, family="gaussian", seed=123) # prepare the input (y, X, studyid) y <- data$y sid <- data$group X <- data[,-c(1,ncol(data))]
3 datagenerator 3 ########### metafuse runs ########### # fuse slopes of X1 (it is heterogeneous with 2 groups) metafuse(x=x, y=y, sid=sid, fuse.which=c(1), family="gaussian", intercept=true, alpha=0, # fuse slopes of X2 (it is heterogeneous with 3 groups) metafuse(x=x, y=y, sid=sid, fuse.which=c(2), family="gaussian", intercept=true, alpha=0, # fuse all three covariates metafuse(x=x, y=y, sid=sid, fuse.which=c(0,1,2), family="gaussian", intercept=true, alpha=0, # fuse all three covariates, with sparsity penalty metafuse(x=x, y=y, sid=sid, fuse.which=c(0,1,2), family="gaussian", intercept=true, alpha=1, # fit metafuse at a given lambda metafuse.l(x=x, y=y, sid=sid, fuse.which=c(0,1,2), family="gaussian", intercept=true, alpha=1, lambda=0.5) datagenerator simulate data Description Simulate data for demonstration of flarcc. Usage datagenerator(n = n, beta0 = beta0, family = "gaussian", seed = seed) Arguments n beta0 family seed sample size for each study, a vector of length K, the number of studies; can also be an scalar, to specify equal sample size coefficient matrix, with dimension K * p, where K is the number of studies and p is the number of covariates "gaussian" for continuous response, "binomial" for binary response, "poisson" for count response set random seed for data generation Details These data sets are artifical, and used to test out some features of flarcc.
4 4 metafuse Value a simulated data frame will be returned, containing y, X, and study ID sid Examples n <- 200 K <- 10 p <- 3 N <- n*k # sample size in each study # number of studies # number of covariates in X (including intercept) # total sample size # the coefficient matrix, used this to set desired heterogeneous pattern (depends on p and K) beta0 <- matrix(c(0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0, # intercept 0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0, # beta_1, etc. 0.0,0.0,0.0,0.0,0.5,0.5,0.5,1.0,1.0,1.0), K, p) # generate a data set, family=c("gaussian", "binomial", "poisson") data <- datagenerator(n=n, beta0=beta0, family="gaussian", seed=123) metafuse fit a GLM with fusion penalty for data integraion Description Usage Fit a GLM with fusion penalty on coefficients within each covariate, generate solution path for model selection. metafuse(x = X, y = y, sid = sid, fuse.which = c(0:ncol(x)), family = "gaussian", intercept = TRUE, alpha = 0, criterion = "EBIC", verbose = TRUE, plots = TRUE, loglambda = TRUE) Arguments X y sid fuse.which family intercept alpha criterion a matrix (or vector) of predictor(s), with dimensions of N*p, where N is the total sample size of all studies a vector of response, with length N, the total sample size of all studies study id, numbered from 1 to K a vector of a subset of integers from 0 to p, indicating which covariates to be considered for fusion; 0 corresponds to intercept "gaussian" for continuous response, "binomial" for binary response, "poisson" for count response if TRUE, intercept will be included in the model the ratio of sparsity penalty to fusion penalty, default is 0 (no penalty on sparsity) "AIC" for AIC, "BIC" for BIC, "EBIC" for extended BIC
5 metafuse 5 verbose plots loglambda if TRUE, output fusion events and tuning parameter lambda if TRUE, create plots of solution paths and clustering trees if TRUE, lambda will be plot in log-10 transformed scale Details Value Adaptive lasso penalty is used. See Zou (2006) for detail. a list containing the following items will be returned: family criterion alpha the model type model selection criterion used the ratio of sparsity penalty to fusion penalty if.fuse whether the covariate is fused (1) or not (0) betahat betainfo Examples the estimated coefficients additional information about the fit, including degree of freedom, lambda optimal, lambda fuse, friction of fusion for each covariate n <- 200 K <- 10 p <- 3 N <- n*k # sample size in each study # number of studies # number of covariates in X (including intercept) # total sample size # the coefficient matrix, used this to set desired heterogeneous pattern (depends on p and K) beta0 <- matrix(c(0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0, # intercept 0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0, # beta_1, etc. 0.0,0.0,0.0,0.0,0.5,0.5,0.5,1.0,1.0,1.0), K, p) # generate a data set, family=c("gaussian", "binomial", "poisson") data <- datagenerator(n=n, beta0=beta0, family="gaussian", seed=123) # prepare the input (y, X, studyid) y <- data$y sid <- data$group X <- data[,-c(1,ncol(data))] # fuse slopes of X1 (it is heterogeneous with 2 groups) metafuse(x=x, y=y, sid=sid, fuse.which=c(1), family="gaussian", intercept=true, alpha=0, # fuse slopes of X2 (it is heterogeneous with 3 groups) metafuse(x=x, y=y, sid=sid, fuse.which=c(2), family="gaussian", intercept=true, alpha=0, # fuse all three covariates metafuse(x=x, y=y, sid=sid, fuse.which=c(0,1,2), family="gaussian", intercept=true, alpha=0,
6 6 metafuse.l # fuse all three covariates, with sparsity penalty metafuse(x=x, y=y, sid=sid, fuse.which=c(0,1,2), family="gaussian", intercept=true, alpha=1, metafuse.l fit a GLM with fusion penalty for data integraion Description Usage Fit a GLM with fusion penalty on coefficients within each covariate at given lambda. metafuse.l(x = X, y = y, sid = sid, fuse.which = c(0:ncol(x)), family = "gaussian", intercept = TRUE, alpha = 0, lambda = lambda) Arguments X y sid fuse.which family intercept alpha lambda a matrix (or vector) of predictor(s), with dimensions of N*p, where N is the total sample size of all studies a vector of response, with length N, the total sample size of all studies study id, numbered from 1 to K a vector of a subset of integers from 0 to p, indicating which covariates to be considered for fusion; 0 corresponds to intercept "gaussian" for continuous response, "binomial" for binary response, "poisson" for count response if TRUE, intercept will be included in the model the ratio of sparsity penalty to fusion penalty, default is 0 (no penalty on sparsity) tuning parameter for fusion penalty Details Adaptive lasso penalty is used. See Zou (2006) for detail. Value a list containing the following items will be returned: family the model type alpha the ratio of sparsity penalty to fusion penalty if.fuse whether the covariate is fused (1) or not (0) betahat the estimated coefficients betainfo additional information about the fit, including degree of freedom
7 metafuse.l 7 Examples n <- 200 K <- 10 p <- 3 N <- n*k # sample size in each study # number of studies # number of covariates in X (including intercept) # total sample size # the coefficient matrix, used this to set desired heterogeneous pattern (depends on p and K) beta0 <- matrix(c(0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0, # intercept 0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0, # beta_1, etc. 0.0,0.0,0.0,0.0,0.5,0.5,0.5,1.0,1.0,1.0), K, p) # generate a data set, family=c("gaussian", "binomial", "poisson") data <- datagenerator(n=n, beta0=beta0, family="gaussian", seed=123) # prepare the input (y, X, studyid) y <- data$y sid <- data$group X <- data[,-c(1,ncol(data))] # fit metafuse at a given lambda metafuse.l(x=x, y=y, sid=sid, fuse.which=c(0,1,2), family="gaussian", intercept=true, alpha=1, lambda=0.5)
8 Index Topic data datagenerator, 3 Topic generator datagenerator, 3 Topic package metafuse-package, 1 datagenerator, 3 metafuse, 4 metafuse-package, 1 metafuse.l, 6 8
Package bigdata. R topics documented: February 19, 2015
Type Package Title Big Data Analytics Version 0.1 Date 2011-02-12 Author Han Liu, Tuo Zhao Maintainer Han Liu Depends glmnet, Matrix, lattice, Package bigdata February 19, 2015 The
More informationPackage empiricalfdr.deseq2
Type Package Package empiricalfdr.deseq2 May 27, 2015 Title Simulation-Based False Discovery Rate in RNA-Seq Version 1.0.3 Date 2015-05-26 Author Mikhail V. Matz Maintainer Mikhail V. Matz
More informationPackage dsmodellingclient
Package dsmodellingclient Maintainer Author Version 4.1.0 License GPL-3 August 20, 2015 Title DataSHIELD client site functions for statistical modelling DataSHIELD
More informationPackage EstCRM. July 13, 2015
Version 1.4 Date 2015-7-11 Package EstCRM July 13, 2015 Title Calibrating Parameters for the Samejima's Continuous IRT Model Author Cengiz Zopluoglu Maintainer Cengiz Zopluoglu
More informationNew Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Introduction
Introduction New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Predictive analytics encompasses the body of statistical knowledge supporting the analysis of massive data sets.
More informationBuilding risk prediction models - with a focus on Genome-Wide Association Studies. Charles Kooperberg
Building risk prediction models - with a focus on Genome-Wide Association Studies Risk prediction models Based on data: (D i, X i1,..., X ip ) i = 1,..., n we like to fit a model P(D = 1 X 1,..., X p )
More informationGeneralized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
More informationPackage MDM. February 19, 2015
Type Package Title Multinomial Diversity Model Version 1.3 Date 2013-06-28 Package MDM February 19, 2015 Author Glenn De'ath ; Code for mdm was adapted from multinom in the nnet package
More informationCross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models.
Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models. Dr. Jon Starkweather, Research and Statistical Support consultant This month
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More informationPackage neuralnet. February 20, 2015
Type Package Title Training of neural networks Version 1.32 Date 2012-09-19 Package neuralnet February 20, 2015 Author Stefan Fritsch, Frauke Guenther , following earlier work
More informationPackage MixGHD. June 26, 2015
Type Package Package MixGHD June 26, 2015 Title Model Based Clustering, Classification and Discriminant Analysis Using the Mixture of Generalized Hyperbolic Distributions Version 1.7 Date 2015-6-15 Author
More informationPenalized Logistic Regression and Classification of Microarray Data
Penalized Logistic Regression and Classification of Microarray Data Milan, May 2003 Anestis Antoniadis Laboratoire IMAG-LMC University Joseph Fourier Grenoble, France Penalized Logistic Regression andclassification
More informationSAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
More informationPackage changepoint. R topics documented: November 9, 2015. Type Package Title Methods for Changepoint Detection Version 2.
Type Package Title Methods for Changepoint Detection Version 2.2 Date 2015-10-23 Package changepoint November 9, 2015 Maintainer Rebecca Killick Implements various mainstream and
More informationPackage acrm. R topics documented: February 19, 2015
Package acrm February 19, 2015 Type Package Title Convenience functions for analytical Customer Relationship Management Version 0.1.1 Date 2014-03-28 Imports dummies, randomforest, kernelfactory, ada Author
More informationPackage smoothhr. November 9, 2015
Encoding UTF-8 Type Package Depends R (>= 2.12.0),survival,splines Package smoothhr November 9, 2015 Title Smooth Hazard Ratio Curves Taking a Reference Value Version 1.0.2 Date 2015-10-29 Author Artur
More informationPackage MFDA. R topics documented: July 2, 2014. Version 1.1-4 Date 2007-10-30 Title Model Based Functional Data Analysis
Version 1.1-4 Date 2007-10-30 Title Model Based Functional Data Analysis Package MFDA July 2, 2014 Author Wenxuan Zhong , Ping Ma Maintainer Wenxuan Zhong
More informationPackage ATE. R topics documented: February 19, 2015. Type Package Title Inference for Average Treatment Effects using Covariate. balancing.
Package ATE February 19, 2015 Type Package Title Inference for Average Treatment Effects using Covariate Balancing Version 0.2.0 Date 2015-02-16 Author Asad Haris and Gary Chan
More informationPackage hier.part. February 20, 2015. Index 11. Goodness of Fit Measures for a Regression Hierarchy
Version 1.0-4 Date 2013-01-07 Title Hierarchical Partitioning Author Chris Walsh and Ralph Mac Nally Depends gtools Package hier.part February 20, 2015 Variance partition of a multivariate data set Maintainer
More informationPackage robfilter. R topics documented: February 20, 2015. Version 4.1 Date 2012-09-13 Title Robust Time Series Filters
Version 4.1 Date 2012-09-13 Title Robust Time Series Filters Package robfilter February 20, 2015 Author Roland Fried , Karen Schettlinger
More informationPackage retrosheet. April 13, 2015
Type Package Package retrosheet April 13, 2015 Title Import Professional Baseball Data from 'Retrosheet' Version 1.0.2 Date 2015-03-17 Maintainer Richard Scriven A collection of tools
More informationPredicting Health Care Costs by Two-part Model with Sparse Regularization
Predicting Health Care Costs by Two-part Model with Sparse Regularization Atsuyuki Kogure Keio University, Japan July, 2015 Abstract We consider the problem of predicting health care costs using the two-part
More informationModel selection in R featuring the lasso. Chris Franck LISA Short Course March 26, 2013
Model selection in R featuring the lasso Chris Franck LISA Short Course March 26, 2013 Goals Overview of LISA Classic data example: prostate data (Stamey et. al) Brief review of regression and model selection.
More informationModel Selection and Claim Frequency for Workers Compensation Insurance
Model Selection and Claim Frequency for Workers Compensation Insurance Jisheng Cui, David Pitt and Guoqi Qian Abstract We consider a set of workers compensation insurance claim data where the aggregate
More informationData Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression
Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction
More informationRandom effects and nested models with SAS
Random effects and nested models with SAS /************* classical2.sas ********************* Three levels of factor A, four levels of B Both fixed Both random A fixed, B random B nested within A ***************************************************/
More informationTOWARD BIG DATA ANALYSIS WORKSHOP
TOWARD BIG DATA ANALYSIS WORKSHOP 邁 向 巨 量 資 料 分 析 研 討 會 摘 要 集 2015.06.05-06 巨 量 資 料 之 矩 陣 視 覺 化 陳 君 厚 中 央 研 究 院 統 計 科 學 研 究 所 摘 要 視 覺 化 (Visualization) 與 探 索 式 資 料 分 析 (Exploratory Data Analysis, EDA)
More informationLocation matters. 3 techniques to incorporate geo-spatial effects in one's predictive model
Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model Xavier Conort xavier.conort@gear-analytics.com Motivation Location matters! Observed value at one location is
More informationPackage support.ces. R topics documented: April 27, 2015. Type Package
Type Package Package support.ces April 27, 2015 Title Basic Functions for Supporting an Implementation of Choice Experiments Version 0.3-3 Date 2015-04-27 Author Hideo Aizaki Maintainer Hideo Aizaki
More informationPackage depend.truncation
Type Package Package depend.truncation May 28, 2015 Title Statistical Inference for Parametric and Semiparametric Models Based on Dependently Truncated Data Version 2.4 Date 2015-05-28 Author Takeshi Emura
More informationDetection of changes in variance using binary segmentation and optimal partitioning
Detection of changes in variance using binary segmentation and optimal partitioning Christian Rohrbeck Abstract This work explores the performance of binary segmentation and optimal partitioning in the
More informationLinear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure
Technical report Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure Table of contents Introduction................................................................ 1 Data preparation
More informationCHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
More informationExample: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
More informationPackage MBA. February 19, 2015. Index 7. Canopy LIDAR data
Version 0.0-8 Date 2014-4-28 Title Multilevel B-spline Approximation Package MBA February 19, 2015 Author Andrew O. Finley , Sudipto Banerjee Maintainer Andrew
More informationPackage support.ces. R topics documented: August 27, 2015. Type Package
Type Package Package support.ces August 27, 2015 Title Basic Functions for Supporting an Implementation of Choice Experiments Version 0.4-1 Date 2015-08-27 Author Hideo Aizaki Maintainer Hideo Aizaki
More informationPackage trimtrees. February 20, 2015
Package trimtrees February 20, 2015 Type Package Title Trimmed opinion pools of trees in a random forest Version 1.2 Date 2014-08-1 Depends R (>= 2.5.0),stats,randomForest,mlbench Author Yael Grushka-Cockayne,
More informationPackage missforest. February 20, 2015
Type Package Package missforest February 20, 2015 Title Nonparametric Missing Value Imputation using Random Forest Version 1.4 Date 2013-12-31 Author Daniel J. Stekhoven Maintainer
More informationPackage CRM. R topics documented: February 19, 2015
Package CRM February 19, 2015 Title Continual Reassessment Method (CRM) for Phase I Clinical Trials Version 1.1.1 Date 2012-2-29 Depends R (>= 2.10.0) Author Qianxing Mo Maintainer Qianxing Mo
More informationPackage multivator. R topics documented: February 20, 2015. Type Package Title A multivariate emulator Version 1.1-4 Depends R(>= 2.10.
Type Package Title A multivariate emulator Version 1.1-4 Depends R(>= 2.10.0) Package multivator February 20, 2015 Imports utils, emulator (>= 1.2-13), mvtnorm, methods Suggests abind Date 2011-07-28 Author
More informationIntroduction to Structural Equation Modeling (SEM) Day 4: November 29, 2012
Introduction to Structural Equation Modeling (SEM) Day 4: November 29, 202 ROB CRIBBIE QUANTITATIVE METHODS PROGRAM DEPARTMENT OF PSYCHOLOGY COORDINATOR - STATISTICAL CONSULTING SERVICE COURSE MATERIALS
More informationMultivariate Logistic Regression
1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation
More information[3] Big Data: Model Selection
[3] Big Data: Model Selection Matt Taddy, University of Chicago Booth School of Business faculty.chicagobooth.edu/matt.taddy/teaching [3] Making Model Decisions Out-of-Sample vs In-Sample performance Regularization
More informationLogistic Regression (a type of Generalized Linear Model)
Logistic Regression (a type of Generalized Linear Model) 1/36 Today Review of GLMs Logistic Regression 2/36 How do we find patterns in data? We begin with a model of how the world works We use our knowledge
More information5. Multiple regression
5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful
More informationVI. Introduction to Logistic Regression
VI. Introduction to Logistic Regression We turn our attention now to the topic of modeling a categorical outcome as a function of (possibly) several factors. The framework of generalized linear models
More informationStatistics in Retail Finance. Chapter 2: Statistical models of default
Statistics in Retail Finance 1 Overview > We consider how to build statistical models of default, or delinquency, and how such models are traditionally used for credit application scoring and decision
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationBOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING
BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING Xavier Conort xavier.conort@gear-analytics.com Session Number: TBR14 Insurance has always been a data business The industry has successfully
More informationPackage cpm. July 28, 2015
Package cpm July 28, 2015 Title Sequential and Batch Change Detection Using Parametric and Nonparametric Methods Version 2.2 Date 2015-07-09 Depends R (>= 2.15.0), methods Author Gordon J. Ross Maintainer
More informationTechnical report. in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE
Linear mixedeffects modeling in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE Table of contents Introduction................................................................3 Data preparation for MIXED...................................................3
More informationPackage bigrf. February 19, 2015
Version 0.1-11 Date 2014-05-16 Package bigrf February 19, 2015 Title Big Random Forests: Classification and Regression Forests for Large Data Sets Maintainer Aloysius Lim OS_type
More informationJoint models for classification and comparison of mortality in different countries.
Joint models for classification and comparison of mortality in different countries. Viani D. Biatat 1 and Iain D. Currie 1 1 Department of Actuarial Mathematics and Statistics, and the Maxwell Institute
More informationPackage pesticides. February 20, 2015
Type Package Package pesticides February 20, 2015 Title Analysis of single serving and composite pesticide residue measurements Version 0.1 Date 2010-11-17 Author David M Diez Maintainer David M Diez
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
More informationAutomated Biosurveillance Data from England and Wales, 1991 2011
Article DOI: http://dx.doi.org/10.3201/eid1901.120493 Automated Biosurveillance Data from England and Wales, 1991 2011 Technical Appendix This online appendix provides technical details of statistical
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More informationPackage hybridensemble
Type Package Package hybridensemble May 30, 2015 Title Build, Deploy and Evaluate Hybrid Ensembles Version 1.0.0 Date 2015-05-26 Imports randomforest, kernelfactory, ada, rpart, ROCR, nnet, e1071, NMOF,
More informationHLM software has been one of the leading statistical packages for hierarchical
Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush
More informationLasso on Categorical Data
Lasso on Categorical Data Yunjin Choi, Rina Park, Michael Seo December 14, 2012 1 Introduction In social science studies, the variables of interest are often categorical, such as race, gender, and nationality.
More informationStatistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
More informationPackage dunn.test. January 6, 2016
Version 1.3.2 Date 2016-01-06 Package dunn.test January 6, 2016 Title Dunn's Test of Multiple Comparisons Using Rank Sums Author Alexis Dinno Maintainer Alexis Dinno
More informationGENERALIZED LINEAR MODELS IN VEHICLE INSURANCE
ACTA UNIVERSITATIS AGRICULTURAE ET SILVICULTURAE MENDELIANAE BRUNENSIS Volume 62 41 Number 2, 2014 http://dx.doi.org/10.11118/actaun201462020383 GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE Silvie Kafková
More informationTests for Two Survival Curves Using Cox s Proportional Hazards Model
Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.
More informationR2MLwiN Using the multilevel modelling software package MLwiN from R
Using the multilevel modelling software package MLwiN from R Richard Parker Zhengzheng Zhang Chris Charlton George Leckie Bill Browne Centre for Multilevel Modelling (CMM) University of Bristol Using the
More informationAnalysis of Bayesian Dynamic Linear Models
Analysis of Bayesian Dynamic Linear Models Emily M. Casleton December 17, 2010 1 Introduction The main purpose of this project is to explore the Bayesian analysis of Dynamic Linear Models (DLMs). The main
More informationMISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
More informationTime Series Analysis
Time Series Analysis hm@imm.dtu.dk Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby 1 Outline of the lecture Identification of univariate time series models, cont.:
More information13. Poisson Regression Analysis
136 Poisson Regression Analysis 13. Poisson Regression Analysis We have so far considered situations where the outcome variable is numeric and Normally distributed, or binary. In clinical work one often
More informationStatistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
More informationIllustration (and the use of HLM)
Illustration (and the use of HLM) Chapter 4 1 Measurement Incorporated HLM Workshop The Illustration Data Now we cover the example. In doing so we does the use of the software HLM. In addition, we will
More informationRegularized Logistic Regression for Mind Reading with Parallel Validation
Regularized Logistic Regression for Mind Reading with Parallel Validation Heikki Huttunen, Jukka-Pekka Kauppi, Jussi Tohka Tampere University of Technology Department of Signal Processing Tampere, Finland
More informationPackage TSfame. February 15, 2013
Package TSfame February 15, 2013 Version 2012.8-1 Title TSdbi extensions for fame Description TSfame provides a fame interface for TSdbi. Comprehensive examples of all the TS* packages is provided in the
More informationE(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F
Random and Mixed Effects Models (Ch. 10) Random effects models are very useful when the observations are sampled in a highly structured way. The basic idea is that the error associated with any linear,
More informationBinary Logistic Regression
Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including
More informationECLT5810 E-Commerce Data Mining Technique SAS Enterprise Miner -- Regression Model I. Regression Node
Enterprise Miner - Regression 1 ECLT5810 E-Commerce Data Mining Technique SAS Enterprise Miner -- Regression Model I. Regression Node 1. Some background: Linear attempts to predict the value of a continuous
More informationPackage png. February 20, 2015
Version 0.1-7 Title Read and write PNG images Package png February 20, 2015 Author Simon Urbanek Maintainer Simon Urbanek Depends R (>= 2.9.0)
More informationCHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA
Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or
More informationPaRFR : Parallel Random Forest Regression on Hadoop for Multivariate Quantitative Trait Loci Mapping. Version 1.0, Oct 2012
PaRFR : Parallel Random Forest Regression on Hadoop for Multivariate Quantitative Trait Loci Mapping Version 1.0, Oct 2012 This document describes PaRFR, a Java package that implements a parallel random
More informationPackage hive. January 10, 2011
Package hive January 10, 2011 Version 0.1-9 Date 2011-01-09 Title Hadoop InteractiVE Description Hadoop InteractiVE, is an R extension facilitating distributed computing via the MapReduce paradigm. It
More informationCHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA
Examples: Mixture Modeling With Longitudinal Data CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA Mixture modeling refers to modeling with categorical latent variables that represent subpopulations
More informationElectronic Thesis and Dissertations UCLA
Electronic Thesis and Dissertations UCLA Peer Reviewed Title: A Multilevel Longitudinal Analysis of Teaching Effectiveness Across Five Years Author: Wang, Kairong Acceptance Date: 2013 Series: UCLA Electronic
More informationIntroducing the Multilevel Model for Change
Department of Psychology and Human Development Vanderbilt University GCM, 2010 1 Multilevel Modeling - A Brief Introduction 2 3 4 5 Introduction In this lecture, we introduce the multilevel model for change.
More informationWeek 5: Multiple Linear Regression
BUS41100 Applied Regression Analysis Week 5: Multiple Linear Regression Parameter estimation and inference, forecasting, diagnostics, dummy variables Robert B. Gramacy The University of Chicago Booth School
More informationA Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn
A Handbook of Statistical Analyses Using R Brian S. Everitt and Torsten Hothorn CHAPTER 6 Logistic Regression and Generalised Linear Models: Blood Screening, Women s Role in Society, and Colonic Polyps
More informationDEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
More informationPackage lmertest. July 16, 2015
Type Package Title Tests in Linear Mixed Effects Models Version 2.0-29 Package lmertest July 16, 2015 Maintainer Alexandra Kuznetsova Depends R (>= 3.0.0), Matrix, stats, methods, lme4 (>=
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationPackage CoImp. February 19, 2015
Title Copula based imputation method Date 2014-03-01 Version 0.2-3 Package CoImp February 19, 2015 Author Francesca Marta Lilja Di Lascio, Simone Giannerini Depends R (>= 2.15.2), methods, copula Imports
More informationPackage SHELF. February 5, 2016
Type Package Package SHELF February 5, 2016 Title Tools to Support the Sheffield Elicitation Framework (SHELF) Version 1.1.0 Date 2016-01-29 Author Jeremy Oakley Maintainer Jeremy Oakley
More informationProbability Calculator
Chapter 95 Introduction Most statisticians have a set of probability tables that they refer to in doing their statistical wor. This procedure provides you with a set of electronic statistical tables that
More informationLogistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression
Logistic Regression Department of Statistics The Pennsylvania State University Email: jiali@stat.psu.edu Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max
More informationLogistic Regression (1/24/13)
STA63/CBB540: Statistical methods in computational biology Logistic Regression (/24/3) Lecturer: Barbara Engelhardt Scribe: Dinesh Manandhar Introduction Logistic regression is model for regression used
More informationPackage PACVB. R topics documented: July 10, 2016. Type Package
Type Package Package PACVB July 10, 2016 Title Variational Bayes (VB) Approximation of Gibbs Posteriors with Hinge Losses Version 1.1.1 Date 2016-01-29 Author James Ridgway Maintainer James Ridgway
More informationData Mining Lab 5: Introduction to Neural Networks
Data Mining Lab 5: Introduction to Neural Networks 1 Introduction In this lab we are going to have a look at some very basic neural networks on a new data set which relates various covariates about cheese
More informationFrom the help desk: Swamy s random-coefficients model
The Stata Journal (2003) 3, Number 3, pp. 302 308 From the help desk: Swamy s random-coefficients model Brian P. Poi Stata Corporation Abstract. This article discusses the Swamy (1970) random-coefficients
More informationPackage GSA. R topics documented: February 19, 2015
Package GSA February 19, 2015 Title Gene set analysis Version 1.03 Author Brad Efron and R. Tibshirani Description Gene set analysis Maintainer Rob Tibshirani Dependencies impute
More information6 Scalar, Stochastic, Discrete Dynamic Systems
47 6 Scalar, Stochastic, Discrete Dynamic Systems Consider modeling a population of sand-hill cranes in year n by the first-order, deterministic recurrence equation y(n + 1) = Ry(n) where R = 1 + r = 1
More information