Package msma. January 1, 2016

Size: px
Start display at page:

Download "Package msma. January 1, 2016"

Transcription

1 Type Package Title Multiblock Sparse Multivariable Analysis Version 0.7 Date Author Atsushi Kawaguchi Package msma January 1, 2016 Maintainer Atsushi Kawaguchi Depends mvtnorm There are several functions to implement the method for analysis in a multiblock multivariable data. If the input is only a matrix, then the principal components analysis (PCA) is implemented. If the input is a list of matrices, then the multiblock PCA is implemented. If the input is two matrices for exploratory and objective variables, then the partial least squares (PLS) analysis is implemented. If the input is two list of matrices for exploratory and objective variables, then the multiblock PLS analysis is implemented. Moreover, if the extra outcome variable is specified, then the supervised version for the methods above is implemented. For each methods, the sparse modeling is also incorporated. Functions to select the number of components and the regularized parameters are also provided. License GPL (>= 2) NeedsCompilation no Repository CRAN Date/Publication :47:02 R topics documented: msma-package cvmsma msma ncompsearch predict.msma regparasearch simdata summary.msma Index 13 1

2 2 cvmsma msma-package Multiblock Sparse Multivariable Analysis Package A Package for implementation of multiblock multivariable data analysis. Author(s) Atsushi Kawaguchi. <kawa_a24@yahoo.co.jp> See Also msma cvmsma Cross-Validation cross-validated method to evaluate the fit of msma. cvmsma(, = NULL, Z = NULL, comp = 1, lambda, lambda = NULL, eta = 1, type = "lasso", in = NULL, in = NULL, mu = 0, mu = 0, nfold = 5, seed = 1) Z comp lambda lambda eta type a (list of) matrix, explanatory variable(s) which is required. a (list of) matrix, objective variable(s). This is optional. If no input for, then the PCA method is implemented. a vector, response variable(s). This is optional. The length is the number of subjects. If no input for Z, then the unsupervised PLS/PCA is implemented. numeric scalar for the number of components to be considered. numeric vector of regularized parameters for with length equal to the number of blocks. If omitted, no regularization is conducted. numeric vector of regularized parameters for with length equal to the number of blocks. If omitted, no regularization is conducted. numeric scalar, the parameter indexing the penalty family. This version has only the choice 1. a character, the penalty family. This version has only the choice "lasso".

3 msma 3 in in mu mu a (list of) numeric vector to specify the variables of which are always in the a (list of) numeric vector to specify the variables of which are always in the a numeric scalar for the weight of for the supervised. 0<=mu<=1. a numeric scalar for the weight of for the supervised. 0<=mu<=1. nfold number of folds - default is 5. seed number of seed for the random number. k-fold cross-validation for msma. The evaluation is based on the matrix element wise errors. Value err The mean cross-validated errors which has three elements consisting of the mean of errors for and, the errors for and for in the PLS and only the errors for in the PCA. Examples ##### data ##### tmpdata = simdata(n = 50, rho = 0.8, ps = c(10, 12, 15), ps = 20, seed=1) = tmpdata$; = tmpdata$ ##### One Component CV ##### cv1 = cvmsma(,, comp = 1, lambda=2, lambda=1:3, nfold=5, seed=1) cv1 ##### Two Component CV ##### cv2 = cvmsma(,, comp = 2, lambda=2, lambda=1:3, nfold=5, seed=1) cv2 msma Multiblock Sparse Multivariable Analysis This is a function for a matrix decomposition method incorporating sparse and supervised modeling for a multiblock multivariable data analysis

4 4 msma msma(,...) ## Default S3 method: msma(, = NULL, Z = NULL, comp = 2, lambda = NULL, lambda = NULL, eta = 1, type = "lasso", in = NULL, in = NULL, mu = 0, mu = 0, defmethod = "canonical", scaling = TRUE, verbose = FALSE,...) ## S3 method for class msma print(x,...) ## S3 method for class msma plot(x,...) a (list of) matrix, explanatory variable(s) which is required.... further arguments passed to or from other methods. Z comp lambda lambda eta type in in mu mu defmethod scaling verbose x a (list of) matrix, objective variable(s). This is optional. If no input for, then the PCA method is implemented. a vector, response variable(s). This is optional. The length is the number of subjects. If no input for Z, then the unsupervised PLS/PCA is implemented. numeric scalar for the number of components to be considered. numeric vector of regularized parameters for with length equal to the number of blocks. If omitted, no regularization is conducted. numeric vector of regularized parameters for with length equal to the number of blocks. If omitted, no regularization is conducted. numeric scalar, the parameter indexing the penalty family. This version has only the choice 1. a character, the penalty family. This version has only the choice "lasso". a (list of) numeric vector to specify the variables of which are always in the a (list of) numeric vector to specify the variables of which are always in the a numeric scalar for the weight of for the supervised. 0<=mu<=1. a numeric scalar for the weight of for the supervised. 0<=mu<=1. a character, the deflation method, this version has only the choice "canonical". a logical, whether the scaling data is done, the default is TRUE. information an object of class "msma", usually, a result of a call to msma

5 msma 5 Value msma requires at least one input as the matrix or the list. In this case, the (multiblock) principal components analysis is conducted. If is also specified, the partial least squares with as explanatory variables and as objective variables. This function scaled each data matrix to mean 0 and variance 1 in the default. The block structure can be represented as the list. If Z is also specified, the supervised version is implemented and the degree is controlled by mu or mu where 0<=mu<=1, 0<=mu<=1, and 0<=mu+mu<1. If the positive lambda or lambda is specified, the sparse estimation based on L1 penalty is implemented. dmode scale scale comp wb sb wb sb ss ws ss ws nzwb nzwb selectnames selectnames Which modes "PLS" or "PCA" Scaled which has a list form. Scaled which has a list form. Scaling information for. The means and standard deviations for each block of are returned. Scaling information for. The means and standard deviations for each block of are returned. the number of components block loading for. The list has same length as that of the input list (the number of blocks) and consists of the matrix with the number of variables in the row and the number of components in the column. block score for. The list has same length as that of the input list (the number of blocks) and consists of the matrix with the number of subjects in the row and the number of components in the column. block loading for. The list has same length as that of the input list (the number of blocks) and consists of the matrix with the number of variables in the row and the number of components in the column. block score for. The list has same length as that of the input list (the number of blocks) and consists of the matrix with the number of subjects in the row and the number of components in the column. super score for. The matrix has the number of subjects in the row and the number of components in the column. super loading for. The matrix has the number of blocks in the row and the number of components in the column. super score for. The matrix has the number of subjects in the row and the number of components in the column. super loading for. The matrix has the number of blocks in the row and the number of components in the column. number of nonzeros in block loading for number of nonzeros in block loading for names of selected variables for. This returns the names of names of selected variables for. This returns the names of

6 6 ncompsearch Examples ##### data ##### tmpdata = simdata(n = 50, rho = 0.8, ps = c(10, 12, 15), ps = 20, seed=1) = tmpdata$; = tmpdata$ ##### One Component ##### fit1 = msma(,, comp=1, lambda=2, lambda=1:3) fit1 ##### Two Component ##### fit2 = msma(,, comp=2, lambda=2, lambda=1:3) fit2 ##### Matrix data ##### sigma = matrix(0.8, 10, 10) diag(sigma) = 1 2 = rmvnorm(50, rep(0, 10), sigma) 2 = rmvnorm(50, rep(0, 10), sigma) fit3 = msma(2, 2, comp=1, lambda=2, lambda=2) fit3 ##### Sparse Principal Component Analysis ##### fit5 = msma(2, comp=5, lambda=2.5) summary(fit5) ncompsearch Search for Number of Components Determination for the number of components based on cross-validated method or the Bayesian information criterion (BIC) ncompsearch(, = NULL, Z = NULL, comps = 1:3, lambda = NULL, lambda = NULL, eta = 1, type = "lasso", in = NULL, in = NULL, mu = 0, mu = 0, nfold = 5, regpara = FALSE, maxrep = 3, method = c("bic", "CV")[1]) ## S3 method for class ncompsearch print(x,...) ## S3 method for class ncompsearch plot(x,...)

7 ncompsearch 7 Z comps lambda lambda eta type in in mu mu a (list of) matrix, explanatory variable(s) which is required. a (list of) matrix, objective variable(s). This is optional. If no input for, then the PCA method is implemented. a vector, response variable(s). This is optional. The length is the number of subjects. If no input for Z, then the unsupervised PLS/PCA is implemented. numeric vector for the candidates of the numbers of components to be selected. numeric vector of regularized parameters for with length equal to the number of blocks. If omitted, no regularization is conducted. numeric vector of regularized parameters for with length equal to the number of blocks. If omitted, no regularization is conducted. numeric scalar, the parameter indexing the penalty family. This version has only the choice 1. a character, the penalty family. This version has only the choice "lasso". a (list of) numeric vector to specify the variables of which are always in the a (list of) numeric vector to specify the variables of which are always in the a numeric scalar for the weight of for the supervised. 0<=mu<=1. a numeric scalar for the weight of for the supervised. 0<=mu<=1. nfold number of folds - default is 5. regpara maxrep method x logical, If TRUE, the regularized parameters search is also conducted simultaneously. numeric scalar for the number of iteration. a character, the evaluation method, "CV" for cross-validation based on matrix element-wise error, and "BIC" for Bayesian information criteria. The default is the BIC. an object of class "ncompsearch", usually, a result of a call to ncompsearch... further arguments passed to or from other methods. This function provide an implementation of a search the optimal number of components. Value comps mincriterion criterions optncomp numbers of components minimum criterion value criterion values optimal number of components with the minimum criteria value

8 8 predict.msma Examples ##### data ##### tmpdata = simdata(n = 50, rho = 0.8, ps = c(10, 12, 15), ps = 20, seed=1) = tmpdata$; = tmpdata$ ##### number of components search ##### ncomp1 = ncompsearch(,, comps = c(1, 5, 10*(1:5)), nfold=5) plot(ncomp1) predict.msma Prediction predict method for class "msma". ## S3 method for class msma predict(object, new, new = NULL,...) object new Value an object of class "msma", usually, a result of a call to msma a matrix in which to look for variables with which to predict required. new a matrix in which to look for variables with which to predict.... further arguments passed to or from other methods. This function provide the prediction from new data based on the msma fit and is mainly used in the cross-validation predicted predicted Examples ##### data ##### tmpdata = simdata(n = 50, rho = 0.8, ps = c(10, 12, 15), ps = 20, seed=1) = tmpdata$; = tmpdata$ ##### Two Component ##### fit2 = msma(,, comp=2, lambda=2, lambda=1:3)

9 regparasearch 9 summary(fit2) ##### Predict ##### test = predict(fit2, new=, new=) regparasearch Regularized Parameters Search Regularized parameters search method for "msma". regparasearch(, = NULL, Z = NULL, eta = 1, type = "lasso", in = NULL, in = NULL, mu = 0, mu = 0, comp = 1, nfold = 5, maxrep = 3, minpct = 0, maxpct = 1, method = c("bic", "CV")[1]) ## S3 method for class regparasearch print(x,...) Z eta type in in mu mu comp a (list of) matrix, explanatory variable(s) which is required. a (list of) matrix, objective variable(s). This is optional. If no input for, then the PCA method is implemented. a vector, response variable(s). This is optional. The length is the number of subjects. If no input for Z, then the unsupervised PLS/PCA is implemented. numeric scalar, the parameter indexing the penalty family. This version has only the choice 1. a character, the penalty family. This version has only the choice "lasso". a (list of) numeric vector to specify the variables of which are always in the a (list of) numeric vector to specify the variables of which are always in the a numeric scalar for the weight of for the supervised. 0<=mu<=1. a numeric scalar for the weight of for the supervised. 0<=mu<=1. numeric scalar for the number of components to be considered. nfold number of folds - default is 5. maxrep minpct maxpct numeric scalar for the number of iteration. percent of minimum candidate parameters. percent of maximum candidate parameters.

10 10 regparasearch method x a character, the evaluation method, "CV" for cross-validation based on matrix element-wise error, and "BIC" for Bayesian information criteria. The default is the BIC. an object of class "regparasearch", usually, a result of a call to regparasearch... further arguments passed to or from other methods. This is a function to search regularized parameters of sparseness lambda and lambda for msma. The initial range of candidates are computed based on the fit with the values of regularized parameters of 0. The binary search is conducted for the divided parameter range into two regions. The representative value for the region is a median and the optimal region is selected with the minimum criteria obtained from the fit with the value. The CV error or BIC can be used as criteria. The selected region is also divided into two region and the same process is iterated by maxrep times. Thus, the final median value in the selected region is set to be the optimal regularized parameter. The search is conducted with combinations of parameters for and. The range of candidates for regularized parameters can be restricted with the percentile of the limit (minimum or maximum) of the range. Value optlambda optlambda mincriterion criterions pararange Optimal parameters for Optimal parameters for Minimum criterion value All resulting criterion values in the process Range of candidates parameters Examples ##### data ##### tmpdata = simdata(n = 50, rho = 0.8, ps = c(10, 12, 15), ps = 20, seed=1) = tmpdata$; = tmpdata$ ##### Regularized parameters search ##### opt1 = regparasearch(,, comp=1, nfold=5, maxrep=2) opt1 fit4 = msma(,, comp=1, lambda=opt1$optlambda, lambda=opt1$optlambda) fit4 summary(fit4) ##### Restrict search range ##### opt2 = regparasearch(,, comp=1, nfold=5, maxrep=2, minpct=0.5) opt2

11 simdata 11 simdata Generate Test Data Sets This is a function for generating multiblock data based on the multivariable normal distribution simdata(n = 100, rho = 0.8, ps = c(100, 120, 150), ps = 500, seed = 1) n rho ps ps seed a numeric scalar, sample size. a numeric scalar, correlation coefficient for all matrices. a numeric vector, numbers of columns for. The length of vector corresponds to the number of blocks. a numeric vector, numbers of columns for. The length of vector corresponds to the number of blocks. a seed number for generating random numbers for the reproductivity and should be changed in the iterative study. The output is a list of matrics. Value Simulated which has a list form Simulated which has a list form summary.msma Summarizing Fits summary method for class "msma". ## S3 method for class msma summary(object,...) ## S3 method for class summary.msma print(x,...)

12 12 summary.msma object, x an object of class "msma", usually, a result of a call to msma... further arguments passed to or from other methods. This function provide the summary of results. Examples ##### data ##### tmpdata = simdata(n = 50, rho = 0.8, ps = c(10, 12, 15), ps = 20, seed=1) = tmpdata$; = tmpdata$ ##### One Component ##### fit1 = msma(,, comp=1, lambda=2, lambda=1:3) summary(fit1)

13 Index Topic documentation msma-package, 2 cvmsma, 2 msma, 2, 3, 4, 12 msma-package, 2 ncompsearch, 6 plot.msma (msma), 3 plot.ncompsearch (ncompsearch), 6 predict.msma, 8 print.msma (msma), 3 print.ncompsearch (ncompsearch), 6 print.regparasearch (regparasearch), 9 print.summary.msma (summary.msma), 11 regparasearch, 9 simdata, 11 summary.msma, 11 13

Package bigdata. R topics documented: February 19, 2015

Package bigdata. R topics documented: February 19, 2015 Type Package Title Big Data Analytics Version 0.1 Date 2011-02-12 Author Han Liu, Tuo Zhao Maintainer Han Liu Depends glmnet, Matrix, lattice, Package bigdata February 19, 2015 The

More information

Package MDM. February 19, 2015

Package MDM. February 19, 2015 Type Package Title Multinomial Diversity Model Version 1.3 Date 2013-06-28 Package MDM February 19, 2015 Author Glenn De'ath ; Code for mdm was adapted from multinom in the nnet package

More information

Package missforest. February 20, 2015

Package missforest. February 20, 2015 Type Package Package missforest February 20, 2015 Title Nonparametric Missing Value Imputation using Random Forest Version 1.4 Date 2013-12-31 Author Daniel J. Stekhoven Maintainer

More information

Package multivator. R topics documented: February 20, 2015. Type Package Title A multivariate emulator Version 1.1-4 Depends R(>= 2.10.

Package multivator. R topics documented: February 20, 2015. Type Package Title A multivariate emulator Version 1.1-4 Depends R(>= 2.10. Type Package Title A multivariate emulator Version 1.1-4 Depends R(>= 2.10.0) Package multivator February 20, 2015 Imports utils, emulator (>= 1.2-13), mvtnorm, methods Suggests abind Date 2011-07-28 Author

More information

Package MixGHD. June 26, 2015

Package MixGHD. June 26, 2015 Type Package Package MixGHD June 26, 2015 Title Model Based Clustering, Classification and Discriminant Analysis Using the Mixture of Generalized Hyperbolic Distributions Version 1.7 Date 2015-6-15 Author

More information

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Review Jeopardy. Blue vs. Orange. Review Jeopardy Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?

More information

Package metafuse. November 7, 2015

Package metafuse. November 7, 2015 Type Package Package metafuse November 7, 2015 Title Fused Lasso Approach in Regression Coefficient Clustering Version 1.0-1 Date 2015-11-06 Author Lu Tang, Peter X.K. Song Maintainer Lu Tang

More information

Package polynom. R topics documented: June 24, 2015. Version 1.3-8

Package polynom. R topics documented: June 24, 2015. Version 1.3-8 Version 1.3-8 Package polynom June 24, 2015 Title A Collection of Functions to Implement a Class for Univariate Polynomial Manipulations A collection of functions to implement a class for univariate polynomial

More information

Package empiricalfdr.deseq2

Package empiricalfdr.deseq2 Type Package Package empiricalfdr.deseq2 May 27, 2015 Title Simulation-Based False Discovery Rate in RNA-Seq Version 1.0.3 Date 2015-05-26 Author Mikhail V. Matz Maintainer Mikhail V. Matz

More information

Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression

Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression Saikat Maitra and Jun Yan Abstract: Dimension reduction is one of the major tasks for multivariate

More information

Package ATE. R topics documented: February 19, 2015. Type Package Title Inference for Average Treatment Effects using Covariate. balancing.

Package ATE. R topics documented: February 19, 2015. Type Package Title Inference for Average Treatment Effects using Covariate. balancing. Package ATE February 19, 2015 Type Package Title Inference for Average Treatment Effects using Covariate Balancing Version 0.2.0 Date 2015-02-16 Author Asad Haris and Gary Chan

More information

Package support.ces. R topics documented: August 27, 2015. Type Package

Package support.ces. R topics documented: August 27, 2015. Type Package Type Package Package support.ces August 27, 2015 Title Basic Functions for Supporting an Implementation of Choice Experiments Version 0.4-1 Date 2015-08-27 Author Hideo Aizaki Maintainer Hideo Aizaki

More information

Package retrosheet. April 13, 2015

Package retrosheet. April 13, 2015 Type Package Package retrosheet April 13, 2015 Title Import Professional Baseball Data from 'Retrosheet' Version 1.0.2 Date 2015-03-17 Maintainer Richard Scriven A collection of tools

More information

Package EstCRM. July 13, 2015

Package EstCRM. July 13, 2015 Version 1.4 Date 2015-7-11 Package EstCRM July 13, 2015 Title Calibrating Parameters for the Samejima's Continuous IRT Model Author Cengiz Zopluoglu Maintainer Cengiz Zopluoglu

More information

Package NHEMOtree. February 19, 2015

Package NHEMOtree. February 19, 2015 Type Package Package NHEMOtree February 19, 2015 Title Non-hierarchical evolutionary multi-objective tree learner to perform cost-sensitive classification Depends partykit, emoa, sets, rpart Version 1.0

More information

Package MFDA. R topics documented: July 2, 2014. Version 1.1-4 Date 2007-10-30 Title Model Based Functional Data Analysis

Package MFDA. R topics documented: July 2, 2014. Version 1.1-4 Date 2007-10-30 Title Model Based Functional Data Analysis Version 1.1-4 Date 2007-10-30 Title Model Based Functional Data Analysis Package MFDA July 2, 2014 Author Wenxuan Zhong , Ping Ma Maintainer Wenxuan Zhong

More information

Package PRISMA. February 15, 2013

Package PRISMA. February 15, 2013 Package PRISMA February 15, 2013 Type Package Title Protocol Inspection and State Machine Analysis Version 0.1-0 Date 2012-10-02 Depends R (>= 2.10), Matrix, gplots, ggplot2 Author Tammo Krueger, Nicole

More information

Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models.

Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models. Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models. Dr. Jon Starkweather, Research and Statistical Support consultant This month

More information

Package COSINE. February 19, 2015

Package COSINE. February 19, 2015 Type Package Title COndition SpecIfic sub-network Version 2.1 Date 2014-07-09 Author Package COSINE February 19, 2015 Maintainer Depends R (>= 3.1.0), MASS,genalg To identify

More information

Package GSA. R topics documented: February 19, 2015

Package GSA. R topics documented: February 19, 2015 Package GSA February 19, 2015 Title Gene set analysis Version 1.03 Author Brad Efron and R. Tibshirani Description Gene set analysis Maintainer Rob Tibshirani Dependencies impute

More information

Package bigrf. February 19, 2015

Package bigrf. February 19, 2015 Version 0.1-11 Date 2014-05-16 Package bigrf February 19, 2015 Title Big Random Forests: Classification and Regression Forests for Large Data Sets Maintainer Aloysius Lim OS_type

More information

Package SurvCorr. February 26, 2015

Package SurvCorr. February 26, 2015 Type Package Title Correlation of Bivariate Survival Times Version 1.0 Date 2015-02-25 Package SurvCorr February 26, 2015 Author Meinhard Ploner, Alexandra Kaider and Georg Heinze Maintainer Georg Heinze

More information

Package neuralnet. February 20, 2015

Package neuralnet. February 20, 2015 Type Package Title Training of neural networks Version 1.32 Date 2012-09-19 Package neuralnet February 20, 2015 Author Stefan Fritsch, Frauke Guenther , following earlier work

More information

Package TSfame. February 15, 2013

Package TSfame. February 15, 2013 Package TSfame February 15, 2013 Version 2012.8-1 Title TSdbi extensions for fame Description TSfame provides a fame interface for TSdbi. Comprehensive examples of all the TS* packages is provided in the

More information

Package dunn.test. January 6, 2016

Package dunn.test. January 6, 2016 Version 1.3.2 Date 2016-01-06 Package dunn.test January 6, 2016 Title Dunn's Test of Multiple Comparisons Using Rank Sums Author Alexis Dinno Maintainer Alexis Dinno

More information

Package gazepath. April 1, 2015

Package gazepath. April 1, 2015 Type Package Package gazepath April 1, 2015 Title Gazepath Transforms Eye-Tracking Data into Fixations and Saccades Version 1.0 Date 2015-03-03 Author Daan van Renswoude Maintainer Daan van Renswoude

More information

Package HHG. July 14, 2015

Package HHG. July 14, 2015 Type Package Package HHG July 14, 2015 Title Heller-Heller-Gorfine Tests of Independence and Equality of Distributions Version 1.5.1 Date 2015-07-13 Author Barak Brill & Shachar Kaufman, based in part

More information

Package plan. R topics documented: February 20, 2015

Package plan. R topics documented: February 20, 2015 Package plan February 20, 2015 Version 0.4-2 Date 2013-09-29 Title Tools for project planning Author Maintainer Depends R (>= 0.99) Supports the creation of burndown

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Statistical Analysis. NBAF-B Metabolomics Masterclass. Mark Viant

Statistical Analysis. NBAF-B Metabolomics Masterclass. Mark Viant Statistical Analysis NBAF-B Metabolomics Masterclass Mark Viant 1. Introduction 2. Univariate analysis Overview of lecture 3. Unsupervised multivariate analysis Principal components analysis (PCA) Interpreting

More information

Package support.ces. R topics documented: April 27, 2015. Type Package

Package support.ces. R topics documented: April 27, 2015. Type Package Type Package Package support.ces April 27, 2015 Title Basic Functions for Supporting an Implementation of Choice Experiments Version 0.3-3 Date 2015-04-27 Author Hideo Aizaki Maintainer Hideo Aizaki

More information

Package hoarder. June 30, 2015

Package hoarder. June 30, 2015 Type Package Title Information Retrieval for Genetic Datasets Version 0.1 Date 2015-06-29 Author [aut, cre], Anu Sironen [aut] Package hoarder June 30, 2015 Maintainer Depends

More information

Package smoothhr. November 9, 2015

Package smoothhr. November 9, 2015 Encoding UTF-8 Type Package Depends R (>= 2.12.0),survival,splines Package smoothhr November 9, 2015 Title Smooth Hazard Ratio Curves Taking a Reference Value Version 1.0.2 Date 2015-10-29 Author Artur

More information

Package survpresmooth

Package survpresmooth Package survpresmooth February 20, 2015 Type Package Title Presmoothed Estimation in Survival Analysis Version 1.1-8 Date 2013-08-30 Author Ignacio Lopez de Ullibarri and Maria Amalia Jacome Maintainer

More information

Partial Least Squares (PLS) Regression.

Partial Least Squares (PLS) Regression. Partial Least Squares (PLS) Regression. Hervé Abdi 1 The University of Texas at Dallas Introduction Pls regression is a recent technique that generalizes and combines features from principal component

More information

Lasso on Categorical Data

Lasso on Categorical Data Lasso on Categorical Data Yunjin Choi, Rina Park, Michael Seo December 14, 2012 1 Introduction In social science studies, the variables of interest are often categorical, such as race, gender, and nationality.

More information

Package hier.part. February 20, 2015. Index 11. Goodness of Fit Measures for a Regression Hierarchy

Package hier.part. February 20, 2015. Index 11. Goodness of Fit Measures for a Regression Hierarchy Version 1.0-4 Date 2013-01-07 Title Hierarchical Partitioning Author Chris Walsh and Ralph Mac Nally Depends gtools Package hier.part February 20, 2015 Variance partition of a multivariate data set Maintainer

More information

Package plsgenomics. May 11, 2015

Package plsgenomics. May 11, 2015 Package plsgenomics May 11, 2015 Version 1.3-1 Date 2015-05-05 Title PLS Analyses for Genomics Author Anne-Laure Boulesteix , Ghislain Durif ,

More information

Package SHELF. February 5, 2016

Package SHELF. February 5, 2016 Type Package Package SHELF February 5, 2016 Title Tools to Support the Sheffield Elicitation Framework (SHELF) Version 1.1.0 Date 2016-01-29 Author Jeremy Oakley Maintainer Jeremy Oakley

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Package hybridensemble

Package hybridensemble Type Package Package hybridensemble May 30, 2015 Title Build, Deploy and Evaluate Hybrid Ensembles Version 1.0.0 Date 2015-05-26 Imports randomforest, kernelfactory, ada, rpart, ROCR, nnet, e1071, NMOF,

More information

Package cpm. July 28, 2015

Package cpm. July 28, 2015 Package cpm July 28, 2015 Title Sequential and Batch Change Detection Using Parametric and Nonparametric Methods Version 2.2 Date 2015-07-09 Depends R (>= 2.15.0), methods Author Gordon J. Ross Maintainer

More information

Package PACVB. R topics documented: July 10, 2016. Type Package

Package PACVB. R topics documented: July 10, 2016. Type Package Type Package Package PACVB July 10, 2016 Title Variational Bayes (VB) Approximation of Gibbs Posteriors with Hinge Losses Version 1.1.1 Date 2016-01-29 Author James Ridgway Maintainer James Ridgway

More information

Statistical Machine Learning

Statistical Machine Learning Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes

More information

Nonparametric statistics and model selection

Nonparametric statistics and model selection Chapter 5 Nonparametric statistics and model selection In Chapter, we learned about the t-test and its variations. These were designed to compare sample means, and relied heavily on assumptions of normality.

More information

Package trimtrees. February 20, 2015

Package trimtrees. February 20, 2015 Package trimtrees February 20, 2015 Type Package Title Trimmed opinion pools of trees in a random forest Version 1.2 Date 2014-08-1 Depends R (>= 2.5.0),stats,randomForest,mlbench Author Yael Grushka-Cockayne,

More information

Data Mining - Evaluation of Classifiers

Data Mining - Evaluation of Classifiers Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010

More information

Introduction to Matrix Algebra

Introduction to Matrix Algebra Psychology 7291: Multivariate Statistics (Carey) 8/27/98 Matrix Algebra - 1 Introduction to Matrix Algebra Definitions: A matrix is a collection of numbers ordered by rows and columns. It is customary

More information

Package CRM. R topics documented: February 19, 2015

Package CRM. R topics documented: February 19, 2015 Package CRM February 19, 2015 Title Continual Reassessment Method (CRM) for Phase I Clinical Trials Version 1.1.1 Date 2012-2-29 Depends R (>= 2.10.0) Author Qianxing Mo Maintainer Qianxing Mo

More information

Package changepoint. R topics documented: November 9, 2015. Type Package Title Methods for Changepoint Detection Version 2.

Package changepoint. R topics documented: November 9, 2015. Type Package Title Methods for Changepoint Detection Version 2. Type Package Title Methods for Changepoint Detection Version 2.2 Date 2015-10-23 Package changepoint November 9, 2015 Maintainer Rebecca Killick Implements various mainstream and

More information

Package robfilter. R topics documented: February 20, 2015. Version 4.1 Date 2012-09-13 Title Robust Time Series Filters

Package robfilter. R topics documented: February 20, 2015. Version 4.1 Date 2012-09-13 Title Robust Time Series Filters Version 4.1 Date 2012-09-13 Title Robust Time Series Filters Package robfilter February 20, 2015 Author Roland Fried , Karen Schettlinger

More information

Principal Component Analysis

Principal Component Analysis Principal Component Analysis ERS70D George Fernandez INTRODUCTION Analysis of multivariate data plays a key role in data analysis. Multivariate data consists of many different attributes or variables recorded

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Package ROC632. February 19, 2015

Package ROC632. February 19, 2015 Type Package Package ROC632 February 19, 2015 Title Construction of diagnostic or prognostic scoring system and internal validation of its discriminative capacities based on ROC curve and 0.633+ boostrap

More information

W6.B.1. FAQs CS535 BIG DATA W6.B.3. 4. If the distance of the point is additionally less than the tight distance T 2, remove it from the original set

W6.B.1. FAQs CS535 BIG DATA W6.B.3. 4. If the distance of the point is additionally less than the tight distance T 2, remove it from the original set http://wwwcscolostateedu/~cs535 W6B W6B2 CS535 BIG DAA FAQs Please prepare for the last minute rush Store your output files safely Partial score will be given for the output from less than 50GB input Computer

More information

Package schwartz97. February 20, 2015

Package schwartz97. February 20, 2015 Type Package Version 0.0.6 Date 2011-12-18 Package schwartz97 February 20, 2015 Title A package on the Schwartz two-factor commodity model Author Philipp Erb, David Luethi, Juri Hinz, Simon Otziger Maintainer

More information

Package decompr. August 17, 2016

Package decompr. August 17, 2016 Version 4.5.0 Title Global-Value-Chain Decomposition Package decompr August 17, 2016 Two global-value-chain decompositions are implemented. Firstly, the Wang-Wei-Zhu (Wang, Wei, and Zhu, 2013) algorithm

More information

Section 1.7 22 Continued

Section 1.7 22 Continued Section 1.5 23 A homogeneous equation is always consistent. TRUE - The trivial solution is always a solution. The equation Ax = 0 gives an explicit descriptions of its solution set. FALSE - The equation

More information

Package CoImp. February 19, 2015

Package CoImp. February 19, 2015 Title Copula based imputation method Date 2014-03-01 Version 0.2-3 Package CoImp February 19, 2015 Author Francesca Marta Lilja Di Lascio, Simone Giannerini Depends R (>= 2.15.2), methods, copula Imports

More information

Package pesticides. February 20, 2015

Package pesticides. February 20, 2015 Type Package Package pesticides February 20, 2015 Title Analysis of single serving and composite pesticide residue measurements Version 0.1 Date 2010-11-17 Author David M Diez Maintainer David M Diez

More information

These axioms must hold for all vectors ū, v, and w in V and all scalars c and d.

These axioms must hold for all vectors ū, v, and w in V and all scalars c and d. DEFINITION: A vector space is a nonempty set V of objects, called vectors, on which are defined two operations, called addition and multiplication by scalars (real numbers), subject to the following axioms

More information

Package mcmcse. March 25, 2016

Package mcmcse. March 25, 2016 Version 1.2-1 Date 2016-03-24 Title Monte Carlo Standard Errors for MCMC Package mcmcse March 25, 2016 Author James M. Flegal , John Hughes and Dootika Vats

More information

Package samplesize4surveys

Package samplesize4surveys Type Package Package samplesize4surveys May 4, 2015 Title Sample Size Calculations for Complex Surveys Version 2.4.0.900 Date 2015-05-01 Author Hugo Andres Gutierrez Rojas Maintainer Hugo Andres Gutierrez

More information

Package bcrm. September 24, 2015

Package bcrm. September 24, 2015 Type Package Package bcrm September 24, 2015 Title Bayesian Continual Reassessment Method for Phase I Dose-Escalation Trials Version 0.4.5 Date 2015-09-23 Author Michael Sweeting Maintainer Michael Sweeting

More information

Methods for Finding Bases

Methods for Finding Bases Methods for Finding Bases Bases for the subspaces of a matrix Row-reduction methods can be used to find bases. Let us now look at an example illustrating how to obtain bases for the row space, null space,

More information

Partial Least Square Regression PLS-Regression

Partial Least Square Regression PLS-Regression Partial Least Square Regression Hervé Abdi 1 1 Overview PLS regression is a recent technique that generalizes and combines features from principal component analysis and multiple regression. Its goal is

More information

Notes on Applied Linear Regression

Notes on Applied Linear Regression Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 444-8935 email:

More information

Linear Algebra Review. Vectors

Linear Algebra Review. Vectors Linear Algebra Review By Tim K. Marks UCSD Borrows heavily from: Jana Kosecka kosecka@cs.gmu.edu http://cs.gmu.edu/~kosecka/cs682.html Virginia de Sa Cogsci 8F Linear Algebra review UCSD Vectors The length

More information

Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng. LISREL for Windows: SIMPLIS Syntax Files

Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng. LISREL for Windows: SIMPLIS Syntax Files Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng LISREL for Windows: SIMPLIS Files Table of contents SIMPLIS SYNTAX FILES... 1 The structure of the SIMPLIS syntax file... 1 $CLUSTER command... 4

More information

MATH 423 Linear Algebra II Lecture 38: Generalized eigenvectors. Jordan canonical form (continued).

MATH 423 Linear Algebra II Lecture 38: Generalized eigenvectors. Jordan canonical form (continued). MATH 423 Linear Algebra II Lecture 38: Generalized eigenvectors Jordan canonical form (continued) Jordan canonical form A Jordan block is a square matrix of the form λ 1 0 0 0 0 λ 1 0 0 0 0 λ 0 0 J = 0

More information

QUALITY ENGINEERING PROGRAM

QUALITY ENGINEERING PROGRAM QUALITY ENGINEERING PROGRAM Production engineering deals with the practical engineering problems that occur in manufacturing planning, manufacturing processes and in the integration of the facilities and

More information

Package acrm. R topics documented: February 19, 2015

Package acrm. R topics documented: February 19, 2015 Package acrm February 19, 2015 Type Package Title Convenience functions for analytical Customer Relationship Management Version 0.1.1 Date 2014-03-28 Imports dummies, randomforest, kernelfactory, ada Author

More information

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.

More information

Multiple Choice: 2 points each

Multiple Choice: 2 points each MID TERM MSF 503 Modeling 1 Name: Answers go here! NEATNESS COUNTS!!! Multiple Choice: 2 points each 1. In Excel, the VLOOKUP function does what? Searches the first row of a range of cells, and then returns

More information

Time Series Analysis

Time Series Analysis Time Series Analysis hm@imm.dtu.dk Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby 1 Outline of the lecture Identification of univariate time series models, cont.:

More information

Package CIFsmry. July 10, 2016. Index 6

Package CIFsmry. July 10, 2016. Index 6 Type Package Package CIFsmry July 10, 2016 Title Weighted summary of cumulative incidence functions Version 1.0.1.1 Date 2013-10-10 Author Jianing Li Maintainer Jianing Li Depends R(>= 3.0.1)

More information

Package MBA. February 19, 2015. Index 7. Canopy LIDAR data

Package MBA. February 19, 2015. Index 7. Canopy LIDAR data Version 0.0-8 Date 2014-4-28 Title Multilevel B-spline Approximation Package MBA February 19, 2015 Author Andrew O. Finley , Sudipto Banerjee Maintainer Andrew

More information

CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.

CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES From Exploratory Factor Analysis Ledyard R Tucker and Robert C MacCallum 1997 180 CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES In

More information

Regularized Logistic Regression for Mind Reading with Parallel Validation

Regularized Logistic Regression for Mind Reading with Parallel Validation Regularized Logistic Regression for Mind Reading with Parallel Validation Heikki Huttunen, Jukka-Pekka Kauppi, Jussi Tohka Tampere University of Technology Department of Signal Processing Tampere, Finland

More information

Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree of PhD of Engineering in Informatics

Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree of PhD of Engineering in Informatics INTERNATIONAL BLACK SEA UNIVERSITY COMPUTER TECHNOLOGIES AND ENGINEERING FACULTY ELABORATION OF AN ALGORITHM OF DETECTING TESTS DIMENSIONALITY Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree

More information

Package SCperf. February 19, 2015

Package SCperf. February 19, 2015 Package SCperf February 19, 2015 Type Package Title Supply Chain Perform Version 1.0 Date 2012-01-22 Author Marlene Silva Marchena Maintainer The package implements different inventory models, the bullwhip

More information

Effective Linear Discriminant Analysis for High Dimensional, Low Sample Size Data

Effective Linear Discriminant Analysis for High Dimensional, Low Sample Size Data Effective Linear Discriant Analysis for High Dimensional, Low Sample Size Data Zhihua Qiao, Lan Zhou and Jianhua Z. Huang Abstract In the so-called high dimensional, low sample size (HDLSS) settings, LDA

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Exploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016

Exploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016 and Principal Components Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016 Agenda Brief History and Introductory Example Factor Model Factor Equation Estimation of Loadings

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

7 Time series analysis

7 Time series analysis 7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are

More information

Orthogonal Diagonalization of Symmetric Matrices

Orthogonal Diagonalization of Symmetric Matrices MATH10212 Linear Algebra Brief lecture notes 57 Gram Schmidt Process enables us to find an orthogonal basis of a subspace. Let u 1,..., u k be a basis of a subspace V of R n. We begin the process of finding

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

Factor Analysis. Factor Analysis

Factor Analysis. Factor Analysis Factor Analysis Principal Components Analysis, e.g. of stock price movements, sometimes suggests that several variables may be responding to a small number of underlying forces. In the factor model, we

More information

Cluster Analysis using R

Cluster Analysis using R Cluster analysis or clustering is the task of assigning a set of objects into groups (called clusters) so that the objects in the same cluster are more similar (in some sense or another) to each other

More information

MONTE-CARLO SIMULATION OF AMERICAN OPTIONS WITH GPUS. Julien Demouth, NVIDIA

MONTE-CARLO SIMULATION OF AMERICAN OPTIONS WITH GPUS. Julien Demouth, NVIDIA MONTE-CARLO SIMULATION OF AMERICAN OPTIONS WITH GPUS Julien Demouth, NVIDIA STAC-A2 BENCHMARK STAC-A2 Benchmark Developed by banks Macro and micro, performance and accuracy Pricing and Greeks for American

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

Lecture 5: Singular Value Decomposition SVD (1)

Lecture 5: Singular Value Decomposition SVD (1) EEM3L1: Numerical and Analytical Techniques Lecture 5: Singular Value Decomposition SVD (1) EE3L1, slide 1, Version 4: 25-Sep-02 Motivation for SVD (1) SVD = Singular Value Decomposition Consider the system

More information

New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Introduction

New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Introduction Introduction New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Predictive analytics encompasses the body of statistical knowledge supporting the analysis of massive data sets.

More information

Measures of Central Tendency and Variability: Summarizing your Data for Others

Measures of Central Tendency and Variability: Summarizing your Data for Others Measures of Central Tendency and Variability: Summarizing your Data for Others 1 I. Measures of Central Tendency: -Allow us to summarize an entire data set with a single value (the midpoint). 1. Mode :

More information

Package benford.analysis

Package benford.analysis Type Package Package benford.analysis November 17, 2015 Title Benford Analysis for Data Validation and Forensic Analytics Version 0.1.3 Author Carlos Cinelli Maintainer Carlos Cinelli

More information

Graphical Modeling for Genomic Data

Graphical Modeling for Genomic Data Graphical Modeling for Genomic Data Carel F.W. Peeters cf.peeters@vumc.nl Joint work with: Wessel N. van Wieringen Mark A. van de Wiel Molecular Biostatistics Unit Dept. of Epidemiology & Biostatistics

More information

Programming Exercise 3: Multi-class Classification and Neural Networks

Programming Exercise 3: Multi-class Classification and Neural Networks Programming Exercise 3: Multi-class Classification and Neural Networks Machine Learning November 4, 2011 Introduction In this exercise, you will implement one-vs-all logistic regression and neural networks

More information

Chapter 15. Mixed Models. 15.1 Overview. A flexible approach to correlated data.

Chapter 15. Mixed Models. 15.1 Overview. A flexible approach to correlated data. Chapter 15 Mixed Models A flexible approach to correlated data. 15.1 Overview Correlated data arise frequently in statistical analyses. This may be due to grouping of subjects, e.g., students within classrooms,

More information

CDD user guide. PsN 4.4.8. Revised 2015-02-23

CDD user guide. PsN 4.4.8. Revised 2015-02-23 CDD user guide PsN 4.4.8 Revised 2015-02-23 1 Introduction The Case Deletions Diagnostics (CDD) algorithm is a tool primarily used to identify influential components of the dataset, usually individuals.

More information