Introduction Basics Simple Statistics More on S. Using R for Data Analysis and Graphics. 1. Introduction

Size: px
Start display at page:

Download "Introduction Basics Simple Statistics More on S. Using R for Data Analysis and Graphics. 1. Introduction"

Transcription

1 Using R for Data Analysis and Graphics 1. Introduction

2 What is R? 1.1 What is R? R is a software environment for statistical computing. R is based on commands. Implements the S language. There is an inofficial menu based interface called R-Commander. Drawbacks of menus: difficult to store what you do. A script of commands documents the analysis and allows for easy repetition with changed data, options,... R is free software. Supported operating systems: Linux, Mac OS X, Windows Language for exchanging statistical methods among researchers

3 Other Statistical Software 1.2 Other Statistical Software S-Plus: same programming language, commercial. Features a GUI. SPSS: good for standard procedures. SAS: all-rounder, good for large data sets, complicated analyses. Systat: Analysis of Variance, easy-to-use graphics system. Excel: Very limited collection of statistical methods. Good for getting the dataset ready. Matlab: Mathematical methods. Statistical methods limited. Similar paradigm, less flexible structure.

4 Introductory examples 1.3 Introductory examples A dataset that we have stored before in the system is called d.sport weit kugel hoch disc stab speer punkte OBRIEN BUSEMANN DVORAK : : : : : : : : : : : : : : : : : : : : : : : : CHMARA Draw a histogram of the results of variable kugel! We type hist(d.sport[,"kugel"]) The graphics window is opened automatically. We have called the S-function hist with argument d.sport[,"kugel"]. [,] is used to select the column.

5 Introductory examples 1.3 Introductory examples Scatter plot: type plot(d.sport[,"kugel"], d.sport[,"speer"]) First argument: x coordinates; second: y coordinates Many optional arguments! plot(d.sport[,"kugel"], d.sport[,"speer"], xlab="ball push", ylab="javelin", pch=7) Scatter plot matrix pairs(d.sport) Every column of d.sport is plotted against all other columns.

6 Introductory examples 1.3 Introductory examples Get a dataset from a text file and assign it to a name: d.sport <- read.table(...) " /WBL/sport.dat", header=true) Start browser of operating system to get a file: d.sport <- read.table(file...())

7 Using R 1.4 Using R Within a window running R, you will see the prompt >. You type a command and get a result and a new prompt. > hist(d.sport[,"kugel"]) > An incomplete statement can be continued on the next line > plot(d.sport[,"kugel"], + d.sport[,"speer"]) R stores objects in your workspace > d.sport <- read.table(...) Objects have names like a, fun, d.sport R provides a huge number of functions and other objects

8 Using R 1.4 Using R An R statement consists of a name of an object object is displayed > d.sport a call to a function graphical or numerical result > hist(d.sport[,"kugel"]) an assignment > a <- 2*pi/360 > mn <- mean(d.sport[,"kugel"]) stores the mean of d.sport[,"kugel"] under the name mn

9 Using R 1.4 Using R Some special and useful functions (more details later): documentation on the arguments etc. of a function (or dataset provided by the system): > help(hist) or?hist list all objects (names) in the workspace: > objects() leave the R session: > q() You get the question: Save workspace image? [y/n/c]: If you answer y, your objects will be available for your next session.

10 Scripts and Editors 1.5 Scripts and Editors Instead of typing commands into the R window, you can generate commands by an editor and then send them to the R window.... and later modify (correct) them and send again. Text Editors supporting R WinEdt: Emacs: ESS: Tinn-R:

11 Scripts and Editors 1.5 Scripts and Editors The Tinn-R Window

12 Scripts and Editors 1.5 Scripts and Editors Define Tinn-R Keyboard Shortcuts: Use dialog R / Hotkeys of R

13 Using R for Data Analysis and Graphics 2. Basics

14 Vectors 2.1 Vectors Functions and operations are usually applied to whole collections instead of single numbers, including vectors, matrices, data.frames ( d.sport ) Numbers can be combined into vectors using the function c() ( combine ) > t.v <- c(4,2,7,8,2) > t.a <- c(3.1, 5, -0.7, 0.9, 1.7) > t.u <- c(t.v,t.a) > t.u

15 Vectors 2.1 Vectors Generate a sequence of consecutive integers: > seq(1, 9) [1] Since sequences of integers are needed very often, this can be abbreviated to 1:9. Equally spaced numbers: Use argument by (default: 1) > seq(0, 3, by=0.5) [1] Repetition: > rep(0.7, 5) [1] > rep(c(1, 3, 5), length=8) [1]

16 Vectors 2.1 Vectors Basic functions for vectors: Call, Example length(t.v) sum(t.v) mean(t.v) var(t.v) range(t.v) Description Length of a vector, number of elements Sum of all elements arithmetic mean empirical variance range

17 Arithmetic 2.2 Arithmetic Simple arithmetic is as expected: > 2+5 [1] 7 Operations: + - * / ˆ (Exponentiation) These operations are applied to vectors elementwise. > (2:5) ˆ c(2,3,1,0) [1] Priorities as usual. Use parentheses! > (2:5) ˆ 2 [1]

18 Arithmetic 2.2 Arithmetic Elements are recycled: > (1:6)*(1:2) [1] > (1:5)-(0:1) [1] Warning message: longer object length is not a multiple of shorter object length in: (1:5) - (0:1) > (1:6)-(0:1) [1] Be careful, there is no warning in this case!

19 Character Vectors 2.3 Character Vectors Character strings: "abc", nut 999 Combine strings into vector of mode character: > t.names <- c("urs", "Anna", "Max", "Pia") Length of strings: > nchar(t.names) [1] String manipulations: > substring(t.names,3,4) [1] "s" "na" "x" "ud" > paste(t.names,"z.") [1] "Urs Z." "Anna Z." "Max Z." "Pia Z." > paste("x",1:3, sep="") [1] "X1" "X2" "X3"

20 Logical Vectors 2.4 Logical Vectors Logical vectors contain elements TRUE or FALSE > rep(c(true, FALSE), length=6) [1] TRUE FALSE TRUE FALSE TRUE FALSE often result from comparisons: < <= > >= ==!= > (1:5)>=3 [1] FALSE FALSE TRUE TRUE TRUE Logical operations: & (and), (or),! (not). > t.i <- (t.a>2)&(t.a<5) > t.i [1] TRUE FALSE FALSE FALSE FALSE

21 Selecting elements 2.5 Selecting elements Select elements from vectors or data.frames: [ ], [,] > t.v[c(1,3,5)] [1] > d.sport[c(1,3,5),1:3] weit kugel hoch OBRIEN DVORAK HAMALAINEN For data.frames, use names of columns or rows: > d.sport[c("obrien","dvorak"), c("kugel","speer","punkte")] kugel speer punkte OBRIEN DVORAK

22 Selecting elements 2.5 Selecting elements Using logical vectors: > t.a[c(true,false,true,true,false,false)] [1] > d.sport[d.sport[,"kugel"] > 16, c(2,7)] kugel punkte HAMALAINEN PENALVER SMITH

23 Matrices 2.6 Matrices Matrices are data tables like data.frames, but they can only contain data of a single type (numeric or character) Generate a matrix: > t.m1 <- matrix(1:10, nrow=2, ncol=5) > t.m1 [,1] [,2] [,3] [,4] [,5] [1,] [2,] > t.m2 <- matrix(1:10, ncol=2, + byrow=true) Transpose: t(t.m1) equals t.m2.

24 Matrices 2.6 Matrices Selection of elements as with data.frames: > t.m1[2,1:3] [1] Matrix multiplication: > t.m1 %*% t.m2 [,1] [,2] [1,] [2,] Vectors are treated as 1-row or 1-column matrices (mostly) Functions for linear algebra are available.

25 Using R for Data Analysis and Graphics 3. Simple Statistics

26 Simple Statistical Functions 3.1 Simple Statistical Functions Count number of cases with same value: > table(d.blast[,"loc"]) L1 L2 L3 L4 L5 L Cross-table > table(d.blast[,"loc"], + d.blast[,"loading"]) L L

27 Simple Statistical Functions 3.1 Simple Statistical Functions Estimation of a location parameter : mean(x) median(x) Variance: var(x) ; correlation: > cor(d.sport[,"kugel"], d.sport[,"speer"]) Correlation matrix: > t.cor <- cor(d.sport[,1:3]) > round(100*t.cor) weit kugel hoch weit kugel hoch

28 Hypothesis Tests 3.2 Hypothesis Tests Do two groups differ in their location? Wilcoxon s Rank Sum Test > t.y1 <- sleep[sleep[, group ]==1, extra ] > t.y2 <- sleep[sleep[, group ]==2, extra ] > wilcox.test(t.y1, t.y2, paired=false) Wilcoxon rank sum test with continuity correction data: t.y1 and t.y2 W = 25.5, p-value = alternative hyp.: true location shift not equal to 0

29 Hypothesis Tests 3.2 Hypothesis Tests More well-known: t-test. Assumes normal distributions. > t.test(t.y2,t.y1,alternative="two.sided", + paired=f) Welch Two Sample t-test data: t.y1 and t.y2 t = , df = , p-value = alternative hyp.: true diff. in means not equal to 0 95 percent confidence interval: sample estimates: mean of x mean of y Confidence interval!

30 Two Groups 3.3 Two Groups Plots for two samples of data. > boxplot(t.y1,t.y2,ylab="extra") > plot(sleep[,"group"],sleep[,"extra"], + xlab="group", ylab="extra")

31 Statistical Models, Formula Objects 3.4 Statistical Models, Formula Objects Statistics is concerned with relations between variables. Prototype: Relationship between target variable Y and explanatory variables X1, X2,... Regression. Symbolic notation of such a relation: Y X1 + X2 This symbolic notation is an S object (of class formula ) (The notation is also used in other statistical packages.) Use of formula : > plot(punkte kugel + speer, + data = d.sport) gives 2 scatterplots, punkte (vertical) against kugel and speer, respectively (horizontal axis).

32 Statistical Models, Formula Objects 3.4 Statistical Models, Formula Objects Grouping or nominal or categorical variables, e.g., location, type, group, species, plot,... Role in models different from continuous variables S must know! stores them as factor s Character variables enter data.frame as factor s Grouping var. with numerical labels can be declared as factor > sleep[, group ] <- + factor(sleep[, group ]) > plot(extra group, data = sleep) produces two box plots.

OVERVIEW OF R SOFTWARE AND PRACTICAL EXERCISE

OVERVIEW OF R SOFTWARE AND PRACTICAL EXERCISE OVERVIEW OF R SOFTWARE AND PRACTICAL EXERCISE Hukum Chandra Indian Agricultural Statistics Research Institute, New Delhi-110012 1. INTRODUCTION R is a free software environment for statistical computing

More information

Getting Started with R and RStudio 1

Getting Started with R and RStudio 1 Getting Started with R and RStudio 1 1 What is R? R is a system for statistical computation and graphics. It is the statistical system that is used in Mathematics 241, Engineering Statistics, for the following

More information

R: A self-learn tutorial

R: A self-learn tutorial R: A self-learn tutorial 1 Introduction R is a software language for carrying out complicated (and simple) statistical analyses. It includes routines for data summary and exploration, graphical presentation

More information

Basics of using the R software

Basics of using the R software Basics of using the R software Experimental and Statistical Methods in Biological Sciences I Juulia Suvilehto NBE 10.9.2015 Demo sessions Demo sessions (Thu 14.15 17.00) x 5 Demos, example code, and exercises

More information

A Short Guide to R with RStudio

A Short Guide to R with RStudio Short Guides to Microeconometrics Fall 2013 Prof. Dr. Kurt Schmidheiny Universität Basel A Short Guide to R with RStudio 1 Introduction 2 2 Installing R and RStudio 2 3 The RStudio Environment 2 4 Additions

More information

5 Correlation and Data Exploration

5 Correlation and Data Exploration 5 Correlation and Data Exploration Correlation In Unit 3, we did some correlation analyses of data from studies related to the acquisition order and acquisition difficulty of English morphemes by both

More information

Using R for Windows and Macintosh

Using R for Windows and Macintosh 2010 Using R for Windows and Macintosh R is the most commonly used statistical package among researchers in Statistics. It is freely distributed open source software. For detailed information about downloading

More information

Scatter Plots with Error Bars

Scatter Plots with Error Bars Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each

More information

Introduction to R June 2006

Introduction to R June 2006 Introduction to R Introduction...3 What is R?...3 Availability & Installation...3 Documentation and Learning Resources...3 The R Environment...4 The R Console...4 Understanding R Basics...5 Managing your

More information

Package dsstatsclient

Package dsstatsclient Maintainer Author Version 4.1.0 License GPL-3 Package dsstatsclient Title DataSHIELD client site stattistical functions August 20, 2015 DataSHIELD client site

More information

Psychology 205: Research Methods in Psychology

Psychology 205: Research Methods in Psychology Psychology 205: Research Methods in Psychology Using R to analyze the data for study 2 Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 38 Outline 1 Getting ready

More information

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application

More information

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Getting started manual

Getting started manual Getting started manual XLSTAT Getting started manual Addinsoft 1 Table of Contents Install XLSTAT and register a license key... 4 Install XLSTAT on Windows... 4 Verify that your Microsoft Excel is up-to-date...

More information

Introduction to Matlab

Introduction to Matlab Introduction to Matlab Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course provides

More information

BIO503 - Lecture 1 Introduction to the R language

BIO503 - Lecture 1 Introduction to the R language BIO503 - Lecture 1 Introduction to the R language Bio503 January 2008, Aedin Culhane. I R, S and S-plus What is R? ˆ R is an environment for data analysis and visualization ˆ R is an open source implementation

More information

IBM SPSS Direct Marketing 23

IBM SPSS Direct Marketing 23 IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release

More information

Quickstart for Desktop Version

Quickstart for Desktop Version Quickstart for Desktop Version What is GeoGebra? Dynamic Mathematics Software in one easy-to-use package For learning and teaching at all levels of education Joins interactive 2D and 3D geometry, algebra,

More information

An introduction to using Microsoft Excel for quantitative data analysis

An introduction to using Microsoft Excel for quantitative data analysis Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing

More information

Prof. Nicolai Meinshausen Regression FS 2014. R Exercises

Prof. Nicolai Meinshausen Regression FS 2014. R Exercises Prof. Nicolai Meinshausen Regression FS 2014 R Exercises 1. The goal of this exercise is to get acquainted with different abilities of the R statistical software. It is recommended to use the distributed

More information

SAS R IML (Introduction at the Master s Level)

SAS R IML (Introduction at the Master s Level) SAS R IML (Introduction at the Master s Level) Anton Bekkerman, Ph.D., Montana State University, Bozeman, MT ABSTRACT Most graduate-level statistics and econometrics programs require a more advanced knowledge

More information

IBM SPSS Direct Marketing 22

IBM SPSS Direct Marketing 22 IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release

More information

Data analysis and regression in Stata

Data analysis and regression in Stata Data analysis and regression in Stata This handout shows how the weekly beer sales series might be analyzed with Stata (the software package now used for teaching stats at Kellogg), for purposes of comparing

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

Introduction to R and UNIX Working with microarray data in a multi-user environment

Introduction to R and UNIX Working with microarray data in a multi-user environment Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Introduction to R and UNIX Working with microarray data in a multi-user environment Carsten Friis Media glna tnra GlnA TnrA C2 glnr C3 C5

More information

Minitab Session Commands

Minitab Session Commands APPENDIX Minitab Session Commands Session Commands and the Session Window Most functions in Minitab are accessible through menus, as well as through a command language called session commands. You can

More information

Installing R and the psych package

Installing R and the psych package Installing R and the psych package William Revelle Department of Psychology Northwestern University August 17, 2014 Contents 1 Overview of this and related documents 2 2 Install R and relevant packages

More information

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Excel Tutorial Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Working with Data Entering and Formatting Data Before entering data

More information

PTC Mathcad Prime 3.0 Keyboard Shortcuts

PTC Mathcad Prime 3.0 Keyboard Shortcuts PTC Mathcad Prime 3.0 Shortcuts Swedish s Regions Inserting Regions Operator/Command Description Shortcut Swedish Area Inserts a collapsible area you can collapse or expand to toggle the display of your

More information

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

There are six different windows that can be opened when using SPSS. The following will give a description of each of them. SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet

More information

Step 2: Save the file as an Excel file for future editing, adding more data, changing data, to preserve any formulas you were using, etc.

Step 2: Save the file as an Excel file for future editing, adding more data, changing data, to preserve any formulas you were using, etc. R is a free statistical software environment that can run a wide variety of tests based on different downloadable packages and can produce a wide variety of simple to more complex graphs based on the data

More information

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

More information

Introduction Course in SPSS - Evening 1

Introduction Course in SPSS - Evening 1 ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Module 2 Basic Data Management, Graphs, and Log-Files

Module 2 Basic Data Management, Graphs, and Log-Files AGRODEP Stata Training April 2013 Module 2 Basic Data Management, Graphs, and Log-Files Manuel Barron 1 and Pia Basurto 2 1 University of California, Berkeley, Department of Agricultural and Resource Economics

More information

Beginner s Matlab Tutorial

Beginner s Matlab Tutorial Christopher Lum lum@u.washington.edu Introduction Beginner s Matlab Tutorial This document is designed to act as a tutorial for an individual who has had no prior experience with Matlab. For any questions

More information

Microsoft Excel. Qi Wei

Microsoft Excel. Qi Wei Microsoft Excel Qi Wei Excel (Microsoft Office Excel) is a spreadsheet application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot

More information

SPSS 12 Data Analysis Basics Linda E. Lucek, Ed.D. LindaL@niu.edu 815-753-9516

SPSS 12 Data Analysis Basics Linda E. Lucek, Ed.D. LindaL@niu.edu 815-753-9516 SPSS 12 Data Analysis Basics Linda E. Lucek, Ed.D. LindaL@niu.edu 815-753-9516 Technical Advisory Group Customer Support Services Northern Illinois University 120 Swen Parson Hall DeKalb, IL 60115 SPSS

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

2+2 Just type and press enter and the answer comes up ans = 4

2+2 Just type and press enter and the answer comes up ans = 4 Demonstration Red text = commands entered in the command window Black text = Matlab responses Blue text = comments 2+2 Just type and press enter and the answer comes up 4 sin(4)^2.5728 The elementary functions

More information

G563 Quantitative Paleontology. SQL databases. An introduction. Department of Geological Sciences Indiana University. (c) 2012, P.

G563 Quantitative Paleontology. SQL databases. An introduction. Department of Geological Sciences Indiana University. (c) 2012, P. SQL databases An introduction AMP: Apache, mysql, PHP This installations installs the Apache webserver, the PHP scripting language, and the mysql database on your computer: Apache: runs in the background

More information

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize

More information

SPSS Tests for Versions 9 to 13

SPSS Tests for Versions 9 to 13 SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list

More information

An R Tutorial. 1. Starting Out

An R Tutorial. 1. Starting Out An R Tutorial 1. Starting Out R is an interactive environment for statistical computing and graphics. This tutorial will assume usage of R 2.0.0 on a PC. However, except in rare situations, these commands

More information

Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees

Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees Statistical Data Mining Practical Assignment 3 Discriminant Analysis and Decision Trees In this practical we discuss linear and quadratic discriminant analysis and tree-based classification techniques.

More information

GeoGebra Statistics and Probability

GeoGebra Statistics and Probability GeoGebra Statistics and Probability Project Maths Development Team 2013 www.projectmaths.ie Page 1 of 24 Index Activity Topic Page 1 Introduction GeoGebra Statistics 3 2 To calculate the Sum, Mean, Count,

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

AMATH 352 Lecture 3 MATLAB Tutorial Starting MATLAB Entering Variables

AMATH 352 Lecture 3 MATLAB Tutorial Starting MATLAB Entering Variables AMATH 352 Lecture 3 MATLAB Tutorial MATLAB (short for MATrix LABoratory) is a very useful piece of software for numerical analysis. It provides an environment for computation and the visualization. Learning

More information

Baseline Question Types and Report Outcomes November 7, 2014

Baseline Question Types and Report Outcomes November 7, 2014 Instructions Explanatory text about the survey itself, instructional text for the current page or section, or background or consent information. Adding text in the Header Text field will display text to

More information

Data exploration with Microsoft Excel: analysing more than one variable

Data exploration with Microsoft Excel: analysing more than one variable Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

MATLAB Basics MATLAB numbers and numeric formats

MATLAB Basics MATLAB numbers and numeric formats MATLAB Basics MATLAB numbers and numeric formats All numerical variables are stored in MATLAB in double precision floating-point form. (In fact it is possible to force some variables to be of other types

More information

R Language Fundamentals

R Language Fundamentals R Language Fundamentals Data Types and Basic Maniuplation Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Where did R come from? Overview Atomic Vectors Subsetting

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

How Does My TI-84 Do That

How Does My TI-84 Do That How Does My TI-84 Do That A guide to using the TI-84 for statistics Austin Peay State University Clarksville, Tennessee How Does My TI-84 Do That A guide to using the TI-84 for statistics Table of Contents

More information

Getting Started With R

Getting Started With R Getting Started With R 1 The purpose of this chapter is to introduce you to the R language and interpreter. After describing some of the basics of R, we will proceed to illustrate its use in a typical,

More information

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

Introduction to the TI-Nspire CX

Introduction to the TI-Nspire CX Introduction to the TI-Nspire CX Activity Overview: In this activity, you will become familiar with the layout of the TI-Nspire CX. Step 1: Locate the Touchpad. The Touchpad is used to navigate the cursor

More information

R with Rcmdr: BASIC INSTRUCTIONS

R with Rcmdr: BASIC INSTRUCTIONS R with Rcmdr: BASIC INSTRUCTIONS Contents 1 RUNNING & INSTALLATION R UNDER WINDOWS 2 1.1 Running R and Rcmdr from CD........................................ 2 1.2 Installing from CD...............................................

More information

Education & Training Plan. Accounting Math Professional Certificate Program with Externship

Education & Training Plan. Accounting Math Professional Certificate Program with Externship Office of Professional & Continuing Education 301 OD Smith Hall Auburn, AL 36849 http://www.auburn.edu/mycaa Contact: Shavon Williams 334-844-3108; szw0063@auburn.edu Auburn University is an equal opportunity

More information

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA We Can Early Learning Curriculum PreK Grades 8 12 INSIDE ALGEBRA, GRADES 8 12 CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA April 2016 www.voyagersopris.com Mathematical

More information

Dataframes. Lecture 8. Nicholas Christian BIOST 2094 Spring 2011

Dataframes. Lecture 8. Nicholas Christian BIOST 2094 Spring 2011 Dataframes Lecture 8 Nicholas Christian BIOST 2094 Spring 2011 Outline 1. Importing and exporting data 2. Tools for preparing and cleaning datasets Sorting Duplicates First entry Merging Reshaping Missing

More information

Using R for Linear Regression

Using R for Linear Regression Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional

More information

Figure 1. An embedded chart on a worksheet.

Figure 1. An embedded chart on a worksheet. 8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the

More information

January 26, 2009 The Faculty Center for Teaching and Learning

January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i

More information

An introduction to IBM SPSS Statistics

An introduction to IBM SPSS Statistics An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive

More information

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

IBM SPSS Statistics 20 Part 1: Descriptive Statistics CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 1: Descriptive Statistics Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the

More information

Financial Econometrics MFE MATLAB Introduction. Kevin Sheppard University of Oxford

Financial Econometrics MFE MATLAB Introduction. Kevin Sheppard University of Oxford Financial Econometrics MFE MATLAB Introduction Kevin Sheppard University of Oxford October 21, 2013 2007-2013 Kevin Sheppard 2 Contents Introduction i 1 Getting Started 1 2 Basic Input and Operators 5

More information

Systat: Statistical Visualization Software

Systat: Statistical Visualization Software Systat: Statistical Visualization Software Hilary R. Hafner Jennifer L. DeWinter Steven G. Brown Theresa E. O Brien Sonoma Technology, Inc. Petaluma, CA Presented in Toledo, OH October 28, 2011 STI-910019-3946

More information

Introduction. Chapter 1

Introduction. Chapter 1 Chapter 1 Introduction MATLAB (Matrix laboratory) is an interactive software system for numerical computations and graphics. As the name suggests, MATLAB is especially designed for matrix computations:

More information

CD-ROM Appendix E: Matlab

CD-ROM Appendix E: Matlab CD-ROM Appendix E: Matlab Susan A. Fugett Matlab version 7 or 6.5 is a very powerful tool useful for many kinds of mathematical tasks. For the purposes of this text, however, Matlab 7 or 6.5 will be used

More information

Simple Predictive Analytics Curtis Seare

Simple Predictive Analytics Curtis Seare Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use

More information

Microsoft Excel 2010 Part 3: Advanced Excel

Microsoft Excel 2010 Part 3: Advanced Excel CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES Microsoft Excel 2010 Part 3: Advanced Excel Winter 2015, Version 1.0 Table of Contents Introduction...2 Sorting Data...2 Sorting

More information

http://school-maths.com Gerrit Stols

http://school-maths.com Gerrit Stols For more info and downloads go to: http://school-maths.com Gerrit Stols Acknowledgements GeoGebra is dynamic mathematics open source (free) software for learning and teaching mathematics in schools. It

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

PCHS ALGEBRA PLACEMENT TEST

PCHS ALGEBRA PLACEMENT TEST MATHEMATICS Students must pass all math courses with a C or better to advance to the next math level. Only classes passed with a C or better will count towards meeting college entrance requirements. If

More information

Regression and Programming in R. Anja Bråthen Kristoffersen Biomedical Research Group

Regression and Programming in R. Anja Bråthen Kristoffersen Biomedical Research Group Regression and Programming in R Anja Bråthen Kristoffersen Biomedical Research Group R Reference Card http://cran.r-project.org/doc/contrib/short-refcard.pdf Simple linear regression Describes the relationship

More information

IBM SPSS Direct Marketing 19

IBM SPSS Direct Marketing 19 IBM SPSS Direct Marketing 19 Note: Before using this information and the product it supports, read the general information under Notices on p. 105. This document contains proprietary information of SPSS

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences. 1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

PASW Direct Marketing 18

PASW Direct Marketing 18 i PASW Direct Marketing 18 For more information about SPSS Inc. software products, please visit our Web site at http://www.spss.com or contact SPSS Inc. 233 South Wacker Drive, 11th Floor Chicago, IL 60606-6412

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices: Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

Package neuralnet. February 20, 2015

Package neuralnet. February 20, 2015 Type Package Title Training of neural networks Version 1.32 Date 2012-09-19 Package neuralnet February 20, 2015 Author Stefan Fritsch, Frauke Guenther , following earlier work

More information

Precalculus REVERSE CORRELATION. Content Expectations for. Precalculus. Michigan CONTENT EXPECTATIONS FOR PRECALCULUS CHAPTER/LESSON TITLES

Precalculus REVERSE CORRELATION. Content Expectations for. Precalculus. Michigan CONTENT EXPECTATIONS FOR PRECALCULUS CHAPTER/LESSON TITLES Content Expectations for Precalculus Michigan Precalculus 2011 REVERSE CORRELATION CHAPTER/LESSON TITLES Chapter 0 Preparing for Precalculus 0-1 Sets There are no state-mandated Precalculus 0-2 Operations

More information

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs Using Excel Jeffrey L. Rummel Emory University Goizueta Business School BBA Seminar Jeffrey L. Rummel BBA Seminar 1 / 54 Excel Calculations of Descriptive Statistics Single Variable Graphs Relationships

More information

Big Data User s Guide for TIBCO Spotfire S+ 8.2

Big Data User s Guide for TIBCO Spotfire S+ 8.2 Big Data User s Guide for TIBCO Spotfire S+ 8.2 November 2010 TIBCO Software Inc. IMPORTANT INFORMATION SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO

More information

Data Analysis in SPSS. February 21, 2004. If you wish to cite the contents of this document, the APA reference for them would be

Data Analysis in SPSS. February 21, 2004. If you wish to cite the contents of this document, the APA reference for them would be Data Analysis in SPSS Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Heather Claypool Department of Psychology Miami University

More information

JavaScript: Introduction to Scripting. 2008 Pearson Education, Inc. All rights reserved.

JavaScript: Introduction to Scripting. 2008 Pearson Education, Inc. All rights reserved. 1 6 JavaScript: Introduction to Scripting 2 Comment is free, but facts are sacred. C. P. Scott The creditor hath a better memory than the debtor. James Howell When faced with a decision, I always ask,

More information

APPLICATION FOR PART-TIME EMPLOYMENT AS A TUTOR TUTOR IN THE DOLCIANI MATHEMATICS LEARNING CENTER

APPLICATION FOR PART-TIME EMPLOYMENT AS A TUTOR TUTOR IN THE DOLCIANI MATHEMATICS LEARNING CENTER APPLICATION FOR PART-TIME EMPLOYMENT AS A TUTOR TUTOR IN THE DOLCIANI MATHEMATICS LEARNING CENTER Dear Applicant, As you consider applying for a position in the Dolciani Mathematics Learning Center, there

More information

Time Series Analysis AMS 316

Time Series Analysis AMS 316 Time Series Analysis AMS 316 Programming language and software environment for data manipulation, calculation and graphical display. Originally created by Ross Ihaka and Robert Gentleman at University

More information

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

More information

LAYOUT OF THE KEYBOARD

LAYOUT OF THE KEYBOARD Dr. Charles Hofmann, LaSalle hofmann@lasalle.edu Dr. Roseanne Hofmann, MCCC rhofman@mc3.edu ------------------------------------------------------------------------------------------------- DISPLAY CONTRAST

More information

IBM SPSS Statistics for Beginners for Windows

IBM SPSS Statistics for Beginners for Windows ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning

More information

Analysis of System Performance IN2072 Chapter M Matlab Tutorial

Analysis of System Performance IN2072 Chapter M Matlab Tutorial Chair for Network Architectures and Services Prof. Carle Department of Computer Science TU München Analysis of System Performance IN2072 Chapter M Matlab Tutorial Dr. Alexander Klein Prof. Dr.-Ing. Georg

More information

Working with Excel in Origin

Working with Excel in Origin Working with Excel in Origin Limitations When Working with Excel in Origin To plot your workbook data in Origin, you must have Excel version 7 (Microsoft Office 95) or later installed on your computer

More information