Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16
|
|
|
- Pierce Long
- 9 years ago
- Views:
Transcription
1 Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Since R is command line driven and the primary software of Chapter 16, this document details all commands used to generate Chapter 16 s output and figures. To help familiarize yourself with the software, we suggest that you try to run these commands and compare your output to those in the chapter before trying to tackle the chapter exercises. Below is a screenshot of R for Windows. To utilize functions from an R package, you need to
2 load the package. We ll assume you already have installed the boot package, so loading it involves the following set of clicks starting with the Packages item on the R menu. Packages > Load Package > boot Once you have the package loaded, you are now able to utilize the functions within this package. Figure 16.1 First we need to load in the data set into R. R has a built in function read.csv that reads.csv files into R. We recommend that all Excel files to be read into R be saved as a.csv file. Once that is done, the following commands read the file into the data matrix bc and then attach it so that the columns can be referred to by their column names rather than referring to everything using rows and columns of bc. bc <- read.csv("u:/ips8/datasets/chapter16/time50.csv") attach(bc) The variable of interest here is Time. To get the mean and median, we can simply type the following commands. The output is shown below each command. Some of these basic summary statistic commands are helpful in checking that the data set has been read in correctly. mean(time) [1] median(time) [1] 12 The next set of commands generates the histogram and Normal quantile plot of Figure The default histogram in R is one that displays the counts. To get the percentages, we alter the histogram heights to be the percentages and then use the plot command to create the histogram with appropriate axis labels. The Normal quantile plot utilizes the qqnorm function. h <- hist(time) h$density = h$counts/sum(h$counts) plot(h,freq=f,main="",xlab="time (in days)",ylab="percent",las=1) qqnorm(time,main="",xlab="normal score",ylab="times (in days)")
3 Figures 16.3 and 16.5 These are the first figures that use a function from the boot package. The boot function requires the sample statistic of interest be defined as an R function, so the first command defines the mean as an R function called theta. The second command requests that 3000 bootstrap sample means be generated from the variable Time. These 3000 sample means are saved in the data object named time.boot. By simply typing time.boot, you get a summary of the bootstrapping. That is what is shown in Figure theta <- function(data,indices) { d <- data[indices] mean(d) time.boot <- boot(time,theta,r=3000) time.boot Figure 16.3 is generated by the following set of commands. By requesting prob=t in the hist function, a density histogram (area of the histograms bars sums to 1) is generated. The Normal curve based on the 3000 generated samples is then drawn (need the mean and standard deviation of the bootstrap sample), followed by vertical lines representing the sample mean and bootstrap sample mean. For the Normal quantile plot, we also draw the line the observations should follow. hist(time.boot$t,prob=t,axes=f,main="",xlab="mean times of resamples (in days)",ylab="") axis(1) jcc <- dnorm(seq(12,40,length=500),mean(time.boot$t),sqrt(var(time.boot$t))) lines(seq(12,40,length=500),jcc) lines(c(mean(time),mean(time)),c(0,dnorm(mean(time),mean(time.boot$t),sqrt(var(ti me.boot$t)))),col=2) afc <- mean(time.boot$t) lines(c(afc,afc),c(0,dnorm(afc,mean(time.boot$t),sqrt(var(time.boot$t)))),lty=2) qqnorm(time.boot$t,main="",xlab="normal score",ylab="mean times of resamples (in days)") qqline(time.boot$t, col = 2,lwd=1,lty=1)
4 A crude version of this plot can also be generated using the plot function. That command is simply plot(time.boot) Figures 16.6 and 16.7 For this figure, we create our variable with the data rather than reading in a data set. This is only a viable option when the data set is small. The rest of the commands are similar to those used with the starting time data set. stout <- c(94,46,78,66,140,24) theta <- function(data,indices) { d <- data[indices] mean(d) stout.boot <- boot(stout,theta,r=3000) stout.boot plot(stout.boot) Figure 16.8 This figure is similar to Figure 16.1 so the commands are similar. If you compare these commands to those for Figure 16.1, only the variable names and data set names have changed. gpa <- read.csv("u:/ips8/datasets/chapter16/gpa.csv") attach(gpa) h = hist(gpa,nclass=20) h$density = h$counts/sum(h$counts) plot(h,freq=f,main="",xlab="gpa",ylab="percent",las=1) qqnorm(gpa,main="",xlab="normal score",ylab="gpa") Figure 16.9 and accompanying output For this figure and output, we are using the trimmed mean statistic. This is available within the mean function. This means we need to tweak the function theta that will be used within the boot function. The rest of the commands are similar to those used for Figure mean(gpa, trim=0.25)
5 theta <- function(data,indices) { d <- data[indices] mean(d,trim=0.25) gpa.boot <- boot(gpa,theta,r=3000) hist(gpa.boot$t,prob=t,axes=f,main="",xlab="means of resamples",ylab="",ylim=c(0,max(dnorm(mean(gpa,trim=0.25),mean(gpa.boot$t),sqrt(v ar(gpa.boot$t)))))) axis(1) jcc <- dnorm(seq(2.5,3.5,length=500),mean(gpa.boot$t),sqrt(var(gpa.boot$t))) lines(seq(2.5,3.5,length=500),jcc) lines(c(mean(gpa,trim=0.25),mean(gpa,trim=0.25)),c(0,dnorm(mean(gpa,trim=0.25),m ean(gpa.boot$t),sqrt(var(gpa.boot$t)))),col=2) afc <- mean(gpa.boot$t) lines(c(afc,afc),c(0,dnorm(afc,mean(gpa.boot$t),sqrt(var(gpa.boot$t)))),lty=2) qqnorm(gpa.boot$t,main="",xlab="normal score",ylab="means of resamples") qqline(gpa.boot$t, col = 2,lwd=1,lty=1) Example 16.6 To generate the values that appear in the table, we use the length, mean, and var functions. Since sex is coded 1=Male and 2=Female, we can restrict attention to GPA scores of just one sex by adding the sex specification within brackets. mean(gpa[sex==1]) sqrt(var(gpa[sex==1])) length(gpa[sex==1]) mean(gpa[sex==2]) sqrt(var(gpa[sex==2])) length(gpa[sex==2])
6 Figure A function from another package is needed to generate the density curves. The first command loads this package. This command works only if the package has already been installed. The function is called sm.density.compare. This function allows you to place a legend anywhere on the plot using a click of the mouse. The other commands set up the legend for this example. Make sure you click on the figure to place the legend. Until you do this, you will not get another command prompt. The last two commands generate the Normal quantile plot. The first plots the data for the males (sex=1) and the last two add the data for the females (sex=2) to this plot. library(sm) # create value labels cyl.f <- factor(sex, levels= c(1,2), labels = c("1 Male", "2 Female")) # plot densities sm.density.compare(gpa, sex, xlab="gpa",xlim=c(0,4)) # add legend via mouse click colfill<-c(2:(2+length(levels(cyl.f)))) legend(locator(1), levels(cyl.f), fill=colfill) qqnorm(gpa[sex==1],main="",xlab="normal score",ylab="gpa",col=2) bcc <- qqnorm(gpa[sex==2],main="",xlab="normal score",ylab="gpa",plot=f) points(bcc$x,bcc$y,col=3) Figure and accompanying output The commands for this figure and output are similar to those used for the trimmed mean (Figure 16.9). The key is defining the function to be used with the boot function. Rather than call this function theta, we call it meandiff here to describe the function. Because this example compares the means of two groups, the function involves two columns from the gpa matrix. Column 2 contains the GPAs and Column 9 contains the sex variable.
7 meandiff <- function(x, w){ y <- tapply(x[w,2], x[w,9], mean) y[1]-y[2] gpa1.boot <- boot(gpa,meandiff,r=3000,strata=sex) gpa1.boot hist(gpa1.boot$t,prob=t,axes=f,main="",xlab="difference in means of resamples",ylab="",ylim=c(0,max(dnorm(mean(gpa1.boot$t),mean(gpa1.boot$t),sqrt(va r(gpa1.boot$t)))))) axis(1) jcc <- dnorm(seq(-0.6,0.4,length=500),mean(gpa1.boot$t),sqrt(var(gpa1.boot$t))) lines(seq(-0.6,0.4,length=500),jcc) lines(c( , ),c(0,dnorm( ,mean(gpa1.boot$t),sqrt(var(gpa1.boot$t)))),col=2) afc <- mean(gpa1.boot$t) lines(c(afc,afc),c(0,dnorm(afc,mean(gpa1.boot$t),sqrt(var(gpa1.boot$t)))),lty=2) qqnorm(gpa1.boot$t,main="",xlab="normal score",ylab="difference in means of resamples") qqline(gpa1.boot$t, col = 2,lwd=1,lty=1) Example 16.8 The R command to get the 2.5 and 97.5 percentiles uses the quantile function. You specify the variable containing the bootstrap sample statistics and then a vector containing the percentiles. quantile(gpa.boot$t,c(.025,.975)) Example 16.9 and Figures and Similar to Figure and accompanying output, which looked at the difference in the means of two groups, the function to be used in boot here is a bit more complicated. The function ratvar involves the second column, which contains the GPAs, and the ninth column, which contains the gender information. Rather than take a difference in means here, the statistic is a
8 ratio of variances. The other addition here is we utilize the function boot.ci to obtain the various confidence intervals based on our bootstrapping results. The remaining commands create the histogram in Figure ratvar <- function(x, w) { y <- tapply(x[w,2], x[w,9], var) y[1]/y[2] gpa2.boot <- boot(gpa,ratvar,r=5000,strata=sex) boot.ci(gpa2.boot) hist(gpa2.boot$t,prob=t,axes=f,main="",xlab="ratio of variances (male to female) of resamples",ylab="") axis(1) jcc <- dnorm(seq(0.5,3,length=500),mean(gpa2.boot$t),sqrt(var(gpa2.boot$t))) lines(seq(.5,3,length=500),jcc) lines(c(var(gpa[sex==1])/var(gpa[sex==2]), var(gpa[sex==1])/var(gpa[sex==2])),c(0,dnorm(var(gpa[sex==1])/var(gpa[sex==2]),me an(gpa2.boot$t),sqrt(var(gpa2.boot$t)))),col=2) afc <- mean(gpa2.boot$t) lines(c(afc,afc),c(0,dnorm(afc,mean(gpa2.boot$t),sqrt(var(gpa2.boot$t)))),lty=2) Example and Figure For this example, we first need to read in the data set and attach it so we can refer to variables by their column names. After that, we define the function to be used for the bootstrapping. In this case, it is the correlation between the rating, which appears in column 3 of the data set, and the price, which appears in column 4. With this function defined, we then call the boot function to do the bootstrapping. The remainder of the commands generates the histogram and Normal quantile plot that we ve seen before. bc <- read.csv("u:/ips8/datasets/ch16/laundry.csv") attach(bc)
9 theta <- function(data,indices){ d <- data[indices,] cov(d[,3],d[,4])/sqrt(var(d[,3])*var(d[,4])) corr.boot <- boot(bc,theta,r=5000) hist(corr.boot$t,prob=t,axes=f,main="",xlab="correlation coefficient of resamples",ylab="") axis(1) jcc <- dnorm(seq(0.25,.995,length=500),mean(corr.boot$t),sqrt(var(corr.boot$t))) lines(seq(.25,.995,length=500),jcc) lines(c( , ),c(0,dnorm( ,mean(corr.boot$t),sqrt(var(corr.boot$t )))),col=2) afc <- mean(corr.boot$t) lines(c(afc,afc),c(0,dnorm(afc,mean(corr.boot$t),sqrt(var(corr.boot$t)))),lty=2) qqnorm(corr.boot$t,main="",xlab="normal score",ylab="correlation coefficient") qqline(corr.boot$t, col = 2,lwd=1,lty=1) Software output of Example For this example, we again enter the data into two variables directly rather than reading in a file. To do a permutation test, a different package is needed. The first command loads the perm package (assuming is has already been installed). The second and third commands enter the data. The function we need is called permts and is controlled by a set of rules. Here we use the default rules except for the number of permutations, which we set to library(perm) trtgrp <- c(24,43,58,71,43,49,61,44,67,49,53,56,59,52,62,54,57,33,46,43,57) ctrlgp <- c(42,43,55,26,62,37,33,41,19,54,20,85,46,10,17,60,53,42,37,42,55,28,48) rules <- permcontrol(nmc=5000) permts(trtgrp,ctrlgp,alternative="greater",method="exact.mc",control=rules)
10 Example For this example we compare the male and female GPAs using a permutation test. The commands will be very similar to those for Example First the data set is read in. Then we call the function permts using the sex== within brackets to separate the GPAs by gender. bc <- read.csv("u:/ips8/datasets/ch16/gpa.csv") attach(bc) rules <- permcontrol(nmc=5000) permts(gpa[sex==1],gpa[sex==2],method="exact.mc",control=rules)
Scatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
4. Descriptive Statistics: Measures of Variability and Central Tendency
4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and
Data exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application
IBM SPSS Statistics for Beginners for Windows
ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning
5 Correlation and Data Exploration
5 Correlation and Data Exploration Correlation In Unit 3, we did some correlation analyses of data from studies related to the acquisition order and acquisition difficulty of English morphemes by both
If there is not a Data Analysis option under the DATA menu, you will need to install the Data Analysis ToolPak as an add-in for Microsoft Excel.
If there is not a Data Analysis option under the DATA menu, you will need to install the Data Analysis ToolPak as an add-in for Microsoft Excel. 1. Click on the FILE tab and then select Options from the
STEP TWO: Highlight the data set, then select DATA PIVOT TABLE
STEP ONE: Enter the data into a database format, with the first row being the variable names, and each row thereafter being one completed survey. For this tutorial, highlight this table, copy and paste
5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.
5. Correlation Objectives Calculate correlations Calculate correlations for subgroups using split file Create scatterplots with lines of best fit for subgroups and multiple correlations Correlation The
Statistical Data analysis With Excel For HSMG.632 students
1 Statistical Data analysis With Excel For HSMG.632 students Dialog Boxes Descriptive Statistics with Excel To find a single descriptive value of a data set such as mean, median, mode or the standard deviation,
Exploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize
SPSS Manual for Introductory Applied Statistics: A Variable Approach
SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All
STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
TIPS FOR DOING STATISTICS IN EXCEL
TIPS FOR DOING STATISTICS IN EXCEL Before you begin, make sure that you have the DATA ANALYSIS pack running on your machine. It comes with Excel. Here s how to check if you have it, and what to do if you
A Picture Really Is Worth a Thousand Words
4 A Picture Really Is Worth a Thousand Words Difficulty Scale (pretty easy, but not a cinch) What you ll learn about in this chapter Why a picture is really worth a thousand words How to create a histogram
Dongfeng Li. Autumn 2010
Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis
Introduction Course in SPSS - Evening 1
ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
There are six different windows that can be opened when using SPSS. The following will give a description of each of them.
SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet
Using Excel in Research. Hui Bian Office for Faculty Excellence
Using Excel in Research Hui Bian Office for Faculty Excellence Data entry in Excel Directly type information into the cells Enter data using Form Command: File > Options 2 Data entry in Excel Tool bar:
Name: Date: Use the following to answer questions 2-3:
Name: Date: 1. A study is conducted on students taking a statistics class. Several variables are recorded in the survey. Identify each variable as categorical or quantitative. A) Type of car the student
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
GeoGebra. 10 lessons. Gerrit Stols
GeoGebra in 10 lessons Gerrit Stols Acknowledgements GeoGebra is dynamic mathematics open source (free) software for learning and teaching mathematics in schools. It was developed by Markus Hohenwarter
Using Excel for descriptive statistics
FACT SHEET Using Excel for descriptive statistics Introduction Biologists no longer routinely plot graphs by hand or rely on calculators to carry out difficult and tedious statistical calculations. These
Data exploration with Microsoft Excel: univariate analysis
Data exploration with Microsoft Excel: univariate analysis Contents 1 Introduction... 1 2 Exploring a variable s frequency distribution... 2 3 Calculating measures of central tendency... 16 4 Calculating
Gamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
Gestation Period as a function of Lifespan
This document will show a number of tricks that can be done in Minitab to make attractive graphs. We work first with the file X:\SOR\24\M\ANIMALS.MTP. This first picture was obtained through Graph Plot.
Chapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
Psychology 205: Research Methods in Psychology
Psychology 205: Research Methods in Psychology Using R to analyze the data for study 2 Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 38 Outline 1 Getting ready
Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.
1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis
Estimating a market model: Step-by-step Prepared by Pamela Peterson Drake Florida Atlantic University
Estimating a market model: Step-by-step Prepared by Pamela Peterson Drake Florida Atlantic University The purpose of this document is to guide you through the process of estimating a market model for the
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
Directions for Frequency Tables, Histograms, and Frequency Bar Charts
Directions for Frequency Tables, Histograms, and Frequency Bar Charts Frequency Distribution Quantitative Ungrouped Data Dataset: Frequency_Distributions_Graphs-Quantitative.sav 1. Open the dataset containing
Figure 1. An embedded chart on a worksheet.
8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the
IBM SPSS Statistics 20 Part 1: Descriptive Statistics
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 1: Descriptive Statistics Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
Statgraphics Getting started
Statgraphics Getting started The aim of this exercise is to introduce you to some of the basic features of the Statgraphics software. Starting Statgraphics 1. Log in to your PC, using the usual procedure
Variables. Exploratory Data Analysis
Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is
SPSS Workbook 1 Data Entry : Questionnaire Data
TEESSIDE UNIVERSITY SCHOOL OF HEALTH & SOCIAL CARE SPSS Workbook 1 Data Entry : Questionnaire Data Prepared by: Sylvia Storey [email protected] SPSS data entry 1 This workbook is designed to introduce
SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout
Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS
DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
An introduction to using Microsoft Excel for quantitative data analysis
Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing
Using Excel for Statistical Analysis
Using Excel for Statistical Analysis You don t have to have a fancy pants statistics package to do many statistical functions. Excel can perform several statistical tests and analyses. First, make sure
Multiple Regression. Page 24
Multiple Regression Multiple regression is an extension of simple (bi-variate) regression. The goal of multiple regression is to enable a researcher to assess the relationship between a dependent (predicted)
Information Technology Services will be updating the mark sense test scoring hardware and software on Monday, May 18, 2015. We will continue to score
Information Technology Services will be updating the mark sense test scoring hardware and software on Monday, May 18, 2015. We will continue to score all Spring term exams utilizing the current hardware
Chapter 4 Displaying and Describing Categorical Data
Chapter 4 Displaying and Describing Categorical Data Chapter Goals Learning Objectives This chapter presents three basic techniques for summarizing categorical data. After completing this chapter you should
PloneSurvey User Guide (draft 3)
- 1 - PloneSurvey User Guide (draft 3) This short document will hopefully contain enough information to allow people to begin creating simple surveys using the new Plone online survey tool. Caveat PloneSurvey
Advanced Excel for Institutional Researchers
Advanced Excel for Institutional Researchers Presented by: Sandra Archer Helen Fu University Analysis and Planning Support University of Central Florida September 22-25, 2012 Agenda Sunday, September 23,
Cell Phone Impairment?
Cell Phone Impairment? Overview of Lesson This lesson is based upon data collected by researchers at the University of Utah (Strayer and Johnston, 2001). The researchers asked student volunteers (subjects)
Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering
Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques
Drawing a histogram using Excel
Drawing a histogram using Excel STEP 1: Examine the data to decide how many class intervals you need and what the class boundaries should be. (In an assignment you may be told what class boundaries to
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
3.4 The Normal Distribution
3.4 The Normal Distribution All of the probability distributions we have found so far have been for finite random variables. (We could use rectangles in a histogram.) A probability distribution for a continuous
Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
Describing, Exploring, and Comparing Data
24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter
Chapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
GeoGebra Statistics and Probability
GeoGebra Statistics and Probability Project Maths Development Team 2013 www.projectmaths.ie Page 1 of 24 Index Activity Topic Page 1 Introduction GeoGebra Statistics 3 2 To calculate the Sum, Mean, Count,
Using MS Excel to Analyze Data: A Tutorial
Using MS Excel to Analyze Data: A Tutorial Various data analysis tools are available and some of them are free. Because using data to improve assessment and instruction primarily involves descriptive and
January 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
Statistics Revision Sheet Question 6 of Paper 2
Statistics Revision Sheet Question 6 of Paper The Statistics question is concerned mainly with the following terms. The Mean and the Median and are two ways of measuring the average. sumof values no. of
containing Kendall correlations; and the OUTH = option will create a data set containing Hoeffding statistics.
Getting Correlations Using PROC CORR Correlation analysis provides a method to measure the strength of a linear relationship between two numeric variables. PROC CORR can be used to compute Pearson product-moment
Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; [email protected], and Dr. J.A. Dobelman
Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; [email protected], and Dr. J.A. Dobelman Statistics lab will be mainly focused on applying what you have learned in class with
Simulating Investment Portfolios
Page 5 of 9 brackets will now appear around your formula. Array formulas control multiple cells at once. When gen_resample is used as an array formula, it assures that the random sample taken from the
6 3 The Standard Normal Distribution
290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since
Dealing with Data in Excel 2010
Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing
Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.
Excel Tutorial Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Working with Data Entering and Formatting Data Before entering data
How To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
The Normal Distribution
Chapter 6 The Normal Distribution 6.1 The Normal Distribution 1 6.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize the normal probability distribution
Descriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS
SECTION 2-1: OVERVIEW Chapter 2 Describing, Exploring and Comparing Data 19 In this chapter, we will use the capabilities of Excel to help us look more carefully at sets of data. We can do this by re-organizing
Directions for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol
R Graphics Cookbook Winston Chang Beijing Cambridge Farnham Koln Sebastopol O'REILLY Tokyo Table of Contents Preface ix 1. R Basics 1 1.1. Installing a Package 1 1.2. Loading a Package 2 1.3. Loading a
AMS 7L LAB #2 Spring, 2009. Exploratory Data Analysis
AMS 7L LAB #2 Spring, 2009 Exploratory Data Analysis Name: Lab Section: Instructions: The TAs/lab assistants are available to help you if you have any questions about this lab exercise. If you have any
Package SHELF. February 5, 2016
Type Package Package SHELF February 5, 2016 Title Tools to Support the Sheffield Elicitation Framework (SHELF) Version 1.1.0 Date 2016-01-29 Author Jeremy Oakley Maintainer Jeremy Oakley
Simple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
Tutorial Segmentation and Classification
MARKETING ENGINEERING FOR EXCEL TUTORIAL VERSION 1.0.8 Tutorial Segmentation and Classification Marketing Engineering for Excel is a Microsoft Excel add-in. The software runs from within Microsoft Excel
AMATH 352 Lecture 3 MATLAB Tutorial Starting MATLAB Entering Variables
AMATH 352 Lecture 3 MATLAB Tutorial MATLAB (short for MATrix LABoratory) is a very useful piece of software for numerical analysis. It provides an environment for computation and the visualization. Learning
General instructions for the content of all StatTools assignments and the use of StatTools:
General instructions for the content of all StatTools assignments and the use of StatTools: An important part of Business Management 330 is learning how to conduct statistical analyses and to write text
Excel 2003: Ringtones Task
Excel 2003: Ringtones Task 1. Open up a blank spreadsheet 2. Save the spreadsheet to your area and call it Ringtones.xls 3. Add the data as shown here, making sure you keep to the cells as shown Make sure
Using Microsoft Excel to Plot and Analyze Kinetic Data
Entering and Formatting Data Using Microsoft Excel to Plot and Analyze Kinetic Data Open Excel. Set up the spreadsheet page (Sheet 1) so that anyone who reads it will understand the page (Figure 1). Type
STAB22 section 1.1. total = 88(200/100) + 85(200/100) + 77(300/100) + 90(200/100) + 80(100/100) = 176 + 170 + 231 + 180 + 80 = 837,
STAB22 section 1.1 1.1 Find the student with ID 104, who is in row 5. For this student, Exam1 is 95, Exam2 is 98, and Final is 96, reading along the row. 1.2 This one involves a careful reading of the
Introduction to R and Exploratory data analysis
Introduction to R and Exploratory data analysis Gavin Simpson November 2006 Summary In this practical class we will introduce you to working with R. You will complete an introductory session with R and
Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
Session 10. Laboratory Works
Session 10 Laboratory Works Adding-in the data analysis tool pack to excel Statistical analysis such as descriptive statistics and regression requires the Excel Data Analysis add-in. The default configuration
SPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,
Linear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
GETTING YOUR DATA INTO SPSS
GETTING YOUR DATA INTO SPSS UNIVERSITY OF GUELPH LUCIA COSTANZO [email protected] REVISED SEPTEMBER 2011 CONTENTS Getting your Data into SPSS... 0 SPSS availability... 3 Data for SPSS Sessions... 4
Beginner s Matlab Tutorial
Christopher Lum [email protected] Introduction Beginner s Matlab Tutorial This document is designed to act as a tutorial for an individual who has had no prior experience with Matlab. For any questions
Multivariate Analysis of Ecological Data
Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology
Chapter 7 Section 1 Homework Set A
Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the
Getting started in Excel
Getting started in Excel Disclaimer: This guide is not complete. It is rather a chronicle of my attempts to start using Excel for data analysis. As I use a Mac with OS X, these directions may need to be
KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management
KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To
When to use Excel. When NOT to use Excel 9/24/2014
Analyzing Quantitative Assessment Data with Excel October 2, 2014 Jeremy Penn, Ph.D. Director When to use Excel You want to quickly summarize or analyze your assessment data You want to create basic visual
EXST SAS Lab Lab #4: Data input and dataset modifications
EXST SAS Lab Lab #4: Data input and dataset modifications Objectives 1. Import an EXCEL dataset. 2. Infile an external dataset (CSV file) 3. Concatenate two datasets into one 4. The PLOT statement will
Microsoft Excel Tutorial
Microsoft Excel Tutorial by Dr. James E. Parks Department of Physics and Astronomy 401 Nielsen Physics Building The University of Tennessee Knoxville, Tennessee 37996-1200 Copyright August, 2000 by James
