An Introduction to Statistical Methods in GenStat
|
|
|
- Jeremy Atkins
- 9 years ago
- Views:
Transcription
1 An Introduction to Statistical Methods in GenStat Alex Glaser VSN International, 5 The Waterhouse, Waterhouse Street, Hemel Hempstead, UK [email protected] [email protected] Many thanks to Roger Payne for the original slides Aberystwyth, January 2011
2 Programme Day 1 Introduction to GenStat From t-test to one-way anova Basic principles of design and blocking Treatment structure factorials & interactions and checking the assumptions Day 2 Simple linear regression Multiple linear regression GLM counts and binomial data GLM further models and extensions
3 Aim of course To give you an overall introduction to the GenStat 13th Edition system.
4 Learning objectives By the end of the course, you will be able to Navigate the GenStat interface Obtain help from the system where necessary Input and manage data Analyse data through GenStat menus All without the help of the trainer.
5 Exercise 1.1 What happens when you select input log in the window navigator? Can you see yourself using this feature in you work? If so, how? What happens to status bar when you click the button? Resize the input log and output window so that you can see both simultaneously What happens when you click the button? Use the tools customize toolbar menu to add or remove buttons from the toolbar to suit your needs.
6 Exercise 1.2 What happens to the text in right hand corner of the status bar if you press the insert key? What do you think this part of the status bar means? Open a new text window using the button. In this window, type the following GenStat command PRINT This is my first time using GenStat Execute the command using the Run Submit Line menu option. Now select the Window Event Log entry for this action. Is there an Event log for this action?
7 GenStat Client Menus Commands GenStat Server
8 Exercise 2 Find help for what s new in the 13 th edition of GenStat Find help on the GenStat spreadsheet Open the Tools Options menu and find help about the ECHO COMMANDS setting on the AUDIT TRAIL tab. Open a new test window and type in the word FIT. Place the cursor in the word and press the F1 key. What is FIT? Type in a statistical term and press the F1 key. View the Introduction to GenStat guide (pdf format) View an example program for a two-sample t- test.
9 Data / Load Menu File Menu ASCII Spreadsheet Database files Other Statistics packages GenStat Save Set up ODBC query Saved ODBC Queries DDE links Spread Menu Spreadsheet Other Statistics packages GIS GenStat Save GenStat session Database files Saved ODBC Queries Saved DDE links Blank / type data Data in GenStat to edit From clipboard Excel Set up ODBC query DDE link Central Data Core
10 Exercise 3.1 Clear all the data from GenStat and use the file open menu to read the data from the file sulphur.xls from installsets\data Clear all the data from GenStat. Go to the tools spreadsheet options file menu and uncheck the use excel import wizard on file open option. Repeat part 1 using the file open menu. Which approach best suits your way of working? The file bacteria.xls, that you met earlier, contains data from a second experiment in the worksheet called Bacteria Counts. The data are not stored in standard format; the data can be found in the range of cells D3:E13. Clear the data core. Read the data into GenStat using the Excel import wizard button.
11 Exercise 3.2 & 3.3 Using the data in the iris.gsh file: Produce a scatter plot of Sepal Width versus Petal Width. There is one point in this plot that stands alone. What are the coordinates of this point? Can you suggest a method of easily identifying to which species of iris this unusual point belongs? Produce a scatter plot of Sepal Length versus Petal Length. Give each factor a different symbol and colours. Experiment with labelling. Produce a histogram of Petal lengths versus Petal widths. Using your own data, experiment with the different aspects of the graphics window. That is, explore the different menus and toolbars. If you have not brought your on data sets, experiment with any of the course data files.
12 Exercise 4.1 Using the Excel Import Wizard, load in the file Traffic.xls On the second screen enter B3:D43 in the Specified Range box. Click OK on the Select Columns to Convert to Factors menu Convert Day and Month to factors using the methods of your choice.
13 Exercise 4.2 Continue using the file Traffic.xls Select a cell in the Day column. Delete the value, type F and then press return. Repeat the process but with the value G. What property of the GenStat spreadsheet do you think this illustrates. Select the Tools Spreadsheet Options Conversions menu. Check the Allow new factor levels in Edit box. Now repeat the above question. What happens now?
14 Exercise 4.3 Continue using the file Traffic.xls Create a new variate which contains the log of the Counts. Sort the columns in descending order of the Counts. Use the Spread Manipulate Unstack to create separate variables for each day of the week. Experiment with the Calculate menu with your own data.
15 1 From t test to one way anova In this session you will learn how to use the t-test to compare two treatments the T-Test menu how to use one-way ANOVA to compare several treatments the model fitted in one-way anova the statistical philosophy behind one-way anova the relationship between one-way anova and the t-test for two treatments how to use the One- and two-way ANOVA menu for oneway anova how to plot the means from one-way anova how to do multiple comparisons Note: topics marked are optional
16 t test suppose we have 2 sets of units, that have received 2 different treatments: animals that have been fed two different diets plots that have been given different fertilisers subjects with different drugs plants with different fungicides. assume the units do not have any special structure e.g. the animals are all of the same breed the plots are in a fairly uniform field the subjects are of similar ages, weights and heights with 2 treatments we may then do a t-test assume each group from a Normal distribution usually assume distributions have the same s.e. (can check) but may have different means
17 Data sets data sets for the examples and practicals can be accessed using the Example Data Sets menu filter by the course Guide to Anova and Design select the file click on Open data
18 t test experiment to study yields from 2 manufacturing methods data in Manufacture.gsh do yields differ more than we would expect from the random variation? can we estimate mean yields from each method?
19 t test menu Use GenStat menus for simplicity
20 Output
21 Practical 1.2 spreadsheet Pots.gsh stores data from a fertilizer experiment 7 plants grown in pots with no fertilizer 8 plants grown in similar conditions with fertilizer do a two-sample t-test to assess whether fertilizer has an effect
22 One way analysis of variance linear model y ij = μ + a i + ε ij represent each mean by grand mean μ + effect a i observations described by fitted value μ + a i + residual ε ij
23 Residual variation may arise from many different causes: the units may not be absolutely identical (discuss later how to allocate units to treatments to take account of this) they may experience slightly different conditions during the experiment there may be measurement errors they may be being dealt with by different people during the experiment and you can no doubt think of others! so estimation is not exact analysis must estimate the amount of variation and take account of it in drawing conclusions
24 One way anova linear model y ij = μ + a i + ε ij if treatments have no effect a 1 = a 2 = 0 y ij = μ + ε ij estimate grand mean by average of all data values assess lack of fit of model by sum of squared residuals (RSS 0 ) degrees of freedom (d.f.) is n 1 +n 2 1 (fitted 1 parameter μ) fit full model estimate a i by average for group i minus grand mean assess lack of fit of model by sum of squared residuals (RSS 1 ) this has n 1 +n 2 2 d.f. (2 parameters as (n 1 a 1 +n 2 a 2 )/(n 1 +n 2 )=0) assess treatments sum of squares due to treatments is TSS=RSS 0 RSS 1 on 1 d.f. assess underlying variation by residual from full model RSS 1 variance ratio is treatment mean square / residual mean square VR = {TSS / 1} / {RSS 1 / (n 1 +n 2 2)} on 1 and (n 1 +n 2 2) d.f.
25 One and two way ANOVA menu
26 Output aov table tables of means s.e.'s for differences between means (m1 m2)/sed = t
27 ANOVA Options menu Options menu controls the output
28 ANOVA Further Output menu Further Output menu provides more output (without redoing the analysis)
29 ANOVA Means Plots menu Means Plots menu plots means as points or joined by lines or with original data points too or in a bar chart
30 Practical 1.4 spreadsheet Pots.gsh stores data from a fertilizer experiment used in Practical plants grown in pots with no fertilizer 8 plants grown in similar conditions with fertilizer do a one-way analysis of variance to assess if fertilizer has an effect compare results with t-test from Practical 1.2
31 One way anova with >2 treatments spreadsheet Rat.gsh has data from an experiment to study effect of dietary supplements on gain in weight of rats 5 diet treatments (a-e) 20 rats allocated at random, 4 per treatment can use One-and two-way ANOVA menu, and plot means, as before
32 Output aov table means s.e.d
33 Plot of means suppose a-e represent amounts 0-4 of supplement might want to assess linear (& quadratic?) effects of supplement
34 Multiple comparison tests in favour there may be many possible comparisons between pairs of treatment means (with t treatments there are t (t 1)/2) so some researchers feel their significance levels should be adjusted to take account of all the tests that they might make against multiple-comparisons are unnecessary if you have only a small number of comparisons to make either because there are few treatments, or because you should have identified beforehand the comparisons that you feel are likely to be of interest they are inappropriate also if the treatments have any sort of structure e.g. levels may represent different amounts of a substance like a fertiliser or a drug, then illogical to assume that only some of the amounts might have an effect see on-line help for the menu
35 Multiple comparisons check that they are enabled on the Menus tab of the Options menu
36 Multiple comparisons the Multiple Comparisons button will then be available to click on the ANOVA Further Output menu check Multiple Comparisons select Treatment and type of Test click OK (and then Run on the Further Output menu)
37 Practical 1.9 spreadsheet Octane.gsh stores data from an experiment to study the effect of different additives A-E on the octane level of gasoline used in Practical 1.7 do a one-way analysis of variance to assess if Gasoline has an effect do a Bonferroni multiplecomparison test to compare the types of gasoline
38 2 Blocking structures In this session you will learn how to improve the precision of an experiment by grouping the units into similar sets called "blocks" how randomization can avoid bias by guarding against unforeseen differences amongst the units how to design and analyse a complete randomized block design how to recognise situations that may require more than one type of blocking how to design and analyse a Latin square design Note: topics marked are optional
39 Completely randomized design design used for all examples so far no formal structure is imposed on the units assumes units effectively identical e.g. in a field experiment, no systematic differences in underlying fertility, drainage etc of the plots in a glasshouse, assumes that light and temperature are the same for each row of pots in a factory, that workforce behaves in essentially the same way at different times of day, days of the week etc in educational studies, that children in different schools are approximately the same, or students studying different subjects at Universities, or in different year groups etc treatments allocated to units at random
40 Non uniform units for example field experiment on a slope best plots may be at top of slope random allocation of treatments to plots may not seem "fair" e.g. replicates of treatment A mainly on "good" plots & replicates of treatment B mainly on "bad" plots if no actual difference between A & B, could lead to A appearing to be much better than B systematic differences between plots increase the residual sum of squares, & hence the estimate of random variability treatment differences must be larger to give a significant F-test standard errors of differences between treatments will be larger i.e. experiment will give less precise results if you know there are differences between units avoid bias & improve precision by grouping (blocking) units into homogenous groups (i.e. groups that are effectively identical)
41 Randomized block design single grouping factor usually known as blocks within each block same number of units for each treatment (one per treatment in a randomized-complete-block design) treatments are allocated randomly to the units in analysis block-effects are estimated and removed, leading to more-precise estimates e.g.
42 One way anova with blocks another experiment to study effect of dietary supplements on gain in weight of rats 8 litters of 5 rats assume rats from same litter more similar than those from different litters 5 Diet treatments (A-E), allocated at random to rats within each litter
43 No blocking residual m.s variance ratio 0.42 s.e.d. 7.19
44 With litters as blocks Differences between litters residual m.s (c.f ) variance ratio 2.13 (c.f. 0.42) s.e.d (c.f. 7.19)
45 Practical 2.3 spreadsheet Wheatstrains.gsh contains the results from a randomized block design to assess 4 strains of wheat analyse the experiment give your assessment of whether the blocking was worthwhile
46 Blocking in 2 directions e.g. experiment on pot plants in a glasshouse door in east wall which may cause temperature differences sunlight mainly from the south other e.g. weekday time-of-day school year-group factory weekday time location
47 Latin square design a design for t treatments arranged in t rows and t columns (i.e. t 2 units) each treatment occurs exactly once in each row and once in each column randomized by randomly permuting rows & columns e.g.
48 Latin square example experiment to assess the (in?)consistency of 6 samplers in assessing the heights of wheat plants 6 areas of wheat to assess may also be ordering effects (accuracy of samplers may vary during experiment) so 6 6 Latin square used with blocking factors Areas and Orders
49 Analysis of Variance menu select Design to be Latin Square
50 Output between Areas between Orders Samplers more precisely estimated (residual m.s c.f )
51 Practical 2.5 spreadsheet Fabric.gsh contains the results from a Latin square design to assess wear resistance of rubber-covered fabrics column factor is 4 different runs row factor is four positions on testing machine used to generate wear under simulated natural conditions analyse the results
52 3 Treatment structure In this session you will learn how to recognise the need for more than one treatment factor analyse designs with two treatment factors using the Oneand two-way ANOVA menu define and interpret interactions between factors analyse designs with two treatment factors using the general Analysis of Variance menu use the Anova Contrasts menu estimate comparisons between levels of treatments interpret interactions between treatment contrasts use model formulae to define the treatment terms to be fitted include control treatments in a factorial experiment use covariates to improve precision by using additional background information about the experimental units (not used for blocking Note: topics marked are optional
53 Types of treatment experiments may study different types of treatment e.g. several different drugs at a range of different doses several different types of fertiliser varieties of wheat and types of fungicide represent each type of treatment by a different treatment factor, with levels to represent the various possibilities e.g. Drug levels Morphine, Amidone, Phenadoxone, Pethidine; Dose levels 2.5, 5, 10, 15; Nitrogen levels 0, 50, 100, 150; Phosphate levels 50, 100; Fungicide levels Carbendazim, Prochloraz; Amount levels 2, 3, 4.
54 Two treatment factors experiment on canola (oil-seed rape) 2 treatment factors N (nitrogen) 0, 180, 230 S (sulphur) 0, 10, 20, 40 randomized-block design with 3 blocks (factor block) and 12 plots per block
55 One and two way ANOVA menu Two-way analysis (Treatment factors N & S) with Blocks (factor block)
56 Output line for each term: N & S main effects, and N.S interaction table of means for each treatment term s.e.d. for each table of means
57 Linear model y ijk = μ + β i + n j + s k + ns jk + ε ijk β i represent the block effects (block stratum in the aov) ε ijk are the residuals n j represent the main effect of nitrogen (N) s k represent the main effect of sulphur (S) ns jk represent the interaction between nitrogen & sulphur (N.S) analysis fits each term in turn, so you can decide how complicated a model is required analysis-of-variance table has a line for each term, so you can assess whether its parameters are needed in the model conclusions will be much clearer if there is no interaction
58 With interaction
59 Without interaction lines are parallel can decide on best level of S without considering N or best level of N without considering S need present only one-way tables of means
60 General Analysis of Variance menu Design: Two-way ANOVA (in Randomized Blocks) click on Contrasts button to fit comparisons (or other contrasts)
61 Comparison contrasts 1 comparison between levels of N clicking OK opens matrix spreadsheet Cont type information into Cont to define comparison
62 General Analysis of Variance menu notice function Comp in Treatment 1 (1 comparison of N defined by Cont)
63 Output extra line for N assesses the comparison also extra line for N.S to assess interaction of comparison with S
64 Practical 3.3 spreadsheet Ratfactorial.gsh contains the results from an experiment to study the effect of 6 different diets on the gain in weight of rats treatment factors concern the protein in the diet Amount (High or Low) Source (Beef, Cereal or Pork) analyse the data as a twoway factorial fit 2 comparison contrasts between levels of Source Animal vs Vegetable Beef vs Pork
65 Model formula define a model to be fitted in an analysis formed automatically by the menus or can define your own list of model terms, linked by operator "+ e.g. A + B 2 terms representing main effects of factors A & B Higher-order terms specified as series of factors separated by dots (e.g. interactions): meaning depends on contents of formula e.g. N + S + N.S N.S is an interaction e.g. Block + Block.Plot Block.Plot represents plotwithin-block effects: differences between individual plots after removing the overall similarity between plots in same block
66 Operators for formulae crossing operator * specifies factorial structures e.g. N * S is expanded automatically to become N + S + N.S nesting operator / occurs most often in block formulae e.g Block / Plot is expanded to become Block + Block.Plot
67 Several operators 3-factor factorial model A * B * C becomes A + B + C + A.B + A.C + B.C + A.B.C 3 nested factors (e.g. block model of split-plot) block / wplot / subplot becomes block + block.wplot + block.wplot.subplot factorial-plus-added-control treatment structure Control / (Drug * Dose) expands to Control + Control.Drug + Control.Dose + Control.Drug.Dose NB: many commands and menus have a FACTORIAL option to control the number of factors/variates in the terms to fit
68 Factorial plus added control 4 different fumigants to control nematodes CN, CS, CM and CK 2 levels of dose single and double also include a control treatment none (no fumigant at any dose) randomized-block design 4 blocks 12 plots per block (4 replicates of control treatment in each block) effects proportional analyse log counts
69 Analysis of Variance menu select Design to be General Treatment Structure (in Randomized Blocks)
70 Factorial plus added control treatment structure Fumigant / ( Level * Type ) Fumigant represents the overall effect of any fumigant at any (non-zero) dose Fumigant.Level represents comparison between single and double doses (averaged over different types) Fumigant.Type represents overall differences between types (averaged over single and double doses) Fumigant.Level.Type represents the interaction between Level and Type (given that some sort of fumigant has been applied)
71 Output
72 Output notice different sed's according to the replication of the means
73 Covariates provide additional background information often measurements made before expt (not used for blocking) e.g. (log) prior nematode counts incorporated in model as linear (regression) terms y ijkl = μ + β i + f j + ft jk + fl jl + ftl jkl + b (x ijkl x mean ) + ε ijkl improve precision remove potential biases caused by non-uniformity of units in aov table extra line(s) to assess effect of covariate(s) on y-variate, after removing effects of treatments treatment s.s. (and effects) adjusted to take account of the fact that the plots with the various treatments have different covariate values cov.ef. for treatment is efficiency remaining after adjustment cov.ef. for residual is amount by which its m.s. has decreased
74 Output regression coefficient for adjustment in Blocks stratum regression coefficient for adjustment within Blocks combined estimate
75 Output
76 Practical 3.7 spreadsheet Ratmuscles.gsh contains data from an experiment to study the effect of electrical stimulation in preventing the wasting away of denervated muscles of rats 3 treatment factors length of each treatment number of treatment periods per day type of current randomized block design with 2 blocks denervated muscles were gastrocnemius muscles on one side of each rat the normal muscle on the other side of each rat was also measured, for use as a covariate in the analysis analyse the experiment
77 4 Checking the assumptions In this session you will learn what assumptions are needed to ensure validity of an aov why the variance must be homogeneous (e.g. variability of residuals should be the same at high as low response values) how to assess whether the variance is homogeneous that residuals should come from identical and independent Normal distributions how to assess the Normality of the residuals why the model must be additive (i.e. differences between treatment effects must remain the same however large or small the underlying size of the response variable) how to identify outliers how transforming the response variate may correct for failures in the assumptions how to print back-transformed tables of means how to do a random permutation test Note: topics marked are optional
78 Homogeneity of variance random variation must be similar over all units beware: it may change with the size of response assess by plotting residuals against fitted values homogeneous increasing with response
79 Non homogeneity of variance if variation increases with size of response s.e.d.'s between treatment means will be over-estimated for differences between low means under-estimated for differences between larger means this could lead you to the wrong conclusions! if plot of residuals against fitted values indicates non-homogeneity of variances consider transforming the response variate (or using a generalized linear model; see Guide to Linear, Nonlinear and Generalized Linear Models in GenStat)
80 Normality of residuals histogram should be "bell-shaped" Normal plot residuals in ascending order plotted against Normal quantiles should give an approximately straight line half-normal plot similar to Normal plot but plots absolute residual values
81 Additivity differences between treatment effects remain the same however large or small the underlying size of the response e.g. in randomized-block design, assume that theoretical value of difference between two treatments remains the same within a block where responses are low, as in one where they are high fitting an additive model when non-additivity is present often leads to detection of (spurious) interactions analysis will be harder to interpret predictions will be unreliable but take care genuine interactions may also occur e.g. if one treatment modifies the mode of action of another data that shows signs of non-additivity often also violates other assumptions use background knowledge of the process if a multiplicative model appropriate take a log transformation for percentage data, consider a logit transformation
82 Outliers are extreme observation, leading to very large residuals look for warnings in ANOVA Information Summary or for extreme points in histogram of residuals or high or low points in plot of residuals against fitted values or points away from line at end of Normal or half-normal plot outliers may arise from errors in recording or punching data if the wrong treatment has been applied to a unit where there is a problem in the experimental procedure outliers distort treatment means inflate the error variance, decreasing the precision of estimates if you have outliers investigate to see if errors have occurred if you find an error try to recover the correct data value if you cannot find the correct data value, insert a missing value if you cannot find any possible source of error, perhaps the outlier might be a true data value is your model wrong?
83 Transformations can correct failures of assumptions e.g. to stabilize variance counts binomial percentages s.e. proportional to mean e.g. non-additivity multiplicative effects percentages square root angular i.e. arcsine(sqrt(p/100)) log log e.g. log10(n+1) for counts logit = log(p/(100-p)) p=100 (r+½)/(n+1) for binomial note: must make inferences on transformed scale but can present back-transformed means using Save and Calculate menus
84 Log transformed data study of plankton numbers 4 types of plankton (treatments) sampled in 12 hauls (blocks) compare analyses for untransformed and log10 transformed numbers
85 Save the means
86 Backtransform and print
87 Practical 4.6 spreadsheet Wine.gsh contains results from an experiment to assess the % alcohol of wine 5 types of wine A-E 3 bottles of each type were tested in a random order analyse the percentages & plot residuals against fitted values transform the percentages using a logit transformation, re-analyse the data & replot residuals against fitted values
88 Permutation tests if the distributional assumptions are not satisfied, you might use a random permutation test as an alternative way to assess the significance of the terms in the analysis model must still be additive for results to be meaningful but residuals need no longer follow Normal distributions with equal variances click on Permutation Test in ANOVA Further Output menu to open ANOVA Permutation Test menu specify Number of permutations select Seed (0 automatic) click on Run probability for each treatment term is now determined from its distribution over the randomly permuted data sets
89 Practical 4.8 spreadsheet Wine.gsh contains results from an experiment to assess the % alcohol of wine used in Practical types of wine A-E 3 bottles of each type were tested in a random order analyse the percentages & plot residuals against fitted values assess the differences between the types using a permutation test
Analysis of Variance. MINITAB User s Guide 2 3-1
3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced
5 Analysis of Variance models, complex linear models and Random effects models
5 Analysis of Variance models, complex linear models and Random effects models In this chapter we will show any of the theoretical background of the analysis. The focus is to train the set up of ANOVA
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
Minitab Tutorials for Design and Analysis of Experiments. Table of Contents
Table of Contents Introduction to Minitab...2 Example 1 One-Way ANOVA...3 Determining Sample Size in One-way ANOVA...8 Example 2 Two-factor Factorial Design...9 Example 3: Randomized Complete Block Design...14
Using Excel for Statistics Tips and Warnings
Using Excel for Statistics Tips and Warnings November 2000 University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Contents 1. Introduction 3 1.1 Data Entry and
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application
Figure 1. An embedded chart on a worksheet.
8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable
Using Excel in Research. Hui Bian Office for Faculty Excellence
Using Excel in Research Hui Bian Office for Faculty Excellence Data entry in Excel Directly type information into the cells Enter data using Form Command: File > Options 2 Data entry in Excel Tool bar:
Randomized Block Analysis of Variance
Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of
APPLYING BENFORD'S LAW This PDF contains step-by-step instructions on how to apply Benford's law using Microsoft Excel, which is commonly used by
APPLYING BENFORD'S LAW This PDF contains step-by-step instructions on how to apply Benford's law using Microsoft Excel, which is commonly used by internal auditors around the world in their day-to-day
business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management
KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To
Getting Started with Minitab 17
2014 by Minitab Inc. All rights reserved. Minitab, Quality. Analysis. Results. and the Minitab logo are registered trademarks of Minitab, Inc., in the United States and other countries. Additional trademarks
Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0.
Statistical analysis using Microsoft Excel Microsoft Excel spreadsheets have become somewhat of a standard for data storage, at least for smaller data sets. This, along with the program often being packaged
Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.
Excel Tutorial Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Working with Data Entering and Formatting Data Before entering data
Business Objects Version 5 : Introduction
Business Objects Version 5 : Introduction Page 1 TABLE OF CONTENTS Introduction About Business Objects Changing Your Password Retrieving Pre-Defined Reports Formatting Your Report Using the Slice and Dice
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
INTRODUCTION TO EXCEL
INTRODUCTION TO EXCEL 1 INTRODUCTION Anyone who has used a computer for more than just playing games will be aware of spreadsheets A spreadsheet is a versatile computer program (package) that enables you
Using Excel s Analysis ToolPak Add-In
Using Excel s Analysis ToolPak Add-In S. Christian Albright, September 2013 Introduction This document illustrates the use of Excel s Analysis ToolPak add-in for data analysis. The document is aimed at
Multivariate Analysis of Variance (MANOVA)
Chapter 415 Multivariate Analysis of Variance (MANOVA) Introduction Multivariate analysis of variance (MANOVA) is an extension of common analysis of variance (ANOVA). In ANOVA, differences among various
SPSS (Statistical Package for the Social Sciences)
SPSS (Statistical Package for the Social Sciences) What is SPSS? SPSS stands for Statistical Package for the Social Sciences The SPSS home-page is: www.spss.com 2 What can you do with SPSS? Run Frequencies
MS-EXCEL: STATISTICAL PROCEDURES
MS-EXCEL: STATISTICAL PROCEDURES Cini Varghese I.A.S.R.I., Library Avenue, New Delhi 110 012 [email protected] Microsoft (MS) Excel ( ) is a powerful spreadsheet that is easy to use and allows you to
GeoGebra Statistics and Probability
GeoGebra Statistics and Probability Project Maths Development Team 2013 www.projectmaths.ie Page 1 of 24 Index Activity Topic Page 1 Introduction GeoGebra Statistics 3 2 To calculate the Sum, Mean, Count,
Excel Guide for Finite Mathematics and Applied Calculus
Excel Guide for Finite Mathematics and Applied Calculus Revathi Narasimhan Kean University A technology guide to accompany Mathematical Applications, 6 th Edition Applied Calculus, 2 nd Edition Calculus:
Tutorial 2: Using Excel in Data Analysis
Tutorial 2: Using Excel in Data Analysis This tutorial guide addresses several issues particularly relevant in the context of the level 1 Physics lab sessions at Durham: organising your work sheet neatly,
How To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
Appendix 2.1 Tabular and Graphical Methods Using Excel
Appendix 2.1 Tabular and Graphical Methods Using Excel 1 Appendix 2.1 Tabular and Graphical Methods Using Excel The instructions in this section begin by describing the entry of data into an Excel spreadsheet.
Recall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different)
Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different) Spreadsheets are computer programs that allow the user to enter and manipulate numbers. They are capable
Assignment objectives:
Assignment objectives: Regression Pivot table Exercise #1- Simple Linear Regression Often the relationship between two variables, Y and X, can be adequately represented by a simple linear equation of the
Monthly Payroll to Finance Reconciliation Report: Access and Instructions
Monthly Payroll to Finance Reconciliation Report: Access and Instructions VCU Reporting Center... 2 Log in... 2 Open Folder... 3 Other Useful Information: Copying Sheets... 5 Creating Subtotals... 5 Outlining
Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.
1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis
Basic Pivot Tables. To begin your pivot table, choose Data, Pivot Table and Pivot Chart Report. 1 of 18
Basic Pivot Tables Pivot tables summarize data in a quick and easy way. In your job, you could use pivot tables to summarize actual expenses by fund type by object or total amounts. Make sure you do not
Using Microsoft Excel to Plot and Analyze Kinetic Data
Entering and Formatting Data Using Microsoft Excel to Plot and Analyze Kinetic Data Open Excel. Set up the spreadsheet page (Sheet 1) so that anyone who reads it will understand the page (Figure 1). Type
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
TIPS FOR DOING STATISTICS IN EXCEL
TIPS FOR DOING STATISTICS IN EXCEL Before you begin, make sure that you have the DATA ANALYSIS pack running on your machine. It comes with Excel. Here s how to check if you have it, and what to do if you
Microsoft Excel Tutorial
Microsoft Excel Tutorial by Dr. James E. Parks Department of Physics and Astronomy 401 Nielsen Physics Building The University of Tennessee Knoxville, Tennessee 37996-1200 Copyright August, 2000 by James
SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS
SECTION 2-1: OVERVIEW Chapter 2 Describing, Exploring and Comparing Data 19 In this chapter, we will use the capabilities of Excel to help us look more carefully at sets of data. We can do this by re-organizing
Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs
Using Excel Jeffrey L. Rummel Emory University Goizueta Business School BBA Seminar Jeffrey L. Rummel BBA Seminar 1 / 54 Excel Calculations of Descriptive Statistics Single Variable Graphs Relationships
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
SPSS Manual for Introductory Applied Statistics: A Variable Approach
SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Spreadsheet software for linear regression analysis
Spreadsheet software for linear regression analysis Robert Nau Fuqua School of Business, Duke University Copies of these slides together with individual Excel files that demonstrate each program are available
Advanced Excel for Institutional Researchers
Advanced Excel for Institutional Researchers Presented by: Sandra Archer Helen Fu University Analysis and Planning Support University of Central Florida September 22-25, 2012 Agenda Sunday, September 23,
EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:
EXCEL Analysis TookPak [Statistical Analysis] 1 First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it: a. From the Tools menu, choose Add-Ins b. Make sure Analysis
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
One-Way Analysis of Variance (ANOVA) Example Problem
One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means
ADD-INS: ENHANCING EXCEL
CHAPTER 9 ADD-INS: ENHANCING EXCEL This chapter discusses the following topics: WHAT CAN AN ADD-IN DO? WHY USE AN ADD-IN (AND NOT JUST EXCEL MACROS/PROGRAMS)? ADD INS INSTALLED WITH EXCEL OTHER ADD-INS
There are six different windows that can be opened when using SPSS. The following will give a description of each of them.
SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet
Assessing Measurement System Variation
Assessing Measurement System Variation Example 1: Fuel Injector Nozzle Diameters Problem A manufacturer of fuel injector nozzles installs a new digital measuring system. Investigators want to determine
Summary of important mathematical operations and formulas (from first tutorial):
EXCEL Intermediate Tutorial Summary of important mathematical operations and formulas (from first tutorial): Operation Key Addition + Subtraction - Multiplication * Division / Exponential ^ To enter a
Chapter 7. One-way ANOVA
Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks
January 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
AP Physics 1 and 2 Lab Investigations
AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks
Microsoft Excel. Qi Wei
Microsoft Excel Qi Wei Excel (Microsoft Office Excel) is a spreadsheet application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot
Getting Started with Excel 2008. Table of Contents
Table of Contents Elements of An Excel Document... 2 Resizing and Hiding Columns and Rows... 3 Using Panes to Create Spreadsheet Headers... 3 Using the AutoFill Command... 4 Using AutoFill for Sequences...
Elementary Statistics Sample Exam #3
Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to
General instructions for the content of all StatTools assignments and the use of StatTools:
General instructions for the content of all StatTools assignments and the use of StatTools: An important part of Business Management 330 is learning how to conduct statistical analyses and to write text
Statistics Review PSY379
Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses
Algebra 1 Course Information
Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
DataPA OpenAnalytics End User Training
DataPA OpenAnalytics End User Training DataPA End User Training Lesson 1 Course Overview DataPA Chapter 1 Course Overview Introduction This course covers the skills required to use DataPA OpenAnalytics
Directions for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
HOW TO USE MINITAB: DESIGN OF EXPERIMENTS. Noelle M. Richard 08/27/14
HOW TO USE MINITAB: DESIGN OF EXPERIMENTS 1 Noelle M. Richard 08/27/14 CONTENTS 1. Terminology 2. Factorial Designs When to Use? (preliminary experiments) Full Factorial Design General Full Factorial Design
Chapter 4 and 5 solutions
Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,
Microsoft Access 2010 Overview of Basics
Opening Screen Access 2010 launches with a window allowing you to: create a new database from a template; create a new template from scratch; or open an existing database. Open existing Templates Create
TI-Inspire manual 1. Instructions. Ti-Inspire for statistics. General Introduction
TI-Inspire manual 1 General Introduction Instructions Ti-Inspire for statistics TI-Inspire manual 2 TI-Inspire manual 3 Press the On, Off button to go to Home page TI-Inspire manual 4 Use the to navigate
Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
Basic Excel Handbook
2 5 2 7 1 1 0 4 3 9 8 1 Basic Excel Handbook Version 3.6 May 6, 2008 Contents Contents... 1 Part I: Background Information...3 About This Handbook... 4 Excel Terminology... 5 Excel Terminology (cont.)...
Point Biserial Correlation Tests
Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable
Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering
Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques
Using Microsoft Excel for Probability and Statistics
Introduction Using Microsoft Excel for Probability and Despite having been set up with the business user in mind, Microsoft Excel is rather poor at handling precisely those aspects of statistics which
1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
Preface of Excel Guide
Preface of Excel Guide The use of spreadsheets in a course designed primarily for business and social science majors can enhance the understanding of the underlying mathematical concepts. In addition,
ECDL. European Computer Driving Licence. Spreadsheet Software BCS ITQ Level 2. Syllabus Version 5.0
European Computer Driving Licence Spreadsheet Software BCS ITQ Level 2 Using Microsoft Excel 2010 Syllabus Version 5.0 This training, which has been approved by BCS, The Chartered Institute for IT, includes
An introduction to using Microsoft Excel for quantitative data analysis
Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing
Information Server Documentation SIMATIC. Information Server V8.0 Update 1 Information Server Documentation. Introduction 1. Web application basics 2
Introduction 1 Web application basics 2 SIMATIC Information Server V8.0 Update 1 System Manual Office add-ins basics 3 Time specifications 4 Report templates 5 Working with the Web application 6 Working
How To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
An analysis method for a quantitative outcome and two categorical explanatory variables.
Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that
Creating an Access Database. To start an Access Database, you should first go into Access and then select file, new.
To start an Access Database, you should first go into Access and then select file, new. Then on the right side of the screen, select Blank database. Give your database a name where it says db1 and save
An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression
Chapter 9 Simple Linear Regression An analysis appropriate for a quantitative outcome and a single quantitative explanatory variable. 9.1 The model behind linear regression When we are examining the relationship
Tutorial for proteome data analysis using the Perseus software platform
Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information
seven Statistical Analysis with Excel chapter OVERVIEW CHAPTER
seven Statistical Analysis with Excel CHAPTER chapter OVERVIEW 7.1 Introduction 7.2 Understanding Data 7.3 Relationships in Data 7.4 Distributions 7.5 Summary 7.6 Exercises 147 148 CHAPTER 7 Statistical
Disciplined Use of Spreadsheet Packages for Data Entry
Disciplined Use of Spreadsheet Packages for Data Entry January 2001 The University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Contents 1. Introduction 3 2. An
Gamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
Introduction to Microsoft Access 2003
Introduction to Microsoft Access 2003 Zhi Liu School of Information Fall/2006 Introduction and Objectives Microsoft Access 2003 is a powerful, yet easy to learn, relational database application for Microsoft
To change title of module, click on settings
HTML Module: The most widely used module on the websites. This module is very flexible and is used for inserting text, images, tables, hyperlinks, document downloads, and HTML code. Hover the cursor over
Statistical Models in R
Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Linear Models in R Regression Regression analysis is the appropriate
Drawing a histogram using Excel
Drawing a histogram using Excel STEP 1: Examine the data to decide how many class intervals you need and what the class boundaries should be. (In an assignment you may be told what class boundaries to
Stat 5303 (Oehlert): Tukey One Degree of Freedom 1
Stat 5303 (Oehlert): Tukey One Degree of Freedom 1 > catch
A Guide to Survey Analysis in GenStat. by Steve Langton. Defra Environmental Observatory, 1-2 Peasholme Green, York YO1 7PX, UK.
Survey Analysis A Guide to Survey Analysis in GenStat by Steve Langton Defra Environmental Observatory, 1-2 Peasholme Green, York YO1 7PX, UK. GenStat is developed by VSN International Ltd, in collaboration
Excel Companion. (Profit Embedded PHD) User's Guide
Excel Companion (Profit Embedded PHD) User's Guide Excel Companion (Profit Embedded PHD) User's Guide Copyright, Notices, and Trademarks Copyright, Notices, and Trademarks Honeywell Inc. 1998 2001. All
Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; [email protected], and Dr. J.A. Dobelman
Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; [email protected], and Dr. J.A. Dobelman Statistics lab will be mainly focused on applying what you have learned in class with
TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics
UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science & Statistics BA (Mod) Enter Course Title Trinity Term 2013 Junior/Senior Sophister ST7002
Tutorial: Get Running with Amos Graphics
Tutorial: Get Running with Amos Graphics Purpose Remember your first statistics class when you sweated through memorizing formulas and laboriously calculating answers with pencil and paper? The professor
Intro to Excel spreadsheets
Intro to Excel spreadsheets What are the objectives of this document? The objectives of document are: 1. Familiarize you with what a spreadsheet is, how it works, and what its capabilities are; 2. Using
