Software for Intro Stats: Is Excel an Option?



Similar documents
STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Predictor Coef StDev T P Constant X S = R-Sq = 0.0% R-Sq(adj) = 0.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Using Excel for Statistical Analysis

An introduction to using Microsoft Excel for quantitative data analysis

3 The spreadsheet execution model and its consequences

Using Excel for Data Manipulation and Statistical Analysis: How-to s and Cautions

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

USING EXCEL IN AN INTRODUCTORY STATISTICS COURSE: A COMPARISON OF INSTRUCTOR AND STUDENT PERSPECTIVES

Problems With Using Microsoft Excel for Statistics

Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; and Dr. J.A. Dobelman

Description. Textbook. Grading. Objective

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

E x c e l : Data Analysis Tools Student Manual

1.1. Simple Regression in Excel (Excel 2010).

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Executive Master of Public Administration. QUANTITATIVE TECHNIQUES I For Policy Making and Administration U6311, Sec. 003

Regression step-by-step using Microsoft Excel

Using Excel s Analysis ToolPak Add-In

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

I ~ 14J... <r ku...6l &J&!J--=-O--

Applied Statistics and Data Analysis

MATH 205 STATISTICAL METHODS

12: Analysis of Variance. Introduction

RUSRR048 COURSE CATALOG DETAIL REPORT Page 1 of 6 11/11/ :33:48. QMS 102 Course ID

AP Statistics: Syllabus 1

Chapter 7 Section 1 Homework Set A

EDMS 769L: Statistical Analysis of Longitudinal Data 1809 PAC, Th 4:15-7:00pm 2009 Spring Semester

Directions for using SPSS

Street Address: 1111 Franklin Street Oakland, CA Mailing Address: 1111 Franklin Street Oakland, CA 94607

UNIVERSITY of MASSACHUSETTS DARTMOUTH Charlton College of Business Decision and Information Sciences Fall 2010

ECO 250 Economics and Business Statistics I Fall 2009 Online Course

Graduate School of Education Program: Doctoral Studies in Education Fall Semester, 2014 SYLLABUS

Minitab Tutorials for Design and Analysis of Experiments. Table of Contents

Notes on computer programmes for statistical analysis

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

Statistics with Ms Excel

NAVAL POSTGRADUATE SCHOOL

1. Go to your programs menu and click on Microsoft Excel.

Better decision making under uncertain conditions using Monte Carlo Simulation

Content and Method in the Teaching of Marketing Research Revisited

COURSE PLAN BDA: Biomedical Data Analysis Master in Bioinformatics for Health Sciences Academic Year Qualification.

Chapter 13 Introduction to Linear Regression and Correlation Analysis

MATH 205 STATISTICAL METHODS

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone:

Simple Predictive Analytics Curtis Seare

Master of Science in Healthcare Informatics and Analytics Program Overview

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.

2015 TUHH Online Summer School: Overview of Statistical and Path Modeling Analyses

Teaching Biostatistics to Postgraduate Students in Public Health

Analysis of categorical data: Course quiz instructions for SPSS

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Executive Master of Public Administration. QUANTITATIVE TECHNIQUES I For Policy Making and Administration U6310, Sec. 03

Quantitative Analysis for Business BSNS102. Course Outline Semester One 2014

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Education & Training Plan Accounting Math Professional Certificate Program with Externship

QMB 3302 Business Analytics CRN Spring 2015 T R -- 11:00am - 12:15pm -- Lutgert Hall 2209

Data Analysis Tools. Tools for Summarizing Data

Simulating Chi-Square Test Using Excel

Learning Objectives for Selected Programs Offering Degrees at Two Academic Levels

STAT 2080/MATH 2080/ECON 2280 Statistical Methods for Data Analysis and Inference Fall 2015

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Final Exam Practice Problem Answers

one Introduction chapter OVERVIEW CHAPTER

The University of Texas at Austin School of Social Work SOCIAL WORK STATISTICS

Advanced Excel for Institutional Researchers

Additional sources Compilation of sources:

ENGG1811 Computing for Engineers. Data Analysis using Excel (weeks 2 and 3)

Economic Statistics (ECON2006), Statistics and Research Design in Psychology (PSYC2010), Survey Design and Analysis (SOCI2007)

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

MATH 140 HYBRID INTRODUCTORY STATISTICS COURSE SYLLABUS

ANALYTICS CENTER LEARNING PROGRAM

Industrial and Systems Engineering Master of Science Program Data Analytics and Optimization

A spreadsheet Approach to Business Quantitative Methods

SPSS Guide: Regression Analysis

Ad Hoc and Statistical Model Visualization Using JMP, SAS and Microsoft Excel

POL 204b: Research and Methodology

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

Regression and Multivariate Data Analysis

SPSS Manual for Introductory Applied Statistics: A Variable Approach

A DATA ANALYSIS TOOL THAT ORGANIZES ANALYSIS BY VARIABLE TYPES. Rodney Carr Deakin University Australia

Forecasting Tourism Demand: Methods and Strategies. By D. C. Frechtling Oxford, UK: Butterworth Heinemann 2001

seven Statistical Analysis with Excel chapter OVERVIEW CHAPTER

Easily Identify Your Best Customers

STAT 360 Probability and Statistics. Fall 2012

ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R.

BIOM611 Biological Data Analysis

How To Check For Differences In The One Way Anova

Using Analysis Tools: Goal Seek, Solver, and Data Tables

SAS Certificate Applied Statistics and SAS Programming

MTH 140 Statistics Videos

QUAN 102 STATISTICS FOR BUSINESS

Elements of statistics (MATH0487-1)

Using Excel in Research. Hui Bian Office for Faculty Excellence

Online Course Syllabus Page 1

Transcription:

Software for Intro Stats: Is Excel an Option? Roger L. Berger Arizona State University August 2006 New Researchers Conference Seattle, WA 1

OUTLINE 1. Course description 2. Survey 3. Why use Excel? 4. Why not use Excel? 5. Conclusions 6. Bonus survey 7. Other topics 2

Course Description Undergraduate, The Basic Practice of Statistics, David Moore Students from many different disciplines Maybe fulfills a general studies requirement Teaches methods descriptive statistics populations & samples t tests & confidence intervals oneway ANOVA simple linear regression χ 2 tests for contingency tables Not a concepts course 3

Survey Choose one answer. What kind of software/technology should be used in this kind of course? A. None, use only pencil & paper B. A hand calculator, nothing more C. Educational software that comes with the text, e.g., CrunchIt D. Microsoft Excel E. Full-featured statistics package, e.g., SAS, SPSS F. Statistics programming language, e.g., R, S-Plus 4

Some Factors to Consider 1. Accessible easily available to students? school? home? during exams? 2. Easy to use 3. Serviceable do what is needed for this course? 4. Affordable 5. Useable after this course? 5

Why Use Excel? 1. Most students have access to Excel on school computers 2. Many students have access to Excel on personal computers 3. No additional expense to students 4. Many students have used Excel to some degree 5. Excel has built-in tools that will carry out most of the analyses that are covered in this course 6. Many resources related to Excel; textbooks, online, etc. 7. Familiarity with Excel is desired by many employers 6

Hot off the press: Nash, J. C. (2006). Spreadsheets in Statistical Practice Another Look, The American Statistician, 60, 287-289. Spreadsheets are ubiquitous. We should work to ensure that they are used correctly for statistical practice. 7

Statistical analysis methods are provided in two forms in Excel: 1. Functions give a single number as output, e.g., =average(a1:a30) =stdev(a1:a30) =percentile(a1:a30,.6) 2. Procedures in the Data Analysis Toolpack add-in two-sample t tests ANOVA oneway and twoway multiple regression 8

Why Not Use Excel? 1. Poor graphics no boxplot or stemplot (many online, e.g., boxplot by Dawson from Moore & McCabe webpage) poor and inaccurate histograms 2. Some missing methods no one sample t tests or confidence intervals no χ 2 tests for contingency tables 3. Poor descriptions of some methods ZTEST function 9

4. Computational inaccuracies use of poor algorithms in earlier versions worst problems corrected in 2003 version accuracy probably adequate for this course 5. Limited capabilities beyond this course. Cannot do serious data analysis. no 3-way ANOVA no logistic regression rigid input formats, e.g., multiple rgression 10

Before the 2003 version, Excel used the hand calculation formula for calculation of the sample variance s 2 = n i=1 x2 i ( n i=1 x i) 2 n 1 n This formula leads to computational inaccuracies if n is large and the x i s cover a wide range. 11

References for Problems with Statistical Computations in Excel McCullough, B. D., and Wilson, Berry (1999). On the accuracy of statistical procedures in Microsoft Excel 97. Computational Statistics and Data Analysis, 31, 27 37.... and later papers by these authors in 2002 and 2005 Website by David A. Heiser www.daheiser.info/excel/frontpage.html Microsoft Excel 2000 and 2003 Faults, Problems, Workarounds and Fixes (Detailed and up-to-date descriptions.) 12

Conclusions Some Positives Reasonable computational tool for an introductory, undergraduate, statistical methods course Accessible to most students, even at home Performs most computations needed for intro course Easy to introduce because many students are familiar with Excel Excel familiarity useful in other contexts 13

Conclusions Some Negatives Not appropriate for advanced data analysis Not appropriate for undergrad statistics majors Not appropriate for graduate students Some accuracy issues 14

Bonus Survey Evaluate the following expressions: 1. 3 3 2. 0 3 3 3. 0 3 2 4. 3 2 15

Bonus Survey Evaluate the following expressions: Answers 1. 3 3 9 2. 0 3 3 9 3. 0 3 2 9 4. 3 2 +9 16

What s going on? In Excel, unary negation, as in 3 2, has a higher order of precedence than exponentiation or any other operation 3 2 is interpreted as ( 3) 2 17

Two high school students, Donald Brandl & Jebina Rajbhandari, discovered this when considering an example from McCullough & Wilson (1999). They were trying to use the Excel Solver Toolbox add-in to fit this nonlinear least squares regression. y = β 1 e β2x + β 3 e (x β 4) 2 /β5 2 + β 6 e (x β 7) 2 /β8 2 + ɛ 18

About Excel: Questions? Comments? Other topics? 19