# Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation

Save this PDF as:

Size: px
Start display at page:

Download "Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation"

## Transcription

1 Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation Leslie Chandrakantha Department of Mathematics & Computer Science John Jay College of Criminal Justice of CUNY USA. Abstract: Computer simulation methods have been used in upper level statistics classes for many years. Lately, many instructors are adopting computer simulation to introduce the concepts in the introductory level. Students in introductory statistics classes struggle to understand the basic concepts. Research has shown that the use of computer simulation methods as an alternative to traditional methods of books and lecture enhance the understanding of the concepts. Computer simulation using spreadsheets such as Excel allows students to experiment with data and to visualize the results. In this paper, we will describe how to use the simulation using Excel Data Tables facility and standard functions to teach confidence intervals and hypothesis testing in introductory statistics classes. We believe, by using this hands on approach, students get a better feel for these abstract concepts. Our preliminary assessment shows that this approach would enhance the student learning of the concepts. 1. Introduction The understanding of the statistical inferences concepts is critical in making accurate conclusions in research findings. Almost all college students will have to do some form of research and summarize their findings. Statistics courses give the necessary skills for these tasks. College students will take at least one statistics course to gain such critical skills. The introductory statistics is the first statistics course they normally take at college level. Fundamental statistical concepts such as sampling distributions, central limit theorem, confidence intervals, hypothesis testing, and p-values are vital in an introductory statistics course. Many students struggle to understand these concepts. The use of computer simulation methods to mimic the real life sampling or repeated sampling from a population helps to understand these concepts. Cobb [3] noted that incorporating computer simulation techniques to illustrate key concepts and to allow students to discover important principles themselves will enhance their knowledge. Almost all software packages offer ways to perform the simulation. Mills [7] has given a comprehensive review of literature of computer simulation methods used in all areas of statistics to help students understand difficult concepts. Butler et al. [1] has developed macros that can run on Minitab environment for resampling methods in teaching statistics. Many introductory statistics students do not have the necessary skills to write or implement macros to perform these tasks. Excel provides ways to accomplish the same task without writing macros. Furthermore Excel is preferred due to the availability to students and the ease of presenting the data for the underlying statistical concepts in multiple rows and columns.

2 In this article, we describe how to use the Excel standard functions and the Data Table facility to generate different random samples from a population, compute confidence intervals, and perform hypothesis tests using repeated simulated sampling. Using this approach in the classroom, we are directly incorporating several recommended guidelines in the GAISE report [9] of American Statistical Association. The simulation and resampling have been used in teaching statistics for many years. Hagtvedt et al. [6] have developed simulation tools to compute multiple confidence intervals. Christie [2] has used the Excel Data Tables for estimating the population mean and the correlation. Winston [9] explained how to use Excel Data tables to simulate stock prices in asset allocation models. A valuable introduction to Excel Data Tables is given by Ecklund [4]. In next section, we give an introduction to Excel Data Tables and how to use them in generating different random samples. The next two sections will show how to use these simulation methods to teach the lessons of confidence intervals and hypothesis testing. Next, we illustrate how this approach meets the guidelines of the GAISE report. Finally, we give a preliminary comparison of traditional method of teaching and computer simulation approach, and our concluding remarks. 2. Excel Data Tables Data Tables are part of a group of commands that are called what-if analysis tools in Excel. Data Table function allows a table of what if questions to be posed and answered simply in sensitivity analysis, and is useful in simulation. What-if analysis is the process of changing the values in cells to see how those changes will affect the outcome of formulas on the worksheet. We can use the Data Table function to compute the values of a test statistic for different random samples. The Data Table function can be accessed from menu bar Data > What IF Analysis > Data Tables in Excel 2007 and To generate the values of a statistic for different samples using Data Table, first we calculate the value of the statistic using a random sample. This can be done using Excel random number generating functions for the appropriate population and other standard functions. This statistic value will be our original input value. This value (formula) will be put in the top cell of the right column of the Data Table. We set up our Data Table by selecting two columns and a certain number of rows depending on the number of values of the statistic we need to calculate. Leave the left column blank. The menu bar Data > What IF Analysis > Data Tables gives the Data Table dialog box. In this dialog box, leave Row input cell blank and type an empty cell reference that has no part of this Data Table setup for the Column input cell. Excel generates a new random sample and computes the value of the statistic for each substitution of this empty input cell and fills the table. Copying the formula down in the output cells does not work in this case. If we do this manually, we need to recalculate the statistic by repeated sampling by pressing the F9 key and recording these values in a column. The Data Table function provides a convenient way of generating values of the statistic for different samples.

3 3. Simulation of Confidence Intervals The topic of confidence intervals of parameter estimation is a difficult lesson to teach in introductory statistics classes. Confidence intervals give the most likely range of the unknown population parameter. In this discussion, we only consider the creating and interpreting confidence interval for mean (µ) assuming population standard deviation (σ) is known. Beginning of the class, we introduce the background and define the confidence interval for the mean as X Z *, where Z* n is the value of the standard normal curve with area C between critical points Z* and Z* and n is the sample size. The confidence level C is the probability that the confidence interval actually does contain the population mean μ, assuming the estimation process is repeated a large number of times, Moore [8]. Students have major difficulty in understanding this last statement. Many students misunderstand this statement as that majority of individual values are in this interval. It is important in this lesson that students understand in repeated sampling form a population, C percent of intervals (say 95%) would capture the true unknown mean. In using the traditional way of teaching, we only consider one sample and calculate one interval. This leads them to believe the wrong interpretation of the interval that there is a 95% chance that this interval will have the true mean. A computer simulation method using Excel will allow students to understand the true meaning of the confidence interval. After introducing the basic facts about confidence intervals to the class, we will calculate 95% confidence intervals for the population mean. First, we will show how to generate one random sample from the normal distribution with an assumed mean (say 100) and assumed standard deviation (say 10) of a convenient size (say 30). That is μ = 100, σ = 10, and n = 30. The Excel formula = norminv(rand(), 100, 10) is typed in a cell and copied using the fill handle to cells below in the column to generate a random sample. In the class room, everybody has computers and students follow our instructions and create their own samples. Then we calculate the mean of this sample using the average function in Excel, and use this as the top value to generate another 999 sample means using Excel Data Table. Now we have a table with 1000 sample means. For each sample mean, the confidence interval bounds are calculated using the formula X Z *. The Excel n formulas for these are= E2-1.96*10/sqrt(30) and = E *10/sqrt(30). The cell E2 contains the sample mean of the first cell of the Data Table. Then we copy these formulas to generate the rest of the confidence interval bounds. Finally, the proportion of intervals contain the true mean μ of 100 is calculated using IF and COUNTIF functions. If the interval contains the true mean, we assign one and otherwise zero using = IF(AND(100>=G2,100<=H2),1,0) formula and then compute the proportion of ones using = COUNTIF(J2:J1001,1)/1000 formula. This proportion should be closer to 95% which is the assumed confidence level C. Since students are doing these steps themselves, they visualize each step and get a clear understanding of the meaning of confidence intervals. Figure 3.1 shows a portion of the spreadsheet implementation of this simulation.

4 Figure 3.1 Confidence Intervals Calculation Spreadsheet Notice that 95% of the confidence intervals do contain the true mean. If we generate another set of samples and compute the confidence intervals by pressing the F9 key, we will find the new proportion which should be 95% or closer to it. Changing the confidence levels (to 90% or 99%) by just changing the confidence coefficient Z* to appropriate values in the formulas will help students to understand the meaning of confidence intervals. This lesson can be easily done in one class period of 75 minutes. 4. Simulation of Hypothesis Testing Hypothesis testing is another key topic of inferences in statistics classes and yet it is one of the abstract concepts to understand. A sound knowledge about terms such as null and alternative hypotheses, significance level, and p-value is essential in performing a hypothesis test and making the correct decision. Since all statistical software calculate p-values, more and more instructors are using the p-value approach to make decisions. Many students do not have a good understanding about the p- value and they blindly use the rejection criterion that if the p-value is less than the significance level, rejects the null hypothesis. The definition of the p-value is the probability, assuming the null hypothesis is true, that the test statistics would take a value as extreme or more extreme than that actually observed, Moore [8]. Computer simulation methods can give a better understanding of the p-value because it generates many samples assuming null hypothesis and then one can find the proportion of the test statistic values that are extreme or more extreme than the actual value observed from sample data.

5 Erickson [5] has used Fathom Dynamic Data Software to simulate flipping coins in teaching hypothesis testing concepts and found out that it makes difficult concepts more visible and understandable. Now we describe how to use Excel Data Tables to simulate hypothesis testing and compute the corresponding p-value. At the beginning of the class, we introduce the key components of hypothesis tests for mean µ assuming population standard deviation σ is known. Then we use an example to set up the null and alternative hypothesis, find the value of the statistic based on the actual sample, generate many random samples assuming null hypothesis, compute the test statistic values, and compute the p-value by finding the proportion of values exceeding the actual value. If the p-value is too small (less than the significance level, normally 0.05), we have evidence against the null hypothesis. The following example is discussed in the class. EXAMPLE: One sample z test. Data for this example are the reading times, Moore [8]. Does the use of fancy type fonts slow down the reading of text on a computer screen? Adults can read four paragraphs of text in an average time of 22 seconds in the common Times New Roman font. 25 adults were asked to read this text in the ornate font named Gigi. Here are their times: 23.2, 21.2, 28.9, 27.7, 23.4, 27.3, 16.1, 22.6, 25.6, 32.6, 23.9, 26.8, 18.9, 27.8, 21.4, 30.7, 21.5, 30.6, 31.5, 24.6, 23.0, 28.6, 24.4, 28.1, Suppose that reading times are normally distributed with σ = 6 seconds. Is there good evidence that the mean reading time for Gigi fonts is greater than 22 seconds? In other words, is μ greater than 22 seconds for Gigi fonts? The null and alternative hypotheses are: H 0 : μ = 22 seconds and H 1 : μ > 22 seconds. The test statistics used for this test is ( x μ )/ σ and that follows the standard normal distribution. The n value of the test statistics based on the above sample assuming null hypothesis is To find the p- value using simulation, we show how to generate one sample of 25 values from normal distribution with a mean of 22 and a standard deviation of 6 and calculate the value of the test statistic. Then use this value as the top value of the Data Table to generate 999 more random samples from the same distribution and calculate value of the test statistic. Now the Data Table has 1000 values of the test statistic. Then we use the Excel formula = COUNTIF(G2:G1001, ">= 2.627")/1000 to find the proportion of those values which exceed the actual value observed (2.627). This will allow students to understand that if the null hypothesis is true, and if we repeat the process large number of times, this is the chance that the value of the statistic is extreme or more extreme. Students will realize that if this chance is too small, the null hypothesis is unlikely. Since students are also doing this in the classroom, they clearly experience the process themselves. Figure 4.1 shows a portion of the spreadsheet implementation of this simulation.

6 More Frequency Figure 4.1 p-value Calculation Spreadsheet This p-value is clearly equal to the exact value calculated from standard normal distribution {P(Z > 2.627) = 0.004}. Rather than using the normal distribution table to find the p-value, this method explains the process better. The empirical distribution of the test statistic is also tabulated in the class and it shows the values are approximately standard normally distributed as it should be. Figure 4.2 shows the histogram Histogram of Test Statistic Test Statistic Values Figure 4.2 Histogram of the Test Statistics Values 5. Meeting GAISE Report Recommendations The activity of teaching sampling distributions using computer simulation meets three of the six recommendations suggested in the GAISE report [9] of the American Statistical Association. The

7 purpose of this report is to lay the foundation to help students achieve a goal of being sound statistically literate citizens who can apply the concepts well and think statistically. This report has revolutionized the way we teach introductory statistics. Our approach is somewhat closer to the active learning in the classroom since students are experimenting and obtaining the results by simulating the sampling distribution using Excel software. The following recommendations are met from this approach: a) Stress conceptual understanding rather than mere knowledge of procedures. Sampling distribution is the key to understand the topics of statistical inferences such as confidence intervals and hypothesis testing. This activity in the classroom stresses the importance of the understanding of randomness of the sampling distribution and that helps to learn the meaning of confidence intervals and p-values in hypothesis tests. b) Foster active learning in the classroom. While we show how to do it, students conduct the simulation and generate the sampling distribution themselves. They can verify the properties of the sampling distribution of the mean and the test statistics using the simulated results. Regularly we ask questions during the lesson to see whether students understand the concepts. c) Use technology for developing conceptual understanding and analyzing data. Students are using classroom computers and Excel software for this activity. They generate random samples, compute confidence intervals, values of test statistics and corresponding p-values to understand the underline concepts of these topics. This will visually illustrate the abstract concepts and enhance the conceptual understanding. 6. Comparison of Two Methods We have taught two introductory statistics sections last semester, one using computer simulation methods with Excel described in this paper and the other by the traditional method of not using simulation. Both classes have the same course content, same exams, quizzes, and assignments. Table 6.1 shows the final exam scores statistics: Table 6.1 Exam Score Statistics Class n Mean Median Std. Dev. CSM used Traditional method used The two sample t-test performed using Excel to test the hypothesis that the CSM class performs better on average than the traditional method class. The p-value produced by Excel was which indicates that CSM class performs significantly better at 0.05 level of significance. We have to caution that these sample sizes are not large enough to make a firm judgment on the conclusion. We plan to use larger sample sizes in the future classes to make a formal assessment.

8 7. Conclusions Many students have difficulties understanding introductory statistics concepts such as confidence intervals and hypothesis testing. Statistics instructors are always searching for new and efficient teaching methods to improve statistics instruction in hopes of enhancing student learning. Computer simulation methods as teaching tools are considered to be effective methods. We have demonstrated the use of simulation using Excel and Data Tables in teaching these topics. This is a very useful way to visualize the sampling distribution, confidence intervals, and to comprehend the p-value. Preliminary comparison of the two methods showed better outcomes using computer simulation methods. These simulation methods are acceptable to students with varying backgrounds of mathematics. References [1] Butler, A., Rothery, P., & Roy, D. (2003). Minitab Macros for Resampling Methods. Teaching Statistics, 25 (1), [2] Christie, D. (2004). Resampling with Excel. Teaching Statistics, 26 (1), [3] Cobb, P. (1994). Where is the Mind? Constructivist and Sociocultural Perspectives on Mathematical Development. Educational Researcher, 23, [4] Ecklund, P. (2009). Introduction to Excel 2007 Data tables and Data Table Exercises. https://www.amstat.org/publications/jse/secure/v7n3/delmas.cfm. [5] Erickson, T. (2007). Using Simulation to Learn About Inferences. International Conference on Teaching Statistics, 7, 1-6. [6] Hagtvedt, R., Jones, G. T., & Jones, K. (2008). Teaching Confidence Intervals Using Simulation, Teaching Statistics, 30 (2), [7] Mills, J. D. (2002). Using Computer Simulation Methods to Teach Statistics: A Review of the Literature. Journal of Statistics Education (Online), 10 (1). [8] Moore, D. S. (1996). Essential Statistics. New York, USA: W. H. Freeman & Company. [9] Winston, W. L. (2007). Excel 2007, Data Analysis and Business Modeling: Microsoft Press.

### Simulating Chi-Square Test Using Excel

Simulating Chi-Square Test Using Excel Leslie Chandrakantha John Jay College of Criminal Justice of CUNY Mathematics and Computer Science Department 524 West 59 th Street, New York, NY 10019 lchandra@jjay.cuny.edu

### LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

### t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

### CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

### Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

### CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS CENTRAL LIMIT THEOREM (SECTION 7.2 OF UNDERSTANDABLE STATISTICS) The Central Limit Theorem says that if x is a random variable with any distribution having

### 9-3.4 Likelihood ratio test. Neyman-Pearson lemma

9-3.4 Likelihood ratio test Neyman-Pearson lemma 9-1 Hypothesis Testing 9-1.1 Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental

### Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; hflores@rice.edu, and Dr. J.A. Dobelman

Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; hflores@rice.edu, and Dr. J.A. Dobelman Statistics lab will be mainly focused on applying what you have learned in class with

### KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application

### Finite Mathematics Using Microsoft Excel

Overview and examples from Finite Mathematics Using Microsoft Excel Revathi Narasimhan Saint Peter's College An electronic supplement to Finite Mathematics and Its Applications, 6th Ed., by Goldstein,

### MINITAB ASSISTANT WHITE PAPER

MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

### Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

### Statistical Functions in Excel

Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.

### LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-

### Chapter 6: t test for dependent samples

Chapter 6: t test for dependent samples ****This chapter corresponds to chapter 11 of your book ( t(ea) for Two (Again) ). What it is: The t test for dependent samples is used to determine whether the

### Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

### MATH 140 HYBRID INTRODUCTORY STATISTICS COURSE SYLLABUS

MATH 140 HYBRID INTRODUCTORY STATISTICS COURSE SYLLABUS Instructor: Mark Schilling Email: mark.schilling@csun.edu (Note: If your CSUN email address is not one you use regularly, be sure to set up automatic

### Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................

### 6 3 The Standard Normal Distribution

290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

### Step 3: Go to Column C. Use the function AVERAGE to calculate the mean values of n = 5. Column C is the column of the means.

EXAMPLES - SAMPLING DISTRIBUTION EXCEL INSTRUCTIONS This exercise illustrates the process of the sampling distribution as stated in the Central Limit Theorem. Enter the actual data in Column A in MICROSOFT

### TIPS FOR DOING STATISTICS IN EXCEL

TIPS FOR DOING STATISTICS IN EXCEL Before you begin, make sure that you have the DATA ANALYSIS pack running on your machine. It comes with Excel. Here s how to check if you have it, and what to do if you

### Statistical Inference

Statistical Inference Idea: Estimate parameters of the population distribution using data. How: Use the sampling distribution of sample statistics and methods based on what would happen if we used this

### Using MS Excel to Analyze Data: A Tutorial

Using MS Excel to Analyze Data: A Tutorial Various data analysis tools are available and some of them are free. Because using data to improve assessment and instruction primarily involves descriptive and

### Online 12 - Sections 9.1 and 9.2-Doug Ensley

Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 12 - Sections 9.1 and 9.2 1. Does a P-value of 0.001 give strong evidence or not especially strong

### Cell Phone Impairment?

Cell Phone Impairment? Overview of Lesson This lesson is based upon data collected by researchers at the University of Utah (Strayer and Johnston, 2001). The researchers asked student volunteers (subjects)

### AP Statistics: Syllabus 1

AP Statistics: Syllabus 1 Scoring Components SC1 The course provides instruction in exploring data. 4 SC2 The course provides instruction in sampling. 5 SC3 The course provides instruction in experimentation.

### MAT 12O ELEMENTARY STATISTICS I

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 12O ELEMENTARY STATISTICS I 3 Lecture Hours, 1 Lab Hour, 3 Credits Pre-Requisite:

### Measuring Evaluation Results with Microsoft Excel

LAURA COLOSI Measuring Evaluation Results with Microsoft Excel The purpose of this tutorial is to provide instruction on performing basic functions using Microsoft Excel. Although Excel has the ability

### Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB DIRECTIONS: Welcome! Your TA will help you apply your Calculator and MINITAB to review Business Stats, and will

### Technology Step-by-Step Using StatCrunch

Technology Step-by-Step Using StatCrunch Section 1.3 Simple Random Sampling 1. Select Data, highlight Simulate Data, then highlight Discrete Uniform. 2. Fill in the following window with the appropriate

### Teaching Statistics with Fathom

Teaching Statistics with Fathom UCB Extension X369.6 (2 semester units in Education) COURSE DESCRIPTION This is a professional-level, moderated online course in the use of Fathom Dynamic Data software

### Unit 29 Chi-Square Goodness-of-Fit Test

Unit 29 Chi-Square Goodness-of-Fit Test Objectives: To perform the chi-square hypothesis test concerning proportions corresponding to more than two categories of a qualitative variable To perform the Bonferroni

### Module 4 (Effect of Alcohol on Worms): Data Analysis

Module 4 (Effect of Alcohol on Worms): Data Analysis Michael Dunn Capuchino High School Introduction In this exercise, you will first process the timelapse data you collected. Then, you will cull (remove)

### Chapter 23 Inferences About Means

Chapter 23 Inferences About Means Chapter 23 - Inferences About Means 391 Chapter 23 Solutions to Class Examples 1. See Class Example 1. 2. We want to know if the mean battery lifespan exceeds the 300-minute

### STAT 360 Probability and Statistics. Fall 2012

STAT 360 Probability and Statistics Fall 2012 1) General information: Crosslisted course offered as STAT 360, MATH 360 Semester: Fall 2012, Aug 20--Dec 07 Course name: Probability and Statistics Number

### Chapter 7 Appendix. Inference for Distributions with Excel, JMP, Minitab, SPSS, CrunchIt!, R, and TI-83/-84 Calculators

Chapter 7 Appendix Inference for Distributions with Excel, JMP, Minitab, SPSS, CrunchIt!, R, and TI-83/-84 Calculators Inference for the Mean of a Population Excel t Confidence Interval for Mean Confidence

### Using Excel for Statistical Analysis

Using Excel for Statistical Analysis You don t have to have a fancy pants statistics package to do many statistical functions. Excel can perform several statistical tests and analyses. First, make sure

### MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

### CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS CHI-SQUARE TESTS OF INDEPENDENCE (SECTION 11.1 OF UNDERSTANDABLE STATISTICS) In chi-square tests of independence we use the hypotheses. H0: The variables are independent

In-Depth Guide Advanced Spreadsheet Techniques Learning Objectives By reading and completing the activities in this chapter, you will be able to: Create PivotTables using Microsoft Excel Create scenarios

### Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

### Stats for Strategy Exam 1 In-Class Practice Questions DIRECTIONS

Stats for Strategy Exam 1 In-Class Practice Questions DIRECTIONS Choose the single best answer for each question. Discuss questions with classmates, TAs and Professor Whitten. Raise your hand to check

### AP Statistics 2002 Scoring Guidelines

AP Statistics 2002 Scoring Guidelines The materials included in these files are intended for use by AP teachers for course and exam preparation in the classroom; permission for any other use must be sought

### Econometrics and Data Analysis I

Econometrics and Data Analysis I Yale University ECON S131 (ONLINE) Summer Session A, 2014 June 2 July 4 Instructor: Doug McKee (douglas.mckee@yale.edu) Teaching Fellow: Yu Liu (dav.yu.liu@yale.edu) Classroom:

### Microsoft Excel. Qi Wei

Microsoft Excel Qi Wei Excel (Microsoft Office Excel) is a spreadsheet application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot

### Fairfield Public Schools

Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

### Standard Deviation Calculator

CSS.com Chapter 35 Standard Deviation Calculator Introduction The is a tool to calculate the standard deviation from the data, the standard error, the range, percentiles, the COV, confidence limits, or

### Inferential Statistics

Inferential Statistics Sampling and the normal distribution Z-scores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are

### 03 The full syllabus. 03 The full syllabus continued. For more information visit www.cimaglobal.com PAPER C03 FUNDAMENTALS OF BUSINESS MATHEMATICS

0 The full syllabus 0 The full syllabus continued PAPER C0 FUNDAMENTALS OF BUSINESS MATHEMATICS Syllabus overview This paper primarily deals with the tools and techniques to understand the mathematics

### Figure 1. An embedded chart on a worksheet.

8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the

### Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

### How to Create a Data Table in Excel 2010

How to Create a Data Table in Excel 2010 Introduction Nicole Bernstein Excel 2010 is a useful tool for data analysis and calculations. Most college students are familiar with the basic functions of this

### Measuring the Power of a Test

Textbook Reference: Chapter 9.5 Measuring the Power of a Test An economic problem motivates the statement of a null and alternative hypothesis. For a numeric data set, a decision rule can lead to the rejection

### An introduction to using Microsoft Excel for quantitative data analysis

Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing

### Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Part 3 Comparing Groups Chapter 7 Comparing Paired Groups 189 Chapter 8 Comparing Two Independent Groups 217 Chapter 9 Comparing More Than Two Groups 257 188 Elementary Statistics Using SAS Chapter 7 Comparing

2. DATA AND EXERCISES (Geos2911 students please read page 8) 2.1 Data set The data set available to you is an Excel spreadsheet file called cyclones.xls. The file consists of 3 sheets. Only the third is

### Preparation of Two-Year College Mathematics Instructors to Teach Statistics with GAISE Session on Assessment

Preparation of Two-Year College Mathematics Instructors to Teach Statistics with GAISE Session on Assessment - 1 - American Association for Higher Education (AAHE) 9 Principles of Good Practice for Assessing

### A Hands-On Exercise Improves Understanding of the Standard Error. of the Mean. Robert S. Ryan. Kutztown University

A Hands-On Exercise 1 Running head: UNDERSTANDING THE STANDARD ERROR A Hands-On Exercise Improves Understanding of the Standard Error of the Mean Robert S. Ryan Kutztown University A Hands-On Exercise

### HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

### Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

### 2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

### 1 SAMPLE SIGN TEST. Non-Parametric Univariate Tests: 1 Sample Sign Test 1. A non-parametric equivalent of the 1 SAMPLE T-TEST.

Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

### PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS INTRODUCTION TO STATISTICS MATH 2050

PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS INTRODUCTION TO STATISTICS MATH 2050 Class Hours: 2.0 Credit Hours: 3.0 Laboratory Hours: 2.0 Date Revised: Fall 2013 Catalog Course Description: Descriptive

### Microsoft Excel 2010 and Tools for Statistical Analysis

Appendix E: Microsoft Excel 2010 and Tools for Statistical Analysis Microsoft Excel 2010, part of the Microsoft Office 2010 system, is a spreadsheet program that can be used to organize and analyze data,

### Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

### Teaching Business Statistics through Problem Solving

Teaching Business Statistics through Problem Solving David M. Levine, Baruch College, CUNY with David F. Stephan, Two Bridges Instructional Technology CONTACT: davidlevine@davidlevinestatistics.com Typical

### Chapter 7 Section 1 Homework Set A

Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the

### Standard Deviation Estimator

CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

### Drawing a histogram using Excel

Drawing a histogram using Excel STEP 1: Examine the data to decide how many class intervals you need and what the class boundaries should be. (In an assignment you may be told what class boundaries to

### How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

### 5.1 Identifying the Target Parameter

University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

### Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques

### Cents and the Central Limit Theorem Overview of Lesson GAISE Components Common Core State Standards for Mathematical Practice

Cents and the Central Limit Theorem Overview of Lesson In this lesson, students conduct a hands-on demonstration of the Central Limit Theorem. They construct a distribution of a population and then construct

### UNIVERSITY of MASSACHUSETTS DARTMOUTH Charlton College of Business Decision and Information Sciences Fall 2010

UNIVERSITY of MASSACHUSETTS DARTMOUTH Charlton College of Business Decision and Information Sciences Fall 2010 COURSE: POM 500 Statistical Analysis, ONLINE EDITION, Fall 2010 Prerequisite: Finite Math

### How to Use a Data Spreadsheet: Excel

How to Use a Data Spreadsheet: Excel One does not necessarily have special statistical software to perform statistical analyses. Microsoft Office Excel can be used to run statistical procedures. Although

### An interval estimate (confidence interval) is an interval, or range of values, used to estimate a population parameter. For example 0.476

Lecture #7 Chapter 7: Estimates and sample sizes In this chapter, we will learn an important technique of statistical inference to use sample statistics to estimate the value of an unknown population parameter.

### Introduction to Stata and Hypothesis testing.

Introduction to Stata and Hypothesis testing. The goals today are simple let s open Stata, understand basically how it works, understand what a dofile is, and then run some basic hypothesis tests for testing

### Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

### Statistics Software Survey Results

Statistics Software Survey Results Dr. Nancy Pfenning University of Pittsburgh Department of Statistics The following survey was administered in October, 24 to two business statistics classes (139 students

### Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

### Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

### Name: Date: Use the following to answer questions 3-4:

Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

### Economic Statistics (ECON2006), Statistics and Research Design in Psychology (PSYC2010), Survey Design and Analysis (SOCI2007)

COURSE DESCRIPTION Title Code Level Semester Credits 3 Prerequisites Post requisites Introduction to Statistics ECON1005 (EC160) I I None Economic Statistics (ECON2006), Statistics and Research Design

Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine

### Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

### An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

### Truman College-Mathematics Department Math 125-CD: Introductory Statistics Course Syllabus Fall 2012

Instructor: Dr. Abdallah Shuaibi Office #: 3816 Email: ashuaibi1@ccc.edu URL: http://faculty.ccc.edu/ashuaibi/ Phone #: (773)907-4085 Office Hours: Truman College-Mathematics Department Math 125-CD: Introductory

### Street Address: 1111 Franklin Street Oakland, CA 94607. Mailing Address: 1111 Franklin Street Oakland, CA 94607

Contacts University of California Curriculum Integration (UCCI) Institute Sarah Fidelibus, UCCI Program Manager Street Address: 1111 Franklin Street Oakland, CA 94607 1. Program Information Mailing Address:

### How to Conduct a Hypothesis Test

How to Conduct a Hypothesis Test The idea of hypothesis testing is relatively straightforward. In various studies we observe certain events. We must ask, is the event due to chance alone, or is there some

### Math 108 Exam 3 Solutions Spring 00

Math 108 Exam 3 Solutions Spring 00 1. An ecologist studying acid rain takes measurements of the ph in 12 randomly selected Adirondack lakes. The results are as follows: 3.0 6.5 5.0 4.2 5.5 4.7 3.4 6.8

### Data exploration with Microsoft Excel: analysing more than one variable

Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical

### Chi Square Tests. Chapter 10. 10.1 Introduction

Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square

### SPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav)

Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Organize and Display One Quantitative Variable (Descriptive Statistics, Boxplot & Histogram) 1. Move the mouse pointer

### Recitation, Week 3: Basic Descriptive Statistics and Measures of Central Tendency:

Recitation, Week 3: Basic Descriptive Statistics and Measures of Central Tendency: 1. What does Healey mean by data reduction? a. Data reduction involves using a few numbers to summarize the distribution

### p ˆ (sample mean and sample

Chapter 6: Confidence Intervals and Hypothesis Testing When analyzing data, we can t just accept the sample mean or sample proportion as the official mean or proportion. When we estimate the statistics