11.2 POINT ESTIMATES AND CONFIDENCE INTERVALS

Size: px
Start display at page:

Download "11.2 POINT ESTIMATES AND CONFIDENCE INTERVALS"

Transcription

1 11.2 POINT ESTIMATES AND CONFIDENCE INTERVALS Point Estimates Suppose we want to estimate the proportion of Americans who approve of the president. In the previous section we took a random sample of size 1200 from the population and used the proportion of the people in the sample who approved of the president to estimate the proportion of the people in

2 the entire country who approve of the president. The proportion of people in the sample who approve of the president is an example of what is called a point estimate. It is called a point estimate because it is a single number that estimates a population parameter, here the proportion of the people in the country who approve of the president. Example 11.3 Suppose that out of 1200 people in the sample, 615 approve of the president. The point estimate of the proportion of the people in the country who approve of the president is 615/ Example 11.4 Suppose that in the example of the 10 containers of sampled water, the sample 2 average of the 10 observed E. coli bacteria densities is 1500/cm. The point estimate 2 of the density of E. coli bacteria in the swimming pool is 1500/cm. Example 11.5 It was desired to find the population mean and standard deviation of the ages of the students enrolled at a particular community college. It was impossible to survey all the students, so a random sample of 100 students was taken and the mean and the standard deviation of the ages of the students in the sample were calculated. The sample mean was and the sample standard deviation was The sample mean age, X , is a point estimate of the population mean age,. The sample standard deviation of the ages, S 2. 25, is a point estimate of the population standard deviation,. In summary, a point estimate for a population parameter of interest is a statistic computed from the sample. It is believed to be an effective estimate of the unknown population parameter. Often, what formula to use for the statistic is obvious. For example, for the population parameter, which is the mean over a (presumably large) population, it seems that the sample mean will be a good estimate. However, in more complex situations the choice of an estimate of a population parameter is not always clear. Statisticians may find that either they have no idea how to use the sample data to estimate the population parameter of interest, or they may in fact have several equally plausible competing estimates to select from. A relatively simple example of the latter case is the estimation of 2. Let n denote the sample size. Many statisticians would use S2 sum( X X 2 n X X 2 n X i ) /, while many others would use sum( ) /( 1). Here denotes the i i

3 ith observation of the random sample. A solid argument can be made for either estimate. In fact, a significant amount of statistical theory is devoted to finding the best estimate of a population parameter. Let s consider an example in which the formula for a good estimate is not obvious. Example 11.6 Suppose the waiting time for a train that goes to the parking area from a particular terminal at Chicago s O Hare International Airport obeys a continuous uniform distribution on the time interval [0,T], where Tis an unknown population parameter. Thus a train always arrives within T minutes, but it is equally likely to arrive at any moment during this waiting time. Suppose we interview five randomly selected passengers and find that their waiting times were (in minutes) 1.4, 4.7, 4.2, 5.1, and 2.1. We want to estimate the parameter T. Clearly the sample mean is a poor choice for an estimate of T. What formula should we use? A widely used technique for producing an estimate when we have none in mind, called maximum likelihood estimation, leads us to an estimate that is equal to the maximum of the five observed times, which is 5.1 minutes. This choice of the maximum of the observed values as our point estimate is not obvious. Even though it might not be the best choice for an estimate of T, it seems clearly better than X. We will not consider maximum likelihood estimation in this book. A point estimate reports one single value that we estimate to be the true value of the population parameter. However, point estimates have the important limitation of not informing us how much the estimate is likely to be in error. Whenever we estimate a population parameter, we lack total accuracy. Thus our estimate will almost always be different from the actual population parameter. So the estimate will almost always have some amount of error associated with it. By simply reporting the point estimate of a parameter, we have essentially ignored the important issue of the likely error size associated with the estimate. Example 11.7 Reconsider Example A second random sample, this time of 200 students, was taken. Suppose, to keep our explanation simple, the point estimate of the population mean is the same in both cases, namely The standard deviation of the second sample mean would be much lower than the standard deviation of the first mean, because the second sample is larger. So even though the two point estimates of the population mean are the same, the second one is surely more accurate, because it is based on more information namely, twice as many observations.

4 Confidence Intervals To improve on point estimates, statisticians usually report an interval of values that they believe the parameter is highly likely to lie in. Usually the point estimate is the middle point of the interval and the endpoints of the interval communicate the size of the error associated with the estimate (recall that point estimates ignore this error) and how confident we are that the population parameter is in the interval. The intervals are called confidence intervals. Typical confidence levels used in practice for confidence intervals are 90%, 95%, or 99%, with 95% occurring most frequently in applications. In Examples 11.1 and 11.3, if we are given a 95% confidence interval for the proportion of the population that approves of the president, which can be shown to be the interval (0.48, 0.54), we say we are 95% confident that the population proportion is contained in the confidence interval. In Examples 11.2 and 11.4, given a 90% confidence interval for the density of bacteria in the swimming pool, (1490, 1510) say, we say we are 90% confident that the density of bacteria in the swimming pool is contained in the confidence interval. (We will learn how to calculate confidence intervals for different population parameters later in the chapter.) What does it really mean to state a 95% confidence interval for the unknown population proportion approving of the president? Although 95% sounds impressive, we cannot be satisfied unless we understand what it means. Theoretically, it means that the probability is 0.95 that such a confidence interval, which will be random because the sample it is formed from is random, will contain (surround) the unknown proportion in the population approving of the president. Our experimental view of probability based on the five-step method will help us more clearly and deeply understand what this probability of 0.95 means practically. Just as we do simulations over and over in the five-step method, imagine that a statistician does the sampling experiment of Example 11.1 over and over 1000 times, say and each time computes a 95% confidence interval from the 1200 sampled people. Now we can find the experimental probability given by the proportion of the 1000 confidence intervals that actually covers the true fraction of the population favoring the president. Since 0.95 is the theoretical probability of the interval containing the population proportion, this experimental probability of the 1000 confidence intervals including the true value will also be close to (Below, we will simulate 100 such 95% confidence intervals and calculate the experimental confidence interval probability.) Of course, in a real application there will only be one random sample and hence only one such confidence interval, such as the (0.48, 0.54) interval of Examples 11.1 and But the statistician obtaining this one

5 sample knows, because of the experimental probability viewpoint, that this confidence interval is very likely to be correct in the sense that it contains the true value (since about 95% of such confidence intervals would cover the true population proportion). In the case of the (0.48, 0.54) interval of Examples 11.1 and 11.3, we know it is very likely that the true proportion of people favoring the president lies between 0.48 and Now in light of this insight into how to interpret the confidence interval percentage, let s return to the Key Problem. The St. Louis Post Dispatch explained the concept of its reported confidence interval this way: [A 95% confidence interval] means if the survey were taken 100 times, the results for the [random] group of respondents would each vary no more than 5.7 percent in either direction from the true population percentage opposing the stadium [about] 95% of these times. The Post Dispatch quote is a bit roundabout and hence forces us to go through a slightly tricky piece of logic (draw yourself a picture if needed). If the interval, which extends 5.7% in either direction from its midpoint, indeed varies no more than 5.7% in either direction from the true population mean about 95% of the time, then about 95% of these intervals must contain the true population percentage as desired. The Post Dispatch could have more simply and more directly told its readers that such an interval can be expected to contain the true population parameter about 95% of the time. In other words, just as explained above in the presidential popularity example, the Post Dispatch is pointing out that if you take 100 random samples from the same population and calculate the confidence interval for the population proportion for each sample, about 95% of the confidence intervals will include the true population proportion. That is, you will be correct in your claim that the unknown population proportion is in the interval computed using the sample for about 95 of the 100 samples. Using our five-step method, we now simulate 100 confidence intervals and determine how many of them contain the true population parameter. Suppose that for the Key Problem the true proportion of people in the county who are opposed to the stadium is 50% (remember that the parameter value is never known to the statistician). In that case the probability that a person in the sample will be opposed to the stadium is the same as the probability of heads seen in flipping a fair coin: 0.5. Our goal is to obtain 100 samples of 301 people and calculate the confidence interval for each of these 100 samples (we will learn to compute such confidence intervals in Section 11.6 below). Each sampling is the same as flipping a coin 301 times, recording the number of heads, and calculating the confidence interval for the proportion of heads in the sample. We then repeat the process 100 times. The 100 confidence intervals obtained from this process are represented in Figure 11.1.

6 Figure 11.1 One hundred simulated 95% confidence intervals for the Key Problem assuming a 50/50 population split. The line across the middle of the graph represents the true population proportion, The 100 confidence intervals are the vertical lines on the graph. If the confidence interval covers the 0.50 line, then we say that the true population proportion is contained in the interval. Likewise, if the confidence interval does not cover the 0.50 line, then we say that the true population proportion is not contained in the interval. For the graph of Figure 11.1, we see that 96 out of the 100 confidence intervals cover the 0.50 line. So for the 100 confidence intervals, 96% of them (about 95%, as expected) contain the true population parameter, Thus, this example of the five-step method clearly illustrates how we are to correctly interpret a 95% confidence interval. As already discussed above, it is a sort of statisticians success rate or batting average. If a statistician constructs % confidence intervals during a year s work, then, as our five-step simulation confirms, we can expect about 95% of them to be hits : cases in which the population parameter is contained in the interval. Similarly we would expect about 5% of them to be outs, or misses: cases in which the population parameter is not contained in the interval. Compared with baseball, in which a batting average of is considered great, a confidence interval coverage rate for statisticians is what is usually required. Batting is never guaranteed in baseball, but in fact a statistician can guarantee a 95% confidence interval, as we shall see. Confidence intervals have two basic characteristics that we need to understand. First, given the same set of data, a 95% confidence interval is wider than a 90% confidence interval, and a 99% confidence interval is wider than a 95% confidence interval. Thus, the higher the confidence level we require, the wider the interval we are forced to accept! Of course, a very wide interval is of little use to the scientist who has sought statistical advice. Thus there is no free lunch in specifying a 99% confidence instead of a

7 SECTION 11.2 EXERCISES 95% confidence, because the price paid is a wider interval. Here are the 90%, 95%, and 99% confidence intervals for the proportion of people in the country who approve of the president in Example 11.1: 90%: (0.49, 0.53) 95%: (0.48, 0.54) 99%: (0.47, 0.55) As you can see from this example, the higher the level of confidence, the wider the confidence interval needs to be in order to contain the population proportion with the specified confidence. The second characteristic of confidence intervals is that, given the same confidence level, a shorter and hence more informative confidence interval is associated with more data points. Suppose in the situation of Example 11.1 a sample of 2400 people was taken from the population and the number of people in the sample that approved of the president was The point estimate of the population proportion would be the same: 1230/ However, a 95% confidence interval for the population proportion based on this sample of 2400 can be shown to be (0.4925, ). This is shorter than the 95% confidence interval for the original sample of 1200, which can be shown to be (0.4842, ). Indeed, the result of increasing the sample size is a shorter interval (of length 0.04 compared with 0.057) for the same confidence level of 95%. In summary, point estimates provide only a single number to estimate the value of a population parameter. By contrast, confidence intervals give a range of values that we reasonably expect will contain the population parameter. Again, a 95% confidence interval means that if we were to take a large number of samples (like 100 or 1000) of equal size from the same population and calculate a confidence interval for the population parameter, about 95% of the confidence intervals would contain the true value of the population parameter. 1. Suppose we want to estimate the proportion 3. Explain the meaning of a 99% confidence of a city s residents who drive to work. What level. is a good choice for the point estimate of this 4. Suppose you want a confidence interval for proportion? a population proportion. You want to be as 2. Suppose, instead, we want to estimate the av- accurate as possible, so you select a 100% conerage number of miles people living in a city fidence level. What would your confidence drive to work. What are two possible choices interval have to be? for the point estimate of this average?

8 5. Which confidence interval, when based on 7. A Gallup poll of 1013 adults found 61% of the same data, is wider: an 80% or an 85% the people in the sample drink alcoholic bevconfidence interval? erages, which yields a confidence interval of 6. True or false: If you flip a fair coin 100 times, (58%, 64%). True or false: There is an apthen calculate a 95% confidence interval, there proximate 95% chance that the percentage of is an approximate 95% chance 1/2 will be in adults in the population who drink alcoholic the interval. beverages is between 58% and 64%.

Point and Interval Estimates

Point and Interval Estimates Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

5.1 Identifying the Target Parameter

5.1 Identifying the Target Parameter University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Chapter 4. Probability and Probability Distributions

Chapter 4. Probability and Probability Distributions Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the

More information

SAMPLING DISTRIBUTIONS

SAMPLING DISTRIBUTIONS 0009T_c07_308-352.qd 06/03/03 20:44 Page 308 7Chapter SAMPLING DISTRIBUTIONS 7.1 Population and Sampling Distributions 7.2 Sampling and Nonsampling Errors 7.3 Mean and Standard Deviation of 7.4 Shape of

More information

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe Introduction to the Practice of Statistics Fifth Edition Moore, McCabe Section 5.1 Homework Answers 5.7 In the proofreading setting if Exercise 5.3, what is the smallest number of misses m with P(X m)

More information

Lesson 17: Margin of Error When Estimating a Population Proportion

Lesson 17: Margin of Error When Estimating a Population Proportion Margin of Error When Estimating a Population Proportion Classwork In this lesson, you will find and interpret the standard deviation of a simulated distribution for a sample proportion and use this information

More information

What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago

What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago 8 June 1998, Corrections 14 February 2010 Abstract Results favoring one treatment over another

More information

Math 251, Review Questions for Test 3 Rough Answers

Math 251, Review Questions for Test 3 Rough Answers Math 251, Review Questions for Test 3 Rough Answers 1. (Review of some terminology from Section 7.1) In a state with 459,341 voters, a poll of 2300 voters finds that 45 percent support the Republican candidate,

More information

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1. MATH10212 Linear Algebra Textbook: D. Poole, Linear Algebra: A Modern Introduction. Thompson, 2006. ISBN 0-534-40596-7. Systems of Linear Equations Definition. An n-dimensional vector is a row or a column

More information

Social Studies 201 Notes for November 19, 2003

Social Studies 201 Notes for November 19, 2003 1 Social Studies 201 Notes for November 19, 2003 Determining sample size for estimation of a population proportion Section 8.6.2, p. 541. As indicated in the notes for November 17, when sample size is

More information

Probability Distributions

Probability Distributions CHAPTER 5 Probability Distributions CHAPTER OUTLINE 5.1 Probability Distribution of a Discrete Random Variable 5.2 Mean and Standard Deviation of a Probability Distribution 5.3 The Binomial Distribution

More information

Introduction to Hypothesis Testing

Introduction to Hypothesis Testing I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

More information

3 Some Integer Functions

3 Some Integer Functions 3 Some Integer Functions A Pair of Fundamental Integer Functions The integer function that is the heart of this section is the modulo function. However, before getting to it, let us look at some very simple

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

Lab 11. Simulations. The Concept

Lab 11. Simulations. The Concept Lab 11 Simulations In this lab you ll learn how to create simulations to provide approximate answers to probability questions. We ll make use of a particular kind of structure, called a box model, that

More information

Problem Solving and Data Analysis

Problem Solving and Data Analysis Chapter 20 Problem Solving and Data Analysis The Problem Solving and Data Analysis section of the SAT Math Test assesses your ability to use your math understanding and skills to solve problems set in

More information

COMP 250 Fall 2012 lecture 2 binary representations Sept. 11, 2012

COMP 250 Fall 2012 lecture 2 binary representations Sept. 11, 2012 Binary numbers The reason humans represent numbers using decimal (the ten digits from 0,1,... 9) is that we have ten fingers. There is no other reason than that. There is nothing special otherwise about

More information

1.7 Graphs of Functions

1.7 Graphs of Functions 64 Relations and Functions 1.7 Graphs of Functions In Section 1.4 we defined a function as a special type of relation; one in which each x-coordinate was matched with only one y-coordinate. We spent most

More information

6.3 Conditional Probability and Independence

6.3 Conditional Probability and Independence 222 CHAPTER 6. PROBABILITY 6.3 Conditional Probability and Independence Conditional Probability Two cubical dice each have a triangle painted on one side, a circle painted on two sides and a square painted

More information

Sampling and Sampling Distributions

Sampling and Sampling Distributions Sampling and Sampling Distributions Random Sampling A sample is a group of objects or readings taken from a population for counting or measurement. We shall distinguish between two kinds of populations

More information

Hypothesis Testing for Beginners

Hypothesis Testing for Beginners Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes

More information

1 Sufficient statistics

1 Sufficient statistics 1 Sufficient statistics A statistic is a function T = rx 1, X 2,, X n of the random sample X 1, X 2,, X n. Examples are X n = 1 n s 2 = = X i, 1 n 1 the sample mean X i X n 2, the sample variance T 1 =

More information

Chapter 7 Review. Confidence Intervals. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 7 Review. Confidence Intervals. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Chapter 7 Review Confidence Intervals MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) Suppose that you wish to obtain a confidence interval for

More information

Statistical estimation using confidence intervals

Statistical estimation using confidence intervals 0894PP_ch06 15/3/02 11:02 am Page 135 6 Statistical estimation using confidence intervals In Chapter 2, the concept of the central nature and variability of data and the methods by which these two phenomena

More information

Normal distribution. ) 2 /2σ. 2π σ

Normal distribution. ) 2 /2σ. 2π σ Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a

More information

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Mind on Statistics. Chapter 12

Mind on Statistics. Chapter 12 Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference

More information

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10 CS 70 Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10 Introduction to Discrete Probability Probability theory has its origins in gambling analyzing card games, dice,

More information

Experimental Analysis

Experimental Analysis Experimental Analysis Instructors: If your institution does not have the Fish Farm computer simulation, contact the project directors for information on obtaining it free of charge. The ESA21 project team

More information

Problem of the Month: Fair Games

Problem of the Month: Fair Games Problem of the Month: The Problems of the Month (POM) are used in a variety of ways to promote problem solving and to foster the first standard of mathematical practice from the Common Core State Standards:

More information

1. The Fly In The Ointment

1. The Fly In The Ointment Arithmetic Revisited Lesson 5: Decimal Fractions or Place Value Extended Part 5: Dividing Decimal Fractions, Part 2. The Fly In The Ointment The meaning of, say, ƒ 2 doesn't depend on whether we represent

More information

Chapter 20: chance error in sampling

Chapter 20: chance error in sampling Chapter 20: chance error in sampling Context 2 Overview................................................................ 3 Population and parameter..................................................... 4

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

The Margin of Error for Differences in Polls

The Margin of Error for Differences in Polls The Margin of Error for Differences in Polls Charles H. Franklin University of Wisconsin, Madison October 27, 2002 (Revised, February 9, 2007) The margin of error for a poll is routinely reported. 1 But

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Mathematical goals. Starting points. Materials required. Time needed

Mathematical goals. Starting points. Materials required. Time needed Level S2 of challenge: B/C S2 Mathematical goals Starting points Materials required Time needed Evaluating probability statements To help learners to: discuss and clarify some common misconceptions about

More information

First-year Statistics for Psychology Students Through Worked Examples. 2. Probability and Bayes Theorem

First-year Statistics for Psychology Students Through Worked Examples. 2. Probability and Bayes Theorem First-year Statistics for Psychology Students Through Worked Examples 2. Probability and Bayes Theorem by Charles McCreery, D.Phil Formerly Lecturer in Experimental Psychology Magdalen College Oxford Copyright

More information

Section 1.1 Linear Equations: Slope and Equations of Lines

Section 1.1 Linear Equations: Slope and Equations of Lines Section. Linear Equations: Slope and Equations of Lines Slope The measure of the steepness of a line is called the slope of the line. It is the amount of change in y, the rise, divided by the amount of

More information

Non-random/non-probability sampling designs in quantitative research

Non-random/non-probability sampling designs in quantitative research 206 RESEARCH MET HODOLOGY Non-random/non-probability sampling designs in quantitative research N on-probability sampling designs do not follow the theory of probability in the choice of elements from the

More information

LINEAR INEQUALITIES. Mathematics is the art of saying many things in many different ways. MAXWELL

LINEAR INEQUALITIES. Mathematics is the art of saying many things in many different ways. MAXWELL Chapter 6 LINEAR INEQUALITIES 6.1 Introduction Mathematics is the art of saying many things in many different ways. MAXWELL In earlier classes, we have studied equations in one variable and two variables

More information

Conditional Probability, Hypothesis Testing, and the Monty Hall Problem

Conditional Probability, Hypothesis Testing, and the Monty Hall Problem Conditional Probability, Hypothesis Testing, and the Monty Hall Problem Ernie Croot September 17, 2008 On more than one occasion I have heard the comment Probability does not exist in the real world, and

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

T-TESTS: There are two versions of the t-test:

T-TESTS: There are two versions of the t-test: Research Skills, Graham Hole - February 009: Page 1: T-TESTS: When to use a t-test: The simplest experimental design is to have two conditions: an "experimental" condition in which subjects receive some

More information

Section 6.4: Counting Subsets of a Set: Combinations

Section 6.4: Counting Subsets of a Set: Combinations Section 6.4: Counting Subsets of a Set: Combinations In section 6.2, we learnt how to count the number of r-permutations from an n-element set (recall that an r-permutation is an ordered selection of r

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

Kenken For Teachers. Tom Davis tomrdavis@earthlink.net http://www.geometer.org/mathcircles June 27, 2010. Abstract

Kenken For Teachers. Tom Davis tomrdavis@earthlink.net http://www.geometer.org/mathcircles June 27, 2010. Abstract Kenken For Teachers Tom Davis tomrdavis@earthlink.net http://www.geometer.org/mathcircles June 7, 00 Abstract Kenken is a puzzle whose solution requires a combination of logic and simple arithmetic skills.

More information

Significant Figures, Propagation of Error, Graphs and Graphing

Significant Figures, Propagation of Error, Graphs and Graphing Chapter Two Significant Figures, Propagation of Error, Graphs and Graphing Every measurement has an error associated with it. If you were to put an object on a balance and weight it several times you will

More information

Basic Proof Techniques

Basic Proof Techniques Basic Proof Techniques David Ferry dsf43@truman.edu September 13, 010 1 Four Fundamental Proof Techniques When one wishes to prove the statement P Q there are four fundamental approaches. This document

More information

WRITING A CRITICAL ARTICLE REVIEW

WRITING A CRITICAL ARTICLE REVIEW WRITING A CRITICAL ARTICLE REVIEW A critical article review briefly describes the content of an article and, more importantly, provides an in-depth analysis and evaluation of its ideas and purpose. The

More information

The Standard Normal distribution

The Standard Normal distribution The Standard Normal distribution 21.2 Introduction Mass-produced items should conform to a specification. Usually, a mean is aimed for but due to random errors in the production process we set a tolerance

More information

Characteristics of Binomial Distributions

Characteristics of Binomial Distributions Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation

More information

Inference for two Population Means

Inference for two Population Means Inference for two Population Means Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison October 27 November 1, 2011 Two Population Means 1 / 65 Case Study Case Study Example

More information

Sample Size Issues for Conjoint Analysis

Sample Size Issues for Conjoint Analysis Chapter 7 Sample Size Issues for Conjoint Analysis I m about to conduct a conjoint analysis study. How large a sample size do I need? What will be the margin of error of my estimates if I use a sample

More information

Objectives. Materials

Objectives. Materials Activity 4 Objectives Understand what a slope field represents in terms of Create a slope field for a given differential equation Materials TI-84 Plus / TI-83 Plus Graph paper Introduction One of the ways

More information

THE STATISTICAL TREATMENT OF EXPERIMENTAL DATA 1

THE STATISTICAL TREATMENT OF EXPERIMENTAL DATA 1 THE STATISTICAL TREATMET OF EXPERIMETAL DATA Introduction The subject of statistical data analysis is regarded as crucial by most scientists, since error-free measurement is impossible in virtually all

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Social Return on Investment

Social Return on Investment Social Return on Investment Valuing what you do Guidance on understanding and completing the Social Return on Investment toolkit for your organisation 60838 SROI v2.indd 1 07/03/2013 16:50 60838 SROI v2.indd

More information

Exact Nonparametric Tests for Comparing Means - A Personal Summary

Exact Nonparametric Tests for Comparing Means - A Personal Summary Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

Executive Summary. Viability of the Return of a Major League Baseball Franchise to Montreal (the Expos )

Executive Summary. Viability of the Return of a Major League Baseball Franchise to Montreal (the Expos ) Executive Summary Viability of the Return of a Major League Baseball Franchise to Montreal (the Expos ) November 2013 Table of Contents 1. CONTEXT AND OBJECTIVES... 3 2. RESEARCH METHODS... 5 3. KEY RESULTS...

More information

STATISTICS 8: CHAPTERS 7 TO 10, SAMPLE MULTIPLE CHOICE QUESTIONS

STATISTICS 8: CHAPTERS 7 TO 10, SAMPLE MULTIPLE CHOICE QUESTIONS STATISTICS 8: CHAPTERS 7 TO 10, SAMPLE MULTIPLE CHOICE QUESTIONS 1. If two events (both with probability greater than 0) are mutually exclusive, then: A. They also must be independent. B. They also could

More information

RATIOS, PROPORTIONS, PERCENTAGES, AND RATES

RATIOS, PROPORTIONS, PERCENTAGES, AND RATES RATIOS, PROPORTIOS, PERCETAGES, AD RATES 1. Ratios: ratios are one number expressed in relation to another by dividing the one number by the other. For example, the sex ratio of Delaware in 1990 was: 343,200

More information

Means, standard deviations and. and standard errors

Means, standard deviations and. and standard errors CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard

More information

Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

More information

7.6 Approximation Errors and Simpson's Rule

7.6 Approximation Errors and Simpson's Rule WileyPLUS: Home Help Contact us Logout Hughes-Hallett, Calculus: Single and Multivariable, 4/e Calculus I, II, and Vector Calculus Reading content Integration 7.1. Integration by Substitution 7.2. Integration

More information

Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS

Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS Sensitivity Analysis 3 We have already been introduced to sensitivity analysis in Chapter via the geometry of a simple example. We saw that the values of the decision variables and those of the slack and

More information

Fundamentals of Probability

Fundamentals of Probability Fundamentals of Probability Introduction Probability is the likelihood that an event will occur under a set of given conditions. The probability of an event occurring has a value between 0 and 1. An impossible

More information

Formal Languages and Automata Theory - Regular Expressions and Finite Automata -

Formal Languages and Automata Theory - Regular Expressions and Finite Automata - Formal Languages and Automata Theory - Regular Expressions and Finite Automata - Samarjit Chakraborty Computer Engineering and Networks Laboratory Swiss Federal Institute of Technology (ETH) Zürich March

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1]. Probability Theory Probability Spaces and Events Consider a random experiment with several possible outcomes. For example, we might roll a pair of dice, flip a coin three times, or choose a random real

More information

If A is divided by B the result is 2/3. If B is divided by C the result is 4/7. What is the result if A is divided by C?

If A is divided by B the result is 2/3. If B is divided by C the result is 4/7. What is the result if A is divided by C? Problem 3 If A is divided by B the result is 2/3. If B is divided by C the result is 4/7. What is the result if A is divided by C? Suggested Questions to ask students about Problem 3 The key to this question

More information

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Stat 5102 Notes: Nonparametric Tests and. confidence interval Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions

More information

The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces

The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces Or: How I Learned to Stop Worrying and Love the Ball Comment [DP1]: Titles, headings, and figure/table captions

More information

c 2008 Je rey A. Miron We have described the constraints that a consumer faces, i.e., discussed the budget constraint.

c 2008 Je rey A. Miron We have described the constraints that a consumer faces, i.e., discussed the budget constraint. Lecture 2b: Utility c 2008 Je rey A. Miron Outline: 1. Introduction 2. Utility: A De nition 3. Monotonic Transformations 4. Cardinal Utility 5. Constructing a Utility Function 6. Examples of Utility Functions

More information

Chapter 6: Probability

Chapter 6: Probability Chapter 6: Probability In a more mathematically oriented statistics course, you would spend a lot of time talking about colored balls in urns. We will skip over such detailed examinations of probability,

More information

SYSTEMS OF EQUATIONS AND MATRICES WITH THE TI-89. by Joseph Collison

SYSTEMS OF EQUATIONS AND MATRICES WITH THE TI-89. by Joseph Collison SYSTEMS OF EQUATIONS AND MATRICES WITH THE TI-89 by Joseph Collison Copyright 2000 by Joseph Collison All rights reserved Reproduction or translation of any part of this work beyond that permitted by Sections

More information

WRITING PROOFS. Christopher Heil Georgia Institute of Technology

WRITING PROOFS. Christopher Heil Georgia Institute of Technology WRITING PROOFS Christopher Heil Georgia Institute of Technology A theorem is just a statement of fact A proof of the theorem is a logical explanation of why the theorem is true Many theorems have this

More information

QUEUING THEORY. 1. Introduction

QUEUING THEORY. 1. Introduction QUEUING THEORY RYAN BERRY Abstract. This paper defines the building blocks of and derives basic queuing systems. It begins with a review of some probability theory and then defines processes used to analyze

More information

arxiv:1112.0829v1 [math.pr] 5 Dec 2011

arxiv:1112.0829v1 [math.pr] 5 Dec 2011 How Not to Win a Million Dollars: A Counterexample to a Conjecture of L. Breiman Thomas P. Hayes arxiv:1112.0829v1 [math.pr] 5 Dec 2011 Abstract Consider a gambling game in which we are allowed to repeatedly

More information

1 Prior Probability and Posterior Probability

1 Prior Probability and Posterior Probability Math 541: Statistical Theory II Bayesian Approach to Parameter Estimation Lecturer: Songfeng Zheng 1 Prior Probability and Posterior Probability Consider now a problem of statistical inference in which

More information

NON-PROBABILITY SAMPLING TECHNIQUES

NON-PROBABILITY SAMPLING TECHNIQUES NON-PROBABILITY SAMPLING TECHNIQUES PRESENTED BY Name: WINNIE MUGERA Reg No: L50/62004/2013 RESEARCH METHODS LDP 603 UNIVERSITY OF NAIROBI Date: APRIL 2013 SAMPLING Sampling is the use of a subset of the

More information

Chapter 27: Taxation. 27.1: Introduction. 27.2: The Two Prices with a Tax. 27.2: The Pre-Tax Position

Chapter 27: Taxation. 27.1: Introduction. 27.2: The Two Prices with a Tax. 27.2: The Pre-Tax Position Chapter 27: Taxation 27.1: Introduction We consider the effect of taxation on some good on the market for that good. We ask the questions: who pays the tax? what effect does it have on the equilibrium

More information

Stat 20: Intro to Probability and Statistics

Stat 20: Intro to Probability and Statistics Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 14.30 Introduction to tatistical Methods in Economics pring 2009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for

More information

9. Sampling Distributions

9. Sampling Distributions 9. Sampling Distributions Prerequisites none A. Introduction B. Sampling Distribution of the Mean C. Sampling Distribution of Difference Between Means D. Sampling Distribution of Pearson's r E. Sampling

More information

Random variables, probability distributions, binomial random variable

Random variables, probability distributions, binomial random variable Week 4 lecture notes. WEEK 4 page 1 Random variables, probability distributions, binomial random variable Eample 1 : Consider the eperiment of flipping a fair coin three times. The number of tails that

More information

Sampling. COUN 695 Experimental Design

Sampling. COUN 695 Experimental Design Sampling COUN 695 Experimental Design Principles of Sampling Procedures are different for quantitative and qualitative research Sampling in quantitative research focuses on representativeness Sampling

More information

People have thought about, and defined, probability in different ways. important to note the consequences of the definition:

People have thought about, and defined, probability in different ways. important to note the consequences of the definition: PROBABILITY AND LIKELIHOOD, A BRIEF INTRODUCTION IN SUPPORT OF A COURSE ON MOLECULAR EVOLUTION (BIOL 3046) Probability The subject of PROBABILITY is a branch of mathematics dedicated to building models

More information

1(a). How many ways are there to rearrange the letters in the word COMPUTER?

1(a). How many ways are there to rearrange the letters in the word COMPUTER? CS 280 Solution Guide Homework 5 by Tze Kiat Tan 1(a). How many ways are there to rearrange the letters in the word COMPUTER? There are 8 distinct letters in the word COMPUTER. Therefore, the number of

More information