CONFIDENCE INTERVALS I

Size: px
Start display at page:

Download "CONFIDENCE INTERVALS I"

Transcription

1 CONFIDENCE INTERVALS I ESTIMATION: the sample mean Gx is an estimate of the population mean µ point of sampling is to obtain estimates of population values Example: for 55 students in Section 105, 45 of 55 work: p s = 8%; for those who work, the mean number of hours xg = inference: 8% of ASU students work an average of hours a week; p = 0.8 and µ = Problem: the sampling distribution is a continuous distribution the probability that Gx actually equals µ is zero Gx is not an accurate estimate of µ; in this case we cannot even state the probability that Gx is accurate Gx is called a point estimate of µ statisticians generally prefer to give an interval estimate: "There is a 90% probability that µ is between 1 and 17.5." interval estimate has two features the estimate in interval form a probability statement: taken as an assessment of the reliability or accuracy of the estimate the probability hinges on the probabilities found in the sampling distribution of Gx INTUITION Consider the sampling distribution of sample means for samples of size 100 drawn from a population of salaries in which µ = 33,000 and σ = 5,000 E(Gx ) = 33,000 σ xg = 5, = 500.

2 In the z table find z values that demarcate the middle 95% of a normal distribution: z = ± 1.96 The interval µ 1.96 σ xg to µ σ xg contains 95% of the sampling distribution or contains 95% of all the possible Gx s that could ever be drawn from this population the interval noted is 33,000 ± or 33,000 ± 980. Any sample mean in this interval differs from the actual population mean by no more than % of all possible sample means differ from µ by no more than 980. for any Gx, there is a 95% probability that it differs from µ by no more than 980; that is, by no more than the amount 1.96 σ xg choose a sample and calculate Gx; now consider an interval of the form Gx ± 1.96 σ xg necessarily, the population mean m lies within the limits of 95% of all such intervals there is a 95% probability that the population mean lies within the limits of any interval of the form Gx ± 1.96 σ xg A C% confidence interval: An interval of the form A C% xg ± confidence z C σ interval xg for the population mean is given by where z C is chosen so that C% of the normal distribution Gx ± z C σ lies within the interval xg z C to +z C. CALCULATING CONFIDENCE INTERVALS FOR THE POPULATION MEAN C is the confidence level z C is found as a z value such that z C to +z C incorporates the middle C% of a normal distribution; ±z C demarcates a symmetric interval which has area C

3 Example: Find the appropriate z value for a 9% confidence interval the interval must be symmetric: take out the middle 9% leaves 8% to be split between upper and lower tails. The required z values demarcate the lower 4% and lower 96% of the z distribution. Alternatively, let L = (100 C)/; here L = (100 9)/ = 4% or In the cumulative z table find area The closest seems to be ; reading back to the margins z = 1.75; therefore the required z C = 1.75 Check by finding a z value such that 96% of the distribution is less than that value. Examples: A population of Christmas trees has unknown µ, but it is known that the population is normally distributed with σ = 4. A sample of 5 trees has Gx = Find a 95% confidence interval for the mean height of the population. given n = 5, so that σ xg = σ n = 4/5 = 0.8 L = (100 C)/ = 5/ =.5% or From the z table 0.05 of the z distribution is less than 1.96, so z C = applying the formula above Gx ± z C σ xg 16.6 ± ± 1.568, or the interval to Stating the interval: A 95% confidence interval for the population mean is 16.6 ± We are 95% confident that the population mean is in the interval to There is a 95% probability that µ is at least but no more than Find a 90% confidence interval for the same population, same sample find z 90 by reference to z table. L = (100 C)/ = the nearest entry is z = then we have 16.6 ± = 16.6 ± 1.31, or the interval to For a sample of 64 drawn from this population, we got the same Gx. Find a 90% confidence interval for the population mean. since n = 64, σ xg = 4 64 = 0.5 confidence interval: 16.6 ± = 16.6 ± 0.8, or the interval to 17.4

4 Messages: the width of the confidence interval varies in the same direction as the confidence level in our first example, width = = 3.136, while in the second example, width = =.64 width of the interval is z C σ xg : called the precision of the estimate there is a trade-off between precision and confidence common sense: for very wide intervals, we can be quite confident that we've captured µ, but as the interval narrows, the probability that it includes µ drops to zero As the sample size increases, precision increases at the same level of confidence the third interval above has width 1.64 with sufficiently large sample, we can achieve whatever combination of confidence and precision we desire as n increases, σ xg decreases FINDING THE RIGHT SAMPLE SIZE The distance e = z C σ xg is the error in the estimate e is one-half the width of the confidence interval within the limits of our confidence statement, we are sure that the population mean differs from the sample mean by no more than e: we might say we're 90% confident that the true mean differs from the sample mean by no more than e. Hence, e is the maximum error in the estimate Suppose that there is some maximum tolerable value for e, or maximum tolerable error for a given confidence level, the value of n necessary to keep e within tolerable limits σ e = zc, solve for n to find n z n = C σ e for given z, chosen for the appropriate confidence level, this formula gives us the sample size necessary to achieve an error of no more than e in general, the result of this calculation is not an integer, so the rule is to make the sample size equal to the next largest integer. NOTE CAREFULLY: This refers to the maximum tolerable error in the sampling procedure, or in the estimate of µ, NOT to the tolerance in a manufacturing process.

5 Examples: Cigarette filters are supposed to have µ = 15 mm in length; σ = 0.1 mm. Machinery will jam if the length of a filter exceeds 15.3 mm, and the probability of such a filter increases as the mean length increases; must have an accurate estimate of the mean length of filters. Let us require e 0.01 mm. and 90% confidence intervals. How large must n be? z n = C σ = e = the next greatest integer is the required sample size (that is, ALWAYS round n upwards in these problems); here n = 69 ordering T-shirts to give to contestants in a road race; average chest size unknown but for all chests everywhere σ = 4 in. Measure a sample of the participants when they register, and require that the sample be accurate to within ±1.5 in. How large must the sample be to have 99% confidence in the result? first, z 99 =? n = (.58 4 / 1.5) = rounding upwards, we require n = 48

6 CONFIDENCE INTERVALS II: σ UNKNOWN WHEN TO USE A z VALUE IN CONSTRUCTING CONFIDENCE INTERVALS To this point, we have assumed the population standard deviation known. IF NOT Population is normally distributed and σ NOT known the sampling distribution of Gx is NOT normal but rather conforms to Student's t distribution If population is NOT normal and σ is NOT known but the sample is large (that is, n 30), then the sampling distribution of Gx approximates the t distribution In either of these cases, s, the sample standard deviation, estimates σ. RECALL: s = Σ(x xg ) /(n 1) and s = s The standard error of the mean is estimated by s xg = s/ n confidence intervals have the form Gx ± t C s xg the t values used here are numbers of standard deviations in this case, numbers of standard deviations on a t distribution CHARACTERISTICS OF THE t DISTRIBUTION Continuous Symmetric Values near the mean are more probable than values further out so that t distribution looks like a bell-shaped curve. How is that any different from a normal distribution? 1. the t distribution has fatter tails and less mass in the center for a given number of standard deviations, probability is higher on the normal distribution than on a t distribution put another way, a given probability level will be further from the center of a t distribution that from the center of the normal distribution or, a given probability level will be more standard deviations (t values) away from the mean than would be the case on a normal distribution Note: t values will always be larger than z values for corresponding confidence level intervals constructed with t will always be wider (less precise) than those constructed with z

7 . there is not one t distribution but a large number, depending on the number of "degrees of freedom" Digression: the concept of degrees of freedom Mechanically df = n k, where n is the sample size and k the number of parameters that must be estimated from the sample before estimating the standard deviation for example: s, the sample standard deviation, is an estimate of σ. To calculate s, we must estimate µ. µ is estimated by xg, and xg is the only statistic we must calculate before we can calculate s. We must thus estimate one parameter, µ, before deriving and estimate of σ, and there are thus n 1 degrees of freedom in our estimate of σ more generally, degrees of freedom represents the number of independent (in the probability sense) random variables in a problem in calculating s we must use Gx. Suppose we are given Gx and n 1 of the values in the sample; then the n-th value is already determined and can be derived from what we know The t-distribution: pages E-7 and E-8 in your textbook how to read the table Upper tail (α) values across the top are the area in one tail of the distribution for a confidence interval use an upper tail value corresponding to the area in one tail of the distribution this will be only half the difference between the confidence level and 1 For example: in preparing a 95% confidence interval, there will be 5% in the tails of the distribution, thus 0.05 in each tail: we should use a t value for upper-tail area 0.05 and the appropriate number of degrees of freedom if C is the confidence level, expressed as decimal fraction, use α = (1 C)/ degrees of freedom are in the left hand column as df infinity, the t-value z value Examples: Find the appropriate t value for 0 degrees of freedom and 90% confidence interval. α = (1 0.9)/ = 0.05 t = for a sample of size 37, find the t value for a 99% confidence interval d.f. = n 1 = 36; α = (1 0.99)/ = t =.7195 CONFIDENCE INTERVAL FOR µ WITH NORMAL POPULATION AND σ UNKNOWN Problem requires use of t with n 1 degrees of freedom. Confidence intervals will have the form ( n 1) d. f. C x ± t s where s xg = s/ n, s being the sample standard deviation note similarity to earlier confidence intervals x

8 Examples: 7 male students are selected at random and an alcoholic beverage is poured down them in tenth-ounce increments until distinct signs of non-sobriety are observed. The following results were obtained: Individual Amount of Beverage (oz) Researchers feel safe in assuming that the distribution of ounces until non-sobriety is normal in the population. Construct a 95% confidence interval for amount of drink it takes to get the average member of the population drunk. calculate Gx and s: Gx = 3.39, s = Σ(x xg ) /(n 1) = [( ) + + ( ) ] (7 1) = s = = calculate s xg = s/ n = 0.846/ 7 = 0.846/.65 = find appropriate t value, for c =.95 and 6 df =.4469 multiply s xg by t value = Gx ± t s xg = 3.39 ± or the interval.546 to Each of 9 cars in a sample is driven 0,000 miles, the gallons of fuel used recorded, and the fuel mileage calculated. For the sample mean fuel mileage Gx = 34.6 and s = 1.. Assuming that the distribution of fuel mileage is normally distributed, find a 90% confidence interval for the mileage to be expected from all cars of this make. s xg = s/ n = (1.)/3 = 0.4 α = (1.9)/ = 0.05 and d.f. = n 1 = 8 t = Gx ± t s xg = 34.6 ± ± or the interval to In a sample of 41 students who work, xg = and s = Find a 95% confidence interval for the average hours worked by all ASU students who work. s xg = s n = = for 40 degrees of freedom, t 95 =.011 confidence interval: ± ± 1.8

9 We wish to establish the average weight of a population of turkeys; we have chosen a sample of 36, weighed them and have the following results: Construct a 98% confidence interval for the population mean of these turkeys first, find t C =.438 next, find Gx and s: Gx = 14.5, s = 4.90 find s xg = s/ n = 4.90/6 = Gx ± t C s xg 14.5 ± ± 1.99 or 1.6 to 16.4 SAMPLING DISTRIBUTIONS FOR SMALL SAMPLES The t distribution is often thought of as primarily of value with small samples applies whenever population is known to be normal and σ unknown, no matter how small n footnote: who was "Student"? A pseudonym for William Gosset, an Irish brewmaster concerned with controlling biochemical processes in brewing with large samples, if population is not normal, we must rely on Central Limit Theorem And many statisticians and other practitioners will use z procedures with any sample of 30 or more: this is especially prevalent in older practice Another possibility: sample is small, so that CLT does not apply population is not normally distributed or the distribution is unknown Safest course is to take a larger sample and rely on CLT

10 Following schematic may be used to determine proper distribution to use in constructing confidence intervals. Population standard deviation known? Yes No Population normal? Population normal? Yes No Yes No Sample Size Sample Size z value n >= 30 n < 30 n >= 30 n < 30 NOTES: z or t (see note) ERROR t value z or t (see note) ERROR 1. For a non-normal population and large samples, different practitioners may proceed differently. Some argue that the Central Limit Theorem justifies use of a z value in this case, while others feel that it is more appropriate to use a t value since that gives a less precise estimate (a wider confidence interval). For purposes of this course, use a t in such cases.. For small samples from non-normal populations: there are techniques which can be used to derive an interval estimate in this case, but they are beyond the scope of this course.

11 CONFIDENCE INTERVALS III CONFIDENCE INTERVAL FOR THE POPULATION PROPORTION Purpose: to use the sample proportion, p s, as the basis of an interval estimate of the population proportion p Reminders: the sample proportion p s = x/n the sampling distribution of p has parameters E(p s ) = p σ ps = p (1 p)/n p s is normally distributed, so that probabilities are found by reference to the z table typically p is unknown, so that we must estimate σ ps by s ps = [p s (1 p s )]/n A confidence interval for p then will have the form p s ± z C s ps Examples: of 55 students in a sample, 45 work. Construct a 95% confidence interval for the proportion in the population who work. p s = 45/55 = 0.8 s ps = [.8 (1.8)]/55 = z C = ±1.96 confidence interval: 0.8 ± ± 0.10 We are 95% confident that in the population somewhere between 7% and 9% work. In a sample of 800 North Carolinians 51% express the intention to vote for Jesse Helms in the next election. Find a 98% confidence interval for the proportion in the population who intend to vote for Helms. p s = 51%; s ps = (51 49)/800 = then we have 51 ± = 51% ± 4.1% or the interval 46.9% to 55.1% from this, we can say, strictly and properly, "We are 95% sure that the proportion in the population who intend to vote Helms is within 4.1% of 51%." or, as we might loosely and a bit improperly put it, "Our survey shows that 51% of the population intend to vote Helms, and this result is accurate to within plus or minus 4%." the election is a toss-up or too close to call.

12 Suppose same result with a sample of size n = 1600 s ps = 1.497, and s p z = =.9% confidence interval would be 51% ± 3% POINT: is the minor increase in precision worth the extra cost? FINDING THE NECESSARY SAMPLE SIZE IN PROPORTION PROBLEMS Since we have ± z C s ps, the estimate p s differs from p by at most that amount substituting the definition of s ps, the error is at most z C [p (1 p)]/n notice the use of p in the above expression; the concepts advanced here involve what we know about the sampling distribution before sampling begins for a given confidence level, this error can be reduced by increasing n in the last example above, we noted that doubling the sample size would reduce the error from 4% to 3% suppose we require e < 0.01, that is, accuracy to within ± 1%. How large must n be? the maximum error in the estimate: solve for n, giving n = p (1 e = e z C p) z C p ( 1 p) n

13 A major problem: p, the population proportion is unknown solution 1: assume p = 0.5 this will give largest possible value for n since p (1 p) reaches a maximum when p = 0.5 may result in an unnecessarily large and expensive sample solution : use other information do a pilot study on a small sample and use the resulting p s to estimate p previous experience or knowledge of other populations may give an approximate value for p lacks certainty of solution 1, but may result in somewhat smaller sample Examples: applying the formula above to solution 1, we have. 5 (1.5).33 = 0.01 n = 13,573 this is the sample size necessary to be absolutely sure that a 98% confidence interval is accurate to within ± 1% In the work example above, 95% confidence interval and sample of 55 gave accuracy of ±0.10. What sample size is necessary to hold the error to ±0.015 (1.5%)? solution 1: n = [( ) 1.96 ] = ; taking the next greatest integer, we have 469 solution : for n = 55, we had p s = 0.8. Take that as an estimate of the unknown p. Then n = [( ) 1.96 ] = or 50 using the pilot-study approach reduces the required sample size by more than 1,749 and might save a considerable amount of money A footnote: in most proportion problems, it doesn t matter whether you use percentages or decimal fractions, as long as you keep them straight. In the sample-size formula above, however, you must use decimal fractions. To use percentages, substitute 100 for 1, so the formula becomes n = [p* (100 p*) z ] e* where p* and e* are defined as percentages. THE z VS. THE t DISTRIBUTION In constructing confidence intervals, use the z distribution whenever the population standard deviation σ is known AND the population is known to normally distributed you wish to calculate a confidence interval for a proportion rule of thumb: n p 5 AND n (1 p) 5 for sufficiently accurate approximation In constructing confidence intervals, use the t distribution if the population is known to be normally distributed AND the population standard deviation σ is UNKNOWN: this holds for any sample size if the population s distribution is NOT normal AND the sample size is at least 30 AND the population standard deviation σ is UNKNOWN

5.1 Identifying the Target Parameter

5.1 Identifying the Target Parameter University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Point and Interval Estimates

Point and Interval Estimates Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

More information

Estimation and Confidence Intervals

Estimation and Confidence Intervals Estimation and Confidence Intervals Fall 2001 Professor Paul Glasserman B6014: Managerial Statistics 403 Uris Hall Properties of Point Estimates 1 We have already encountered two point estimators: th e

More information

Chapter 7 Review. Confidence Intervals. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 7 Review. Confidence Intervals. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Chapter 7 Review Confidence Intervals MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) Suppose that you wish to obtain a confidence interval for

More information

Math 251, Review Questions for Test 3 Rough Answers

Math 251, Review Questions for Test 3 Rough Answers Math 251, Review Questions for Test 3 Rough Answers 1. (Review of some terminology from Section 7.1) In a state with 459,341 voters, a poll of 2300 voters finds that 45 percent support the Republican candidate,

More information

Unit 26 Estimation with Confidence Intervals

Unit 26 Estimation with Confidence Intervals Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

Objectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)

Objectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI) Objectives 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) Statistical confidence (CIS gives a good explanation of a 95% CI) Confidence intervals. Further reading http://onlinestatbook.com/2/estimation/confidence.html

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Constructing and Interpreting Confidence Intervals

Constructing and Interpreting Confidence Intervals Constructing and Interpreting Confidence Intervals Confidence Intervals In this power point, you will learn: Why confidence intervals are important in evaluation research How to interpret a confidence

More information

Lesson 17: Margin of Error When Estimating a Population Proportion

Lesson 17: Margin of Error When Estimating a Population Proportion Margin of Error When Estimating a Population Proportion Classwork In this lesson, you will find and interpret the standard deviation of a simulated distribution for a sample proportion and use this information

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

A Short Guide to Significant Figures

A Short Guide to Significant Figures A Short Guide to Significant Figures Quick Reference Section Here are the basic rules for significant figures - read the full text of this guide to gain a complete understanding of what these rules really

More information

Need for Sampling. Very large populations Destructive testing Continuous production process

Need for Sampling. Very large populations Destructive testing Continuous production process Chapter 4 Sampling and Estimation Need for Sampling Very large populations Destructive testing Continuous production process The objective of sampling is to draw a valid inference about a population. 4-

More information

Pre-Algebra Lecture 6

Pre-Algebra Lecture 6 Pre-Algebra Lecture 6 Today we will discuss Decimals and Percentages. Outline: 1. Decimals 2. Ordering Decimals 3. Rounding Decimals 4. Adding and subtracting Decimals 5. Multiplying and Dividing Decimals

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

7 Confidence Intervals

7 Confidence Intervals blu49076_ch07.qxd 5/20/2003 3:15 PM Page 325 c h a p t e r 7 7 Confidence Intervals and Sample Size Outline 7 1 Introduction 7 2 Confidence Intervals for the Mean (s Known or n 30) and Sample Size 7 3

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Types of Error in Surveys

Types of Error in Surveys 2 Types of Error in Surveys Surveys are designed to produce statistics about a target population. The process by which this is done rests on inferring the characteristics of the target population from

More information

Chapter Study Guide. Chapter 11 Confidence Intervals and Hypothesis Testing for Means

Chapter Study Guide. Chapter 11 Confidence Intervals and Hypothesis Testing for Means OPRE504 Chapter Study Guide Chapter 11 Confidence Intervals and Hypothesis Testing for Means I. Calculate Probability for A Sample Mean When Population σ Is Known 1. First of all, we need to find out the

More information

Normal distribution. ) 2 /2σ. 2π σ

Normal distribution. ) 2 /2σ. 2π σ Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a

More information

The Margin of Error for Differences in Polls

The Margin of Error for Differences in Polls The Margin of Error for Differences in Polls Charles H. Franklin University of Wisconsin, Madison October 27, 2002 (Revised, February 9, 2007) The margin of error for a poll is routinely reported. 1 But

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph. MBA/MIB 5315 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?

More information

Math and FUNDRAISING. Ex. 73, p. 111 1.3 0. 7

Math and FUNDRAISING. Ex. 73, p. 111 1.3 0. 7 Standards Preparation Connect 2.7 KEY VOCABULARY leading digit compatible numbers For an interactive example of multiplying decimals go to classzone.com. Multiplying and Dividing Decimals Gr. 5 NS 2.1

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

Chapter 7 - Practice Problems 1

Chapter 7 - Practice Problems 1 Chapter 7 - Practice Problems 1 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) Define a point estimate. What is the

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Review of basic statistics and the simplest forecasting model: the sample mean

Review of basic statistics and the simplest forecasting model: the sample mean Review of basic statistics and the simplest forecasting model: the sample mean Robert Nau Fuqua School of Business, Duke University August 2014 Most of what you need to remember about basic statistics

More information

8 6 X 2 Test for a Variance or Standard Deviation

8 6 X 2 Test for a Variance or Standard Deviation Section 8 6 x 2 Test for a Variance or Standard Deviation 437 This test uses the P-value method. Therefore, it is not necessary to enter a significance level. 1. Select MegaStat>Hypothesis Tests>Proportion

More information

Chapter 27: Taxation. 27.1: Introduction. 27.2: The Two Prices with a Tax. 27.2: The Pre-Tax Position

Chapter 27: Taxation. 27.1: Introduction. 27.2: The Two Prices with a Tax. 27.2: The Pre-Tax Position Chapter 27: Taxation 27.1: Introduction We consider the effect of taxation on some good on the market for that good. We ask the questions: who pays the tax? what effect does it have on the equilibrium

More information

Mind on Statistics. Chapter 10

Mind on Statistics. Chapter 10 Mind on Statistics Chapter 10 Section 10.1 Questions 1 to 4: Some statistical procedures move from population to sample; some move from sample to population. For each of the following procedures, determine

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Probability. Distribution. Outline

Probability. Distribution. Outline 7 The Normal Probability Distribution Outline 7.1 Properties of the Normal Distribution 7.2 The Standard Normal Distribution 7.3 Applications of the Normal Distribution 7.4 Assessing Normality 7.5 The

More information

Revision Notes Adult Numeracy Level 2

Revision Notes Adult Numeracy Level 2 Revision Notes Adult Numeracy Level 2 Place Value The use of place value from earlier levels applies but is extended to all sizes of numbers. The values of columns are: Millions Hundred thousands Ten thousands

More information

1.7 Graphs of Functions

1.7 Graphs of Functions 64 Relations and Functions 1.7 Graphs of Functions In Section 1.4 we defined a function as a special type of relation; one in which each x-coordinate was matched with only one y-coordinate. We spent most

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

How To Calculate Confidence Intervals In A Population Mean

How To Calculate Confidence Intervals In A Population Mean Chapter 8 Confidence Intervals 8.1 Confidence Intervals 1 8.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Calculate and interpret confidence intervals for one

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

8. THE NORMAL DISTRIBUTION

8. THE NORMAL DISTRIBUTION 8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Stat 5102 Notes: Nonparametric Tests and. confidence interval Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions

More information

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice! Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!) Part A - Multiple Choice Indicate the best choice

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) ±1.88 B) ±1.645 C) ±1.96 D) ±2.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) ±1.88 B) ±1.645 C) ±1.96 D) ±2. Ch. 6 Confidence Intervals 6.1 Confidence Intervals for the Mean (Large Samples) 1 Find a Critical Value 1) Find the critical value zc that corresponds to a 94% confidence level. A) ±1.88 B) ±1.645 C)

More information

Confidence Intervals for Cp

Confidence Intervals for Cp Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process

More information

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1. Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Section 1.3 P 1 = 1 2. = 1 4 2 8. P n = 1 P 3 = Continuing in this fashion, it should seem reasonable that, for any n = 1, 2, 3,..., = 1 2 4.

Section 1.3 P 1 = 1 2. = 1 4 2 8. P n = 1 P 3 = Continuing in this fashion, it should seem reasonable that, for any n = 1, 2, 3,..., = 1 2 4. Difference Equations to Differential Equations Section. The Sum of a Sequence This section considers the problem of adding together the terms of a sequence. Of course, this is a problem only if more than

More information

= 2.0702 N(280, 2.0702)

= 2.0702 N(280, 2.0702) Name Test 10 Confidence Intervals Homework (Chpt 10.1, 11.1, 12.1) Period For 1 & 2, determine the point estimator you would use and calculate its value. 1. How many pairs of shoes, on average, do female

More information

COMP 250 Fall 2012 lecture 2 binary representations Sept. 11, 2012

COMP 250 Fall 2012 lecture 2 binary representations Sept. 11, 2012 Binary numbers The reason humans represent numbers using decimal (the ten digits from 0,1,... 9) is that we have ten fingers. There is no other reason than that. There is nothing special otherwise about

More information

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

7. Normal Distributions

7. Normal Distributions 7. Normal Distributions A. Introduction B. History C. Areas of Normal Distributions D. Standard Normal E. Exercises Most of the statistical analyses presented in this book are based on the bell-shaped

More information

The Standard Normal distribution

The Standard Normal distribution The Standard Normal distribution 21.2 Introduction Mass-produced items should conform to a specification. Usually, a mean is aimed for but due to random errors in the production process we set a tolerance

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9

More information

Chapter 3 Review Math 1030

Chapter 3 Review Math 1030 Section A.1: Three Ways of Using Percentages Using percentages We can use percentages in three different ways: To express a fraction of something. For example, A total of 10, 000 newspaper employees, 2.6%

More information

Social Studies 201 Notes for November 19, 2003

Social Studies 201 Notes for November 19, 2003 1 Social Studies 201 Notes for November 19, 2003 Determining sample size for estimation of a population proportion Section 8.6.2, p. 541. As indicated in the notes for November 17, when sample size is

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

2 ESTIMATION. Objectives. 2.0 Introduction

2 ESTIMATION. Objectives. 2.0 Introduction 2 ESTIMATION Chapter 2 Estimation Objectives After studying this chapter you should be able to calculate confidence intervals for the mean of a normal distribution with unknown variance; be able to calculate

More information

Solving Quadratic Equations

Solving Quadratic Equations 9.3 Solving Quadratic Equations by Using the Quadratic Formula 9.3 OBJECTIVES 1. Solve a quadratic equation by using the quadratic formula 2. Determine the nature of the solutions of a quadratic equation

More information

5.4 Solving Percent Problems Using the Percent Equation

5.4 Solving Percent Problems Using the Percent Equation 5. Solving Percent Problems Using the Percent Equation In this section we will develop and use a more algebraic equation approach to solving percent equations. Recall the percent proportion from the last

More information

Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test. Dr. Tom Pierce Radford University Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

More information

Year 9 set 1 Mathematics notes, to accompany the 9H book.

Year 9 set 1 Mathematics notes, to accompany the 9H book. Part 1: Year 9 set 1 Mathematics notes, to accompany the 9H book. equations 1. (p.1), 1.6 (p. 44), 4.6 (p.196) sequences 3. (p.115) Pupils use the Elmwood Press Essential Maths book by David Raymer (9H

More information

SAMPLING DISTRIBUTIONS

SAMPLING DISTRIBUTIONS 0009T_c07_308-352.qd 06/03/03 20:44 Page 308 7Chapter SAMPLING DISTRIBUTIONS 7.1 Population and Sampling Distributions 7.2 Sampling and Nonsampling Errors 7.3 Mean and Standard Deviation of 7.4 Shape of

More information

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives. The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution

More information

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing 1) Hypothesis testing and confidence interval estimation are essentially two totally different statistical procedures

More information

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 1. Does vigorous exercise affect concentration? In general, the time needed for people to complete

More information

Linear Programming. Solving LP Models Using MS Excel, 18

Linear Programming. Solving LP Models Using MS Excel, 18 SUPPLEMENT TO CHAPTER SIX Linear Programming SUPPLEMENT OUTLINE Introduction, 2 Linear Programming Models, 2 Model Formulation, 4 Graphical Linear Programming, 5 Outline of Graphical Procedure, 5 Plotting

More information

Section 1-4 Functions: Graphs and Properties

Section 1-4 Functions: Graphs and Properties 44 1 FUNCTIONS AND GRAPHS I(r). 2.7r where r represents R & D ependitures. (A) Complete the following table. Round values of I(r) to one decimal place. r (R & D) Net income I(r).66 1.2.7 1..8 1.8.99 2.1

More information

1. How different is the t distribution from the normal?

1. How different is the t distribution from the normal? Statistics 101 106 Lecture 7 (20 October 98) c David Pollard Page 1 Read M&M 7.1 and 7.2, ignoring starred parts. Reread M&M 3.2. The effects of estimated variances on normal approximations. t-distributions.

More information

5.1 Radical Notation and Rational Exponents

5.1 Radical Notation and Rational Exponents Section 5.1 Radical Notation and Rational Exponents 1 5.1 Radical Notation and Rational Exponents We now review how exponents can be used to describe not only powers (such as 5 2 and 2 3 ), but also roots

More information

The Crescent Primary School Calculation Policy

The Crescent Primary School Calculation Policy The Crescent Primary School Calculation Policy Examples of calculation methods for each year group and the progression between each method. January 2015 Our Calculation Policy This calculation policy has

More information

Notes on Orthogonal and Symmetric Matrices MENU, Winter 2013

Notes on Orthogonal and Symmetric Matrices MENU, Winter 2013 Notes on Orthogonal and Symmetric Matrices MENU, Winter 201 These notes summarize the main properties and uses of orthogonal and symmetric matrices. We covered quite a bit of material regarding these topics,

More information

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools. Tools for Summarizing Data Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

More information

Confidence intervals

Confidence intervals Confidence intervals Today, we re going to start talking about confidence intervals. We use confidence intervals as a tool in inferential statistics. What this means is that given some sample statistics,

More information

Unit 1 Number Sense. In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions.

Unit 1 Number Sense. In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions. Unit 1 Number Sense In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions. BLM Three Types of Percent Problems (p L-34) is a summary BLM for the material

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

Overview for Families

Overview for Families unit: Ratios and Rates Mathematical strand: Number The following pages will help you to understand the mathematics that your child is currently studying as well as the type of problems (s)he will solve

More information

Notes on Continuous Random Variables

Notes on Continuous Random Variables Notes on Continuous Random Variables Continuous random variables are random quantities that are measured on a continuous scale. They can usually take on any value over some interval, which distinguishes

More information

Probability Distributions

Probability Distributions Learning Objectives Probability Distributions Section 1: How Can We Summarize Possible Outcomes and Their Probabilities? 1. Random variable 2. Probability distributions for discrete random variables 3.

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

CHI-SQUARE: TESTING FOR GOODNESS OF FIT CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity

More information

Characteristics of Binomial Distributions

Characteristics of Binomial Distributions Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation

More information

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Describing Populations Statistically: The Mean, Variance, and Standard Deviation Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly

More information

ALGEBRA. sequence, term, nth term, consecutive, rule, relationship, generate, predict, continue increase, decrease finite, infinite

ALGEBRA. sequence, term, nth term, consecutive, rule, relationship, generate, predict, continue increase, decrease finite, infinite ALGEBRA Pupils should be taught to: Generate and describe sequences As outcomes, Year 7 pupils should, for example: Use, read and write, spelling correctly: sequence, term, nth term, consecutive, rule,

More information