Math C067 Sampling Distributions

Save this PDF as:
 WORD  PNG  TXT  JPG

Size: px
Start display at page:

Download "Math C067 Sampling Distributions"

Transcription

1 Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters i favor of a particular presidetial cadidate by pollig a radom sample of voters. A statisticia may wat to estimate the average startig icome of ew college graduates by fidig the mea of a radom sample of ew college graduates. Numbers Populatio size: N Sample size: (usually much smaller tha N) Samplig with Replacemet Choose 1 perso, Choose a 2d perso (who could be the same as the first), Choose a 3rd perso (who could be the same as the first ad/or secod),..., Choose a th perso Samplig without Replacemet Choose differet people Rules of thumb Our formulas will be correct for samplig with replacemet. Our formulas will be approximately correct for samplig without replacemet provided that the populatio size N is large. 1

2 Sample Mea Suppose that we are tryig to estimate a radom variable that is based o a large populatio, e.g., we wat to estimate average icome. Radom Variable that we wish to estimate: X Expected value (mea) of X: µ Stadard deviatio of X: σ We defie a radom variable X called the Sample Mea via the followig radom process: Evaluate X at radom poits, i.e., choose radom people ad fid out how much each of them ears Fid the average of those values Formulas The expected value of X is the same as the the expected value of X, i.e., µ X = µ (correct eve if we sample without replacemet). The variace of X is smaller tha the variace of X by a factor of (the sample size), i.e., σ 2 X = 1 σ2 The stadard deviatio of X is smaller tha the stadard deviatio of X by a factor of (the sample size), i.e., σ X = 1 σ 2

3 Example: Test scores. Oe millio studets take a stadardized test. Their average score is 500. The stadard deviatio of their scores is 100. A statisticia chooses 400 studets at radom with replacemet ad computes the mea X of their test scores. What is the expected value of X? What is the stadard deviatio of X? µ X = µ = 500 σ X = 1 σ = = = 5 Note: these formulas do ot deped i ay way o the populatio size N. Example: Test scores. Oe millio studets take a stadardized test. Their average score is 500. The stadard deviatio of their scores is 100. A statisticia chooses a group of 400 differet studets at radom ad computes the mea X of their test scores. What is the expected value of X? What is the stadard deviatio of X? µ X = µ = 500. This formula works with or without replacemet. Because the populatio size is large we ca use the same formula we used with replacemet to estimate the stadard deviatio. σ X 1 σ = = = 5 Example: Average icome 10 millio people work i Metropolis. Their average icome is $100,000. The stadard deviatio of their icomes is A statisticia chooses 100 differet Metropolis workers at radom ad computes their average icome I. Was the sample chose with replacemet or without replacemet? What is the expected value of I? What is the stadard deviatio of I? What is the probability that I is betwee $99,500 ad $100,500. I is just a sample mea, so we ca call it X. (But the problem statemet ca call it aythig. It s up to you to recogize that I is the sample mea.) µ I = µ X = µ = σ I = σ X 1 σ = = = The Sample Mea is approximately ormally distributed so we ca use the methods of chapter 6 to estimate Pr[99500 X ]. We are lookig at 2 stadard deviatios o each side of the mea so Pr[99500 X ] = Note: 30. You ca approximate a Sample Mea by a Normal Distributio provided that 3

4 Sample Proportio Suppose that a fractio (proportio) p of a populatio favors cadidate C for presidet. We take a radom sample cosistig of people Let X be a radom variable that deotes the umber of people i the sample who favor cadidate C The the distributio of X is B(, p). Let ˆP be a radom variable that deotes the fractio (proportio) of people i the sample that favor cadidate C The ˆP = 1 X ˆP is called the Sample Proportio E(X) = p, E( ˆP ) = p Var(X) = p(1 p), Var( ˆP ) = p(1 p) σ X = p(1 p), σ ˆP = p(1 p) Example: A Electio Suppose that 55 percet of the voters favor Cadidate C for presidet. If a radom sample of 1000 voters is chose, estimate the probability that betwee 52 ad 58 percet of the voters i the sample will favor Cadidate C. To aswer this questio we ca work with B(, p) ad estimate Pr[520 X 580] or we ca work directly with the Sample Proportio ad ask whether Pr[0.52 ˆP 0.58]. Both radom variables X ad ˆP are approximately ormally distributed. Solve this o your ow ad the compare your aswer to the aswers o the ext page. 4

5 Solved Example: A Electio Suppose that 55 percet of the voters favor Cadidate C for presidet. If a radom sample of 1000 voters is chose, estimate the probability that betwee 52 ad 58 percet of the voters i the sample will favor Cadidate C. To aswer this questio we ca work with B(, p) ad estimate Pr[520 X 580] or we ca work directly with the Sample Proportio ad ask whether Pr[0.52 ˆP 0.58]. Both radom variables X ad ˆP are approximately ormally distributed. Solve this o your ow ad the compare your aswer to the aswers o the ext page. Solutio usig Biomial Distributio E[X] = p = = 550. Var[X] = pq = p(1 p) = = σ X = Var[X] = Pr[520 X 580] = Pr[519.5 X 580.5] = Pr[519.5 X 550] + Pr[550 < X 580.5] The first iterval cosists of stadard deviatios. The secod iterval cosists of stadard deviatios. Therefore Pr[520 X 580] = Did you get the correct aswer? If ot, go back ad try agai. Solutio usig Sample Proportio E[ ˆP ] = p = 0.55 Var[ ˆP ] = pq σ ˆP = = p(1 p) = = Var[ ˆP ] = Pr[0.52 ˆP 0.58] = Pr[0.52 ˆP 0.55] + Pr[0.55 < ˆP 0.58] The first iterval cosists of stadard deviatios. The secod iterval cosists of stadard deviatios. Therefore Pr[0.52 ˆP 058] = (The aswer we got by usig the Biomial Distributio was more accurate because we expaded the iterval by 0.5 i both directios.) Did you get the correct aswer? If ot, go back ad try agai. 5

6 Solved Example: 1936 Presidetial Electio Suppose that 61% of the voters favor Cadidate R for Presidet. You poll a radom sample cosistig of 3000 voters. What is the probability that a majority of your sample favors Cadidate R? Try this yourself before readig o. 6

7 Solved Example: 1936 Presidetial Electio Suppose that 61% of the voters favor Cadidate R for Presidet. You poll a radom sample cosistig of 3000 voters. What is the probability that a majority of your sample favors Cadidate R? Solutio usig Biomial Distributio Let X deote the umber of voters i your sample who favor Cadidate R. The distributio of X is B(3000, 0.61). We wish to estimate Pr[X > 1500], which is the same as Pr[X 1501]. E[X] = p = = Var[X] = pq = p(1 p) = = σ X = Var[X] = Pr[X 1501] = Pr[X ] = Pr[ X] = Pr[ X 1830] + Pr[1830 < X] The first iterval cosists of approximately stadard deviatios. The secod iterval cosists of half of the distributio. Therefore, Solutio usig Sample Proportio E[ ˆP ] = p = 0.61 Var[ ˆP ] = pq σ ˆP = Pr[X 1501] = 1.0 = p(1 p) = = Var[ ˆP ] = Pr[ ˆP > 0.5] = Pr[0.5 < ˆP ] = Pr[0.5 < ˆP 0.61] + Pr[0.61 < ˆP ] The first iterval cosists of approximately stadard deviatios. The secod iterval cosists of half of the distributio. Therefore, Pr[ ˆP > 0.5] = 1.0 7

8 Gallup s Gambit I a 1935, George Gallup came up with a clever marketig strategy for his opiio polls. Gallup bet that he would correctly predict the results of the 1936 Presidetial Electio. He promised to refud all subscriptio fees for the etire year if his predictio was icorrect. Gallup did predict the Electio results correctly based o a sample of approximately 3000 voters. Reader s Digest icorrectly predicted the result based o a sample of 10,000,000 voters. How could Gallup predict correctly based o such a small sample? Because the actual electio was t really close, eve his small sample provided a comfort level of 12 stadard deviatios. How could Reader s Digest predict icorrectly based o such a large sample? Their sample was ot represetative, but cosisted maily of people who had a car or a telephoe. Lesso: Radomess is much more importat tha sample size. (Accuracy i samplig is also importat, sice some people lie to pollsters. There is a sciece to desigig questioaires.) Some recet electios have bee decided by less tha a 1% margi. It is iterestig to look at how Gallup s poll might have tured out, had the 1936 Electio ot bee a ladslide. Example: What if the Electio had bee close? Suppose that 50.5% of the voters favor Cadidate R for Presidet. You poll a radom sample cosistig of 3000 voters. What is the probability that a majority of your sample favors Cadidate R? Try this oe yourself before readig the solutio. 8

9 Example: What if the Electio had bee close? Suppose that 50.5% of the voters favor Cadidate R for Presidet. You poll a radom sample cosistig of 3000 voters. What is the probability that a majority of your sample favors Cadidate R? Solutio usig Biomial Distributio Let X deote the umber of voters i your sample who favor Cadidate R. The distributio of X is B(3000, 0.505). We wish to estimate Pr[X > 1500], which is the same as Pr[X 1501]. E[X] = p = = Var[X] = pq = p(1 p) = = σ X = Var[X] = Pr[X 1501] = Pr[X ] = Pr[ X] = Pr[ X 1515] + Pr[1515 < X] The first iterval cosists of approximately stadard deviatios. The secod iterval cosists of half of the distributio. Therefore, Pr[X 1501] = Thus Gallup s chace of predictig the Electio correctly would have bee about 70%, assumig he used a represetative sample. If his sample was skewed (like Reader s Digest s), the his chace could have bee much smaller. Solutio usig Sample Proportio E[ ˆP ] = p = Var[ ˆP ] = pq σ ˆP = = p(1 p) = = Var[ ˆP ] = Pr[ ˆP > 0.5] = Pr[0.5 < ˆP ] = Pr[0.5 < ˆP 0.505] + Pr[0.505 < ˆP ] The first iterval cosists of approximately stadard deviatios. The secod iterval cosists of half of the distributio. Therefore, Pr[ ˆP > 0.5] = Thus Gallup s chace of predictig the Electio correctly would have bee about 71%, assumig he used a represetative sample. If his sample was skewed (like Reader s Digest s), the his chace could have bee much smaller. 9

10 Example: How good was Gallup s Sample? Oly 54% of Gallup s sample favored Roosevelt. Suppose that 61% of the voters favor Cadidate R for Presidet. You poll a radom sample cosistig of 3000 voters. What is the probability that 54% or less your sample favors Cadidate R? Try this yourself before readig o. 10

11 Solved Example: How good was Gallup s Sample? Suppose that 61% of the voters favor Cadidate R for Presidet. You poll a radom sample cosistig of 3000 voters. What is the probability that 54% or less your sample favors Cadidate R? Solutio usig Biomial Distributio Let X deote the umber of voters i your sample who favor Cadidate R. The distributio of X is B(3000, 0.61). 54% of 3000 is We wish to estimate Pr[X 1620]. E[X] = p = = Var[X] = pq = p(1 p) = = σ X = Var[X] = Pr[X 1620] = Pr[X ] = Pr[X 1830] Pr[ < X 1830] The first iterval cosists of half of the distributio stadard deviatios. Therefore, The secod iterval cosists of Pr[X 1620] = 0.0 This suggests that Gallup s sample was ot close to uiform, i.e., ot represetative of the populatio. Perhaps Gallup just got lucky. They say that Fortue favors the bold. Solutio usig Sample Proportio E[ ˆP ] = p = 0.61 Var[ ˆP ] = pq σ ˆP = = p(1 p) = = Var[ ˆP ] = Pr[ ˆP 0.54] = Pr[ ˆP 0.61] Pr[0.54 < ˆP 0.61] The first iterval cosists of approximately stadard deviatios. The secod iterval cosists of half of the distributio. Therefore, Pr[ ˆP 0.54] = 0.0 This suggests that Gallup s sample was ot close to uiform, i.e., ot represetative of the populatio. Perhaps Gallup just got lucky. They say that Fortue favors the bold. 11

12 Gallup vs. the Tout If the 1936 Electio had bee a close oe (decided by oly a 1% margi), there is substatial probability (30% or more) that Gallup would have predicted it wrog. Was Gallup takig a big risk? Perhaps Gallup would have take a larger sample if it looked like the electio was goig to be close. Or, perhaps Gallup had othig to lose. Suppose that you are gambler. A tout is a perso who sells predictios, usually writte o somethig called a tout sheet. For example, the tout might predict whether the Phillies will beat the Cardials toight. The tout charges $1 for his predictio but he always offers a moey-back guaratee. Key facts about tout sheets: 1. Tout sheets are worthless. Why? I reality, the tout will tell oe customer that Philadelphia will wi ad tell aother customer that Philadelphia will lose. So half of the predictios are guarateed to be correct. (Not like Gallup Polls) 2. The tout ever loses. Whe he predicts correctly, he ears $1. Whe he predicts icorrectly he ears $0. (Like Gallup, although Gallup also exteded his offer to previous customers.) 3. Like most somethig-for-othig schemes, sellig tout sheets is illegal. (Not like Gallup.) 4. Touts thrive o repeat busiess. People who wi are likely to retur to the tout for aother predictio, which is usually more expesive tha the first. (Like Gallup, although I do t thik he jacked up his rates.) 5. A tout s customers are kow i gamblig circles as suckers. (Like Gallup s customers, if they were fooled by the gambit.) 12

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

Confidence Intervals for the Mean of Non-normal Data Class 23, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Confidence Intervals for the Mean of Non-normal Data Class 23, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom Cofidece Itervals for the Mea of No-ormal Data Class 23, 8.05, Sprig 204 Jeremy Orloff ad Joatha Bloom Learig Goals. Be able to derive the formula for coservative ormal cofidece itervals for the proportio

More information

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

Sampling Distribution And Central Limit Theorem

Sampling Distribution And Central Limit Theorem () Samplig Distributio & Cetral Limit Samplig Distributio Ad Cetral Limit Samplig distributio of the sample mea If we sample a umber of samples (say k samples where k is very large umber) each of size,

More information

Section 7-3 Estimating a Population. Requirements

Section 7-3 Estimating a Population. Requirements Sectio 7-3 Estimatig a Populatio Mea: σ Kow Key Cocept This sectio presets methods for usig sample data to fid a poit estimate ad cofidece iterval estimate of a populatio mea. A key requiremet i this sectio

More information

Confidence Intervals for the Population Mean

Confidence Intervals for the Population Mean Cofidece Itervals Math 283 Cofidece Itervals for the Populatio Mea Recall that from the empirical rule that the iterval of the mea plus/mius 2 times the stadard deviatio will cotai about 95% of the observatios.

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

Measures of Central Tendency

Measures of Central Tendency Measures of Cetral Tedecy A studet s grade will be determied by exam grades ( each exam couts twice ad there are three exams, HW average (couts oce, fial exam ( couts three times. Fid the average if the

More information

Confidence Intervals

Confidence Intervals Cofidece Itervals Cofidece Itervals are a extesio of the cocept of Margi of Error which we met earlier i this course. Remember we saw: The sample proportio will differ from the populatio proportio by more

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

Key Ideas Section 8-1: Overview hypothesis testing Hypothesis Hypothesis Test Section 8-2: Basics of Hypothesis Testing Null Hypothesis

Key Ideas Section 8-1: Overview hypothesis testing Hypothesis Hypothesis Test Section 8-2: Basics of Hypothesis Testing Null Hypothesis Chapter 8 Key Ideas Hypothesis (Null ad Alterative), Hypothesis Test, Test Statistic, P-value Type I Error, Type II Error, Sigificace Level, Power Sectio 8-1: Overview Cofidece Itervals (Chapter 7) are

More information

Using Excel to Construct Confidence Intervals

Using Excel to Construct Confidence Intervals OPIM 303 Statistics Ja Stallaert Usig Excel to Costruct Cofidece Itervals This hadout explais how to costruct cofidece itervals i Excel for the followig cases: 1. Cofidece Itervals for the mea of a populatio

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

Definition. Definition. 7-2 Estimating a Population Proportion. Definition. Definition

Definition. Definition. 7-2 Estimating a Population Proportion. Definition. Definition 7- stimatig a Populatio Proportio I this sectio we preset methods for usig a sample proportio to estimate the value of a populatio proportio. The sample proportio is the best poit estimate of the populatio

More information

Review for 1 sample CI Name. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Review for 1 sample CI Name. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Review for 1 sample CI Name MULTIPLE CHOICE. Choose the oe alterative that best completes the statemet or aswers the questio. Fid the margi of error for the give cofidece iterval. 1) A survey foud that

More information

Statistics Lecture 14. Introduction to Inference. Administrative Notes. Hypothesis Tests. Last Class: Confidence Intervals

Statistics Lecture 14. Introduction to Inference. Administrative Notes. Hypothesis Tests. Last Class: Confidence Intervals Statistics 111 - Lecture 14 Itroductio to Iferece Hypothesis Tests Admiistrative Notes Sprig Break! No lectures o Tuesday, March 8 th ad Thursday March 10 th Exteded Sprig Break! There is o Stat 111 recitatio

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

x : X bar Mean (i.e. Average) of a sample

x : X bar Mean (i.e. Average) of a sample A quick referece for symbols ad formulas covered i COGS14: MEAN OF SAMPLE: x = x i x : X bar Mea (i.e. Average) of a sample x i : X sub i This stads for each idividual value you have i your sample. For

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

Confidence Intervals and Sample Size

Confidence Intervals and Sample Size 8/7/015 C H A P T E R S E V E N Cofidece Itervals ad Copyright 015 The McGraw-Hill Compaies, Ic. Permissio required for reproductio or display. 1 Cofidece Itervals ad Outlie 7-1 Cofidece Itervals for the

More information

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio

More information

AQA STATISTICS 1 REVISION NOTES

AQA STATISTICS 1 REVISION NOTES AQA STATISTICS 1 REVISION NOTES AVERAGES AND MEASURES OF SPREAD www.mathsbox.org.uk Mode : the most commo or most popular data value the oly average that ca be used for qualitative data ot suitable if

More information

Chapter 10. Hypothesis Tests Regarding a Parameter. 10.1 The Language of Hypothesis Testing

Chapter 10. Hypothesis Tests Regarding a Parameter. 10.1 The Language of Hypothesis Testing Chapter 10 Hypothesis Tests Regardig a Parameter A secod type of statistical iferece is hypothesis testig. Here, rather tha use either a poit (or iterval) estimate from a simple radom sample to approximate

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Meas ad Proportios Itroductio: We wat to kow the value of a parameter for a populatio. We do t kow the value of this parameter for the etire populatio because

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

One-sample test of proportions

One-sample test of proportions Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

Practice Problems for Test 3

Practice Problems for Test 3 Practice Problems for Test 3 Note: these problems oly cover CIs ad hypothesis testig You are also resposible for kowig the samplig distributio of the sample meas, ad the Cetral Limit Theorem Review all

More information

Overview of some probability distributions.

Overview of some probability distributions. Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability

More information

Ch 7.1 pg. 364 #11, 13, 15, 17, 19, 21, 23, 25

Ch 7.1 pg. 364 #11, 13, 15, 17, 19, 21, 23, 25 Math 7 Elemetary Statistics: A Brief Versio, 5/e Bluma Ch 7.1 pg. 364 #11, 13, 15, 17, 19, 1, 3, 5 11. Readig Scores: A sample of the readig scores of 35 fifth-graders has a mea of 8. The stadard deviatio

More information

Hypothesis Tests Applied to Means

Hypothesis Tests Applied to Means The Samplig Distributio of the Mea Hypothesis Tests Applied to Meas Recall that the samplig distributio of the mea is the distributio of sample meas that would be obtaied from a particular populatio (with

More information

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number. GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all

More information

Chapter 7: Confidence Interval and Sample Size

Chapter 7: Confidence Interval and Sample Size Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about

More information

Chapter 5 Discrete Probability Distributions

Chapter 5 Discrete Probability Distributions Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide Chapter 5 Discrete Probability Distributios Radom Variables Discrete Probability Distributios Epected Value ad Variace Poisso Distributio

More information

3. Covariance and Correlation

3. Covariance and Correlation Virtual Laboratories > 3. Expected Value > 1 2 3 4 5 6 3. Covariace ad Correlatio Recall that by takig the expected value of various trasformatios of a radom variable, we ca measure may iterestig characteristics

More information

Confidence Intervals for One Mean with Tolerance Probability

Confidence Intervals for One Mean with Tolerance Probability Chapter 421 Cofidece Itervals for Oe Mea with Tolerace Probability Itroductio This procedure calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) with

More information

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Research Method (I) --Knowledge on Sampling (Simple Random Sampling) Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact

More information

Section 7.2 Confidence Interval for a Proportion

Section 7.2 Confidence Interval for a Proportion Sectio 7.2 Cofidece Iterval for a Proportio Before ay ifereces ca be made about a proportio, certai coditios must be satisfied: 1. The sample must be a SRS from the populatio of iterest. 2. The populatio

More information

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern.

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern. 5.5 Fractios ad Decimals Steps for Chagig a Fractio to a Decimal. Simplify the fractio, if possible. 2. Divide the umerator by the deomiator. d d Repeatig Decimals Repeatig Decimals are decimal umbers

More information

Gregory Carey, 1998 Linear Transformations & Composites - 1. Linear Transformations and Linear Composites

Gregory Carey, 1998 Linear Transformations & Composites - 1. Linear Transformations and Linear Composites Gregory Carey, 1998 Liear Trasformatios & Composites - 1 Liear Trasformatios ad Liear Composites I Liear Trasformatios of Variables Meas ad Stadard Deviatios of Liear Trasformatios A liear trasformatio

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Covariance and correlation

Covariance and correlation Covariace ad correlatio The mea ad sd help us summarize a buch of umbers which are measuremets of just oe thig. A fudametal ad totally differet questio is how oe thig relates to aother. Stat 0: Quatitative

More information

Unit 20 Hypotheses Testing

Unit 20 Hypotheses Testing Uit 2 Hypotheses Testig Objectives: To uderstad how to formulate a ull hypothesis ad a alterative hypothesis about a populatio proportio, ad how to choose a sigificace level To uderstad how to collect

More information

Hypothesis testing in a Nutshell

Hypothesis testing in a Nutshell Hypothesis testig i a Nutshell Summary by Pamela Peterso Drake Itroductio The purpose of this readig is to discuss aother aspect of statistical iferece, testig. A is a statemet about the value of a populatio

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

Lesson 17 Pearson s Correlation Coefficient

Lesson 17 Pearson s Correlation Coefficient Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig

More information

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of

More information

Discrete Random Variables and Probability Distributions. Random Variables. Chapter 3 3.1

Discrete Random Variables and Probability Distributions. Random Variables. Chapter 3 3.1 UCLA STAT A Applied Probability & Statistics for Egieers Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistat: Neda Farziia, UCLA Statistics Uiversity of Califoria, Los Ageles, Sprig

More information

7. Sample Covariance and Correlation

7. Sample Covariance and Correlation 1 of 8 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 7. Sample Covariace ad Correlatio The Bivariate Model Suppose agai that we have a basic radom experimet, ad that X ad Y

More information

Hypergeometric Distributions

Hypergeometric Distributions 7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you

More information

7.1 Inference for a Population Proportion

7.1 Inference for a Population Proportion 7.1 Iferece for a Populatio Proportio Defiitio. The statistic that estimates the parameter p is the sample proportio cout of successes i the sample ˆp = cout of observatios i the sample. Assumptios for

More information

4.1 Sigma Notation and Riemann Sums

4.1 Sigma Notation and Riemann Sums 0 the itegral. Sigma Notatio ad Riema Sums Oe strategy for calculatig the area of a regio is to cut the regio ito simple shapes, calculate the area of each simple shape, ad the add these smaller areas

More information

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error STA 2023 Practice Questios Exam 2 Chapter 7- sec 9.2 Formulas Give o the test: Case parameter estimator stadard error Estimate of stadard error Samplig Distributio oe mea x s t (-1) oe p ( 1 p) CI: prop.

More information

23.3 Sampling Distributions

23.3 Sampling Distributions COMMON CORE Locker LESSON Commo Core Math Stadards The studet is expected to: COMMON CORE S-IC.B.4 Use data from a sample survey to estimate a populatio mea or proportio; develop a margi of error through

More information

9.8: THE POWER OF A TEST

9.8: THE POWER OF A TEST 9.8: The Power of a Test CD9-1 9.8: THE POWER OF A TEST I the iitial discussio of statistical hypothesis testig, the two types of risks that are take whe decisios are made about populatio parameters based

More information

8.1 Arithmetic Sequences

8.1 Arithmetic Sequences MCR3U Uit 8: Sequeces & Series Page 1 of 1 8.1 Arithmetic Sequeces Defiitio: A sequece is a comma separated list of ordered terms that follow a patter. Examples: 1, 2, 3, 4, 5 : a sequece of the first

More information

Chapter 14 Nonparametric Statistics

Chapter 14 Nonparametric Statistics Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they

More information

Notes on Hypothesis Testing

Notes on Hypothesis Testing Probability & Statistics Grishpa Notes o Hypothesis Testig A radom sample X = X 1,..., X is observed, with joit pmf/pdf f θ x 1,..., x. The values x = x 1,..., x of X lie i some sample space X. The parameter

More information

1 Hypothesis testing for a single mean

1 Hypothesis testing for a single mean BST 140.65 Hypothesis Testig Review otes 1 Hypothesis testig for a sigle mea 1. The ull, or status quo, hypothesis is labeled H 0, the alterative H a or H 1 or H.... A type I error occurs whe we falsely

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

OMG! Excessive Texting Tied to Risky Teen Behaviors

OMG! Excessive Texting Tied to Risky Teen Behaviors BUSIESS WEEK: EXECUTIVE EALT ovember 09, 2010 OMG! Excessive Textig Tied to Risky Tee Behaviors Kids who sed more tha 120 a day more likely to try drugs, alcohol ad sex, researchers fid TUESDAY, ov. 9

More information

Sequences II. Chapter 3. 3.1 Convergent Sequences

Sequences II. Chapter 3. 3.1 Convergent Sequences Chapter 3 Sequeces II 3. Coverget Sequeces Plot a graph of the sequece a ) = 2, 3 2, 4 3, 5 + 4,...,,... To what limit do you thik this sequece teds? What ca you say about the sequece a )? For ǫ = 0.,

More information

15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011 15.075 Exam 3 Istructor: Cythia Rudi TA: Dimitrios Bisias November 22, 2011 Gradig is based o demostratio of coceptual uderstadig, so you eed to show all of your work. Problem 1 A compay makes high-defiitio

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006 Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam

More information

TIEE Teaching Issues and Experiments in Ecology - Volume 1, January 2004

TIEE Teaching Issues and Experiments in Ecology - Volume 1, January 2004 TIEE Teachig Issues ad Experimets i Ecology - Volume 1, Jauary 2004 EXPERIMENTS Evirometal Correlates of Leaf Stomata Desity Bruce W. Grat ad Itzick Vatick Biology, Wideer Uiversity, Chester PA, 19013

More information

7818 Interval estimation and hypothesis testing - Set

7818 Interval estimation and hypothesis testing - Set 7 7818 Iterval estimatio ad hypothesis testig - Set revised Nov 9, 010 You might wat to read some of the chapter i MGB o Parametric Iterval Estimatio. There are subtle di ereces across questios. uderstad

More information

Stat 104 Lecture 16. Statistics 104 Lecture 16 (IPS 6.1) Confidence intervals - the general concept

Stat 104 Lecture 16. Statistics 104 Lecture 16 (IPS 6.1) Confidence intervals - the general concept Statistics 104 Lecture 16 (IPS 6.1) Outlie for today Cofidece itervals Cofidece itervals for a mea, µ (kow σ) Cofidece itervals for a proportio, p Margi of error ad sample size Review of mai topics for

More information

7 b) 0. Guided Notes for lesson P.2 Properties of Exponents. If a, b, x, y and a, b, 0, and m, n Z then the following properties hold: 1 n b

7 b) 0. Guided Notes for lesson P.2 Properties of Exponents. If a, b, x, y and a, b, 0, and m, n Z then the following properties hold: 1 n b Guided Notes for lesso P. Properties of Expoets If a, b, x, y ad a, b, 0, ad m, Z the the followig properties hold:. Negative Expoet Rule: b ad b b b Aswers must ever cotai egative expoets. Examples: 5

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

Example Consider the following set of data, showing the number of times a sample of 5 students check their per day:

Example Consider the following set of data, showing the number of times a sample of 5 students check their  per day: Sectio 82: Measures of cetral tedecy Whe thikig about questios such as: how may calories do I eat per day? or how much time do I sped talkig per day?, we quickly realize that the aswer will vary from day

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

Lesson 12. Sequences and Series

Lesson 12. Sequences and Series Retur to List of Lessos Lesso. Sequeces ad Series A ifiite sequece { a, a, a,... a,...} ca be thought of as a list of umbers writte i defiite order ad certai patter. It is usually deoted by { a } =, or

More information

1 The Binomial Theorem: Another Approach

1 The Binomial Theorem: Another Approach The Biomial Theorem: Aother Approach Pascal s Triagle I class (ad i our text we saw that, for iteger, the biomial theorem ca be stated (a + b = c a + c a b + c a b + + c ab + c b, where the coefficiets

More information

Confidence Intervals

Confidence Intervals 1 Cofidece Itervals Recall: Iferetial statistics are used to make predictios ad decisios about a populatio based o iformatio from a sample. The two major applicatios of iferetial statistics ivolve the

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5

More information

Hypothesis Testing. Definitions. H 0 : The Null Hypothesis This is the hypothesis or claim that is initially assumed to be true.

Hypothesis Testing. Definitions. H 0 : The Null Hypothesis This is the hypothesis or claim that is initially assumed to be true. Hypothesis Testig Hypothesis testig allows us to use a sample to decide betwee two statemets made about a Populatio characteristic. These two statemets are called the Null Hypothesis ad the Alterative

More information

Estimating the Mean and Variance of a Normal Distribution

Estimating the Mean and Variance of a Normal Distribution Estimatig the Mea ad Variace of a Normal Distributio Learig Objectives After completig this module, the studet will be able to eplai the value of repeatig eperimets eplai the role of the law of large umbers

More information

3. Continuous Random Variables

3. Continuous Random Variables Statistics ad probability: 3-1 3. Cotiuous Radom Variables A cotiuous radom variable is a radom variable which ca take values measured o a cotiuous scale e.g. weights, stregths, times or legths. For ay

More information

Riemann Sums y = f (x)

Riemann Sums y = f (x) Riema Sums Recall that we have previously discussed the area problem I its simplest form we ca state it this way: The Area Problem Let f be a cotiuous, o-egative fuctio o the closed iterval [a, b] Fid

More information

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here). BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

The Stable Marriage Problem

The Stable Marriage Problem The Stable Marriage Problem William Hut Lae Departmet of Computer Sciece ad Electrical Egieerig, West Virgiia Uiversity, Morgatow, WV William.Hut@mail.wvu.edu 1 Itroductio Imagie you are a matchmaker,

More information

Confidence intervals and hypothesis tests

Confidence intervals and hypothesis tests Chapter 2 Cofidece itervals ad hypothesis tests This chapter focuses o how to draw coclusios about populatios from sample data. We ll start by lookig at biary data (e.g., pollig), ad lear how to estimate

More information

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits

More information

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

Elementary Theory of Russian Roulette

Elementary Theory of Russian Roulette Elemetary Theory of Russia Roulette -iterestig patters of fractios- Satoshi Hashiba Daisuke Miematsu Ryohei Miyadera Itroductio. Today we are goig to study mathematical theory of Russia roulette. If some

More information