the proportion of voters who intend on voting for the incumbent prime minister in the next election.
|
|
- Arthur Elliott
- 7 years ago
- Views:
Transcription
1 1 Iferece for Populatio Proportios So far we were iterested i estimatig ad aswerig questios for populatio meas. I these cases our parameters of iterest were either a populatio mea µ or a differece betwee two populatio meas, µ 1 µ 2. We will ow study the aalysis of populatio proportios: the proportio of voters who ited o votig for the icumbet prime miister i the ext electio. the proportio of cacer patiets who are goig to survive at least 5 years after treatmet the proportio of batteries which lasts at least 6 hours the proportio of studets who receive a A i Stat 151 We ca also iterpret the populatio proportio, as the probability of the evet of iterest (whe radomly choosig a idividual from the populatio). 2 Estimatio of a populatio proportio p The sample proportio ˆp is defied for a sample a give by: ˆp = x where x deotes the umber of members i the sample that have the specified attribute (or umber of successes), ad deotes the sample size. It seems atural to use the sample proportio for estimatig a populatio proportio. I order to cofirm that this is statistically reasoable, we eed to study the distributio of ˆp (why is this a radom variable?). The followig is also called the Cetral Limit Theorem for proportios. For samples of size : 1. (mea) muˆp = p 2. (stadard deviatio) σˆp = p(1 p)/ 3. (shape) If is large the ˆp is approximately ormally distributed. The first property meas, that ˆp is a ubiased estimator for p, the secod property meas the larger, the more likely ˆp is fallig close to p, ad the last property lets us costruct cofidece itervals ad tests (yippee!) Example A study showed, that the proportio of people i the 20 to 34 age group with a IQ (o the Wechsler Itelligece Scale) of over 120 is about Calculate the probability for the evet that i a sample of 50 there are more tha 20 people with a IQ of at least 120. For this 1
2 sample ˆp = 20/20 = 0.4 We will calculate how likely a sample proportio of 0.4 (or larger) is occurrig i a sample of size 50, with a true populatio proportio of 0.35 ˆp 0.35 P (ˆp 0.4 = P ( 0.35(0.65)/ (0.65)/50 ) stadardize = P (Z 0.74) = 1 P (Z < 0.74) = =.2296 (table II) We calculated that the probability that more tha 20 out of 50 people (betwee 20 ad 34) have a IQ greater tha 120 is.23. Not that ulikely. 2.1 Large-Sample Cofidece Iterval for a Populatio Proportio p Let p be the probability of a evet of iterest. We saw before that ˆp = x is a ubiased estimate for p, if x is the umber of successes i trials. Usually p is ukow ad based o a radom sample we ca calculate a (1 α)100% cofidece iterval. A (1 α)100% Large Sample Cofidece Iterval for a Populatio Proportio p. ˆp ± z α/2 p(1 p) where z 1 α/2 is the 1 α/2 percetile of a stadard ormal distributio. Sice p is ukow, it is estimated usig ˆp. The sample size is cosidered large whe the ormal approximatio to the biomial distributio is adequate amely whe the umber of successes ad the umber of failures are both at least five. Proof: P ( ˆp z 1 α/2 p(1 p) ) p ˆp + z p(1 p) 1 α/2 = P ( z 1 α/2 ) p ˆp p(1 p)/ z 1 α/2 ( ) ( ) p ˆp = P p(1 p)/ z p ˆp 1 α/2 P p(1 p)/ z 1 α/2 = 1 α 2 (1 (1 α 2 )) = 1 α sice p ˆp p(1 p)/ is accordig to the Cetral Limit Theorem stadard ormal distributed. Remark: A cofidece iterval is calculated, whe p is ukow. So the boudaries will be calculated by replacig p by the ubiased estimator ˆp. This is oly appropriate if is large ad will result 2
3 i a approximate cofidece iterval, that meas the probability for the parameter to fall ito the iterval is approximately 1 α. So we use: Let z α/2 the (1 α/2) percetile of the stadard ormal distributio ad p > 5 ad (1 p) > 5. The is ˆp(1 ˆp) ˆp(1 ˆp) ˆp z α/2 ; ˆp + z α/2 a approximate (1 α) cofidece iterval for p. Example: Cosider flippig a coi 1000 times. I oly 400 of the experimets HEAD was observed. Is this a surprisig umber, if the coi is ubiased. To aswer this questio calculate a 95% cofidece iterval from this data ad check if 0.5 (the probability for HEAD, whe tossig a ubiased coi) is i the cofidece iterval. First check if the coditios are met: p = (1 p) = = We coclude that we ca apply the Cetral Limit Theorem ad ca use the above described method for obtaiig a cofidece iterval. [ ˆp z α/2 ˆp(1 ˆp) ; ˆp + z α/2 ˆp(1 ˆp) ] = [ ; = [ ; ] = [0.37 ; 0.43] We ca be 95% cofidet, that the true probability for HEAD is i the iterval [0.37; 0.43]. Sice 0.5 is ot i the iterval, it seems to be ulikely that 0.5 is the true probability for HEAD. Check the coi, what makes it biased! 2.2 Choosig the Sample Size The Margi of Error for the estimatio of p is E = z α/2 p(1 p)/ Choosig the sample size for estimatig a proportio p follows the same argumet, as fidig the sample size for estimatig a mea µ, oly that the formula is based o aother cofidece iterval. Assume a probability p shall be estimated withi a margi of error of E with a (1 α)100% cofidece iterval, the ( ) 2 z( α/2) p(1 p) E Sice p is ot kow, use a guess, or use p = 0.5 as a coservative value i this formula. Example A poll shall be coducted to fid the proportio of Caadias supportig the Liberal party withi a margi of error of 3% (E = 0.03) the ( ) (0.5) = A sample size of 1068 would be required to make this goal. (This is why most polls are based o samples of size of a little above 1000). 3 ]
4 2.3 A Large Sample Test Cocerig a Proportio p For developig a test agai the facts we kow from the CLT have to be cosidered. The poit estimator for a proportio is the sample proportio ˆp. From the Cetral Limit Theorem we kow about the samplig distributio of ˆp that: 1. µˆp = p 2. σˆp = p(1 p) 3. If is large the samplig distributio of ˆp is approximately ormal. So we get that z = p ˆp p(1 p) is stadard ormally distributed for large sample sizes. Usig these properties it ca be proved that the followig procedure, is a statistical test, that esures, that the probability to make a error of type I is less or equal tha α. A Large Sample Test cocerig a Proportio p 1. Hypotheses: Test type Upper tail H 0 : p p 0 versus H a : p > p 0 Lower tail H 0 : p p 0 versus H a : p < p 0 Two tail H 0 : p = p 0 versus H a : p p 0 Choose α. 2. Assumptio:Radom sample ad, the sample size is large, that is that ˆp > 5 ad (1 ˆp) > Test statistic: Let p 0 be a value betwee zero ad oe ad defie the test statistic z 0 = ˆp p 0 (p 0 (1 p 0 ))/ 4. p-value ad Rejectio Regio: Test type p-value Rejectio Regio Upper tail P (z > z 0 ) z 0 > z α Lower tail P (z < z 0 ) z 0 < z α Two tail 2 P (z > abs(z 0 )) abs(z 0 ) > z α/2 4
5 Where z α is the 1 α percetile of the stadard ormal distributio. 5. Decisio: If P-value α or z 0 falls i the rejectio regio, the reject H 0 If P-value> α or z 0 does ot fall i the rejectio regio the do ot reject H 0 6. Cotext. Example: Suppose that you wat to show that the proportio of adults above 40 who are participatig i fitess activities is below So you wat to test ( puttig what you wat to show ito the alterative hypothesis H a ) H 0 : p 0.2 vs. H a : p < 0.2 at a sigificace level of α = The sample size is = 100 ad the umber of people sampled who participate i those activities equals 19, so that ˆp = 0.19, ˆp = 19 > 5 ad (1 ˆp) = 81 > 5, so the assumptios are met (assumig the sample was radomly chose). 3. The z 0 = = Now calculate the P-value, accordig to the choice of H a it is a lower tail test, so the P-value is the lower tail probability. P value = P (z < 0.25) = (from table II.) 5. Decisio: Sice =P-value> 0.05 = α, H 0 is ot rejected. 6. Cotext: At sigificace level of 5% the sample data do ot provide sufficiet evidece that less tha 20% of adults 40 ad older take part i fitess activities. 5
6 2.4 Estimatig the Differece betwee Two Populatio Proportios Istead of comparig two populatio meas let s ow compare two populatio proportios. Assume you wat to compare the rate of people who play computer games i the age groups of 20 to 30 ad 30 to 40 The proportio of defective items maufactured i two productio lies The statistic for estimatig the differece i two populatio proportios that comes to mid is the differece i the sample proportio (ˆp 1 ˆp 2 ). Let study the samplig distributio of this statistic to costruct a cofidece iterval. Properties of the Samplig Distributio of the Differece betwee two Sample Proportios (ˆp 1 ˆp 2 ) Cosider that you have two idepedet samples of sizes 1 ad 2 from biomial populatios with parameters p 1 ad p 2, respectively. The samplig distributio of (ˆp 1 ˆp 2 ) has these properties: 1. The mea of (ˆp 1 ˆp 2 ) is ad the stadard error is µˆp1 ˆp 2 = p 1 p 2 SE = p1 (1 p 1 ) 1 + p 2(1 p 2 ) 2 which is estimated by SE ˆ ˆp1 (1 ˆp 1 ) = + ˆp 2(1 ˆp 2 ) The samplig distributio of (ˆp 1 ˆp 2 ) is approximately ormal distributed, whe the sample sizes 1 ad 2 are large, that is whe 1 p 1 > 5 ad 1 (1 p 1 ) > 5 ad 2 p 2 > 5 ad 2 (1 p 2 ) > 5 These results ow lead to the descriptio of the estimatio of (p 1 p 2 ). Large Sample Poit Estimatio of (p 1 p 2 ) Poit estimate: (ˆp 1 ˆp 2 ) Margi of error: z α/2 p1 (1 p 1 ) 1 + p 2(1 p 2 ) 2 Large Sample (1 α)100% Cofidece Iterval for (p 1 p 2 ) (ˆp 1 ˆp 2 ) ± z α/2 p1 (1 p 1 ), 1 + p 2(1 p 2 ) 2 6
7 For this we have to assume agai that 1 ad 2 are large, that is 1 p 1 5, 1 (1 p 1 ), 2 p 2, 2 (1 p 2 ) are greater tha 5. I order to apply the tools described above, fid that p 1 ad p 2, the populatio proportios, are ukow. I order to use the above procedures, we have to replace the populatio proportios by their estimates ˆp 1 ad ˆp 2. So that you will estimate the margi of error by ±1.96SE ˆ ˆp1 (1 ˆp 1 ) = ± ˆp 2(1 ˆp 2 ) 1 2 ad use the followig Approximate Large Sample (1 α)100% Cofidece Iterval for (p 1 p 2 ) (ˆp 1 ˆp 2 ) ± z α/2 ˆp1 (1 ˆp 1 ) 1 + ˆp 2(1 ˆp 2 ) 2 For this we have to assume agai that 1 ad 2 are large, that is 1 p 1, 1 (1 p 1 ), 2 p 2, 2 (1 p 2 ) are greater tha 5. Example: Suppose we wat to compare therapies. The criteria for the compariso is the probability to survive at least 5 years after therapy. The study produced the followig data: Populatio 1 Populatio x ˆp = x/ That is 90 out of 100 patiets, who uderwet therapy 1 survived at least 5 years. If we use ˆp 1 as estimate for p 1 ad ˆp 2 as estimate for p 2, we fid that 1 p 1, 1 (1 p 1 ), 2 p 2, 2 (1 p 2 ) are all greater tha 5. So we ca use the formula from above for calculatig a 95% cofidece iterval for p 1 p 2. ˆp1 (1 ˆp 1 ) (ˆp 1 ˆp 2 )±z α/2 + ˆp 2(1 ˆp 2 ) 0.9(0.1) = (0.025)± (0.125) = 0.025± or [ ; 0.118]. Sice 0 is captured i this iterval, we fid, that this data does ot provide evidece, that the two therapies result i differet probabilities to survive 5 years. They ca be differet, but this data does ot show it. 7
8 2.5 Statistical Test for Two Populatio Proportios p 1 ad p 2 Notatio: populatio sample proportio size proportio populatio 1 p 1 1 ˆp 1 populatio 2 p 2 2 ˆp 2 Large-Sample z Test for comparig p 1 ad p 2 Hypotheses Test type Upper tail H 0 : p 1 p 2 0 versus H a : p 1 p 2 > 0 Lower tail H 0 : p 1 p 2 0 versus H a : p 1 p 2 < 0 Two tail H 0 : p 1 p 2 = 0 versus H a : p 1 p 2 0 Assumptio: Both sample sizes are large: Radom samples, 1ˆp 1 > 5, 1 (1 ˆp 1 ) > 5, 2ˆp 2 > 5, 2 (1 ˆp 2 ) > 5 Test statistic: z 0 = P-value ad Rejectio Regio: ˆpc(1 ˆp c) (ˆp 1 ˆp 2 ) 1 + ˆpc(1 ˆpc) 2 Test type P-value Rejectio Regio Upper tail P (z > z 0 ) z 0 > z α Lower tail P (z < z 0 ) z 0 < z α Two tail 2 P (z < abs(z 0 )) abs(z 0 ) > z α/2 Decisio Cotext Example: Fid if the proportios of red M&M s i the plai ad peaut variety do differ at a sigificace level of The sample Plai(1) Peaut(2) Sample Size Number of red M&Ms 12 8 This results i ˆp 1 = 12/56 = ad ˆp 2 = 8/32 = 0.25 ad ˆp c = (12+8)/(56+32) = 20/88 =
9 1. The questio asks for a test of H 0 : p 1 p 2 = 0 vs. H a : p 1 p 2 0. α = Assumptio: Sice ˆp 1 1, (1 ˆp 1 ) 1, ˆp 2 2, (1 ˆp 2 ) 2 are all greater tha 5, the assumptios are met ad the test will deliver a reliable result. 3. Test statistic: z 0 = ˆpc(1 ˆp c) (ˆp 1 ˆp 2 ) 1 + ˆpc(1 ˆpc) 2 = ( ) 0.227(0.773) (0.773) = = Rejectio regio: With α = 0.05 the rejectio regio for a two tailed test is: abs(z 0 ) > z α/2 = or usig the p-value: 2-tailed p-value=2p (z > abs(z 0 )) = 2P (z > ) = 2( ) = Decisio: Sice the P-value is ot smaller tha α = 0.05 do ot reject H 0 at sigificace level of At sigificace level of 5% we coclude that we do ot have eough evidece, that the proportio of red M&M s is differet for the plai ad peaut variety. 9
1. C. The formula for the confidence interval for a population mean is: x t, which was
s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value
More informationHypothesis testing. Null and alternative hypotheses
Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate
More informationInference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval
Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio
More information5: Introduction to Estimation
5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample
More informationPractice Problems for Test 3
Practice Problems for Test 3 Note: these problems oly cover CIs ad hypothesis testig You are also resposible for kowig the samplig distributio of the sample meas, ad the Cetral Limit Theorem Review all
More informationCHAPTER 7: Central Limit Theorem: CLT for Averages (Means)
CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:
More informationConfidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.
Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).
More informationPSYCHOLOGICAL STATISTICS
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics
More informationI. Chi-squared Distributions
1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.
More informationSampling Distribution And Central Limit Theorem
() Samplig Distributio & Cetral Limit Samplig Distributio Ad Cetral Limit Samplig distributio of the sample mea If we sample a umber of samples (say k samples where k is very large umber) each of size,
More informationOne-sample test of proportions
Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:
More informationThe following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles
The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio
More informationConfidence Intervals
Cofidece Itervals Cofidece Itervals are a extesio of the cocept of Margi of Error which we met earlier i this course. Remember we saw: The sample proportio will differ from the populatio proportio by more
More informationOverview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals
Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of
More informationCenter, Spread, and Shape in Inference: Claims, Caveats, and Insights
Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the
More information0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5
Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.
More informationZ-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown
Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about
More informationSTA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error
STA 2023 Practice Questios Exam 2 Chapter 7- sec 9.2 Formulas Give o the test: Case parameter estimator stadard error Estimate of stadard error Samplig Distributio oe mea x s t (-1) oe p ( 1 p) CI: prop.
More informationMath C067 Sampling Distributions
Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters
More informationDetermining the sample size
Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors
More information15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011
15.075 Exam 3 Istructor: Cythia Rudi TA: Dimitrios Bisias November 22, 2011 Gradig is based o demostratio of coceptual uderstadig, so you eed to show all of your work. Problem 1 A compay makes high-defiitio
More informationConfidence Intervals for One Mean
Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationLesson 15 ANOVA (analysis of variance)
Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi
More informationCase Study. Normal and t Distributions. Density Plot. Normal Distributions
Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca
More informationSTATISTICAL METHODS FOR BUSINESS
STATISTICAL METHODS FOR BUSINESS UNIT 7: INFERENTIAL TOOLS. DISTRIBUTIONS ASSOCIATED WITH SAMPLING 7.1.- Distributios associated with the samplig process. 7.2.- Iferetial processes ad relevat distributios.
More information1 Computing the Standard Deviation of Sample Means
Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.
More informationChapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:
Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries
More informationOutput Analysis (2, Chapters 10 &11 Law)
B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should
More informationConfidence intervals and hypothesis tests
Chapter 2 Cofidece itervals ad hypothesis tests This chapter focuses o how to draw coclusios about populatios from sample data. We ll start by lookig at biary data (e.g., pollig), ad lear how to estimate
More informationProperties of MLE: consistency, asymptotic normality. Fisher information.
Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout
More informationUniversity of California, Los Angeles Department of Statistics. Distributions related to the normal distribution
Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.
More informationChapter 7: Confidence Interval and Sample Size
Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum
More informationMeasures of Spread and Boxplots Discrete Math, Section 9.4
Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,
More informationA Mathematical Perspective on Gambling
A Mathematical Perspective o Gamblig Molly Maxwell Abstract. This paper presets some basic topics i probability ad statistics, icludig sample spaces, probabilistic evets, expectatios, the biomial ad ormal
More informationMEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)
MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:
More informationGCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.
GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all
More informationChapter 7 Methods of Finding Estimators
Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of
More informationChapter 14 Nonparametric Statistics
Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they
More information1 Correlation and Regression Analysis
1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio
More informationChapter 6: Variance, the law of large numbers and the Monte-Carlo method
Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value
More informationMann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)
No-Parametric ivariate Statistics: Wilcoxo-Ma-Whitey 2 Sample Test 1 Ma-Whitey 2 Sample Test (a.k.a. Wilcoxo Rak Sum Test) The (Wilcoxo-) Ma-Whitey (WMW) test is the o-parametric equivalet of a pooled
More informationMaximum Likelihood Estimators.
Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio
More informationOMG! Excessive Texting Tied to Risky Teen Behaviors
BUSIESS WEEK: EXECUTIVE EALT ovember 09, 2010 OMG! Excessive Textig Tied to Risky Tee Behaviors Kids who sed more tha 120 a day more likely to try drugs, alcohol ad sex, researchers fid TUESDAY, ov. 9
More informationTHE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n
We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample
More informationLECTURE 13: Cross-validation
LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M
More informationOverview of some probability distributions.
Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability
More informationPresent Value Factor To bring one dollar in the future back to present, one uses the Present Value Factor (PVF): Concept 9: Present Value
Cocept 9: Preset Value Is the value of a dollar received today the same as received a year from today? A dollar today is worth more tha a dollar tomorrow because of iflatio, opportuity cost, ad risk Brigig
More informationNormal Distribution.
Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued
More informationHypergeometric Distributions
7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you
More informationWeek 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable
Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5
More informationThe Stable Marriage Problem
The Stable Marriage Problem William Hut Lae Departmet of Computer Sciece ad Electrical Egieerig, West Virgiia Uiversity, Morgatow, WV William.Hut@mail.wvu.edu 1 Itroductio Imagie you are a matchmaker,
More information3 Basic Definitions of Probability Theory
3 Basic Defiitios of Probability Theory 3defprob.tex: Feb 10, 2003 Classical probability Frequecy probability axiomatic probability Historical developemet: Classical Frequecy Axiomatic The Axiomatic defiitio
More informationDefinition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean
1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.
More informationApproximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find
1.8 Approximatig Area uder a curve with rectagles 1.6 To fid the area uder a curve we approximate the area usig rectagles ad the use limits to fid 1.4 the area. Example 1 Suppose we wat to estimate 1.
More informationCentral Limit Theorem and Its Applications to Baseball
Cetral Limit Theorem ad Its Applicatios to Baseball by Nicole Aderso A project submitted to the Departmet of Mathematical Scieces i coformity with the requiremets for Math 4301 (Hoours Semiar) Lakehead
More informationIn nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008
I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces
More informationQuadrat Sampling in Population Ecology
Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may
More informationThis document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC.
SPC Formulas ad Tables 1 This documet cotais a collectio of formulas ad costats useful for SPC chart costructio. It assumes you are already familiar with SPC. Termiology Geerally, a bar draw over a symbol
More informationResearch Method (I) --Knowledge on Sampling (Simple Random Sampling)
Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact
More informationDiscrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13
EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may
More informationLecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem
Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits
More informationA Recursive Formula for Moments of a Binomial Distribution
A Recursive Formula for Momets of a Biomial Distributio Árpád Béyi beyi@mathumassedu, Uiversity of Massachusetts, Amherst, MA 01003 ad Saverio M Maago smmaago@psavymil Naval Postgraduate School, Moterey,
More informationTopic 5: Confidence Intervals (Chapter 9)
Topic 5: Cofidece Iterval (Chapter 9) 1. Itroductio The two geeral area of tatitical iferece are: 1) etimatio of parameter(), ch. 9 ) hypothei tetig of parameter(), ch. 10 Let X be ome radom variable with
More information0.674 0.841 1.036 1.282 1.645 1.960 2.054 2.326 2.576 2.807 3.091 3.291 50% 60% 70% 80% 90% 95% 96% 98% 99% 99.5% 99.8% 99.9%
Sectio 10 Aswer Key: 0.674 0.841 1.036 1.282 1.645 1.960 2.054 2.326 2.576 2.807 3.091 3.291 50% 60% 70% 80% 90% 95% 96% 98% 99% 99.5% 99.8% 99.9% 1) A simple radom sample of New Yorkers fids that 87 are
More informationDescriptive Statistics
Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote
More informationUnit 8: Inference for Proportions. Chapters 8 & 9 in IPS
Uit 8: Iferece for Proortios Chaters 8 & 9 i IPS Lecture Outlie Iferece for a Proortio (oe samle) Iferece for Two Proortios (two samles) Cotigecy Tables ad the χ test Iferece for Proortios IPS, Chater
More informationMulti-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu
Multi-server Optimal Badwidth Moitorig for QoS based Multimedia Delivery Aup Basu, Iree Cheg ad Yizhe Yu Departmet of Computig Sciece U. of Alberta Architecture Applicatio Layer Request receptio -coectio
More information, a Wishart distribution with n -1 degrees of freedom and scale matrix.
UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that
More informationHypothesis testing using complex survey data
Hypotesis testig usig complex survey data A Sort Course preseted by Peter Ly, Uiversity of Essex i associatio wit te coferece of te Europea Survey Researc Associatio Prague, 5 Jue 007 1 1. Objective: Simple
More informationLesson 17 Pearson s Correlation Coefficient
Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig
More informationPROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM
PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics
More informationIncremental calculation of weighted mean and variance
Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically
More informationTHE TWO-VARIABLE LINEAR REGRESSION MODEL
THE TWO-VARIABLE LINEAR REGRESSION MODEL Herma J. Bieres Pesylvaia State Uiversity April 30, 202. Itroductio Suppose you are a ecoomics or busiess maor i a college close to the beach i the souther part
More informationhp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation
HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics
More informationSoving Recurrence Relations
Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree
More informationLecture 3. denote the orthogonal complement of S k. Then. 1 x S k. n. 2 x T Ax = ( ) λ x. with x = 1, we have. i = λ k x 2 = λ k.
18.409 A Algorithmist s Toolkit September 17, 009 Lecture 3 Lecturer: Joatha Keler Scribe: Adre Wibisoo 1 Outlie Today s lecture covers three mai parts: Courat-Fischer formula ad Rayleigh quotiets The
More informationUC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006
Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam
More informationCS103X: Discrete Structures Homework 4 Solutions
CS103X: Discrete Structures Homewor 4 Solutios Due February 22, 2008 Exercise 1 10 poits. Silico Valley questios: a How may possible six-figure salaries i whole dollar amouts are there that cotai at least
More informationSection 11.3: The Integral Test
Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult
More informationElementary Theory of Russian Roulette
Elemetary Theory of Russia Roulette -iterestig patters of fractios- Satoshi Hashiba Daisuke Miematsu Ryohei Miyadera Itroductio. Today we are goig to study mathematical theory of Russia roulette. If some
More informationChapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions
Chapter 5 Uit Aual Amout ad Gradiet Fuctios IET 350 Egieerig Ecoomics Learig Objectives Chapter 5 Upo completio of this chapter you should uderstad: Calculatig future values from aual amouts. Calculatig
More informationTHE ABRACADABRA PROBLEM
THE ABRACADABRA PROBLEM FRANCESCO CARAVENNA Abstract. We preset a detailed solutio of Exercise E0.6 i [Wil9]: i a radom sequece of letters, draw idepedetly ad uiformly from the Eglish alphabet, the expected
More informationG r a d e. 2 M a t h e M a t i c s. statistics and Probability
G r a d e 2 M a t h e M a t i c s statistics ad Probability Grade 2: Statistics (Data Aalysis) (2.SP.1, 2.SP.2) edurig uderstadigs: data ca be collected ad orgaized i a variety of ways. data ca be used
More informationAsymptotic Growth of Functions
CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll
More informationChair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics
Chair for Network Architectures ad Services Istitute of Iformatics TU Müche Prof. Carle Network Security Chapter 2 Basics 2.4 Radom Number Geeratio for Cryptographic Protocols Motivatio It is crucial to
More informationAnalyzing Longitudinal Data from Complex Surveys Using SUDAAN
Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical
More informationDepartment of Computer Science, University of Otago
Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly
More informationA Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:
A Test of Normality Textbook Referece: Chapter. (eighth editio, pages 59 ; seveth editio, pages 6 6). The calculatio of p values for hypothesis testig typically is based o the assumptio that the populatio
More informationA probabilistic proof of a binomial identity
A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two
More informationTHE ARITHMETIC OF INTEGERS. - multiplication, exponentiation, division, addition, and subtraction
THE ARITHMETIC OF INTEGERS - multiplicatio, expoetiatio, divisio, additio, ad subtractio What to do ad what ot to do. THE INTEGERS Recall that a iteger is oe of the whole umbers, which may be either positive,
More informationNon-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring
No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy
More informationTaking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling
Taig DCOP to the Real World: Efficiet Complete Solutios for Distributed Multi-Evet Schedulig Rajiv T. Maheswara, Milid Tambe, Emma Bowrig, Joatha P. Pearce, ad Pradeep araatham Uiversity of Souther Califoria
More informationCHAPTER 3 DIGITAL CODING OF SIGNALS
CHAPTER 3 DIGITAL CODING OF SIGNALS Computers are ofte used to automate the recordig of measuremets. The trasducers ad sigal coditioig circuits produce a voltage sigal that is proportioal to a quatity
More informationInfinite Sequences and Series
CHAPTER 4 Ifiite Sequeces ad Series 4.1. Sequeces A sequece is a ifiite ordered list of umbers, for example the sequece of odd positive itegers: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29...
More informationParametric (theoretical) probability distributions. (Wilks, Ch. 4) Discrete distributions: (e.g., yes/no; above normal, normal, below normal)
6 Parametric (theoretical) probability distributios. (Wilks, Ch. 4) Note: parametric: assume a theoretical distributio (e.g., Gauss) No-parametric: o assumptio made about the distributio Advatages of assumig
More informationLecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)
18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the
More informationExample 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).
BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly
More informationarxiv:1506.03481v1 [stat.me] 10 Jun 2015
BEHAVIOUR OF ABC FOR BIG DATA By Wetao Li ad Paul Fearhead Lacaster Uiversity arxiv:1506.03481v1 [stat.me] 10 Ju 2015 May statistical applicatios ivolve models that it is difficult to evaluate the likelihood,
More informationHere are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.
This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio
More information