Expected values, standard errors, Central Limit Theorem. Statistical inference
|
|
- Warren McCarthy
- 7 years ago
- Views:
Transcription
1 Expected values, standard errors, Central Limit Theorem FPP Statistical inference Up to this point we have focused primarily on exploratory statistical analysis We know dive into the realm of statistical inference The ideas associated with sampling distributions, p-values, and confidence intervals are more abstract and are therefore slightly harder These concepts are also very powerful For good if used correctly For bad if used incorrectly 1
2 Statistics vs probability modeling Probability: know the truth, want to estimate the chances that data occur Statistics: know the data that occur, want to infer about the truth Parameter Population Inference Statistic x Sample Law of averages What does the law of averages say? Toss a coin As # of tosses increase the #heads 0.5(#tosses %heads 50% 2
3 Chance processes When tossing a coin: Actual #heads Expected #heads What is the likely size of the difference? Strategy: Find an analogy between the process being studied and drawing numbers at random from a box (box model) Box models A so called box model is a good starting point into statistical inference The purpose of these very simple models is to analyze chance variability They are a construction for learning about characteristics of populations 3
4 Motivating example Population: 119,106 graduates of Duke Variable: donation amount in $$ to Duke Annual Fund in 2001 Box model: make a ticket for every alumnus containing his/her donation amount Put all these tickets in a hypothetical box. Box models: typical questions Pick 100 tickets at random from the box, with replacement 1. Before collecting the data, what do you expect the sum of these 100 alumni donations to equal? 2. What do you think is a typical deviation from this expected value? 3. Before collecting the data how many of the 100 alumni people do you expect to be donators? 4. What do you think is a typical deviation from this expected value? 4
5 Characteristics of alumni donations For the 119,106 alumni: Average of all donations = $735 SD of donations = $23,827 42,938 donated (36%) 76,168 did not donate (64%) Learning about the sample sum When we sample randomly, the sum of the 100 tickets will differ for different samples What is the expected value (EV) of the sample sum E(sample sum) = n*(average of box) What is a typical deviation of a sample sum from this expected value Standard error (SE) of sum = n *(SD of box) 5
6 Sample sum of donations for 100 alumni So the sum of the 100 alumni donations should be: E(sample sum) = 100*($735) = $73,500 give or take the SE SE = 100($23,827) = $238,270 How sure are we about the sum of donations using a sample of 100? Key idea If we take independent samples of 100 alumni over and over again, recording each sample then The average of the sample sums should be around $73,500 The SD of the sample sums should be around $238,270 Box model for binary (dichotomous) outcomes 42,938 donated and 76,168 did not Make a box with tickets comprised of 42,938 ones and 76,168 zeros. Average of box = % of ones = 0.36 = p SD of box = 0.48 Short cut for SD for binary box models (and only for binary box models) Sample 100 tickets out of the box with replacement. What does this process remind you of? 6
7 Sample number of donators out of 100 alumni The number of donators in the sample equals the sample sum of the 0-1 tickets Thus, the expected number of donators is EV of sample sum = n * (Average of box) = 100 * 0.36 = 36 The typical deviation of the sample sum for expected value is The Standard error (SE) of sum = * (SD of box) n = 10 *.48 = 4.8 Sample number of donators out of 100 alumni Hence, the number of alumni who donated out of a random sample of 100 should be 36, give or take around 5 people (SD = 4.8). Compared to the average donation per alumni how confident are we that any give sample of 100 will produce 36 donors. Key idea If we take independent samples of 100 alumni over and over again, recording the number of donators in each sample The average of the sample number of donators should be around 36 The SD of the sample numbers of donators should be around 4.8 7
8 A problem from the text 100 draws are made with replacement from a box containing the seven numbers Suppose you were betting. The closer your guess is to the sample sum, the more money you win. What number would you guess? How much would you expect the sample sum to be off from the expected value of the sum? Difference between SD and SE SD is the typical deviation from the average in a box. SD is a property of the box; it doesn t depend on a random sampling SE is the typical deviation from the expected value in a random sample. SE results from random sampling SE gives an idea of how large the chance error is Sum of draws is likely to be around its expected value, but to be off by a chance error similar in size to its SE Sum of draws = EV ± chance error 8
9 EV and SE of the sample average or percent Since sample average(percent) = sample sum /n we get 1. Just like sample sums, sample averages and sample percentages are subject to chance variation 2. EV for sample average ( or %) = EV of sample sum / n = Avg. of box. 3. SE for sample average (or %) = SE for sample sum / n = SD of box / n Common theme for SE of sample average and sample percentage Fir a binary variable, the population SD = So both the sample average and sample percentage have a standard error of the form SE = Population SD / n 9
10 Sample averages and percentages In a random sample of 100 alumni, we expect the sample average donation In a random sample of 100 alumni, we expect the sample average donation to equal $735, give or take $2, We expect 36% to donate, give or take 4.8% (SE =.048). If we take independent samples of 100 alumni over and over again, recording the average donation and the percentage of donators in each sample: a) the average of the sample averages of donations should be around 735. b) the SD of the sample averages of donations should be around 2, c) the average of the sample percentages of donators should be around.36. d) the SD of the sample percentages of donators should be around Sample averages and percentages In a random sample of 100 alumni, we expect the sample average donation to equal $735 give or take $2, We expect 36% to donate, give or take 4.8% If we take independent samples of 100 alumni over and over again, recording the average donation and the percentage of donators in each sample The average of the sample averages of donations should be around 735 The SD of the sample averages of donations should be around 2, The average of the sample percentages of donators should be around 0.36 The SD of the sample percentages of donators should be around
11 Law of averages Plot the SE of sample average donation for an increasing sample taken from the box As n in increases, the SE of the sample average decreases This is called the law of averages Vegas was built on this law Shape of chance process The expected value and the standard error provide a measure of center and spread for the chance process What about the shape Book introduces something called the probability histogram This is a histogram of the samples take from the box model. What shape will this histogram take on 11
12 The central limit theorem Take many random samples from a box model, all of the samples of size n. When n is sufficiently large, the distribution of the sample average (or sample %) is welldescribed by a normal curve The mean of this normal curve is the EV and the standard deviation for this normal curve is the SE The Central Limit Theorem What does the CLT give us? A ton of stuff We can find probabilities and percentiles using the the normal table Can predict fairly accurately how unlikely it is to sample an observed sample mean Can assess rather accurately how likely a population mean lies within an interval 12
13 Central Limit Theorem What happens if the distribution of the original variable is not symmetric The central limit theorem still kicks in (the sample size n just needs to be bigger) What happens if the distribution of the original variable is bimodal The central limit theorem still kicks in (the sample size n just needs to be bigger) This is absolutely a fantastic result!!! Central Limit Theorem M&Ms Pick 50 M&Ms at random (from a bag). How likely is it to have less than 40% yellow and brown M&Ms in the bag? Assume 50% of all M&M s are yellow and brown (source: M&M s home page) For a sample proportion of yellow and brown M&Ms EV = 0.5 and SE = 13
14 Size of sample For binomial data, the CLT usually kicks in pretty well when both of the following conditions on sample size are met In practice we don t know p so we will plug in the sample percentage CLT and M&Ms Since n=50, CLT applies The probability of getting less than 40% yellow and brown M&Ms in a bag of 50 is It is somewhat unusual to get less than 40% yellow and brown M&Ms (about 8 chances in 100) 14
15 CLT household example The average size of U.S. households is 2.6 people. The SD of household size is (These are true values from the U.S. Census). Pick 200 houses at random in the U.S. How likely is it that we ll get a sample average household size of 3 or more? CLT household example For a sample average of 200 households EV = 2.6 and SE = The chance of getting an average household size greater than 3 equals the area under the standard normal curve to the right of 4. This is a very small chance 15
16 Alumni donations example In a random sample of 100 alumni, what is the chance that more than half donated? Alumni donations example What is the chance that the sample average of donations from 100 randomly picked alumni will be between $50 and $100 16
17 CLT under three conditions 1. If original variable follows a normal distribution the CLT holds regardless of sample size 2. If distribution of original variable is symmetric and unimodal then CLT holds for a small sample size (say less than 15) 3. If distribution is skewed, not unimodal then the CLT holds after a larger sample size how large depends on the sharpness of the skew. In this class we will follow convention and say
AMS 5 CHANCE VARIABILITY
AMS 5 CHANCE VARIABILITY The Law of Averages When tossing a fair coin the chances of tails and heads are the same: 50% and 50%. So if the coin is tossed a large number of times, the number of heads and
More informationTwo-sample inference: Continuous data
Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As
More informationElementary Statistics and Inference. Elementary Statistics and Inference. 17 Expected Value and Standard Error. 22S:025 or 7P:025.
Elementary Statistics and Inference S:05 or 7P:05 Lecture Elementary Statistics and Inference S:05 or 7P:05 Chapter 7 A. The Expected Value In a chance process (probability experiment) the outcomes of
More informationThe Normal Approximation to Probability Histograms. Dice: Throw a single die twice. The Probability Histogram: Area = Probability. Where are we going?
The Normal Approximation to Probability Histograms Where are we going? Probability histograms The normal approximation to binomial histograms The normal approximation to probability histograms of sums
More informationStat 20: Intro to Probability and Statistics
Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum
More informationMONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010
MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times
More informationExploratory Data Analysis. Psychology 3256
Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationStat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015
Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation
More informationLecture 2: Discrete Distributions, Normal Distributions. Chapter 1
Lecture 2: Discrete Distributions, Normal Distributions Chapter 1 Reminders Course website: www. stat.purdue.edu/~xuanyaoh/stat350 Office Hour: Mon 3:30-4:30, Wed 4-5 Bring a calculator, and copy Tables
More information5.1 Identifying the Target Parameter
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying
More informationChapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
More informationJohn Kerrich s coin-tossing Experiment. Law of Averages - pg. 294 Moore s Text
Law of Averages - pg. 294 Moore s Text When tossing a fair coin the chances of tails and heads are the same: 50% and 50%. So, if the coin is tossed a large number of times, the number of heads and the
More informationThe Binomial Distribution
The Binomial Distribution James H. Steiger November 10, 00 1 Topics for this Module 1. The Binomial Process. The Binomial Random Variable. The Binomial Distribution (a) Computing the Binomial pdf (b) Computing
More informationWeek 4: Standard Error and Confidence Intervals
Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.
More information4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
More informationUnbeknownst to us, the entire population consists of 5 cloned sheep with ages 10, 11, 12, 13, 14 months.
Activity #14: Sampling distributions and the Central Limit Theorem So far, this unit has focused on distributions of discrete and continuous random variables. In this activity, we ll investigate sampling
More informationCharacteristics of Binomial Distributions
Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation
More informationDescriptive Statistics and Measurement Scales
Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample
More informationDensity Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve
More informationLesson 4 Measures of Central Tendency
Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central
More information13.0 Central Limit Theorem
13.0 Central Limit Theorem Discuss Midterm/Answer Questions Box Models Expected Value and Standard Error Central Limit Theorem 1 13.1 Box Models A Box Model describes a process in terms of making repeated
More informationYou flip a fair coin four times, what is the probability that you obtain three heads.
Handout 4: Binomial Distribution Reading Assignment: Chapter 5 In the previous handout, we looked at continuous random variables and calculating probabilities and percentiles for those type of variables.
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationPr(X = x) = f(x) = λe λx
Old Business - variance/std. dev. of binomial distribution - mid-term (day, policies) - class strategies (problems, etc.) - exponential distributions New Business - Central Limit Theorem, standard error
More informationCurriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
More informationWeek 3&4: Z tables and the Sampling Distribution of X
Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More informationThe normal approximation to the binomial
The normal approximation to the binomial In order for a continuous distribution (like the normal) to be used to approximate a discrete one (like the binomial), a continuity correction should be used. There
More informationThe overall size of these chance errors is measured by their RMS HALF THE NUMBER OF TOSSES NUMBER OF HEADS MINUS 0 400 800 1200 1600 NUMBER OF TOSSES
INTRODUCTION TO CHANCE VARIABILITY WHAT DOES THE LAW OF AVERAGES SAY? 4 coins were tossed 1600 times each, and the chance error number of heads half the number of tosses was plotted against the number
More informationIndependent samples t-test. Dr. Tom Pierce Radford University
Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of
More informationSolutions to Homework 6 Statistics 302 Professor Larget
s to Homework 6 Statistics 302 Professor Larget Textbook Exercises 5.29 (Graded for Completeness) What Proportion Have College Degrees? According to the US Census Bureau, about 27.5% of US adults over
More informationStatistics courses often teach the two-sample t-test, linear regression, and analysis of variance
2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample
More informationProbability Distributions
Learning Objectives Probability Distributions Section 1: How Can We Summarize Possible Outcomes and Their Probabilities? 1. Random variable 2. Probability distributions for discrete random variables 3.
More informationNormal distribution. ) 2 /2σ. 2π σ
Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationThe normal approximation to the binomial
The normal approximation to the binomial The binomial probability function is not useful for calculating probabilities when the number of trials n is large, as it involves multiplying a potentially very
More informationThe Binomial Probability Distribution
The Binomial Probability Distribution MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2015 Objectives After this lesson we will be able to: determine whether a probability
More informationThe Standard Normal distribution
The Standard Normal distribution 21.2 Introduction Mass-produced items should conform to a specification. Usually, a mean is aimed for but due to random errors in the production process we set a tolerance
More informationDescriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
More informationThe Assumption(s) of Normality
The Assumption(s) of Normality Copyright 2000, 2011, J. Toby Mordkoff This is very complicated, so I ll provide two versions. At a minimum, you should know the short one. It would be great if you knew
More informationLecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions
Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule
More informationPoint and Interval Estimates
Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number
More informationSolutions to Homework 3 Statistics 302 Professor Larget
s to Homework 3 Statistics 302 Professor Larget Textbook Exercises 3.20 Customized Home Pages A random sample of n = 1675 Internet users in the US in January 2010 found that 469 of them have customized
More informationChapter 5. Discrete Probability Distributions
Chapter 5. Discrete Probability Distributions Chapter Problem: Did Mendel s result from plant hybridization experiments contradicts his theory? 1. Mendel s theory says that when there are two inheritable
More informationLecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
More informationThe Math. P (x) = 5! = 1 2 3 4 5 = 120.
The Math Suppose there are n experiments, and the probability that someone gets the right answer on any given experiment is p. So in the first example above, n = 5 and p = 0.2. Let X be the number of correct
More information$2 4 40 + ( $1) = 40
THE EXPECTED VALUE FOR THE SUM OF THE DRAWS In the game of Keno there are 80 balls, numbered 1 through 80. On each play, the casino chooses 20 balls at random without replacement. Suppose you bet on the
More information5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.
The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution
More informationREPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.
REPEATED TRIALS Suppose you toss a fair coin one time. Let E be the event that the coin lands heads. We know from basic counting that p(e) = 1 since n(e) = 1 and 2 n(s) = 2. Now suppose we play a game
More informationMATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem
MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem Tony Pourmohamad Department of Mathematics De Anza College Spring 2015 Objectives By the end of this set of slides, you
More informationThe Normal distribution
The Normal distribution The normal probability distribution is the most common model for relative frequencies of a quantitative variable. Bell-shaped and described by the function f(y) = 1 2σ π e{ 1 2σ
More informationTEACHER NOTES MATH NSPIRED
Math Objectives Students will understand that normal distributions can be used to approximate binomial distributions whenever both np and n(1 p) are sufficiently large. Students will understand that when
More informationMeans, standard deviations and. and standard errors
CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard
More informationAP Stats - Probability Review
AP Stats - Probability Review Multiple Choice Identify the choice that best completes the statement or answers the question. 1. I toss a penny and observe whether it lands heads up or tails up. Suppose
More information8. THE NORMAL DISTRIBUTION
8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,
More informationNormal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.
Normal Distribution Definition A continuous random variable has a normal distribution if its probability density e -(y -µ Y ) 2 2 / 2 σ function can be written as for < y < as Y f ( y ) = 1 σ Y 2 π Notation:
More informationSample Term Test 2A. 1. A variable X has a distribution which is described by the density curve shown below:
Sample Term Test 2A 1. A variable X has a distribution which is described by the density curve shown below: What proportion of values of X fall between 1 and 6? (A) 0.550 (B) 0.575 (C) 0.600 (D) 0.625
More informationWhy Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
More informationComparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples
Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The
More informationBetting rules and information theory
Betting rules and information theory Giulio Bottazzi LEM and CAFED Scuola Superiore Sant Anna September, 2013 Outline Simple betting in favorable games The Central Limit Theorem Optimal rules The Game
More informationLecture 11 Uncertainty
Lecture 11 Uncertainty 1. Contingent Claims and the State-Preference Model 1) Contingent Commodities and Contingent Claims Using the simple two-good model we have developed throughout this course, think
More informationComparing Means in Two Populations
Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we
More informationSection 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
More informationLab 11. Simulations. The Concept
Lab 11 Simulations In this lab you ll learn how to create simulations to provide approximate answers to probability questions. We ll make use of a particular kind of structure, called a box model, that
More informationMidterm Review Problems
Midterm Review Problems October 19, 2013 1. Consider the following research title: Cooperation among nursery school children under two types of instruction. In this study, what is the independent variable?
More informationCHI-SQUARE: TESTING FOR GOODNESS OF FIT
CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity
More informationLecture 8. Confidence intervals and the central limit theorem
Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of
More informationStatistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
More information99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm
Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the
More informationCA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction
CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous
More informationNonparametric statistics and model selection
Chapter 5 Nonparametric statistics and model selection In Chapter, we learned about the t-test and its variations. These were designed to compare sample means, and relied heavily on assumptions of normality.
More informationConfidence intervals
Confidence intervals Today, we re going to start talking about confidence intervals. We use confidence intervals as a tool in inferential statistics. What this means is that given some sample statistics,
More informationHigh School Statistics and Probability Common Core Sample Test Version 2
High School Statistics and Probability Common Core Sample Test Version 2 Our High School Statistics and Probability sample test covers the twenty most common questions that we see targeted for this level.
More informationMinimax Strategies. Minimax Strategies. Zero Sum Games. Why Zero Sum Games? An Example. An Example
Everyone who has studied a game like poker knows the importance of mixing strategies With a bad hand, you often fold But you must bluff sometimes Lectures in Microeconomics-Charles W Upton Zero Sum Games
More informationNow we begin our discussion of exploratory data analysis.
Now we begin our discussion of exploratory data analysis. 1 Remember to keep in mind where we are in the big picture. For now, we will assume that the data we are given is a representative sample from
More informationStat 20: Intro to Probability and Statistics
Stat 20: Intro to Probability and Statistics Lecture 18: Simple Random Sampling Tessa L. Childers-Day UC Berkeley 24 July 2014 By the end of this lecture... You will be able to: Draw box models for real-world
More informationPROBABILITY AND SAMPLING DISTRIBUTIONS
PROBABILITY AND SAMPLING DISTRIBUTIONS SEEMA JAGGI AND P.K. BATRA Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 0 0 seema@iasri.res.in. Introduction The concept of probability
More informationsocscimajor yes no TOTAL female 25 35 60 male 30 27 57 TOTAL 55 62 117
Review for Final Stat 10 (1) The table below shows data for a sample of students from UCLA. (a) What percent of the sampled students are male? 57/117 (b) What proportion of sampled students are social
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationCOMMON CORE STATE STANDARDS FOR
COMMON CORE STATE STANDARDS FOR Mathematics (CCSSM) High School Statistics and Probability Mathematics High School Statistics and Probability Decisions or predictions are often based on data numbers in
More informationSummarizing and Displaying Categorical Data
Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency
More informationLecture 5 : The Poisson Distribution
Lecture 5 : The Poisson Distribution Jonathan Marchini November 10, 2008 1 Introduction Many experimental situations occur in which we observe the counts of events within a set unit of time, area, volume,
More informationSTATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
More informationHYPOTHESIS TESTING WITH SPSS:
HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER
More informationc. Construct a boxplot for the data. Write a one sentence interpretation of your graph.
MBA/MIB 5315 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?
More informationT O P I C 1 2 Techniques and tools for data analysis Preview Introduction In chapter 3 of Statistics In A Day different combinations of numbers and types of variables are presented. We go through these
More informationHypothesis Testing for Beginners
Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes
More informationof course the mean is p. That is just saying the average sample would have 82% answering
Sampling Distribution for a Proportion Start with a population, adult Americans and a binary variable, whether they believe in God. The key parameter is the population proportion p. In this case let us
More informationChapter 26: Tests of Significance
Chapter 26: Tests of Significance Procedure: 1. State the null and alternative in words and in terms of a box model. 2. Find the test statistic: z = observed EV. SE 3. Calculate the P-value: The area under
More informationChapter 23 Inferences About Means
Chapter 23 Inferences About Means Chapter 23 - Inferences About Means 391 Chapter 23 Solutions to Class Examples 1. See Class Example 1. 2. We want to know if the mean battery lifespan exceeds the 300-minute
More informationSTT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
More informationAnalyzing Portfolio Expected Loss
Analyzing Portfolio Expected Loss In this white paper we discuss the methodologies that Visible Equity employs in the calculation of portfolio expected loss. Portfolio expected loss calculations combine
More informationThe Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)
Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,
More informationTesting Research and Statistical Hypotheses
Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you
More informationIntroduction to Hypothesis Testing OPRE 6301
Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about
More informationWhat is the purpose of this document? What is in the document? How do I send Feedback?
This document is designed to help North Carolina educators teach the Common Core (Standard Course of Study). NCDPI staff are continually updating and improving these tools to better serve teachers. Statistics
More information2. Discrete random variables
2. Discrete random variables Statistics and probability: 2-1 If the chance outcome of the experiment is a number, it is called a random variable. Discrete random variable: the possible outcomes can be
More informationChapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
More informationMTH 140 Statistics Videos
MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative
More information