Statistics 522: Sampling and Survey Techniques. Topic 6. There are situations where this can give much better results.

Size: px
Start display at page:

Download "Statistics 522: Sampling and Survey Techniques. Topic 6. There are situations where this can give much better results."

Transcription

1 Topic Overview Statistics 522: Sampling and Survey Techniques This topic will cover Sampling with unequal probabilities Sampling one primary sampling unit One-stage sampling with replacement Topic 6 Unequal probabilities Recall π i is the probability that unit i is selected as part of the sample. Most designs we have studied so far have the π i equal. Now we consider general designs where the π i can vary with i. There are situations where this can give much better results. Example 6.1 Survey of nursing home residents in Philadelphia to determine preferences on lifesustaining treatments 294 nursing homes with a total of 37,652 beds (number of residents not known at the planning stage) Use cluster sampling Suppose we choose an SRS of the 294 nursing homes and then an SRS of 10 residents of each selected home. A nursing home with 20 beds has the same probability of being sampled as a nursing home with 1000 beds. 10 residents from the 20 bed home represent fewer people than 10 residents from 1000 bed home. 1

2 Self-weighting This procedure gives a sample that is not self-weighted. Alternatives that are self-weighted. A one-stage cluster sample Sample a fixed percentage of the residents of each selected nursing home. The two-stage cluster design The two-stage cluster design (SRS of homes, then equal proportion SRS of residents in each selected home) Gives a mathematically valid estimator SRS at first stage Three shortcomings: We would expect t i to be proportional to the number of beds in nursing home i, so estimators will have large variance (M i ). Equal percentage sampling in each selected home may be difficult to administer. Cost is not known in advance (dont know if you will get large or small homes in sample). The study They drew a sample of 57 nursing homes with probabilities proportional to the number of beds. Then they took an SRS of 30 beds (and their occupants) from a list of all beds within each selected nursing home. Properties Each bed is equally likely to be in the sample (note beds vs occupants). The cost is known before selecting the sample. The same number of interviews is taken at each nursing home. The estimators will have smaller variance 2

3 Key ideas When sampling with unequal probabilities, we deliberately vary the selection probabilities. We compensate by using weights in the estimation. The key is that we know the selection probabilities Notation The probability that psu i is in the sample is π i. The probability that psu i is selected on the first draw is ψ i. We will consider an artificial situation where n =1,soπ i = ψ i. Sampling one psu Sample size is n =1. Suppose we are interested in estimating the population total. t i is the total for psu i. To illustrate the ideas, we will assume that we know the whole population. The Example N = 4 supermarkets Size (in square meters) varies. Select n = 1 with probabilities proportional to size. Record total sales Using the data from one store we want to estimate total sales for the four stores in the population. The population Store Size ψ i t i A 100 1/16 11 B 200 2/16 20 C 300 3/16 24 D / Total

4 Weights The weights w i are the inverses of the selection probabilities ψ i. The weighted estimator of the population total is ˆt ψ = w i t i. There are four possible samples. We calculate ˆt ψ for each. The samples Sample ψ i w i t i ˆt ψ A 1/ B 2/ C 3/16 16/ D 10/16 16/ Sampling distribution of the estimate ˆt ψ Sample ψ i ˆt ψ 1 1/ / / / Mean of the sampling distribution of ˆt ψ So ˆt ψ is unbiased. This will always be true. Eˆt ψ = = 300 = t 16 Eˆt ψ = ψ i w i t i = t i Variance of the sampling distribution ˆt ψ Var(ˆt ψ )= 1 16 ( ) ( ) ( ) ( )2 = Compare with the variance for an SRS: Var(ˆt SRS )= 1 4 ( ) ( ) ( ) ( )2 =

5 Interpretation Store D is the largest and we expect it to account for a large portion of the total sales. Therefore, we give it a higher probability of being in the sample (10/16) than it would have with an SRS (1/4). If it is selected, we multiply its sales by (16/10) to estimate total sales. One-stage sampling with replacement Suppose n>1 and we sample with replacement. This implies π i =1 (1 ψ i ) n. Probability that item i is selected on the first draw is the same as the probability that item i is selected on any other draw. Sampling with replacement gives us n independent estimates of the population total, one for each unit in sample. We average these n estimates. Estimated variance is variance of the estimates divided by n Example 6.2 N = 15 classes of elementary stat M i students in class i (i = 1 to 15) Values of M i range from 20 to 100. We want a sample of 5 classes. Each student in the selected classes will fill out a questionnaire. (It is possible for the same class to be selected more than once.) Randomization There are a total of 647 students in these classes. Select 5 random numbers between 1 and 647. Think about ordering the students by class. Each random number corresponds to a student and the corresponding class will be in the sample. 5

6 This method This method is called the cumulative-size method. It is based on M 1, M 1 + M 2, M 1 + M 2 + M 3,... Analternativeistousethecumulativesumsoftheψ i and select random numbers between 0 and 1. For this example, ψ i = M i /647 Alternative Systematic sampling is often used as an alternative in this setting. The basic idea is the same. Not technically sampling with replacement Works well as systematic sampling works well. See page 186 for details. Lahiris method Involves two stages of randomization Rejection sampling: corresponds to classroom problem in Problem Set 2. Can be inefficient. See page 187 for details Estimation Theory Let Q i be the number of times unit i occurs in the sample. Then ˆt ψ = 1 n Qi t i /ψ i. The estimated variance of ˆt i is 1 Qi ( t i ˆt ψ ) 2 n(n 1) ψ i The estimate and its estimated variance are both unbiased. Choosing the selection probabilities We want small variance for our estimator. Often, t i is related to the size of the psu. We can take ψ i proportional to M i or some other measure of the size of psu i. 6

7 PPS This procedure is called sampling with probability proportional to size (pps). The formulas for the estimate and variance can be simplified for this special case. See page 190 for details See Example 6.5 on pages ψ i = M i K t i ψ i = Kȳ i Two-stage sampling with replacement Basic ideas are very similar to one-stage sampling. ψ i is the probability that psu i is selected on the first (or any) draw. We take a sample of m i ssus from each selected psu. Sampling ssu s Usually we use an SRS. Alternatives include systematic sampling any other probability sampling method Note if a psu is selected more than once, a separate independent second stage sample is required. Estimates and SE s Weights are used to make the estimators unbiased. Formulas are similar to those for one-stage. See (6.8) and (6.9) on page 192 7

8 Outline of the procedure 1. Determine the ψ i. 2. Select the n psus (with replacement). 3. Select the ssus. 4. Estimate the t for each selected psu, ˆt ψ = weight ˆt 5. The average of these is ˆt ψ. 6. SE is the standard error of these (sd/ n). Unequal probability sampling without replacement ψ i is the probability of selection on the first draw. The probability of selection on later draws depends on which units were selected on earlier draws. Estimation π i is called the inclusion probability. ( pop π i = n) π i,j is the probability that both psu i and psu j are in the sample. ( j i π i,j =(n 1)π i ) Weights (inverse of selection probability) we use π i /n in place of ψ i (with replacement) The recommended procedure is to use the Horvitz-Thompson (HT) estimator and the associated SE. (ˆt HT = sam ˆt i /π i ) See page for details. This estimator can be generalized to other designs that do not use replacement. Randomization Theory Framework is Probability sampling without replacement for the psus for the first stage Sampling at the second stage is independent of sampling at the first stage 8

9 Horvitz-Thompson Randomization theory can be used to prove the Horvitz-Thompson Theorem. Expected value of the estimator is t. Formula for the variance of the estimator The estimator ˆt HT = ˆt i /π i where the sum is over the psu s selected in the first stage. Idea behind proofs is to condition on which psus are in the sample. Study pages Model One-way random effects anova model where Y i,j = A i + ɛ i,j the A i are random variables with mean µ and variance σa 2 the ɛ i,j are random variables with mean 0 and variance σ 2. the A i and the ɛ i,j are uncorrelated The pps estimator π i = nm i /K the inclusion probability We rewrite this as a weighted estimator. where w i,j = K nm i ˆT P = K nm i ˆTi ˆt i = M i m i Yi,j ˆt P = w i,j Y i,j Take expected values to show that the estimator is unbiased. 9

10 Variance The variance can be computed. See page 211 The variance depends on which psu s are selected through the M i. The variance is smallest when psu s with the largest M i are chosen. Recall Estimate of population total is the weighted average of the ˆt i for the selected psus. The weights w i are the inverses of the probabilities of selection. Elephants A circus needed to ship its 50 elephants. They needed to estimate the total weight of the animals. It is not easy to weigh 50 elephants and they were in a hurry. They had data from three years ago. Sample NO The owner wanted to base the estimate on a sample. Dumbo had a weight equal to the average three years ago. The owner wanted to weigh Dumbo and multiply by 50. The statistician said: You have to use probability sampling and the Horvitz-Thompson estimator. They compromised: The probability of selecting Dumbo was set as 99/100. The probability of selecting each of the other elephants was 1/

11 Who was selected NO Dumbo, of course. The owner was happy and said now we can estimate the weight of the 50 elephants as 50 times Dumbos weight, 50y. The statistician said The estimate of the total weight of the 50 elephants should be Dumbos weight divided by his probability of selection. This is y/(99/100) or 100y/99. The theory behind this estimator is rigorous What if The owner asked What if the randomization had selected Jumbo the largest elephant in the herd? The statistician replied 4900y, where y is Jumbos weight. Conclusion The statistician lost his circus job and became a teacher of statistics. bad model; highly variable estimator Due to Basu (1971). 11

Statistics 522: Sampling and Survey Techniques. Topic 5. Consider sampling children in an elementary school.

Statistics 522: Sampling and Survey Techniques. Topic 5. Consider sampling children in an elementary school. Topic Overview Statistics 522: Sampling and Survey Techniques This topic will cover One-stage Cluster Sampling Two-stage Cluster Sampling Systematic Sampling Topic 5 Cluster Sampling: Basics Consider sampling

More information

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population. SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations

More information

Introduction to Sampling. Dr. Safaa R. Amer. Overview. for Non-Statisticians. Part II. Part I. Sample Size. Introduction.

Introduction to Sampling. Dr. Safaa R. Amer. Overview. for Non-Statisticians. Part II. Part I. Sample Size. Introduction. Introduction to Sampling for Non-Statisticians Dr. Safaa R. Amer Overview Part I Part II Introduction Census or Sample Sampling Frame Probability or non-probability sample Sampling with or without replacement

More information

1 Sufficient statistics

1 Sufficient statistics 1 Sufficient statistics A statistic is a function T = rx 1, X 2,, X n of the random sample X 1, X 2,, X n. Examples are X n = 1 n s 2 = = X i, 1 n 1 the sample mean X i X n 2, the sample variance T 1 =

More information

Multivariate Analysis of Variance (MANOVA): I. Theory

Multivariate Analysis of Variance (MANOVA): I. Theory Gregory Carey, 1998 MANOVA: I - 1 Multivariate Analysis of Variance (MANOVA): I. Theory Introduction The purpose of a t test is to assess the likelihood that the means for two groups are sampled from the

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

3. Mathematical Induction

3. Mathematical Induction 3. MATHEMATICAL INDUCTION 83 3. Mathematical Induction 3.1. First Principle of Mathematical Induction. Let P (n) be a predicate with domain of discourse (over) the natural numbers N = {0, 1,,...}. If (1)

More information

Lecture 13 - Basic Number Theory.

Lecture 13 - Basic Number Theory. Lecture 13 - Basic Number Theory. Boaz Barak March 22, 2010 Divisibility and primes Unless mentioned otherwise throughout this lecture all numbers are non-negative integers. We say that A divides B, denoted

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Comparing Alternate Designs For A Multi-Domain Cluster Sample

Comparing Alternate Designs For A Multi-Domain Cluster Sample Comparing Alternate Designs For A Multi-Domain Cluster Sample Pedro J. Saavedra, Mareena McKinley Wright and Joseph P. Riley Mareena McKinley Wright, ORC Macro, 11785 Beltsville Dr., Calverton, MD 20705

More information

SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY

SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY Karen E. Davis National Center for Health Statistics, 6525 Belcrest Road, Room 915, Hyattsville, MD 20782 KEY WORDS: Sample survey, cost model

More information

Abstract: We describe the beautiful LU factorization of a square matrix (or how to write Gaussian elimination in terms of matrix multiplication).

Abstract: We describe the beautiful LU factorization of a square matrix (or how to write Gaussian elimination in terms of matrix multiplication). MAT 2 (Badger, Spring 202) LU Factorization Selected Notes September 2, 202 Abstract: We describe the beautiful LU factorization of a square matrix (or how to write Gaussian elimination in terms of matrix

More information

2.3. Finding polynomial functions. An Introduction:

2.3. Finding polynomial functions. An Introduction: 2.3. Finding polynomial functions. An Introduction: As is usually the case when learning a new concept in mathematics, the new concept is the reverse of the previous one. Remember how you first learned

More information

SECTION 10-2 Mathematical Induction

SECTION 10-2 Mathematical Induction 73 0 Sequences and Series 6. Approximate e 0. using the first five terms of the series. Compare this approximation with your calculator evaluation of e 0.. 6. Approximate e 0.5 using the first five terms

More information

Geometric Series and Annuities

Geometric Series and Annuities Geometric Series and Annuities Our goal here is to calculate annuities. For example, how much money do you need to have saved for retirement so that you can withdraw a fixed amount of money each year for

More information

CHAPTER 14 NONPARAMETRIC TESTS

CHAPTER 14 NONPARAMETRIC TESTS CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

CHAPTER 7 PRESENTATION AND ANALYSIS OF THE RESEARCH FINDINGS

CHAPTER 7 PRESENTATION AND ANALYSIS OF THE RESEARCH FINDINGS CHAPTER 7 PRESENTATION AND ANALYSIS OF THE RESEARCH FINDINGS 7.1 INTRODUCTION Chapter 6 detailed the methodology that was used to determine whether educators are teaching what management accountants need

More information

Sect 6.7 - Solving Equations Using the Zero Product Rule

Sect 6.7 - Solving Equations Using the Zero Product Rule Sect 6.7 - Solving Equations Using the Zero Product Rule 116 Concept #1: Definition of a Quadratic Equation A quadratic equation is an equation that can be written in the form ax 2 + bx + c = 0 (referred

More information

26 Integers: Multiplication, Division, and Order

26 Integers: Multiplication, Division, and Order 26 Integers: Multiplication, Division, and Order Integer multiplication and division are extensions of whole number multiplication and division. In multiplying and dividing integers, the one new issue

More information

Matrices 2. Solving Square Systems of Linear Equations; Inverse Matrices

Matrices 2. Solving Square Systems of Linear Equations; Inverse Matrices Matrices 2. Solving Square Systems of Linear Equations; Inverse Matrices Solving square systems of linear equations; inverse matrices. Linear algebra is essentially about solving systems of linear equations,

More information

CONTINUED FRACTIONS AND FACTORING. Niels Lauritzen

CONTINUED FRACTIONS AND FACTORING. Niels Lauritzen CONTINUED FRACTIONS AND FACTORING Niels Lauritzen ii NIELS LAURITZEN DEPARTMENT OF MATHEMATICAL SCIENCES UNIVERSITY OF AARHUS, DENMARK EMAIL: niels@imf.au.dk URL: http://home.imf.au.dk/niels/ Contents

More information

Designing a Sampling Method for a Survey of Iowa High School Seniors

Designing a Sampling Method for a Survey of Iowa High School Seniors Designing a Sampling Method for a Survey of Iowa High School Seniors Kyle A. Hewitt and Michael D. Larsen, Iowa State University Department of Statistics, Snedecor Hall, Ames, Iowa 50011-1210, larsen@iastate.edu

More information

10/14/08 ON NUMBERS EQUAL TO THE SUM OF TWO SQUARES IN MORE THAN ONE WAY

10/14/08 ON NUMBERS EQUAL TO THE SUM OF TWO SQUARES IN MORE THAN ONE WAY 1 10/14/08 ON NUMBERS EQUAL TO THE SUM OF TWO SQUARES IN MORE THAN ONE WAY Ming-Sun Li, Kathryn Robertson, Thomas J. Osler, Abdul Hassen, Christopher S. Simons and Marcus Wright Department of Mathematics

More information

A Second Course in Mathematics Concepts for Elementary Teachers: Theory, Problems, and Solutions

A Second Course in Mathematics Concepts for Elementary Teachers: Theory, Problems, and Solutions A Second Course in Mathematics Concepts for Elementary Teachers: Theory, Problems, and Solutions Marcel B. Finan Arkansas Tech University c All Rights Reserved First Draft February 8, 2006 1 Contents 25

More information

Stat 20: Intro to Probability and Statistics

Stat 20: Intro to Probability and Statistics Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum

More information

WHERE DOES THE 10% CONDITION COME FROM?

WHERE DOES THE 10% CONDITION COME FROM? 1 WHERE DOES THE 10% CONDITION COME FROM? The text has mentioned The 10% Condition (at least) twice so far: p. 407 Bernoulli trials must be independent. If that assumption is violated, it is still okay

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES

INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES Hukum Chandra Indian Agricultural Statistics Research Institute, New Delhi-110012 1. INTRODUCTION A sample survey is a process for collecting

More information

Sampling Probability and Inference

Sampling Probability and Inference PART II Sampling Probability and Inference The second part of the book looks into the probabilistic foundation of statistical analysis, which originates in probabilistic sampling, and introduces the reader

More information

Sampling Distributions

Sampling Distributions Sampling Distributions You have seen probability distributions of various types. The normal distribution is an example of a continuous distribution that is often used for quantitative measures such as

More information

What Is an Intracluster Correlation Coefficient? Crucial Concepts for Primary Care Researchers

What Is an Intracluster Correlation Coefficient? Crucial Concepts for Primary Care Researchers What Is an Intracluster Correlation Coefficient? Crucial Concepts for Primary Care Researchers Shersten Killip, MD, MPH 1 Ziyad Mahfoud, PhD 2 Kevin Pearce, MD, MPH 1 1 Department of Family Practice and

More information

Welcome back to EDFR 6700. I m Jeff Oescher, and I ll be discussing quantitative research design with you for the next several lessons.

Welcome back to EDFR 6700. I m Jeff Oescher, and I ll be discussing quantitative research design with you for the next several lessons. Welcome back to EDFR 6700. I m Jeff Oescher, and I ll be discussing quantitative research design with you for the next several lessons. I ll follow the text somewhat loosely, discussing some chapters out

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Review Jeopardy. Blue vs. Orange. Review Jeopardy Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

13: Additional ANOVA Topics. Post hoc Comparisons

13: Additional ANOVA Topics. Post hoc Comparisons 13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Kruskal-Wallis Test Post hoc Comparisons In the prior

More information

LOGNORMAL MODEL FOR STOCK PRICES

LOGNORMAL MODEL FOR STOCK PRICES LOGNORMAL MODEL FOR STOCK PRICES MICHAEL J. SHARPE MATHEMATICS DEPARTMENT, UCSD 1. INTRODUCTION What follows is a simple but important model that will be the basis for a later study of stock prices as

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation

More information

The Variance of Sample Variance for a Finite Population

The Variance of Sample Variance for a Finite Population The Variance of Sample Variance for a Finite Population Eungchun Cho November 11, 004 Key Words: variance of variance, variance estimator, sampling variance, randomization variance, moments Abstract The

More information

Unit 18 Determinants

Unit 18 Determinants Unit 18 Determinants Every square matrix has a number associated with it, called its determinant. In this section, we determine how to calculate this number, and also look at some of the properties of

More information

LS.6 Solution Matrices

LS.6 Solution Matrices LS.6 Solution Matrices In the literature, solutions to linear systems often are expressed using square matrices rather than vectors. You need to get used to the terminology. As before, we state the definitions

More information

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule

More information

Applications of the Pythagorean Theorem

Applications of the Pythagorean Theorem 9.5 Applications of the Pythagorean Theorem 9.5 OBJECTIVE 1. Apply the Pythagorean theorem in solving problems Perhaps the most famous theorem in all of mathematics is the Pythagorean theorem. The theorem

More information

Week 3&4: Z tables and the Sampling Distribution of X

Week 3&4: Z tables and the Sampling Distribution of X Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal

More information

WRITING PROOFS. Christopher Heil Georgia Institute of Technology

WRITING PROOFS. Christopher Heil Georgia Institute of Technology WRITING PROOFS Christopher Heil Georgia Institute of Technology A theorem is just a statement of fact A proof of the theorem is a logical explanation of why the theorem is true Many theorems have this

More information

Sampling and Sampling Distributions

Sampling and Sampling Distributions Sampling and Sampling Distributions Random Sampling A sample is a group of objects or readings taken from a population for counting or measurement. We shall distinguish between two kinds of populations

More information

Testing Surface Disinfectants

Testing Surface Disinfectants Testing Surface Disinfectants This series of knowledge sharing articles is a project of the Standardized Biofilm Methods Laboratory in the CBE KSA-SM-06 Enumerating viable cells by pooling counts for several

More information

Discrete Math in Computer Science Homework 7 Solutions (Max Points: 80)

Discrete Math in Computer Science Homework 7 Solutions (Max Points: 80) Discrete Math in Computer Science Homework 7 Solutions (Max Points: 80) CS 30, Winter 2016 by Prasad Jayanti 1. (10 points) Here is the famous Monty Hall Puzzle. Suppose you are on a game show, and you

More information

6.3 Conditional Probability and Independence

6.3 Conditional Probability and Independence 222 CHAPTER 6. PROBABILITY 6.3 Conditional Probability and Independence Conditional Probability Two cubical dice each have a triangle painted on one side, a circle painted on two sides and a square painted

More information

Power and sample size in multilevel modeling

Power and sample size in multilevel modeling Snijders, Tom A.B. Power and Sample Size in Multilevel Linear Models. In: B.S. Everitt and D.C. Howell (eds.), Encyclopedia of Statistics in Behavioral Science. Volume 3, 1570 1573. Chicester (etc.): Wiley,

More information

Lecture Notes on Polynomials

Lecture Notes on Polynomials Lecture Notes on Polynomials Arne Jensen Department of Mathematical Sciences Aalborg University c 008 Introduction These lecture notes give a very short introduction to polynomials with real and complex

More information

Real Time Sampling in Patient Surveys

Real Time Sampling in Patient Surveys Real Time Sampling in Patient Surveys Ronaldo Iachan, 1 Tonja Kyle, 1 Deirdre Farrell 1 1 ICF Macro, 11785 Beltsville Drive, Suite 300 Calverton, MD 20705 Abstract The article describes a real-time sampling

More information

Factors Galore C: Prime Factorization

Factors Galore C: Prime Factorization Concept Number sense Activity 4 Factors Galore C: Prime Factorization Students will use the TI-73 calculator s ability to simplify fractions to find the prime factorization of a number. Skills Simplifying

More information

Measures of Central Tendency and Variability: Summarizing your Data for Others

Measures of Central Tendency and Variability: Summarizing your Data for Others Measures of Central Tendency and Variability: Summarizing your Data for Others 1 I. Measures of Central Tendency: -Allow us to summarize an entire data set with a single value (the midpoint). 1. Mode :

More information

Descriptive Methods Ch. 6 and 7

Descriptive Methods Ch. 6 and 7 Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational

More information

STATISTICAL SIGNIFICANCE OF RANKING PARADOXES

STATISTICAL SIGNIFICANCE OF RANKING PARADOXES STATISTICAL SIGNIFICANCE OF RANKING PARADOXES Anna E. Bargagliotti and Raymond N. Greenwell Department of Mathematical Sciences and Department of Mathematics University of Memphis and Hofstra University

More information

Elementary Statistics and Inference. Elementary Statistics and Inference. 17 Expected Value and Standard Error. 22S:025 or 7P:025.

Elementary Statistics and Inference. Elementary Statistics and Inference. 17 Expected Value and Standard Error. 22S:025 or 7P:025. Elementary Statistics and Inference S:05 or 7P:05 Lecture Elementary Statistics and Inference S:05 or 7P:05 Chapter 7 A. The Expected Value In a chance process (probability experiment) the outcomes of

More information

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010 MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

More information

Rank-Based Non-Parametric Tests

Rank-Based Non-Parametric Tests Rank-Based Non-Parametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs

More information

STRUTS: Statistical Rules of Thumb. Seattle, WA

STRUTS: Statistical Rules of Thumb. Seattle, WA STRUTS: Statistical Rules of Thumb Gerald van Belle Departments of Environmental Health and Biostatistics University ofwashington Seattle, WA 98195-4691 Steven P. Millard Probability, Statistics and Information

More information

8 Primes and Modular Arithmetic

8 Primes and Modular Arithmetic 8 Primes and Modular Arithmetic 8.1 Primes and Factors Over two millennia ago already, people all over the world were considering the properties of numbers. One of the simplest concepts is prime numbers.

More information

Chapter 20: chance error in sampling

Chapter 20: chance error in sampling Chapter 20: chance error in sampling Context 2 Overview................................................................ 3 Population and parameter..................................................... 4

More information

STAT 830 Convergence in Distribution

STAT 830 Convergence in Distribution STAT 830 Convergence in Distribution Richard Lockhart Simon Fraser University STAT 830 Fall 2011 Richard Lockhart (Simon Fraser University) STAT 830 Convergence in Distribution STAT 830 Fall 2011 1 / 31

More information

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem Time on my hands: Coin tosses. Problem Formulation: Suppose that I have

More information

GRADE 6 MATH: GROCERY SHOPPING AND THE QUILT OF A MATH TEACHER

GRADE 6 MATH: GROCERY SHOPPING AND THE QUILT OF A MATH TEACHER GRADE 6 MATH: GROCERY SHOPPING AND THE QUILT OF A MATH TEACHER UNIT OVERVIEW This unit contains a curriculum- embedded Common Core aligned task and instructional supports. The task is embedded in a 2 3

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 2015 MODULE 1 : Data collection and interpretation Time allowed: One and a half hours Candidates should answer THREE

More information

Annex 6 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION. (Version 01.

Annex 6 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION. (Version 01. Page 1 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION (Version 01.1) I. Introduction 1. The clean development mechanism (CDM) Executive

More information

ANALYTIC AND REPORTING GUIDELINES

ANALYTIC AND REPORTING GUIDELINES ANALYTIC AND REPORTING GUIDELINES The National Health and Nutrition Examination Survey (NHANES) Last Update: December, 2005 Last Correction, September, 2006 National Center for Health Statistics Centers

More information

Normal distribution. ) 2 /2σ. 2π σ

Normal distribution. ) 2 /2σ. 2π σ Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a

More information

Simple Random Sampling

Simple Random Sampling Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for

More information

1 Basic ANOVA concepts

1 Basic ANOVA concepts Math 143 ANOVA 1 Analysis of Variance (ANOVA) Recall, when we wanted to compare two population means, we used the 2-sample t procedures. Now let s expand this to compare k 3 population means. As with the

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document

More information

FRACTIONS OPERATIONS

FRACTIONS OPERATIONS FRACTIONS OPERATIONS Summary 1. Elements of a fraction... 1. Equivalent fractions... 1. Simplification of a fraction... 4. Rules for adding and subtracting fractions... 5. Multiplication rule for two fractions...

More information

Year 9 mathematics test

Year 9 mathematics test Ma KEY STAGE 3 Year 9 mathematics test Tier 6 8 Paper 1 Calculator not allowed First name Last name Class Date Please read this page, but do not open your booklet until your teacher tells you to start.

More information

You know from calculus that functions play a fundamental role in mathematics.

You know from calculus that functions play a fundamental role in mathematics. CHPTER 12 Functions You know from calculus that functions play a fundamental role in mathematics. You likely view a function as a kind of formula that describes a relationship between two (or more) quantities.

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses Michael R. Powers[ ] Temple University and Tsinghua University Thomas Y. Powers Yale University [June 2009] Abstract We propose a

More information

(Following Paper ID and Roll No. to be filled in your Answer Book) Roll No. METHODOLOGY

(Following Paper ID and Roll No. to be filled in your Answer Book) Roll No. METHODOLOGY (Following Paper ID and Roll No. to be filled in your Answer Book) Roll No. ~ n.- (SEM.J:V VEN SEMESTER THEORY EXAMINATION, 2009-2010 RESEARCH METHODOLOGY Note: The question paper contains three parts.

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Chapter 2 Portfolio Management and the Capital Asset Pricing Model

Chapter 2 Portfolio Management and the Capital Asset Pricing Model Chapter 2 Portfolio Management and the Capital Asset Pricing Model In this chapter, we explore the issue of risk management in a portfolio of assets. The main issue is how to balance a portfolio, that

More information

Basic Proof Techniques

Basic Proof Techniques Basic Proof Techniques David Ferry dsf43@truman.edu September 13, 010 1 Four Fundamental Proof Techniques When one wishes to prove the statement P Q there are four fundamental approaches. This document

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS 1. SYSTEMS OF EQUATIONS AND MATRICES 1.1. Representation of a linear system. The general system of m equations in n unknowns can be written a 11 x 1 + a 12 x 2 +

More information

Sample Size Issues for Conjoint Analysis

Sample Size Issues for Conjoint Analysis Chapter 7 Sample Size Issues for Conjoint Analysis I m about to conduct a conjoint analysis study. How large a sample size do I need? What will be the margin of error of my estimates if I use a sample

More information

Introduction to Matrix Algebra

Introduction to Matrix Algebra Psychology 7291: Multivariate Statistics (Carey) 8/27/98 Matrix Algebra - 1 Introduction to Matrix Algebra Definitions: A matrix is a collection of numbers ordered by rows and columns. It is customary

More information

9.2 Summation Notation

9.2 Summation Notation 9. Summation Notation 66 9. Summation Notation In the previous section, we introduced sequences and now we shall present notation and theorems concerning the sum of terms of a sequence. We begin with a

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Sampling strategies *

Sampling strategies * UNITED NATIONS SECRETARIAT ESA/STAT/AC.93/2 Statistics Division 03 November 2003 Expert Group Meeting to Review the Draft Handbook on Designing of Household Sample Surveys 3-5 December 2003 English only

More information

Notes on Determinant

Notes on Determinant ENGG2012B Advanced Engineering Mathematics Notes on Determinant Lecturer: Kenneth Shum Lecture 9-18/02/2013 The determinant of a system of linear equations determines whether the solution is unique, without

More information

Survey Data Analysis in Stata

Survey Data Analysis in Stata Survey Data Analysis in Stata Jeff Pitblado Associate Director, Statistical Software StataCorp LP Stata Conference DC 2009 J. Pitblado (StataCorp) Survey Data Analysis DC 2009 1 / 44 Outline 1 Types of

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,

More information

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS Systems of Equations and Matrices Representation of a linear system The general system of m equations in n unknowns can be written a x + a 2 x 2 + + a n x n b a

More information