The written Master s Examination

Size: px
Start display at page:

Download "The written Master s Examination"

Transcription

1 The written Master s Examination Option Statistics and Probability Fall Full points may be obtained for correct answers to 8 questions. Each numbered question (which may have several parts) is worth the same number of points. All answers will be graded, but the score for the examination will be the sum of the scores of your best 8 solutions. Use separate answer sheets for each question. DO NOT PUT YOUR NAME ON YOUR ANSWER SHEETS. When you have finished, insert all your answer sheets into the envelope provided, then seal and print your name on it. Any student whose answers need clarification may be required to submit to an oral examination.

2 MS Exam, Option Probability and Statistics, FALL. (STAT 4) Let ~ N( μ, σ ) X and g be a differentiable function, and E g'( X ) <. (i) Show that cov( X, g( X)) = E( g( X)( X μ)) = σ E( g'( X)). 3 (ii) Calculate EX ( ).. (STAT 4) Let X be one observation from a population with pdf x θ e f( x θ) =, < x<, < θ <. x θ ( + e ) (i) Construct a most powerful size α test to test H : θ = versus : H θ =. (ii) Construct a UMP size α test to test H : θ versus : H θ >. a a 3. (STAT 4) Suppose X,...,X n is a random sample from the exponential distribution with parameter θ>: θx θ e, x > f ( x; θ ) =,, otherwise (i) Show that Y = n X i i= follows a Gamma distribution. (ii) Show that Y is complete sufficient for θ. (iii) Derive the minimum variance unbiased estimator for θ. Justify your answer. 4. (STAT 46) A group of researchers want to determine whether there is a direct relationship between math and computer anxiety. The test scored are shown in the table for 8 students, with larger scores for indicating greater amount of the trait. Student A B C D E F G H Math Anxiety Computer Anxiety

3 MS Exam, Option Probability and Statistics, FALL (STAT 46 cont.) (a). Suppose both scores are symmetrically distributed. State your hypothesis and computer p-value, determine whether the students have the same median of computer anxiety as of math anxiety. (b). Calculate the association measure between computer anxiety as math anxiety, and test if there is any relationship between the two sets of scores. 5. (STAT 43) The purpose is to unbiasedly estimate the proportion of left-handers among school-going students in a large co-ed community school. It is known that on the whole there are 35 students enrolled in the school and 6% of these students are enrolled in the science stream. Assume stratified simple random sampling with proportional allocation for both boys-girls group classification as also for science-arts stream classification. Based on a stratified simple random sample of 35 students on the whole, the following table has been prepared : Boys Girls 5 sample left-handers in science stream 8 in arts stream 5 8 (a) For both the groups, estimate the total number of left-handers in each stream : science and arts. (b) For the science stream as a whole, find an estimate of the total number of left-handers and compute its estimated se. (c) For the arts stream as a whole, find an estimate of the proportion of left-handers and its 95% confidence interval. 6. (STAT 46) Customers arrive at a service facility according to a Poisson process with rate λ (customers/hour). Let X () t be the number of customers that have arrived up to time t. Let W, W,... be the successive arrival times of the customers. Determine: (a). E ( W X () t = ), (b). E W + () = W X t, (c). E ( W3 X () t = ). 3

4 MS Exam, Option Probability and Statistics, FALL 7. (STAT 47) Consider max x + 3x 7x3 such that x x = 3x + x x x3 9 x, x, x 3 unrestricted. (i). Write down the dual to the above linear programming problem. (ii). Write down the dual to the dual program you have obtained. 8. (STAT 47) In the last minute some doctors in New York are hectically trying to attend a conference in Los Angeles and are willing to go with connecting flights. A travel agent finds the following information: From To Number of seats available New York Chicago 5 New York Houston 7 Houston Atlanta 8 Chicago Atlanta 6 Chicago Denver Denver Los Angeles 5 Atlanta Los Angeles 4 Use Ford -Fulkerson algorithm to find the maximum number of doctors who could go to Los Angeles via these connecting flights. 4

5 MS Exam, Option Probability and Statistics, FALL 9. (STAT 473) Seven year old kids Ann, Beth, Cindy, Debbie, and Emma went on a field trip to a factory making kitchen utensils. At the end of the field trip the kids were allowed to pick either a cup or a saucer as memento. Ann opted for a saucer and the rest four opted for a cup. While coming out of the factory they noticed a guy paying $5 for a cup and a saucer. There is an ice cream shop charging $ per cone ice cream. All children are keen on getting rid of their mementos to buy ice cream recovering part of the expenses from selling their mementos. Ann is approached by all other kids with their cups to be sold. What will be considered fair value by Shapley for the saucer that Ann owns?. (STAT 48) Consider an example of the factorial design. The effects of temperature (factor, two levels are low and high, or and +) and reaction time (factor, two levels are short and long, or and +) on the percent yield of a certain chemical reaction (response Y) are studied. The experiment was replicated (n=) and the order of the eight runs was randomized. The design table and observations are listed below: Run I x x x x Average Yield Individual Observations , , , , 68.7 (a) Estimate the overall mean, main effects, and the interaction. Specify the model you used here, as well as your model assumptions. (b) Test if your estimated effects are significantly from at the significance level.5. For your reference, some critical values of the standard normal distribution are Prob(X >.96) =.5, Prob(X >.645) =.5. 5

6 MS Exam, Option Probability and Statistics, FALL. (STAT 48) Consider a regression model that relates gas mileage and weight of automobiles. Thirtyeight cars were selected, and their weights x (in units of, pounds) and fuel efficiencies MPG (miles per gallon, the response y ) were measured. (a) Given the summary statistics: x = 8. 79, y = 94. 9, x = , i i y i i i i y = , and x = , find the least-squares estimates of the regression coefficients in the simple regression model y = β + βx + ε. (b) Given SSE = ( y i yˆ i ) = which is the sum of squares due to error, along with the summary statistics in (a), construct a 95% confidence interval for β. For your reference, some critical values of t distributions are t(.5; df=38)=.4, t(.5; df=36)=.8. (c) The figure below is the residual plot (residuals versus fitted values) of the simple linear regression model. Based on that, discuss whether this model is appropriate. What other model(s) would you suggest? 6

7 7

8 Statistics 4&4 MS Exam Fall Semester. (STAT4) Let X E g'( X ) < N( μ, σ ) and g be a differentiable function, and. (i) Show that cov( X, g( X)) = E( g( X)( X μ)) = σ E( g'( X)). 3 (ii) Calculate EX ( ). Solution: (i) The first equality follows from the definition of cov( X, g( X )) immediately. To show the second equality, we have ( EgX ( ( )( X μ)) = gx ( )( x )exp dx πσ x μ) μ σ. Using integration by parts with u = g( x) and ( x μ) dv = ( x μ)exp dx σ yields that ( x μ) ( x μ) EgX ( ( )( X μ)) = σ gx ( )exp σ g'( x)exp dx + πσ σ σ ( x μ) = σ g'( x) exp πσ σ = σ Eg ( '( X)). dx 3 (ii) Note that. Let, then g'( X ) = EX ( ) = EX ( ( X μ+ μ)) = EX ( ( X μ)) + μex ( ) X and g( X) = X EX EX X EX 3 ( ) = ( ( μ)) + μ ( ) = EgX X + EX + X ( ( )( μ)) μ(( ( )) var( )) = σ E X + μ μ + σ = μ + μσ 3 ( ) ( ) 3. Remark: the equality in (i) is known as Stein s Lemma.

9 . (STAT4) Let X be one observation from a population with pdf x θ e f( x θ) =, < x<, < θ <. x θ ( + e ) (i) Construct a most powerful size α test to test H : θ = versus H : θ =. (ii) Construct a UMP size α test to test H : θ versus H : a θ >. Solution: a (i) According to N-P Lemma, the most powerful test is to reject H if x x x f( x θ = ) ( + e ) e + e = = e x x x f( x θ = ) e ( + e ) + e k. Note that e + e + e x x is an increasing function in x (by showing its derivative in x is positive), so the most powerful test is to reject H if x k '. To determine k ', implies that k ' = log(( α) / α). (ii) First, for any θ x e α = PX ( k' θ = ) = dx= x ( + e ) + e k ' θ, the likelihood ratio is > k ' x θ θ e θ = e x θ f( x θ = θ) + f( x θ = θ) + e. The derivative of the likelihood ratio is x θ x θ x θ x e θ e > x θ x θ x θ d f( x θ = θ) θ θ d + e θ θ + e e = e =. dx f ( x θ = θ) dx + e + e ( + e ) f( x θ = θ) Therefore, is an increasing function in x (MLR), and the UMP test is to f( x θ = θ ) reject H if x k. This is the same test as in (i), so k = log(( α) / α).

10 Suppose X ; :::; X n is a random sample from the exponential distribution with parameter > : e f(x; ) = x if x > ; otherwise. (i) Show that Y = nx X i follows a Gamma distribution. i= (ii) Show that Y is complete su cient for. (iii) Derive the minimum variance unbiased estimator for : Justify your answer. Solution: (i) The mgf of X is: So, the mgf of Y is Z E e tx = e tx e x dx = E e ty = n t : t This is the mgf of a Gamma distribution with = n; = =: (ii) Since Gamma belongs to the exponential family of distributions, Y is complete su cient for. (iii) E Y = = (n) n : n Z y yn e y dy Hence n Y is unbiased for : It follows from Rao-Blackwell Theorem that n Y is the minimum variance unbiased estimator for :

11 STAT 46 Problem in Fall A group of researchers want to determine wether there is a direct relationship between math and computer anxiety. The test scored are shown in the table for 8 students, with larger scores for indicating greater amount of of the trait. Student A B C D E F G H Math Anxiety Computer Anxiety (a). Suppose both scores are symmetrically distributed. State your hypothesis and computer p-value, determine wether the students have the same median of computer anxiety as of math anxiety. (b). Calculate the association measure between computer anxiety as math anxiety,.and test if there is any relationship between the two sets of scores. Solution: (a). Let D = Y X, both hypotheses: H : M D = vs. H : M D. Student A B C D E F G H X i Y i D i = Y i X i r ( D i ) Signed-rank test: T + = N r ( D i ) I {Di>} = = 7 i= p value = P { T + 7 } =.5 =.5 There is no significant difference between the medians of the two anxiety scores. (b). Use spearman s test statistic for association measure, first rank the two scores respectively Spearman s Rho test Student A B C D E F G H S i = rank(x i ) R i = rank(y i ) D i = S i R i R = 6 n i= D i n (n ) = =.94 Its p value = P (R.94) <.5.

12 Solution to Sampling Problem- Fall. Background On the whole there are N = 35 students and in a random sample with proportional allocation of 35 students, there are boys and 5 girls. Therefore, in the population, there are boys and the rest [5] are girls. Further, 6% of the students are enrolled in science stream. We assume that this 6% refers to each of the two groups : boys and girls. Therefore, in the population, we have a x table of frequency counts as follows : Boys Girls Science Stream 9 Arts Stream 8 6 TOTAL 5 We are given the sample frequency counts of the left-handers for each of the above x categories. Under proportional sampling, we have thus the following table, indicating the number of left-handers in parentheses. Boys Girls Popl. Size Sample Size Popl. Size Sample Size Science Stream (8) 9 9 () Arts Stream 8 8 (5) 6 6 (8) -- (a) (i) For boys in science stream estimated total number of left-handers = x 8 / = 8 (ii) For boys in arts stream estimated total number of left-handers = 8 x 5 / 8 = 5 (iii) For girls in science-stream estimated total number of left-handers = 9 x / 9 = (iv) For girls in arts stream estimated total number of left-handers = 6 x 8 / 6 = 8 (b) For science-stream as a whole estimated total number of left-handers = 8 + = estimated s.e. = sqrt. [ ^ x (8/)(/)/(9) + 9^ x (/9)(78/9)/(89)] =. (c) For arts stream as a whole estimated proportion of left-handers = [(5 + 8)/(8 + 6) = To compute 95% confidence interval, we need to compute estimated s.e. of the above estimate. This is given by s.e. = sqrt.[(8^)(5/8)(75/8)/(79) + (6)^(8/6)(5/6)/(59)] / 4 Finally, 95% confidence interval is computed as estimate +/-.96 times estimated s.e.

13

14

15

16

17 Stat 48 (Experimental Design) Problem: Consider an example of the factorial design. The effects of temperature (factor, two levels are low and high, or and +) and reaction time (factor, two levels are short and long, or and +) on the percent yield of a certain chemical reaction (response Y) are studied. The experiment was replicated (n=) and the order of the eight runs was randomized. The design table and observations are listed below: Run I x x x x Average Yield Individual Observations , , , , 68.7 (a) Estimate the overall mean, main effects, and the interaction. Specify the model you used here, as well as your model assumptions. (b) Test if your estimated effects are significantly from at the significance level.5. For your reference, some critical values of the standard normal distribution are Prob(X >.96) =.5, Prob(X >.645) =.5. Solution for Stat 48 (Experimental Design) Problem: (a) The estimates of overall mean: μˆ = ( )/4=6.; main effect : ˆμ = ( )/4=.4; main effect : ˆμ = ( )/4= 4.; interaction: ˆμ = ( )/4= -.4. The model we used here is Yij = μ + x iμ + xiμ + x i xiμ + ε ij, i =,,3, 4, j =,. Model assumption: ε ij are i.i.d. ~ N(, σ ). (b) The overall variance estimate 4 4 s = si = ( Yij Yi ) = i= 4( ) i= j= The estimate of the variance of an effect is s Var(effect)= = An estimated effect is significantly from at the significance level.5 if its absolute value goes beyond =. 4. Therefore, the overall mean and two main effects are significantly from at level.5, while the interaction is not significant.

18 Stat 48 (Linear Regression) Problem: Consider a regression model that relates gas mileage and weight of automobiles. Thirtyeight cars were selected, and their weights x (in units of, pounds) and fuel efficiencies MPG (miles per gallon, the response y ) were measured. i i i (a) Given the summary statistics: x = 8. 79, y = 94. 9, x = , i y = , and x 56 i y = 539., find the least-squares estimates of i the regression coefficients in the simple regression model y = β + βx + ε. (b) Given SSE = ( y i yˆ i ) = which is the sum of squares due to error, along with the summary statistics in (a), construct a 95% confidence interval for β. For your reference, some critical values of t distributions are t(.5; df=38)=.4, t(.5; df=36)=.8. (c) The figure below is the residual plot (residuals versus fitted values) of the simple linear regression model. Based on that, discuss whether this model is appropriate. What other model(s) would you suggest? Solution for Stat 48 (Linear Regression) Problem: (a) For the least-squares estimates: ˆ xi yi ( xi )( yi ) /38 β = = 8.365, xi ( xi ) /38 ˆβ = y /38 ˆβ xi /38= i

19 (b) An estimate of the standard deviation of ( ˆ MSE SSE /(38 ) s β ) = = =.663. ( x x) x ( x ) /38 i i i ˆ β β Since ~ t(38 ) s( ˆ β), a 95% confidence interval for βˆ is ˆ β ˆ ±.8 s ( β) = ±.8.663=[-9.7, -7.]. (c) The figure shows a quadratic pattern which indicates the simple regression model is not appropriate. One may want to try the model y = β + βx + β x + ε. Another possibility is to look for transformations on y that simplify the structure of the model, say ( y) = β + β x + ε. βˆ is g

Sections 2.11 and 5.8

Sections 2.11 and 5.8 Sections 211 and 58 Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1/25 Gesell data Let X be the age in in months a child speaks his/her first word and

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is. Some Continuous Probability Distributions CHAPTER 6: Continuous Uniform Distribution: 6. Definition: The density function of the continuous random variable X on the interval [A, B] is B A A x B f(x; A,

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

Penalized regression: Introduction

Penalized regression: Introduction Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood

More information

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. STT315 Practice Ch 5-7 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the problem. 1) The length of time a traffic signal stays green (nicknamed

More information

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

More information

Math 461 Fall 2006 Test 2 Solutions

Math 461 Fall 2006 Test 2 Solutions Math 461 Fall 2006 Test 2 Solutions Total points: 100. Do all questions. Explain all answers. No notes, books, or electronic devices. 1. [105+5 points] Assume X Exponential(λ). Justify the following two

More information

Master s Theory Exam Spring 2006

Master s Theory Exam Spring 2006 Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem

More information

AP STATISTICS (Warm-Up Exercises)

AP STATISTICS (Warm-Up Exercises) AP STATISTICS (Warm-Up Exercises) 1. Describe the distribution of ages in a city: 2. Graph a box plot on your calculator for the following test scores: {90, 80, 96, 54, 80, 95, 100, 75, 87, 62, 65, 85,

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program. Statistics Module Part 3 2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

More information

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1. **BEGINNING OF EXAMINATION** 1. You are given: (i) The annual number of claims for an insured has probability function: 3 p x q q x x ( ) = ( 1 ) 3 x, x = 0,1,, 3 (ii) The prior density is π ( q) = q,

More information

Stat 704 Data Analysis I Probability Review

Stat 704 Data Analysis I Probability Review 1 / 30 Stat 704 Data Analysis I Probability Review Timothy Hanson Department of Statistics, University of South Carolina Course information 2 / 30 Logistics: Tuesday/Thursday 11:40am to 12:55pm in LeConte

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

More information

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015.

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015. Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment -3, Probability and Statistics, March 05. Due:-March 5, 05.. Show that the function 0 for x < x+ F (x) = 4 for x < for x

More information

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab Monte Carlo Simulation: IEOR E4703 Fall 2004 c 2004 by Martin Haugh Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab 1 Overview of Monte Carlo Simulation 1.1 Why use simulation?

More information

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010 Simulation Methods Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Simulation Methods 15.450, Fall 2010 1 / 35 Outline 1 Generating Random Numbers 2 Variance Reduction 3 Quasi-Monte

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators... MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................

More information

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Comparing Multiple Proportions, Test of Independence and Goodness of Fit Comparing Multiple Proportions, Test of Independence and Goodness of Fit Content Testing the Equality of Population Proportions for Three or More Populations Test of Independence Goodness of Fit Test 2

More information

Important Probability Distributions OPRE 6301

Important Probability Distributions OPRE 6301 Important Probability Distributions OPRE 6301 Important Distributions... Certain probability distributions occur with such regularity in real-life applications that they have been given their own names.

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly

More information

Statistics 151 Practice Midterm 1 Mike Kowalski

Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and

More information

Two-sample hypothesis testing, II 9.07 3/16/2004

Two-sample hypothesis testing, II 9.07 3/16/2004 Two-sample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For two-sample tests of the difference in mean, things get a little confusing, here,

More information

One-Way Analysis of Variance (ANOVA) Example Problem

One-Way Analysis of Variance (ANOVA) Example Problem One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density HW MATH 461/561 Lecture Notes 15 1 Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density and marginal densities f(x, y), (x, y) Λ X,Y f X (x), x Λ X,

More information

Statistical Functions in Excel

Statistical Functions in Excel Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.

More information

Quadratic forms Cochran s theorem, degrees of freedom, and all that

Quadratic forms Cochran s theorem, degrees of freedom, and all that Quadratic forms Cochran s theorem, degrees of freedom, and all that Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 1, Slide 1 Why We Care Cochran s theorem tells us

More information

GLM I An Introduction to Generalized Linear Models

GLM I An Introduction to Generalized Linear Models GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

Basics of Statistical Machine Learning

Basics of Statistical Machine Learning CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Modern machine learning is rooted in statistics. You will find many familiar

More information

1 Another method of estimation: least squares

1 Another method of estimation: least squares 1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i

More information

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics Analysis of Data Claudia J. Stanny PSY 67 Research Design Organizing Data Files in SPSS All data for one subject entered on the same line Identification data Between-subjects manipulations: variable to

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice! Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!) Part A - Multiple Choice Indicate the best choice

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Lecture 8: Gamma regression

Lecture 8: Gamma regression Lecture 8: Gamma regression Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Models with constant coefficient of variation Gamma regression: estimation and testing

More information

Chapter 6: Point Estimation. Fall 2011. - Probability & Statistics

Chapter 6: Point Estimation. Fall 2011. - Probability & Statistics STAT355 Chapter 6: Point Estimation Fall 2011 Chapter Fall 2011 6: Point1 Estimat / 18 Chap 6 - Point Estimation 1 6.1 Some general Concepts of Point Estimation Point Estimate Unbiasedness Principle of

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

SAS Certificate Applied Statistics and SAS Programming

SAS Certificate Applied Statistics and SAS Programming SAS Certificate Applied Statistics and SAS Programming SAS Certificate Applied Statistics and Advanced SAS Programming Brigham Young University Department of Statistics offers an Applied Statistics and

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

More information

Module 2 Probability and Statistics

Module 2 Probability and Statistics Module 2 Probability and Statistics BASIC CONCEPTS Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The standard deviation of a standard normal distribution

More information

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl Dept of Information Science j.nerbonne@rug.nl October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic

More information

STAT 830 Convergence in Distribution

STAT 830 Convergence in Distribution STAT 830 Convergence in Distribution Richard Lockhart Simon Fraser University STAT 830 Fall 2011 Richard Lockhart (Simon Fraser University) STAT 830 Convergence in Distribution STAT 830 Fall 2011 1 / 31

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

August 2012 EXAMINATIONS Solution Part I

August 2012 EXAMINATIONS Solution Part I August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,

More information

NOV - 30211/II. 1. Let f(z) = sin z, z C. Then f(z) : 3. Let the sequence {a n } be given. (A) is bounded in the complex plane

NOV - 30211/II. 1. Let f(z) = sin z, z C. Then f(z) : 3. Let the sequence {a n } be given. (A) is bounded in the complex plane Mathematical Sciences Paper II Time Allowed : 75 Minutes] [Maximum Marks : 100 Note : This Paper contains Fifty (50) multiple choice questions. Each question carries Two () marks. Attempt All questions.

More information

Lecture Notes 1. Brief Review of Basic Probability

Lecture Notes 1. Brief Review of Basic Probability Probability Review Lecture Notes Brief Review of Basic Probability I assume you know basic probability. Chapters -3 are a review. I will assume you have read and understood Chapters -3. Here is a very

More information

Statistical Models in R

Statistical Models in R Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova

More information

Tests of Hypotheses Using Statistics

Tests of Hypotheses Using Statistics Tests of Hypotheses Using Statistics Adam Massey and Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract We present the various methods of hypothesis testing that one

More information

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination February 14 th, 2014.

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination February 14 th, 2014. University of Ljubljana Doctoral Programme in Statistics ethodology of Statistical Research Written examination February 14 th, 2014 Name and surname: ID number: Instructions Read carefully the wording

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 STATISTICS 8, FINAL EXAM NAME: KEY Seat Number: Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 Make sure you have 8 pages. You will be provided with a table as well, as a separate

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction

More information

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

Chapter 4 and 5 solutions

Chapter 4 and 5 solutions Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,

More information

One-Way Analysis of Variance

One-Way Analysis of Variance One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

More information

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., CHAPTER 13 Nonparametric and Distribution-Free Statistics Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., 2 tests for goodness of fit and independence).

More information

Lecture 7: Continuous Random Variables

Lecture 7: Continuous Random Variables Lecture 7: Continuous Random Variables 21 September 2005 1 Our First Continuous Random Variable The back of the lecture hall is roughly 10 meters across. Suppose it were exactly 10 meters, and consider

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

SIMON FRASER UNIVERSITY

SIMON FRASER UNIVERSITY SIMON FRASER UNIVERSITY BUEC 333: Statistics for Business and Economics. MIDTERM EXAM: PART I Instructor: Alex Jameson Appiah February. 27, 1996. Time: 50 mins. Name: ------------------------------------------------------

More information

Section 5.1 Continuous Random Variables: Introduction

Section 5.1 Continuous Random Variables: Introduction Section 5. Continuous Random Variables: Introduction Not all random variables are discrete. For example:. Waiting times for anything (train, arrival of customer, production of mrna molecule from gene,

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Non-Parametric Tests (I)

Non-Parametric Tests (I) Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent

More information

8 6 X 2 Test for a Variance or Standard Deviation

8 6 X 2 Test for a Variance or Standard Deviation Section 8 6 x 2 Test for a Variance or Standard Deviation 437 This test uses the P-value method. Therefore, it is not necessary to enter a significance level. 1. Select MegaStat>Hypothesis Tests>Proportion

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

University of Chicago Graduate School of Business. Business 41000: Business Statistics

University of Chicago Graduate School of Business. Business 41000: Business Statistics Name: University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas. 2. Throughout

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

5. Linear Regression

5. Linear Regression 5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4

More information

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

CHI-SQUARE: TESTING FOR GOODNESS OF FIT CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

Factors affecting online sales

Factors affecting online sales Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

More information

Notes on Applied Linear Regression

Notes on Applied Linear Regression Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 444-8935 email:

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

Simple Linear Regression

Simple Linear Regression STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze

More information

Math 201: Statistics November 30, 2006

Math 201: Statistics November 30, 2006 Math 201: Statistics November 30, 2006 Fall 2006 MidTerm #2 Closed book & notes; only an A4-size formula sheet and a calculator allowed; 90 mins. No questions accepted! Instructions: There are eleven pages

More information

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

More information

Principle of Data Reduction

Principle of Data Reduction Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then

More information