The written Master s Examination
|
|
- Barnard Lynch
- 7 years ago
- Views:
Transcription
1 The written Master s Examination Option Statistics and Probability Fall Full points may be obtained for correct answers to 8 questions. Each numbered question (which may have several parts) is worth the same number of points. All answers will be graded, but the score for the examination will be the sum of the scores of your best 8 solutions. Use separate answer sheets for each question. DO NOT PUT YOUR NAME ON YOUR ANSWER SHEETS. When you have finished, insert all your answer sheets into the envelope provided, then seal and print your name on it. Any student whose answers need clarification may be required to submit to an oral examination.
2 MS Exam, Option Probability and Statistics, FALL. (STAT 4) Let ~ N( μ, σ ) X and g be a differentiable function, and E g'( X ) <. (i) Show that cov( X, g( X)) = E( g( X)( X μ)) = σ E( g'( X)). 3 (ii) Calculate EX ( ).. (STAT 4) Let X be one observation from a population with pdf x θ e f( x θ) =, < x<, < θ <. x θ ( + e ) (i) Construct a most powerful size α test to test H : θ = versus : H θ =. (ii) Construct a UMP size α test to test H : θ versus : H θ >. a a 3. (STAT 4) Suppose X,...,X n is a random sample from the exponential distribution with parameter θ>: θx θ e, x > f ( x; θ ) =,, otherwise (i) Show that Y = n X i i= follows a Gamma distribution. (ii) Show that Y is complete sufficient for θ. (iii) Derive the minimum variance unbiased estimator for θ. Justify your answer. 4. (STAT 46) A group of researchers want to determine whether there is a direct relationship between math and computer anxiety. The test scored are shown in the table for 8 students, with larger scores for indicating greater amount of the trait. Student A B C D E F G H Math Anxiety Computer Anxiety
3 MS Exam, Option Probability and Statistics, FALL (STAT 46 cont.) (a). Suppose both scores are symmetrically distributed. State your hypothesis and computer p-value, determine whether the students have the same median of computer anxiety as of math anxiety. (b). Calculate the association measure between computer anxiety as math anxiety, and test if there is any relationship between the two sets of scores. 5. (STAT 43) The purpose is to unbiasedly estimate the proportion of left-handers among school-going students in a large co-ed community school. It is known that on the whole there are 35 students enrolled in the school and 6% of these students are enrolled in the science stream. Assume stratified simple random sampling with proportional allocation for both boys-girls group classification as also for science-arts stream classification. Based on a stratified simple random sample of 35 students on the whole, the following table has been prepared : Boys Girls 5 sample left-handers in science stream 8 in arts stream 5 8 (a) For both the groups, estimate the total number of left-handers in each stream : science and arts. (b) For the science stream as a whole, find an estimate of the total number of left-handers and compute its estimated se. (c) For the arts stream as a whole, find an estimate of the proportion of left-handers and its 95% confidence interval. 6. (STAT 46) Customers arrive at a service facility according to a Poisson process with rate λ (customers/hour). Let X () t be the number of customers that have arrived up to time t. Let W, W,... be the successive arrival times of the customers. Determine: (a). E ( W X () t = ), (b). E W + () = W X t, (c). E ( W3 X () t = ). 3
4 MS Exam, Option Probability and Statistics, FALL 7. (STAT 47) Consider max x + 3x 7x3 such that x x = 3x + x x x3 9 x, x, x 3 unrestricted. (i). Write down the dual to the above linear programming problem. (ii). Write down the dual to the dual program you have obtained. 8. (STAT 47) In the last minute some doctors in New York are hectically trying to attend a conference in Los Angeles and are willing to go with connecting flights. A travel agent finds the following information: From To Number of seats available New York Chicago 5 New York Houston 7 Houston Atlanta 8 Chicago Atlanta 6 Chicago Denver Denver Los Angeles 5 Atlanta Los Angeles 4 Use Ford -Fulkerson algorithm to find the maximum number of doctors who could go to Los Angeles via these connecting flights. 4
5 MS Exam, Option Probability and Statistics, FALL 9. (STAT 473) Seven year old kids Ann, Beth, Cindy, Debbie, and Emma went on a field trip to a factory making kitchen utensils. At the end of the field trip the kids were allowed to pick either a cup or a saucer as memento. Ann opted for a saucer and the rest four opted for a cup. While coming out of the factory they noticed a guy paying $5 for a cup and a saucer. There is an ice cream shop charging $ per cone ice cream. All children are keen on getting rid of their mementos to buy ice cream recovering part of the expenses from selling their mementos. Ann is approached by all other kids with their cups to be sold. What will be considered fair value by Shapley for the saucer that Ann owns?. (STAT 48) Consider an example of the factorial design. The effects of temperature (factor, two levels are low and high, or and +) and reaction time (factor, two levels are short and long, or and +) on the percent yield of a certain chemical reaction (response Y) are studied. The experiment was replicated (n=) and the order of the eight runs was randomized. The design table and observations are listed below: Run I x x x x Average Yield Individual Observations , , , , 68.7 (a) Estimate the overall mean, main effects, and the interaction. Specify the model you used here, as well as your model assumptions. (b) Test if your estimated effects are significantly from at the significance level.5. For your reference, some critical values of the standard normal distribution are Prob(X >.96) =.5, Prob(X >.645) =.5. 5
6 MS Exam, Option Probability and Statistics, FALL. (STAT 48) Consider a regression model that relates gas mileage and weight of automobiles. Thirtyeight cars were selected, and their weights x (in units of, pounds) and fuel efficiencies MPG (miles per gallon, the response y ) were measured. (a) Given the summary statistics: x = 8. 79, y = 94. 9, x = , i i y i i i i y = , and x = , find the least-squares estimates of the regression coefficients in the simple regression model y = β + βx + ε. (b) Given SSE = ( y i yˆ i ) = which is the sum of squares due to error, along with the summary statistics in (a), construct a 95% confidence interval for β. For your reference, some critical values of t distributions are t(.5; df=38)=.4, t(.5; df=36)=.8. (c) The figure below is the residual plot (residuals versus fitted values) of the simple linear regression model. Based on that, discuss whether this model is appropriate. What other model(s) would you suggest? 6
7 7
8 Statistics 4&4 MS Exam Fall Semester. (STAT4) Let X E g'( X ) < N( μ, σ ) and g be a differentiable function, and. (i) Show that cov( X, g( X)) = E( g( X)( X μ)) = σ E( g'( X)). 3 (ii) Calculate EX ( ). Solution: (i) The first equality follows from the definition of cov( X, g( X )) immediately. To show the second equality, we have ( EgX ( ( )( X μ)) = gx ( )( x )exp dx πσ x μ) μ σ. Using integration by parts with u = g( x) and ( x μ) dv = ( x μ)exp dx σ yields that ( x μ) ( x μ) EgX ( ( )( X μ)) = σ gx ( )exp σ g'( x)exp dx + πσ σ σ ( x μ) = σ g'( x) exp πσ σ = σ Eg ( '( X)). dx 3 (ii) Note that. Let, then g'( X ) = EX ( ) = EX ( ( X μ+ μ)) = EX ( ( X μ)) + μex ( ) X and g( X) = X EX EX X EX 3 ( ) = ( ( μ)) + μ ( ) = EgX X + EX + X ( ( )( μ)) μ(( ( )) var( )) = σ E X + μ μ + σ = μ + μσ 3 ( ) ( ) 3. Remark: the equality in (i) is known as Stein s Lemma.
9 . (STAT4) Let X be one observation from a population with pdf x θ e f( x θ) =, < x<, < θ <. x θ ( + e ) (i) Construct a most powerful size α test to test H : θ = versus H : θ =. (ii) Construct a UMP size α test to test H : θ versus H : a θ >. Solution: a (i) According to N-P Lemma, the most powerful test is to reject H if x x x f( x θ = ) ( + e ) e + e = = e x x x f( x θ = ) e ( + e ) + e k. Note that e + e + e x x is an increasing function in x (by showing its derivative in x is positive), so the most powerful test is to reject H if x k '. To determine k ', implies that k ' = log(( α) / α). (ii) First, for any θ x e α = PX ( k' θ = ) = dx= x ( + e ) + e k ' θ, the likelihood ratio is > k ' x θ θ e θ = e x θ f( x θ = θ) + f( x θ = θ) + e. The derivative of the likelihood ratio is x θ x θ x θ x e θ e > x θ x θ x θ d f( x θ = θ) θ θ d + e θ θ + e e = e =. dx f ( x θ = θ) dx + e + e ( + e ) f( x θ = θ) Therefore, is an increasing function in x (MLR), and the UMP test is to f( x θ = θ ) reject H if x k. This is the same test as in (i), so k = log(( α) / α).
10 Suppose X ; :::; X n is a random sample from the exponential distribution with parameter > : e f(x; ) = x if x > ; otherwise. (i) Show that Y = nx X i follows a Gamma distribution. i= (ii) Show that Y is complete su cient for. (iii) Derive the minimum variance unbiased estimator for : Justify your answer. Solution: (i) The mgf of X is: So, the mgf of Y is Z E e tx = e tx e x dx = E e ty = n t : t This is the mgf of a Gamma distribution with = n; = =: (ii) Since Gamma belongs to the exponential family of distributions, Y is complete su cient for. (iii) E Y = = (n) n : n Z y yn e y dy Hence n Y is unbiased for : It follows from Rao-Blackwell Theorem that n Y is the minimum variance unbiased estimator for :
11 STAT 46 Problem in Fall A group of researchers want to determine wether there is a direct relationship between math and computer anxiety. The test scored are shown in the table for 8 students, with larger scores for indicating greater amount of of the trait. Student A B C D E F G H Math Anxiety Computer Anxiety (a). Suppose both scores are symmetrically distributed. State your hypothesis and computer p-value, determine wether the students have the same median of computer anxiety as of math anxiety. (b). Calculate the association measure between computer anxiety as math anxiety,.and test if there is any relationship between the two sets of scores. Solution: (a). Let D = Y X, both hypotheses: H : M D = vs. H : M D. Student A B C D E F G H X i Y i D i = Y i X i r ( D i ) Signed-rank test: T + = N r ( D i ) I {Di>} = = 7 i= p value = P { T + 7 } =.5 =.5 There is no significant difference between the medians of the two anxiety scores. (b). Use spearman s test statistic for association measure, first rank the two scores respectively Spearman s Rho test Student A B C D E F G H S i = rank(x i ) R i = rank(y i ) D i = S i R i R = 6 n i= D i n (n ) = =.94 Its p value = P (R.94) <.5.
12 Solution to Sampling Problem- Fall. Background On the whole there are N = 35 students and in a random sample with proportional allocation of 35 students, there are boys and 5 girls. Therefore, in the population, there are boys and the rest [5] are girls. Further, 6% of the students are enrolled in science stream. We assume that this 6% refers to each of the two groups : boys and girls. Therefore, in the population, we have a x table of frequency counts as follows : Boys Girls Science Stream 9 Arts Stream 8 6 TOTAL 5 We are given the sample frequency counts of the left-handers for each of the above x categories. Under proportional sampling, we have thus the following table, indicating the number of left-handers in parentheses. Boys Girls Popl. Size Sample Size Popl. Size Sample Size Science Stream (8) 9 9 () Arts Stream 8 8 (5) 6 6 (8) -- (a) (i) For boys in science stream estimated total number of left-handers = x 8 / = 8 (ii) For boys in arts stream estimated total number of left-handers = 8 x 5 / 8 = 5 (iii) For girls in science-stream estimated total number of left-handers = 9 x / 9 = (iv) For girls in arts stream estimated total number of left-handers = 6 x 8 / 6 = 8 (b) For science-stream as a whole estimated total number of left-handers = 8 + = estimated s.e. = sqrt. [ ^ x (8/)(/)/(9) + 9^ x (/9)(78/9)/(89)] =. (c) For arts stream as a whole estimated proportion of left-handers = [(5 + 8)/(8 + 6) = To compute 95% confidence interval, we need to compute estimated s.e. of the above estimate. This is given by s.e. = sqrt.[(8^)(5/8)(75/8)/(79) + (6)^(8/6)(5/6)/(59)] / 4 Finally, 95% confidence interval is computed as estimate +/-.96 times estimated s.e.
13
14
15
16
17 Stat 48 (Experimental Design) Problem: Consider an example of the factorial design. The effects of temperature (factor, two levels are low and high, or and +) and reaction time (factor, two levels are short and long, or and +) on the percent yield of a certain chemical reaction (response Y) are studied. The experiment was replicated (n=) and the order of the eight runs was randomized. The design table and observations are listed below: Run I x x x x Average Yield Individual Observations , , , , 68.7 (a) Estimate the overall mean, main effects, and the interaction. Specify the model you used here, as well as your model assumptions. (b) Test if your estimated effects are significantly from at the significance level.5. For your reference, some critical values of the standard normal distribution are Prob(X >.96) =.5, Prob(X >.645) =.5. Solution for Stat 48 (Experimental Design) Problem: (a) The estimates of overall mean: μˆ = ( )/4=6.; main effect : ˆμ = ( )/4=.4; main effect : ˆμ = ( )/4= 4.; interaction: ˆμ = ( )/4= -.4. The model we used here is Yij = μ + x iμ + xiμ + x i xiμ + ε ij, i =,,3, 4, j =,. Model assumption: ε ij are i.i.d. ~ N(, σ ). (b) The overall variance estimate 4 4 s = si = ( Yij Yi ) = i= 4( ) i= j= The estimate of the variance of an effect is s Var(effect)= = An estimated effect is significantly from at the significance level.5 if its absolute value goes beyond =. 4. Therefore, the overall mean and two main effects are significantly from at level.5, while the interaction is not significant.
18 Stat 48 (Linear Regression) Problem: Consider a regression model that relates gas mileage and weight of automobiles. Thirtyeight cars were selected, and their weights x (in units of, pounds) and fuel efficiencies MPG (miles per gallon, the response y ) were measured. i i i (a) Given the summary statistics: x = 8. 79, y = 94. 9, x = , i y = , and x 56 i y = 539., find the least-squares estimates of i the regression coefficients in the simple regression model y = β + βx + ε. (b) Given SSE = ( y i yˆ i ) = which is the sum of squares due to error, along with the summary statistics in (a), construct a 95% confidence interval for β. For your reference, some critical values of t distributions are t(.5; df=38)=.4, t(.5; df=36)=.8. (c) The figure below is the residual plot (residuals versus fitted values) of the simple linear regression model. Based on that, discuss whether this model is appropriate. What other model(s) would you suggest? Solution for Stat 48 (Linear Regression) Problem: (a) For the least-squares estimates: ˆ xi yi ( xi )( yi ) /38 β = = 8.365, xi ( xi ) /38 ˆβ = y /38 ˆβ xi /38= i
19 (b) An estimate of the standard deviation of ( ˆ MSE SSE /(38 ) s β ) = = =.663. ( x x) x ( x ) /38 i i i ˆ β β Since ~ t(38 ) s( ˆ β), a 95% confidence interval for βˆ is ˆ β ˆ ±.8 s ( β) = ±.8.663=[-9.7, -7.]. (c) The figure shows a quadratic pattern which indicates the simple regression model is not appropriate. One may want to try the model y = β + βx + β x + ε. Another possibility is to look for transformations on y that simplify the structure of the model, say ( y) = β + β x + ε. βˆ is g
Sections 2.11 and 5.8
Sections 211 and 58 Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1/25 Gesell data Let X be the age in in months a child speaks his/her first word and
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationCHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.
Some Continuous Probability Distributions CHAPTER 6: Continuous Uniform Distribution: 6. Definition: The density function of the continuous random variable X on the interval [A, B] is B A A x B f(x; A,
More informationSTAT 350 Practice Final Exam Solution (Spring 2015)
PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects
More informationPenalized regression: Introduction
Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood
More informationChapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )
Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
STT315 Practice Ch 5-7 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the problem. 1) The length of time a traffic signal stays green (nicknamed
More informationOutline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
More informationMath 461 Fall 2006 Test 2 Solutions
Math 461 Fall 2006 Test 2 Solutions Total points: 100. Do all questions. Explain all answers. No notes, books, or electronic devices. 1. [105+5 points] Assume X Exponential(λ). Justify the following two
More informationMaster s Theory Exam Spring 2006
Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem
More informationAP STATISTICS (Warm-Up Exercises)
AP STATISTICS (Warm-Up Exercises) 1. Describe the distribution of ages in a city: 2. Graph a box plot on your calculator for the following test scores: {90, 80, 96, 54, 80, 95, 100, 75, 87, 62, 65, 85,
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More information**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.
**BEGINNING OF EXAMINATION** 1. You are given: (i) The annual number of claims for an insured has probability function: 3 p x q q x x ( ) = ( 1 ) 3 x, x = 0,1,, 3 (ii) The prior density is π ( q) = q,
More informationStat 704 Data Analysis I Probability Review
1 / 30 Stat 704 Data Analysis I Probability Review Timothy Hanson Department of Statistics, University of South Carolina Course information 2 / 30 Logistics: Tuesday/Thursday 11:40am to 12:55pm in LeConte
More informationProbability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur
Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce
More informationPremaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
More informationDepartment of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015.
Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment -3, Probability and Statistics, March 05. Due:-March 5, 05.. Show that the function 0 for x < x+ F (x) = 4 for x < for x
More informationOverview of Monte Carlo Simulation, Probability Review and Introduction to Matlab
Monte Carlo Simulation: IEOR E4703 Fall 2004 c 2004 by Martin Haugh Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab 1 Overview of Monte Carlo Simulation 1.1 Why use simulation?
More informationGenerating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010
Simulation Methods Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Simulation Methods 15.450, Fall 2010 1 / 35 Outline 1 Generating Random Numbers 2 Variance Reduction 3 Quasi-Monte
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationMATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...
MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................
More informationComparing Multiple Proportions, Test of Independence and Goodness of Fit
Comparing Multiple Proportions, Test of Independence and Goodness of Fit Content Testing the Equality of Population Proportions for Three or More Populations Test of Independence Goodness of Fit Test 2
More informationImportant Probability Distributions OPRE 6301
Important Probability Distributions OPRE 6301 Important Distributions... Certain probability distributions occur with such regularity in real-life applications that they have been given their own names.
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly
More informationStatistics 151 Practice Midterm 1 Mike Kowalski
Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and
More informationTwo-sample hypothesis testing, II 9.07 3/16/2004
Two-sample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For two-sample tests of the difference in mean, things get a little confusing, here,
More informationOne-Way Analysis of Variance (ANOVA) Example Problem
One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationDefinition: Suppose that two random variables, either continuous or discrete, X and Y have joint density
HW MATH 461/561 Lecture Notes 15 1 Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density and marginal densities f(x, y), (x, y) Λ X,Y f X (x), x Λ X,
More informationStatistical Functions in Excel
Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.
More informationQuadratic forms Cochran s theorem, degrees of freedom, and all that
Quadratic forms Cochran s theorem, degrees of freedom, and all that Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 1, Slide 1 Why We Care Cochran s theorem tells us
More informationGLM I An Introduction to Generalized Linear Models
GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationBasics of Statistical Machine Learning
CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Modern machine learning is rooted in statistics. You will find many familiar
More information1 Another method of estimation: least squares
1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i
More informationAnalysis of Data. Organizing Data Files in SPSS. Descriptive Statistics
Analysis of Data Claudia J. Stanny PSY 67 Research Design Organizing Data Files in SPSS All data for one subject entered on the same line Identification data Between-subjects manipulations: variable to
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationStatistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!
Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!) Part A - Multiple Choice Indicate the best choice
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationLecture 8: Gamma regression
Lecture 8: Gamma regression Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Models with constant coefficient of variation Gamma regression: estimation and testing
More informationChapter 6: Point Estimation. Fall 2011. - Probability & Statistics
STAT355 Chapter 6: Point Estimation Fall 2011 Chapter Fall 2011 6: Point1 Estimat / 18 Chap 6 - Point Estimation 1 6.1 Some general Concepts of Point Estimation Point Estimate Unbiasedness Principle of
More informationStatistics Review PSY379
Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses
More informationSAS Certificate Applied Statistics and SAS Programming
SAS Certificate Applied Statistics and SAS Programming SAS Certificate Applied Statistics and Advanced SAS Programming Brigham Young University Department of Statistics offers an Applied Statistics and
More informationMultiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
More informationCurriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
More informationModule 2 Probability and Statistics
Module 2 Probability and Statistics BASIC CONCEPTS Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The standard deviation of a standard normal distribution
More informationStatistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl
Dept of Information Science j.nerbonne@rug.nl October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic
More informationSTAT 830 Convergence in Distribution
STAT 830 Convergence in Distribution Richard Lockhart Simon Fraser University STAT 830 Fall 2011 Richard Lockhart (Simon Fraser University) STAT 830 Convergence in Distribution STAT 830 Fall 2011 1 / 31
More informationCA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction
CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous
More informationAugust 2012 EXAMINATIONS Solution Part I
August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,
More informationNOV - 30211/II. 1. Let f(z) = sin z, z C. Then f(z) : 3. Let the sequence {a n } be given. (A) is bounded in the complex plane
Mathematical Sciences Paper II Time Allowed : 75 Minutes] [Maximum Marks : 100 Note : This Paper contains Fifty (50) multiple choice questions. Each question carries Two () marks. Attempt All questions.
More informationLecture Notes 1. Brief Review of Basic Probability
Probability Review Lecture Notes Brief Review of Basic Probability I assume you know basic probability. Chapters -3 are a review. I will assume you have read and understood Chapters -3. Here is a very
More informationStatistical Models in R
Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova
More informationTests of Hypotheses Using Statistics
Tests of Hypotheses Using Statistics Adam Massey and Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract We present the various methods of hypothesis testing that one
More informationUniversity of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination February 14 th, 2014.
University of Ljubljana Doctoral Programme in Statistics ethodology of Statistical Research Written examination February 14 th, 2014 Name and surname: ID number: Instructions Read carefully the wording
More informationCS 147: Computer Systems Performance Analysis
CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More informationSTATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4
STATISTICS 8, FINAL EXAM NAME: KEY Seat Number: Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 Make sure you have 8 pages. You will be provided with a table as well, as a separate
More information11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationGeneralized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationData Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression
Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction
More informationAssumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model
Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity
More informationSection 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
More informationChapter 4 and 5 solutions
Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,
More informationOne-Way Analysis of Variance
One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We
More informationNonparametric tests these test hypotheses that are not statements about population parameters (e.g.,
CHAPTER 13 Nonparametric and Distribution-Free Statistics Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., 2 tests for goodness of fit and independence).
More informationLecture 7: Continuous Random Variables
Lecture 7: Continuous Random Variables 21 September 2005 1 Our First Continuous Random Variable The back of the lecture hall is roughly 10 meters across. Suppose it were exactly 10 meters, and consider
More informationLeast Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
More informationSIMON FRASER UNIVERSITY
SIMON FRASER UNIVERSITY BUEC 333: Statistics for Business and Economics. MIDTERM EXAM: PART I Instructor: Alex Jameson Appiah February. 27, 1996. Time: 50 mins. Name: ------------------------------------------------------
More informationSection 5.1 Continuous Random Variables: Introduction
Section 5. Continuous Random Variables: Introduction Not all random variables are discrete. For example:. Waiting times for anything (train, arrival of customer, production of mrna molecule from gene,
More informationBNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More informationNon-Parametric Tests (I)
Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent
More information8 6 X 2 Test for a Variance or Standard Deviation
Section 8 6 x 2 Test for a Variance or Standard Deviation 437 This test uses the P-value method. Therefore, it is not necessary to enter a significance level. 1. Select MegaStat>Hypothesis Tests>Proportion
More informationElementary Statistics
Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,
More informationIntroduction to Analysis of Variance (ANOVA) Limitations of the t-test
Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only
More informationUniversity of Chicago Graduate School of Business. Business 41000: Business Statistics
Name: University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas. 2. Throughout
More informationFinal Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
More information5. Linear Regression
5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4
More informationCHI-SQUARE: TESTING FOR GOODNESS OF FIT
CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationStatistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
More informationFactors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
More informationNotes on Applied Linear Regression
Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 444-8935 email:
More informationMATHEMATICAL METHODS OF STATISTICS
MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS
More informationSimple Linear Regression
STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze
More informationMath 201: Statistics November 30, 2006
Math 201: Statistics November 30, 2006 Fall 2006 MidTerm #2 Closed book & notes; only an A4-size formula sheet and a calculator allowed; 90 mins. No questions accepted! Instructions: There are eleven pages
More informationTesting Group Differences using T-tests, ANOVA, and Nonparametric Measures
Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:
More informationPrinciple of Data Reduction
Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then
More information