MAT3378 (Winter 2016)
|
|
- Audrey Mills
- 7 years ago
- Views:
Transcription
1 MAT3378 (Winter 2016) Assignment 3 - SOLUTIONS The following questions will be marked: 2a),b),c), 3, 5 Total number of points for Assignment 3: 13 Q1. (Normality) (1) Simulate data from a normal distribution. Plot boxplot, histogram and qq-plot. Apply Jarque-Bera Test by hand as in R-3.html. Confirm your calculations by using the function jarqueberatest. Comment on your findings. (2) Simulate data from a non-normal distribution (pick your favourite distribution). Plot boxplot, histogram and qq-plot. Apply Jarque-Bera Test by hand as in R-3.html. Confirm your calculations by using the function jarqueberatest. Comment on your findings. Solution to Q1: (a) Z1=rnorm(1000); library(moments); # you need this package to calculate kurtosis and skewness; sk=skewness(z1); kurt=kurtosis(z1); n=length(z1); test.stat=sk^2*n/6+(kurt-3)^2*n/24; p.value=1-pchisq(test.stat,2); # chi-square distribution with 2 degrees of freedom; test.stat;p.value; #We should accept normality here (recall that the data come from the normal population); The p-value is The test does not reject normality (of course, you will have a different p-value!). Confirmation: library(fbasics); jarqueberatest(z1); Output: X-squared: Asymptotic p Value: (b) Z1=rexp(1000); library(fbasics); jarqueberatest(z1); Output: X-squared: Asymptotic p Value: < 2.2e-16 The normality is rejected, as it should be. Marking scheme for Q1: 1
2 2 This question will not be marked. Q2. (Equality of the variance. 3 points) (a) Simulate realizations of H distribution with (4, 20) degrees of freedom. Store them under the vector called H. Obtain 95% quantile. Compare with the appropriate entry in Table B.10. (b) Simulate 4 normal vectors of size 21 with mean zero and the same variances. Apply the Hartley test. Calculating the p-value using the vector H calculated in (a). (c) Simulate 4 normal vectors of size 21 with mean zero and different variances. Apply the Hartley test. Calculating the p-value using the vector H calculated in (a). (d) This part is not compulsory. In this part we study the performance of the Hartley test when the sample sizes are not equal - I had this question in class. (d-1) Simulate 4 normal vectors of size 21 with mean zero and the same variances. Calculate the value of the test statistics. Repeat it 1000 times and store the values of the test statistics in the vector HartleyTest. Use length(hartleytest[hartleytest>3.29])/1000 Solution to Q2: to get the rejection rate. The rejection rate should be around Simulate 4 normal vectors of sizes 21, 25, 29, 33 with mean zero and the same variances. Proceed as in (d-1). Simulate 4 normal vectors of sizes 21, 17, 13, 9 with mean zero and the same variances. Proceed as in (d-1). Simulate 4 normal vectors of sizes 21, 21, 17, 25 with mean zero and the same variances. Proceed as in (d-1). (a) H=NULL; no.of.rep=100000; n1=21;n2=21;n3=21;n4=21; #you need 4 samples of size 21; sigma1=1;sigma2=1;sigma3=1;sigma4=1; for(i in 1:no.of.rep) { pop1=rnorm(n1,0,sigma1); pop2=rnorm(n2,0,sigma2); pop3=rnorm(n3,0,sigma3); pop4=rnorm(n4,0,sigma4); var1=var(pop1);var2=var(pop2);var3=var(pop3);var4=var(pop4); test.stat=max(var1,var2,var3,var4)/min(var1,var2,var3,var4); H=c(H,test.stat) } As the result we obtain of size with realizations of the appropriate H distribution. We have quantile(h,0.95) 95% From the table B.10 we have the true value of the quantile: Note: you will not get the same quantile, but you should be close to it. (b) n1=21;n2=21;n3=21;n4=21; sigma1=1;sigma2=1;sigma3=1;sigma4=1; pop1=rnorm(n1,0,sigma1); pop2=rnorm(n2,0,sigma2); pop3=rnorm(n3,0,sigma3); pop4=rnorm(n4,0,sigma4); var1=var(pop1);var2=var(pop2);var3=var(pop3);var4=var(pop4);
3 3 test.stat=max(var1,var2,var3,var4)/min(var1,var2,var3,var4); The test statistics and the p-value are: test.stat; [1] length(h[h>test.stat])/no.of.rep [1] The test does not reject equality of the variances. Note: you will get a different test statistics and p-value. (b) n1=21;n2=21;n3=21;n4=21; sigma1=1;sigma2=2;sigma3=3;sigma4=3; pop1=rnorm(n1,0,sigma1); pop2=rnorm(n2,0,sigma2); pop3=rnorm(n3,0,sigma3); pop4=rnorm(n4,0,sigma4); var1=var(pop1);var2=var(pop2);var3=var(pop3);var4=var(pop4); test.stat=max(var1,var2,var3,var4)/min(var1,var2,var3,var4); The test statistics and the p-value are: test.stat; [1] length(h[h>test.stat])/no.of.rep [1] 1e-05 The test rejects equality of the variances. Note: you will get a different test statistics and p-value. Marking scheme for Q2: 1 point for each of the parts a), b), c). Total - 3 points. Q3. (3 points) Simulate data from a t-distribution with 4 degrees of freedom. Test for normality using the Jarque- Bera test (you can use R command directly). If the test rejects normality, apply a transformation to get normality. Confirm normality by apply the Jarque-Bera test again. Solution to Q3: Z1=rt(1000,4); library(fbasics); jarqueberatest(z1); Output: X-squared: Asymptotic p Value: < 2.2e-16 The normality is rejected. We apply ( Y ) transform. Z1.1=sqrt(abs(Z1)) par(mfrow=c(1,2)); qqnorm(z1); qqnorm(z1.1); jarqueberatest(z1.1); Normality is still rejected. Another transformation. Z1.2=(abs(Z1))^(1/3)
4 4 par(mfrow=c(1,2)); qqnorm(z1); qqnorm(z1.2); jarqueberatest(z1.2); The p-value is For α = 0.01 the normality is not rejected. Let s try another transformation. Z1.3=(abs(Z1))^(1/3.5) par(mfrow=c(1,2)); qqnorm(z1); qqnorm(z1.3); jarqueberatest(z1.3); The p-value is The normality is rejected. Marking scheme for Q3: Total 3 points. Maximal number of points if the test is performed for the original data and normality is accepted. Maximal number of points if the test is performed for the original data and normality is rejected as well as the transformation is applied to get normality. subtract 2 points if normality rejected for the original data and no transformation is applied. subtract 1 point if normality rejected for the original data, transformation is applied but not test. Q4. Simulate data from two normal populations with mean 0 and the variances 1 and 9. Apply the Brown-Forsythe test. If the test rejects equality of the variances, apply remedial measures. Test again for equality of the variances. Solution to Q4: pop1=rnorm(100,1); n1=length(pop1); pop2=rnorm(100,3); n2=length(pop1); data<-data.frame(values=c(pop1,pop2), Treatment=c(c(rep(1,n1)),c(rep(2,n2))) ) y<-data$values; x<-factor(data$treatment); levene.test(y,x,location="median"); #Brown-Forsythe test data: y Test Statistic = , p-value = No transformation is needed. Marking scheme for Q4: This question will not be marked. Q5. (7 points) Consider the SENIC data from Appendix C.1. The variables are explained on page We would like to know if the mean length of stay (variable 2) is the same in the four geographic regions (variable 9). (a) Write an appropriate ANOVA model for this study. (b) Produce side-by-side boxplots. What are your observations? Compare central tendencies of the length of stay between geographic regions. Do the within region variabilities appear to be the same?
5 5 (c) Examine by means of the Brown-Forsythe test whether or not the error variances are equal? What are your findings at α = 0.05? (d) Based on our simple guide to transformations, which transformation on the response would be the best? (e) Apply the transformation from (d) to the length of stay. Examine by means of the Brown-Forsythe test whether or not the error variances are equal? What are your findings at α = 0.05? (f) Verify that the distribution that the random error for the transformed response is normally distributed. What are your findings? (g) Assume that the ANOVA model is appropriate for the transformed response. Test wether or not the mean length of stay in the transformed units is the same in the four geographic regions. Give the p-value and your conclusion. Solution to Q5: I stored my data under the name senic. y<-senic$v2; x<-factor(senic$v9) (a) Y ij = µ i + ε ij, i = 1,..., 4, j = 1,..., 113. (b) par(mfrow=c(1,1)) boxplot(y~x); # See the bottom of the file for the graph There seem to be some differences between regions, especially the first and the last region. (c) y<-senic$v2; x<-factor(senic$v9) levene.test(y,x,location="median"); #Brown-Forsythe test data: y Test Statistic = , p-value = The equality of variances is rejected. (d) means=tapply(y,x,mean); stdev=tapply(y,x,sd); print(stdev^2/means); print(stdev/means); print(stdev/means^2); It seems that log(y ) or 1/Y should be suitable (second and third case). (e) For log(y ): logy=log(y); levene.test(logy,x,location="median"); #Brown-Forsythe test data: logy Test Statistic = , p-value = For 1/Y : yinv=1/y; levene.test(yinv,x,location="median"); #Brown-Forsythe test data: yinv Test Statistic = , p-value = 0.41 Both transformations lead to equality of the variances. (f) For log(y ): jarqueberatest(logy)
6 6 X-squared: Asymptotic p Value: 1.078e-09 For 1/Y : jarqueberatest(yinv) X-squared: Asymptotic p Value: Conclusion: 1/Y is the appropriate transform. (g) summary(aov(yinv~x)) Df Sum Sq Mean Sq F value Pr(>F) x e-08 *** Residuals Signif. codes: 0 *** ** 0.01 * Conclusion: means for the transformed variable 1/Y are different. Marking scheme for Q5: a) - 1 point; b) - 1 point, c) - 1 point, d) - 1 point, e) - 1 point, f) - 1 point, g) - 1 point. Total: 7 points
Two-way ANOVA and ANCOVA
Two-way ANOVA and ANCOVA In this tutorial we discuss fitting two-way analysis of variance (ANOVA), as well as, analysis of covariance (ANCOVA) models in R. As we fit these models using regression methods
More informationStatistical Models in R
Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Linear Models in R Regression Regression analysis is the appropriate
More information1.5 Oneway Analysis of Variance
Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments
More informationOnce saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.
1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis
More informationINTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationHow To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
More informationMultiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
More informationDifference of Means and ANOVA Problems
Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly
More informationPsychology 205: Research Methods in Psychology
Psychology 205: Research Methods in Psychology Using R to analyze the data for study 2 Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 38 Outline 1 Getting ready
More informationOutline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
More informationFinal Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
More informationChapter 7. One-way ANOVA
Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks
More informationComparing Means in Two Populations
Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we
More informationN-Way Analysis of Variance
N-Way Analysis of Variance 1 Introduction A good example when to use a n-way ANOVA is for a factorial design. A factorial design is an efficient way to conduct an experiment. Each observation has data
More informationHow To Test For Significance On A Data Set
Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.
More informationExperimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test
Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationKSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management
KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To
More informationWe extended the additive model in two variables to the interaction model by adding a third term to the equation.
Quadratic Models We extended the additive model in two variables to the interaction model by adding a third term to the equation. Similarly, we can extend the linear model in one variable to the quadratic
More information1 Basic ANOVA concepts
Math 143 ANOVA 1 Analysis of Variance (ANOVA) Recall, when we wanted to compare two population means, we used the 2-sample t procedures. Now let s expand this to compare k 3 population means. As with the
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationChapter 7 Section 1 Homework Set A
Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the
More informationCalculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation
Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.
More informationNCSS Statistical Software. One-Sample T-Test
Chapter 205 Introduction This procedure provides several reports for making inference about a population mean based on a single sample. These reports include confidence intervals of the mean or median,
More informationExercises on using R for Statistics and Hypothesis Testing Dr. Wenjia Wang
Exercises on using R for Statistics and Hypothesis Testing Dr. Wenjia Wang School of Computing Sciences, UEA University of East Anglia Brief Introduction to R R is a free open source statistics and mathematical
More informationANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R.
ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R. 1. Motivation. Likert items are used to measure respondents attitudes to a particular question or statement. One must recall
More informationNCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
More informationCHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS
CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS CHI-SQUARE TESTS OF INDEPENDENCE (SECTION 11.1 OF UNDERSTANDABLE STATISTICS) In chi-square tests of independence we use the hypotheses. H0: The variables are independent
More informationRegression III: Advanced Methods
Lecture 16: Generalized Additive Models Regression III: Advanced Methods Bill Jacoby Michigan State University http://polisci.msu.edu/jacoby/icpsr/regress3 Goals of the Lecture Introduce Additive Models
More informationNotes on Applied Linear Regression
Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 444-8935 email:
More informationStatistical Models in R
Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova
More informationNote 2 to Computer class: Standard mis-specification tests
Note 2 to Computer class: Standard mis-specification tests Ragnar Nymoen September 2, 2013 1 Why mis-specification testing of econometric models? As econometricians we must relate to the fact that the
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationBill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
More informationE(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F
Random and Mixed Effects Models (Ch. 10) Random effects models are very useful when the observations are sampled in a highly structured way. The basic idea is that the error associated with any linear,
More informationNovember 08, 2010. 155S8.6_3 Testing a Claim About a Standard Deviation or Variance
Chapter 8 Hypothesis Testing 8 1 Review and Preview 8 2 Basics of Hypothesis Testing 8 3 Testing a Claim about a Proportion 8 4 Testing a Claim About a Mean: σ Known 8 5 Testing a Claim About a Mean: σ
More informationComparing Nested Models
Comparing Nested Models ST 430/514 Two models are nested if one model contains all the terms of the other, and at least one additional term. The larger model is the complete (or full) model, and the smaller
More informationCS 147: Computer Systems Performance Analysis
CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding
More informationA full analysis example Multiple correlations Partial correlations
A full analysis example Multiple correlations Partial correlations New Dataset: Confidence This is a dataset taken of the confidence scales of 41 employees some years ago using 4 facets of confidence (Physical,
More informationClass 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
More informationGood luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:
Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours
More informationUNDERSTANDING THE INDEPENDENT-SAMPLES t TEST
UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly
More informationAn analysis method for a quantitative outcome and two categorical explanatory variables.
Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that
More informationStatistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl
Dept of Information Science j.nerbonne@rug.nl October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More information13: Additional ANOVA Topics. Post hoc Comparisons
13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Kruskal-Wallis Test Post hoc Comparisons In the prior
More informationAnalysis of Variance ANOVA
Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.
More informationSection 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
More informationAnalysis of Variance. MINITAB User s Guide 2 3-1
3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced
More informationVector Time Series Model Representations and Analysis with XploRe
0-1 Vector Time Series Model Representations and Analysis with plore Julius Mungo CASE - Center for Applied Statistics and Economics Humboldt-Universität zu Berlin mungo@wiwi.hu-berlin.de plore MulTi Motivation
More informationData Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
More informationHow to calculate an ANOVA table
How to calculate an ANOVA table Calculations by Hand We look at the following example: Let us say we measure the height of some plants under the effect of different fertilizers. Treatment Measures Mean
More informationData analysis and regression in Stata
Data analysis and regression in Stata This handout shows how the weekly beer sales series might be analyzed with Stata (the software package now used for teaching stats at Kellogg), for purposes of comparing
More informationNCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
More informationNormality Testing in Excel
Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More information12: Analysis of Variance. Introduction
1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider
More informationSurvey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationSales forecasting # 2
Sales forecasting # 2 Arthur Charpentier arthur.charpentier@univ-rennes1.fr 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting
More informationSPSS/Excel Workshop 3 Summer Semester, 2010
SPSS/Excel Workshop 3 Summer Semester, 2010 In Assignment 3 of STATS 10x you may want to use Excel to perform some calculations in Questions 1 and 2 such as: finding P-values finding t-multipliers and/or
More informationDongfeng Li. Autumn 2010
Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis
More informationFinancial Time Series Analysis (FTSA) Lecture 1: Introduction
Financial Time Series Analysis (FTSA) Lecture 1: Introduction Brief History of Time Series Analysis Statistical analysis of time series data (Yule, 1927) v/s forecasting (even longer). Forecasting is often
More informationRandomized Block Analysis of Variance
Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of
More informationExam Solutions. X t = µ + βt + A t,
Exam Solutions Please put your answers on these pages. Write very carefully and legibly. HIT Shenzhen Graduate School James E. Gentle, 2015 1. 3 points. There was a transcription error on the registrar
More informationTesting for Lack of Fit
Chapter 6 Testing for Lack of Fit How can we tell if a model fits the data? If the model is correct then ˆσ 2 should be an unbiased estimate of σ 2. If we have a model which is not complex enough to fit
More informationSPSS TUTORIAL & EXERCISE BOOK
UNIVERSITY OF MISKOLC Faculty of Economics Institute of Business Information and Methods Department of Business Statistics and Economic Forecasting PETRA PETROVICS SPSS TUTORIAL & EXERCISE BOOK FOR BUSINESS
More informationt-test Statistics Overview of Statistical Tests Assumptions
t-test Statistics Overview of Statistical Tests Assumption: Testing for Normality The Student s t-distribution Inference about one mean (one sample t-test) Inference about two means (two sample t-test)
More informationPaired T-Test. Chapter 208. Introduction. Technical Details. Research Questions
Chapter 208 Introduction This procedure provides several reports for making inference about the difference between two population means based on a paired sample. These reports include confidence intervals
More informationSTAT 350 Practice Final Exam Solution (Spring 2015)
PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects
More informationChapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:
Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a
More informationGeneral Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.
General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n
More informationChapter 11: Chi-Square and ANOVA Tests
Chapter 11: Chi-Square Tests and ANOVA Chapter 11: Chi-Square and ANOVA Tests This chapter presents material on three more hypothesis tests. One is used to determine significant relationship between two
More informationWeek 5: Multiple Linear Regression
BUS41100 Applied Regression Analysis Week 5: Multiple Linear Regression Parameter estimation and inference, forecasting, diagnostics, dummy variables Robert B. Gramacy The University of Chicago Booth School
More informationPredictability of Non-Linear Trading Rules in the US Stock Market Chong & Lam 2010
Department of Mathematics QF505 Topics in quantitative finance Group Project Report Predictability of on-linear Trading Rules in the US Stock Market Chong & Lam 010 ame: Liu Min Qi Yichen Zhang Fengtian
More informationStat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015
Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation
More informationTutorial 5: Hypothesis Testing
Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................
More informationMEASURES OF LOCATION AND SPREAD
Paper TU04 An Overview of Non-parametric Tests in SAS : When, Why, and How Paul A. Pappas and Venita DePuy Durham, North Carolina, USA ABSTRACT Most commonly used statistical procedures are based on the
More informationAssumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model
Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity
More informationGoodness of Fit. Proportional Model. Probability Models & Frequency Data
Probability Models & Frequency Data Goodness of Fit Proportional Model Chi-square Statistic Example R Distribution Assumptions Example R 1 Goodness of Fit Goodness of fit tests are used to compare any
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationStatistics courses often teach the two-sample t-test, linear regression, and analysis of variance
2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample
More informationSensex Realized Volatility Index
Sensex Realized Volatility Index Introduction: Volatility modelling has traditionally relied on complex econometric procedures in order to accommodate the inherent latent character of volatility. Realized
More informationSimple Linear Regression
Chapter Nine Simple Linear Regression Consider the following three scenarios: 1. The CEO of the local Tourism Authority would like to know whether a family s annual expenditure on recreation is related
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationCopyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5
Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression
More informationInference for two Population Means
Inference for two Population Means Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison October 27 November 1, 2011 Two Population Means 1 / 65 Case Study Case Study Example
More informationThis chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationParametric and non-parametric statistical methods for the life sciences - Session I
Why nonparametric methods What test to use? Rank Tests Parametric and non-parametric statistical methods for the life sciences - Session I Liesbeth Bruckers Geert Molenberghs Interuniversity Institute
More informationStatistics Review PSY379
Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses
More informationThe t-test and Basic Inference Principles
Chapter 6 The t-test and Basic Inference Principles The t-test is used as an example of the basic principles of statistical inference. One of the simplest situations for which we might design an experiment
More informationZ-table p-values: use choice 2: normalcdf(
P-values with the Ti83/Ti84 Note: The majority of the commands used in this handout can be found under the DISTR menu which you can access by pressing [ nd ] [VARS]. You should see the following: NOTE:
More informationELEMENTARY STATISTICS
ELEMENTARY STATISTICS Study Guide Dr. Shinemin Lin Table of Contents 1. Introduction to Statistics. Descriptive Statistics 3. Probabilities and Standard Normal Distribution 4. Estimates and Sample Sizes
More informationUsing SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
More informationCHAPTER 14 NONPARAMETRIC TESTS
CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More information