Unit 14: Nonparametric Statistical Methods


 Elijah Greene
 1 years ago
 Views:
Transcription
1 Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 7/26/2004 Unit 14  Stat Ramón V. León 1
2 Introductory Remarks Most methods studied so far have been based on the assumption of normally distributed data Frequently this assumption is not valid Sample size may be too small to verify it Sometimes the data is measured in an ordinal scale Nonparametric or distributionfree statistical methods Make very few assumptions about the form of the population distribution from which the data are sampled Based on ranks so they can be used on ordinal data Will concentrate on hypothesis tests but will also mention confidence interval procedures. 7/26/2004 Unit 14  Stat Ramón V. León 2
3 Inference for a Single Sample Consider a random sample x1, x2,..., x n from a population with unknown median µ. (Recall that for nonnormal (especially skewed) distributions the median is a better measure of the center than the mean.) H : µ = µ vs. H : µ > µ Example: Test whether the median household income of a population exceeds $50,000 based on a random sample of household incomes from that population For simplicity we sometimes present methods for onesided tests. Modifications for twosided tests are straightforward and are given in the textbook Some examples in these notes are twosided tests. 7/26/2004 Unit 14  Stat Ramón V. León 3
4 Sign test: Sign Test for a Single Sample H 1. Count the number of x i 's that exceed µ 0. Denote this number by s+, called the number of plus signs. Let s = n s+, which is the number of minus signs. 2. Reject H if s is large or equivalently if s is small. 0 : µ = µ vs. H : µ > µ Test idea: Under the null hypothesis s + has a binomial distribution, Bin (n, ½). So this test is simply the test for binomial proportions 7/26/2004 Unit 14  Stat Ramón V. León 4
5 Sign Test Example A thermostat used in an electric device is to be checked for the accuracy of its design setting of 200ºF. Ten thermostats were tested to determine their actual settings, resulting in the following data: 202.2, 203.4, 200.5, 202.5, 206.3, 198.0, 203.7, 200.8, 201.3, s + H : µ = 200 vs H : µ = 8 = number of data values > 200, so Pvalue = i= 8 i = = 2 i= 0 i 2 (The t test based on the mean has Pvalue = However recall that the t test assumes a normal population) 7/26/2004 Unit 14  Stat Ramón V. León 5
6 Normal Approximation to Test Statistic If the sample size is large ( 20) the common of S and S is approximated by a normal distribution with 1 n ES ( + ) = ES ( ) = np= n =, n Var( S+ ) = Var( S ) = np(1 p) = n = Therefore can perform a onesided z test with s+ n z = n 4 + 7/26/2004 Unit 14  Stat Ramón V. León 6
7 Pvalues for Sign Test Using JMP Based on normal approximation to the binomial ( = z 2 ) 7/26/2004 Unit 14  Stat Ramón V. León 7
8 Treatment of Ties Theory of the test assumes that the distribution of the data is continuous so in theory ties are impossible In practice they do occur because of rounding A simple solution is to ignore the ties and work only with the untied observation. This does reduce the effective sample size of the test and hence its power, but the loss is not significant if there are only a few ties 7/26/2004 Unit 14  Stat Ramón V. León 8
9 Let x x x be the ordered data values. (1) (2) ( n) Then a (1 α )level CI for µ is given by x µ Comfidence Interval for µ x ( b+ 1) ( n b) where b= b α is the lower α 2 critical point n,1 2 of the Bin n,1 2 distribution. ( ) Note: Not all confidence levels are possible because of the discreteness of the Binomial distribution 7/26/2004 Unit 14  Stat Ramón V. León 9
10 Thermostat Setting: Sign Confidence Interval for the Median From Table A.1 we see that for n = 10 and p=0.5, the lower critical point of the binomial distribution is 1 and by symmetry the upper critical point is 9. Setting α 2 = which gives 1α = = 0.978, we find that = x µ x = (2) (9) is a 97.8% CI for µ. 7/26/2004 Unit 14  Stat Ramón V. León 10
11 Sign Test for Matched Pairs Drop 3 tied pairs. Then s + = 20; s  = 3 7/26/2004 Unit 14  Stat Ramón V. León 11
12 Sign Test for Matched Pairs 7/26/2004 Unit 14  Stat Ramón V. León 12
13 Sign Test for Matched Pairs in JMP Pearson s pvalue is not the same as the book s twosided Pvalue because the book uses the continuity correction in the normal approximation to the binomial distribution, i.e, book uses z = (Page 567) rather than z = used by JMP. Note that ( ) 2 = book 7/26/2004 Unit 14  Stat Ramón V. León 13
14 Wilcoxon Signed Rank Test H : µ = µ vs. H : µ µ More powerful than the sign test, however, it requires the assumption that the population distribution is symmetric 1. Rank and order the differences in terms of their absolute value Example 14.1 and 14.4: Thermostat Setting is 200 F 2. Calculate w + = sum of the ranks of the positive differences w + = Reject H 0 if w + is large or small 7/26/2004 Unit 14  Stat Ramón V. León 14
15 Wilcoxon Signed Rank Test in JMP This test finds a significant difference at α=0.05 while the sign test did not at even α=0.1 7/26/2004 Unit 14  Stat Ramón V. León 15
16 Normal Approximation in the Wilcoxon Signed Rank Test For large n, the null distribution of W W W +  can be wellapproximated by a normal distribution with mean and variance given by nn ( + 1) nn ( + 1)(2n+ 1) EW ( ) = andvarw ( ) = For large samples a onesided ( greater than median) ztest uses the statistic w+ n( n 1)/4 1/2 z = + nn ( + 1)(2n+ 1) 24 7/26/2004 Unit 14  Stat Ramón V. León 16
17 Importance of Symmetric Population Assumption Here even though H 0 is true the long right hand tail makes the positive differences tend to be larger in magnitude than the negative differences, resulting in higher ranks. This inflates w + and hence the test s type I error probability. 7/26/2004 Unit 14  Stat Ramón V. León 17
18 Null Distribution of the Wilcoxon Signed Rank Statistics 7/26/2004 Unit 14  Stat Ramón V. León 18
19 Null Distribution of the Wilcoxon Signed Rank Statistics 7/26/2004 Unit 14  Stat Ramón V. León 19
20 Wilcoxon Signed Rank Statistic: Treatment of Ties There are two types of ties Some of the data is equal to the median Drop these observations Some of the differences from the median may be tied Use midrank, that is, the average rank For example, suppose d1 = 1, d2 =+ 3, d3 = 3, d4 =+ 5 Then (2 + 3) r1 = 1, r2 = r3 = = 2.5, r4 = 4 2 With ties Table A.10 is only approximate 7/26/2004 Unit 14  Stat Ramón V. León 20
21 Wilcoxon Sign Rank Test: Matched Pair Design Example 14.5: Comparing Two Methods of Cardiac Output Notice that we drop the three zero differences Notice that we average the tied ranks TwoSide Pvalues Signed test: Signed Rank test: ttest: (Page 284) (Notice that these tests require progressively more stringent assumptions about the population of differences) 7/26/2004 Unit 14  Stat Ramón V. León 21
22 JMP Calculation 7/26/2004 Unit 14  Stat Ramón V. León 22
23 Signed Rank Confidence Interval for the Median 7/26/2004 Unit 14  Stat Ramón V. León 23
24 Thermostat Setting: Wilcoxon Signed Rank Confidence Interval for Median From Table A.10 we see that for n = 10, the upper 2.4% critical point is 47 and by symmetry the lower 2.4% 10(10 + 1) critical point is  47 = = 8. 2 Setting α 2 = and hence 1α= =0.952 we find that = x = x µ x = is a 95.2% CI for µ 7/26/2004 Unit 14  Stat Ramón V. León 24
25 Inferences for Two Independent Samples One wants to show that the observations from one population tend to be larger than those from another population based on independent random samples x, x,..., x and y, y,..., y 1 2 n n Examples: Treated patients tend to live longer than untreated patients An equity fund tends to have a higher yield than a bond fund 7/26/2004 Unit 14  Stat Ramón V. León 25
26 WilcoxonMannWhitney Test Example: Time to Failure of Two Capacitor Groups Reject for extreme values of w 1. 7/26/2004 Unit 14  Stat Ramón V. León 26
27 Stochastic Ordering of Populations X is stochastically larger than Y ( X Y) if for all real numbers u, PX ( > u) PY ( > u) equivalently, P( X u) = F( u) F ( u) = P( Y u) 1 2 with strict inequality for at least some u. Denoted by X Y or equivalently by F < F ) 1 2 7/26/2004 Unit 14  Stat Ramón V. León 27
28 Stochastic Ordering Especial Case: Location Difference θ is called a location parameter Notice that X X iff θ < θ /26/2004 Unit 14  Stat Ramón V. León 28
29 0 1 2 WilcoxonMannWhitney Test H : F = F ( X Y) Alternatives : One sided: H : F < F ( X Y) Two sided: H : F < F or F < F ( X Y or Y X) Notice that the alternative is not H : F F (KolmogorovSmirnov Test can handle this alternative) 7/26/2004 Unit 14  Stat Ramón V. León 29
30 Wilcoxon Version of the Test H : F = F ( X Y)vs. H : F < F ( X Y) Rank all N = n + n observations, 1 2 x, x,..., x and y, y,..., y 1 2 n in ascending order 2. Sum the ranks of the x's and y's separately. Denote these sums by w and Reject H if w is large or equivalently w is small n w 2 7/26/2004 Unit 14  Stat Ramón V. León 30
31 MannWhitney Test Version The advantage of using the MannWhitney form of the test is that the same distribution applies whether we use u 1 or u 2 P value = P( U u ) = P( U u ) 1 2 7/26/2004 Unit 14  Stat Ramón V. León 31
32 Null Distribution of the Wilcoxon MannWhitney Test Statistic Under the null hypothesis each of these 10 ordering has an equal chance of occurring, namely, 1/10 5 = /26/2004 Unit 14  Stat Ramón V. León 32
33 Null Distribution of the Wilcoxon MannWhitney Test Statistic Pw ( 8) = = 0.2 (onesided pvalue for w= 8) 1 1 ( H : X Y) 1 7/26/2004 Unit 14  Stat Ramón V. León 33
34 Normal Approximation of Mann Whitney Statistic For large n and n, the null distribution of U can be 1 2 well approximated by a normal distribution with mean and variance given by nn 1 2 nn 1 2( N+ 1) EU ( ) = and VarU ( ) = 2 12 A large sample onesided z test can be based on the statistic z = u nn nn ( N 1) 12 ( H : X Y) 7/26/2004 Unit 14  Stat Ramón V. León 34
35 Treatment of Ties A tie occurs when some x equal a y. A contribution of ½ is counted towards both u 1 and u 2 for each tied pair Equivalent to using the midrank method in computing the Wilcoxon rank sum statistic 7/26/2004 Unit 14  Stat Ramón V. León 35
36 WilcoxonMannWhitney Confidence Interval Example14.8 shows that [d (18), d (63) ] = [1.1, 14.7] is a 95.6% CI for the difference of the two medians of the failure times of capacitors. This example is in the book errata since Table A.11 is not detailed enough. 7/26/2004 Unit 14  Stat Ramón V. León 36
37 WilcoxonMannWhitney Test in JMP z 2 = With continuity correction. Used in the book which gets a onesided p value of Without continuity correction 7/26/2004 Unit 14  Stat Ramón V. León 37
38 Inference for Several Independent Samples: KruskalWallis Test Note that this is a completely randomized design 7/26/2004 Unit 14  Stat Ramón V. León 38
39 KruskalWallis Test H : F = F = = F vs. H : F < F for some i j a 1 i j Reject if a 2 H0 kw> χ 1, α Distance from the average rank 7/26/2004 Unit 14  Stat Ramón V. León 39
40 ChiSquare Approximation For large samples the distribution of KW under the null hypothesis can be approximated by the chisquare distribution with a1 degrees of freedom So reject H 0 if kw > χa 1, α 7/26/2004 Unit 14  Stat Ramón V. León 40
41 KruskalWallis Test Example Reject if kw is large. 2 χ 3,.005 = /26/2004 Unit 14  Stat Ramón V. León 41
42 KruskalWallis Test in JMP 7/26/2004 Unit 14  Stat Ramón V. León 42
43 7/26/2004 Unit 14  Stat Ramón V. León 43
44 Case method is different from Unitary method Formula method is different from Unitary method 7/26/2004 Unit 14  Stat Ramón V. León 44
45 Pairwise Comparisons: Is Any Pair of Treatments Different? One can use the Tukey Method on the average ranks to make approximate pairwise comparisons. This is one of many approximate techniques where ranks are substituted for the observations in the normal theory methods. 7/26/2004 Unit 14  Stat Ramón V. León 45
46 7/26/2004 Unit 14  Stat Ramón V. León 46
47 7/26/2004 Unit 14  Stat Ramón V. León 47
48 Tukey s Test Applied to the Ranks Averaged Lack of agreement with the more precise method of Example Here Equation method also seems to be different from Formula and Case method 7/26/2004 Unit 14  Stat Ramón V. León 48
49 Example of Friedman s Test Ranking is done within blocks 2 χ 7,.025 = Pvalue =.0040 vs for ANOVA table 7/26/2004 Unit 14  Stat Ramón V. León 49
50 i i i Inference for Several Matched Samples Randomized Block Design: a b y ij = observation on the ith treatment in the jth block if = c.d.f of r.v. Y corresponding to the observed value y ij ij ij For simplicity assume F ( y) = F( y θ β ) iθ i iβ j 2 treatment groups 2 blocks is the "treatment effect" is the "block effect" i.e., we assume that there is no treatment by block interaction ij i j 7/26/2004 Unit 14  Stat Ramón V. León 50
51 Friedman Test H : θ = θ = = θ vs. H : θ > θ for some i j a 1 i j Reject if fr 2 > χa 1, α Distance from the total of the ranks from their expected value when there is no agreement between the blocks 7/26/2004 Unit 14  Stat Ramón V. León 51
52 Pairwise Comparisons 7/26/2004 Unit 14  Stat Ramón V. León 52
53 Rank Correlation Methods The Pearson correlation coefficient measures only the degree of linear association between two variables Inferences use the assumption of bivariate normality of the two variables We present two correlation coefficients that Take into account only the ranks of the observations Measure the degree of monotonic (increasing or decreasing) association between two variables 7/26/2004 Unit 14  Stat Ramón V. León 53
54 Motivating Example ( xy, ) = (1, e), (2, e), (3, e), (4, e), (5, e) Note that there is a perfect positive association between between x and y with y = e x. The Pearson correlation correlation coefficient is only because the relationship is not linear The rank correlation coefficients we present yield a value of 1 for these data 7/26/2004 Unit 14  Stat Ramón V. León 54
55 Spearman s Rank Correlation Coefficient Ranges between 1 and +1 with r s = 1 when there is a perfect negative association and r s = +1 when there is a perfect positive association 7/26/2004 Unit 14  Stat Ramón V. León 55
56 Example (Wine Consumption and Heart Disease Deaths per 100,000 7/26/2004 Unit 14  Stat Ramón V. León 56
57 7/26/2004 Unit 14  Stat Ramón V. León 57
58 Calculation of Spearman s Rho 7/26/2004 Unit 14  Stat Ramón V. León 58
59 Test for Association Based on Spearman s Rank Correlation Coefficient 7/26/2004 Unit 14  Stat Ramón V. León 59
60 H 0 1 Hypothesis Testing Example : X= Wine Consumption and Y = Heart Disease Deaths are independent. vs. H : X and Y are (negatively or positively) associated z = r n 1 = = S TwoSided P value = Evidence of negative association 7/26/2004 Unit 14  Stat Ramón V. León 60
61 JMP Calculations: Pearson Correlation Heart Disease Deaths Alcohol from Wine Plot is fairly linear Pearson correlation 7/26/2004 Unit 14  Stat Ramón V. León 61
62 JMP Calculations: Spearman Rank Correlation 7/26/2004 Unit 14  Stat Ramón V. León 62
63 Kendall s Rank Correlation Coefficient: Key Concept Examples Concordant pairs: (1,2), (4,9) (14)(29)>0 (4,2), (3,1) (43)(21)>0 Discordant pairs: (1,2), (9,1) (19)(21)<0 (2,4), (3,1) (23)(41)<0 Tied pairs: (1,3), (1,5) (1 1)(3 5)=0 (1,4), (2,4) (1 2)(4 4)=0 (1,2), (1,2) (1 1)(2 2)=0 Kendall s idea is to compare the number of concordant pairs to the number of discordant pairs in bivariate data 7/26/2004 Unit 14  Stat Ramón V. León 63
64 (X, Y) (1, 2) Kendall s Tau (3, 4) Example (2, 1) n 3 Number of pairwise comparisons = = = 3 = 2 2 N Concordant pairs: (1,2) (3,4) (3,4) (2,1) N c = 2 Discordant pairs: (1,2) (2,1) N d = 1 ˆ τ = = = N c N N d 7/26/2004 Unit 14  Stat Ramón V. León 64
65 Kendall s Rank Correlation Coefficient: Population Version 7/26/2004 Unit 14  Stat Ramón V. León 65
66 Kendall s Rank Correlation Coefficient: Sample Estimate Let Nc = Number of concordant pairs in the data Let Nd = Number of disconcordant pairs in the data n Let N = be the number of pairwise comparisons among 2 the observations ( xi, yi), i = 1, 2,..., n. Then Nc Nd ˆ τ = and Nc + Nd N = N if no ties ˆ τ = Nc Nd if ties ( N T )( N T ) x y where T and T are corrections for the number of tied pairs. x y 7/26/2004 Unit 14  Stat Ramón V. León 66
67 Hypothesis of Independence Versus Positive Association Wine data: /26/2004 Unit 14  Stat Ramón V. León 67
68 JMP Calculations: Kendall s Rank Correlation Coefficient 7/26/2004 Unit 14  Stat Ramón V. León 68
69 Kendall s Coefficient of Concordance Measure of association between several matched samples Closely related to Friedman s test statistic Consider a candidates (treatments) and b judges (blocks) with each judge ranking the a candidates If there is perfect agreement between the judges, then each candidate gets the same rank. Assuming the candidates are labeled in the order of their ranking, the rank sum for the ith candidate would be r i = ib If the judges rank the candidates completely at random ( perfect disagreement ) then the expected rank of each candidate would be [1+2+ +a]/a =[a(a+1)/2]/a=(a+1)/2, and the expected value of all the rank sums would equal to b(a+1)/2 7/26/2004 Unit 14  Stat Ramón V. León 69
70 Kendall s Coefficient of Concordance 7/26/2004 Unit 14  Stat Ramón V. León 70
71 Kendall s Coefficient of Concordance and Friedman s Test 7/26/2004 Unit 14  Stat Ramón V. León 71
72 w = = (8 1) 7/26/2004 Unit 14  Stat Ramón V. León 72
73 Do You Need to Know More Nonparametric Statistical Methods, Second Edition by Myles Hollander and Douglas A. Wolfe. (1999) WileyInterscience 7/26/2004 Unit 14  Stat Ramón V. León 73
74 Resampling Methods Conventional methods are based on the sampling distribution of a statistic computed for the observed sample. The sampling distribution is derived by considering all possible samples of size n from the underlying population. Resampling methods generate the sampling distribution of the statistic by drawing repeated samples from the observed sample itself. This eliminates the need to assume a specific functional form for the population distribution (e.g. normal). 7/26/2004 Unit 14  Stat Ramón V. León 74
75 Challenger Shuttle ORing Data Do we have statistical evidence that cold temperature leads to more Oring incidents? Notice that assumptions of two sample t test do not hold. Original analysis omitted the zeros? Was this justified? What do we do? 7/26/2004 Unit 14  Stat Ramón V. León 75
76 Wrong ttest Analysis Difference of Low mean to High mean Notice that the assumptions of the independent sample ttest do not hold, i.e., data is not normal for each group. 7/26/2004 Unit 14  Stat Ramón V. León 76
77 Permutation Distribution of t Statistic Also equal to the twosided pvalue Equivalent to selecting all simple random samples without replacement of size 20 from the 24 data points, labeling these High and the rest Low 7/26/2004 Unit 14  Stat Ramón V. León 77
78 Comments A randomization test is a permutation test applied to data from a randomized experiment. Randomization tests are the gold standard for establishing causality. A permutation test considers all possible simple random samples without replacement from the set of observed data values The bootstrap method considers a large number of simple random samples with replacement from the set of observed data values. 7/26/2004 Unit 14  Stat Ramón V. León 78
79 Calculation of t Statistics from 10, Bootstrap Samples Think that we are placing the 24 Challenger data values in a hat. And that we are randomly selecting 24 values with replacement from the hat, labeling the first 20 values High and the remaining 4 values Low. We repeat these process 10,000 times. For each of these 10,000 bootstrap samples we calculate the tstatistic. 35 t statistics values were greater than or equal to out of (if s p = 0, t is defined to be 0). This gives a bootstrap Pvalue of 35/10000 = /26/2004 Unit 14  Stat Ramón V. León 79
80 Bootstrap Distribution of Difference Between the Means 67 of the 10,000 differences of the Low mean and the High mean were greater than or equal to 1.3. This gives a bootstrap Pvalue of 67/10000 = Conclusion: Cold weather increases the chance of Oring problems 7/26/2004 Unit 14  Stat Ramón V. León 80
81 Bootstrap Final Remarks The JMP files  that we used to generate the bootstrap samples and to calculate the statistics  are available at the course web site. There are bootstrap procedures for most types of statistical problems. All are based on resampling from the data. These methods do not assume specific functional forms for the distribution of the data, e.g. normal The accuracy of bootstrap procedures depend on the sample size and the number of bootstrap samples generated 7/26/2004 Unit 14  Stat Ramón V. León 81
82 How Were the Bootstrap Samples Generated? (see next page) 7/26/2004 Unit 14  Stat Ramón V. León 82
83 7/26/2004 Unit 14  Stat Ramón V. León 83
84 7/26/2004 Unit 14  Stat Ramón V. León 84
85 7/26/2004 Unit 14  Stat Ramón V. León 85
86 7/26/2004 Unit 14  Stat Ramón V. León 86
87 Calculated Columns in JMP Samples File 7/26/2004 Unit 14  Stat Ramón V. León 87
88 7/26/2004 Unit 14  Stat Ramón V. León 88
89 7/26/2004 Unit 14  Stat Ramón V. León 89
90 7/26/2004 Unit 14  Stat Ramón V. León 90
91 7/26/2004 Unit 14  Stat Ramón V. León 91
92 7/26/2004 Unit 14  Stat Ramón V. León 92
93 7/26/2004 Unit 14  Stat Ramón V. León 93
94 7/26/2004 Unit 14  Stat Ramón V. León 94
95 Bootstrap Estimate of the Standard Error of the Mean Summary: We calculate the standard deviation of the N bootstrap estimates of the mean 7/26/2004 Unit 14  Stat Ramón V. León 95
96 BSE for Arbitrary Statistic Example: The bootstrap standard error of the median is calculated by drawing a large number N, e.g , of bootstrap samples from the data. For each bootstrap sample we calculated the sample median. Then we calculate the standard deviation of the N bootstrap medians. 7/26/2004 Unit 14  Stat Ramón V. León 96
97 Estimated Bootstrap Standard Error for t statistics Using JMP Note N =10,000 7/26/2004 Unit 14  Stat Ramón V. León 97
98 Bootstrap Standard Error Interpretation Many bootstrap statistics have an approximate normal distribution Confidence interval interpretation 68% of the time the bootstrap estimate (the average of the bootstrap estimates) will be within one standard error of true parameter value 95% of the time the bootstrap estimate (the average of the bootstrap estimates) will be within two standard error of true parameter value 7/26/2004 Unit 14  Stat Ramón V. León 98
99 Bootstrap Confidence Intervals Percentile Method: Median Example 1. Draw N (= 10000) bootstrap samples from the data and for each calculate the (bootstrap) sample median. 2. The 2.5 percentile of the N bootstrap sample medians will be the LCL for a 95% confidence interval 3. The 97.5 percentile of the N bootstrap sample medians will be the UCL for a 95% confidence interval LCL UCL 7/26/2004 Unit 14  Stat Ramón V. León 99
100 Do You Need to Know More? A Introduction to the Bootstrap by Bradley Efrom and Robert J. Tibshirani. (1993) Chapman & Hall/CRC 7/26/2004 Unit 14  Stat Ramón V. León 100
Lecture 7: Binomial Test, Chisquare
Lecture 7: Binomial Test, Chisquare Test, and ANOVA May, 01 GENOME 560, Spring 01 Goals ANOVA Binomial test Chi square test Fisher s exact test Su In Lee, CSE & GS suinlee@uw.edu 1 Whirlwind Tour of One/Two
More informationWe are often interested in the relationship between two variables. Do people with more years of fulltime education earn higher salaries?
Statistics: Correlation Richard Buxton. 2008. 1 Introduction We are often interested in the relationship between two variables. Do people with more years of fulltime education earn higher salaries? Do
More information3. Nonparametric methods
3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests
More informationNonparametric Statistics
Nonparametric Statistics References Some good references for the topics in this course are 1. Higgins, James (2004), Introduction to Nonparametric Statistics 2. Hollander and Wolfe, (1999), Nonparametric
More information3.6: General Hypothesis Tests
3.6: General Hypothesis Tests The χ 2 goodness of fit tests which we introduced in the previous section were an example of a hypothesis test. In this section we now consider hypothesis tests more generally.
More informationStatistics: revision
NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 3 / 4 May 2005 Department of Experimental Psychology University of Cambridge Slides at pobox.com/~rudolf/psychology
More informationHow to choose a statistical test. Francisco J. Candido dos Reis DGOFMRP University of São Paulo
How to choose a statistical test Francisco J. Candido dos Reis DGOFMRP University of São Paulo Choosing the right test One of the most common queries in stats support is Which analysis should I use There
More informationStatistical tests for SPSS
Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly
More informationNonparametric Statistics
1 14.1 Using the Binomial Table Nonparametric Statistics In this chapter, we will survey several methods of inference from Nonparametric Statistics. These methods will introduce us to several new tables
More informationChapter 21 Section D
Chapter 21 Section D Statistical Tests for Ordinal Data The ranksum test. You can perform the ranksum test in SPSS by selecting 2 Independent Samples from the Analyze/ Nonparametric Tests menu. The first
More informationChapter 3: Nonparametric Tests
B. Weaver (15Feb00) Nonparametric Tests... 1 Chapter 3: Nonparametric Tests 3.1 Introduction Nonparametric, or distribution free tests are socalled because the assumptions underlying their use are fewer
More informationChapter 11: Two Variable Regression Analysis
Department of Mathematics Izmir University of Economics Week 1415 20142015 In this chapter, we will focus on linear models and extend our analysis to relationships between variables, the definitions
More informationEPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST
EPS 625 INTERMEDIATE STATISTICS The Friedman test is an extension of the Wilcoxon test. The Wilcoxon test can be applied to repeatedmeasures data if participants are assessed on two occasions or conditions
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More informationHypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam
Hypothesis Testing 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 3 2. Hypothesis Testing... 3 3. Hypothesis Tests Concerning the Mean... 10 4. Hypothesis Tests
More informationRankBased NonParametric Tests
RankBased NonParametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs
More informationStatistics for Management IISTAT 362Final Review
Statistics for Management IISTAT 362Final Review Multiple Choice Identify the letter of the choice that best completes the statement or answers the question. 1. The ability of an interval estimate to
More informationInferential Statistics
Inferential Statistics Sampling and the normal distribution Zscores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are
More informationSimple Linear Regression Chapter 11
Simple Linear Regression Chapter 11 Rationale Frequently decisionmaking situations require modeling of relationships among business variables. For instance, the amount of sale of a product may be related
More informationSome Critical Information about SOME Statistical Tests and Measures of Correlation/Association
Some Critical Information about SOME Statistical Tests and Measures of Correlation/Association This information is adapted from and draws heavily on: Sheskin, David J. 2000. Handbook of Parametric and
More information1 Nonparametric Statistics
1 Nonparametric Statistics When finding confidence intervals or conducting tests so far, we always described the population with a model, which includes a set of parameters. Then we could make decisions
More informationT adult = 96 T child = 114.
Homework Solutions Do all tests at the 5% level and quote pvalues when possible. When answering each question uses sentences and include the relevant JMP output and plots (do not include the data in your
More informationConfidence Intervals for Spearman s Rank Correlation
Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationUNIVERSITY OF NAIROBI
UNIVERSITY OF NAIROBI MASTERS IN PROJECT PLANNING AND MANAGEMENT NAME: SARU CAROLYNN ELIZABETH REGISTRATION NO: L50/61646/2013 COURSE CODE: LDP 603 COURSE TITLE: RESEARCH METHODS LECTURER: GAKUU CHRISTOPHER
More informationNonparametric Statistics
Nonparametric Statistics J. Lozano University of Goettingen Department of Genetic Epidemiology Interdisciplinary PhD Program in Applied Statistics & Empirical Methods Graduate Seminar in Applied Statistics
More informationNCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, twosample ttests, the ztest, the
More informationContents 1. Contents
Contents 1 Contents 3 Ksample Methods 2 3.1 Setup............................ 2 3.2 Classic Method Based on Normality Assumption..... 3 3.3 Permutation F test.................... 5 3.4 KruskalWallis
More informationCHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA
CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA Chapter 13 introduced the concept of correlation statistics and explained the use of Pearson's Correlation Coefficient when working
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationChapter 12 Nonparametric Tests. Chapter Table of Contents
Chapter 12 Nonparametric Tests Chapter Table of Contents OVERVIEW...171 Testing for Normality...... 171 Comparing Distributions....171 ONESAMPLE TESTS...172 TWOSAMPLE TESTS...172 ComparingTwoIndependentSamples...172
More informationCOMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore*
COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore* The data collection phases for evaluation designs may involve
More informationChapter 11: Linear Regression  Inference in Regression Analysis  Part 2
Chapter 11: Linear Regression  Inference in Regression Analysis  Part 2 Note: Whether we calculate confidence intervals or perform hypothesis tests we need the distribution of the statistic we will use.
More informationData Analysis. Lecture Empirical Model Building and Methods (Empirische Modellbildung und Methoden) SS Analysis of Experiments  Introduction
Data Analysis Lecture Empirical Model Building and Methods (Empirische Modellbildung und Methoden) Prof. Dr. Dr. h.c. Dieter Rombach Dr. Andreas Jedlitschka SS 2014 Analysis of Experiments  Introduction
More informationAnalysis of numerical data S4
Basic medical statistics for clinical and experimental research Analysis of numerical data S4 Katarzyna Jóźwiak k.jozwiak@nki.nl 3rd November 2015 1/42 Hypothesis tests: numerical and ordinal data 1 group:
More informationBowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology StepbyStep  Excel Microsoft Excel is a spreadsheet software application
More informationSection 7.2 Confidence Intervals for Population Proportions
Section 7.2 Confidence Intervals for Population Proportions 2012 Pearson Education, Inc. All rights reserved. 1 of 83 Section 7.2 Objectives Find a point estimate for the population proportion Construct
More informationBIOSTATISTICS QUIZ ANSWERS
BIOSTATISTICS QUIZ ANSWERS 1. When you read scientific literature, do you know whether the statistical tests that were used were appropriate and why they were used? a. Always b. Mostly c. Rarely d. Never
More informationHypothesis Testing. Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University
Hypothesis Testing Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 AMU / BonTech, LLC, JourniTech Corporation Copyright 2015 Learning Objectives Upon successful
More informationSpearman s correlation
Spearman s correlation Introduction Before learning about Spearman s correllation it is important to understand Pearson s correlation which is a statistical measure of the strength of a linear relationship
More informationModule 9: Nonparametric Tests. The Applied Research Center
Module 9: Nonparametric Tests The Applied Research Center Module 9 Overview } Nonparametric Tests } Parametric vs. Nonparametric Tests } Restrictions of Nonparametric Tests } OneSample ChiSquare Test
More informationWilcoxon Rank Sum or MannWhitney Test Chapter 7.11
STAT NonParametric tests /0/0 Here s a summary of the tests we will look at: Setting Normal test NonParametric Test One sample Onesample ttest Sign Test Wilcoxon signedrank test Matched pairs Apply
More informationResampling: Bootstrapping and Randomization. Presented by: Jenn Fortune, Alayna Gillespie, Ivana Pejakovic, and Anne Bergen
Resampling: Bootstrapping and Randomization Presented by: Jenn Fortune, Alayna Gillespie, Ivana Pejakovic, and Anne Bergen Outline 1. The logic of resampling and bootstrapping 2. Bootstrapping: confidence
More informationChapter G08 Nonparametric Statistics
G08 Nonparametric Statistics Chapter G08 Nonparametric Statistics Contents 1 Scope of the Chapter 2 2 Background to the Problems 2 2.1 Parametric and Nonparametric Hypothesis Testing......................
More informationDifference tests (2): nonparametric
NST 1B Experimental Psychology Statistics practical 3 Difference tests (): nonparametric Rudolf Cardinal & Mike Aitken 10 / 11 February 005; Department of Experimental Psychology University of Cambridge
More informationMath 62 Statistics Sample Exam Questions
Math 62 Statistics Sample Exam Questions 1. (10) Explain the difference between the distribution of a population and the sampling distribution of a statistic, such as the mean, of a sample randomly selected
More informationChapter 7. Estimates and Sample Size
Chapter 7. Estimates and Sample Size Chapter Problem: How do we interpret a poll about global warming? Pew Research Center Poll: From what you ve read and heard, is there a solid evidence that the average
More informationStandard Deviation Calculator
CSS.com Chapter 35 Standard Deviation Calculator Introduction The is a tool to calculate the standard deviation from the data, the standard error, the range, percentiles, the COV, confidence limits, or
More informationStatistical Significance and Bivariate Tests
Statistical Significance and Bivariate Tests BUS 735: Business Decision Making and Research 1 1.1 Goals Goals Specific goals: Refamiliarize ourselves with basic statistics ideas: sampling distributions,
More informationMCQ TESTING OF HYPOTHESIS
MCQ TESTING OF HYPOTHESIS MCQ 13.1 A statement about a population developed for the purpose of testing is called: (a) Hypothesis (b) Hypothesis testing (c) Level of significance (d) Teststatistic MCQ
More informationOutline of Topics. Statistical Methods I. Types of Data. Descriptive Statistics
Statistical Methods I Tamekia L. Jones, Ph.D. (tjones@cog.ufl.edu) Research Assistant Professor Children s Oncology Group Statistics & Data Center Department of Biostatistics Colleges of Medicine and Public
More informationOn Small Sample Properties of Permutation Tests: A Significance Test for Regression Models
On Small Sample Properties of Permutation Tests: A Significance Test for Regression Models Hisashi Tanizaki Graduate School of Economics Kobe University (tanizaki@kobeu.ac.p) ABSTRACT In this paper we
More informationNCSS Statistical Software. OneSample TTest
Chapter 205 Introduction This procedure provides several reports for making inference about a population mean based on a single sample. These reports include confidence intervals of the mean or median,
More informationtests whether there is an association between the outcome variable and a predictor variable. In the Assistant, you can perform a ChiSquare Test for
This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. In practice, quality professionals sometimes
More informationSPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stemandleaf plots and extensive descriptive statistics. To run the Explore procedure,
More informationSimple Linear Regression
Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression Statistical model for linear regression Estimating
More informationHypothesis testing  Steps
Hypothesis testing  Steps Steps to do a twotailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationStatistics and research
Statistics and research Usaneya Perngparn Chitlada Areesantichai Drug Dependence Research Center (WHOCC for Research and Training in Drug Dependence) College of Public Health Sciences Chulolongkorn University,
More informationHence, multiplying by 12, the 95% interval for the hourly rate is (965, 1435)
Confidence Intervals for Poisson data For an observation from a Poisson distribution, we have σ 2 = λ. If we observe r events, then our estimate ˆλ = r : N(λ, λ) If r is bigger than 20, we can use this
More informationAnalysis of Questionnaires and Qualitative Data Nonparametric Tests
Analysis of Questionnaires and Qualitative Data Nonparametric Tests JERZY STEFANOWSKI Instytut Informatyki Politechnika Poznańska Lecture SE 2013, Poznań Recalling Basics Measurment Scales Four scales
More informationE205 Final: Version B
Name: Class: Date: E205 Final: Version B Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The owner of a local nightclub has recently surveyed a random
More information9.1 (a) The standard deviation of the four sample differences is given as.68. The standard error is SE (ȳ1  ȳ 2 ) = SE d  = s d n d
CHAPTER 9 Comparison of Paired Samples 9.1 (a) The standard deviation of the four sample differences is given as.68. The standard error is SE (ȳ1  ȳ 2 ) = SE d  = s d n d =.68 4 =.34. (b) H 0 : The mean
More informationRegression in SPSS. Workshop offered by the Mississippi Center for Supercomputing Research and the UM Office of Information Technology
Regression in SPSS Workshop offered by the Mississippi Center for Supercomputing Research and the UM Office of Information Technology John P. Bentley Department of Pharmacy Administration University of
More informationChi Square for Contingency Tables
2 x 2 Case Chi Square for Contingency Tables A test for p 1 = p 2 We have learned a confidence interval for p 1 p 2, the difference in the population proportions. We want a hypothesis testing procedure
More informationResearch Methods 1 Handouts, Graham Hole,COGS  version 1.0, September 2000: Page 1:
Research Methods 1 Handouts, Graham Hole,COGS  version 1.0, September 000: Page 1: NONPARAMETRIC TESTS: What are nonparametric tests? Statistical tests fall into two kinds: parametric tests assume that
More informationHypothesis Testing. Chapter Introduction
Contents 9 Hypothesis Testing 553 9.1 Introduction............................ 553 9.2 Hypothesis Test for a Mean................... 557 9.2.1 Steps in Hypothesis Testing............... 557 9.2.2 Diagrammatic
More informationConfidence Intervals for the Area Under an ROC Curve
Chapter 261 Confidence Intervals for the Area Under an ROC Curve Introduction Receiver operating characteristic (ROC) curves are used to assess the accuracy of a diagnostic test. The technique is used
More informationPermutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. JaeWan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationQUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NONPARAMETRIC TESTS
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NONPARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.
More informationRegression analysis in the Assistant fits a model with one continuous predictor and one continuous response and can fit two types of models:
This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. The simple regression procedure in the
More informationStatistics revision. Dr. Inna Namestnikova. Statistics revision p. 1/8
Statistics revision Dr. Inna Namestnikova inna.namestnikova@brunel.ac.uk Statistics revision p. 1/8 Introduction Statistics is the science of collecting, analyzing and drawing conclusions from data. Statistics
More informationAMS7: WEEK 8. CLASS 1. Correlation Monday May 18th, 2015
AMS7: WEEK 8. CLASS 1 Correlation Monday May 18th, 2015 Type of Data and objectives of the analysis Paired sample data (Bivariate data) Determine whether there is an association between two variables This
More informationNCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, twosample ttests, the ztest, the
More informationAn Alternative Route to Performance Hypothesis Testing
EDHECRisk Institute 393400 promenade des Anglais 06202 Nice Cedex 3 Tel.: +33 (0)4 93 18 32 53 Email: research@edhecrisk.com Web: www.edhecrisk.com An Alternative Route to Performance Hypothesis Testing
More information11. Analysis of Casecontrol Studies Logistic Regression
Research methods II 113 11. Analysis of Casecontrol Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationConfidence Intervals for Cp
Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process
More informationPASS Sample Size Software. Linear Regression
Chapter 855 Introduction Linear regression is a commonly used procedure in statistical analysis. One of the main objectives in linear regression analysis is to test hypotheses about the slope (sometimes
More informationBiostatistics: Types of Data Analysis
Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS
More informationUCLA STAT 13 Statistical Methods  Final Exam Review Solutions Chapter 7 Sampling Distributions of Estimates
UCLA STAT 13 Statistical Methods  Final Exam Review Solutions Chapter 7 Sampling Distributions of Estimates 1. (a) (i) µ µ (ii) σ σ n is exactly Normally distributed. (c) (i) is approximately Normally
More informationSAS/STAT. 9.2 User s Guide. Introduction to. Nonparametric Analysis. (Book Excerpt) SAS Documentation
SAS/STAT Introduction to 9.2 User s Guide Nonparametric Analysis (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation
More informationNonparametric tests these test hypotheses that are not statements about population parameters (e.g.,
CHAPTER 13 Nonparametric and DistributionFree Statistics Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., 2 tests for goodness of fit and independence).
More informationSample Size Determination
Sample Size Determination Population A: 10,000 Population B: 5,000 Sample 10% Sample 15% Sample size 1000 Sample size 750 The process of obtaining information from a subset (sample) of a larger group (population)
More informationStandard Deviation Estimator
CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of
More informationThe Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests
More informationVariables and Data A variable contains data about anything we measure. For example; age or gender of the participants or their score on a test.
The Analysis of Research Data The design of any project will determine what sort of statistical tests you should perform on your data and how successful the data analysis will be. For example if you decide
More information4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
More informationPermutation tests are similar to rank tests, except that we use the observations directly without replacing them by ranks.
Chapter 2 Permutation Tests Permutation tests are similar to rank tests, except that we use the observations directly without replacing them by ranks. 2.1 The twosample location problem Assumptions: x
More informationACTM State ExamStatistics
ACTM State ExamStatistics For the 25 multiplechoice questions, make your answer choice and record it on the answer sheet provided. Once you have completed that section of the test, proceed to the tiebreaker
More informationCHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression
Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the
More informationBasic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
More informationSTATISTICAL SIGNIFICANCE OF RANKING PARADOXES
STATISTICAL SIGNIFICANCE OF RANKING PARADOXES Anna E. Bargagliotti and Raymond N. Greenwell Department of Mathematical Sciences and Department of Mathematics University of Memphis and Hofstra University
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGrawHill/Irwin, 2008, ISBN: 9780073319889. Required Computing
More informationHypothesis Testing COMP 245 STATISTICS. Dr N A Heard. 1 Hypothesis Testing 2 1.1 Introduction... 2 1.2 Error Rates and Power of a Test...
Hypothesis Testing COMP 45 STATISTICS Dr N A Heard Contents 1 Hypothesis Testing 1.1 Introduction........................................ 1. Error Rates and Power of a Test.............................
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More informationINTERPRETING THE ONEWAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONEWAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the oneway ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
More informationPaired TTest. Chapter 208. Introduction. Technical Details. Research Questions
Chapter 208 Introduction This procedure provides several reports for making inference about the difference between two population means based on a paired sample. These reports include confidence intervals
More informationWe know from STAT.1030 that the relevant test statistic for equality of proportions is:
2. Chi 2 tests for equality of proportions Introduction: Two Samples Consider comparing the sample proportions p 1 and p 2 in independent random samples of size n 1 and n 2 out of two populations which
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More informationApplications of Intermediate/Advanced Statistics in Institutional Research
Applications of Intermediate/Advanced Statistics in Institutional Research Edited by Mary Ann Coughlin THE ASSOCIATION FOR INSTITUTIONAL RESEARCH Number Sixteen Resources in Institional Research 2005 Association
More information