IEMS 441 Social Network Analysis Term Paper Multiple Testing Multi-theoretical, Multi-level Hypotheses
|
|
- Veronica Lloyd
- 7 years ago
- Views:
Transcription
1 IEMS 441 Social Network Analysis Term Paper Multiple Testing Multi-theoretical, Multi-level Hypotheses Jiangtao Gou Department of Statistics, Northwestern University Instructor: Prof. Noshir Contractor March 13, Introduction Exponential random graph model has been carefully reviewed in [11] and [13], and the corresponding theory of testing multi-theoretical multilevel hypotheses has been carefully discussed in [2] and [9], and a practical R package statnet has been released [4][5]. In this term paper, I consider a set of statistical inferences simultaneously. Exponential random graph (p*) statistical models (ERGM) nested variables at various levels which are simultaneously estimated. But if I want to test several hypotheses together under a preset significant level, then multiple testing problems occur. Under the significant level (type-i error) 5%, I need to apply multiple testing procedures to guarantee that the family-wise error rate (FWER). If I test the two hypotheses separately, the FWER may not be controlled by 5% which I set previously. In most of research papers, authors did not consider all hypotheses simultaneously. Consequently, if they drew conclusions simultaneously, the type-i error might not be controlled, in other words, they would face larger risk to the false positives than they originally expected. Some researchers applied multiple testing procedures, for example, Baker and Faulkner (1991) applied Bonferroni multiple testing procedure when they studied the network in the Hollywood film industry [1], although they did not specifically mention that they actually had used Bon- 1
2 ferroni multiple testing procedure. While, classic Bonferroni multiple testing procedure is very conservative [14]. It is safe, but sometimes lack of power, say, lack of the ability to discover the true positive. In 1980 s, a lot of multiple testing procedures were constructed. Here, I introduce Hochberg procedure and Hommel procedure into social network area. Hochberg procedure is slightly less powerful than Hommel procedure, but easier to apply. In Hochberg procedure [7], at first, all p-values are ordered p (1) p (2) p (n), and the corresponding hypotheses are H (1), H (2),, H (n), then in step 1, compare p (n) with significant level α, if p (n) α, reject all hypotheses then stop, otherwise go to the next step. In step 2, compare p (n 1) with significant level α/2, if p (n 1) α/2, reject all hypotheses except H (n) then stop, otherwise go to the next step. Similarly, in step j, compare p (n i+1) with significant level α/i, if p (n i+1) α/i, reject all hypotheses from H (1) to H (n i+1) then stop, otherwise go to the next step until comparing with p (1) with significant level α/n. In Hommel procedure [8], p-values and hypotheses are ordered, then in step 1, compare p (n) with significant level α, if p (n) α, reject all hypotheses then stop, otherwise go to the next step. In step j, if p (n i+1) > α(j i + 1)/j for i = 1,, j, then accept H (n j+1), go to the next stop, otherwise reject all H (i) where p (i) α/(j 1) [3]. 2 Examples In this section, I give three examples to demonstrate how to apply multiple hypothesis procedures. 2.1 Florentine Family Marriage and Business Ties Data This is a data set of marriage and business ties among Renaissance Florentine families [10]. The two relations are business ties and marriage alliances. Vertex information includes wealth, the number of seats on the civic council, and the total number ties [5]. There are 16 vertices, with 20 links in marriage network and 15 links in business network. Both net- 2
3 works are symmetric. Figure 1: Florentine family marriage (left) and business (right) network plot with vertex size proportional to wealth [4] The model is flomarriage edges + nodecov("wealth") + nodecov("priorates") + nodecov("totalties") + edgecov(flobusiness) Table 1: MLE estimates with p-values Estimate SE p-value edges nodecov.wealth nodecov.priorates nodecov.totalties edgecov.flobusiness There are five p-values, I order them from the largest to the smallest When setting the significant level α 5%, if I apply the least significant (LS) procedure, then I directly compare all p-values with Since p (3), 3
4 Table 2: Ordered p-values p (5) nodecov.totalties p (4) nodecov.wealth p (3) nodecov.priorates p (2) edgecov.flobusiness p (1) edges p (2) and p (1) are less than 5%, I may conclude that priorates, flobusiness and edges are significant. If I apply the Hommel procedure, it is a step-up procedure, I have (1) p (5) = > 0.05 = α, go to the next step, (2) p (4) = > = α/2, go to the next step, (3) p (3) = < = 2α/3, stop, reject all p (i) s which are less than α/2 = 0.025, say, p (3), p (2) and p (1). I may conclude that priorates, flobusiness and edges are significant. If I apply the Hochberg procedure, it is a step-up procedure, I have (1) p (5) = > 0.05 = α, go to the next step, (2) p (4) = > = α/2, go to the next step, (3) p (3) = > = α/3, go to the next step, (4) p (2) = < = α/4, stop, reject p (2) and p (1). I may conclude that flobusiness and edges are significant. If I apply Bonferroni procedure, which is the most conservative procedure, then I directly compare all p-values with α/5 = Since p (2) and p (1) are less than 1%, I may conclude that flobusiness and edges are significant. Table 3: Testing Results: NS (not significant) S (significant) LS Hommel Hochberg Bonferroni p (5) nodecov.totalties NS NS NS NS p (4) nodecov.wealth NS NS NS NS p (3) nodecov.priorates S S NS NS p (2) edgecov.flobusiness S S S S p (1) edges S S S S 4
5 Bonferroni procedure is more conservative than Hochberg procedure, Hochberg procedure is more conservative than Hommel procedure, Hommel procedure is more conservative than LS procedure. Boferroni, Hochberg and Hommel procedures can control the familywise error rate (FWER) under given significant level α, but LS procedure can not. 2.2 Longitudinal networks of positive affection within a monastery In this section I consider a directed network as a case in point. In this data set, Sampson recorded the social interactions among a group of monks while resident as an experimenter on vision, and collected numerous sociometric rankings [5][12]. The whole data set includes three phases, I only use the data in phase 3, called samplk3 in R. There are 18 vertices with 56 (directed) edges. The model is flomarriage samplk3 edges + mutual + gwesp(0.2, fixed = T) The estimations are shown in Table 4. Table 4: MLE estimates with p-values Estimate SE p-value edges mutual gwesp.fixed When applying significant level 5%, the testing results are 2.3 Goodreau s Faux Mesa High School This data set shows a simulation of an in-school friendship network, which is based in the rural western US, with a student body that is largely Hispanic and Native American [5]. This is a 205-vertex network with 203 connections. There are 99 female students and 106 male students. 5
6 Figure 2: Longitudinal networks of positive affection within a monastery, phase three Table 5: Testing Results: NS (not significant) S (significant) LS Hommel Hochberg Bonferroni p (3) gwesp.fixed.0.2 S S S NS p (2) mutual S S S S p (1) edges S S S S 6
7 Grade 7 Grade 8 Grade 9 Grade 10 Grade 11 Grade 12 Figure 3: Goodreau s Faux Mesa High School network plot 7
8 The model is mesa edges + nodematch("grade", diff = T) + nodematch("race", diff = T) The estimation results are shown in Table 6 Table 6: MLE estimates with p-values Estimate SE p-value edges < nodematch.grade < nodematch.grade < nodematch.grade < nodematch.grade < nodematch.grade < nodematch.grade < nodematch.race.black -Inf NA NA nodematch.race.hisp nodematch.race.natam < nodematch.race.other -Inf NA NA nodematch.race.white Some coefficients can not be estimated, because there are too few students in these categories. When applying significant level 5%, the testing results are If I apply the least significant (LS) procedure, then I directly compare all p-values with I may conclude that all except nodematch.race.hisp are significant. If I apply the Hommel procedure, it is a step-up procedure, I have (1) p (10) = > 0.05 = α, go to the next step, (2) p (9) = < = α/2, stop, reject all p (i) s which are less than α = I may conclude that all except nodematch.race.hisp are significant. If I apply the Hochberg procedure, it is a step-up procedure, I have (1) p (10) = > 0.05 = α, go to the next step, (2) p (9) = < = 8
9 Table 7: Testing Results: NS (not significant) S (significant) LS Hommel Hochberg Bonferroni edges S S S S nodematch.grade.7 S S S S nodematch.grade.8 S S S S nodematch.grade.9 S S S S nodematch.grade.10 S S S S nodematch.grade.11 S S S S nodematch.grade.12 S S S S nodematch.race.black nodematch.race.hisp NS NS NS NS nodematch.race.natam S S S S nodematch.race.other nodematch.race.white S S S NS α/2, stop, reject all p-values between p (9) and p (1). I may conclude that all except nodematch.race.hisp and nodematch.race.white are significant. If I apply Bonferroni procedure, which is the most conservative procedure, then I directly compare all p-values with α/5 = Since p (2) and p (1) are less than 1%, I may conclude that all except nodematch.race.hisp are significant. 3 Future Work In this paper, I only consider the uncorrelated hypotheses multiple testing at first (though this assumption about the uncorrelated hypotheses may not be true in the case of network analysis). The next step is to consider the correlated hypotheses multiple testing. I think how to estimate the correlation between different hypotheses could be a problem, but I can assume a relatively safe correlation (if the null hypotheses are favored, we can assume a relatively small correlation) to form the multiple testing procedure. After some correlation corrections, multiple testing 9
10 procedures, like Hommel procedure, Hochberg procedure, can be applied into multiple testing in network hypotheses. References [1] Wayne E. Baker, Robert R. Faulkner, Role as Resource in the Hollywood Film Industy. The American Journal of Sociology, 97, 2, p [2] N. S. Contractor, S. Wasserman, K. Faust, Testing Multitheoretical Multilevel Hypotheses About Organizational Networks: An Analytic Framework and Emprical Example. Academy of Management Review, 31, 3, p [3] Alex Dmitrienko, Ajit C. Tamhane, Frank Bretz, Multiple Testing Problems in Pharmaceutical Statistics. Chapman and Hall/CRC Press. [4] Steven M. Goodreau, joint with the rest of the Statnet Development Team, Introduction to Exponetial-family Random Graph (ERG or p*) modeling with statnet. INSNA Sunbelt - St. Pete Beach, Florida. [5] Mark S. Handcock, David R. Hunter, Carter T. Butts, Steven M. Goodreau, and Martina Morris, Software Tools for the Statistical Modeling of Network Data. Version 2.6. Project home page at http: //statnet.org, URL [6] P. D. Hoff, A. E. Raftery, M. S. Handcock, Latent space approaches to social network analysis, Journal of the American Statistical Association 97, 1090C1098. [7] Yosef Hochberg, A sharper Bonferroni procedure for multiple tests of significance. Biometrika, 75, 4, p [8] G. Hommel, A Stagewise Rejective Multiple Test Procedure Based on a Modified Bonferroni Test. Biometrika, 75, 2, p [9] P. R. Monge, N. S. Contractor, Theories of communication networks. New York: Oxford University Press. 10
11 [10] John F. Padgett, Marriage and Elite Structure in Renaissance Florence, Paper delivered to the Social Science History Association. [11] G. Robins, P. Pattison, Y. Kalish, D. Lusher, Introduction to exponential random graph (p*) models for social networks. Social Networks, 29, 2, p [12] S. F. Sampson, A novitiate in a period of change: An experimental and case study of relationships, Unpublished Ph.D. dissertation, Department of Sociology, Cornell University. [13] M. Shumate, E. T. Palazzolo, Exponential Random Graph (p*) Models as a Method for Social Network Analysis in Communication Research. Communication Methods and Measures, 4, 4, p [14] A. C. Tamhane, Statistical Analysis of Designed Experiments. John Wiley and Sons, Inc. 11
Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses
Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the
More informationStatistical Analysis of Social Networks
Statistical Analysis of Social Networks Krista J. Gile University of Massachusetts, Amherst Octover 24, 2013 Collaborators: Social Network Analysis [1] Isabelle Beaudry, UMass Amherst Elena Erosheva, University
More informationPermutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
More informationHISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy
SOCIAL NETWORK ANALYSIS University of Amsterdam, Netherlands Keywords: Social networks, structuralism, cohesion, brokerage, stratification, network analysis, methods, graph theory, statistical models Contents
More informationPackage ERP. December 14, 2015
Type Package Package ERP December 14, 2015 Title Significance Analysis of Event-Related Potentials Data Version 1.1 Date 2015-12-11 Author David Causeur (Agrocampus, Rennes, France) and Ching-Fan Sheu
More informationExperimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test
Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely
More informationPackage dunn.test. January 6, 2016
Version 1.3.2 Date 2016-01-06 Package dunn.test January 6, 2016 Title Dunn's Test of Multiple Comparisons Using Rank Sums Author Alexis Dinno Maintainer Alexis Dinno
More informationTwo-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
More informationPearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
More informationTwo-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationAn Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS
The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice
More informationHypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationRethinking the Cultural Context of Schooling Decisions in Disadvantaged Neighborhoods: From Deviant Subculture to Cultural Heterogeneity
Rethinking the Cultural Context of Schooling Decisions in Disadvantaged Neighborhoods: From Deviant Subculture to Cultural Heterogeneity Sociology of Education David J. Harding, University of Michigan
More informationTests for Two Proportions
Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics
More informationNon-Inferiority Tests for Two Means using Differences
Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationNon-Inferiority Tests for One Mean
Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random
More information1 Why is multiple testing a problem?
Spring 2008 - Stat C141/ Bioeng C141 - Statistics for Bioinformatics Course Website: http://www.stat.berkeley.edu/users/hhuang/141c-2008.html Section Website: http://www.stat.berkeley.edu/users/mgoldman
More informationStatistical issues in the analysis of microarray data
Statistical issues in the analysis of microarray data Daniel Gerhard Institute of Biostatistics Leibniz University of Hannover ESNATS Summerschool, Zermatt D. Gerhard (LUH) Analysis of microarray data
More informationAssociation Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
More informationBA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420
BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test
More informationSection 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)
Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More informationGeneral Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.
General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n
More information3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
More informationTests for Two Survival Curves Using Cox s Proportional Hazards Model
Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.
More informationHYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
More informationSection 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationMultiple-Comparison Procedures
Multiple-Comparison Procedures References A good review of many methods for both parametric and nonparametric multiple comparisons, planned and unplanned, and with some discussion of the philosophical
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationInference for two Population Means
Inference for two Population Means Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison October 27 November 1, 2011 Two Population Means 1 / 65 Case Study Case Study Example
More informationFalse Discovery Rates
False Discovery Rates John D. Storey Princeton University, Princeton, USA January 2010 Multiple Hypothesis Testing In hypothesis testing, statistical significance is typically based on calculations involving
More informationStatistics 2014 Scoring Guidelines
AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home
More informationClass 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
More informationVirtually There: Exploring Proximity and Homophily in a Virtual World
Virtually There: Exploring Proximity and Homophily in a Virtual World Yun Huang Dept. of Management Science and Industrial Engineering Northwestern University Evanston, IL U.S.A. yun@northwestern.edu Cuihua
More informationSample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
More informationExponential Random Graph Models for Social Network Analysis. Danny Wyatt 590AI March 6, 2009
Exponential Random Graph Models for Social Network Analysis Danny Wyatt 590AI March 6, 2009 Traditional Social Network Analysis Covered by Eytan Traditional SNA uses descriptive statistics Path lengths
More informationBA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394
BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 1. Does vigorous exercise affect concentration? In general, the time needed for people to complete
More informationISyE 2028 Basic Statistical Methods - Fall 2015 Bonus Project: Big Data Analytics Final Report: Time spent on social media
ISyE 2028 Basic Statistical Methods - Fall 2015 Bonus Project: Big Data Analytics Final Report: Time spent on social media Abstract: The growth of social media is astounding and part of that success was
More informationStatistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl
Dept of Information Science j.nerbonne@rug.nl October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic
More informationLOGISTIC REGRESSION ANALYSIS
LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic
More informationChapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:
Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a
More informationHow To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
More informationNon-Parametric Tests (I)
Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationFallback tests for co-primary endpoints
Research Article Received 16 April 2014, Accepted 27 January 2016 Published online 25 February 2016 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.6911 Fallback tests for co-primary
More informationVariables Control Charts
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables
More informationChapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing
Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing
More informationAP STATISTICS (Warm-Up Exercises)
AP STATISTICS (Warm-Up Exercises) 1. Describe the distribution of ages in a city: 2. Graph a box plot on your calculator for the following test scores: {90, 80, 96, 54, 80, 95, 100, 75, 87, 62, 65, 85,
More informationName: Date: Use the following to answer questions 3-4:
Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationSampling Biases in IP Topology Measurements
Sampling Biases in IP Topology Measurements Anukool Lakhina with John Byers, Mark Crovella and Peng Xie Department of Boston University Discovering the Internet topology Goal: Discover the Internet Router
More informationImpact of Skewness on Statistical Power
Modern Applied Science; Vol. 7, No. 8; 013 ISSN 1913-1844 E-ISSN 1913-185 Published by Canadian Center of Science and Education Impact of Skewness on Statistical Power Ötüken Senger 1 1 Kafkas University,
More informationNon-Inferiority Tests for Two Proportions
Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which
More informationHYPOTHESIS TESTING WITH SPSS:
HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER
More informationCorrelational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
More informationStatistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
More informationNovember 08, 2010. 155S8.6_3 Testing a Claim About a Standard Deviation or Variance
Chapter 8 Hypothesis Testing 8 1 Review and Preview 8 2 Basics of Hypothesis Testing 8 3 Testing a Claim about a Proportion 8 4 Testing a Claim About a Mean: σ Known 8 5 Testing a Claim About a Mean: σ
More informationMind on Statistics. Chapter 13
Mind on Statistics Chapter 13 Sections 13.1-13.2 1. Which statement is not true about hypothesis tests? A. Hypothesis tests are only valid when the sample is representative of the population for the question
More informationTests for One Proportion
Chapter 100 Tests for One Proportion Introduction The One-Sample Proportion Test is used to assess whether a population proportion (P1) is significantly different from a hypothesized value (P0). This is
More informationClocking In Facebook Hours. A Statistics Project on Who Uses Facebook More Middle School or High School?
Clocking In Facebook Hours A Statistics Project on Who Uses Facebook More Middle School or High School? Mira Mehta and Joanne Chiao May 28 th, 2010 Introduction With Today s technology, adolescents no
More informationInterpretation of Somers D under four simple models
Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms
More informationDifference of Means and ANOVA Problems
Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Sample Practice problems - chapter 12-1 and 2 proportions for inference - Z Distributions Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide
More informationQUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.
More informationFrom Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data
From Reads to Differentially Expressed Genes The statistics of differential gene expression analysis using RNA-seq data experimental design data collection modeling statistical testing biological heterogeneity
More informationRedwood Building, Room T204, Stanford University School of Medicine, Stanford, CA 94305-5405.
W hittemoretxt050806.tex A Bayesian False Discovery Rate for Multiple Testing Alice S. Whittemore Department of Health Research and Policy Stanford University School of Medicine Correspondence Address:
More informationError Type, Power, Assumptions. Parametric Tests. Parametric vs. Nonparametric Tests
Error Type, Power, Assumptions Parametric vs. Nonparametric tests Type-I & -II Error Power Revisited Meeting the Normality Assumption - Outliers, Winsorizing, Trimming - Data Transformation 1 Parametric
More informationCOMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
More informationHypothesis Testing --- One Mean
Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis
More informationNCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
More informationC. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.
Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationCancer Biostatistics Workshop Science of Doing Science - Biostatistics
Cancer Biostatistics Workshop Science of Doing Science - Biostatistics Yu Shyr, PhD Jan. 18, 2008 Cancer Biostatistics Center Vanderbilt-Ingram Cancer Center Yu.Shyr@vanderbilt.edu Aims Cancer Biostatistics
More informationBivariate Statistics Session 2: Measuring Associations Chi-Square Test
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Features Of The Chi-Square Statistic The chi-square test is non-parametric. That is, it makes no assumptions about the distribution
More informationImputation of missing network data: Some simple procedures
Imputation of missing network data: Some simple procedures Mark Huisman Dept. of Psychology University of Groningen Abstract Analysis of social network data is often hampered by non-response and missing
More informationTHE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.
THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM
More informationTesting Multiple Secondary Endpoints in Confirmatory Comparative Studies -- A Regulatory Perspective
Testing Multiple Secondary Endpoints in Confirmatory Comparative Studies -- A Regulatory Perspective Lilly Yue, Ph.D. Chief, Cardiovascular and Ophthalmic Devices Branch Division of Biostatistics CDRH/FDA
More informationStat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015
Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation
More informationUncertainty quantification for the family-wise error rate in multivariate copula models
Uncertainty quantification for the family-wise error rate in multivariate copula models Thorsten Dickhaus (joint work with Taras Bodnar, Jakob Gierl and Jens Stange) University of Bremen Institute for
More informationNonparametric Two-Sample Tests. Nonparametric Tests. Sign Test
Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric
More informationTesting Hypotheses About Proportions
Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More informationPrinciples of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions
More informationMinería de Datos ANALISIS DE UN SET DE DATOS.! Visualization Techniques! Combined Graph! Charts and Pies! Search for specific functions
Minería de Datos ANALISIS DE UN SET DE DATOS! Visualization Techniques! Combined Graph! Charts and Pies! Search for specific functions Data Mining on the DAG ü When working with large datasets, annotation
More informationSCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
More informationStudents' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)
Cairo University Faculty of Economics and Political Science Statistics Department English Section Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study) Prepared
More informationAnalysis of Variance ANOVA
Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.
More informationWHAT IS A JOURNAL CLUB?
WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club
More informationEvaluating the Effect of Teacher Degree Level on Educational Performance Dan D. Goldhaber Dominic J. Brewer
Evaluating the Effect of Teacher Degree Level Evaluating the Effect of Teacher Degree Level on Educational Performance Dan D. Goldhaber Dominic J. Brewer About the Authors Dr. Dan D. Goldhaber is a Research
More informationMaster s Thesis. PERFORMANCE OF BETA-BINOMIAL SGoF MULTITESTING METHOD UNDER DEPENDENCE: A SIMULATION STUDY
Master s Thesis PERFORMANCE OF BETA-BINOMIAL SGoF MULTITESTING METHOD UNDER DEPENDENCE: A SIMULATION STUDY AUTHOR: Irene Castro Conde DIRECTOR: Jacobo de Uña Álvarez Master in Statistical Techniques University
More informationNonparametric Statistics
Nonparametric Statistics J. Lozano University of Goettingen Department of Genetic Epidemiology Interdisciplinary PhD Program in Applied Statistics & Empirical Methods Graduate Seminar in Applied Statistics
More informationChapter 4: Statistical Hypothesis Testing
Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin
More informationComparing Means in Two Populations
Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we
More information