The purpose of Statistics is to ANSWER QUESTIONS USING DATA Know more about your data and you can choose what statistical method...

Size: px
Start display at page:

Download "The purpose of Statistics is to ANSWER QUESTIONS USING DATA Know more about your data and you can choose what statistical method..."

Transcription

1 The purpose of Statistics is to ANSWER QUESTIONS USING DATA Know the type of question and you can choose what type of statistics... Aim: DESCRIBE Type of question: What's going on? Examples: How many chapters do novels have? What possibilities are there for body temperature after a meal with or without chilli? What sort of relationship might the amount of sleep a student gets have with their grades? What sorts of things might be related to whether a person does volunteer work? Type of Statistics: Descriptive statistics: graphs and basic numbers Aim: DECIDE Type of question: Yes or no? Examples: Is the median number of chapters in a novel 20? Is your body temperature higher after a meal if it has chilli in it? Does getting more sleep affect a students grades? Are women more likely to participate in volunteer work than men? Type of Statistics: Hypothesis tests (p-values) Aim: ESTIMATE Type of question: What's this number? Examples: What is the median number of chapters in a novel? How much higher is your body temperature after a chilli meal compared to one without? On average, how much of an effect does 30 minutes more sleep have on a students grades? How much more (or less) likely is a woman to participate in volunteer work than a man? Type of Statistics: Confidence intervals Aim: PREDICT / EXPLAIN Type of question: What's the formula? Examples: How can I explain a person s body temperature after a meal using their temperature before and the chilli content of the meal? How can I calculate a student s grade based on their number of hours of sleep during semester? How can I use a person s gender, age, income and religion to predict their chances of participating in volunteer work? Type of Statistics: Modelling and regression The purpose of Statistics is to ANSWER QUESTIONS USING DATA Know more about your data and you can choose what statistical method... HOW THE DATA IS COLLECTED what is done to the subjects? when is information recorded? how are the subjects chosen? HOW MUCH DATA lots of things recorded per subject? lots of subjects? missing data? VARIABLES IN THE DATA how to measure? what type? defining groups or measurements? what distribution? By Dr David Butler 2012 The University of Adelaide 1

2 DATA ENTRY gender = M 1 age = 18 gender = M 2 age = 25 gender = F 3 age = 19 chilli = Y temp = 37 chilli = N temp = 36 chilli = Y temp = 38 BECOMES... gender age chilli temp 1 M 18 Y 37 2 M 25 N 36 3 F 19 Y 38 4 F 21 N 35 Statisticians say: "PLEASE make it consistent!" TYPES OF VARIABLES (things you record) Numerical / Quantitative / Scale (numbers: how far apart has meaning) o Continuous (measured) o Discrete (counted) Categorical / Qualitative (words: how far apart has no meaning) o Nominal (names: more or less has no meaning) o Ordinal (ordered: more or less has meaning) DISTRIBUTIONS OF VARIABLES (how the possible values are spread out) Approximately normal parametric tests will be fine Skewed or worse non-parametric tests might be better WHAT EXPLANATORY VARIABLES DEFINE: Independent Groups chilli = Y temp = 38 chilli = N 1 chilli = Y 2 3 temp = 36 chilli = N 4 BECOMES... chilli temp 1 Y 38 2 Y 36 3 N 37 4 N 36 temp = 37 temp = 36 OR (chilli = Y) temp = 38 1 (chilli = Y) temp = 36 2 (chilli = Y) temp (chilli = N) temp Repeated Measures (matched pairs) (chilli = N) temp = 37 (chilli = Y) temp = 37 3 (chilli = N) temp = 37 (chilli = Y) temp = 37 4 BECOMES (chilli = N) temp = 36 (chilli = N) temp = 35 By Dr David Butler 2012 The University of Adelaide 2

3 HOW HYPOTHESIS TESTING WORKS A hypothesis test is designed to DECIDE the answer to a YES OR NO question using DATA. This is how to do a hypothesis test: Have a yes-or-no question. Collect data. Calculate a test statistic. Figure out the distribution if you assume a particular answer. Calculate a p-value. Decide the answer based on the p-value. This is what a hypothesis test means: It tells you if your data is likely or unlikely given a particular situation (the null hypothesis ). A low p-value means your data is unlikely and you don t believe you re in that situation. A high p-value means your data is likely and you do believe you could be in that situation. HOW CONFIDENCE INTERVALS WORK A confidence interval is designed to give a RANGE of possible answers for a WHAT S THE NUMBER? question, using DATA from a sample. This is how to find a confidence interval: Have a what s the number? question. Collect data. Choose a matching hypothesis test. Work backwards to calculate two ends. The confidence interval is between these two values. This is what a confidence interval means: The values in the CI would be retained with a matching hypothesis test. The values in the CI have a high chance of producing data like yours. The values in the CI are those you are happy to believe based on your data. By Dr David Butler 2012 The University of Adelaide 3

4 HOW REGRESSION WORKS Regression is a method designed to create a FORMULA that uses some information to PREDICT/EXPLAIN an outcome, using DATA. This is how to perform regression: Have a what s the formula? question. Collect data. Look at the pattern usually with a scatterplot to choose a formula. Get a computer to calculate the numbers and p-values. Check the p-values. Choose your final formula. This is what regression means: It tells you a formula for how an outcome varies based on other information. It does NOT tell you if some things CAUSE others, only how to calculate them as accurately as possible. The computer output will tell you p-values and confidence intervals to answer other types of questions. More details: DESCRIBING A RELATIONSHIP: o Scatterplot describes relationship and helps choose a good formula o Correlation coefficient (r) measures how strong a linear relationship is. Ranges from -1 (perfect negative) to 0 (no relationship) to 1 (perfect positive). Ignores how steep the slope is, only says how close to a line. FINDING AND INTERPRETING THE FORMULA: o Computer program will use the data to find the numbers that make the formula fit best. o The coefficient says how much the outcome changes (on average) for a change of 1 in the explanatory variable. LOOKING AT P-VALUES: o The p-value that goes with the F-statistic in the ANOVA table tells you whether all the variables at once have a relationship with the outcome. Low p-value means the relationship is significant. o The p-value for each coefficient tells you whether that explanatory variable appears to have a relationship with the outcome. Low p-value means the effect is significant. LOOKING AT CONFIDENCE INTERVALS: o The confidence interval that goes with an explanatory variable tells you how large or small the real effect could be. NOTE: Regression has assumptions that must be checked in order to use it properly, especially if you plan to use the p-values and confidence intervals. By Dr David Butler 2012 The University of Adelaide 4

5 Turning a research question into a statistical question. ORIGINAL QUESTION: Concept Concept Concept ABOUT ONE CONCEPT ABOUT RELATIONSHIPS BETWEEN CONCEPTS TYPE OF QUESTION: DESCRIBE what s going on? DECIDE yes or no? ESTIMATE what s this number? PREDICT/EXPLAIN what s the formula? TYPES OF VARIABLES: Concept BECOMES... OR WHAT EXPLANATORY VARIABLES DEFINE: Independent Groups OR Repeated Measures (matched pairs) DISTRIBUTION OF OUTCOME VARIABLE: OR OR Note: This probably doesn t matter if you have a lot of data. STATISTICAL QUESTION: eg: DESCRIBE eg: DECIDE Independent Groups Note: In the list below, the outcome variables are usually assumed to be normal. By Dr David Butler 2012 The University of Adelaide 5

6 Statistical methods for statistical questions DESCRIBE: Numbers: Mean & standard deviation ( Graphs: Histogram / Boxplot. DECIDE: Is the mean equal to #? one sample t-test. median & IQR) Is the median equal to #? sign test. ESTIMATE: What is the mean? confidence interval for a mean. DESCRIBE: Numbers: Table of percentages or proportions. Graphs: Histogram. DECIDE: Is this percentage equal to #? z-test for a single proportion. Are percentages distributed according to #, #, #? chi-squared test for goodness of fit. ESTIMATE: What is this percentage? confidence interval for a proportion. (2 categories) Independent Groups DESCRIBE: Numbers: Means & standard deviations for each group ( medians & IQRs for each category). Graphs: Histograms on same scale / side-by-side boxplots. DECIDE: Are the means equal? unpaired t-test ( Mann- Whitney U-test or Wilcoxon rank-sum test). ESTIMATE: What is the difference between the means? confidence interval for the difference in means. (2 categories) Repeated Measures DESCRIBE: Numbers: Mean & standard deviation of differences between measurements. Graphs: Histogram of the differences between measurements. DECIDE: Is there a mean difference? paired t-test ( Wilcoxon signed ranks test). ESTIMATE: What is the mean difference? confidence interval for the mean difference. (any# categories) Independent Groups DESCRIBE: Numbers: Mean & standard deviation of each group. Graphs: Histograms/boxplots on the same scale. Line graph showing mean of each group. DECIDE: Are the means equal? one-way analysis of variance ANOVA with post-hoc t-tests ( Kruskal-Wallis test). ESTIMATE: What are the differences between means? confidence intervals for each difference in means. By Dr David Butler 2012 The University of Adelaide 6

7 Statistical methods for statistical questions (any# categories) Repeated Measures DESCRIBE: Graphs: Line graph for each subject showing changing value of variable. DECIDE: On average, does the value change for each person across categories? repeated measures ANOVA with post-hoc paired t-tests / mixed effects regression. ESTIMATE: What are the mean differences between categories? confidence intervals for mean differences. (2 categories) Independent Groups (2 categories) DESCRIBE: Numbers: Two-way table of counts or %s. Odds ratios. Graphs: Histogram for each explanatory category. DECIDE: Is the outcome just as likely for both explanatory categories?, Are the two variables associated? chisquared test for independence (small amount of data: Fisher s exact test). ESTIMATE: How much more likely is the outcome in this category? confidence interval for difference in proportions, confidence interval for odds ratio. (2 categories) Repeated Measures (2 categories) DESCRIBE: Numbers: Two-way table of counts or %s. Graphs: Histogram for each explanatory category. DECIDE: Is the outcome just as likely for both explanatory categories? McNemar s test. ESTIMATE: How much more likely is the outcome in one category compared to the other? confidence interval for difference in proportions. (any# categories) Independent Groups (any# categories) DESCRIBE: Numbers: Two-way table of counts or %. Graphs: Histogram for each explanatory category. DECIDE: Do the percentages in the outcome change across the explanatory categories?, Are the two variables associated? chi-squared test for independence. (any# categories) Repeated Measures (2 categories) DESCRIBE: Numbers: Two-way table of counts or %. Graphs: Histogram for each explanatory category. DECIDE: Do the percentages in the outcome change across the explanatory categories?, Are the two variables associated? Cochrane s Q-test. By Dr David Butler 2012 The University of Adelaide 7

8 Statistical methods for statistical questions DESCRIBE: Numbers: Correlation coefficient (R) Graphs: Scatterplot. DECIDE: Does a relationship exist? linear regression: t-test on coefficient. ESTIMATE: How much does the output variable change when the explanatory variable changes? linear regression: confidence interval for slope. PREDICT: How can you calculate the output knowing the explanatory variable? linear regression formula: y = β 0 + β 1 x. NOTE: May need to do a nonlinear regression if the scatterplot indicates a curved sort of relationship. (2 categories) DESCRIBE: Numbers: Mean & standard deviation for each category of the outcome. Graphs: Histograms/boxplots on the same scale. DECIDE: Does the numerical variable have an effect on the chances of the outcome? unpaired t-test using the outcome to define the two groups. ESTIMATE: How much does a change in the numerical variable affect the chances of the outcome? logistic regression: confidence interval for odds ratio. PREDICT: How can you calculate the chances of the outcome knowing the value of the explanatory variable? logistic regression formula: log(odds of y) = β 0 + β 1 x. (any# categories) Independent Groups Time to event Possible missing data! DESCRIBE: Numbers: Proportion reaching event at certain time (eg 5- year survival), median times to reach event. Graphs: Kaplan-Meier curve showing survival percentages. DECIDE: Is the time to reach the event the same in all groups? survival analysis: log-rank test. ESTIMATE: What is the difference in proportions reaching the end point at this particular time? confidence interval for the difference in proportions. How much more at risk of the event is this group than this group? Cox regression: confidence interval for relative hazard. By Dr David Butler 2012 The University of Adelaide 8

9 Statistical methods for statistical questions DESCRIBE: Graphs: Scatterplot for each explanatory variable with the outcome variable. Numbers: multiple linear regression: R 2 value DECIDE: Does a relationship exist with any of the variables at all? multiple linear regression: F-test. Does a relationship exist with this varable, taking into account the others? multiple linear regression: t-test on one coefficient. ESTIMATE: How much does the output variable change when this explanatory variable changes? multiple linear regression: confidence interval for one slope. PREDICT: How can you calculate the output knowing the explanatory variables? multiple linear regression formula: y = β 0 + β 1 x 1 + β 2 x 2. NOTE: This can be done for many explanatory variables. (any# categories) Independent Groups DESCRIBE: Graphs: Scatterplot of both numerical variables for each category. Numbers: multiple regression: R 2 value DECIDE: See above for multiple regression. ESTIMATE: See above for multiple regression. PREDICT: See above for multiple regression. NOTE: This can be done for many explanatory variables of both types. (any# categories) Independent Groups (any# categories) Independent Groups DESCRIBE: Graphs: Histogram for each combination of explanatory categories. Line graph showing mean of each group. DECIDE: Does a relationship exist with any of the variables at all? two-way ANOVA: F-test. Does a relationship exist with this varable, taking into account the others? two-way ANOVA: F-test for one effect. Note: both can also answered with multiple regression (see above). PREDICT: How can you calculate the output knowing the explanatory variables? multiple linear regression formula: y = β 0 + β 1 x 1 + β 2 x 2. By Dr David Butler 2012 The University of Adelaide 9

10 (any# categories) Independent Groups (any# categories) Independent Groups (any# categories) Repeated Measures (2 categories) DESCRIBE: Graphs: Histogram for each combination of explanatory categories. DECIDE: Does a relationship exist with any of the variables at all? multiple logistic regression: chi-squared test for covariates. Does a relationship exist with this varable, taking into account the others? multiple logistic regression: Wald test. ESTIMATE: How much does the chance of the outcome change when this explanatory variable changes? multiple logistic regression: confidence interval for odds ratio. PREDICT: How can you calculate the chances of the outcome knowing the explanatory variables? multiple logistic regression formula: log(odds of y) = β 0 + β 1 x 1 + β 2 x 2. NOTE: This can be done with many explanatory variables even if some of them are numerical. DESCRIBE: Numbers: multiple linear regression: R 2 value DECIDE: Does a relationship exist with any of the variables at all? mixed effects regression: F-test. Does a relationship exist with this varable, taking into account the others? mixed effects linear regression: t- test on one coefficient. ESTIMATE: How much does the output variable change when this explanatory variable changes? mixed effects regression: confidence interval for one coefficient. PREDICT: How can you calculate the output knowing the explanatory variables? mixed effects regression formula. NOTE: mixed effects may also be called random effects. NOTE: This can be done for many explanatory variables, of both types, and with a mixture of repeated-measures and independentgroups DECIDE: Does one variable change the way the other affects the outcome? multiple linear regression: t-test on the interaction effect. ESTIMATE: How much does the second variable change the effect of the first on the outcome? multiple linear regression: confidence interval for the interaction effect. PREDICT: How can you calculate the output knowing the explanatory variables? multiple linear regression formula: y = β 0 + β 1 x 1 + β 2 x 2 + β 12 x 1 x 2. By Dr David Butler 2012 The University of Adelaide 10

11 (any# categories) Independent Groups DESCRIBE: Graphs: Scatterplot for each category, showing line of best fit in each case. DECIDE: Does one variable change the way the other affects the outcome? Analysis of Covariance (ANCOVA) / multiple linear regression: t-test on the interaction effect. ESTIMATE: How much does the second variable change the effect of the first on the outcome? multiple linear regression: confidence interval for the interaction effect. PREDICT: How can you calculate the output knowing the explanatory variables? multiple linear regression formula: y = β 0 + β 1 x 1 + β 2 x 2 + β 12 x 1 x 2. NOTE: This can be done for many explanatory variables of both types. ANCOVA refers specifically to the case where the interaction variable is categorical. NOTE: There are many other methods dealing with more specific and difficult questions including (but definitely not limited to): Does this variable affect the variance of the outcome? F-test for two variances Do these variables affect this categorical outcome (which has several categories)? Multinomial regression Does the data come from a normal distribution? Investigate normal quantile-quantile plot; Shapiro-Wilk test To what degree do these two measuring systems agree? Intraclass correlation coefficient What is the best cut-off for this measurement in order to say someone needs medical attention? ROC analysis Do all these measurements vary together so that they could be considered as measuring some smaller number of underlying concepts? Factor analysis / Principal Component Analysis Can the subjects be grouped into a few similar groups based on the similarity in their measurements? Cluster analysis and so on... By Dr David Butler 2012 The University of Adelaide 11

12 By Dr David Butler 2012 The University of Adelaide 12

13 By Dr David Butler 2012 The University of Adelaide 13

14 By Dr David Butler 2012 The University of Adelaide 14

15 By Dr David Butler 2012 The University of Adelaide 15

16 By Dr David Butler 2012 The University of Adelaide 16

17 By Dr David Butler 2012 The University of Adelaide 17

18 By Dr David Butler 2012 The University of Adelaide 18

19 By Dr David Butler 2012 The University of Adelaide 19

20 By Dr David Butler 2012 The University of Adelaide 20

21 By Dr David Butler 2012 The University of Adelaide 21

22 By Dr David Butler 2012 The University of Adelaide 22

23 By Dr David Butler 2012 The University of Adelaide 23

24 By Dr David Butler 2012 The University of Adelaide 24

25 By Dr David Butler 2012 The University of Adelaide 25

26 By Dr David Butler 2012 The University of Adelaide 26

27 By Dr David Butler 2012 The University of Adelaide 27

28 By Dr David Butler 2012 The University of Adelaide 28

29 SAMPLE SIZE CALCULATIONS FOR HYPOTHESIS TESTS: The following five things affect the sample size you need: 1. Which hypothesis test you plan to use Hypothesis test based on categorical outcomes (as opposed to numerical outcomes) BIGGER sample size Hypothesis test uses independent groups (as opposed to repeated measures) BIGGER sample size 2. Size of the difference you are looking for Most hypothesis tests concern the differences between means or percentages. The difference you would like to see is often called: Clinically significant difference Practically significant difference Choosing how big this difference is requires KNOWLEDGE OF YOUR AREA OF RESEARCH. Looking for a SMALL DIFFERENCE BIGGER sample size 3. Variability of the results HIGH VARIABILITY means many options for what could happen in a sample of a particular size eg: for the CHI-SQUARED TEST very high or very low expected percentage low variability medium expected percentage high variability eg: for t-tests or ANOVA large standard deviation high variability You usually get this information from previous research or a pilot study. HIGH VARIABILITY BIGGER sample size By Dr David Butler 2012 The University of Adelaide 29

30 4. Significance level The cut-off for saying when a p-value is significant. Usually 5%. Also known as α (alpha) or the Type I Error rate. LOW SIGNIFICANCE LEVEL BIGGER sample size 5. Power The probability of getting a significant result if in fact there IS a difference in the population. Usually you set this at 80%. The opposite of Type II Error rate (also known as β (beta)). HIGH POWER BIGGER sample size [ Note that a high dropout rate also increases sample size ] FOR CONFIDENCE INTERVALS: Confidence intervals are related to hypothesis tests, so the considerations above are used for confidence intervals too. NOTE: Significance level = 100% - Confidence Level (so for a 95% confidence interval, the significance level is 5%) NOTE: The difference you are looking for is half the width of the confidence interval. Also known as the margin of error. FOR REGRESSION: Rule of thumb: at least 10 times as many subjects as there are explanatory variables. Proper calculations are based on the t-tests involved to see if slope is significant. X1 X1 Y X2 Y X2 X3 X4 X5 At least 2 10 = 20 At least 5 10 = 50 By Dr David Butler 2012 The University of Adelaide 30

31 SOME TERMINOLOGY: Type I Error: NO difference in the population BUT there IS a difference in the sample (also known as significance level or alpha α) Type II Error: There IS a difference in the population BUT there is NO difference in the sample (also known as beta β, or the opposite of power) PERFORMING THE CALCULATIONS : Russ Lenth s has created a comprehensive suite of online calculators: You need all the information mentioned above in order to use the calculators. There are also simple formulas for the t-tests and chi-squared tests in Chapter 36 of Medical Statistics at a Glance by Aviva Petrie and Caroline Sabin You need all the information mentioned above in order to use the formulas. By Dr David Butler 2012 The University of Adelaide 31

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Come scegliere un test statistico

Come scegliere un test statistico Come scegliere un test statistico Estratto dal Capitolo 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc. (disponibile in Iinternet) Table

More information

Statistics for Sports Medicine

Statistics for Sports Medicine Statistics for Sports Medicine Suzanne Hecht, MD University of Minnesota (suzanne.hecht@gmail.com) Fellow s Research Conference July 2012: Philadelphia GOALS Try not to bore you to death!! Try to teach

More information

MTH 140 Statistics Videos

MTH 140 Statistics Videos MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Statistics in Medicine Research Lecture Series CSMC Fall 2014 Catherine Bresee, MS Senior Biostatistician Biostatistics & Bioinformatics Research Institute Statistics in Medicine Research Lecture Series CSMC Fall 2014 Overview Review concept of statistical power

More information

Statistical tests for SPSS

Statistical tests for SPSS Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

The Statistics Tutor s Quick Guide to

The Statistics Tutor s Quick Guide to statstutor community project encouraging academics to share statistics support resources All stcp resources are released under a Creative Commons licence The Statistics Tutor s Quick Guide to Stcp-marshallowen-7

More information

UNIVERSITY OF NAIROBI

UNIVERSITY OF NAIROBI UNIVERSITY OF NAIROBI MASTERS IN PROJECT PLANNING AND MANAGEMENT NAME: SARU CAROLYNN ELIZABETH REGISTRATION NO: L50/61646/2013 COURSE CODE: LDP 603 COURSE TITLE: RESEARCH METHODS LECTURER: GAKUU CHRISTOPHER

More information

List of Examples. Examples 319

List of Examples. Examples 319 Examples 319 List of Examples DiMaggio and Mantle. 6 Weed seeds. 6, 23, 37, 38 Vole reproduction. 7, 24, 37 Wooly bear caterpillar cocoons. 7 Homophone confusion and Alzheimer s disease. 8 Gear tooth strength.

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade Statistics Quiz Correlation and Regression -- ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

International Statistical Institute, 56th Session, 2007: Phil Everson

International Statistical Institute, 56th Session, 2007: Phil Everson Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities. Consider a study in which How many subjects? The importance of sample size calculations Office of Research Protections Brown Bag Series KB Boomer, Ph.D. Director, boomer@stat.psu.edu A researcher conducts

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

T O P I C 1 2 Techniques and tools for data analysis Preview Introduction In chapter 3 of Statistics In A Day different combinations of numbers and types of variables are presented. We go through these

More information

Difference tests (2): nonparametric

Difference tests (2): nonparametric NST 1B Experimental Psychology Statistics practical 3 Difference tests (): nonparametric Rudolf Cardinal & Mike Aitken 10 / 11 February 005; Department of Experimental Psychology University of Cambridge

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY Before we begin: Turn on the sound on your computer. There is audio to accompany this presentation. Audio will accompany most of the online

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application

More information

Ordinal Regression. Chapter

Ordinal Regression. Chapter Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise

More information

Simple Predictive Analytics Curtis Seare

Simple Predictive Analytics Curtis Seare Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences. 1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

More information

Simple Linear Regression

Simple Linear Regression STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze

More information

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Logistic regression generalizes methods for 2-way tables Adds capability studying several predictors, but Limited to

More information

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION SOLUTIONS 1. a. To calculate the mean, we just add up all 7 values, and divide by 7. In Xi i= 1 fancy

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Data Analysis, Research Study Design and the IRB

Data Analysis, Research Study Design and the IRB Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R.

ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R. ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R. 1. Motivation. Likert items are used to measure respondents attitudes to a particular question or statement. One must recall

More information

Lecture 2. Summarizing the Sample

Lecture 2. Summarizing the Sample Lecture 2 Summarizing the Sample WARNING: Today s lecture may bore some of you It s (sort of) not my fault I m required to teach you about what we re going to cover today. I ll try to make it as exciting

More information

MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics

MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS title- course code: Program name: Contingency Tables and Log Linear Models Level Biostatistics Hours/week Ther. Recite. Lab. Others Total Master of Sci.

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Analyzing Research Data Using Excel

Analyzing Research Data Using Excel Analyzing Research Data Using Excel Fraser Health Authority, 2012 The Fraser Health Authority ( FH ) authorizes the use, reproduction and/or modification of this publication for purposes other than commercial

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

Data analysis process

Data analysis process Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis

More information

STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE

STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE Perhaps Microsoft has taken pains to hide some of the most powerful tools in Excel. These add-ins tools work on top of Excel, extending its power and abilities

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-

More information

Section 3 Part 1. Relationships between two numerical variables

Section 3 Part 1. Relationships between two numerical variables Section 3 Part 1 Relationships between two numerical variables 1 Relationship between two variables The summary statistics covered in the previous lessons are appropriate for describing a single variable.

More information

COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore*

COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore* COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore* The data collection phases for evaluation designs may involve

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

More information

SPSS Guide How-to, Tips, Tricks & Statistical Techniques

SPSS Guide How-to, Tips, Tricks & Statistical Techniques SPSS Guide How-to, Tips, Tricks & Statistical Techniques Support for the course Research Methodology for IB Also useful for your BSc or MSc thesis March 2014 Dr. Marijke Leliveld Jacob Wiebenga, MSc CONTENT

More information

LOGIT AND PROBIT ANALYSIS

LOGIT AND PROBIT ANALYSIS LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 amitvasisht@iasri.res.in In dummy regression variable models, it is assumed implicitly that the dependent variable Y

More information

Statistics E100 Fall 2013 Practice Midterm I - A Solutions

Statistics E100 Fall 2013 Practice Midterm I - A Solutions STATISTICS E100 FALL 2013 PRACTICE MIDTERM I - A SOLUTIONS PAGE 1 OF 5 Statistics E100 Fall 2013 Practice Midterm I - A Solutions 1. (16 points total) Below is the histogram for the number of medals won

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES MASTER COURSE SYLLABUS-PROTOTYPE THE PSYCHOLOGY DEPARTMENT VALUES ACADEMIC FREEDOM AND THUS OFFERS THIS MASTER SYLLABUS-PROTOTYPE ONLY AS A GUIDE. THE INSTRUCTORS ARE FREE TO ADAPT THEIR COURSE SYLLABI

More information

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:

More information

Statistics 151 Practice Midterm 1 Mike Kowalski

Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

More information