(a) Write a paragraph summarizing the key characteristics of this study. In particular, identify:
|
|
- Calvin Peters
- 7 years ago
- Views:
Transcription
1 Data Collection: (a) Write a paragraph summarizing the key characteristics of this study. In particular, identify: Observational units and sample size The explanatory variable and response variable (as set up in the research conjecture) and classify each as quantitative or categorical The type of study (observational or randomized experiment). Justify your answer. A brief description of the sample used in the study In particular, was this a random sample selected from a larger population? >> The observational units are 40 student volunteers. The sample size is 40: student volunteers at University College, London. This was not a random sample, the students volunteered for the study. The response variable is brain size, and the explanatory variable is number of Facebook friends; both are quantitative. This is an observational study; the explanatory variable was not imposed by the researchers. Descriptive Statistics: (b) In describing scatterplots, our goal is to describe the association between the two variables. To do so, we focus on three characteristics: direction, strength, and form. Is the direction of the association as expected by the researchers? State specifically what the researchers believed the relationship would be. Would you consider the association strong? (In other words, does knowing a person s number of Facebook friends help you to accurately predict their brain density in this region?) Explain. Does the pattern of the association seem reasonably well modeled by a line? (Or, is there curvature or some other prominent pattern in the plot?) 1
2 >> The scatterplot appears linear, with a fairly weak positive correlation. Yes, the direction of the association is in the direction expected by the researchers; they conjectured that Facebook use swells the size of brain regions devoted to tracking and maintaining relationships, and therefore that there is a positive correlation between the density of those areas and the number of Facebook friends an individual has. The dots are not very concentrated along a specific line and show a weak but present positive association; I wouldn t feel very comfortable predicting someone s brain density in this region because there appears to be quite a range of brain densities for the same number of Facebook friends. good Though the scatterplot isn t obviously concentrated along a specific line, the pattern of association would be best modeled by a line as opposed to a curve. Numerical Summary: (c) Report the value of this correlation coefficient. Does the correlation coefficient indicate a positive or negative linear association here? Does it seem strong (r close to 1 or -1) or weak (r close to 0)? >> r=.366. This correlation coefficient is closer to zero than it is to 1; its positive value indicates a positive linear association. ok Interpreting the Least Squares Regression Equation: Include screen capture of scatterplot with regression line superimposed. 2
3 (d) Identify the values of the intercept and the slope, and (following the example) provide a detailed (statistical) interpretation of both the slope and intercept coefficients of the least squares regression line (red) in the Facebook/Brain Density study context. Hint: Remember that the explanatory variable here is how many hundred Facebook friends the person reports. Slope value: >> Interpretation: >> This slope coefficient of.2009 is the predicted change in the value of the brain density (of this certain region) associated with a oneunit increase in the number of hundreds of Facebook friends an observational unit has. The predicted increase in brain density (of this brain area) associated with each additional one hundred friends is.2009 arbitrary units ; how much denser we expect a person's grey matter to be if the number of Facebook friends is one hundred friends greater. Intercept value: >> Interpretation: >> This intercept coefficient of (-.7404) is the predicted value of the brain density (of this certain region) when the number of how many hundreds of Facebook friends the observational unit (student) has is zero.this model predicts that a person with zero Facebook friends has brain density (in that region of the brain) of Which doesn t make sense for this value, but appears to model the majority of the data well. Statistical Inference - Simulation: (e) If our research conjecture is that more Facebook friends will tend to correspond to larger brain density values, what does this imply about the value of the population slope? State this, in symbols, as the alternative hypothesis. >> Ha: >0 (f) Did you obtain the same regression line as the researchers (in red)? Is the association still positive in the simulated scatterplot? Is the relationship stronger or weaker than the one observed by these researchers (how are you deciding)? >> No, the new line is not the same, nor positive as our original regression line; the blue shuffled regression line displays a negative correlation coefficient of The relationship is weaker than the one observed by these researchers because itclarify is closer to zero than our previous value of.366, reflecting a weaker association between the explanatory and response variables; the closer to 1 (or -1), the stronger the association. (g) Did you obtain the same (blue) regression line the second time? >> No; r=.040, noting an extremely weak but regardless, positive association, different from the first shuffled regression line. (h) Notice that the two sample slope values you generated have been added to the dotplot on the right. Suppose you repeat these shuffling 1000 times, where do you think this distribution will be centered? Why? >> I predict the distribution will be centered around zero because with the null distribution we are assuming the null hypothesis is true, and that there is ZERO association between brain density and number of Facebook friends. (in population) 3
4 (i) Press the Shuffle Y-values button 8 more times. Describe the pattern you are seeing in how the regression lines on the scatterplot vary from sample to sample (what pattern is there to the blue lines on the scatterplot) as if to someone who couldn't see the same picture. >> Sample to sample, the shuffled regression lines vary from having positive slopes to negative slopes, but always intersect at the same point on the scatterplot. The blue lines do not display very slanted slopes and are always relatively close together; they are not as extreme in direction as the original red regression line. Does the observed regression line for this study (in red) appear to be extreme (compare to the simulated blue lines)? >> The observed regression line does not lie within the overlay of the blue regression lines and is definitely more extreme than the blue regression lines, but not distinctly so. (j) Describe the shape, center (mean), and spread (standard deviation) of the dotplot of the resulting null distribution of sample slopes. Including: what are the observational units and variable of this distribution? Also, make sure you describe what the standard deviation is measuring (Hint: What is the variable?). >> The null distribution is normal and symmetric, centered around 0.00 (mean), with a standard deviation of.086 as shown by the small amount of variability and short spread. The standard deviation is measuring the variability in the calculated slopes by all 1000 shuffles, specifically how similar the slopes of the simulated null distribution are to one another, and in relation to the center of the distribution, all under the assumption there is no association between grey matter density and number of Facebook friends. The observational units are the 1000 shuffles of 40 cards with their corresponding values (the values indicate the intersection of the density value and the number of Facebook friends value on the scatterplot), and the variable is the slope of each resulting regression line. (k) Include a screen capture of your output (null distribution and p-value) and provide a one-sentence summary of this p-value in this context. Paste a copy of the null distribution of sample slopes with shading here. 4
5 p-value interpretation>> If I shuffle a set of 40 cards with values representing the intersections? of the density value and number of Facebook friends many, many times, I would expect to see a resulting slope of.2009 (for the regression line) or more extreme (greater) in only.7% of shuffles. Assuming the null hypothesis to be true - 1/2 (l) What conclusion will you draw from this p-value? >> This p-value of.007 is less than.05 and therefore statistically significant; I can reject the null hypothesis that states there is NO association between number of Facebook friends and grey matter density in regions of the brain responsible for social perception and associative memory. in favor of? (m) Using the standard deviation you found in (j), how many standard deviations is the observed sample slope (0.2) from the hypothesized slope (0.0)? Show your work. >> ( )/.086= The observed sample slope is standard deviations from the hypothesized slope. Statistical Inference Theoretical Model (n) Does the theoretical p-value provide a reasonable approximation to the simulated p-value? >> The theoretical p-value provides a reasonable approximation to the simulated p-value; both are statistically significant, less than.05. What two values are you comparing? -1/2 Include a copy of the overlaid t-distribution and your Regression Table here. 5
6 (o) How does the SE value corresponding to the friends row compare to the standard deviation of the slopes found in (j)? >> The SE value for friends is.0830, which is less than but very close to the standard deviation I found in J,.086. (p) How does this t-value for the friends row compare to what you found in (m)? >> This t-value for friends is 2.42, which is also very similar to what I found in M, which represents how many standard deviations the observed statistic is from the mean. (q) How does the (two-sided) p-value for the friends row of the Regression Table output compare to the one-sided p-value you estimated in (k)? >>The two sided p-value for the friends row of the regression table is.0204, which is greater than the one sided p-value I estimated in K, but still statistically significant (less than.05 and very small) regardless. why is it larger? (r) Using the Two Standard Deviation method, use the observed slope and the standard error reported from the Regression Table output (or the dotplot of simulated slopes) to approximate a 95% confidence interval for the population slope. Show your work and write a one-sentence interpretation of your interval. Calculation >>.2+2(.0830)= (.083)=.034 Confidence Interval: (.034,.366) 6
7 Interpretation >> I m 95% confident that in the population, each additional one hundred Facebook friends is associated with an increase in arbitrary units on average between.034 and.366 in brain matter density. Conclusions: (s) Write a few sentences summarizing your analysis for this research study, in particular: What did you learn about the relationship between Facebook friends and brain density measurements for this sample from the scatterplot and the correlation coefficient about the sample data? State the hypotheses that we were testing about the population in words (Hint: Back in part e). Write a sentence interpreting the one-sided p-value you found in the context of the Facebook study (What is it the probability of?). What are your final conclusions about the relationship between Facebook friends and brain density for this population and how do they follow from your p-value? What is your estimate of the population slope, make sure you include an interpretation of what we mean by 'slope' in this context? What population are you willing to generalize these results to? (Briefly justify.) Are you able to draw a cause-and-effect conclusion between Facebook friends and brain density in this population? (Briefly justify.) What concerns do you have about how these data were collected and how that might impact the conclusions you can draw form this study? (Your answer should discuss at least one potential non-sampling error.) >> According to the statistically significant p-value and t-statistics calculated in this study, I learned that there may be a positive relationship between the number of stated Facebook friends and the density of certain areas in the brain responsible for social perception and associative memory, or at least that in this group of students more distinction on sample vs. population/ First just talk about the population? I can reject the assumption that there is NO association between the two. The observed correlation coefficient of.366 does not show a strong association, but regardless shows a statistically significant association between the observed variables. In testing the population, we hypothesized (our null) that there is no association between number of Facebook friends and brain density; our alternative hypothesis stated that there IS an association, and that the association is positive; that with an increase in Facebook friends comes an increase in brain density of those specific regions. My repeated p-value interpretation: If I shuffle a set of 40 cards with values representing the intersections of the density value and number of Facebook friends many, many times, (null hypothesis) I would expect to see a resulting slope of.2009 or more extreme in only.7% of shuffles. Essentially, the probability of obtaining results as extreme as these is highly unlikely (only.7%) under the assumption the null hypothesis is true. I cannot draw causation because this sample is not random, nor is random assignment used. There is potential for confounding variables in this study, primarily the idea that those who volunteer may in general be more sociable, outgoing people, with naturally denser brain regions responsible for sociability. This isn t a confounding variable, it s a source of sampling bias As well, the students self-reported the number of friends, and may have stated numbers greater than accurate.!!! Mixed up your random sampling/random assignment and consequences a bit too much -1/2 Keep in Mind: There is a difference between the strength of a relationship and the strength of the evidence against the null hypothesis 7
8 Application: (a) Create the scatterplot and calculate the least squares line. Also include the Parameter Estimates output. Include a copy of scatterplot and regression line Include a copy of Regression Table output (b) How do does the regression line compare to the follow-up data for 40 subjects that you examined in this lab? Comparison >> The two lines are very similar in slope and intercept while just eye-balling them on the scatterplots. As well, the equations model each other very closely: Original study: x Latter study: x (c) From the Regression Table output, how do the standard error for the slope, t Ratio, and p-value for the slope compare to the original analysis on the follow-up data? Explain why these changes make sense based on what is different between the two data sets. 8
9 >> The SE in this latter study is smaller than in the original due to the greater sample size; the t-stat is greater, and the p-value is smaller, both indicating stronger evidence against the null hypothesis, also due to the greater sample size. a bit more on why the larger sample size does this? -1/2 Extra Credit: Why do you think this method of "look at a bunch of regions to see which are significant" is often criticized? (Hint: If I run lots of tests of significance using a 5% level of significance, how often will I reject the null hypothesis when I really shouldn't have?) >> When using this method of looking at all the regions individually to see which are statistically significant it is easier to reject the null hypothesis more often than not because a 5% level of significance is not that much when there is a great potential for outliers; the individual sets of data, if extreme, have a greater possibility to skew the p-value in a direction that favors the alternative hypothesis, which might not and probably doesn t represent the overall data. More than if you run 100 tests like this, 5 will reject the null hypothesis when it shouldn t +1/2 9
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationMTH 140 Statistics Videos
MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative
More informationChapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
More informationSouth Carolina College- and Career-Ready (SCCCR) Probability and Statistics
South Carolina College- and Career-Ready (SCCCR) Probability and Statistics South Carolina College- and Career-Ready Mathematical Process Standards The South Carolina College- and Career-Ready (SCCCR)
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationThe Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces
The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces Or: How I Learned to Stop Worrying and Love the Ball Comment [DP1]: Titles, headings, and figure/table captions
More informationSTAT 350 Practice Final Exam Solution (Spring 2015)
PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationSimple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
More informationStatistics 2014 Scoring Guidelines
AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home
More informationCHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression
Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the
More informationRelationships Between Two Variables: Scatterplots and Correlation
Relationships Between Two Variables: Scatterplots and Correlation Example: Consider the population of cars manufactured in the U.S. What is the relationship (1) between engine size and horsepower? (2)
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationHow To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More informationChapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
More informationStatistics courses often teach the two-sample t-test, linear regression, and analysis of variance
2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample
More informationCOMMON CORE STATE STANDARDS FOR
COMMON CORE STATE STANDARDS FOR Mathematics (CCSSM) High School Statistics and Probability Mathematics High School Statistics and Probability Decisions or predictions are often based on data numbers in
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationChapter 23. Inferences for Regression
Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily
More informationExercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationThis unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.
Algebra I Overview View unit yearlong overview here Many of the concepts presented in Algebra I are progressions of concepts that were introduced in grades 6 through 8. The content presented in this course
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More informationWEB APPENDIX. Calculating Beta Coefficients. b Beta Rise Run Y 7.1 1 8.92 X 10.0 0.0 16.0 10.0 1.6
WEB APPENDIX 8A Calculating Beta Coefficients The CAPM is an ex ante model, which means that all of the variables represent before-thefact, expected values. In particular, the beta coefficient used in
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationSimple Linear Regression
STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze
More informationSample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More informationAssociation Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
More informationIntroduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
More informationLecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation
Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation Display and Summarize Correlation for Direction and Strength Properties of Correlation Regression Line Cengage
More informationThe correlation coefficient
The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative
More informationLAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationSection 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More informationCorrelation and Regression
Correlation and Regression Scatterplots Correlation Explanatory and response variables Simple linear regression General Principles of Data Analysis First plot the data, then add numerical summaries Look
More informationHomework 8 Solutions
Math 17, Section 2 Spring 2011 Homework 8 Solutions Assignment Chapter 7: 7.36, 7.40 Chapter 8: 8.14, 8.16, 8.28, 8.36 (a-d), 8.38, 8.62 Chapter 9: 9.4, 9.14 Chapter 7 7.36] a) A scatterplot is given below.
More informationMultiple Regression: What Is It?
Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in
More informationCorrelation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2
Lesson 4 Part 1 Relationships between two numerical variables 1 Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables
More informationRegression and Correlation
Regression and Correlation Topics Covered: Dependent and independent variables. Scatter diagram. Correlation coefficient. Linear Regression line. by Dr.I.Namestnikova 1 Introduction Regression analysis
More informationWhat is the purpose of this document? What is in the document? How do I send Feedback?
This document is designed to help North Carolina educators teach the Common Core (Standard Course of Study). NCDPI staff are continually updating and improving these tools to better serve teachers. Statistics
More informationSection 3 Part 1. Relationships between two numerical variables
Section 3 Part 1 Relationships between two numerical variables 1 Relationship between two variables The summary statistics covered in the previous lessons are appropriate for describing a single variable.
More informationHYPOTHESIS TESTING WITH SPSS:
HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER
More informationStepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection
Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.
More informationChapter 7. One-way ANOVA
Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationPearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
More informationFactors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
More informationAP STATISTICS REVIEW (YMS Chapters 1-8)
AP STATISTICS REVIEW (YMS Chapters 1-8) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with
More informationPremaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
More informationCorrelational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
More informationhp calculators HP 50g Trend Lines The STAT menu Trend Lines Practice predicting the future using trend lines
The STAT menu Trend Lines Practice predicting the future using trend lines The STAT menu The Statistics menu is accessed from the ORANGE shifted function of the 5 key by pressing Ù. When pressed, a CHOOSE
More informationFinal Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
More informationThe Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)
Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,
More informationCorrelation key concepts:
CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)
More information3. Data Analysis, Statistics, and Probability
3. Data Analysis, Statistics, and Probability Data and probability sense provides students with tools to understand information and uncertainty. Students ask questions and gather and use data to answer
More informationCourse Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.
SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed
More informationCORRELATION ANALYSIS
CORRELATION ANALYSIS Learning Objectives Understand how correlation can be used to demonstrate a relationship between two factors. Know how to perform a correlation analysis and calculate the coefficient
More information2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More informationInteraction between quantitative predictors
Interaction between quantitative predictors In a first-order model like the ones we have discussed, the association between E(y) and a predictor x j does not depend on the value of the other predictors
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More informationSPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
More informationHypothesis Testing for Beginners
Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationDescriptive Statistics
Descriptive Statistics Descriptive statistics consist of methods for organizing and summarizing data. It includes the construction of graphs, charts and tables, as well various descriptive measures such
More informationCorrelational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationAugust 2012 EXAMINATIONS Solution Part I
August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,
More information12: Analysis of Variance. Introduction
1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider
More informationUsing R for Linear Regression
Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional
More informationRegression III: Advanced Methods
Lecture 4: Transformations Regression III: Advanced Methods William G. Jacoby Michigan State University Goals of the lecture The Ladder of Roots and Powers Changing the shape of distributions Transforming
More informationTwo-sample inference: Continuous data
Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As
More informationLinear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationp ˆ (sample mean and sample
Chapter 6: Confidence Intervals and Hypothesis Testing When analyzing data, we can t just accept the sample mean or sample proportion as the official mean or proportion. When we estimate the statistics
More informationDiagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
More informationInternational Statistical Institute, 56th Session, 2007: Phil Everson
Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationThis curriculum is part of the Educational Program of Studies of the Rahway Public Schools. ACKNOWLEDGMENTS
CURRICULUM FOR STATISTICS & PROBABILITY GRADES 11 & 12 This curriculum is part of the Educational Program of Studies of the Rahway Public Schools. ACKNOWLEDGMENTS Christine H. Salcito, Director of Curriculum
More informationIndependent samples t-test. Dr. Tom Pierce Radford University
Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of
More informationABSORBENCY OF PAPER TOWELS
ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationFormula for linear models. Prediction, extrapolation, significance test against zero slope.
Formula for linear models. Prediction, extrapolation, significance test against zero slope. Last time, we looked the linear regression formula. It s the line that fits the data best. The Pearson correlation
More informationCopyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5
Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression
More informationWhat Does the Normal Distribution Sound Like?
What Does the Normal Distribution Sound Like? Ananda Jayawardhana Pittsburg State University ananda@pittstate.edu Published: June 2013 Overview of Lesson In this activity, students conduct an investigation
More information