In last class, we learned statistical inference for population mean. Meaning. The population mean. The sample mean
|
|
- Posy Franklin
- 7 years ago
- Views:
Transcription
1 RECALL: In last class, we learned statistical inference for population mean. Problem. Notation Populati on Notation X σ Meaning The population mean The sample mean The population standard deviation s The sample standard deviation n The sample size
2 RECALL: Point estimation. (sample mean X ) Distribution of X Confidence Interval One-sample z-interval (population SD is known) One-sample t-interval (only sample SD is known) X ± t Remark: 1. T-interval needs normal assumption. * 2. t n 1, which is related to n-1 and C%, can be obtained from t-table. * n 1 s n
3 RECALL: Hypothesis Testing about μ Null Hypothesis H 0 vs. Alternative Hypothesis H A H 0 : Z-Test (population SD is known) Test statistic: P-value: vs. Alternative Hypothesis H A H A : H A : H A : (two-sided) (one-sided) (one-sided) P-value formula H A : (two-sided) P-value=2P(Z> z ) H A : (one-sided) P-value=P(Z>z) H A : (one-sided) P-value=P(Z<z)
4 RECALL: Hypothesis Testing about μ Null Hypothesis H 0 vs. Alternative Hypothesis H A H A : (two-sided) H 0 : vs. H A : (one-sided) H A : (one-sided) T-Test (sample SD s is known) Test statistic: X µ 0 t = s n P-value:(df=n-1) Alternative Hypothesis H A P-value formula H A : (two-sided) Two-tail prob. of t H A : (one-sided) One-tail prob. of t H A : (one-sided) One-tail prob. of t
5 RECALL: TI commands (under STAT TESTS): T-interval: use 8: T Interval T-Test: use 2:T-Test Decisions: If p-value< alpha level, reject H 0, and we say the test is statistically significant at this alpha level); If p-value>alpha level, fail to reject H 0, and we say the test is not statistically significant at this alpha level); Errors: Type I error: decide to reject H 0, but actually H 0 is true; Type II error: decide to retain H 0, but actually H 0 is false; P(Type I error)=alpha level.
6 Exploring Relationship Between Variables Chapter 7: Scatterplots, Association, and Correlation Chapter 8: Linear Regression
7 WHERE ARE WE GOING? People might ask the following questions in the real life: 1. Is the price of sneakers related to how long they last? 2. Is smoking related to lung cancer? 3. Do baseball teams that score more runs sell more tickets to their games? Chapter 7 will look at relationships between two quantitative variables X and Y. Scatterplot Correlation
8 TERM 1: SCATTERPLOTS Is the price of sneakers related to how long they last? Following table shows some data collected for sneakers: Years Price($) Price This is an example of scatterplot. x-axis represents variable years and y-axis represents prices.
9 TERM 1: SCATTERPLOT Scatterplots may be the most common and most effective display for paired data. Scatterplots are the best way to start observing the relationship and the ideal way to picture associations between two quantitative variables Price X-axis: Years, Explanatory variable which explains or influences changes in the other variable. Y-axis: Price, Response variable which measures an outcome of a study.
10 TERM 1: SCATTERPLOTS How do we describe the scatterplot? Or, What information about the relationship of the two variables can we get by looking at the scatterplot? Please look at the scatterplot of the sneakers example, and think about what can you tell about the relationship of years and price Price We are going to describe the relationship from four different aspects. 1) Direction 2) Form 3) Strength 4) Unusual features
11 TERM 1: SCATTERPLOT Scatterplot Look for direction: What s my design positive, negative or neither? Negative A pattern like this that runs from the upper left to the lower right is said to be negative. Y variable decreases as the X variable increases. Positive A pattern running the other way is called positive. Y variable increases as X variable increases. Y Y X Scatterplot X
12 TERM 1: SCATTERPLOT The example in the text shows a negative association between central pressure and maximum wind speed As the central pressure increases, the maximum wind speed decreases
13 TERM 1: SCATTERPLOTS Look for Form: straight, curved or something exotic, or no pattern? Scatterplot Scatterplot Scatterplot Y Y Y X X X Straight line, linear Curved No pattern In this part, we are more interested in the linear pattern.
14 TERM 1: SCATTERPLOTS Look for strength: how much scatter? Or, how strong the relationship is? Strong: the points appear tightly clustered in a single stream. Scatterplot Scatterplot Scatterplot Y Y Y X X Weak: the X swarm of points seem to form a vague cloud through which we can barely discern any trend or pattern Scatterplot Y X
15 TERM 1: SCATTERPLOTS Look for the Unusual Features: Are there outliers or subgroups? Scatterplot Scatterplot Y Y The point circled is a potential outlier X There are two clusters. X
16 TERM 1: SCATTERPLOT-ROLES FOR VARIABLES It is important to determine which of the two quantitative variables goes on the x-axis and which on the y-axis. Slide 1-16 This determination is made based on the roles played by the variables. When the roles are clear, the explanatory or predictor variable goes on the x-axis, and the response variable goes on the y-axis.
17 TERM 1: SCATTERPLOTS Summary A Scatterplot shows the relationship between two quantitative variables measured on the same individual. The variable that is designated the X variable is called the explanatory variable The variable that is designated the Y variable is called the response variable Always plot the explanatory variable on the horizontal (x) axis Always plot the response variable on the vertical (y) axis In examining scatterplots, look for an overall pattern showing the form, direction and strength of the relationship Look also for outliers or other deviations from this pattern
18 TERM 1: SCATTERPLOT Example: Fast food is often considered unhealthy because much of it is high in fat. Are fat and calories related? Here are the fat and calories contents of several brands of burgers. Analyze the association between fat content and calories. Fat(g) Calories Calorie Fat Comment on the scatterplot: 1) Direction Positive 2) Form Roughly linear 3) Strength Moderately strong 4) Unusual features No.
19 TERM 2: CORRELATION From scatterplots, we can look for the relationship between two quantitative variables and whether the relationship is strong or weak. But how strong is it? Correlation coefficient (or simply correlation) is a quantitative measure of linear relationship (association) between two quantitative variables. Finding the correlation coefficient, denoted by r, by hand: ( x x)( y y) r = ( n 1) s x s y s s Where x and y are standard deviations for X and Y respectively. Remarks: Before you use correlation, you must check several conditions: Quantitative Variables Condition Straight Enough Condition Outlier Condition
20 TERM 2: CORRELATION (Revisit the calories example) Here are the fat and calories contents of several brands of burgers. X: Fat(g) Y: Calories What is the correlation coefficient of x (fat) and y (calories)? Solution: Deviations in x Deviations in y Product 20-35= =-180 (-15)*(-180)= = =-10 (-5)*(-10)= = = 0 0*0= = =-20 1*(-20)= = = 50 5*50= = = 90 5*90= = = 70 9*70=630 Add up the products: (-20) =4060 Correlation r=4060/{(7-1)*7.98*89.81}=0.9442
21 TERM 2: CORRELATION
22 CORRELATION PROPERTIES The sign of a correlation coefficient gives the direction of the linear association. Positive sign Positive linear association Negative sign Negative linear association Correlation is always between -1 and +1. Correlation can be exactly equal to -1 or +1, but these values are unusual in real data because they mean that all the data points fall exactly on a single straight line. A correlation near zero corresponds to a weak linear association. Example: The correlation between fat and calories as indicates a strong positive linear association between them. Slide 1-22
23 y y TERM 2: CORRELATION Cautions about correlation: Quantitative Variables Condition: Correlation applies only to quantitative variables. Straight Enough Condition: Correlation measures the strength only of the linear association r= x -2 r= x Outlier Condition: Outliers can distort the correlation dramatically. y With the outlier: r=0.795 Without the outlier: r= x
24 TERM 2: CORRELATION Correlation Causation Fast food is often considered unhealthy because much of it is high in fat. Are fat and calories related? Based on the fat and calories contents of several brands of burgers, the correlation between them is r= Which conclusion is most accurate? A. More fat in the burgers causes higher calories B. The burgers containing more fat tend to have higher calories Comment: Even though A sounds all right, it is not the conclusion can be derived/explained by the correlation. Correlation is an objective story teller of the linear association between two variables. It can t tell the causation.
25 CORRELATION PROPERTIES (CONT.) Correlation treats x and y symmetrically: The correlation of x with y is the same as the correlation of y with x. Slide 1-25 Correlation has no units. Correlation is not affected by shifting and rescaling of either variable. Correlation depends only on the z-scores, and they are unaffected by changes in center or scale. i.e. corr(ax+b,cy+d)=corr(x,y) where a,b,c,d are constants.
26 TERM 2: CORRELATION Example: Here are several scatterplots. The calculated correlations are , , and Which is which? (a) (b) Y Y X X (c) (d) Y Y X X
27 QUESTION: CAN WE DO MORE? Scatterplot and correlation are useful tolls helping us to learn the (linear) association between two quantitative variables. Can we answer the following question: Fast food is often considered unhealthy because much of it is high in fat. What is the calorie content of a kind of fast food with 28g fat? 700 If we want to estimate a unknown value based on the 650 known values, this is called a 600 prediction. Calorie Fat One way to do the prediction is by constructing a linear model.
28 TERM 3: LINEAR MODEL Let s look at the burger example again. Fat(g) Calories BURGERS CALORIES FAT The red line does not go through all the points, but it can summarize the general pattern with only a couple of parameters: Calories = a+b*fat. This model can be used to predict the Calories based on the fat contain. Explanatory Var: Fat Response Var: Calories
29 TERM 3: LINEAR MODEL BURGERS Predicted value: we call the estimate made from a model the predicted value, denoted as. Residual: The difference between the observed value and its associated predicted value is called the residual. The line of best fit is the line for which the sum of the squared residuals is smallest. And it s called the least squares line. ŷ CALORIES residual Prediction FAT
30 TERM 3: LINEAR MODEL
31 TERM 3: LINEAR MODEL X: Fat(g) Y: Calories Q1: Please construct a linear regression model to predict the calories based on fat. Fat: Calories: Correlation: r= Slope: Intercept: Linear model: Q2: What is the predicted calorie when the fat is 30g? When x=30, Q3: What is the residual for the burger with 30g fat? When x=30, the residual is CALORIES BURGERS = x FAT
32 TERM 3: LINEAR MODEL Remarks: Since regression and correlation are closely related, we need to check the same conditions for regressions as we did for correlations: Quantitative Variables Condition Straight Enough Condition Outlier Condition
33 TERM 3: LINEAR MODEL (PARAMETERS) We write a and b for the slope and intercept of the line. They are called the coefficients of the linear model. The coefficient b is the slope, which tells us how rapidly the predicted value ( ŷ ) changes with respect to x. As the value of x increases by 1 unit, the predicted value of y will be increased by b units. The coefficient a is the intercept, which tells where the line hits (intercepts) the y-axis. In other words, the intercept a is the predicted value of y when x=0
34 Intercept and Slope (examples) Fast food is often considered unhealthy because much of it is high in fat. Are fat and calories related? Here are the fat and calories contents of several brands of burgers. To analyze the association between fat content and calories, the equation of the regression model is: Predicted calories= *fat For this linear equation, slope=10.63, intercept= Q1: What does the slope mean? A1: An increase in fat of 1 gram is associated with an increase in calories of Q2: If the fat increases by 2 grams, how many more calories are expected to be contained in the burger? A2: 2*10.63=21.26 Q3: What does the intercept mean here? A3: Theoretically, it means: when the burger contains no fat at all, the amount of calories is
35 TERM 4: RESIDUAL PLOT After you construct the linear model, you have to check whether the linear model makes sense or not. Residual plot can be used to check the appropriateness of the linear model. Residual plot is the scatterplot of the residuals versus the x- values. If a linear model is appropriate, then the residual plot shouldn t have any interesting features, like a direction or shape. It should stretch horizontally, with about the same amount of scatter throughout. It should show no bends, and it should have no outliers. Residuals X
36 TERM 4: RESIDUAL SCATTERPLOT Now, let s try to diagnose the model for the calorie and fat example. Fat(g): x Calories: y Predicted calories: Residual: Residual plot residuals fat x
37 TERM 4: RESIDUAL PLOT Example: Tell what each of the residual plots below indicates about the appropriateness of the linear model that was fit to the data. (a) (b) (c) y y y x x x3
38 TI for correlation and regression equation The first time you do this: Press 2 nd, CATALOG (above 0) Scroll down to DiagnosticOn Press ENTER, ENTER Read Done Your calculator will remember this setting even when turned off Enter predictor (x) values in L1 Enter response (y) values in L2 Pairs must line up There must be the same number of predictor and response values Press STAT, > (to CALC) Scroll down to 8:LinReg(a+bx), press ENTER, ENTER Read intercept a, slope b and correlation r at the screen
39 IMPORTANT NOTES: Take-home quiz is due on Monday. No late submission will be accepted. Keep the ID assignment and bring it to class on Monday. Sample exam will be handed out on Monday. We will discuss the questions on Wednesday. Suggested Problem Set 4 will be collected on next Thursday. Final exam will be on next Thursday. 2 hours in class. Please prepare one page A4 size cheat sheet (one-sided) on your own. Formula sheet will not be provided in final exam. Cheat sheet will be collected together with the final exam.
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
More informationHow Does My TI-84 Do That
How Does My TI-84 Do That A guide to using the TI-84 for statistics Austin Peay State University Clarksville, Tennessee How Does My TI-84 Do That A guide to using the TI-84 for statistics Table of Contents
More informationRelationships Between Two Variables: Scatterplots and Correlation
Relationships Between Two Variables: Scatterplots and Correlation Example: Consider the population of cars manufactured in the U.S. What is the relationship (1) between engine size and horsepower? (2)
More informationLecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation
Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation Display and Summarize Correlation for Direction and Strength Properties of Correlation Regression Line Cengage
More informationChapter 23. Inferences for Regression
Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily
More informationch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
ch12 practice test 1) The null hypothesis that x and y are is H0: = 0. 1) 2) When a two-sided significance test about a population slope has a P-value below 0.05, the 95% confidence interval for A) does
More informationExercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
More information2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
More informationChapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
More informationScatter Plot, Correlation, and Regression on the TI-83/84
Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page
More informationTHE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.
THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationSection 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationHypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationCopyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5
Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression
More informationUSING A TI-83 OR TI-84 SERIES GRAPHING CALCULATOR IN AN INTRODUCTORY STATISTICS CLASS
USING A TI-83 OR TI-84 SERIES GRAPHING CALCULATOR IN AN INTRODUCTORY STATISTICS CLASS W. SCOTT STREET, IV DEPARTMENT OF STATISTICAL SCIENCES & OPERATIONS RESEARCH VIRGINIA COMMONWEALTH UNIVERSITY Table
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More informationSTAT 350 Practice Final Exam Solution (Spring 2015)
PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationThe Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationIntroduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
More informationElements of a graph. Click on the links below to jump directly to the relevant section
Click on the links below to jump directly to the relevant section Elements of a graph Linear equations and their graphs What is slope? Slope and y-intercept in the equation of a line Comparing lines on
More informationMTH 140 Statistics Videos
MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationCopyright 2013 by Laura Schultz. All rights reserved. Page 1 of 7
Using Your TI-83/84/89 Calculator: Linear Correlation and Regression Dr. Laura Schultz Statistics I This handout describes how to use your calculator for various linear correlation and regression applications.
More informationLecture 13/Chapter 10 Relationships between Measurement (Quantitative) Variables
Lecture 13/Chapter 10 Relationships between Measurement (Quantitative) Variables Scatterplot; Roles of Variables 3 Features of Relationship Correlation Regression Definition Scatterplot displays relationship
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationHomework 11. Part 1. Name: Score: / null
Name: Score: / Homework 11 Part 1 null 1 For which of the following correlations would the data points be clustered most closely around a straight line? A. r = 0.50 B. r = -0.80 C. r = 0.10 D. There is
More informationStatistics 2014 Scoring Guidelines
AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home
More informationExample: Boats and Manatees
Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant
More informationCHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression
Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the
More informationHYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
More informationYou buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week. Week 1 2 3 4 5 6 7 8 9
Warm Up: You buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week Week 1 2 3 4 5 6 7 8 9 Money Owed 900 800 700 600 500 400 300 200 100
More informationGood luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:
Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours
More informationAn analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression
Chapter 9 Simple Linear Regression An analysis appropriate for a quantitative outcome and a single quantitative explanatory variable. 9.1 The model behind linear regression When we are examining the relationship
More informationStatistics courses often teach the two-sample t-test, linear regression, and analysis of variance
2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample
More informationFactors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
More informationOnce saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.
1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More informationt Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon
t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com
More informationCorrelation key concepts:
CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)
More informationKSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management
KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More informationCorrelation and Regression
Correlation and Regression Scatterplots Correlation Explanatory and response variables Simple linear regression General Principles of Data Analysis First plot the data, then add numerical summaries Look
More informationFinal Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
More informationLesson 9 Hypothesis Testing
Lesson 9 Hypothesis Testing Outline Logic for Hypothesis Testing Critical Value Alpha (α) -level.05 -level.01 One-Tail versus Two-Tail Tests -critical values for both alpha levels Logic for Hypothesis
More informationThe correlation coefficient
The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative
More information2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.
Math 1530-017 Exam 1 February 19, 2009 Name Student Number E There are five possible responses to each of the following multiple choice questions. There is only on BEST answer. Be sure to read all possible
More informationStatistics for Sports Medicine
Statistics for Sports Medicine Suzanne Hecht, MD University of Minnesota (suzanne.hecht@gmail.com) Fellow s Research Conference July 2012: Philadelphia GOALS Try not to bore you to death!! Try to teach
More informationCURVE FITTING LEAST SQUARES APPROXIMATION
CURVE FITTING LEAST SQUARES APPROXIMATION Data analysis and curve fitting: Imagine that we are studying a physical system involving two quantities: x and y Also suppose that we expect a linear relationship
More informationYou have data! What s next?
You have data! What s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511 Spring 2014 Part 1:! Research Questions Part 1:! Research Questions Write down > 2 things you thought were
More information11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationCorrelation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2
Lesson 4 Part 1 Relationships between two numerical variables 1 Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables
More informationIntroduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationCorrelational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More informationSPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
More informationSIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.
SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation
More informationEstimation of σ 2, the variance of ɛ
Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationComparing Means in Two Populations
Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we
More informationUsing R for Linear Regression
Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More informationLinear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares
Linear Regression Chapter 5 Regression Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). We can then predict the average response for all subjects
More informationStat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb 2 2015
Stat 412/512 CASE INFLUENCE STATISTICS Feb 2 2015 Charlotte Wickham stat512.cwick.co.nz Regression in your field See website. You may complete this assignment in pairs. Find a journal article in your field
More informationHomework 8 Solutions
Math 17, Section 2 Spring 2011 Homework 8 Solutions Assignment Chapter 7: 7.36, 7.40 Chapter 8: 8.14, 8.16, 8.28, 8.36 (a-d), 8.38, 8.62 Chapter 9: 9.4, 9.14 Chapter 7 7.36] a) A scatterplot is given below.
More informationMULTIPLE REGRESSION EXAMPLE
MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if
More informationTwo-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
More informationPractice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.
Practice problems for Homework 1 - confidence intervals and hypothesis testing. Read sections 10..3 and 10.3 of the text. Solve the practice problems below. Open the Homework Assignment 1 and solve the
More informationCourse Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
More informationLinear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
More information1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ
STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material
More informationHow To Write A Data Analysis
Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction
More informationCourse Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.
SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed
More informationInference for two Population Means
Inference for two Population Means Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison October 27 November 1, 2011 Two Population Means 1 / 65 Case Study Case Study Example
More informationCHAPTER 14 NONPARAMETRIC TESTS
CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
More informationCorrelation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers
Correlation Greg C Elvers What Is Correlation? Correlation is a descriptive statistic that tells you if two variables are related to each other E.g. Is your related to how much you study? When two variables
More informationPlot the following two points on a graph and draw the line that passes through those two points. Find the rise, run and slope of that line.
Objective # 6 Finding the slope of a line Material: page 117 to 121 Homework: worksheet NOTE: When we say line... we mean straight line! Slope of a line: It is a number that represents the slant of a line
More informationThe Wilcoxon Rank-Sum Test
1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We
More informationDescriptive statistics; Correlation and regression
Descriptive statistics; and regression Patrick Breheny September 16 Patrick Breheny STA 580: Biostatistics I 1/59 Tables and figures Descriptive statistics Histograms Numerical summaries Percentiles Human
More information17. SIMPLE LINEAR REGRESSION II
17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.
More informationResults from the 2014 AP Statistics Exam. Jessica Utts, University of California, Irvine Chief Reader, AP Statistics jutts@uci.edu
Results from the 2014 AP Statistics Exam Jessica Utts, University of California, Irvine Chief Reader, AP Statistics jutts@uci.edu The six free-response questions Question #1: Extracurricular activities
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More informationInteraction between quantitative predictors
Interaction between quantitative predictors In a first-order model like the ones we have discussed, the association between E(y) and a predictor x j does not depend on the value of the other predictors
More information