SPSS Workshop. Day 2 Data Analysis

Similar documents
Scatter Plots with Error Bars

Summarizing and Displaying Categorical Data

Using SPSS, Chapter 2: Descriptive Statistics

Diagrams and Graphs of Statistical Data

Describing, Exploring, and Comparing Data

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

The Chi-Square Test. STAT E-50 Introduction to Statistics

MTH 140 Statistics Videos

IBM SPSS Statistics for Beginners for Windows

SPSS Manual for Introductory Applied Statistics: A Variable Approach

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

Exercise 1.12 (Pg )

Chapter 23. Inferences for Regression

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Exploratory data analysis (Chapter 2) Fall 2011

MEASURES OF VARIATION

MBA 611 STATISTICS AND QUANTITATIVE METHODS

The Dummy s Guide to Data Analysis Using SPSS

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

Chapter 5 Analysis of variance SPSS Analysis of variance

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.


Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Table of Contents. Preface

An introduction to IBM SPSS Statistics

Introduction Course in SPSS - Evening 1

Exploratory Data Analysis

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Data exploration with Microsoft Excel: analysing more than one variable

An SPSS companion book. Basic Practice of Statistics

Chapter 2: Frequency Distributions and Graphs

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Projects Involving Statistics (& SPSS)

January 26, 2009 The Faculty Center for Teaching and Learning

Module 2: Introduction to Quantitative Data Analysis

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

4. Descriptive Statistics: Measures of Variability and Central Tendency

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Practice#1(chapter1,2) Name

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE

Additional sources Compilation of sources:

Statistics Chapter 2

SPSS TUTORIAL & EXERCISE BOOK

Name: Date: Use the following to answer questions 2-3:

Using Excel for descriptive statistics

2013 MBA Jump Start Program. Statistics Module Part 3

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

Variables. Exploratory Data Analysis

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

Lecture 1: Review and Exploratory Data Analysis (EDA)

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Introduction to Quantitative Methods

Directions for Frequency Tables, Histograms, and Frequency Bar Charts

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout

a) Find the five point summary for the home runs of the National League teams. b) What is the mean number of home runs by the American League teams?

Gestation Period as a function of Lifespan

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Foundation of Quantitative Data Analysis

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement

DATA INTERPRETATION AND STATISTICS

Mind on Statistics. Chapter 2

Descriptive Statistics

Demographics of Atlanta, Georgia:

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Fairfield Public Schools

UNDERSTANDING THE TWO-WAY ANOVA

Chapter 7 Section 7.1: Inference for the Mean of a Population

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Part 1: Background - Graphing

2 Describing, Exploring, and

II. DISTRIBUTIONS distribution normal distribution. standard scores

SPSS Tests for Versions 9 to 13

Tutorial for proteome data analysis using the Perseus software platform

Analyzing Research Data Using Excel

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

Statistical tests for SPSS

Data representation and analysis in Excel

Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab

Creating Charts in Microsoft Excel A supplement to Chapter 5 of Quantitative Approaches in Business Studies

Describing and presenting data

SPSS Explore procedure

Factors affecting online sales

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

Is it statistically significant? The chi-square test

GETTING YOUR DATA INTO SPSS

Dongfeng Li. Autumn 2010

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Data exploration with Microsoft Excel: univariate analysis

Transcription:

SPSS Workshop Day 2 Data Analysis

Outline Descriptive Statistics Types of data Graphical Summaries For Categorical Variables For Quantitative Variables Contingency Tables Hypothesis Testing One Sample t-test Two Sample t-test Sample Size/Power Analysis

Descriptive Statistics 5-number summary Minimum- minimum value in your dataset Q1-25th percentile (25% of the data is below this value) Median- middle value of your data (50th percentile: 50% of the data is below this value) Q3-75th percentile (75% of the data is below this value) Maximum- maximum value in your dataset Mean- average value of your all your data points Standard deviation- the average distance each observation falls from the mean Variance- average of the squared deviations; explains the variation of the data about the mean

To SPSS: Open gssnet.sav ->Analyze->Descriptive Statistics ->Descriptives ->Analyze->Descriptive Statistics->Frequencies (you can get more descriptive statistics here also)

Types of Data Variable- any characteristic that is recorded for subjects in a study Categorical- if each observation belongs to one of a set of categories Quantitative- if observations on it take numerical values that represent different magnitudes of the variable Discrete- if its possible values form a set of separate numbers, such as 0, 1, 2, Continuous- if its possible values form an interval

Other Valuable Terminology Parameter- a numerical summary of the population Statistic- a numerical summary of a sample taken from the population Frequency table- a listing of possible values for a variable, together with the number of observations for each value Relative frequency- proportions and percentages

Graphical Summaries for Categorical Variables Pie chart- a circle having a slice of the pie for each category. The size of a slice corresponds to the percentage of observations in the category Bar chart- displays a vertical bar for each category. The height of the bar is the percentage of observations in the category

To SPSS: Still in gssnet.sav For the pie chart: ->Graphs->Pie->Summaries of groups of cases->define slices by netcat->click OK For labels: ->Double click on the chart ->Elements->Show data labels- >choose labels

SPSS continued For the bar chart: ->Graphs->Bar->Simple ->Category axis: netcat Again, we can choose which labels to appear on the chart by double clicking.

Graphical Summaries for Quantitative Variables Dot plot- shows a dot for each observation, placed just above the value on the number line for that observation. Stem-and-Leaf Plot- each observation is represented by a stem and a leaf. Usually the stem consists of all digits except the final one, which is the leaf. Histogram- a graph that uses bars to portray the frequencies or the relative frequencies of the possible outcomes. Scatterplot- display for two variables. It uses the horizontal axis for the explanatory variable (x) and the vertical axis for the response variable (y).

To SPSS: Open marathon.sav Histogram: ->Analyze->Descriptive Statistics->Frequencies ->Charts->Histogram (you can also put a normal curve on the histogram to see how the shape of your data compares to the normal distribution)

SPSS continued: Scatterplots: ->Graphs->Scatter/dot.. ->Simple Scatter->Define ->Choose (continuous) variables

Other Useful Plots Time plot- charts each observation, on the vertical scale, against the time it was measured, on the horizontal scale Box plot- constructed from the 5-number summary

To SPSS: Box plots: ->Graphs->Boxplot->Simple ->variable (continuous) ->category axis (categorical) (You can also use boxplots in order to visually compare different groups on a quantitative variable, i.e. age by gender)

Contingency Tables/Cross Tabs A contingency table is a display for two categorical variables. Its rows list the categories of one variable and its columns list the categories of the other variable. Each entry in the table is the frequency of cases in the sample with certain outcomes of the two variables The process of taking a data file and finding the frequencies for the cells of a contingency table is referred to as cross-tabulation of the data

Example 2 x 2 contingency table: Binge Drinking by Gender Binge Drinker Non-binge Drinker Total Male 1908 2017 3925 Female 2854 4125 6979 Total 4762 6142 10904

Chi-squared Test for Independence The chi-squared test is a hypothesis test to see whether two categorical variables are independent of one another. We will look to see if the p-value <.05 (Reject the null hypothesis) If so, then our variables are not independent of one another

To SPSS: ->Analyze->Descriptive Statistics->Crosstabs You can also request a chisquared test for independence: ->Click on Statistics ->Check Chi-square

Interpreting P-values We compare the calculated p-value to a pre-specified value (usually.05), if the calculated p-value is less than.05 then there is significant evidence to reject the null hypothesis.

One-sample t-test Does the population mean differ from hypothesized value? Different alternative hypotheses (SPSS only does two-sided hypothesis test)

Examples Does anorexia therapy induce a positive mean weight change? Is the amount of Coke dispensed into a can 12 oz.? Do radio advertisements increase the average daily sales of hamburgers?

To SPSS: Is the mean age of marathon runners greater than 30? ->Analyze->Compare means - >One sample t-test ->test value = 30

Interpreting the p-value With a p-value less than.05, there is a significant difference between the mean age of our sample and the specified test value of 30.

Two-sample t-test (Independent samples) Does one population mean differ from another population mean? Different alternative hypotheses

Examples Do women tend to spend more time on housework than men? Do men and women watch the same amount of television in a day?

To SPSS: Are the male runners older than the female runners? ->Analyze->Independent Samples t-test ->test variable (continuous) ->grouping variable (categorical)

Interpreting the p-value With a p-value less than.05, there is a significant difference between the mean completion time for males and females.

Paired t-test (matched pairs/dependent samples) Does the population mean change for two different treatments (before & after)? Different alternative hypotheses

Examples Does the use of a cell phone impact driver reaction time? (matched pairs) Does exercise help blood pressure? (before & after)

To SPSS: Open endorph.sav Do the beta endorphin levels differ before and after running a half-marathon? ->Analyze->Compare means ->Paired samples t-test ->Paired variables (before & after)

Interpreting the p-value With a p-value less than.05, there is a significant difference between beta endorphin levels before and after running a halfmarathon.

Determining Sample Size Power- the ability to reject the null hypothesis when it is false If a certain level of power is desired, use power analysis to determine the required sample size