SPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav)

Advertisement


Advertisement
Similar documents
Using SPSS, Chapter 2: Descriptive Statistics

SPSS Manual for Introductory Applied Statistics: A Variable Approach

IBM SPSS Statistics for Beginners for Windows

Directions for Frequency Tables, Histograms, and Frequency Bar Charts

Scatter Plots with Error Bars

Summarizing and Displaying Categorical Data

4. Descriptive Statistics: Measures of Variability and Central Tendency

SPSS TUTORIAL & EXERCISE BOOK

Data exploration with Microsoft Excel: analysing more than one variable

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

Diagrams and Graphs of Statistical Data

MTH 140 Statistics Videos

Describing, Exploring, and Comparing Data

When to use Excel. When NOT to use Excel 9/24/2014

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

SPSS Tests for Versions 9 to 13

An SPSS companion book. Basic Practice of Statistics

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

An Introduction to SPSS. Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman

An introduction to IBM SPSS Statistics

INTRODUCTION TO SPSS FOR WINDOWS Version 19.0

Introduction Course in SPSS - Evening 1

Foundation of Quantitative Data Analysis

Introduction to Statistics with SPSS (15.0) Version 2.3 (public)

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Exercise 1.12 (Pg )

Table of Contents. Preface

4 Other useful features on the course web page. 5 Accessing SAS

Chapter 5 Analysis of variance SPSS Analysis of variance

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

Data Analysis Tools. Tools for Summarizing Data

Chapter 23. Inferences for Regression

SPSS Introduction. Yi Li

Directions for using SPSS

GETTING YOUR DATA INTO SPSS

Lecture 1: Review and Exploratory Data Analysis (EDA)

Creating an Excel XY (Scatter) Plot

Data exploration with Microsoft Excel: univariate analysis

Exploratory data analysis (Chapter 2) Fall 2011

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

The Chi-Square Test. STAT E-50 Introduction to Statistics

Simple Linear Regression in SPSS STAT 314

Chapter 4 Displaying and Describing Categorical Data

Descriptive and Inferential Statistics

Learning SPSS: Data and EDA

January 26, 2009 The Faculty Center for Teaching and Learning

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Simple Linear Regression, Scatterplots, and Bivariate Correlation

To create a histogram, you must organize the data in two columns on the worksheet. These columns must contain the following data:

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Excel Charts & Graphs

General instructions for the content of all StatTools assignments and the use of StatTools:

5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.

Appendix 2.1 Tabular and Graphical Methods Using Excel

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Introduction to StatsDirect, 11/05/2012 1

Selected Introductory Statistical and Data Manipulation Procedures. Gordon & Johnson 2005

Northumberland Knowledge

Projects Involving Statistics (& SPSS)

Chapter 3. The Normal Distribution

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

Psych. Research 1 Guide to SPSS 11.0

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

SPSS Notes (SPSS version 15.0)

Variables. Exploratory Data Analysis

1: Introduction to SPSS

Data analysis process

Using Excel for descriptive statistics

STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE

Exploratory Data Analysis

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.

SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Appendix III: SPSS Preliminary

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.


Variables and Data A variable contains data about anything we measure. For example; age or gender of the participants or their score on a test.

Descriptive Statistics

STC: Descriptive Statistics in Excel Running Descriptive and Correlational Analysis in Excel 2013

Instructions for SPSS 21

Microsoft Excel. Qi Wei

Getting started in Excel

CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08

Simple Predictive Analytics Curtis Seare

Introduction to Exploratory Data Analysis

7. Comparing Means Using t-tests.

How to Use a Data Spreadsheet: Excel

IBM SPSS Direct Marketing 23

Independent t- Test (Comparing Two Means)

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller

2 Describing, Exploring, and

Statistics 101 Homework 2

Advertisement
Transcription:

Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Organize and Display One Quantitative Variable (Descriptive Statistics, Boxplot & Histogram) 1. Move the mouse pointer to Analyze, click the left button of the mouse and move through the following menu selections: Analyze Descriptive Statistics Explore (To perform Exploratory Analysis) 2. In the Explore dialog box, click and select the variable (weight) to be studied. Click the variable to be selected from the list of variables on the left for analysis and click the select button, the button with an arrow shape in it, to select the variable into Dependent List box. A. Chang 1

3. In the Explore dialog box, click Plots button. In the Explore: Plots dialog box, check the Histogram and Normality plots with tests, if they are needed, and click on Continue button. If Sig. value (or p-value) in the normality test table is less than.05, it implies that data may not be from normally distributed population. The values.200 and.236 are p-values calculated based on tow different tests. WEIGHT Tests of Normality Kolmogorov-Smirnov a Shapiro-Wilk Statistic df Sig. Statistic df Sig..127 22.200*.938 22.236 *. This is a lower bound of the true significance. a. Lilliefors Significance Correction The p-values are both greater than 0.05. The distribution which the data was sampled from is not significantly different from normal, at 5% level of significance. 4. In the Explore dialog box, click on OK button. The SPSS will put the results, histogram, stemplot and descriptive statistics such as mean, standard deviation, confidence interval for mean, in the OUTPUT window. 5. If, in the Explore dialog box, one click on Statistics button and check on Percentiles box and click on Continue, SPSS will produce quartiles (25 th and 75 th percentiles are good for getting 5-number summary for the data), and some special percentiles for the active data set. 6. If one wishes to explore the quantitative variable for separate categories of a qualitative variable, select that qualitative variable and put it in the Factor List and click OK. (See the example in last three pages of this document.) A. Chang 2

Organize and Display One Qualitative(Categorical) Variable (Pie or bar charts) 1. Move the mouse pointer on Analyze, click the left button of the mouse and move through the following menu selections: Analyze Descriptive Statistics Frequencies 2. In the Frequencies dialog box, click and select the variable (sex) to be studied. Click the variable to be selected for analysis from the list of variables on the left and click the select button, the button with a dark triangular shape in it, to select the variable into A. Chang 3

3. In the Frequencies dialog box, click Charts button, if one wishes to display chart. In the Frequencies: Charts dialog box check on the desired chart and select either Frequencies or Percentages to be displayed and click Continue button. 4. In the Frequencies dialog box, click on OK button. The SPSS will put the results, frequency distribution table and bar chart (if checked), in the OUTPUT window. sex Valid Female Male Total Cumulative Frequency Percent Valid Percent Percent 9 40.9 40.9 40.9 13 59.1 59.1 100.0 22 100.0 100.0 14 sex 12 10 8 6 4 Frequency 2 0 Female Male sex A. Chang 4

Examine Relation Between Two Quantitative Variables by Chart (Scatter Plot) 1. Click and move through the following menu selections: Graphs Legacy Dialogs Scatter 2. In Scatterplot dialog box, click the Simple option and click Define button. 3. In Simple Scatterplot dialog box, select the two variables (height and weight) to be studied. If one wishes to build a regression model for predicting height using weight variable, usually choose height variable for Y Axis (as response variable) and choose weight variable for X Axis (as explanatory variable). One can select sex variable for the Set Markers by: field to make scatter plot display scatter dots with different color for different sex. A. Chang 5

4. In the Simple Scatterplot dialog box, click on OK button. The SPSS will put the scatter plot in the OUTPUT window. The following scatter plot is based on the data in studentp.sav file with the first case dropped, since the height information for the first case is incorrect. 76 74 72 70 68 66 64 5. One can double click on the any part of the chart in the SPSS output window to bring up the chart editor for editing the scatter plot. A fitted line can be added to the chart using the Chart option in the chart editor menu bar. 62 HEIGHT 60 58 100 200 300 WEIGHT A. Chang 6

Examine Relation Between Two Qualitative Variables (Contingency Table & Cluster Bar Chart) 1. Click and move through the following menu selections: Analyze Descriptive Statistics Crosstabs 2. In Crosstabs dialog box, select the categorical variables for Row variable and Column variable, and click OK button. A. Chang 7

3. In Crosstabs dialog box, click on Cells button to specify whether or not to display the percentage information and then click Continue button to go back to Crosstabs dialog box. 4. One can also check on Display clustered bar charts option in the Crosstabs dialog box to display clustered bar chart with only frequency (count) information. If percentages are needed be displayed, go through Graphs options. 5. In the Crosstabs dialog box, click on OK button. The SPSS will put a contingency table and also a clustered bar chart in the OUTPUT window if the clustered bar chart box is checked. A. Chang 8

6. The Chi-square test and other statistics can be done by clicking on Statistics button and choose the desired option. To perform a chi-square test, click on Statistics button and check the Chi-square box. The chi-square test results will be displayed in SPSS output window after clicking on OK from the Crosstabs diaglog box.. A. Chang 9

Clustered Bar Chart (Make a separate cluster bar chart) 1. Move the mouse pointer on Graphs, click the left button of the mouse and move through the following menu selections: Graphs Legacy Dialogs Bar 2. In Bar Charts dialog box, click the Clustered option. Check the Data in Chart Are option in Summaries for groups of cases and click Define button. A. Chang 10

Select the two variables (sex and coin) to be studied. One can select sex variable for the Category Axis and coin variable for Clusters and check of % of cases and click OK. (Sometimes, percentage information is better for understanding the data.) 70 60 50 40 30 20 cion * Chart editor can be used to modify the chart and change the color or pattern in the chart. To activate the chart editor, one can simply double click any part of the chart in the SPSS Output window. Percent 10 0 Female Male T H sex A. Chang 11

Examine Relation Between One Quantitative Variable with One Qualitative Factor Variable (Side-by-side boxplot, descriptive measures for sub-categories.) 1. Move the mouse pointer on Analyze, click the left button of the mouse and move through the following menu selections: Analyze Descriptive Statistics Explore (To perform Exploratory Analysis) 2. In the Explore dialog box, click and select the variables (weight) and (sex) to be studied. Click the variable to be selected (weight) from the list of variables on the left for analysis and click the select button, the button with a dark triangular shape in it, to select the variable into Dependent List box. Select the sex variable in Factor List box to observe the difference between the weights from both genders. A. Chang 12

3. In the Explore dialog box, click Plots button. In the Explore: Plots dialog box, check the Histogram and Normality plots with tests, if they are needed, and click on Continue button. If Sig. value (or p-value) in the normality test table is less than.05, it implies that data may not be from normally distributed population. 4. In the Explore dialog box, click on OK button. The SPSS will put the results, histograms, stemplots, descriptive statistics, and side-by-side boxplot in the OUTPUT window. Descriptives weight sex Female Mean 95% Confidence Interval for Mean Lower Bound Upper Bound Statistic Std. Error 134.89 7.983 116.48 153.30 Male 5% Trimmed Mean Median Variance Std. Deviation Minimum Maximum Range Interquartile Range Skewness Kurtosis Mean 95% Confidence Interval for Mean Lower Bound Upper Bound 133.99 135.00 573.611 23.950 106 180 74 38.705.717.173 1.400 192.85 9.035 173.16 212.53 5% Trimmed Mean Median Variance Std. Deviation Minimum Maximum Range Interquartile Range Skewness Kurtosis 190.11 185.00 1061.141 32.575 150 285 135 25 1.990.616 5.449 1.191 A. Chang 13

* Tests of normality for weight variable, one for male and one for female. weight sex Female Male Tests of Normality Kolmogorov-Smirnov a Shapiro-Wilk Statistic df Sig. Statistic df Sig..165 9.200*.945 9.633 *. This is a lower bound of the true significance. a. Lilliefors Significance Correction.243 13.035.812 13.010 * Two histograms for weights, one for male and one for female. Histogram Histogram for sex= Female for sex= Male 2.0 7 6 1.5 5 Frequency 1.0 Frequency 4 3 0.5 2 1 0.0 Mean = 134.89 Std. Dev. = 23.95 N = 9 0 Mean = 192.85 Std. Dev. = 32.575 N = 13 100 120 140 160 180 150 180 210 240 270 300 weight weight * Side-by-side box plot for comparing weight between male and female students. 300 22 200 7 100 WEIGHT 0 N = 9 Female 13 Male sex A. Chang 14