Descriptive Analysis

Similar documents

II. DISTRIBUTIONS distribution normal distribution. standard scores

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Additional sources Compilation of sources:

1-3 id id no. of respondents respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

Using SPSS, Chapter 2: Descriptive Statistics

Association Between Variables

SPSS Tests for Versions 9 to 13

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Descriptive Statistics

Foundation of Quantitative Data Analysis

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Fairfield Public Schools

Analysing Questionnaires using Minitab (for SPSS queries contact -)

DATA INTERPRETATION AND STATISTICS

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Descriptive Statistics

Study Guide for the Final Exam

Projects Involving Statistics (& SPSS)

Final Exam Practice Problem Answers

Independent t- Test (Comparing Two Means)

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Statistical tests for SPSS

The Chi-Square Test. STAT E-50 Introduction to Statistics

Introduction to Quantitative Methods

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

An introduction to IBM SPSS Statistics

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

UNDERSTANDING THE TWO-WAY ANOVA

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Research Methods & Experimental Design

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

Lesson 4 Measures of Central Tendency

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Using Excel for inferential statistics

Module 2: Introduction to Quantitative Data Analysis

A and B This represents the probability that both events A and B occur. This can be calculated using the multiplication rules of probability.

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

3.4 Statistical inference for 2 populations based on two samples

Statistics Review PSY379

Chapter 7 Section 7.1: Inference for the Mean of a Population

Biostatistics: Types of Data Analysis

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

SPSS Explore procedure

Diagrams and Graphs of Statistical Data

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

How To Write A Data Analysis

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

TRAINING PROGRAM INFORMATICS

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

NCSS Statistical Software

An introduction to using Microsoft Excel for quantitative data analysis

03 The full syllabus. 03 The full syllabus continued. For more information visit PAPER C03 FUNDAMENTALS OF BUSINESS MATHEMATICS

SPSS TUTORIAL & EXERCISE BOOK

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Dongfeng Li. Autumn 2010

Quantitative Methods for Finance

Survey Research Data Analysis

How To Check For Differences In The One Way Anova

An Introduction to Statistics using Microsoft Excel. Dan Remenyi George Onofrei Joe English

Data exploration with Microsoft Excel: univariate analysis

MBA 611 STATISTICS AND QUANTITATIVE METHODS

HYPOTHESIS TESTING WITH SPSS:

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

1 Nonparametric Statistics

Introduction to Statistics and Quantitative Research Methods

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

First-year Statistics for Psychology Students Through Worked Examples

Comparing Means in Two Populations

Chapter 3 RANDOM VARIATE GENERATION

Intro to Parametric & Nonparametric Statistics

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES

Two Related Samples t Test

Normality Testing in Excel

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Statistics 2014 Scoring Guidelines

Data exploration with Microsoft Excel: analysing more than one variable

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

Permutation Tests for Comparing Two Populations

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

The Dummy s Guide to Data Analysis Using SPSS

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

Testing differences in proportions

Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables

Two Correlated Proportions (McNemar Test)

Simple Linear Regression Inference

Rank-Based Non-Parametric Tests

WHAT IS A JOURNAL CLUB?

Skewed Data and Non-parametric Methods

Transcription:

Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging, ordering, and manipulating data to generate descriptive information

Type of Measurement Type of descriptive analysis Nominal Two categories More than two categories Frequency table Proportion (percentage) Frequency table Category proportions (percentages) Mode Type of Measurement Type of descriptive analysis Ordinal Rank order Median

Type of Measurement Type of descriptive analysis Interval Arithmetic mean Type of Measurement Type of descriptive analysis Ratio Index numbers Geometric mean Harmonic mean

Tabulation Tabulation - Orderly arrangement of data in a table or other summary format Frequency table Percentages Frequency Table The arrangement of statistical data in a rowand-column format that exhibits the count of responses or observations for each category assigned to a variable

Central Tendency Measure of Central Measure of Type of Scale Tendency Dispersion Nominal Mode None Ordinal Median Percentile Interval or ratio Mean Standard deviation Cross-Tabulation A technique for organizing data by groups, categories, or classes, thus facilitating comparisons; a joint frequency distribution of observations on two or more sets of variables Contingency table- The results of a crosstabulation of two variables, such as survey questions

Cross-Tabulation Analyze data by groups or categories Compare differences Contingency table Percentage cross-tabulations Base The number of respondents or observations (in a row or column) used as a basis for computing percentages

Elaboration and Refinement Moderator variable A third variable that, when introduced into an analysis, alters or has a contingent effect on the relationship between an independent variable and a dependent variable. Spurious relationship An apparent relationship between two variables that is not authentic. Quadrant Analysis Two rating scales 4 quadrants two-dimensional table Importance- Performance Analysis)

Data Transformation Data conversion Changing the original form of the data to a new format More appropriate data analysis New variables Data Transformation Summative Score VAR1 + VAR2 + VAR 3

Collapsing a Five-Point Scale Strongly Agree Agree Neither Agree nor Disagree Disagree Strongly Disagree Strongly Agree/Agree Neither Agree nor Disagree Disagree/Strongly Disagree Index Numbers Score or observation recalibrated to indicate how it relates to a base number CPI - Consumer Price Index

Calculating Rank Order Ordinal data Brand preferences Tables Bannerheads for columns Studheads for rows

Pie charts Line graphs Bar charts Vertical Horizontal Charts and Graphs

Line Graph Bar Graph 90 80 70 60 50 40 30 20 10 0 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North

WebSurveyor Bar Chart How did you find your last job? Temporary agency 1.5 % 643 Networking 213 print ad 179 Online recruitment site 112 Placement firm 18 Temporary agency Placement firm 9.6 % Online recruitment site 15.4 % print ad 18.3 % Networking 55.2 % 0 100 200 300 400 500 600 700 SPSS SAS SYSTAT Microsoft Excel WebSurveyor Computer Programs

Microsoft Excel -Data Analysis The Paste Function Provides Numerous Statistical Operations

Computer Programs Box and whisker plots Interquartile range - midspread Outlier

Interpretation The process of making pertinent inferences and drawing conclusions concerning the meaning and implications of a research investigation Research Methods William G. Zikmund Univariate Statistics

Univariate Statistics Test of statistical significance Hypothesis testing one variable at a time Hypothesis Unproven proposition Supposition that tentatively explains certain facts or phenomena Assumption about nature of the world

Hypothesis An unproven proposition or supposition that tentatively explains certain facts or phenomena Null hypothesis Alternative hypothesis Null Hypothesis Statement about the status quo No difference

Alternative Hypothesis Statement that indicates the opposite of the null hypothesis Significance Level Critical probability in choosing between the null hypothesis and the alternative hypothesis

Significance Level Critical Probability Confidence Level Alpha Probability Level selected is typically.05 or.01 Too low to warrant support for the null hypothesis The null hypothesis that the mean is equal to 3.0: H o : µ 3.0

The alternative hypothesis that the mean does not equal to 3.0: H : µ 1 3.0 A Sampling Distribution µ3.0 x

A Sampling Distribution α.025 α.025 µ3.0 x A Sampling Distribution LOWER LIMIT µ3.0 UPPER LIMIT

Critical values of µ Critical value - upper limit µ + ZS X or µ + Z 3.0 + 1.96 1.5 225 S n Critical values of µ 3.0 + 3.0 + 3.196 1.96( 0.1).196

Critical values of µ Critical value - lower limit µ - ZS X or µ - 3.0-1.96 1.5 225 Z S n Critical values of µ 3.0 1.96 3.0 2.804.196 ( 0.1)

Region of Rejection LOWER LIMIT µ3.0 UPPER LIMIT Hypothesis Test µ 3.0 2.804 µ3.0 3.196 3.78

Type I and Type II Errors Accept null Reject null Null is true Correct- no error Type I error Null is false Type II error Correct- no error Type I and Type II Errors in Hypothesis Testing State of Null Hypothesis Decision in the Population Accept Ho Reject Ho Ho is true Correct--no error Type I error Ho is false Type II error Correct--no error

Calculating Z obs x z obs sx Alternate Way of Testing the Hypothesis X Z obs S X

Alternate Way of Testing the Hypothesis Z obs 3. 78 µ 3.78 S. 1 X 0.78 7. 8.1 3.0 Choosing the Appropriate Statistical Technique Type of question to be answered Number of variables Univariate Bivariate Multivariate Scale of measurement

PARAMETRIC STATISTICS NONPARAMETRIC STATISTICS t-distribution Symmetrical, bell-shaped distribution Mean of zero and a unit standard deviation Shape influenced by degrees of freedom

Degrees of Freedom Abbreviated d.f. Number of observations Number of constraints Confidence Interval Estimate Using the t-distribution or µ X ±. l. t c S X Upper limit X + Lower limit X t t c. l. c. l. S n S n

Confidence Interval Estimate Using the t-distribution µ X t c.l. S X S n population mean sample mean critical value of t at a specified confidence level standard error of the mean sample standard deviation sample size Confidence Interval Estimate Using the t-distribution µ X S n 17 X ± t cl s x 3.7 2.66

upper limit 3.7 + 2.12(2.66 17) 5.07 Lower limit 3.7 2.12 (2.66 17 ) 2.33

Hypothesis Test Using the t-distribution Univariate Hypothesis Test Utilizing the t-distribution Suppose that a production manager believes the average number of defective assemblies each day to be 20. The factory records the number of defective assemblies for each of the 25 days it was opened in a given month. The mean X was calculated to be 22, and the standard deviation, S,to be 5.

H 0 : µ 20 H 1 : µ 20 S X S n

Univariate Hypothesis Test Utilizing the t-distribution The researcher desired a 95 percent confidence, and the significance level becomes.05.the researcher must then find the upper and lower limits of the confidence interval to determine the region of rejection. Thus, the value of t is needed. For 24 degrees of freedom (n-1, 25-1), the t-value is 2.064. Lower limit: µ t c S 20 2.064 5/ 20 2.064( 1) 17.936. l. X 25

Upper limit: µ + t c. l. S + X 20 2.0645/ 20+ 2.0641 ( ) 20.064 25 Univariate Hypothesis Test t-test t obs X µ S X 22 20 1 2 1 2

Testing a Hypothesis about a Distribution Chi-Square test Test for significance in the analysis of frequency distributions Compare observed frequencies with expected frequencies Goodness of Fit Chi-Square Test x O i E i E i

Chi-Square Test x² chi-square statistics O i observed frequency in the i th cell E i expected frequency on the i th cell Chi-Square Test Estimation for Expected Number for Each Cell ij i j

Chi-Square Test Estimation for Expected Number for Each Cell R i total observed frequency in the i th row C j total observed frequency in the j th column n sample size Univariate Hypothesis Test Chi-square Example X 2 ( ) 2 O E ( O E ) 1 E 1 1 + 2 E 2 2 2

Univariate Hypothesis Test Chi-square Example X ( ) 2 60 50 ( 40 50) 2 + 50 4 50 2 Hypothesis Test of a Proportion π is the population proportion p is the sample proportion π is estimated with p

Hypothesis Test of a Proportion H 0 : π. 5 H 1 : π ¹. 5 S p ( 0.6)( 0.4) 100.24 100.0024. 04899

Zobs p π. 6.5 S.04899 p.1 2. 04.04899 Hypothesis Test of a Proportion: Another Example n 1,200 p.20 S p S p S p S p pq n (.2)(.8) 1200.16 1200.000133 S p.0115

Hypothesis Test of a Proportion: Another Example n 1,200 p.20 S p S p S p S p pq n (.2)(.8) 1200.16 1200.000133 S p.0115 p π Z S Hypothesis Test of a Proportion: Another Example p.20.15 Z.0115.05 Z.0115 Z 4.348 The Z value exceeds 1.96,so the null hypothesis should be rejected at Indeed it is significant beyond the.001 the.05 level.