DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.



Similar documents
Descriptive Statistics

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Descriptive Statistics and Measurement Scales

Means, standard deviations and. and standard errors

Statistics. Measurement. Scales of Measurement 7/18/2012

The correlation coefficient

MEASURES OF VARIATION

II. DISTRIBUTIONS distribution normal distribution. standard scores

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

Module 3: Correlation and Covariance

Correlation key concepts:

Section 3 Part 1. Relationships between two numerical variables

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Introduction to Quantitative Methods

Introduction; Descriptive & Univariate Statistics

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

Descriptive statistics parameters: Measures of centrality

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

Exercise 1.12 (Pg )

Lesson 4 Measures of Central Tendency

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement

Statistics Review PSY379

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

Study Guide for the Final Exam

DATA COLLECTION AND ANALYSIS

Chapter 2 Statistical Foundations: Descriptive Statistics

Foundation of Quantitative Data Analysis

CALCULATIONS & STATISTICS

Measures of Central Tendency and Variability: Summarizing your Data for Others

Additional sources Compilation of sources:

6.4 Normal Distribution

How To Write A Data Analysis

Data Exploration Data Visualization

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

MBA 611 STATISTICS AND QUANTITATIVE METHODS

DATA INTERPRETATION AND STATISTICS

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

UNIVERSITY OF NAIROBI

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Describing Data: Measures of Central Tendency and Dispersion

Using Excel for inferential statistics

Midterm Review Problems

Diagrams and Graphs of Statistical Data

Northumberland Knowledge

Exploratory Data Analysis. Psychology 3256

Summarizing and Displaying Categorical Data

Algebra I Vocabulary Cards

Interpreting Data in Normal Distributions

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

Expression. Variable Equation Polynomial Monomial Add. Area. Volume Surface Space Length Width. Probability. Chance Random Likely Possibility Odds

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

MEASURES OF CENTER AND SPREAD MEASURES OF CENTER 11/20/2014. What is a measure of center? a value at the center or middle of a data set

Measurement with Ratios

Introduction to Statistics and Quantitative Research Methods

Math 1. Month Essential Questions Concepts/Skills/Standards Content Assessment Areas of Interaction

Simple linear regression

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Simple Regression Theory II 2010 Samuel L. Baker

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Common Core Unit Summary Grades 6 to 8

Chapter 2: Frequency Distributions and Graphs

Biggar High School Mathematics Department. National 5 Learning Intentions & Success Criteria: Assessing My Progress

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

Mathematics. Mathematical Practices

Describing and presenting data

Univariate Regression

Scatter Plots with Error Bars

Geostatistics Exploratory Analysis

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab


Name: Date: Use the following to answer questions 2-3:

January 26, 2009 The Faculty Center for Teaching and Learning

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Homework 11. Part 1. Name: Score: / null

Common Tools for Displaying and Communicating Data for Process Improvement

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

GeoGebra. 10 lessons. Gerrit Stols

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

Section 1.3 Exercises (Solutions)

Standard Deviation Estimator

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Transcription:

DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize, summarize & describe the data Inferential To determine reliability of the data

RELATIONSHIPS SCALES OF MEASURMENT Nominal Scale Only use those statistical procedures that rely on counting -- the number (N) in the sample. Ordinal Scale Same as nominal scale Can use statistics that indicate points below which certain percentages of the cases fall.

RELATIONSHIPS SCALES OF Interval Scale MEASURMENT Any of the above plus procedures that include adding. Ratio Scale Any statistical procedure is acceptable.

MEASUREMENT SUMMARY Measurement Characteristics Scoring Types Examples Nominal Lowest level -- used to classify variables into two or more categories. Cases placed in the same category must be equivalent. The categories must be exhaustive -- all persons or items must fit into one of the categories. Counting N in sample Labels or # s No relation between # s N of sample Mode Range Football player jerseys 48 not better than 36 Race Gender Must also be mutually exclusive -- one person or item can't fit more than one category.

MEASUREMENT SUMMARY Measurement Characteristics Scoring Types Examples Ordinal Numbers only used to indicate the rank order of cases of a variable. Cannot measure or evaluate the difference in value between each case. No mathematical or statistical operations (you can't add label 1 to label 2, etc.). Points below which certain % falls. Size of distance between intervals unknown. Order of objects with respect to an attribute. Frequency distribution Median Quartile deviation Spearman rho coefficient of correlation Hardness of metal Personnel evaluations of performance

MEASUREMENT SUMMARY Measurement Characteristics Scoring Types Examples Interval Has all of the above characteristics. Added requirement of equal distances or intervals between labels -- represent equal distances in the variables of your study. = intervals w/ arbitrary origin No true zero Adding Mean Standard deviation Variance Pearson product moment coefficient of correlation Temperature difference Footcandle levels in lighting IQ s

MEASUREMENT SUMMARY Measurement Characteristics Scoring Types Examples Ratio Has all of above features plus an absolute zero point. Enables you to multiple and divide scale numbers to create ratios between labels. Equal intervals Multiply Divide All types Income ranges. Number of years of school. Age in years. Yardstick or architect s scale.

FREQUENCY DISTRIBUTIONS The arrangement of the scores from lowest to highest. Implies a general shape to the data because of the shape of the distribution.

FREQUENCY DISTRIBUTIONS The easiest way for you to do summary statistics is with a dedicated statistical package. With small data sets, you can do most data manipulation for summary statistics with a spreadsheet.

HISTOGRAMS & POLYGONS: GENERAL RULES On horizontal axis, lay out lowest scores to highest -- left to right. Lay out frequencies on vertical axis -- from 0 up to highest frequency.

HISTOGRAMS & POLYGONS: GENERAL RULES Place a point at center of score/frequency intersection. Construct either a histogram or polygon.

HISTOGRAMS & POLYGONS: Histogram or polygon. GENERAL RULES

MEASURES OF CENTRAL TENDANCY Used to summarize data through a single number that can represent the whole set of scores. Types: mode, median, mode, mean

MEASURES OF CENTRAL TENDANCY Mode The value or number that occurs most frequently in the distribution. Two modes are bi-modal; three or more are tri-model or multi-modal. Very stable and there can be more than one mode. Only appropriate measure for nominal scales.

MEASURES OF CENTRAL Median TENDANCY The point in the distribution below which 50% of the scores lie. Scores must be placed in rank order from lowest to highest first. The median can fall between the upper limit and lower limit of a score. Can fall on the border line between scores.

MEASURES OF CENTRAL TENDANCY Median (continued) The median is an ordinal statistic because it is based on rank. Can be used on interval and ratio data but the interval characteristic of the data is not used. Only time the median is really useful is when there are extreme scores in the distribution.

MEASURES OF CENTRAL Mean TENDANCY The arithmetic average -- sum of all the scores divided by the N. Most stable measure of central tendency and is more precise than the median or mode. Can be used with interval and ratio scales.

MEASURES OF CENTRAL TENDANCY Mean (continued) Can calculate the Mean for a distribution of scores or for a frequency distribution. Best indicator of combined performance whereas the median is the best indicator of typical performance.

DISTRIBUTION SHAPES - The mean and median are the same. If a single mode, it falls at the same location as the mean and median. SYMMETRICAL

DISTRIBUTION SHAPES - When distributions are skewed the values of central tendency differ. Determine skewness by comparing the mean & median without drawing a histogram or polygon. SKEWED

DISTRIBUTION SHAPES - POSITIVE SKEW The mean is always greater than the median & the median is usually greater than the mode. Skew is to the left.

DISTRIBUTION SHAPES - NEGATIVE SKEW The mean is always smaller than the median & the median is usually smaller than the mode. Skew is to the right.

DISTRIBUTION SHAPES - NORMAL CURVE A symmetrical curve with the same number of scores above & below the mean. Same as symmetrical. Most scores are concentrated around the mean. Approximately 68% of the cases are within +/- 1 SD unit from the mean.

VARIABILITY MEASURES Range Difference between the highest and lowest scores. Determine by subtraction. Is an unreliable index of variability because it is derived from only two scores.

VARIABILITY MEASURES Quartile deviation Half the difference between the upper and lower quartiles in a distribution. The 75th percentile & the 25th percentile. Provides a measure of one-half of the range of scores within which lie the middle 50% of the scores. It is an ordinal scale statistic and is used with the median (which means that it is not often used unless there are extreme scores).

VARIABILITY MEASURES Variance Based on the mean. Considers the size and location of individual scores. Variance & standard deviation are based on the deviation score which is the difference between a raw score & the mean. The sum of the deviation scores of a distribution are always zero because the scores above the mean are always positive while the scores below the mean are always negative.

VARIABILITY MEASURES Standard Deviation SD is the square root of variance Is used to summarize data in the same units as the original data. Most commonly used statistic for variability. It is the square root of the mean of the squared deviation scores.

STANDARD SCORES z-scores The distance of a score from the mean in standard deviation units. Scores with the same numerical value as the mean will have a z-score of zero. Used to compare one set of scores to another -- example two exams and S's performance on the exams. Use of z-scores requires use of negative values and fractions. Overcome by using Z-scores.

Z-scores STANDARD SCORES Obtained by multiplying the z-score by 10 and adding 50 to the result. Used to compare scores in different distributions. Allows descriptions in whole numbers. A type of standard score. Does not alter the shape of the original distribution.

CORRELATION Used to describe the relationship between pairs of scores. Shows the extent to which a change in one variable is associated with change in another variable.

Scattergrams CORRELATION Used to show correlation. One variable on each axis (horizontal and vertical). Plot scattergrams to see both direction & strength of a relationship. Direction shows positive or negative relationship. Scores for independent variable on horizontal axis & dependent variable on vertical axis.

Lower left to upper right Positive relationship Low scores on one variable associated with low scores on other High on one high on other. CORRELATION

CORRELATION Upper left to lower right Negative relationship. High on one, low on the other variable.

CORRELATION Narrow dot band High strength. Straight line shows strong relationship between variables.

CORRELATION Scattered dot band Low strength. Relatively weak relationship between variables.

CORRELATION Prediction of one variable from another can occur with strong relationships Positive and negative equally important. The higher the correlation between variables in either a positive or negative direction, the more accurate the prediction.

CORRELATION COEFFICIENTS Range from -1.00 to +1.00. -1.00 = perfect negative relationship. +1.00 = perfect positive relationship. 0.00 (midpoint) = no relationship at all.

CORRELATION COEFFICIENTS Correlation coefficients near unity indicate high degree of relationship. Make accurate prediction about one variable from info about another variable. Desirable to have +/- 0.90 and above. Again, negative & positive both equally good for prediction.

PEARSONS R (PRODUCT MOMENT CORRELATION) Used with either interval or ratio scales. Defined as the mean of z-score products of two variables. Most common method for correlation. Same statistical family as mean.

PEARSONS R (PRODUCT MOMENT CORRELATION) Assumes a linear relationship between the two variables. (Straight line fit between scores of the two variables). If curvilinear, must use other methods.

SPEARMAN RHO Used with rank order data; ordinal scales. Part of the same statistical family as median. Ranges from -1.00 to +1.00 (same as Pearsons R).

SOURCES OF INFO See your bibliography for the class!