Quantitative Data Analysis
|
|
|
- Ursula Welch
- 10 years ago
- Views:
Transcription
1 Quantitative Data nalysis -. Quantitative Data nalysis analysis-quant-xi-1 ( version 0.7, 1/4/05 ) Code: analysis-quant Daniel K. Schneider, TECF, University of Geneva Menu 1. Scales and "data assumptions" 2 2. The principle of statistical analysis 5 3. Stages of statistical analysis 6 4. Data preparation and composite scale making 7 5. Overview on statistical methods and coefficients Crosstabulation Simple analysis of variance Regression nalysis and Pearson Correlations Exploratory Multi-variate nalysis 22
2 Quantitative Data nalysis - 1. Scales and "data assumptions" 1. Scales and "data assumptions" analysis-quant-xi Types of quantitative measures (scales) Types of measures Description Examples nominal or category enumeration of categories ordinal ordered scales 1st, 2nd, 3rd interval or quantitative or "scale" (in SPSS) male, female district, district B, software widget, widget B measure with an interval 1, 10, 5, 6 (on a scale from 1-10) 180cm, 160cm, 170cm For each type of measure or combinations of types of measure you will have to use different analysis techniques. For interval variables you have a bigger choice of statistical techniques. Therefore scales like (1) strongly agree, (2) agree, (3) somewhat agree, etc. usually are treated as interval variables.
3 Quantitative Data nalysis - 1. Scales and "data assumptions" 1.2 Data assumptions analysis-quant-xi-3 not only you have to adapt your analysis techniques to types of measures but they also (roughly) should respect other data assumptions.. Linearity Example: Most popular statistical methods for interval data assume linear relationships: In the following example the relationship is non-linear: students that show weak daily computer use have bad grades, but so do they ones that show very strong use. Popular measures like the Pearson s r will "not work", i.e. you will have a very weak correlation and therefore miss this non-linear relationship student grades (average) daily use of computers
4 Quantitative Data nalysis - 1. Scales and "data assumptions" B. Normal distribution analysis-quant-xi-4 Most methods for interval data also require "normal distribution" If you have data with "extreme cases" and/or data that is skewed, some individuals will have much more "weight" than the others. Hypothetical example: The "red" student who uses the computer for very long hours will determine a positive correlation and positive regression rate, whereas the "black" ones suggest an inexistent correlation. Mean use of computers does not represent "typical" usage. The "green" student however, will not have a major impact on the result, since the other data are well distributed along the 2 axis. In this second case the "mean" represents a "typical" student. student grades (average) student grades (average) weekly use of computers weekly use of computers
5 Quantitative Data nalysis - 2. The principle of statistical analysis 2. The principle of statistical analysis The goal of statistical analysis is quite simple: find structure in the data analysis-quant-xi-5 DT = STRUCTURE + NON-STRUCTURE DT = EXPLINED VRINCE + NOT EXPLINED VRINCE Example: Simple regression analysis DT = predicted regression line + residuals in other words: regression analysis tries to find a line that will maximize prediction and minimize residuals y=student grades (average) not predicted (error, residuals) predicted (explained) x=weekly use of computers
6 Quantitative Data nalysis - 3. Stages of statistical analysis 3. Stages of statistical analysis analysis-quant-xi-6 Note: With statistical data analysis programs you easily can do several steps in one operation. 1. Clean your data Make very sure that your data are correct (e.g. check data transcription) Make very sure that missing values (e.g. not answered questions in a survey) are clearly identified as missing data 2. Gain knowledge about your data Make lists of data (for small data sets only!) Produce descriptive statistics, e.g. means, standard-deviations, minima, maxima for each variable Produce graphics, e.g. histograms or box plot that show the distribution 3. Produce composed scales E.g. create a single variable from a set of questions 4. Make graphics or tables that show relationships E.g. Scatter plots for interval data (as in our previous examples) or crosstabulations 5. Calculate coefficients that measure the strength and the structure of a relation Strength examples: Cramer s V for crosstabulations, or Pearson s R for interval data Structure examples: regression coefficient, tables of means in analysis of variance 6. Calculate coefficients that describe the percentage of variance explained E.g. R2 in a regression analysis 7. Compute significance level, i.e. find out if you have to right to interpret the relation E.g. Chi-2 for crosstabs, Fisher s F in regression analysis
7 Quantitative Data nalysis - 4. Data preparation and composite scale making 4. Data preparation and composite scale making analysis-quant-xi Statistics programs and data preparation Statistics programs If available, plan to use a real statistics program like SPSS or Statistica Good freeware: WinIDMS (statistical analysis require the use of a command language) url: Freeware for advanced statistics and data visualization: R (needs good IT skills!) url: Using programs like Excel will make you loose time only use such programs for simple descriptive statistics ok if the main thrust of your thesis does not involve any kind of serious data analysis Data preparation Enter the data ssign a number to each response item (planned when you design the questionnaire) Enter a clear code for missing values (no response), e.g. -1 Make sure that your data set is complete and free of errors Some simple descriptive statistics (minima, maxima, missing values, etc.) can help Learn how to document the data in your statistics program Enter labels for variables, labels for responses items, display instructions (e.g. decimal points to show) Define data-types (interval, ordinal or nominal)
8 Quantitative Data nalysis - 4. Data preparation and composite scale making 4.2 Composite scales (indicators) analysis-quant-xi-8 Basics: Most scales are made by simply adding the values from different items (sometimes called "Lickert scales") Eliminate items that have a high number of non responses Make sure to take into account missing values (non responses) when you add up the responses from the different items real statistics program (SPSS) does that for you Make sure when you create your questionnaire that all items use the same range of response item, else you will need to standardize!! Quality of a scale: gain: use a published set of items to measure a variable (if available) if you do, you can avoid making long justifications! Sensitivity: questionnaire scores discriminate e.g. if exploratory research has shown higher degree of presence in one kind of learning environment than in an other one, results of presence questionnaire should demonstrate this. Reliability: internal consistency is high Intercorrelation between items (alpha) is high Validity: results obtained with the questionnaire can be tied to other measures e.g. were similar to results obtained by other tools (e.g. in depth interviews), e.g. results are correlated with similar variables.
9 Quantitative Data nalysis - 4. Data preparation and composite scale making analysis-quant-xi-9 Exemple 4-1: The COLLES surveys url: The Constructivist On-Line Learning Environment Surveys include one to measure preferred (or ideal) experience in a teaching unit. It includes 24 statements measuring 6 dimensions. We only show the first two (4 questions concerning relevance and 4 questions concerning reflection). Note that in the real questionnaire you do not show labels like "Items concerning relevance" or "response codes". Statements lmost Never Seldom Sometimes Often response codes Items concerning relevance a. my learning focuses on issues that interest me. O O O O O b. what I learn is important for my prof. practice as a trainer. O O O O O c. I learn how to improve my professional practice as a trainer. O O O O O d. what I learn connects well with my prof. practice as a trainer. O O O O O Items concerning Reflection I think critically about how I learn. O O O O O I think critically about my own ideas. O O O O O I think critically about other students' ideas. O O O O O I think critically about ideas in the readings. O O O O O lmost lways
10 Quantitative Data nalysis - 4. Data preparation and composite scale making lgorithm to compute each scale: analysis-quant-xi-10 for each individual add response codes and divide by number of items or use a "means" function in your software package: relevance = mean (a, b, c, d) Examples: Individual who answered a=sometimes, b=often, c=almost always, d= often gives: ( ) / 4 = 4 Missing values (again) Make sure that you do not add "missing values" Individual B who answered a=sometimes, b=often, c=almost always, d=missing gives: ( ) / 3 = 4 and certainly NOT: ( ) / 4 or ( ) / 4!!
11 Quantitative Data nalysis - 5. Overview on statistical methods and coefficients 5. Overview on statistical methods and coefficients analysis-quant-xi Descriptive statistics Descriptive statistics are not very interesting in most cases (unless they are used to compare different cases in comparative systems designs) Therefore, do not fill up pages of your thesis with tons of Excel diagrams!! Some popular summary statistics for interval variables Mean Median: the data point that is in the middle of "low" and "high" values Standard deviation: the mean deviation from the mean, i.e. how far a typical data point is away from the mean. High and Low value: extremes a both end Quartiles: same thing as median for 1/4 intervals
12 Quantitative Data nalysis - 5. Overview on statistical methods and coefficients 5.2 Which data analysis for which data types? analysis-quant-xi-12 Popular bi-variate analysis Independent (explaining) variable X Quantitative (interval) Dependant variable Y Qualitative (nominal or ordinal) Transform X into a qualitative Quantitative Correlation and Regression variable and see below Qualitative nalysis of variance Crosstabulations Popular multi-variate anaylsis Independent (explaining) variable X Quantitative Quantitative (interval) Factor nalysis, multiple regresstion, SEM, Cluster nalysis, Dependant variable Y Qualitative (nominal or ordinal) Transform X into a qualitative variable and see below Qualitative nova Multidimensional scaling etc.
13 Quantitative Data nalysis - 5. Overview on statistical methods and coefficients 5.3 Types of statistical coefficients: analysis-quant-xi-13 First of all make sure that the coefficient you use is more or less appropriate for you data The big four: 1. Strength of a relation Coefficients usually range from -1 (total negative relationship) to +1 (total positive relationship). 0 means no relationship. 2. Structure (tendency) of a relation 3. Percentage of variance explained 4. Signification level of your model Gives that chance that you are in fact gambling Typically in the social sciences a sig. level lower than 5% (0.05) is acceptable Do not interpret data that is above! These four are mathematically connected: E.g. Signification is not just dependent on the size of your sample, but also on the strength of a relation.
14 Quantitative Data nalysis - 6. Crosstabulation 6. Crosstabulation analysis-quant-xi-14 Crosstabulation is a popular technique to study relationships between normal (categorical) or ordinal variables Computing the percentages (probabilities) See the example on the next slides 1. For each value of the explaining (independent) variable compute de percentages Usually the X variable is put on top (i.e. its values show in columns). If you don t you have to compute percentages across lines! Remember this: you want to know the probability (percentage) that a value of X leads to a value of Y 2. Compare (interpret) percentages across the dependant (to be explained) variable Statistical association coefficients (there are many!) Phi is a chi-square based measure of association and is usually used for 2x2 tables The Contingency Coefficient (Pearson's C). The contingency coefficient is an adjustment to phi, intended to adapt it to tables larger than 2-by-2. Somers' d is a popular coefficient for ordinal measures (both X and Y). Two variants: symmetric and Y dependant on X (but less the other way round). Statistical significance tests Pearson's chi-square is by far the most common. If simply "chi-square" is mentioned, it is probably Pearson's chi-square. This statistic is used to text the hypothesis of no association of columns and rows in tabular data. It can be used with nominal data.
15 Quantitative Data nalysis - 6. Crosstabulation analysis-quant-xi-15 Exemple 6-1: Crosstabulation vez-vous reçu une formation à l'informatique?* Créer des documents pour afficher en classe X= vez-vous reçu une formation à l'informatique? Non Oui The probability that computer training ("oui") leads to superior usage of the computer to prepare documents is very weak (you can see this by comparing the % line by line. Statistics: Pearson Chi-Square = 1.15 with a signification=.562 This means that the likelihood of results being random is > 50% and you have to reject relationship Contingency coefficient = 0.115, significance =.562 Not only is the relationship very weak (but it can t be interpreted) Total Régulièrement Effectif Y= Utilisez-vous % dans X 44.4% 58.4% 57.0% l ordinateur pour Occasionnellement Effectif créer des documents pour afficher % dans X 44.4% 27.3% 29.1% en classe? 2 Jamais Effectif % dans X 11.1% 14.3% 14.0% Total Effectif % dans X 100.0% 100.0% 100.0%
16 Quantitative Data nalysis - 6. Crosstabulation analysis-quant-xi-16 Exemple 6-2: Crosstabulation: Pour l'élève, le recours aux ressources de réseau favorise l'autonomie dans l'apprentissage * Rechercher des informations sur Internet 0 Régulièrement Y= Rechercher des 1 Occasionnellement informations sur Internet 2 Jamais Total X= Pour l'élève, le recours aux ressources de réseau favorise l'autonomie dans l'apprentissage 0 Tout à fait en désaccord 1 Plutôt en désaccord 2 Plutôt en accord 3 Tout à fait en accord Total Count % within X.0% 18.2% 19.6% 42.3% 25.6% Count % within X 33.3% 63.6% 50.0% 42.3% 48.8% Count % within X 66.7% 18.2% 30.4% 15.4% 25.6% Count % within X 100.0% 100.0% 100.0% 100.0% 100.0% We have a weak significant relationship: the more teachers agree that students will increase learning autonomy from using Internet resources, the more they will let students do so. Statistics: Directional Ordinal by Ordinal Measures with Somer s D Values Somer s D Significance Symmetric Y = Rechercher des informations sur Internet Dependent
17 Quantitative Data nalysis - 7. Simple analysis of variance 7. Simple analysis of variance analysis-quant-xi-17 nalysis of variance (and it s multi-variate variant nova) are the favorite tools of the experimentalists. X is an experimental condition (therefore a nominal variable) and Y usually is an interval variable. E.g. Does presence or absence of ICT usage influence grades? You can show that X has an influence on Y if means achieved by different groups (e.g. ICT vs. non-ict users) are significantly different. Significance improves when: means of the X groups are different (the further apart the better) variance inside X groups is low (certainly lower than the overall variance)
18 Quantitative Data nalysis - 7. Simple analysis of variance Exemple 7-1: Differences between teachers and teacher students analysis-quant-xi-18 Population 1 Etudiant(e) LME 2 Enseignant(e) du primaire Total COP2 Fréquence des COP1 Fréquence de activités d'exploration différentes manières à l'extérieur de la de travailler des élèves classe Mean N Std. Deviation Mean N Std. Deviation Mean N Std. Deviation COP3 Fréquence des travaux individuels des élèves COP1, COP2, COP3 sont des indicateurs composé allant de 0 (peu) et 2 (beaucoup) The difference for COP2 is not significant (see next slide) Standard deviations within groups are rather high (in particular for students), which is a bad thing: it means that among students they are highly different.
19 Quantitative Data nalysis - 7. Simple analysis of variance nova Table and measures of associations analysis-quant-xi-19 Var_COP1 Fréquence de différentes manières de travailler des élèves * Population_bis Population Var_COP2 Fréquence des activités d'exploration à l'extérieur de la classe * Population_bis Population Var_COP3 Fréquence des travaux individuels des élèves * Population_bis Population Measures of ssociation associations are week and explained variance very weak Sum of Squares df Mean Square F Sig. Between Groups Within Groups Total Between Groups Within Groups Total Between Groups Within Groups Total Eta Eta Squared Var_COP1 Fréquence de différentes manières de travailler des élèves * Population Var_COP2 Fréquence des activités d'exploration à l'extérieur de la classe * Population Var_COP3 Fréquence des travaux individuels des élèves * Population
20 Quantitative Data nalysis - 8. Regression nalysis and Pearson Correlations 8. Regression nalysis and Pearson Correlations analysis-quant-xi-20 Exemple 8-1: Does teacher age explain exploratory activities outside the classroom? Independant variable: GE Dependent variable: Fréquence des activités d'exploration à l'extérieur de la classe Model Summary R Model Coefficients R Square djusted R Square ll this means: We have a week relation (.316) between age and exploratory activities. It is significant (.027) Formally the relation is: exploration scale = * GE (roughly: only people over 99 are predicted a top score of 2) Std. Error of the Estimate Pearson Correlation Sig. (1-tailed) N Coefficients Stand. coeff. t Sig. Correlations B Std. Error Beta Zero-order (Constant) GE ge Dependent Variable: Var_COP2 Fréquence des activités d'exploration à l'extérieur de la classe
21 Quantitative Data nalysis - 8. Regression nalysis and Pearson Correlations analysis-quant-xi-21 Here is a scatter plot of this relation No need for statistical coefficients to see that the relation is rather week and why the prediction states that it takes a 100 years... :) Fréquence des activités d'exploration à l'extérieur de la classe Fréquence des activités d'exploration à l'extérieur de la classe = * GE R-Square = Linear Regression with 95.00% Mean Prediction Interval ge
22 Quantitative Data nalysis - 9. Exploratory Multi-variate nalysis 9. Exploratory Multi-variate nalysis analysis-quant-xi-22 There many techniques, here we just introduce cluster analysis, e.g. Factor nalysis (principal components) or Discriminant analysis are missing here 9.1 Cluster nalysis Cluster analysis or classification refers to a set of multivariate methods for grouping elements (subjects or variables) from some finite set into clusters of similar elements (subjects or variables). There 2 different kinds: hierarchical cluster analysis and K-means cluster. Typical examples: Classify teachers into 4 to 6 different groups regarding ICT usage Exemple 9-1: Gonzalez classification of teachers hierarchical analysis allow to identify 6 major types of teachers Type 1 : l enseignant convaincu Type 2 : les enseignants actifs Type 3 : les enseignants motivés ne disposant pas d un environnement favorable Type 4 : les enseignants volontaires, mais faibles dans le domaine des technologies Type 5 : l enseignant techniquement fort mais peu actif en TIC Type 6 : l enseignant à l aise malgré un niveau moyen de maîtrise
23 Quantitative Data nalysis - 9. Exploratory Multi-variate nalysis Dendogram (tree diagram of the population) analysis-quant-xi-23 vertical ordering is of no importance case 9 and case 11 are very close case 18 and case 12 are quite close cluster number case 1 and case 6 are very different distance of cases (horizonal)
24 Quantitative Data nalysis - 9. Exploratory Multi-variate nalysis Statistics of a subset of the 36 variables used for analysis: analysis-quant-xi-24 Final note: confirmatory multivariate analysis (e.g. structural equation modelling) is not even mentionnend in this document
January 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
The Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests
SPSS TUTORIAL & EXERCISE BOOK
UNIVERSITY OF MISKOLC Faculty of Economics Institute of Business Information and Methods Department of Business Statistics and Economic Forecasting PETRA PETROVICS SPSS TUTORIAL & EXERCISE BOOK FOR BUSINESS
Lecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
Data exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected]
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected] Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
IBM SPSS Statistics for Beginners for Windows
ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning
IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
SPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
Simple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
How to Get More Value from Your Survey Data
Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2
Working with SPSS. A Step-by-Step Guide For Prof PJ s ComS 171 students
Working with SPSS A Step-by-Step Guide For Prof PJ s ComS 171 students Contents Prep the Excel file for SPSS... 2 Prep the Excel file for the online survey:... 2 Make a master file... 2 Clean the data
Diagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models
Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis
An introduction to IBM SPSS Statistics
An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
The Chi-Square Test. STAT E-50 Introduction to Statistics
STAT -50 Introduction to Statistics The Chi-Square Test The Chi-square test is a nonparametric test that is used to compare experimental results with theoretical models. That is, we will be comparing observed
Chapter 23. Inferences for Regression
Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily
Working with data: Data analyses April 8, 2014
Working with data: Data analyses April 8, 2014 Housekeeping notes This webinar will be recorded, and will be available on the Centre s website as an educational resource The slides have been sent to participants
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Introduction to Statistics with SPSS (15.0) Version 2.3 (public)
Babraham Bioinformatics Introduction to Statistics with SPSS (15.0) Version 2.3 (public) Introduction to Statistics with SPSS 2 Table of contents Introduction... 3 Chapter 1: Opening SPSS for the first
DATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Features Of The Chi-Square Statistic The chi-square test is non-parametric. That is, it makes no assumptions about the distribution
Statistics. Measurement. Scales of Measurement 7/18/2012
Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
Independent t- Test (Comparing Two Means)
Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent
4. Descriptive Statistics: Measures of Variability and Central Tendency
4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and
DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
Research Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS INTRODUCTION TO STATISTICS MATH 2050
PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS INTRODUCTION TO STATISTICS MATH 2050 Class Hours: 2.0 Credit Hours: 3.0 Laboratory Hours: 2.0 Date Revised: Fall 2013 Catalog Course Description: Descriptive
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 [email protected] 1. Descriptive Statistics Statistics
Data analysis process
Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation
SPSS Tests for Versions 9 to 13
SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19
PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations
Directions for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
An introduction to using Microsoft Excel for quantitative data analysis
Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
A Basic Guide to Analyzing Individual Scores Data with SPSS
A Basic Guide to Analyzing Individual Scores Data with SPSS Step 1. Clean the data file Open the Excel file with your data. You may get the following message: If you get this message, click yes. Delete
Data Analysis for Marketing Research - Using SPSS
North South University, School of Business MKT 63 Marketing Research Instructor: Mahmood Hussain, PhD Data Analysis for Marketing Research - Using SPSS Introduction In this part of the class, we will learn
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Overview of Factor Analysis
Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,
WHAT IS A JOURNAL CLUB?
WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club
Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools
Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................
Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini
NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building
IBM SPSS Statistics 20 Part 1: Descriptive Statistics
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 1: Descriptive Statistics Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
Instructions for SPSS 21
1 Instructions for SPSS 21 1 Introduction... 2 1.1 Opening the SPSS program... 2 1.2 General... 2 2 Data inputting and processing... 2 2.1 Manual input and data processing... 2 2.2 Saving data... 3 2.3
STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance
Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
Name: Date: Use the following to answer questions 2-3:
Name: Date: 1. A study is conducted on students taking a statistics class. Several variables are recorded in the survey. Identify each variable as categorical or quantitative. A) Type of car the student
Elements of statistics (MATH0487-1)
Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -
Foundation of Quantitative Data Analysis
Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1
430 Statistics and Financial Mathematics for Business
Prescription: 430 Statistics and Financial Mathematics for Business Elective prescription Level 4 Credit 20 Version 2 Aim Students will be able to summarise, analyse, interpret and present data, make predictions
Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:
Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:
UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST
UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
Exploratory Data Analysis
Exploratory Data Analysis Johannes Schauer [email protected] Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
Association Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
Introduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
MULTIPLE REGRESSION EXAMPLE
MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if
Study Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Chapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
Multiple Regression. Page 24
Multiple Regression Multiple regression is an extension of simple (bi-variate) regression. The goal of multiple regression is to enable a researcher to assess the relationship between a dependent (predicted)
Exploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)
1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting
Introduction to Statistics and Quantitative Research Methods
Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.
COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
The correlation coefficient
The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative
Simple Linear Regression, Scatterplots, and Bivariate Correlation
1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.
Linear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
International Statistical Institute, 56th Session, 2007: Phil Everson
Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: [email protected] 1. Introduction
Calculating the Probability of Returning a Loan with Binary Probability Models
Calculating the Probability of Returning a Loan with Binary Probability Models Associate Professor PhD Julian VASILEV (e-mail: [email protected]) Varna University of Economics, Bulgaria ABSTRACT The
Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)
Cairo University Faculty of Economics and Political Science Statistics Department English Section Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study) Prepared
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.
How to Make APA Format Tables Using Microsoft Word
How to Make APA Format Tables Using Microsoft Word 1 I. Tables vs. Figures - See APA Publication Manual p. 147-175 for additional details - Tables consist of words and numbers where spatial relationships
Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th
Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th Standard 3: Data Analysis, Statistics, and Probability 6 th Prepared Graduates: 1. Solve problems and make decisions that depend on un
TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2
About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing
An analysis method for a quantitative outcome and two categorical explanatory variables.
Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that
Projects Involving Statistics (& SPSS)
Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V
CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V Chapters 13 and 14 introduced and explained the use of a set of statistical tools that researchers use to measure
Analyzing Research Data Using Excel
Analyzing Research Data Using Excel Fraser Health Authority, 2012 The Fraser Health Authority ( FH ) authorizes the use, reproduction and/or modification of this publication for purposes other than commercial
