Navigator. Statistics

Size: px
Start display at page:

Download "Navigator. Statistics"

Transcription

1 Navigator IBM A SPSS A Statistics (A Vade Mecum for SPSS 19 on Windows7) 1

2 2

3 Navigator IBM A SPSS A Statistics (A Vade Mecum for SPSS 19 on Windows7) 3

4 Preface This is an elementary guide intended for navigating IBM SPSS v19 on the Windows platform. Little or no previous knowledge of SPSS is assumed. The guide is to be treated as an interface to manipulate tools available from IBM SPSS before embarking on statistical applications and their interpretations. Brief comments on data properties and necessary requirements for applying specific statistical techniques have been outlined in the appendix. September 2011 Royal Holloway University of London Further Readings: Frequently Applied Statistical Techniques Multivariate Statistical Analysis P Pal P Pal 4

5 Contents Working with Input Editor 4 Data View 4 Variable View 5 Data Manipulations 6 Pre-processing of Raw Data 7 Compute 8 Re-Code 10 Case Selection 12 Simple Condition; Conjugate Condition 13 Split File 15 Working with Output Editor 17 Controlling Output 18 Export Output to another Application 18 Appendix Basic Statistics 22 Uncertainties Frequently Applied Statistical Techniques Mathematical Models 5

6 NAVIGATOR IBM SPSS 19 Working with the Input When SPSS is opened, the first window that appears on the screen is the Input Editor (Data View). At the top of this Editor, two bars appear: standard menu bar and standard tool bar. The main body consists of a spreadsheet with rows and columns. Like most applications packages under Windows OS, there are arrows on the right hand and bottom margins of the SPSS spreadsheet for scrolling it up, down or side ways as required. At the bottom (left hand corner) two tabbed buttons are located e,g Data View and Variable View/ The standard menu bar shows the following menu items: Item Used for Usage File opening, saving, printing files (extensive) Edit copy, paste, editing files (extensive) View controlling the appearance of software (less extensive) Data organizing data file, code labelling of data values (extensive) Transform recode, compute, defining new variables (extensive) Analyze statistical applications (extensive) Graphs drawing different types of graphs (extensive) Add-ons plug-in additional applications (less frequent) Utilities accessing command languages (less frequent) Window activating a particular window (seldom) Help accessing SPSS help facilities on applications (extensive) Exhibit 1 The Input Editor (Data View) 6

7 The tool bar below the standard menu bar contains tools which are used as short cut tools to perform some of the standard menu functions. The specific function by the individual icon may be displayed by anchoring the mouse pointer on it. Exhibit 1 shows 5 sets of data have been entered in 3 columns. By default, each value has 2 decimal places as the values are entered. The three active columns containing values are automatically labelled VAR00001, VAR00002, VAR00003 as values are typed in. The remaining columns remain passive with blank cells. These columns labelled Var (faintly displayed). By clicking on the Variable View tab, the Input Editor is opened showing the variables. Exhibit 2. Input Editor Variable View This window shows the list of variables entered in the Input Editor (Data View) and information about the types and format including names and other properties of individual variables, together with their character width, decimal places etc. In the above Exhibit, notable elements are: 3 variables with their names, their type (i.e. numeric or string type; here it is numeric type), width (8 character width), decimals (2 decimal places), alignment of data values (right alignment). 7

8 Manipulation of Input data Different techniques are available for the manipulation of input data. In the following sections, a selection of frequently used manipulation methods is outlined. These include: Changing variable names, Defining data properties, Labelling code values, Increase or decrease decimal places. Change Variable names: Steps to Change Variable names (header labels) Anchor mouse pointer to a required cell in the Name column (Col 1 of the Variable View Editor) Type the desired name; Repeat process to all the variable cells as required. Variable names must begin with a letter, have up to 8 characters (without space), Data manipulation: Usually the width of data values should not require any change. However, default decimal places of 2, may need changing. Steps to change decimal places in the data values Anchor mouse pointer to a cell in the Decimal column, (shows the up and down arrow scroll button appears). Click Up arrow to increase or Down arrow to decrease decimal places in the data values. Steps to change value labels Values in some variables may consist of discrete numeric codes representing some class names. These codes may be given their appropriate value labels. Click on Data from the Menu bar and Choose Define Data Properties (The variable scanning window appears) Exhibit 3 Variable scanning window 8

9 Steps (contd) Click on the desired variable from the list and transfer to the right box Click on Continue; next window opens (see Exhibit below) Click on the Variable appearing on the left. Anchor mouse pointer in the Label box (top right hand corner) and type an elaborate label as desired; Type in the labels in the boxes next to each Code value. Finally OK. Exhibit 4 Value edit window Pre-processing of raw data. Pre-processing of raw data may consist of (i) direct arithmetic calculation, or (ii) by application of an in-built function e.g. Log10 or Ln function or exponential or trigonometric functions or (iii) application of a statistical probability function e.g. CDF (cumulative distribution function) Direct calculation may be based on a formula or an equation. The formula expression may contain variables (entered as input data) and constants. A variety of in-built functions are offered for transformation of raw data values. Any specific function may be selected from a list of functions. Data values contained selected variable(s) from the input list are used as the argument(s) of the selected function. 9

10 (a) Pre-processing by Direct calculation using an expression Here use constant numeric values, data variables, and operator symbols are used from the numeric key pad of the Compute dialog box Example 1: Calculate AGE from the raw data set containing the Month and Year of birth (to 2011) This is to be calculated with reference to the year Suggested formula is AGE = (2011 Year) + (12 Month)/12 Here AGE is the target variable in which the computed result values go and Year and Month variables are provided as raw data input variables (See Exhibit 3) Exhibit 5 Compute Dialog box. The operators (e.g. +, -, *, / operators), parantheses, and numeric constants are available from the key pad displayed by the Compute box, the input variables list appears on the left hand side pane. Steps to compute a new variable Transform Compute (Compute dialog box appears) Enter desired name in the Target box Enter the desired formula to compute in the Numeric Expression box Finally OK 10

11 Note: In the Transform --- Compute process (SPSS 19), when the entries are made from the key board, these entries are recorded as a syntax document. This syntax document appears after the final OK is clicked. The sequence is illustrated with the following simple computational example. Example: Conversion of Centigrade values to Fahrenheit scale. Formula Fahrenheit = (5/9)*Centigrade + 32 In the Compute process, we chose Fahrenheit as the Target variable and in the Function box, we type the formula by picking the appropriate symbols and operators and finally OK. The syntax representation appears on the screen. This is shown in the Exhibit. Exhibit 6 Syntax file on the foreground and Data file on the background. The Syntax representation may be cancelled and the results (in this case the Fahrenheit scale values) appear as a new variable data as follows. 11

12 .Exhibit 7 The results after pre-processing are shown as the target variable. (b) Computation by using a function For computation of any in-built function on the data values, select required function from the functions group list (top half of the box) and select any procedure function from special variables group (bottom half). Example 2: (1 ) Compute Log10 of AGE and (2 ) natural log of AGE The formulas for these computations are as follows: Log_AGE = Lg10 (AGE) LnAGE = LN (AGE) Here LOG_AGE and LNAGE are the Target variables for Log10 and natural log transformations respectively, and LG10 and Ln are the corresponding functions to be applied to the variable AGE. Steps Transform Compute (Compute dialog box appears) Enter desired name in the Target box Enter the desired function in Numeric Expression box Enter required argument from the variable list Finally OK. (For entering a function, click on the up arrow button at the bottom right hand corner of the key pad. The function is displayed which requires appropriate arguments.) 12

13 From the input variable list highlight the appropriate variable and enter as the required argument.) Exhibit 8 (Using Functions on Variables) These pre-processed results (after applying chosen functions) appear in the Input Editor as new Variables with the name that was given as the Target Variable in the Compute process. Exhibit 9.(Input Editor displaying Computed Values) 13

14 Re-Code Re-code tool is useful to hold new specified (re-coded) values from an existing variable. Re-code mechanism allows to form a categorical variable from an existing variable (old) containing continuous data. Steps to re-code Transform Re-code (select into different variable option) (Re-code dialog box appears) Exhibit 8 (Re-code Dialog box) Steps (Contd). Select the variable to be re-coded from the input variable list Type an output variable in the Output variable box and click Change Click Old and New values button (Old and New values box appears) Exhibit 10. (Old and New values Dialog box) In this box three range selection options are available (i) Range through (ii) Range Lowest through ---- Value (iii) Range Value ---- through Highest 14

15 Steps (Contd Enter the limits of each range in appropriate blank boxes Type a desired value in the New value box, and click ADD (this New value is displayed in Old New box Continue (Return to the previous dialog box) Finally OK. Exhibit 11 The re-coded values appear in a new column in the Input Editor. Exhibit 11 15

16 Selection of a sub-set of data It is often necessary to analyse values contained in a variable by sub-groups. The subsets are to be specified in terms of some categorical variable. Two variables are in this process. Analysis of data is achieved by the Split file tool which organises the output by the specified sub-groupings. This selects only the desired sub-set from the full data file. The selection may be based on simple conditions or compound conditions. (a) Case selection with simple condition Example 3. Given a sample of candidates with their gender and marital status, select (a) the female candidates (b) the female candidates who are married. Calculate (a) the average age of the candidates (b) the average age of male and (c) average age of female candidates. Gender Maritals Age Codes: Gender male = female = Maritals Unmarried = Married = Select female cases only (i.e. Gender = 2) Steps Data Choose Select Cases (the select cases window appears) This window shows the list of all the variables entered (left hand pane) and All cases selected as default Choose the If condition satisfied option by clicking in the adjacent circle to open the next window (see Exhibit) 16

17 Finally OK Exhibit 12 (Select Cases Conditions Dialog box). The exhibit shows all cases are selected. Exhibit 13 (Condition applied) Steps (contd). Choose appropriate variable (for the given Example, this is Gender) Transfer to the right hand side box. 17

18 Paste = symbol, followed by 2, ( i.e. Gender = 2) Continue (return to the previous window) This window now shows the instruction next to the If Condition satisfied option, Finally OK. The results of the condition are applied in the Input Editor by selecting the selected cases and de-selecting the remaining cases. (see left hand margin and Filter_$ col). Exhibit 14 (Input Editor showing the selection with a simple condition) (b) Case selection with compound condition With the sample example Select cases for female and married candidates. This is a compound condition case. In order to apply the condition, two variables are to be sought (i) gender from which gender = 2 condition is to be employed and (ii) maritals from which maritals = 2 is to be employed. The two separate conditions are combined by the & conjunction. See the Exhibit below. 18

19 Exhibit 15 (Selection with conjugate conditions) As a result, the Input Editor displays all cases which satisfy the compound condition and crosses out the remaining cases. See the margin and filter $ column of the Editor. Exhibit 16 (Input Editor showing the sub-set selected with conjugate condition) 19

20 Split Files It is often necessary to analyse data from a data file (full set) by different categories (sub-sets). This is performed by specifying the sub-sets with appropriate defining variable which contains the category codes. Example: Given a sample of 10 male and female candidates with their ages. Calculate (a) the average age of the candidates (b) the average age of male and (c) average age of female candidates. Note: (b) and (c) are split file cases Steps to Split file From the Data menu select Split File (Split File box appears) Choose Organize Output by groups and then OK Note: Data file is sorted by the grouping variable Exhibit 17 20

21 Results (a) Average of all candidates Exhibit 18 Average age of all the candidates calculated Average age is (b) Average age of Male and Female candidates Exhibit 20 (The results on the top window) 21

22 Appendix Measurement scales Majority of statistical analysis involves simple arithmetic operations on the data values. However the types of admissible operations depend on the measurement scale of the data. Measurement scales of the data values are of the following types Nominal Discrete Non-parametric Ordinal/Rank/Likert Discrete Non-parametric Interval/Ratio Continuous Parametric Relative strength of 3 measurement scales Weak < > Strong Nominal Ordinal/Rank/Likert Interval/Ratio Transformation from Interval/Ratio to Ordinal is possible by grouping Interval scale values in discrete groups within ranges defined by researchers. (cf. Re-code application). Transformation from Ordinal scale to Interval scale is done by applying various transformation on the Ordinal scale data, e.g Log transform, Square root or Inverse transform. Admissible Statistical Analyses In general, the non-parametric variables (Nominal or Ordinal/Rank scale) define categories. Hence these variables are called categorical or classification or grouping or factor variables, containing the so-called non-metric or discrete data. Parametric variables contain actual measurements or observations which represent metric or continuous data. Various statistical techniques that are applied on the data are dependent on the measurement scale of the data values. A list of admissible statistical techniques is given below. Non-parametric Parametric Nominal Ordinal Ratio/Interval Mode Median Summary parameters Frequency Frequency Mean, Variance, Non-parametric tests Non-parametric tests Skewness, Kurtosis Contingency coeff Spearmans correlation Inferential Statistics ANOVA MANOVA Pearsons Correlations Linear and non-linear regressions 22

23 Summary statistics Statistical parameters are computed with a view to describing (or summarizing) the properties of sampled data. Each parameter is a single-valued quantity. In most cases, these parameters are computed for continuous scale data, with a few for discrete data. A selection of these parameters is given below. Definitions and Formulas of Summary Parameters for a sample Mode: Mode is the most frequently occurring value in a collection of observations. There may be more than one mode in a set of observations. (Nominal) Median: Median is the value in the middle value in a range of observations. There are exactly same number of observations greater than and less than the median value With N observed values, if N is odd, the median value is in the position [N+1]/2, and if N is even, median is the average of the values in position N/2 and N/ (Ordinal/Rank) Arithmetic Mean X : The (sample) mean of a sample of N observations consisting of X 1, X 2,...X i values, is given by (Interval/Ratio) X 1 N N X i i1 1 Range: Range is the difference between the maximum and minimum values of N observations, i.e. Range = X max - X min 2 Variance v: The variance of N observation represents the spread of the observed data set about the mean and is expressed as (Interval/Ratio) v N i1 X i X N Standard Deviation s: Standard deviation is given by the square root of variance s = v (Interval/Ratio) 23

24 Skewness Sk: Skewness represents features (presence or absence) of symmetry of the distribution of the data set about the mean and is given by S k X i X N * s (Interval/Ratio) Sk zero Sk -ve Sk +ve Distribution Symmetrical Left skewed Right skewed Kurtosis Kr: Kurtosis represents the peakedness of a distribution. (Interval/Ratio) K r 4 X i X i 7 4 N * s 24

25 Summary statistics contd. Quartiles: Quartiles divide a data set into quarters (4 equal parts) The first quartile, Q1, is the median of the portion of data set that lies at or below the median of the entire data set The second quartile, Q2 is the median of the entire data set The third quartile Q3, is the median of the portion of data set that lies at or above the median of the entire data set. Note: The entire data set is arranged in ascending order for quartile evaluation. Five number summary: Five number summary is the list of five summary measures including the minimum, Q1, Q2 (median), Q3 and maximum of the data set. Note: The entire data set is arranged in ascending order for five number evaluation. A Box and Whisker plot gives a graphic representation of a data set based on the Five number summary. Q1 Median Q3 Also the B and W plot gives a visual understanding of the skewness of the distribution of the data set. The equal areas of the two polygons on either side of the Median indicates a symmetrical (skew = 0) distribution of the dta. Percentiles: A percentile of a specific value K (Kth Percentile) is the value of the data at or below which lies the K percent of observations. 25

26 Uncertainties associated with sample data Confidence Interval CI A confidence interval is a range if values, w a to w b, which is expected to include a statistical parameter θ (i.e. w a θ w b ). A sampling distribution is associated with a statistical parameter (e,g, mean, correlation coefficient, regression coefficient and so on) calculated from a sample data set and the Central Limit Theorem states that this distribution is normal with the well known bell shape and two tails (areas of error) on either side. By specifying a confidence level α (traditionally α is set at 95%) with the sample parameter, it is necessary to exclude the regions of error. This region is 1 α taking both sides (two tail) of the distribution. In other words, the area of error on each side is (1 α)/2the area of exclusion is usually looked up from a table showing values at desired error (A t-distribution table). a) Calculation of CI for a mean (m) Find the value for the error region (look up t0.025 from t-distribution table). This value is 1.96 at 95% confidence level, or at 5% error level. Multiply the standard error (SE) of the mean by this value which gives the required interval. Note: the SE of a mean m is (s/ N), where s is the standard deviation of m and N the sample size Thus the CI of a sample mean m is CI if m = m ± t * SE of m i.e. = m ± 1.96* (s/ N) b) Calculation of CI for a proportion from a dichotomous set of values Consider a sample of size N with dichotomous values. The proportion p of values with one dichotomy sub-set is f 1 /N and the other sub-set is 1- p. Calculate the standard error SE of a proportion (p). This is given by SE of a proportion = The CI of a sample proportion p is p ( 1 p) N CI of p = p± t * SE of p at 95%level p ( 1 p) = p± t * N 26

27 Inferential Statistics The key objective of statistical analysis with a set of data is to draw some valid inference from it. Two computational steps are involved in deriving such inference. (a) Calculation of an appropriate test measure of the relevant hypothesis and (b) Estimation of the confidence interval. Inferential statistics comprises a wide range of tests which enable researchers to draw inferences from their sample data set. If an inference is to be drawn on a full population of data with a sample, certain conditions are to satisfied. Associated with every statistical test, there are two basic components. These components basically specify the conditions for an appropriate inference. These include i) A measurement requirement and ii) A model. Parametric and Non-parametric Tests: A parametric tests is a test which is to be applied on a set of data measured on Interval or Ratio scale (continuous) or on counts (discrete). It is generally assumed that the data conform to some distribution e.g. Normal or Poisson etc. A non-parametric test applied to a data set on Ordinal scale (e.g. ranked data) or on Nominal scale. In non-parametric tests, conformity of the data to any distribution is not assumed. Power Efficiency of Parametric and Non-Parametric Tests In general, inference of a more general type are drawn from the application of statistical tests which demand weaker assumption about the data. However, the test of null hypothesis through these applications is less powerful for any particular sample of size N (say) under consideration. This true of any Non-Parametric tests compared with Parametric test. Thus when two different tests are applied Test A and Test B) to two samples of two sizes Na and Nb where Nb > Na, then test B may be more powerful than test A This provides a scope for test B over test A. Efficiency and Sample size The extent of increase in the sample size to make a test (test B say) as powerful as another test (test A say) is linked with the power efficiency. This is defined in terms of the sample size used the respective tests 27

28 Power efficiency of test B = Na/Nb Using the above formula, it is easy to calculate the sample size of test B in order to raise its power to level of test A. Considering the parametric and non-parametric tests for comparing means of two groups (i.e. students t-test and Mann-Whitney test), the power efficiency of Mann- Whitney test is 95% relative to the t-test. Similarly among the parametric and non-parametric version of analysis of variance (ANOVA) the power efficiency of Kruskal-Wallis (non-parametric) test is again 95% relative to F-test (parametric). Same level of efficiency may be achieved for two types of tests (parametric and nonparametric) by drawing appropriate size for each. For example, a researcher needs to have a sample size of 20 cases for a Mann Whitney test to be able to reject Ho with the same level of confidence for every 19 cases for a t-test. Similarly a sample of size 20 is needed in a Kruskal-Wallis test to be able to reject Ho for every 19 cases for application of the F-test. 28

29 Hypothesis testing steps A no difference statement is set up as the starter (Ho Null hypothesis). The implicit aim of a null hypothesis is to negate what you wish to demonstrate with your sample data set. Apply appropriate statistical technique, which produces the test statistic value, degrees of freedom (df), and probability p (sig-value) Compare the test statistic p (sig) value with the p value of the sampled distribution If the p value is low (conventionally < 0.05), reject Ho with residual uncertainty proportional to p or else accept Ho. Significance level (p): The dichotomy p < 0.05 yes no Reject Ho Accept Ho Common Errors The common errors in arriving at a decision about Ho are of two types: Type I Error To reject Ho, when in fact it is true. Probability associated with Type I error = alpha Type II Error To accept Ho when in fact it is false Probability associated with Type II error = beta The two probabilities alpha and beta are inversely related, if alpha is increased. Beta is decreased. These errors originate from the sample size N of the data set. The errors are reduced by by increasing the sample size N. 29

30 Specific Parametric Tests. Simple ANOVA applications ANOVA (One Way or Univariate) Data specification: 2 variables - one Test Variable which includes all test scores (Interval/Ratio scale) and one categorical variable (Ordinal or Nominal scale) which defines the groups (sub-sets) of data on the test variables. The key quantity that is computed in ANOVA is the F-statistic, together with the degrees of freedom and the sig-value. The F statistic (a single valued result) is the ratio of the between group (b-g) variation and the within-group (w-g) variation present in the test variable. F = SS SS bg wg Both the numerator and the denominator are single-valued scaler quantities for ANOVA applications and hence the F-statistic turns out to be a single-valued quantity. Simple MANOVA applications GLM Multiple Analysis of Variance (MANOVA) The MANOVA technique is applied to analyse the variances with 2 or more test variables (contrast to MANOVA). These test variables containing continuous data are treated as the dependent variables and the categorical variable(s) as independent variable(s). MANOVA computes the effects of the categorical (factor) variable on the dependent variables through the F-statistic. Normality assumptions on the test variables are not stringent. In Manova with multiple test variables, the between group (b-g) and within group (wg) variations are not single values scaler quantities but matrices. These matrices are known as SSCP(bg) and SSCP(wg) matrices respectively. 30

31 The latent roots from the matrix represent the so-called eigenvalues. The F-statistic calculation from these eigenvalues proceeds along the route as follows. SSCP SSCP b-g (Matrix)/SSCP w-g (Matrix) b-g (Matr ix)/s SCP w- g(mat rix) Computes eigenvalues Computes Wilks The following equations relate the eigenvalues (s) with the Wilk's and the F- statsistic. 1 Wilks = i 1 F = n1 n p 1 p 1 2 where n 1 and n 2 represent the number of cases in p groups. Other commonly used Manova test indices are also computed from the eigenvalues. These are given below. Pillais Trace =1 i i Hotelling Trace = i Roy s max root = i i i max 1 max 31

32 Specific MANOVA Applications Variability of 2 or more test variables with 1 or more factor variables Data Specification: (i) 2 or more test variables (Continuous) and (ii) 1 or more categorical (factor) variables (discrete). The discrete variable may include 2 or more levels Repeated measure MANOVA (Between groups) Data Specification: Sub-sets of Test variables (Interval/Ratio scale) are to be entered in a 2-dimensional matrix format. The first element of the matrix represents the main factor and the second element denotes the factor levels (the levels indicate time repetition). Repeated measure MANOVA (mixed Within-Between groups) Sub-sets of Test variable (Interval/Ratio scale) is to be entered in a 1-dimensional matrix form with factor levels (the levels indicate time repetition) BI-VARIATE RELATIONSHIPS Pearsons Product-Moment Correlation (Parametric) Data specification Measurement scale 2 or more variables Ratio/Interval scale Example Correlation Matrix 3 variables: A B C Correlation coefficients appear as a Matrix AA AB AC BA BB BC CA CB CC AB = BA; AC = CA; BC = CB Range of Correlation coefficient r -1 r +1 Qualitative description of strength of r Range Relationship +/-0.1 to +/- 0.3 weak +/-0.31 to +/- 0.5 medium +/-0.51 to +/- 0.7 moderate +/-0.71 to +/- 1.0 strong 32

33 MATHEMATICAL MODELS An aspect of rationality is the possibility of going beyond purely intuitive judgements to the use of structural models that provide methods of aggregating intuitive judgements. P Suppes Lucien Stern Professor of Philosophy Stanford University California. General Linear Model Family Regression Analysis GLM Discriminant Analysis Factor Analyis Principal Component Analysis GLM Prediction Model Simple Linear Regression A single linear regression is formed with one dependent variable y and one independent variable x 1, and the equation is reduced to the form: y = constant + b 1 * x 1 The coefficient b 1 which represents the slope of the regression line and the constant is the intercept of the regression line with the vertical axis The coefficient may have a positive (+ve) or a negative (-ve) value. A +ve coefficient means that as the independent variable x increases, the dependent variable y also increases and a -ve coefficient means that as x increases y decreases. The magnitude of the coefficient value gives the measure of steepness of the regression line. 33

34 Multiple Linear Regression Regression is an example of GLM structures with the objective of prediction In Multiple regression, a linear equation is constructed by least square fitting using multiple independent (explanatory) variables X1, X2, X3 and one dependent variable Y.. The equation is expressed as follows: Y = constant + b 1 * X 1 + b 2 * X 2 + b 3 * X In the above equation, the coefficients (b 1, b 2 etc) associated with the independent variables are partial regression coefficients. Each coefficient or parameter should be regarded as an estimator of the measure of influence of a particular independent variable on the dependent variable. These coefficients may be +ve or -ve Data Specification Usually all the variables (independent and dependent) contain continuous (Interval/Ratio scale) data. In many regression applications, some of the independent variables may contain Ordinal /Rank scale data. In special applications (e.g. regression used as a classification model), the dependent variable may contain categorical data (Nominal or Ordinal/Rank scale) Terms and Definitions Partial Regression Coefficient - The coefficient associated with each independent variable (e.g. b1, b2 etc) is called the partial regression coefficient. These are the measures of influence of the independent variables on the dependent variable. These are unstandardised coefficients Standardised Coefficients - Owing to the presence of different scales of values in the independent variables, it is necessary to transform the unstandardised coefficients to produce the standardised coefficient which gives the measure of relative influence of each regression coefficient in comparable units. This is given by S x = b S y where is the standardised coefficient which is related to b, the unstandardised coefficient for a particular independent variable X and the dependent variable Y, and Sx and Sy are the respective standard deviations. Coefficient of Multiple Determination R 2 - This is the measure of the proportion of the variation in the dependent variable explained by the independent riables 34

35 The Ballantine for Multiple Regression The Ballantine model provides a diagrammatic representation of the influence of the explanatory variables on the dependent variable. The variables involved (dependent and independents) in the multivariate regression analysis are depicted as circles. which mutually overlap. The overlap sections of these circles represent the extent of partial contributions by the independent variables to the dependent variable and among themselves. A Y B X1 X2 C D Three circles (above) represent 3 variables Y, X1 and X2. There are 4 overlap regions shown by A, B, C, and D. 'A' represents the overlap between X1 and Y (independent of X2) equivalent to b1 'B' represents the overlap between X2 and Y (independent of X1) equivalent to b2 'C' represents the overlap between X1, X2 and Y The ratio of the sum 'A' + 'B' + 'C' to the total area of the 'Y' circle is equivalent to R2, the coefficient of multiple determination. 'D' represents the overlap between X1 and X2 (independent of Y) is equivalent to partial correlation of X1 and X2. 35

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

An introduction to IBM SPSS Statistics

An introduction to IBM SPSS Statistics An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive

More information

SAS Analyst for Windows Tutorial

SAS Analyst for Windows Tutorial Updated: August 2012 Table of Contents Section 1: Introduction... 3 1.1 About this Document... 3 1.2 Introduction to Version 8 of SAS... 3 Section 2: An Overview of SAS V.8 for Windows... 3 2.1 Navigating

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools. Tools for Summarizing Data Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

January 26, 2009 The Faculty Center for Teaching and Learning

January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i

More information

Data exploration with Microsoft Excel: analysing more than one variable

Data exploration with Microsoft Excel: analysing more than one variable Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

IBM SPSS Statistics 20 Part 1: Descriptive Statistics CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 1: Descriptive Statistics Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

Multivariate Analysis of Variance (MANOVA)

Multivariate Analysis of Variance (MANOVA) Chapter 415 Multivariate Analysis of Variance (MANOVA) Introduction Multivariate analysis of variance (MANOVA) is an extension of common analysis of variance (ANOVA). In ANOVA, differences among various

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

IBM SPSS Statistics for Beginners for Windows

IBM SPSS Statistics for Beginners for Windows ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

Introduction Course in SPSS - Evening 1

Introduction Course in SPSS - Evening 1 ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Descriptive Analysis

Descriptive Analysis Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging,

More information

Instructions for SPSS 21

Instructions for SPSS 21 1 Instructions for SPSS 21 1 Introduction... 2 1.1 Opening the SPSS program... 2 1.2 General... 2 2 Data inputting and processing... 2 2.1 Manual input and data processing... 2 2.2 Saving data... 3 2.3

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

S P S S Statistical Package for the Social Sciences

S P S S Statistical Package for the Social Sciences S P S S Statistical Package for the Social Sciences Data Entry Data Management Basic Descriptive Statistics Jamie Lynn Marincic Leanne Hicks Survey, Statistics, and Psychometrics Core Facility (SSP) July

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Simple Linear Regression, Scatterplots, and Bivariate Correlation 1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.

More information

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application

More information

An introduction to using Microsoft Excel for quantitative data analysis

An introduction to using Microsoft Excel for quantitative data analysis Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing

More information

An Introduction to SPSS. Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman

An Introduction to SPSS. Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman An Introduction to SPSS Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman Topics to be Covered Starting and Entering SPSS Main Features of SPSS Entering and Saving Data in SPSS Importing

More information

Scatter Plots with Error Bars

Scatter Plots with Error Bars Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate 1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,

More information

An analysis method for a quantitative outcome and two categorical explanatory variables.

An analysis method for a quantitative outcome and two categorical explanatory variables. Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

More information

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Excel Tutorial Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Working with Data Entering and Formatting Data Before entering data

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

ADD-INS: ENHANCING EXCEL

ADD-INS: ENHANCING EXCEL CHAPTER 9 ADD-INS: ENHANCING EXCEL This chapter discusses the following topics: WHAT CAN AN ADD-IN DO? WHY USE AN ADD-IN (AND NOT JUST EXCEL MACROS/PROGRAMS)? ADD INS INSTALLED WITH EXCEL OTHER ADD-INS

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

SPSS Tests for Versions 9 to 13

SPSS Tests for Versions 9 to 13 SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list

More information

Using Excel for Statistics Tips and Warnings

Using Excel for Statistics Tips and Warnings Using Excel for Statistics Tips and Warnings November 2000 University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Contents 1. Introduction 3 1.1 Data Entry and

More information

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 [email protected] 1. Descriptive Statistics Statistics

More information

SPSS TUTORIAL & EXERCISE BOOK

SPSS TUTORIAL & EXERCISE BOOK UNIVERSITY OF MISKOLC Faculty of Economics Institute of Business Information and Methods Department of Business Statistics and Economic Forecasting PETRA PETROVICS SPSS TUTORIAL & EXERCISE BOOK FOR BUSINESS

More information

Data analysis process

Data analysis process Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis

More information

SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS

SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS SECTION 2-1: OVERVIEW Chapter 2 Describing, Exploring and Comparing Data 19 In this chapter, we will use the capabilities of Excel to help us look more carefully at sets of data. We can do this by re-organizing

More information

Microsoft Excel 2010 Part 3: Advanced Excel

Microsoft Excel 2010 Part 3: Advanced Excel CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES Microsoft Excel 2010 Part 3: Advanced Excel Winter 2015, Version 1.0 Table of Contents Introduction...2 Sorting Data...2 Sorting

More information

Two Correlated Proportions (McNemar Test)

Two Correlated Proportions (McNemar Test) Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences. 1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

More information

SPSS for Simple Analysis

SPSS for Simple Analysis STC: SPSS for Simple Analysis1 SPSS for Simple Analysis STC: SPSS for Simple Analysis2 Background Information IBM SPSS Statistics is a software package used for statistical analysis, data management, and

More information

SPSS Manual for Introductory Applied Statistics: A Variable Approach

SPSS Manual for Introductory Applied Statistics: A Variable Approach SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All

More information

Data exploration with Microsoft Excel: univariate analysis

Data exploration with Microsoft Excel: univariate analysis Data exploration with Microsoft Excel: univariate analysis Contents 1 Introduction... 1 2 Exploring a variable s frequency distribution... 2 3 Calculating measures of central tendency... 16 4 Calculating

More information

SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout

SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS

More information

Multiple Regression. Page 24

Multiple Regression. Page 24 Multiple Regression Multiple regression is an extension of simple (bi-variate) regression. The goal of multiple regression is to enable a researcher to assess the relationship between a dependent (predicted)

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Introduction to the TI-Nspire CX

Introduction to the TI-Nspire CX Introduction to the TI-Nspire CX Activity Overview: In this activity, you will become familiar with the layout of the TI-Nspire CX. Step 1: Locate the Touchpad. The Touchpad is used to navigate the cursor

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected]

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected] Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:

More information

Appendix III: SPSS Preliminary

Appendix III: SPSS Preliminary Appendix III: SPSS Preliminary SPSS is a statistical software package that provides a number of tools needed for the analytical process planning, data collection, data access and management, analysis,

More information

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

There are six different windows that can be opened when using SPSS. The following will give a description of each of them. SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:

More information

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

Mixed 2 x 3 ANOVA. Notes

Mixed 2 x 3 ANOVA. Notes Mixed 2 x 3 ANOVA This section explains how to perform an ANOVA when one of the variables takes the form of repeated measures and the other variable is between-subjects that is, independent groups of participants

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

Describing, Exploring, and Comparing Data

Describing, Exploring, and Comparing Data 24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

More information

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express

More information

Simple Predictive Analytics Curtis Seare

Simple Predictive Analytics Curtis Seare Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Descriptive and Inferential Statistics

Descriptive and Inferential Statistics General Sir John Kotelawala Defence University Workshop on Descriptive and Inferential Statistics Faculty of Research and Development 14 th May 2013 1. Introduction to Statistics 1.1 What is Statistics?

More information

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the

More information

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs Using Excel Jeffrey L. Rummel Emory University Goizueta Business School BBA Seminar Jeffrey L. Rummel BBA Seminar 1 / 54 Excel Calculations of Descriptive Statistics Single Variable Graphs Relationships

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Using R for Linear Regression

Using R for Linear Regression Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional

More information

IBM SPSS Direct Marketing 23

IBM SPSS Direct Marketing 23 IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release

More information

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques

More information

Psych. Research 1 Guide to SPSS 11.0

Psych. Research 1 Guide to SPSS 11.0 SPSS GUIDE 1 Psych. Research 1 Guide to SPSS 11.0 I. What is SPSS: SPSS (Statistical Package for the Social Sciences) is a data management and analysis program. It allows us to store and analyze very large

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

SPSS INSTRUCTION CHAPTER 1

SPSS INSTRUCTION CHAPTER 1 SPSS INSTRUCTION CHAPTER 1 Performing the data manipulations described in Section 1.4 of the chapter require minimal computations, easily handled with a pencil, sheet of paper, and a calculator. However,

More information

LOGISTIC REGRESSION ANALYSIS

LOGISTIC REGRESSION ANALYSIS LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Big Ideas in Mathematics

Big Ideas in Mathematics Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards

More information

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions. Algebra I Overview View unit yearlong overview here Many of the concepts presented in Algebra I are progressions of concepts that were introduced in grades 6 through 8. The content presented in this course

More information