Save this PDF as:

Size: px
Start display at page:

## Transcription

1 Table of Contents Preface Chapter 1: Introduction 1-1 Opening an SPSS Data File Viewing the SPSS Screens... 3 o Data View o Variable View o Output View 1-3 Reading Non-SPSS Files... 6 o Convert From Excel to SPSS o Convert From Text to SPSS 1-4 Data View in SPSS Variable View in SPSS Chapter Tables (1.1) One Proportion (1.5) Summarizing Variables (3.1) Cross-Tabulation Tables (4.1) Descriptive Statistics (6.1) o Descriptives o Descriptives with Percentiles, Interquartile Ranges, and Confidence Intervals 2-6 Independent T-Tests (6.3) Chapter Paired Samples T-Test (7.2) One Sample T-Test (7.3) Chi-Square Test of Association (8.2) ANOVA (9.3) Scatterplot (10.1) o Line of Best Fit o Correlation 3-6 Linear Regression Chapter 4: Appendix 4-1 Bar Chart Boxplot Histogram Pie Chart Side-By-Side Boxplots Chi Squared Goodness of Fit Test Chi Squared Tests o Linear Trend Tests o McNemar Test o Relative Risk i

2 Preface Welcome to your SPSS Statistics 21 Manual which is a companion to Introduction to Statistical Investigations. The manual is designed with simple commands for an operation first, then an example, and then the output. The examples use databases in SPSS (.sav); your professor will tell you where to find these files. The names of the databases have been italicized. To simplify the text, we have also italicized all buttons that should be clicked, and we have put quotations around all labels. We hope you find the manual user friendly. Be aware of special notes (see the icon) that are sure to be useful. We have included helpful screen shots. Use the Table of Contents to navigate the manual. Chapter sections are arranged alphabetically (where possible) for your convenience. The manual provides help for basic graphics and hypothesis testing, but is not all inclusive. Feel free to explore SPSS on your own to find other features. Last Revised: September 2013 iii

3 Chapter 1: Introduction to SPSS 1 Chapter 1: Introduction to SPSS 1-1 Opening an SPSS Data File Viewing the SPSS Screens... 3 o Data View o Variable View o Output View 1-3 Reading Non-SPSS Files... 6 o Convert From Excel to SPSS o Convert From Text to SPSS 1-4 Data View in SPSS Variable View in SPSS... 10

4 2 Chapter 1: Introduction to SPSS 1-1 Opening an SPSS Data File Use if you want to open a SPSS data file (*.sav). Commands: 1. Open SPSS Statistics In the SPSS Statistics 21 window, check the bubble next to Open an existing data source. 3. Select OK. 4. Find and double click the file you wish to open. Note: SPSS data files are *.sav files. If your data is not in a *.sav file see 1-3 Reading Non- SPSS Files.

5 Chapter 1: Introduction to SPSS Viewing the SPSS Screens Use if you are unfamiliar with the SPSS screens. Data View After opening a SPSS data file, there are two possible ways to view the data (Data View & Variable View). In the bottom left corner of the screen there is a tab labeled Data View. Highlight the Data View tab by clicking on it. Data View displays the data in a spreadsheet. For more information see 1-4 Data Entry in SPSS. Note: Each row is a different subject or individual, while each column is a different variable. Example: Look at row 5 in Data View using Animal Sleep Data.sav. Looking at row 5, this subject is a mountain beaver who has a body weight of 1.35 kg, brain weight of 8.10 grams, non-dreaming sleep of 8.4 hours, and a dreaming sleep of 2.8 hours. Notice the columns are the different variables (species, body weight, brain weight, non-dreaming sleep, and dreaming sleep).

6 4 Chapter 1: Introduction to SPSS Variable View In the bottom of the screen there is also a tab labeled Variable View (next to Data View). Highlight the Variable View tab by clicking on it. Variable View allows for editing and labeling of the variables. For more information see 1-5 Variable View in SPSS. Output View After running a test or creating a graph, an *Output1 [Document1]-SPSS Statistics Viewer window will appear and will show your results. The column of the title outputs on the left side of the window is like a table of contents for your results. Simply double click the desired title and it will show up in the Output screen. (See figure at right.) To save the Output, select File>Save. Next to File name: type an appropriate name for the data file. Note: Make sure to save both the Output window and the SPSS data file separately because they are separate files. To copy the output into Microsoft Word, right click the item of interest and select Copy. Open Microsoft Word and paste the item into the document.

7 Chapter 1: Introduction to SPSS 5 This window appears after running descriptive statistics. Notice the Output on the left side of the box.

8 6 Chapter 1: Introduction to SPSS 1-3 Reading Non-SPSS Files Convert from Excel (.xls) to SPSS (.sav) Use if you want to convert a file from Excel (.xls) to SPSS (.sav). Commands: 1. Make sure your desired Excel file is saved and closed. Note: Some Excel files also contain explanatory sentences about the dataset. Make sure these sentences are deleted (not including the variable names or string values). You want the variable names to be in the first row of the spreadsheet. 2. Open SPSS Statistics In the SPSS Statistics 21 window, select Cancel at the bottom. 4. Click File>Read Text Data in the top left corner. 5. At the bottom of the Open Data window, click the down arrow for Files of type. 6. Select Excel (*.xls, *.xlsx, *.xlsm). 7. Find the Excel file you saved earlier and double-click it. 8. In the Opening Excel Data Source window, make sure Read variable names from the first row of data is checked.

9 Chapter 1: Introduction to SPSS 7 9. Select OK. 10. Your results should now be in SPSS. Don t forget to save it as a SPSS (*.sav) file by going to File>Save As. Convert from Text (.txt) to SPSS (.sav) Use if you want to convert a file from.txt to SPSS (.sav)

10 8 Chapter 1: Introduction to SPSS Commands: 1. Make sure your desired.txt file is saved and closed. Note: Some.txt files also contain explanatory sentences about the dataset. Make sure these sentences are deleted (not including the variable names or string values). You want the variable names to be at the top of the document. 2. Open SPSS Statistics In the SPSS Statistics 21 window, select Cancel at the bottom. 4. Click File>Read Text Data in the top left corner. 5. At the bottom of the Open Data window, click the down arrow for Files of type. 6. Select Text (*.txt, *.dat). 7. Find the.txt file you saved earlier and double-click it. 8. In the Text Import Wizard - Step 1 of 6 check the desired bubbles (the default is usually correct most.txt files have a No checked for Does you text file match a predefined format? ). Select Next >. Note: In the following six steps there will be a Text file or Data Preview that shows what the data will look like once it is converted to SPSS. 9. In the Text Import Wizard Step 2 of 6 check the desired bubbles (the default is usually correct most.txt files are Delimited, variable names are at the top of the file). Select Next >. 10. In the Text Import Wizard Step 3 of 6 check the desired bubbles (the default is usually correct most.txt files have the first case of data beginning at line 2, each line represents a case, and you will want to import all of the cases). Select Next>. 11. In the Text Import Wizard Step 4 of 6 check the desired bubbles (the default is usually correct most.txt files have a Tab between variables, the text qualifier is None ). Select Next>. 12. In the Text Import Wizard Step 5 of 6 change the desired information about the variables. Select Next>. 13. In the Text Import Wizard Step 6 of 6 check the desired bubbles (the default is usually correct most.txt files do not save the file for future use, would not like to paste the syntax). Select Finish. 14. Your results should now be in SPSS. Don t forget to save it as a SPSS (*.sav) file by going to File>Save As. Note: When converting a file from Fathom to SPSS, you need to save it as a.txt file. Go through the following steps: 1. Open your Fathom work. 2. Click (once) your brown Collection box so that the boarder is highlighted. 3. Select File>Export Collection in the top left corner. 4. Next to File name save the file as a text file (.txt). 5. Then follow the above instructions to get the.txt file to SPSS.

11 Chapter 1: Introduction to SPSS Data View in SPSS Use if you have raw data to enter into SPSS. Data View Commands: 1. Open SPSS Statistics In the SPSS Statistics 21 window, check the bubble next to Type in data. 3. Select OK. 4. In the bottom left corner of the screen make sure the Data View tab is highlighted. 5. Type the data into the cells. Note: Each row is a different subject or individual, while each column is a different variable.

12 10 Chapter 1: Introduction to SPSS 1-5 Variable View in SPSS Use if you have variables to change in SPSS Variable View Commands: 1. Select Variable View on the bottom of the screen. 2. To change the type of the variable, select Numeric under the Type column. Select the button next to Numeric. Pick the appropriate variable type. Select OK. Note: The most popular variable types are numeric (numbers) and string (words). However, string variables cannot be used for many tests and graphs. A solution to this problem is to code your categorical variable numerically and connect each number with a value label. This is done for the variable dog type below. The process is described more completely later in this section. 3. To change the number of characters allowed in a cell in Data View, select the number under Width. Then click the up and down arrows to get the desired number. 4. To change the number of decimal places in a cell in Data View for numeric variables, select the number under Decimal. Then click the up and down arrows to get the desired number. 5. To label the variables, type your preferred variable name in the Label cell. This label will appear when running tests and descriptive statistics instead of the actual variable name as shown in Data View. Note: Labeling variables can be extremely important. When using datasets with many variables similarly named (i.e. Research_ Question_1, Research_Question_2, Research_ Question_3) running tests will display only Research_Que, Research_Que, Research_Que which is difficult to understand. Labels will make this much easier to comprehend in tests without changing the actual name of the variable. Also, labels can contain spaces, whereas variable names cannot. 6. To add a Value to a variable, Select the button next under Values. Enter the assigned number next to Value:. Enter the assigned label next to Label:. Select Add. Continue to add labels until all the values are assigned. Select OK. Note: Assigning a value to a variable is very common when using SPSS 21. This feature allows for string values to appear in tests and descriptive statistics, but for numeric values to appear in Data View. See the example below. 7. To increase the horizontal length of a variable s column, select the number under Columns. Then click the up and down arrows to get the desired number.

13 Chapter 1: Introduction to SPSS To align the information to the left, right, or center of the cell, select the word under Align. Then select the drop down arrow and select the desired alignment. 9. To change the measure of the variable (scale, nominal, ordinal), select the word under Measure. Then select the drop down arrow and select the desired measure. Example: Enter data into SPSS (using both Data View & Variable View) using the following information: A dog owner has 4 dogs, which include 2 Chihuahuas and 2 Great Danes. The owner wants to look at how much the dogs weigh (in pounds) and the amount of food the dogs consume per day (in cups). The first Chihuahua weighs 4 pounds and eats.5 cups of food per day. The second Chihuahua weighs 2 pounds and eats.25 cups of food per day. The first Great Dane weighs 120 pounds and eats 2.5 cups of food per day. The second Great Dane weights 200 pounds and eats 4 cups of food per day. Also, label Chihuahua as 1 and Great Dane as 2 (so the variables are not string). This is Data View. Notice in the figure above, Chihuahua has a value of 1, and Great Dane has a value of 2.

14 12 Chapter 1: Introduction to SPSS The figure above shows how to change the values of the Dog_Type variable. Notice also that the labels have been changed for the Dog_Type and Dog_Weight variables. The effects of these changes are shown in the two figures below. Both of these images come from running descriptive statistics. Notice instead of 1 or 2, "Chihuahua and Great Dane are shown, and instead of Dog_Type or Dog_Weight, Type of Dog and Weight of Dog are shown.

15 Chapter 1: Introduction to SPSS 13 Computing a New Variable Using Mathematical Operations Use if you want to create a new variable as a result of mathematical or conditional operations on existing variables. Commands: 1. Transform>Compute Variable. 2. Type the name of the new variable under Target Variable: (make sure the new name has no spaces in it). 3. Select and drag the variable(s) of interest under Numeric Expression:. Also indicate the mathematical relationship between the target variable and the existing variable(s) using the calculator pad or the computer keyboard. 4. Select OK. Example: Calculate the average golf score of the four rounds using GolfScores.sav. This shows the process. Notice under Numeric Expression: there is an equation to get the average score of the four rounds, which was created by the above calculator pad and the computer keyboard.

16 14 Chapter 1: Introduction to SPSS This shows that the average scores for the four rounds are now recoded into a new column (AverageScore). For example, Tiger had an average score of 68=(( )/4).

17 Chapter 2: Descriptive Statistics 15 Chapter Tables (1.1) One Proportion (1.5) Summarizing Variables (3.1) Cross-Tabulation Tables (4.1) Comparing Statistics (6.1) o Descriptives o Descriptives with Percentiles, Interquartile Ranges, and Confidence Intervals 2-6 Independent T-Tests (6.3)... 33

18 16 Chapter 2: Descriptive Statistics 2-1 Tables Use if variable is categorical. Commands: 1. Analyze>Descriptive Statistics>Frequencies. 2. Select and drag the variable of interest into the Variable(s): box. 3. Make sure Display frequency tables is checked. 4. Select OK. Example: Explore the variable Shop from the data set Coffee Data.sav. We will create a table of frequencies (counts) and percentages of students who frequent the different coffee shops. Output: This box shows there are 177 students in our sample and there is no missing data.

19 Chapter 2: Descriptive Statistics 17 This box shows the desired table. The Frequency column shows the number of students who preferred a certain coffee shop. The Percent column shows the percent of students who chose a certain shop out of the total number of students in the sample (including the missing data). The Valid Percent column shows the percent of students who chose a certain shop out of the number of students in the sample (without the missing data). If there is no missing data then the Percent and Valid Percent will be exactly the same. The Cumulative Percent column shows the percent of students who chose that shop and the shop(s) already listed from the Valid Percent column. For example, = 73.4 percent of students chose Lemonjello s or JP s.

20 18 Chapter 2: Descriptive Statistics 2-2 One Proportion Use if variable is categorical and binary. Note: To use this method, you will need to create a new document with two variables. Both variables need to be numeric. The first variable will have the two values for the categorical variable. In Variable View under Values, add value labels for these two values. This will make your output more readable. The second variable will be the observed count of each category. Commands: 1. Data>Weight Cases 2. Select Weight cases by and send the frequency/count to the Frequency Variable: box. OK. 3. Analyze>Nonparametric Tests>Chi-Square 4. The Test Variable List: is the categorical variable. 5. In the Expected Values box, select All categories equal if the population proportion is 50%. If the population proportion is not 50%, select Values. Type in one value at a time in the Value box, starting with the hypothesized proportion of the first group, then adding the corresponding proportion for the second group. (The first group is the group with the smallest numeric value in Variable View. These two proportions must have a sum of 100.) 6. Select OK.

21 Chapter 2: Descriptive Statistics 19 Example: Analyze if the proportion of females at Hope College is different from the national average of 57% female students in college. Use the data set Hope Student Survey 2008.sav. You will need to find the counts of males and females to create your new spreadsheet. The null and alternative hypotheses you will test are as follows. H 0 : The proportion of Hope College females is the same as the national average of 57%. H a : The proportion of Hope College females is different from the national average of 57%. First, create a new SPSS Data Sheet where the first variable is gender with values 0 and 1. In variable view under Values, enter value labels of female and male for the two values.

22 20 Chapter 2: Descriptive Statistics The second variable is the observed count. In the Hope Spring Survey 2008 data set, you will need to find the sum of each gender (Analyze>Descriptive Statistics>Frequencies, under Statistics check Sum ; the first box Statistics of the output gives the observed counts for each gender in the Sum row.) In the new SPSS Data Sheet, enter each count value, as shown below. Next, follow the aforementioned Commands to display the desired Output shown below. Output: This table displays the observed counts of females and males in our sample of Hope College students in the first column. The second column shows the counts we would have expected to see if the proportion of females at Hope were the same as the national average. The residual is the distance between the observed count and the expected count. This table displays the p-value of our test of significance (see the circle). The footnote a under the table tells you that all of the cells have expected cell counts of at least 5. The Chi-Square can have at most 20% of the cells in the table with an Expected Count less than 5.

23 Chapter 2: Descriptive Statistics 21 P-value: p =.695 >.05, fail to reject H 0. Conclusion: We do not have enough evidence to show that the proportion of Hope College females is significantly different from the national average of 57% females enrolled in college. Note: SPSS will always give the two-sided p-value. If you would like to get the one-sided p- value, divide the SPSS two-sided p-value in half (assuming that the observed proportion is in the direction of the alternative hypothesis).

24 22 Chapter 2: Descriptive Statistics 2-3 Summarizing Quantitative Variables Use if variable is quantitative. Commands: 1. Analyze>Descriptive Statistics>Frequencies. 2. Drag the appropriate quantitative variable from the list to the Variables(s): box. 3. Select Statistics. 4. Check the desired numeric summaries. To get the Quartiles or Percentiles, check the desired boxes under Percentile Values. When checking Percentile(s): enter the value in the box then select Add. To get the Mean, Median, Mode, or Sum, check the desired boxes under Central Tendency. To get the Standard Deviation, Variance, Range, Maximum, Minimum, or S.E. Mean, check the desired boxes under Dispersion. To get the Skewness or Kurtosis, check the desired boxes under Distribution. 5. Select Continue. 6. To create a histogram, select Charts and make sure Histograms: is selected. Check With normal curve if desired. 7. Select OK. Note: This histogram does not allow all the options to edit it or modify it. See Appendix: 4-3 Histograms on page 60 for more chart options.

25 Chapter 2: Descriptive Statistics 23 Example: Explore the variable egg length from the data set Cuckoo Eggs.sav. We will find the mean, median, mode, standard deviation, minimum, maximum, and the 20 th percentile. We will also create a histogram with a normal curve superimposed for the egg lengths. Output: This box gives the mean ( ), median (22.65), mode (23.05), standard deviation ( ), minimum (19.85), maximum (25.05), and the 20 th percentile ( ) for the egg lengths.

26 24 Chapter 2: Descriptive Statistics The box above gives information about individual egg lengths. The Valid column includes all the different egg lengths in the dataset. The Frequency column shows the number of eggs that have a certain egg length. The Percent column shows the percent of eggs that have a certain egg length out of the total number of eggs in the sample (including the missing data). The Valid Percent column shows the percent of eggs that have a certain egg length out of the number of eggs in the sample (without the missing data). If there is no missing data then the Percent and Valid Percent will be exactly the same. The Cumulative Percent column shows the percent of eggs that have a certain length or any length(s) less than it from the Valid Percent column. For example = 4.0 percent of eggs have a length of or less. Histogram of the egg lengths with a normal curve superimposed.

27 Chapter 2: Descriptive Statistics Cross-Tabulation Table Use if both independent and dependent variables are categorical. Commands: 1. Analyze>Descriptive Statistics>Crosstabs. 2. Drag the independent (categorical, explanatory) variable into the Row(s): box. 3. Drag the dependent (categorical, response) variable into the Column(s): box. 4. Under Cells Check Row in the Percentages box. 5. Select Continue. 6. Select OK. Example: Explore the relationship between the variables gender and penalties from the data set Penalties Data.sav. We will create a cross-tabulation table of gender predicting penalties. Output: This box shows there are 100 students in our sample and there is no missing data.

28 26 Chapter 2: Descriptive Statistics This box shows the desired cross-tabulation table. Notice the circle shows that 16 males received a penalty. The percent of males who received a penalty is 16 out of the total 46 males, which is 34.8%.

29 Chapter 2: Descriptive Statistics 27 Descriptives 2-5 Comparing Descriptive Statistics Use if the dependent variable is quantitative or ordinal and the independent variable is categorical. Commands: 1. Analyze>Compare Means>Means. 2. Under Dependent List: enter the dependent (quantitative, response) variable. 3. Under Independent List: enter the independent (categorical, explanatory) variable. 4. Under Options drag any of the desired actions (mean, standard error of mean, median, grouped median, sum, minimum, maximum, range, number of cases, first, last, standard deviation, variance, kurtosis, standard error of kurtosis, harmonic mean, geometric mean, percent of total sum, percent of total n, etc.) from under Statistics: to under Cell Statistics: 5. Select Continue. 6. Select OK in the Means window. Example: Explore the variable time, which measures time spent in a bathroom, from the data set Restrooms & Gender.sav. We will compare the mean, range, and standard deviation for males and females (variable: gender).

30 28 Chapter 2: Descriptive Statistics Output: This box shows there are 97 participants that do not have missing data. There are 5 participants that have missing time data with a total of 102 participants. This box shows the desired descriptive statistics for both males and females so comparisons can be performed. The mean bathroom time for females is seconds, while the mean bathroom time for males is seconds. The range bathroom time for females is 265 seconds, while the range bathroom time for males is 387 seconds. The standard deviation bathroom time for females is seconds, while the standard deviation bathroom time for males is

31 Chapter 2: Descriptive Statistics 29 Descriptives with Percentiles, Interquartile Range, & Confidence Intervals Use if the dependent variable is quantitative or ordinal and the independent variable is categorical. Note: This method has a similar output as the Descriptives method, but this method will give percentiles, interquartile range, and confidence intervals. Commands: 1. Analyze>Descriptive Statistics>Explore. 2. Under Dependent List: enter the dependent (quantitative, response) variable. 3. Under Factor List: enter the independent (categorical, explanatory) variable. 4. Select Statistics Check Descriptives to get the mean, 95% confidence interval for mean (upper and lower bound), 5% trimmed mean, median, variance, standard deviation, minimum, maximum, range, interquartile range, skewness, and kurtosis. Check Percentiles to get the 5 th, 10 th, 25 th, 50 th, 75 th, 90 th, and 95 th percentile. 5. Select Continue. 6. Select OK. Example: Once again we will explore the variable time from the data set Restrooms & Gender.sav. We will compare the mean, range, standard deviation, and the 50 th percentile for males and females.

32 30 Chapter 2: Descriptive Statistics Output: This box shows there are 46 female and 51 male participants. There are 5 females that have missing time data with a total of 51 male and 51 female participants. This box shows the descriptive statistics for both males and females so comparisons can be performed. The mean bathroom times for females ( seconds) and males ( seconds) are given along with 95% confidence intervals for these values. Many other descriptive statistics are provided as well.

33 Chapter 2: Descriptive Statistics 31 This box shows different percentiles of male and female time in the bathroom. The two rows (Weighted Average and Tukey s Hinges) calculate percentiles slightly differently, but either one can be used. The 50 th percentile for females is seconds in the bathroom, while the 50 th percentile for males is 82 seconds in the bathroom. This is the female stem-and-leaf plot. Notice the stem width is 100. This is the male stem-and-leaf plot. Notice the stem width is 100.

34 32 Chapter 2: Descriptive Statistics Shown here is a side-by-side boxplot of the amount of time spent in the bathroom for males and females. The open circles represent potential outliers using the 1.5 IQR rule, and the asterisks represent outliers using the 3 IQR rule.

35 Chapter 2: Descriptive Statistics 33 Independent Samples T Test 2-6 Independent T Tests Use if there is a binary categorical explanatory variable and a quantitative or ordinal response variable and the two groups being compared are independent (i.e. compare the means of two independent groups). Commands: 1. Analyze>Compare Means>Independent-Samples T Test 2. The Test Variable is the dependent (quantitative, response) variable. 3. The Grouping Variable is the independent (categorical, explanatory) variable. 4. Define Groups to indicate to SPSS which two groups are being compared. In the box Group1, type in the variable value (as it appears in Data View ) for the first group (i.e.: 0 ). In the box Group 2, type in the variable value (as it appears in Data View ) for the second group (i.e.: 1 ). Select Continue. 5. Select OK. Example: Use the variables sleep hours and gender from the data set Hope Student Survey 2008.sav to analyze whether or not the mean hours of sleep per night for females (µ F ) is significantly different from the mean hours of sleep per night for males (µ M ). We will test the following hypotheses: H 0 : The mean sleep hours of males is the same as the mean sleep hours of females. (µ M = µ F ) H a : The mean sleep hours of males is different than the mean sleep hours of females. (µ M µ F )

36 34 Chapter 2: Descriptive Statistics Output: The above table gives the sample size ( N ), mean, standard deviation, and standard error mean for the variable sleep hours for each gender. It shows that there is not much of a difference in the average hours of sleep per night between females and males. The p-value (.677) for the test (see the red circle) is in the Sig. (2-tailed) column. Sig. (short for significance) is what SPSS calls the p-value. Also notice the Equal variances not assumed row. The p-value will not always be the same in both rows (though it is in this case). Always use the equal variances not assumed row as it is generally accepted in statistical practice that it is safer not to assume equal variances. Notice the confidence interval (see the blue circle). The confidence interval contains zero, meaning zero is a possible difference in mean sleep hours. This makes sense because our difference in means is not significant. The confidence interval gives similar information as the p-value: because the confidence interval contains zero, the p-value is non-significant. P-value: p =.677 >.05, fail to reject H 0. Conclusion: We do not have enough evidence to show there is a significant difference in average amount of hours of sleep per night obtained by males and females. Note: SPSS will always give the two-sided p-value. To get the one-sided p-value (assuming that the sample mean of Group 1 is larger than Group 2 and the alternative hypothesis is such that the population mean of Group 1 is larger than Group 2), divide the SPSS two-sided p-value in half.

37 Chapter 3: Tests of Significance 35 Chapter Paired Samples T-Test (7.2) One Sample T-Test (7.3) Chi-Square Test of Association (8.2) Scatterplot (10.1) o Line of Best Fit o Correlation 3-6 Linear Regression (10.4) ANOVA (9.3)... 43

38 36 Chapter 3: Tests of Significance 3-1 Paired Samples T-Test Use if there is a quantitative or ordinal response variable and a binary categorical explanatory variable that has a 1-1 correspondence between the two groups (i.e. compare the means where the two groups in the sample are paired, such as married couples and comparing the mean of the men to the mean of the women). Commands: 1. Analyze>Compare Means>Paired-Samples T Test 2. Put one of the variables into the Variable1 column and the other into the Variable2 column in the first row. 3. Select OK. Note: The test will calculate Variable1 - Variable2. Note: If you have a 1-1 correspondence between the two groups, you may need to restructure your data so that each row corresponds to a pair of individuals instead of a single individual. Example: Analyze if there is a significant difference on average between the variables pulse sitting and pulse standing in the data set Pulse Data.sav. Since these pulse rates come form the same individual, there is dependency between the two pulse rate measures. We will test the following hypotheses to answer the question: On average, is there a significant difference between one s sitting pulse rate (µ Sit ) and one s standing pulse rate (µ Stand )? H 0 : The mean pulse rate sitting is the same as the mean pulse rate standing. (µ Sit = µ Stand ) H a : The mean pulse rate sitting is different from the mean pulse rate standing. (µ Sit µ Stand )

39 Chapter 3: Tests of Significance 37 Output: This table gives mean pulse rate, sample size N (which should be the same because the sample is the same for each variable), standard deviation, and standard error mean for each group. This box gives the correlation (.866, a strong correlation) between pulse rate sitting and pulse rate standing. The correlation is highly significant (p <.001) (see the circle). Although correlation is not the main purpose of the paired samples t test, it can give helpful information. The difference in the means is -6 (Pulse Sitting - Pulse Standing). The p-value for the difference in means is.005 (see the blue circle). Notice the confidence interval (see the red circle). The confidence interval does not contain zero, meaning zero is not a possible difference in mean pulse rates. This corresponds to the p- value from our test of significance. Our p-value leads us to reject the null hypothesis of no difference between the mean pulse rates. P-value: p =.005, reject H 0. Conclusion: We have enough evidence to show that the mean pulse rate while sitting is significantly different from the mean pulse rate while standing. On average, the standing pulse rate is 6 beats more per minute than the sitting pulse rate. Note: SPSS will always give the two-sided p-value. To get the one-sided p-value (assuming that the sample mean of Group 1 is larger than Group 2 and the alternative hypothesis is such that the population mean of Group 1 is larger than Group 2), divide the SPSS two-sided p-value by two. Note: To perform the paired samples t-test you can also create a new variable as the difference between your two quantitative variables (see 1-5 Variable View in SPSS, section entitled Computing a New Variable Using Mathematical Operations) and perform a one-sample T-Test on the newly created difference (see 2-6 Independent T-Tests).

40 38 Chapter 3: Tests of Significance 3-2 One Sample T-Test Use if there is a quantitative or ordinal variable and a given population parameter (i.e. compare a mean to a population mean). Commands: 1. Analyze>Compare Means>One-Sample T Test 2. The Test Variable is the variable of interest. 3. Type in the Test Value (the population mean that you wish to compare your sample to). 4. Select OK. Example: Use the variable StudyHrs from the data set Hope Student Survey 2008.sav to analyze whether or not the mean study hours per week (µ) of Hope College students is the same as or different from the recommended 30 hours per week for all full-time college students. We will test the following hypotheses: H 0 : The average study hours per week of Hope students is 30 hours. (µ = 30) H a : The average study hours per week of Hope students is not 30 hours. (µ 30) Output: The above table gives the sample size ( N ), mean standard deviation, and standard error mean of study hours for Hope students.

41 Chapter 3: Tests of Significance 39 The p-value (<.001) for the test (see the blue circle) is in the Sig. (2-tailed) column. Sig. (short for significance) is what SPSS calls the p-value. Notice the confidence interval (see the red circle). It is the confidence interval for the difference of means (Sample Mean Test Value). If you want the confidence interval for the true mean, add the test value to the lower and upper ends of the confidence interval. Also notice that because the confidence interval does not contain zero (zero is not a possible difference in means), the p-value is significant. If it had contained zero (zero is a possible difference in means), the p-value would have been non-significant. For this example, the confidence interval would have a lower bound of 30+( )= and an upper bound of 30+( )= P-value: Since p < <.05, reject H 0. Conclusion: We have enough evidence to show that the average study hours per week of Hope students are significantly different from the recommended 30 hours per week. Note: SPSS will always give the two-sided p-value. If you would like to get the one-sided p- value, divide the SPSS two-sided p-value in half (assuming that the sample mean is in the direction of the alternative hypothesis).

42 40 Chapter 3: Tests of Significance 3-3 Chi Square Test of Association Use if both independent and dependent variables are categorical and the groups being compared are independent. Commands: 1. Analyze>Descriptive Statistics>Crosstabs 2. The Row(s) is the independent (explanatory) variable. 3. The Column(s) is the dependent (response) variable. 4. Under Statistics check Chi-square. Select Continue. 5. Under Cells check Expected in the Counts box. Check Row in the Percentage box. Select Continue. 6. Select OK. Example: Analyze the association between the variables gender and in Greek life from the data set Hope Student Survey 2008.sav. We will test whether or not the proportion of males involved in Greek life is different from the proportion of females involved in Greek life at Hope College. Our null and alternative hypotheses are as follows. H 0 : The proportion of males in Greek life is the same as the proportion of females in Greek life. H a : The proportion of males in Greek life is different from the proportion of females in Greek life.

43 Chapter 3: Tests of Significance 41 Output: This box gives the variables and the sample size (N) of the valid, missing, and total data. This box gives the cross-tabulation table. We see that 35 females ( Count ) are in Greek life. Because Row percentages were selected, we know that 19.1% of Hope College females are in Greek life (see the blue circle). Also, 25 males ( Count ) are in Greek life, which is 18.9% of Hope College males (see the red circle). The Expected Count is the number of each gender we would expect to see involved in Greek life if the proportions of students in Greek life were the same for both genders.

44 42 Chapter 3: Tests of Significance This box gives us the p-value of.967 (see the circle). The p-value is in the Pearson Chi- Square row and Asymp. Sig. (2-sided) column. The footnote a under the table tells you that all of the cells have expected cell counts of at least 5. The Chi-Square can have at most 20% of the cells in the table with an Expected Count less than 5. (The Expected Count can be seen in the cross-tabulation table.) P-value: p =.967 >.05, fail to reject H 0. Conclusion: We do not have enough evidence to show that the proportion of males in Greek life (18.9%) is significantly different from the proportion of females in Greek life (19.1%).

45 Chapter 3: Tests of Significance ANOVA Use if there is a multiple category explanatory variable and a quantitative or ordinal response variable and the multiple groups being compared are independent (i.e. compare the means of multiple independent groups). Commands: 1. Analyze>Compare Means>One-Way ANOVA 2. The Dependent List is the dependent (response) variable. 3. The Factor is the independent (explanatory) variable. 4. Under Post-Hoc you can check either Tukey or Tamhane s T2 to do post-hoc comparisons of the group means. Select Continue. Note: Tukey can only be used when the standard deviations are within a factor of two of each other. 5. Under Options you can check Descriptive to get means and standard deviations within each group. Select Continue. 6. Select OK. Example: Analyze the variable egg length by host bird from the data set Cuckoo Eggs.sav. We will compare the means of cuckoo egg lengths between the five different host birds. Our null and alternative hypotheses are as follows. H 0 : The mean egg length for each host bird is equal. H a : At least one host bird has a different mean egg length from the other host birds.

46 44 Chapter 3: Tests of Significance Output: This box gives the sample sizes (N), the means, the standard deviations, and the confidence intervals of the means of each group. (In this example, the group is the type of host bird.) Because the p-value (Sig) is <.001 (see the circle), we have evidence that at least one of the group means is different from the others. Since there is evidence of a difference and the standard deviations (from the first table) are within a factor of two, (check if largest and smallest are within a factor of 2, if so then all are within a factor of two: /21.13 = < 2) we can use the Tukey post-hoc comparison. The mean squares are also given, along with the F statistic: /.814 = Note: If the p-value is not significant, then a post-hoc comparison is unnecessary. If the standard deviations are not within a factor of two of each other, but the p-value is significant, we should use the Tamhane s T2 post-hoc comparison. P-value: p <.001 <.05, reject H 0. Conclusion: We have evidence that at least one host bird has a different mean egg length from the other host birds.

47 Chapter 3: Tests of Significance 45 This box shows the results of the Tukey post-hoc analysis. It shows the difference in means between every possible group. The statistically significant differences are starred. From this, we see the Wren has a significantly different egg length than the Tree Pipit (p <.001), Hedge Sparrow (p <.001), Robin (p <.001), and the Pied Wagtail (p <.001) (see the red circle). However, none of the other groups are significantly different from each other. The confidence interval is given for the difference in means between the two groups. Thus, in cases where it does not include zero there is evidence of a statistically significant difference in means (see the blue circle), which coincide with the p-value. If the confidence interval contains zero (meaning zero is a possible difference in means), the p-value is non-significant.

48 46 Chapter 3: Tests of Significance This box summarizes the post-hoc test by identifying homogenous subsets of means. The different subsets are grouped so that each column contains a set of means that does not differ significantly from the other means in the column and does differ significantly from the means outside of its column. In this example, the mean egg length in the Wren nest is significantly different than the mean egg lengths of all the other host birds; and the Robin, Pied Wagtail, Tree Pipit, and Hedge Sparrow do not have significantly different mean egg lengths from each other.

49 Chapter 3: Tests of Significance Scatterplot Use if both variables are quantitative. Commands: 1. Graphs>Chart Builder. 2. From the Gallery tab, select Scatter/Dot. 3. Drag the first icon (Simple Scatter) into the white space labeled Drag a Gallery chart here Drag the appropriate quantitative variables from the Variables: list into the boxes for the X-Axis? and Y-Axis? 5. To change the range of the axes use the Element Properties dialog box and look at the Edit properties of:. Select X-Axis1 (Point1) or Y-Axis1 (Point1). Then under Scale Range change the Minimum or Maximum by unchecking the Automatic box and typing in your own Custom values. Select Apply. 6. Select OK. Note: Make sure the quantitative variables for both the X-axis and Y-axis are correctly identified by SPSS as scale variables or you will not be able to create a line of best fit on your scatterplot. To check this, right click on each of the variable names in the Chart Builder window under Variables:. Make sure there is a dot next to Scale.

50 48 Chapter 3: Tests of Significance Line of Best Fit Commands: 1. After creating the scatterplot, double click on the scatterplot graph in the Output window to get the Chart Editor window. 2. Elements>Fit Line at Total. 3. Make sure Linear is checked under Fit Method in the Properties window. Select Close. 4. File>Close in the Chart Editor window. Example: Explore the relationship between the variables weight and height from the data set Body Fat Data.sav. We will create a scatterplot of these variables with the line of best fit. Output: The scatterplot shows a moderately strong positive relationship between weight and height of individuals. The correlation coefficient, r, is *Residual Plot: For instructions on how to do a residual plot see page 51.

51 Chapter 3: Tests of Significance 49 Correlation Use if both variables are quantitative. Commands: 1. Analyze>Correlate>Bivariate 2. Select the two variables of interest. 3. Make sure Pearson is checked in the Correlation Coefficients box. 4. Select OK. *For instructions on how to do a residual plot see page 51. For instructions on how to do a scatter plot see page 47.

52 50 Chapter 3: Tests of Significance Example: Analyze the correlation (ρ) between the variables body length and body width, measurements taken from a sample of Dover sole, from the data set Fish.sav. H 0 : There is no relationship between the body length and the body width of Dover sole. (ρ = 0) H a : There is a relationship between the body length and the body width of Dover sole. (ρ 0) Output: The correlation between length and width is (see blue circles), a strong correlation, for this sample size (N) of 100. The p-value is < (see red circles). This shows that the correlation is significantly different from 0. Note: The p-value for testing correlation is the same as testing for slope in a simple (one explanatory variable) linear regression model. P-value: p <.001 <.05, reject H 0. Conclusion: We have enough evidence to show a significant relationship between the body length and the body width of Dover sole. The correlation was.831.

53 Chapter 3: Tests of Significance Linear Regression Use if both independent and dependent variables are quantitative or ordinal. Commands: 1. Analyze>Regression>Linear 2. Select the Dependent (response) and Independent(s) (explanatory) variables. 3. Under Statistics you can check Confidence intervals to have confidence intervals put on the beta coefficient estimates. Select Continue. 4. Under Save you can check the box for Unstandardized under Residuals in order to be able to create a residual plot later. Select Continue. 5. Select OK. Example: Analyze the linear relationship between heart rate and body temperature from the data set Body Temp & Heart Rate.sav. We will test the following hypotheses to answer the question: is heart rate a good predictor of body temperature? H 0 : There is no linear relationship between heart rate and body temperature. (β = 0) H a : There is a linear relationship between heart rate and body temperature. (β 0) Output: This box simply shows the variables used in the model: Variables Entered is the independent variable, and the dependent variable is shown at the bottom

54 52 Chapter 3: Tests of Significance This box indicates that the correlation coefficient, r, (column R) is.254, and the r 2 value (column R Square) is.064. This box gives the p-value (see the circle) for the significance of the correlation coefficient, r. When there is only one independent variable, the p-value for testing the correlation coefficient, r, will be the same as the p-value for testing the slope of the independent variable (see below). In the B column of the Unstandardized Coefficients section, we find the y-intercept of and the slope of for the least squares regression line (see the red circle). Thus, Body Temperature = *HeartRate. To use this equation, plug in the heart rate to estimate/predict the body temperature. For example, to estimate/predict the body temperature of a person with a heart rate of 70 beats per minute, plug in 70 for HeartRate : Body Temperature = *70 = F. The p-value for the test of the intercept being significantly different than 0 is p <.001, which is a fairly useless test. The p-value for the test of the slope β (and correlation) being significantly different than 0 is p =.004 (see the black circle). Note: Recall that the p-value for testing correlation is the same as the p-value for testing slope in a simple (one explanatory variable) linear regression model.

55 Chapter 3: Tests of Significance 53 The 95% upper and lower confidence bounds are given for both the slope and the intercept estimates (see the blue circle). Neither of which contain zero, meaning zero is not a possible value for the intercept or for the slope, coinciding with the p-values both being significant. If the confidence intervals had included zero, meaning zero is a possible value for the intercept or for the slope, the p-value would have been non-significant. P-value: Since p =.004 <.05, reject H 0. Conclusion: We have enough evidence to show a significant linear relationship between heart rate and body temperature. (Evidence of β 0) Creating a Residual Plot Creating a residual plot is an additional step in SPSS. You should have requested Unstandardized Residuals be Saved (see Command 4 above) when running your analysis. If you look at the SPSS spreadsheet (DataView) you will see a new column (variable) called Res_1. This column gives the residual for each individual in your dataset. You can then create a residual plot by creating a scatter plot (Sections 3-6 and 3-6 of this manual) of the residuals (Res_1) vs. your explanatory variable.

56 54 Chapter 3: Tests of Significance Note: Other more sophisticated residual plots are available by using the Plots dialog when running the Linear Regression.

57 Chapter 4: Appendix 55 Appendix 4-1 Bar Chart Boxplot Histogram Pie Chart Side-By-Side Boxplots Chi Squared Goodness of Fit Test Chi Squared Tests o Linear Trend Test o McNemar Test o Relative Risk

58 56 Chapter 4: Appendix 4-1 Bar Chart Use if variable is categorical. Commands: 1. Graphs>Chart Builder. 2. From the Gallery tab, select Bar. 3. Drag the first icon (Simple Bar) into the white space that says Drag a Gallery chart here Drag the appropriate categorical variable from the Variables: list into the X-Axis? box. 5. To change the y-axis to percentages instead of counts, use the Element Properties dialog box, shown on the left side of the image to the right. Under Statistics use the pull down arrow to select Percentage (?). Select Apply. 6. Select OK. 7. To add labels, right click the bar chart in the Output window. Select Edit Content>In Separate Window. In the Chart Editor window, right click a bar in the bar chart. Select Show Data Labels. In the Properties box, drag the desired labels from the Not Displayed: box to the Displayed: box. Select Apply then Close in the Properties box. Select File>Close in the Chart Editor window. 8. To move the data labels on the bar chart, right click the bar chart in the Output window. Select Edit Content>In Separate Window. Click and drag the labels to the appropriate spots. Select File>Close in the Chart Editor window.

### Using SPSS, Chapter 2: Descriptive Statistics

1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

### SPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav)

Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Organize and Display One Quantitative Variable (Descriptive Statistics, Boxplot & Histogram) 1. Move the mouse pointer

### Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

### A Guide for a Selection of SPSS Functions

A Guide for a Selection of SPSS Functions IBM SPSS Statistics 19 Compiled by Beth Gaedy, Math Specialist, Viterbo University - 2012 Using documents prepared by Drs. Sheldon Lee, Marcus Saegrove, Jennifer

### Introduction Course in SPSS - Evening 1

ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/

### IBM SPSS Statistics for Beginners for Windows

ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning

### 4. Descriptive Statistics: Measures of Variability and Central Tendency

4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and

### Technology Step-by-Step Using StatCrunch

Technology Step-by-Step Using StatCrunch Section 1.3 Simple Random Sampling 1. Select Data, highlight Simulate Data, then highlight Discrete Uniform. 2. Fill in the following window with the appropriate

### Directions for using SPSS

Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

### Introduction to SPSS 16.0

Introduction to SPSS 16.0 Edited by Emily Blumenthal Center for Social Science Computation and Research 110 Savery Hall University of Washington Seattle, WA 98195 USA (206) 543-8110 November 2010 http://julius.csscr.washington.edu/pdf/spss.pdf

### SPSS Manual for Introductory Applied Statistics: A Variable Approach

SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All

### Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

### An SPSS companion book. Basic Practice of Statistics

An SPSS companion book to Basic Practice of Statistics SPSS is owned by IBM. 6 th Edition. Basic Practice of Statistics 6 th Edition by David S. Moore, William I. Notz, Michael A. Flinger. Published by

### January 26, 2009 The Faculty Center for Teaching and Learning

THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i

### An introduction to IBM SPSS Statistics

An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive

### An Introduction to SPSS. Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman

An Introduction to SPSS Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman Topics to be Covered Starting and Entering SPSS Main Features of SPSS Entering and Saving Data in SPSS Importing

### There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet

### Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

### Describing, Exploring, and Comparing Data

24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

### Exercise 1.12 (Pg. 22-23)

Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

### The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

### Data analysis process

Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis

### 3: Graphing Data. Objectives

3: Graphing Data Objectives Create histograms, box plots, stem-and-leaf plots, pie charts, bar graphs, scatterplots, and line graphs Edit graphs using the Chart Editor Use chart templates SPSS has the

### For example, enter the following data in three COLUMNS in a new View window.

Statistics with Statview - 18 Paired t-test A paired t-test compares two groups of measurements when the data in the two groups are in some way paired between the groups (e.g., before and after on the

### Module 9: Nonparametric Tests. The Applied Research Center

Module 9: Nonparametric Tests The Applied Research Center Module 9 Overview } Nonparametric Tests } Parametric vs. Nonparametric Tests } Restrictions of Nonparametric Tests } One-Sample Chi-Square Test

### SPSS Explore procedure

SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

### GETTING YOUR DATA INTO SPSS

GETTING YOUR DATA INTO SPSS UNIVERSITY OF GUELPH LUCIA COSTANZO lcostanz@uoguelph.ca REVISED SEPTEMBER 2011 CONTENTS Getting your Data into SPSS... 0 SPSS availability... 3 Data for SPSS Sessions... 4

### Simple Linear Regression in SPSS STAT 314

Simple Linear Regression in SPSS STAT 314 1. Ten Corvettes between 1 and 6 years old were randomly selected from last year s sales records in Virginia Beach, Virginia. The following data were obtained,

### Frequency distributions, central tendency & variability. Displaying data

Frequency distributions, central tendency & variability Displaying data Software SPSS Excel/Numbers/Google sheets Social Science Statistics website (socscistatistics.com) Creating and SPSS file Open the

### Using SPSS 20, Handout 3: Producing graphs:

Research Skills 1: Using SPSS 20: Handout 3, Producing graphs: Page 1: Using SPSS 20, Handout 3: Producing graphs: In this handout I'm going to show you how to use SPSS to produce various types of graph.

### Once saved, if the file was zipped you will need to unzip it.

1 Commands in SPSS 1.1 Dowloading data from the web The data I post on my webpage will be either in a zipped directory containing a few files or just in one file containing data. Please learn how to unzip

### Learning SPSS: Data and EDA

Chapter 5 Learning SPSS: Data and EDA An introduction to SPSS with emphasis on EDA. SPSS (now called PASW Statistics, but still referred to in this document as SPSS) is a perfectly adequate tool for entering

### Linear Regression in SPSS

Linear Regression in SPSS Data: mangunkill.sav Goals: Examine relation between number of handguns registered (nhandgun) and number of man killed (mankill) checking Predict number of man killed using number

### Data exploration with Microsoft Excel: analysing more than one variable

Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical

### Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:

### Using Excel for descriptive statistics

FACT SHEET Using Excel for descriptive statistics Introduction Biologists no longer routinely plot graphs by hand or rely on calculators to carry out difficult and tedious statistical calculations. These

### SPSS TUTORIAL & EXERCISE BOOK

UNIVERSITY OF MISKOLC Faculty of Economics Institute of Business Information and Methods Department of Business Statistics and Economic Forecasting PETRA PETROVICS SPSS TUTORIAL & EXERCISE BOOK FOR BUSINESS

### SPSS Reference Manual: A gentle overview

SPSS Reference Manual: A gentle overview Table of Contents 1. Introduction to SPSS 4 2. Data Analysis and Statistical Concepts 13 Concept 1 Measurements of Central Tendency 13 Concept 2 Measurements of

### Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

### Introduction to Statistics with SPSS (15.0) Version 2.3 (public)

Babraham Bioinformatics Introduction to Statistics with SPSS (15.0) Version 2.3 (public) Introduction to Statistics with SPSS 2 Table of contents Introduction... 3 Chapter 1: Opening SPSS for the first

### Variables and Data A variable contains data about anything we measure. For example; age or gender of the participants or their score on a test.

The Analysis of Research Data The design of any project will determine what sort of statistical tests you should perform on your data and how successful the data analysis will be. For example if you decide

### F. Farrokhyar, MPhil, PhD, PDoc

Learning objectives Descriptive Statistics F. Farrokhyar, MPhil, PhD, PDoc To recognize different types of variables To learn how to appropriately explore your data How to display data using graphs How

### Using Excel for Statistics Tips and Warnings

Using Excel for Statistics Tips and Warnings November 2000 University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Contents 1. Introduction 3 1.1 Data Entry and

### Using SPSS version 14 Joel Elliott, Jennifer Burnaford, Stacey Weiss

Using SPSS version 14 Joel Elliott, Jennifer Burnaford, Stacey Weiss SPSS is a program that is very easy to learn and is also very powerful. This manual is designed to introduce you to the program however,

### Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

### Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

Using Excel Jeffrey L. Rummel Emory University Goizueta Business School BBA Seminar Jeffrey L. Rummel BBA Seminar 1 / 54 Excel Calculations of Descriptive Statistics Single Variable Graphs Relationships

### Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

### SPSS: Descriptive and Inferential Statistics. For Windows

For Windows August 2012 Table of Contents Section 1: Summarizing Data...3 1.1 Descriptive Statistics...3 Section 2: Inferential Statistics... 10 2.1 Chi-Square Test... 10 2.2 T tests... 11 2.3 Correlation...

### Simple Linear Regression, Scatterplots, and Bivariate Correlation

1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.

### Excel Charts & Graphs

MAX 201 Spring 2008 Assignment #6: Charts & Graphs; Modifying Data Due at the beginning of class on March 18 th Introduction This assignment introduces the charting and graphing capabilities of SPSS and

### MTH 140 Statistics Videos

MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative

### SPSS (Statistical Package for the Social Sciences)

SPSS (Statistical Package for the Social Sciences) What is SPSS? SPSS stands for Statistical Package for the Social Sciences The SPSS home-page is: www.spss.com 2 What can you do with SPSS? Run Frequencies

### Statistical Analysis Using Gnumeric

Statistical Analysis Using Gnumeric There are many software packages that will analyse data. For casual analysis, a spreadsheet may be an appropriate tool. Popular spreadsheets include Microsoft Excel,

### business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

### Using Microsoft Excel to Plot and Analyze Kinetic Data

Entering and Formatting Data Using Microsoft Excel to Plot and Analyze Kinetic Data Open Excel. Set up the spreadsheet page (Sheet 1) so that anyone who reads it will understand the page (Figure 1). Type

### Appendix 2.1 Tabular and Graphical Methods Using Excel

Appendix 2.1 Tabular and Graphical Methods Using Excel 1 Appendix 2.1 Tabular and Graphical Methods Using Excel The instructions in this section begin by describing the entry of data into an Excel spreadsheet.

### Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

### DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics

### KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

### Summarizing and Displaying Categorical Data

Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency

### When to use Excel. When NOT to use Excel 9/24/2014

Analyzing Quantitative Assessment Data with Excel October 2, 2014 Jeremy Penn, Ph.D. Director When to use Excel You want to quickly summarize or analyze your assessment data You want to create basic visual

### How to Use a Data Spreadsheet: Excel

How to Use a Data Spreadsheet: Excel One does not necessarily have special statistical software to perform statistical analyses. Microsoft Office Excel can be used to run statistical procedures. Although

### GeoGebra Statistics and Probability

GeoGebra Statistics and Probability Project Maths Development Team 2013 www.projectmaths.ie Page 1 of 24 Index Activity Topic Page 1 Introduction GeoGebra Statistics 3 2 To calculate the Sum, Mean, Count,

Using Your TI-NSpire Calculator: Descriptive Statistics Dr. Laura Schultz Statistics I This handout is intended to get you started using your TI-Nspire graphing calculator for statistical applications.

### Simple Predictive Analytics Curtis Seare

Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use

### How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

### Directions for Frequency Tables, Histograms, and Frequency Bar Charts

Directions for Frequency Tables, Histograms, and Frequency Bar Charts Frequency Distribution Quantitative Ungrouped Data Dataset: Frequency_Distributions_Graphs-Quantitative.sav 1. Open the dataset containing

### Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different)

Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different) Spreadsheets are computer programs that allow the user to enter and manipulate numbers. They are capable

### The Chi-Square Test. STAT E-50 Introduction to Statistics

STAT -50 Introduction to Statistics The Chi-Square Test The Chi-square test is a nonparametric test that is used to compare experimental results with theoretical models. That is, we will be comparing observed

### Data exploration with Microsoft Excel: univariate analysis

Data exploration with Microsoft Excel: univariate analysis Contents 1 Introduction... 1 2 Exploring a variable s frequency distribution... 2 3 Calculating measures of central tendency... 16 4 Calculating

### An introduction to using Microsoft Excel for quantitative data analysis

Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing

### Introduction to StatsDirect, 11/05/2012 1

INTRODUCTION TO STATSDIRECT PART 1... 2 INTRODUCTION... 2 Why Use StatsDirect... 2 ACCESSING STATSDIRECT FOR WINDOWS XP... 4 DATA ENTRY... 5 Missing Data... 6 Opening an Excel Workbook... 6 Moving around

### Instructions for SPSS 21

1 Instructions for SPSS 21 1 Introduction... 2 1.1 Opening the SPSS program... 2 1.2 General... 2 2 Data inputting and processing... 2 2.1 Manual input and data processing... 2 2.2 Saving data... 3 2.3

### A Picture Really Is Worth a Thousand Words

4 A Picture Really Is Worth a Thousand Words Difficulty Scale (pretty easy, but not a cinch) What you ll learn about in this chapter Why a picture is really worth a thousand words How to create a histogram

### 1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

### TIPS FOR DOING STATISTICS IN EXCEL

TIPS FOR DOING STATISTICS IN EXCEL Before you begin, make sure that you have the DATA ANALYSIS pack running on your machine. It comes with Excel. Here s how to check if you have it, and what to do if you

### Psych. Research 1 Guide to SPSS 11.0

SPSS GUIDE 1 Psych. Research 1 Guide to SPSS 11.0 I. What is SPSS: SPSS (Statistical Package for the Social Sciences) is a data management and analysis program. It allows us to store and analyze very large

### Drawing a histogram using Excel

Drawing a histogram using Excel STEP 1: Examine the data to decide how many class intervals you need and what the class boundaries should be. (In an assignment you may be told what class boundaries to

### Using Excel for Analyzing Survey Questionnaires Jennifer Leahy

University of Wisconsin-Extension Cooperative Extension Madison, Wisconsin PD &E Program Development & Evaluation Using Excel for Analyzing Survey Questionnaires Jennifer Leahy G3658-14 Introduction You

### Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

### SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS

SECTION 2-1: OVERVIEW Chapter 2 Describing, Exploring and Comparing Data 19 In this chapter, we will use the capabilities of Excel to help us look more carefully at sets of data. We can do this by re-organizing

### mean, median, mode, variance, standard deviation, skewness, and kurtosis.

Quantitative Methods Assignment 2 Part I. Descriptive Statistics 1. EXCEL Download the Alabama homicide data to use in this short lab and save it to your flashdrive or to the desktop. Pick one of the variables

### Chapter 5. Regression

Topics covered in this chapter: Chapter 5. Regression Adding a Regression Line to a Scatterplot Regression Lines and Influential Observations Finding the Least Squares Regression Model Adding a Regression

### Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; hflores@rice.edu, and Dr. J.A. Dobelman

Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; hflores@rice.edu, and Dr. J.A. Dobelman Statistics lab will be mainly focused on applying what you have learned in class with

### Main Effects and Interactions

Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly

### SAS Analyst for Windows Tutorial

Updated: August 2012 Table of Contents Section 1: Introduction... 3 1.1 About this Document... 3 1.2 Introduction to Version 8 of SAS... 3 Section 2: An Overview of SAS V.8 for Windows... 3 2.1 Navigating

### NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

### 4 Other useful features on the course web page. 5 Accessing SAS

1 Using SAS outside of ITCs Statistical Methods and Computing, 22S:30/105 Instructor: Cowles Lab 1 Jan 31, 2014 You can access SAS from off campus by using the ITC Virtual Desktop Go to https://virtualdesktopuiowaedu

### MINITAB ASSISTANT WHITE PAPER

MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

### Scatter Plots with Error Bars

Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each

### Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

### Charting LibQUAL+(TM) Data. Jeff Stark Training & Development Services Texas A&M University Libraries Texas A&M University

Charting LibQUAL+(TM) Data Jeff Stark Training & Development Services Texas A&M University Libraries Texas A&M University Revised March 2004 The directions in this handout are written to be used with SPSS

### STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

### Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This