Scatterplots: Basics, enhancements, problems and solutions Peter L. Flom, Peter Flom Consulting, New York, NY
|
|
|
- Morgan Thomas
- 9 years ago
- Views:
Transcription
1 ABSTRACT Scatterplots: Basics, enhancements, problems and solutions Peter L. Flom, Peter Flom Consulting, New York, NY The scatter plot is a basic tool for presenting information on two continuous variables. While the basic plot is good in many situations, enhancements can increase its utility. I also go over tools to deal with the problem of overplotting. INTRODUCTION In this paper, I discuss scatter plots. I start with a very basic example, and then illustrate some enhancements. Next, I show some problems that can occur, and illustrate some solutions. With the new SG procedures, introduced in SAS R 9.2, SAS allows us to make good scatter plots relatively easily. However, there are many options, and applying them well is not always obvious. And, for specialized purposes, PROC SGRENDER can produce highly customized graphics, but its use is not entirely straightforward. BASIC SCATTER PLOTS AND ENHANCEMENTS SIMPLE SCATTER PLOTS WITH PROC SGPLOT The PROC for basic scatter plots is PROC SGPLOT. Rather than list a lot of the options and syntax for this PROC (all of which can be looked up) I will give some specific examples. As a starting example, let s plot unemployment rate and infant mortality for each of the 50 states plus the District of Columbia. This code: proc sgplot data = UnempIM; *STARTS THE PROC; scatter x = Unemployment y = InfantMortality; *CREATES A PLOT, NOTE THE USE OF X = AND Y =; Produces this scatter plot: Figure 1: Most basic scatter plot 1
2 ENHANCING THE SCATTER PLOT WITH PROC SGPLOT Next, we should probably make the labels on the axes clearer: proc sgplot data = UnempIM; xaxis label = "Unemployment (%)"; *THIS SHOULD BE SELF EXPLANATORY, THERE ARE OTHER AXIS OPTIONS AS WELL; yaxis label = "Infant Mortality (%)"; scatter x = Unemployment y = InfantMortality; This creates figure 2. Figure 2: Axes fixed The scatter plot isn t bad, but we can easily include more information. One thing we might want to do is add a smoothed line for the relationship between the two variables; in fact, we might want more than one, with different amounts of smoothing. One kind of smoothed line is loess. Another is linear regression. We can add loess lines and a regression line as follows: proc sgplot data = UnempIM; xaxis label = "Unemployment (%)"; yaxis label = "Infant Mortality (%)"; scatter x = Unemployment y = InfantMortality; loess x = Unemployment y = InfantMortality/nomarkers; loess x = Unemployment y = InfantMortality/nomarkers; reg x = Unemployment y = InfantMortality; *LOESS WORKS ON THE SAME DATA AS SCATTER, SMOOTH CAN BE ADJUSTED. SAS FROM PLOTTING EACH POINT 3 TIMES. REG PLOTS A LINEAR REGRESSION LINE, IT DOES NOT NEED NOMARKERS; NOMARKERS PREVENTS 2
3 Figure 3: Scatter plot with loess lines That bumpy smooth is too bumpy, so let s delete it. And we might also want to add an ellipse around the points; by default, the ellipse statement creates a prediction ellipse, that is, an ellipse for predicting a new point. It also approximates a region that contains 95% of the population: Figure 4 is produced by the following code: proc sgplot data = UnempIM; xaxis label = "Unemployment (%)"; yaxis label = "Infant Mortality (%)"; scatter x = Unemployment y = InfantMortality; loess x = Unemployment y = InfantMortality/nomarkers; reg x = Unemployment y = InfantMortality ; ellipse x = Unemployment y = InfantMortality; 3
4 Figure 4: Scatter plot with loess lines and ellipse That s all simple enough, and certainly adds information. But we can add more; we can look at the distribution of each variable separately and plot these in the margins. This requires use of the graph template language. MORE COMPLEX ENHANCEMENTS WITH THE GRAPH TEMPLATE LANGUAGE (GTL) The GTL allows very fine control over every aspect of a graph. It is also the language that SAS uses to create graphics. To use the GTL, you begin with a PROC TEMPLATE. You then use PROC SGRENDER to render (plot) that template. An advantage of this is that, once you have created a template, you can very easily use it with different data sets. A very good reference that includes many starting templates is Kuhfeld (1). In addition, the GTL Users Guide and GTL Reference Manual are quite useful. There are a great many possible options, and I will cover only a few. Suppose we want to produce a graph such as that shown in 5 4
5 The SAS System Figure 5: Scatter plot with density plots I think this is a pretty sophisticated graph. There s a lot of information and it s reasonably clear what that information is. That is, it includes both information on the univariate distributions as well as the bivariate distribution. To produce this figure, we first create a template PROC TEMPLATE. proc template; *STARTS PROC TEMPLATE; define statgraph scatdens2; *DEFINES A GRAPH TO BE CALL SCATDENS; begingraph; *BEGIN DEFINING THE GRAPH; entrytitle "Scatter plot with density plots"; *CREATE A TITLE; layout lattice/columns = 2 rows = 2 columnweights = (.8.2) rowweights = (.8.2) columndatarange = union rowdatarange = union; *LAYOUT LATTICE/COLUMNS = 2 ROWS = 2 SETS UP A GRID, OR LATTICE, OF GRAPHS; *COLUMNWEIGHTS AND ROWWEIGHTS SETS THE RELATIVE SIZE OF THE INDIVIDUAL COLUMNS AND ROWS; columnaxes; columnaxis /label = Unemployment (%) griddisplay = on; columnaxis /label = griddisplay = on; endcolumnaxes; *COLUMNAXES SETS THE CHARACTERISTICS OF COLUMNS; *THE SECOND ONE HAS NO LABEL (NONE WOULD FIT) rowaxes; rowaxis /label = Infant Mortality (%) griddisplay = on; rowaxis /label = griddisplay = on; endrowaxes; layout overlay; *STARTS THE ACTUAL GRAPHING OF DOTS AND SUCH; scatter plot x = unemployment y = infantmortality; *GRAPHS THE DOTS; loessplot x = unemployment y = infantmortality/nomarkers; loessplot x = unemployment y = infantmortality/smooth = 1 nomarkers; ellipse x = unemployment y = infantmortality/type = predicted; endlayout; 5
6 densityplot infantmortality/orient = horizontal; densityplot unemployment; endlayout; endgraph; end; Then we render it with PROC SGRENDER. proc sgrender data = UnempIM template = scatdens2; *NOW WE RENDER THE TEMPLATE WE CREATED; Rick Wicklin of SAS pointed out that, for people who dont like to program, the %sgdesign macro brings up a GUI interface that allows you to create the second image using drag-and-drop and menus. For details and examples, see (2), I have not used this feature. OVERPLOTTING Although scatter plots are very useful, they can have problems. The most important of these is overplotting which occurs when more than one observation has the same values. The proper solution depends on the type and amount of overplotting. In some cases, overplotting is due to the discrete nature of the way the data are recorded; for example, when asked their weights and heights, people respond with weights in pounds (or kilograms) and heights in feet and inches (or centimeters), rounded to the nearest unit, or sometimes even to the nearest multiple of 5. In other cases, there is so much data that overplotting occurs even when the data are recorded accurately to several decimal places. In the first situation, one excellent solution is jittering or adding small amounts of random noise to the data. The proper amount to add is partly a matter of trial and error; you want enough jitter so that the overlap is gone, but not so much that the data move to another category. In the latter case, there are various solutions. If the data set is not enormous, changing the plotting character or its size may be enough. If there is an enormous number of points, then we can change to a parallel box plot. A DATA SET Here I create a data set of actual heights and weights (realht and realwt), rounded to the nearest inch and pound (ht and wt). I also jitter these (jitht and jitwt). data htwt; do i = 1 to 10000; realht = rannor( )*3 + 66; realwt = realht*2 + realht**2* *rannor( ); ht = round(realht,1); wt = round(realwt,1); jitht = ht+rannor( ); jitwt = wt+rannor( ); output; end; MODERATE OVERPLOTTING DUE TO DISCRETIZATION If we have a data set of 500 people with rounded height and weight, the plot will not show all the points clearly; see figure 6 6
7 Figure 6: Scatter plot with moderate overplotting Here, simply jittering the data works well; see figure 7. Figure 7: Scatter plot with jittering However, if we have 10,000 points, jittering is not enough, see figure 8. 7
8 The SAS System Figure 8: Scatter plot with jittering, but N =
9 We can change the plotting character and its size with the following program: proc sgplot data = htwt; scatter x = jitht y = jitwt/ markerattrs = (size = 2 symbol = circlefilled); producing figure 9 Figure 9: Scatter plot with small symbols, N = An alternative is to abandon the scatter plot and use parallel boxplots: proc sgplot data = htwt; vbox wt/category = ht spread; *THE SPREAD OPTION PREVENTS OVERLAP; This produces figure 10 9
10 Figure 10: Parallel boxplot, N = One problem with this is that it does not give any indication of the density of height. Using GTL, we can add a barchart: proc template; define statgraph fancybox; begingraph; entrytitle "Box plot w/histogram"; layout lattice/rows = 2 columns = 1 order = columnmajor rowweights = (.8.2); columnaxes; columnaxis /griddisplay = on; columnaxis /label = griddisplay = on; endcolumnaxes; boxplot x = ht y = wt; endlayout; endgraph; end; barchart x = ht y = wt; proc sgrender data = htwt template = fancybox; This produces figure 12 10
11 The SAS System Figure 11: Parallel boxplot, N = This will look better if we externalize the axes: proc template; define statgraph fancybox2; begingraph; entrytitle "Box plot w/barchart"; layout lattice/rows = 2 columns = 1 rowweights = (.8.2) rowdatarange=union rowgutter=3px coldatarange = union; rowaxes; rowaxis / griddisplay=on label="weight" rowaxis / griddisplay=on label="%" endrowaxes; boxplot x = ht y = wt; barchart x = ht y = wt; columnaxes; columnaxis /griddisplay = on; endcolumnaxes; endlayout; endgraph; end; to get figure 12, below. 11
12 The SAS System Figure 12: Parallel boxplot, N = SUMMARY Scatter plots are a very valuable graphical tool. The SG procedures in SAS allow many scatter plots to be produced easily, and the graph template language allows very fine control over all aspects of a graph. REFERENCES [1] Kuhfeld, W. Statistical Graphics in SAS: An Introduction to the Graphical Template Language, SAS Press, Cary, NC, [2] CONTACT INFORMATION Peter L. Flom 515 West End Ave Apt 8C New York, NY [email protected] (917) SAS R and all other SAS Institute Inc., product or service names are registered trademarks ore trademarks of SAS Institute Inc., in the USA and other countries. R indicates USA registration. Other brand names and product names are registered trademarks or trademarks of their respective companies. 12
A Simplified Approach to Add Shading to your Graph using GTL
ABSTRACT Paper RF-011-2014 A Simplified Approach to Add Shading to your Graph using GTL Jennifer L Yen, Abbott Laboratories, Abbott Park, IL You have mastered the skill of using SAS annotate facility along
SAS 9.3 ODS Graphics. Getting Started with Business and Statistical Graphics. SAS Documentation
SAS 9.3 ODS Graphics Getting Started with Business and Statistical Graphics SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2011. SAS 9.3 ODS Graphics:
Describing, Exploring, and Comparing Data
24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter
Using PROC SGPLOT for Quick High Quality Graphs
Using PROC SGPLOT for Quick High Quality Graphs Lora D. Delwiche, University of California, Davis, CA Susan J. Slaughter, Avocet Solutions, Davis, CA ABSTRACT New with SAS 9.2, ODS Graphics introduces
Scatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
Scientific Graphing in Excel 2010
Scientific Graphing in Excel 2010 When you start Excel, you will see the screen below. Various parts of the display are labelled in red, with arrows, to define the terms used in the remainder of this overview.
containing Kendall correlations; and the OUTH = option will create a data set containing Hoeffding statistics.
Getting Correlations Using PROC CORR Correlation analysis provides a method to measure the strength of a linear relationship between two numeric variables. PROC CORR can be used to compute Pearson product-moment
Plotting Against Cancer: Creating Oncology Plots Using SAS
ABSTRACT Paper SAS130-2014 Plotting Against Cancer: Creating Oncology Plots Using SAS Debpriya Sarkar, SAS Institute Inc., Pune, India Graphs in oncology studies are essential for getting more insight
5 Correlation and Data Exploration
5 Correlation and Data Exploration Correlation In Unit 3, we did some correlation analyses of data from studies related to the acquisition order and acquisition difficulty of English morphemes by both
SAS/GRAPH 9.2 ODS Graphics Editor. User s Guide
SAS/GRAPH 9.2 ODS Graphics Editor User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2009. SAS/GRAPH 9.2: ODS Graphics Editor User's Guide. Cary, NC: SAS
Grade level: secondary Subject: mathematics Time required: 45 to 90 minutes
TI-Nspire Activity: Paint Can Dimensions By: Patsy Fagan and Angela Halsted Activity Overview Problem 1 explores the relationship between height and volume of a right cylinder, the height and surface area,
Summary of important mathematical operations and formulas (from first tutorial):
EXCEL Intermediate Tutorial Summary of important mathematical operations and formulas (from first tutorial): Operation Key Addition + Subtraction - Multiplication * Division / Exponential ^ To enter a
Dealing with Data in Excel 2010
Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize
The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA
Paper 156-2010 The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA Abstract JMP has a rich set of visual displays that can help you see the information
Excel -- Creating Charts
Excel -- Creating Charts The saying goes, A picture is worth a thousand words, and so true. Professional looking charts give visual enhancement to your statistics, fiscal reports or presentation. Excel
Relationships Between Two Variables: Scatterplots and Correlation
Relationships Between Two Variables: Scatterplots and Correlation Example: Consider the population of cars manufactured in the U.S. What is the relationship (1) between engine size and horsepower? (2)
Updates to Graphing with Excel
Updates to Graphing with Excel NCC has recently upgraded to a new version of the Microsoft Office suite of programs. As such, many of the directions in the Biology Student Handbook for how to graph with
Graphing Quadratic Functions
Problem 1 The Parabola Examine the data in L 1 and L to the right. Let L 1 be the x- value and L be the y-values for a graph. 1. How are the x and y-values related? What pattern do you see? To enter the
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
Counting the Ways to Count in SAS. Imelda C. Go, South Carolina Department of Education, Columbia, SC
Paper CC 14 Counting the Ways to Count in SAS Imelda C. Go, South Carolina Department of Education, Columbia, SC ABSTRACT This paper first takes the reader through a progression of ways to count in SAS.
Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.
1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis
AP Physics 1 and 2 Lab Investigations
AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks
Box-and-Whisker Plots with The SAS System David Shannon, Amadeus Software Limited
Box-and-Whisker Plots with The SAS System David Shannon, Amadeus Software Limited Abstract One regularly used graphical method of presenting data is the box-and-whisker plot. Whilst the vast majority of
This activity will show you how to draw graphs of algebraic functions in Excel.
This activity will show you how to draw graphs of algebraic functions in Excel. Open a new Excel workbook. This is Excel in Office 2007. You may not have used this version before but it is very much the
Building a Better Dashboard Using Base SAS Software
PharmaSUG 2016 Paper AD12 Building a Better Dashboard Using Base SAS Software Kirk Paul Lafler, Software Intelligence Corporation, Spring Valley, California Roger Muller, Data-To-Events, Indianapolis,
CC03 PRODUCING SIMPLE AND QUICK GRAPHS WITH PROC GPLOT
1 CC03 PRODUCING SIMPLE AND QUICK GRAPHS WITH PROC GPLOT Sheng Zhang, Xingshu Zhu, Shuping Zhang, Weifeng Xu, Jane Liao, and Amy Gillespie Merck and Co. Inc, Upper Gwynedd, PA Abstract PROC GPLOT is a
There are six different windows that can be opened when using SPSS. The following will give a description of each of them.
SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet
What Does the Normal Distribution Sound Like?
What Does the Normal Distribution Sound Like? Ananda Jayawardhana Pittsburg State University [email protected] Published: June 2013 Overview of Lesson In this activity, students conduct an investigation
MULTIPLE REGRESSION EXAMPLE
MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if
SPSS Introduction. Yi Li
SPSS Introduction Yi Li Note: The report is based on the websites below http://glimo.vub.ac.be/downloads/eng_spss_basic.pdf http://academic.udayton.edu/gregelvers/psy216/spss http://www.nursing.ucdenver.edu/pdf/factoranalysishowto.pdf
Each function call carries out a single task associated with drawing the graph.
Chapter 3 Graphics with R 3.1 Low-Level Graphics R has extensive facilities for producing graphs. There are both low- and high-level graphics facilities. The low-level graphics facilities provide basic
Formulas, Functions and Charts
Formulas, Functions and Charts :: 167 8 Formulas, Functions and Charts 8.1 INTRODUCTION In this leson you can enter formula and functions and perform mathematical calcualtions. You will also be able to
Descriptive Statistics
Descriptive Statistics Descriptive statistics consist of methods for organizing and summarizing data. It includes the construction of graphs, charts and tables, as well various descriptive measures such
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
Innovative Techniques and Tools to Detect Data Quality Problems
Paper DM05 Innovative Techniques and Tools to Detect Data Quality Problems Hong Qi and Allan Glaser Merck & Co., Inc., Upper Gwynnedd, PA ABSTRACT High quality data are essential for accurate and meaningful
Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation
Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation Display and Summarize Correlation for Direction and Strength Properties of Correlation Regression Line Cengage
Getting started with qplot
Chapter 2 Getting started with qplot 2.1 Introduction In this chapter, you will learn to make a wide variety of plots with your first ggplot2 function, qplot(), short for quick plot. qplot makes it easy
Data Visualization Tips and Techniques for Effective Communication
ABSTRACT PharmaSUG 2013 - Paper DG10 Data Visualization Tips and Techniques for Effective Communication LeRoy Bessler Bessler Consulting and Research, Mequon, Milwaukee, Wisconsin, USA This tutorial presents
Session 7 Bivariate Data and Analysis
Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares
FREE FALL. Introduction. Reference Young and Freedman, University Physics, 12 th Edition: Chapter 2, section 2.5
Physics 161 FREE FALL Introduction This experiment is designed to study the motion of an object that is accelerated by the force of gravity. It also serves as an introduction to the data analysis capabilities
Introduction Course in SPSS - Evening 1
ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/
A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion
A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion Objective In the experiment you will determine the cart acceleration, a, and the friction force, f, experimentally for
Examples of Data Representation using Tables, Graphs and Charts
Examples of Data Representation using Tables, Graphs and Charts This document discusses how to properly display numerical data. It discusses the differences between tables and graphs and it discusses various
Graphing in SAS Software
Graphing in SAS Software Prepared by International SAS Training and Consulting Destiny Corporation 100 Great Meadow Rd Suite 601 - Wethersfield, CT 06109-2379 Phone: (860) 721-1684 - 1-800-7TRAINING Fax:
HSPA 10 CSI Investigation Height and Foot Length: An Exercise in Graphing
HSPA 10 CSI Investigation Height and Foot Length: An Exercise in Graphing In this activity, you will play the role of crime scene investigator. The remains of two individuals have recently been found trapped
Graphing in excel on the Mac
Graphing in excel on the Mac Quick Reference for people who just need a reminder The easiest thing is to have a single series, with y data in the column to the left of the x- data. Select the data and
Microsoft Excel 2010 Charts and Graphs
Microsoft Excel 2010 Charts and Graphs Email: [email protected] Web Page: http://training.health.ufl.edu Microsoft Excel 2010: Charts and Graphs 2.0 hours Topics include data groupings; creating
Embedded Special Characters Kiran Karidi, Mahipal Vanam, and Sridhar Dodlapati
PharmaSUG2010 - Paper CC19 Embedded Special Characters Kiran Karidi, Mahipal Vanam, and Sridhar Dodlapati ABSTRACT When the report generated from the clinical trial data requires to show lot of information
Chapter 32 Histograms and Bar Charts. Chapter Table of Contents VARIABLES...470 METHOD...471 OUTPUT...472 REFERENCES...474
Chapter 32 Histograms and Bar Charts Chapter Table of Contents VARIABLES...470 METHOD...471 OUTPUT...472 REFERENCES...474 467 Part 3. Introduction 468 Chapter 32 Histograms and Bar Charts Bar charts are
Using Excel for Handling, Graphing, and Analyzing Scientific Data:
Using Excel for Handling, Graphing, and Analyzing Scientific Data: A Resource for Science and Mathematics Students Scott A. Sinex Barbara A. Gage Department of Physical Sciences and Engineering Prince
SPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,
FRICTION, WORK, AND THE INCLINED PLANE
FRICTION, WORK, AND THE INCLINED PLANE Objective: To measure the coefficient of static and inetic friction between a bloc and an inclined plane and to examine the relationship between the plane s angle
2) The three categories of forecasting models are time series, quantitative, and qualitative. 2)
Exam Name TRUE/FALSE. Write 'T' if the statement is true and 'F' if the statement is false. 1) Regression is always a superior forecasting method to exponential smoothing, so regression should be used
Scatter Chart. Segmented Bar Chart. Overlay Chart
Data Visualization Using Java and VRML Lingxiao Li, Art Barnes, SAS Institute Inc., Cary, NC ABSTRACT Java and VRML (Virtual Reality Modeling Language) are tools with tremendous potential for creating
Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:
Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:
Swimmer Plot: Tell a Graphical Story of Your Time to Response Data Using PROC SGPLOT Stacey D. Phillips, Inventiv Health Clinical, Princeton, NJ
PharmaSUG 2014 - Paper DG07 Swimmer Plot: Tell a Graphical Story of Your Time to Response Data Using PROC SGPLOT Stacey D. Phillips, Inventiv Health Clinical, Princeton, NJ ABSTRACT ODS Statistical Graphics
Probability Distributions
CHAPTER 5 Probability Distributions CHAPTER OUTLINE 5.1 Probability Distribution of a Discrete Random Variable 5.2 Mean and Standard Deviation of a Probability Distribution 5.3 The Binomial Distribution
Activity 6 Graphing Linear Equations
Activity 6 Graphing Linear Equations TEACHER NOTES Topic Area: Algebra NCTM Standard: Represent and analyze mathematical situations and structures using algebraic symbols Objective: The student will be
Part 1: Background - Graphing
Department of Physics and Geology Graphing Astronomy 1401 Equipment Needed Qty Computer with Data Studio Software 1 1.1 Graphing Part 1: Background - Graphing In science it is very important to find and
Getting Correct Results from PROC REG
Getting Correct Results from PROC REG Nathaniel Derby, Statis Pro Data Analytics, Seattle, WA ABSTRACT PROC REG, SAS s implementation of linear regression, is often used to fit a line without checking
Chapter 9 Descriptive Statistics for Bivariate Data
9.1 Introduction 215 Chapter 9 Descriptive Statistics for Bivariate Data 9.1 Introduction We discussed univariate data description (methods used to eplore the distribution of the values of a single variable)
CHAPTER 1: SPREADSHEET BASICS. AMZN Stock Prices Date Price 2003 54.43 2004 34.13 2005 39.86 2006 38.09 2007 89.15 2008 69.58
1. Suppose that at the beginning of October 2003 you purchased shares in Amazon.com (NASDAQ: AMZN). It is now five years later and you decide to evaluate your holdings to see if you have done well with
CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08
CHARTS AND GRAPHS INTRODUCTION SPSS and Excel each contain a number of options for producing what are sometimes known as business graphics - i.e. statistical charts and diagrams. This handout explores
EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.
EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. Excel is powerful tool and can make your life easier if you are proficient in using it. You will need to use Excel to complete most of your
Excel Tutorial. Bio 150B Excel Tutorial 1
Bio 15B Excel Tutorial 1 Excel Tutorial As part of your laboratory write-ups and reports during this semester you will be required to collect and present data in an appropriate format. To organize and
Describing Relationships between Two Variables
Describing Relationships between Two Variables Up until now, we have dealt, for the most part, with just one variable at a time. This variable, when measured on many different subjects or objects, took
AP * Statistics Review. Descriptive Statistics
AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production
Creating Charts in Microsoft Excel A supplement to Chapter 5 of Quantitative Approaches in Business Studies
Creating Charts in Microsoft Excel A supplement to Chapter 5 of Quantitative Approaches in Business Studies Components of a Chart 1 Chart types 2 Data tables 4 The Chart Wizard 5 Column Charts 7 Line charts
In this chapter, you will learn to use cost-volume-profit analysis.
2.0 Chapter Introduction In this chapter, you will learn to use cost-volume-profit analysis. Assumptions. When you acquire supplies or services, you normally expect to pay a smaller price per unit as the
Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2
Lesson 4 Part 1 Relationships between two numerical variables 1 Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables
Your Resume Selling Yourself Using SAS
Your Resume Selling Yourself Using SAS Rebecca Ottesen, California Polytechnic State University, San Luis Obispo, CA Susan J. Slaughter, Avocet Solutions, Davis CA ABSTRACT These days selling yourself
Introduction to Exploratory Data Analysis
Introduction to Exploratory Data Analysis A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,
Lab 1: The metric system measurement of length and weight
Lab 1: The metric system measurement of length and weight Introduction The scientific community and the majority of nations throughout the world use the metric system to record quantities such as length,
Diagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
Microsoft Excel 2010 Part 3: Advanced Excel
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES Microsoft Excel 2010 Part 3: Advanced Excel Winter 2015, Version 1.0 Table of Contents Introduction...2 Sorting Data...2 Sorting
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable
SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout
Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS
Chapter 4 Creating Charts and Graphs
Calc Guide Chapter 4 OpenOffice.org Copyright This document is Copyright 2006 by its contributors as listed in the section titled Authors. You can distribute it and/or modify it under the terms of either
Statgraphics Getting started
Statgraphics Getting started The aim of this exercise is to introduce you to some of the basic features of the Statgraphics software. Starting Statgraphics 1. Log in to your PC, using the usual procedure
Unit 9 Describing Relationships in Scatter Plots and Line Graphs
Unit 9 Describing Relationships in Scatter Plots and Line Graphs Objectives: To construct and interpret a scatter plot or line graph for two quantitative variables To recognize linear relationships, non-linear
7 Time series analysis
7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are
Lesson 3.2.1 Using Lines to Make Predictions
STATWAY INSTRUCTOR NOTES i INSTRUCTOR SPECIFIC MATERIAL IS INDENTED AND APPEARS IN GREY ESTIMATED TIME 50 minutes MATERIALS REQUIRED Overhead or electronic display of scatterplots in lesson BRIEF DESCRIPTION
Homework 8 Solutions
Math 17, Section 2 Spring 2011 Homework 8 Solutions Assignment Chapter 7: 7.36, 7.40 Chapter 8: 8.14, 8.16, 8.28, 8.36 (a-d), 8.38, 8.62 Chapter 9: 9.4, 9.14 Chapter 7 7.36] a) A scatterplot is given below.
Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering
Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques
Dynamic Dashboards Using Base-SAS Software
Paper TT-01-2015 Dynamic Dashboards Using Base-SAS Software Kirk Paul Lafler, Software Intelligence Corporation, Spring Valley, California Abstract Dynamic interactive visual displays known as dashboards
. 58 58 60 62 64 66 68 70 72 74 76 78 Father s height (inches)
PEARSON S FATHER-SON DATA The following scatter diagram shows the heights of 1,0 fathers and their full-grown sons, in England, circa 1900 There is one dot for each father-son pair Heights of fathers and
Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab
1 Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab I m sure you ve wondered about the absorbency of paper towel brands as you ve quickly tried to mop up spilled soda from
Point Biserial Correlation Tests
Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable
Years after 2000. US Student to Teacher Ratio 0 16.048 1 15.893 2 15.900 3 15.900 4 15.800 5 15.657 6 15.540
To complete this technology assignment, you should already have created a scatter plot for your data on your calculator and/or in Excel. You could do this with any two columns of data, but for demonstration
Excel Chart Best Practices
Excel Chart Best Practices Robert Balik, Western Michigan University Abstract Virtually everyone in personal financial planning, student, teacher or practitioner, uses charts. The charts convey information
Intermediate PowerPoint
Intermediate PowerPoint Charts and Templates By: Jim Waddell Last modified: January 2002 Topics to be covered: Creating Charts 2 Creating the chart. 2 Line Charts and Scatter Plots 4 Making a Line Chart.
R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol
R Graphics Cookbook Winston Chang Beijing Cambridge Farnham Koln Sebastopol O'REILLY Tokyo Table of Contents Preface ix 1. R Basics 1 1.1. Installing a Package 1 1.2. Loading a Package 2 1.3. Loading a
Curve Fitting, Loglog Plots, and Semilog Plots 1
Curve Fitting, Loglog Plots, and Semilog Plots 1 In this MATLAB exercise, you will learn how to plot data and how to fit lines to your data. Suppose you are measuring the height h of a seedling as it grows.
Pearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
Data analysis and regression in Stata
Data analysis and regression in Stata This handout shows how the weekly beer sales series might be analyzed with Stata (the software package now used for teaching stats at Kellogg), for purposes of comparing
Chapter 27 Using Predictor Variables. Chapter Table of Contents
Chapter 27 Using Predictor Variables Chapter Table of Contents LINEAR TREND...1329 TIME TREND CURVES...1330 REGRESSORS...1332 ADJUSTMENTS...1334 DYNAMIC REGRESSOR...1335 INTERVENTIONS...1339 TheInterventionSpecificationWindow...1339
Chapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
Plotting: Customizing the Graph
Plotting: Customizing the Graph Data Plots: General Tips Making a Data Plot Active Within a graph layer, only one data plot can be active. A data plot must be set active before you can use the Data Selector
