Exploring All of Your Options



Similar documents
BANA6037 Data Visualization Fall Semester 2014 (14FS) / First Half Session Section 001 S 9:00a- 12:50p Lindner 107

Quantitative vs. Categorical Data: A Difference Worth Knowing Stephen Few April 2005

Dashboard Design for Rich and Rapid Monitoring

Examples of Data Representation using Tables, Graphs and Charts

Business Intelligence. In God we trust, all others bring data. - W. Edwards Deming. Business Intelligence Moving from data to action

Data Visualization Basics for Students

Performance Dashboard Tutorial

Visualization Quick Guide

Plots, Curve-Fitting, and Data Modeling in Microsoft Excel

Numbers as pictures: Examples of data visualization from the Business Employment Dynamics program. October 2009

Data Visualization. BUS 230: Business and Economic Research and Communication

Quantitative Displays for Combining Time-Series and Part-to-Whole Relationships

Performance evaluation

This file contains 2 years of our interlibrary loan transactions downloaded from ILLiad. 70,000+ rows, multiple fields = an ideal file for pivot

Part 2: Data Visualization How to communicate complex ideas with simple, efficient and accurate data graphics

TABLEAU COURSE CONTENT. Presented By 3S Business Corporation Inc Call us at : Mail us at : info@3sbc.com

Sample Table. Columns. Column 1 Column 2 Column 3 Row 1 Cell 1 Cell 2 Cell 3 Row 2 Cell 4 Cell 5 Cell 6 Row 3 Cell 7 Cell 8 Cell 9.

Data representation and analysis in Excel

CSU, Fresno - Institutional Research, Assessment and Planning - Dmitri Rogulkin

Based on Chapter 11, Excel 2007 Dashboards & Reports (Alexander) and Create Dynamic Charts in Microsoft Office Excel 2007 and Beyond (Scheck)

Infographics in the Classroom: Using Data Visualization to Engage in Scientific Practices

Gestation Period as a function of Lifespan

Basic Excel Handbook

Graphic Design Basics. Shannon B. Neely. Pacific Northwest National Laboratory Graphics and Multimedia Design Group

Updates to Graphing with Excel

Data Visualization for Mobile Devices with OBI 11g. Collaborate 14

Introduction to Geographical Data Visualization

Choosing Colors for Data Visualization Maureen Stone January 17, 2006

CREATING EXCEL PIVOT TABLES AND PIVOT CHARTS FOR LIBRARY QUESTIONNAIRE RESULTS

Excel Guide for Finite Mathematics and Applied Calculus

How to make a line graph using Excel 2007

Dealing with Data in Excel 2010

How does the Review Program work?

Exercise 1: How to Record and Present Your Data Graphically Using Excel Dr. Chris Paradise, edited by Steven J. Price

Visualizing Multidimensional Data Through Time Stephen Few July 2005

Online Marketing Module COMP. Certified Online Marketing Professional. v2.0

HOW TO USE DATA VISUALIZATION TO WIN OVER YOUR AUDIENCE

BI SURVEY. The world s largest survey of business intelligence software users

Creating a Tableau Data Visualization on Cincinnati Crime By Jeffrey A. Shaffer

30 Chants for Better Charts

Common Mistakes in Data Presentation Stephen Few September 4, 2004

Break-even analysis. On page 256 of It s the Business textbook, the authors refer to an alternative approach to drawing a break-even chart.

MetroBoston DataCommon Training

5 Best Practices for Mobile Business Intelligence

Get The Picture: Visualizing Financial Data part 1

MARKETING BENCHMARKS 7,000+ from. Businesses

How To Trade With Properly Scaled Static Charts

8 Killer Tips: How to Use Facebook for Event Marketing

4. Are you satisfied with the outcome? Why or why not? Offer a solution and make a new graph (Figure 2).

Chapter 4 Creating Charts and Graphs

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

Intro to Excel spreadsheets

Excel -- Creating Charts

Report Card Template Navigating Techniques

Drawing a histogram using Excel

Public Health Activities and Services Tracking (PHAST) Interactive Data Visualization Tool User Manual

2. Creating Bar Graphs with Excel 2007

Wednesday June 16, :30 am 3:30 pm

Effective Big Data Visualization

Identity Guide. HHMI Identity Guidelines V 1.2 1

Table of Contents. What is ProSite? What is Behance? How do ProSite & Behance work together? Get Started in 6 Easy Steps.

Sometimes We Must Raise Our Voices

2 Introduction to Nintex Workflow

Spreadsheet software for linear regression analysis

Expert Color Choices for Presenting Data

/ / TABLEAU TRAINING DURATION 30hrs

Summary of important mathematical operations and formulas (from first tutorial):

Creating an with Constant Contact. A step-by-step guide

Symantec Identity Guidelines. Version 3 - March 2012

2014 TALEND IDENTITY GUIDELINES

BANA6037 Data Visualization Fall Semester 2015 (15FS) / First Half Session Section 001 S 9:00a-12:50p Lindner 107

Introduction to Dashboards in Excel Craig W. Abbey Director of Institutional Analysis Academic Planning and Budget University at Buffalo

Best Practices for Dashboard Design with SAP BusinessObjects Design Studio

WEBFOCUS QUICK DATA FOR EXCEL

Exploratory data analysis (Chapter 2) Fall 2011

Getting the Most Out of SAS Visual Analytics: Design Tips for Creating More Stunning Reports

United States Department of Agriculture (USDA) Agricultural Marketing Service (AMS) Livestock and Grain Market News (LGMN)

Applying a circular load. Immediate and consolidation settlement. Deformed contours. Query points and query lines. Graph query.

Plotting Data with Microsoft Excel

Table of Contents Find the story within your data

2015 Marketing Guidelines Parallels IP Holdings GmbH. All rights reserved. Terms of Use Privacy Policy

CyI DOCTORAL THESIS TEMPLATE 1

The Point Cloud Library Logo

DIGITAL DESIGN APPLICATIONS Word Exam REVIEW

Create Charts in Excel

DATA VISUALISATION. A practical guide to producing effective visualisations for research communication

Executive Dashboard. User Guide

A Picture Really Is Worth a Thousand Words

Creating an with Constant Contact. A step-by-step guide

Poster Preparation & Presentation. Updated 4/27/2015, JC

Microsoft Office Excel 2007 Key Features. Office of Enterprise Development and Support Applications Support Group

DATA VISUALIZATION 101: HOW TO DESIGN CHARTS AND GRAPHS

Graphing Information

Tableau Data Visualization Cookbook

QUICK START GUIDE FOR CLUB WEBSITES

WORKING TOGETHER FOR SUCCESS

Scientific Graphing in Excel 2010

Transcription:

LÛCRUM, Inc. White Paper Exploring All of Your Options Data Visualization Written by: Jeffrey A. Shaffer February 21, 2013

ABOUT THE AUTHOR Jeffrey A. Shaffer is the Vice President of Information Technology and Analytics at Unifund. Mr. Shaffer joined Unifund in 1996 and has been instrumental in the creation and development of the complex systems, analytics and business intelligence platform at Unifund. Mr. Shaffer holds a BM and MM degree from the University of Cincinnati and an MBA from Xavier University where he was the winner of the 2006 Graduate Student Scholarly Project in Research. Mr. Shaffer has attended the Harvard Business School's Executive Education Program, is a Certified Manager of Quality and Organizational Excellence through the American Society for Quality, a Certified Project Management Professional through the Project Management Institute and has completed Six Sigma Green Belt and Black Belt training with the Xavier Consulting Group. Mr. Shaffer is also Adjunct Assistant Professor at the University of Cincinnati in the Carl H. Lindner College of Business teaching Data Visualization in the Graduate Course series for Data Analytics. He is also a regular speaker at business intelligence conferences and symposiums on the topic of data visualization, writes for the data visualization blog at MakingDataMeaningful.com for LÛCRUM, Inc. and was a finalist in the 2011 Tableau Interactive Visualization Competition. For more from Mr. Shaffer visit www.makingdatameaningful.com or visit his blog at www.highvizability.com.

Exploring All of Your Options: Data Visualization I recently read a blog post by Naomi Robbins from Forbes entitled Line Charts are Not Always the Best Way to Show Time Series. For those of you who are not familiar with Naomi Robbins you should add her to your reading list because she has some terrific posts regarding data visualization practices (http://blogs.forbes.com/naomirobbins/). This was an interesting post regarding her recent internet usage on a trip using a mifi for internet access. She writes: Many blogs and books suggest line charts or area charts similar to this one for time series but they don t work in this case since interpolation does not make sense. It is true about blogs and books suggesting line charts for time series data. In fact, when teaching data visualization at the University of Cincinnati I always reinforce to my students that time series data is best as a line chart. This is because we, as readers, typically understand time when plotted on the x- axis and we typically want to see a trend over time. This is the biggest advantage of a line chart as it shows trend over time better than any other chart type. Time series data is in two categories. In statistics we refer to this as continuous and discrete. Time series data that is continuous is where we have an observation at every point in time, for example an electrocardiogram at a hospital monitoring a patient s vital signs. Time series data that is discrete is where the observation is at spaced intervals, frequently evenly spaced intervals, for example monthly sales revenue. One might conclude from this that line charts would work better for the continuous data and a bar chart would be better for the discrete data, however, even discrete time series trends are typically displayed better as a line chart. Again, this is because the biggest advantage of a line chart is that it shows a trend over time very well. The example used by Naomi Robbins is her daily internet usage for what appears to be one monthly billing cycle, from December 7 th to January 6 th. I agree with her that the original area graph provided is not the best way to visualize this data. However, in this case a simple line chart does work well, even for the discrete time series data. Consider her redesign vs. a simple line chart.

In this case the line chart shows a clear trend for the period as well as the zero values. Although the zero values are not ideal, they are not distracting from the overall message here. The trend is clear as well, we have an increase up through December 9 th followed by a decline, with very light use on December 14 th. I would not consider this data as the exception to break the rule of line charts and further, since many users would have data over a the full billing cycle, a line would be better for covering a full month of days rather than displaying 30 bars each month. I m sure the application developer would rather avoid using special rules for showing different data in different ways. There are situations with certain datasets where a bar chart might be better than a line chart. One example would be fragmented dates where there are large gaps of time or unequal time intervals. However, even in these cases a line chart can be adapted to fit the needs of the data. Stephen Few has a terrific example of this on his blog in a redesign of Time Magazine vs. Newsweek (http://www.perceptualedge.com/example14.php). In this example he uses the lines spread out over their appropriate interval and connects unknown points with dotted lines. Another example is when there are lots of zero values in the data, and if you plot it as a line chart then there would simply be a line at zero along the x- axis for the majority of the data. In this case, it would be better to just plot the point that has the data and not all of the zeros. Consider this example below which one of my students created (as part of a dashboard) showing rainfall in inches over a 4- year period of time on Halloween. Why is a line chart difficult in this example? The primary reason is because there is no trend to show here, and 3 of the 4 values are zero. In this case plotting that single point would be better. A bar chart would have worked too or a simple dot for the point. In this case I chose a lollipop chart for the single point of data. A few design points to mention as well. I adjusted the scale because in the original the.75 inches was plotted as a very large number when in fact it is not that much rain for the day and was really just light rain during trick or treating. Another redesign point to mention here is that there is no reason to list 9 points on the y- axis to give the reader the ability to approximate 1 point of actual data, so in this case it s better to simply plot that data point and remove the y- axis completely.

Naomi continues her discussion in another blog post entitled How to Position Y- Axis Labels in Graphs, and in this post discusses the various placements of the y- axis labels. She offers four comparisons for the data usage chart and explains why the y- axis title is positioned vertically. See her examples here on her blog: http://blogs- images.forbes.com/naomirobbins/files/2013/02/yaxispositions.jpg. She offers some comparisons of the chart explaining that the vertical placement of the y- axis title is better than some other alternatives and based on the alternatives she offered I would agree. However, I don t think we ve considered all of the available options. There are simple alternatives that would avoid this problem and when creating data visualizations we should not feel like we are boxed in or stuck with making bad choices. I will use her bar chart design to make some comparisons and try to demonstrate a number of different options. It appears that Naomi was keeping the overall size of her chart the same, so I will do the same and keep each chart s overall footprint exactly the same. In the first comparison below I am comparing her design to the design I used above. Simply adding a subtitle at the top denoting that the units are in gigabytes eliminates the need for the y- axis title completely. Another option is to add in GB after Usage in the main title. In this next redesign, I chose to drop the word usage from the y- axis title. You could also provide a very detailed description in the y- axis title if necessary and still keep it vertical by breaking it up on more than one line, although in this case I don t think this is necessary. Also a better title for the chart might help.

Another interesting point is that Naomi chose to remove the redundant information on the y- axis, the label GB over and over again, yet on the x- axis this remains with Dec. Each date has Dec being used one after the other. In her four- chart redesign this causes an issue because when using the day and month together the x- axis labels will auto rotate in Excel when the space is limited, which should be avoided. Therefore she contends that rotating the y- axis label is better. However, using any of the options above would avoid this issue. Even if it could not be avoided we still have other alternatives. There are even options as we cross over months, for example if this data happened to start on December 28 th instead of December 7 th.

After having walked through all of these steps, I think a very simple line chart is the best way to visualize this data. Even with the discrete dates over a very short period of time and the two days of zero value, the simple line chart serves the purpose of displaying the trend to the reader. Finally, I would like to point out some finer design details that you may or may not have noticed. If you look back through the charts that I presented you will notice that I avoided black text. This was done purposefully. When black text is used it s as if you are starting a music concert at the loudest point. The only place to go from there requires another use of emphasis, for example, bold, italics or a larger font size. You will have much more control over the reader s attention if you widen your range. In this case, by using gray as the default it will give you the ability to drive attention in more ways. By changing your default font color to a medium to dark gray (along with gridlines, axis lines, titles, etc.) then you will have more options when you want to drive attention to a specific area of your visualization or maybe a single data point.

In this example, the only emphasis added to the single data label is by using a black font color contrasting the rest of the visualization that is using gray. The black font color here stands out almost as much as the title that is in a much larger font size and bold. This final example is taken from a Site Statistics chart in WordPress. This is a great example of where they used a design very similar to Naomi s redesign. In this case, a line chart would have produced zero values for 12 of the 15 days and the technique used in this case, the same technique as Naomi s recommendation, achieved a much better solution. Also notice the very clean graph with many of the same features described above, as well as a band in gray for the weekend days. This graph is very well done. I hope that you find these tips helpful and happy charting!

LÛCRUM, Incorporated 2013 Contact Us Today: Erin Macdonald Marketing Associate emacdonald@lucruminc.com 513.241.5949 x234 LÛCRUM, Inc. 7755 Montgomery Road, Suite 160 Cincinnati, Ohio 45236 LÛCRUM creates long- term relationships with its clients to build Business Intelligence and Analytics Solutions that increase profitability, drive operational efficiency and maximize decision- making capabilities. Visit our website: www.lucruminc.com Like us on Facebook: www.facebook.com/lucruminc Follow us on Twitter: @LucrumInc Connect on LinkedIn.