Effective Visualization Techniques for Data Discovery and Analysis



Similar documents
Visualization Quick Guide

Data Visualization Techniques

Data Visualization Techniques

White Paper. Data Visualization Techniques. From Basics to Big Data With SAS Visual Analytics

Top 5 best practices for creating effective dashboards. and the 7 mistakes you don t want to make

HOW TO USE DATA VISUALIZATION TO WIN OVER YOUR AUDIENCE

CSU, Fresno - Institutional Research, Assessment and Planning - Dmitri Rogulkin

GRAPHING DATA FOR DECISION-MAKING

Visualizing Historical Agricultural Data: The Current State of the Art Irwin Anolik (USDA National Agricultural Statistics Service)

visualization pitfalls (and how to avoid them)

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

Data Visualization & Dashboard Design Best Practices and Tips

5 Tips for Creating Compelling Dashboards

Numbers as pictures: Examples of data visualization from the Business Employment Dynamics program. October 2009

IBM Cognos Insight. Independently explore, visualize, model and share insights without IT assistance. Highlights. IBM Software Business Analytics

top 5 best practices for creating effective campaign dashboards and the 7 mistakes you don t want to make

Principles of Data Visualization

Best Practices. Create a Better VoC Report. Three Data Visualization Techniques to Get Your Reports Noticed

3D Interactive Information Visualization: Guidelines from experience and analysis of applications

Empower Individuals and Teams with Agile Data Visualizations in the Cloud

Quantitative Displays for Combining Time-Series and Part-to-Whole Relationships

Interactive Voting System. IVS-Basic IVS-Professional 4.4

Build Your First Web-based Report Using the SAS 9.2 Business Intelligence Clients

Integrating SAS with JMP to Build an Interactive Application

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics

TEXT-FILLED STACKED AREA GRAPHS Martin Kraus

Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER

SAS BI Dashboard 3.1. User s Guide

MetroBoston DataCommon Training

Executive Dashboard Cookbook

PharmaSUG Paper DG06

Get The Picture: Visualizing Financial Data part 1

THE ULTIMATE GUIDE TO DATA VISUALIZATION

an introduction to VISUALIZING DATA by joel laumans

Designing a Poster using MS-PowerPoint

JMP Training. jmp.com/training

SAS VISUAL ANALYTICS AN OVERVIEW OF POWERFUL DISCOVERY, ANALYSIS AND REPORTING

TIBCO Spotfire Network Analytics 1.1. User s Manual

The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA

Chapter 6: Constructing and Interpreting Graphic Displays of Behavioral Data

SAS Information Delivery Portal: Organize your Organization's Reporting

DATA VISUALIZATION: When Data Speaks Business PRODUCT ANALYSIS REPORT IBM COGNOS BUSINESS INTELLIGENCE. Technology Evaluation Centers

Your Resume Selling Yourself Using SAS

MicroStrategy Analytics Express User Guide

Security visualisation

SYNTHESIO PRODUCT GUIDE THE EXPERIENCE RELEASE

A Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data

Microsoft Visio 2010 Business Intelligence

Better planning and forecasting with IBM Predictive Analytics

STEPHEN FEW SHOW ME THE NUMBERS

Visualizing Multidimensional Data Through Time Stephen Few July 2005

Whitepaper. Building Effective Dashboards: Measuring for Success.

Data Analytics: Exploiting the Data Warehouse

IM 2701 Multimedia Design and Web Development BSc in Business Information Systems. Chapter 01 Introduction to Multimedia Design

Effective Big Data Visualization

Motion Charts: Telling Stories with Statistics

Qlik s Associative Model

SAS BI Dashboard 4.4. User's Guide Second Edition. SAS Documentation

Unleash your intuition

How to Use Boards for Competitive Intelligence

Expert Color Choices for Presenting Data

Dashboard Design for Rich and Rapid Monitoring

Winning with an Intuitive Business Intelligence Solution for Midsize Companies

Business white paper. The road to strategic website design The Optimost Web Optimization Maturity Model

Five Tips for Presenting Data Analyses: Telling a Good Story with Data

Visualizing Clinical Trial Data Matt Becker, SAS Institute

2012 VISUAL ART STANDARDS GRADES K-1-2

Get to the Point HOW GOOD DATA VISUALIZATION IMPROVES BUSINESS DECISIONS

Bazaarvoice Intelligence reference guide

Building Capacity to Use Infographic Tools

Table of Contents Find the story within your data

Visualizing Data from Government Census and Surveys: Plans for the Future

Choosing Colors for Data Visualization Maureen Stone January 17, 2006

Create Mobile, Compelling Dashboards with Trusted Business Warehouse Data

THE ULTIMATE BEST PRACTICES

Data Visualization Handbook

ElegantJ BI. White Paper. Achieve a Complete Business Picture with a Business Intelligence (BI) Dashboard

An Introduction to Design Thinking PROCESS GUIDE

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

Dashboard Reporting Business Intelligence

IBM Cognos TM1 Executive Viewer Fast self-service analytics

Choosing a successful structure for your visualization

The Clear Path to Business Intelligence

Qlik Sense Enterprise

The main imovie window is divided into six major parts.

Market Research with Social Media

Best Practices for Dashboard Design with SAP BusinessObjects Design Studio

Transcription:

WHITE PAPER Effective Visualization Techniques for Data Discovery and Analysis Chuck Pirrello, SAS Institute, Cary, NC

Table of Contents Abstract... 1 Introduction... 1 Visual Analytics... 1 Static Graphs... 2 Guidelines For Creating Static Graphs:... 2 It s all about the data... 2 2-D or 3?... 2 Plan your color scheme... 4 Conserve ink... 4 Use the right type of graph... 4 How many is too many?... 4 Keep time axes honest... 6 Interactive Graphs... 7 Supporting Analytics... 9 Presentation... 10 Conclusion... 11 Recommended Reading... 11 Contact Information... 11 i

Abstract One of the best ways to understand data is through data visualization. The art of presenting data visually has matured from static graphs and dashboard widgets to interactive, animated graphs. Adding advanced analytics to support these graphs provides a competitive edge to companies by helping them better explore and understand their data, predict potential outcomes and decide with confidence. This paper discusses the power of visualization for analyzing data and spotting trends you can act on. Effective visual techniques show how users can better explore and understand their data, discover trends and patterns, and communicate their findings to a non-technical, non-analytical audience. Introduction Most information researchers agree that the volume of electronically stored data is doubling every 18 months. Some say it s happening even faster. Regardless of the actual time frame, data is being generated faster than it can be consumed and digested. Companies and individuals are constantly being challenged to make decisions more quickly with less time to analyze data that is required to support their conclusions and proposed courses of action. This presentation offers best practices for analyzing and displaying data graphically, using techniques that enhance clarity and shorten the path to discovery. Visual Analytics The components of visual analytics are interactive graphics and advanced analytics. Those elements combine to create statistical models that allow analysts to synthesize large amounts of data, detect trends and patterns, and help discover important and unexpected content. The resulting insights permit analysts to make high-quality human judgments more quickly. Visual analytics is the science of analytical reasoning facilitated by interactive visual interfaces [and] seeks to marry techniques from information visualization with techniques from computational transformation and analysis of data. The computer finds patterns in the information and organizes it in ways that are meant to be revealing to the analyst. The analyst supplies his or her knowledge in ways that help the computer refine and organize information more appropriately. Working together, they are much more powerful than each one working separately. (Illuminating the Path: The Research and Development Agenda for Visual Analytics, National Visualization and Analytics Center, 2004.) 1

Static Graphs Static graphs are appropriate when a snapshot or series of snapshots can convey the intended information. Care should be taken not to overload the graph with too much decoration that can conceal the data. Conversely, not showing enough information in a graph also can dilute its effectiveness. The failure of poorly prepared graphs was painfully and tragically evident with the Challenger O-ring incident in 1986, which resulted in the explosion of Space Shuttle Challenger shortly after liftoff. Thirteen charts were prepared to make the case [about the danger posed by the O-rings], but the charts were unconvincing. The engineers correctly identified the O-ring failure at low temperatures, but the displays chosen to present the evidence did not adequately show the cause of the failure and overcome the bias of decision makers. Displays obscured the data, and the wrong decision to launch was made. The consequences were tragic. (Illuminating the Path: The Research and Development Agenda for Visual Analytics, National Visualization and Analytics Center, 2004.) Guidelines For Creating Static Graphs: It s all about the data All too often, users get hung up on the bling factor in creating graphs. They think that if they wow the viewer with elaborate, attention-getting images, it will help them see the results more clearly. In fact, overblown images often obscure the meaning of the data. Graphs should bring the data to life, not conceal it. 2-D or 3? Most people use a third dimension to dress up a graph rather than to show another variable. Such things as 3-D bars with shadows do nothing more than add a visual effect that, while possibly more pleasing to the eye, contributes no new information. Avoid using 3-D graphs merely to imitate depth or shadows. As a rule, try first to create the visual in 2-D. If all the information can be clearly conveyed, stay with that design. Sometimes a third dimension is used to represent a third variable. However, you can often convey the same information on a 2-D chart by varying the color or size of the objects shown. 2

If you feel you must use 3-D, do so only with interactive graphs, not static ones. The viewer should be able to rotate the graph to see all the information, even though you risk concealing some information by revealing other information. Figure 1. Adding a third dimension frequently adds little value for the viewer. In the top image here, the 3-D effect is misleading. The values are the same for both graphs. The second graph is merely displayed in two dimensions. 3

Plan your color scheme Beauty may be in the eye of the beholder, but there are some tried and true eyepleasing colors that appeal to most people and don t act as a distraction. For guidelines on color schemes, go to http://colorbrewer2.org/. Conserve ink Too much ink can conceal the actual data. Use light gray lines for axes and tick marks, and don t show unnecessary tick marks on graphs. As a basic guideline, use muted colors for all graphic elements other than the data and its graphical representation (i.e., bars, lines, etc.). Use the right type of graph When comparing the results among several entities, bar graphs typically provide the best visual. Bar graphs convey differences mainly through the differences in length, color or size of the bars. Care should be taken not to overload bar graphs with too many colors or varied bar widths for these attributes make it more difficult to see the relative length of the bars. Line graphs are the best choice for data that is typically viewed over time, because they allow viewers to see how values have progressed and where each one may be headed. How many is too many? One common mistake is trying to put all the data into one graph, resulting in a graph that is too cluttered to make any sense. However, there is also a danger in creating too many graphs that must be viewed one at a time. This approach risks taxing the viewer s memory, making it difficult for him or her to grasp important details and comparisons. One way to solve this dilemma is by creating a display that includes several small graphs. Small multiples, or trellis displays, use a series of small graphs along the same x and y scales to present information in such a way that the viewer can read the data from graph to graph in a continuous fashion. Such a display allows the viewer to consume more information and facilitates comparisons. 4

Small multiples should exhibit the following characteristics: 1 Individual graphs differ only in terms of the data that they display. Each graph displays a subset of a single larger data set. Graphs are identical in shape, type and size, and they share the same categorical and quantitative scales. Graphs can be arranged horizontally, vertically or in a matrix. Graphs are sequenced in a meaningful order. Figure 2. A small-multiples graph facilitates comparisons. 1 Now You See It: Simple Visualization Techniques for Quantitative Analysis, Stephen Few, Analytics Press, 2009.) 5

Keep time axes honest A frequent misuse of time axes is to skew the tick marks to exaggerate or downplay the significance of the data. Figure 3. The top graph shows a more gradual increase over time. The second graph misleads the viewer into thinking the rise over time is much more dramatic by changing midstream from yearly increments to five-year increments. 6

Interactive Graphs While static graphs are commonly used to convey findings that result from analysis, they do not usually yield new insights. Interactive graphs are much more likely than static graphs to provide insights that lead to discovery. Static graphs can quickly become cluttered when large amounts of information are required for understanding and decision-making. Interactive graphs let analysts spoon feed the audience, revealing details progressively instead of all at once so that the information becomes more digestible. Visual representations should invite the user and audience to explore the data, but exploration is not possible if they cannot interact with the data. The ability to isolate and reorganize information appropriately leads to better understanding of trends and anomalies, and engages them in the analytic reasoning process. It is through these interactions that the analyst achieves insight. Interaction takes four basic forms: selecting desired data, reorganizing data, zooming and data filtering. 1. In an interactive graph, selecting an object in one graph results in a selection in a linked graph, as seen here. Figure 4. Selecting bars in the bar graph on the left causes the distribution graph on the right to reflect the selection. 2. Reorganizing data means letting the user redefine or reorient the data. This type of interaction can include the x and y axes, the legend, and any other dimensions the graph is capable of displaying. 3. Zooming allows the user to enlarge any portion of a graph to see more detail. 7

4. Filtering allows the user to narrow the view of the data, as in the following graph. Figure 5. The filter on the left governs what data is displayed in the graph on the right. Interactive graphs should be created with the same visual guidelines as static graphs. Their interactions and results should be displayed immediately to provide the analyst with the ability to navigate through data seamlessly, making it easier to spot trends and patterns. The ability to see data unfold before one s eyes can provide insights that static data simply cannot convey. 8

Supporting Analytics Visuals that help users analyze their data transcend the capabilities of traditional graphs. They are navigational vehicles that let users peer into their data. Those that provide built-in methods for synthesizing and processing data offer analysts a deeper understanding of what the data represents and how future results might unfold. For example, the ability to segment data based on similar results over time, or to predict potential outcomes by varying a number of inputs that contribute to a desired output (known as Monte Carlo simulation), enables analysts to move beyond merely dissecting data to gaining valuable insights into cause-and-effect relationships. A better understanding of root causes can facilitate decisions that are more likely to achieve desired results. Figure 6. The Prediction Profiler in JMP shows the effect of selected inputs on the desired output. The display in Figure 6 uses Monte Carlo simulation to predict the potential outcomes of targeted values. The windows marked with a 1 represent each variable that contributes to the desired result: profit (2). A user can adjust the values of each input variable (1) by clicking and dragging the vertical red dotted line or by simply entering the value (shown in red below each variable display box.). As the user changes the input variables, the desired result is recalculated and displayed immediately. To use the Monte Carlo simulation, the user marks each variable as random or fixed, selects the distribution method (Normal is shown in this example), enters the number of iterations (5,000 or more) to be performed, and clicks on the SIMULATE button. The display window (4) will show the results of the simulation. In this example, values above the red line in display window 4 indicate profit, and values below the line indicate loss. The rate shows the percentage of the simulated outcomes that resulted in a loss (i.e., the potential percentage of transactions that will not generate a profit). The value of this interactive graph is the immediacy with which scenarios can be generated and displayed. Its speed in showing results makes it an invaluable tool in the decision-making process. 9

Presentation The ultimate goal of visual analytics is not creation of the graphics themselves, but the resulting insights that inform decisions. The ultimate value of a graph is not how well it is designed, but rather how well it conveys a call to action. There are three questions an analyst must ask when preparing to present findings: 1. Do I have a set of visuals that clearly show my findings? That is, can a viewer with no more knowledge than what is presented here understand what the visuals show? 2. Is the level of detail in my graph appropriate for my audience? Showing too much detail might overwhelm viewers, while showing too little could fail to convince them. One benefit of an interactive graph is that the analyst can start with the minimum amount of data and reveal more as needed by altering the graph, filtering and zooming. 3. Do I have a clear, concise narrative to accompany my visuals? This question is often overlooked. All presenters need to be good storytellers. It s the only way their work can come alive and convince the audience of the merits of their conclusion, persuading the audience to take action. The narrative must be thorough, but not too detailed. It must be appropriate for the audience, and it should always start with a statement of why the audience should listen. This statement should include enough information about the presenter to establish her credibility, if necessary. A valuable guide to storytelling is The Story Factor: Inspiration, Influence, and Persuasion Through the Art of Storytelling, by Annette Simmons. Figure 7. This bubble plot shows fluctuations in home prices over time. When animated, it shows how prices changed between 1975 and 2009. 10

Animated graphs require a good narration to be well understood. Once put in motion, the visuals need to reveal insights or guide viewers to areas of greatest interest. In the example shown in Figure 7, each bubble represents a state and the rise and fall of housing prices there over time. When put in motion, each bubble rises over time, showing the pace and direction of price fluctuations. Conclusion Properly prepared graphs promote understanding of large amounts of data. Graphs that allow for user interaction and that are supported by underlying analytics offer even better understanding and insight. Care must be taken not to clutter graphs with unnecessary decoration, which can conceal the data that they are intended to surface. The analyst must match the data to be analyzed with the appropriate graph type. However, even well designed, interactive graphs can fail to achieve their goal if they are not accompanied by compelling narratives that not only explain, but also illuminate. In addition to good analytical skills, data analysts must also hone their design and storytelling skills to effectively create and deliver their findings. Recommended Reading Graphic design any and all books by Stephen Few and Edward Tufte. Color themes www.colorbrewer.org Storytelling The Story Factor: Inspiration, Influence, and Persuasion Through the Art of Storytelling, 2001, by Annette Simmons. Contact Information Your comments and questions are valued and encouraged. Contact the author at: Name: Chuck Pirrello Enterprise: Product Manager, Visual Analytics, SAS Work phone: 919-531-4918 E-mail: Charles.Pirrello@sas.com Web: www.jmp.com 11

SAS Institute Inc. World Headquarters +1 919 677 8000 To contact your local JMP office, please visit: www.jmp.com... SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. Copyright 2010, SAS Institute Inc. All rights reserved. 104502_S56266.0510