UNDERSTANDING HUMAN FACTORS OF BIG DATA VISUALIZATION



Similar documents
What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf. Flow Visualization. Image-Based Methods (integration-based)

Information visualization examples

Interactive Visual Data Analysis in the Times of Big Data

Good Scientific Visualization Practices + Python

Visualization Quick Guide

Introduction to Geographical Data Visualization

Enhanced LIC Pencil Filter

3D Data Visualization / Casey Reas

Visibility optimization for data visualization: A Survey of Issues and Techniques

The Information Processing model

Table of Contents Find the story within your data

Multivariate data visualization using shadow

9.63 Laboratory in Cognitive Science. Interaction: memory experiment

Introduction to Flow Visualization

3D Interactive Information Visualization: Guidelines from experience and analysis of applications

An Overview of Tests of Cognitive Spatial Ability

Teaching Expert Thinking (by Wendy Adams and Carl Wieman, CU-SEI and UBC-CWSEI, and Dan Schwartz, Stanford School of Education)

Multi-Attribute Glyphs on Venn and Euler Diagrams to Represent Data and Aid Visual Decoding

Skill acquisition. Skill acquisition: Closed loop theory Feedback guides learning a motor skill. Problems. Motor learning practice

an introduction to VISUALIZING DATA by joel laumans

Interpretive Report of WMS IV Testing

Information Visualization Multivariate Data Visualization Krešimir Matković

Chapter 3 - Multidimensional Information Visualization II

Chapter 7: Memory. Memory

Visualizing Uncertainty: Computer Science Perspective

Visualizing Multidimensional Data Through Time Stephen Few July 2005

Spatial Data Analysis

Table of contents. Overview... Page 2. Minimum size requirements... Page 8. Prohibited usage... Page 9. Symbol-only variations... Page 10.

Data, Measurements, Features

Ultra-High Resolution Digital Mosaics

Non-Stop Optical Illusions A Teacher s Guide to the Empire State Plaza Art Collection

VARIANCE REDUCTION TECHNIQUES FOR IMPLICIT MONTE CARLO SIMULATIONS

Google is an outstanding example of aesthetic and minimalist design. Its interface is as simple as possible. Unnecessary features and hyperlinks are

pm4dev, 2015 management for development series Project Schedule Management PROJECT MANAGEMENT FOR DEVELOPMENT ORGANIZATIONS

Supporting Employee Success. A Tool to Plan Accommodations that Support Success at Work

Corel Painter 8 Liquid Ink Visual Guide

DATA VISUALIZATION 101: HOW TO DESIGN CHARTS AND GRAPHS

The Art and Science of Teaching the Common Core State Standards

Choosing Colors for Data Visualization Maureen Stone January 17, 2006

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

CS171 Visualization. The Visualization Alphabet: Marks and Channels. Alexander Lex [xkcd]

Frequency, definition Modifiability, existence of multiple operations & strategies

STRAND: Number and Operations Algebra Geometry Measurement Data Analysis and Probability STANDARD:

Visualization of Multivariate Data. Dr. Yan Liu Department of Biomedical, Industrial and Human Factors Engineering Wright State University

PRIMING OF POP-OUT AND CONSCIOUS PERCEPTION

Expert Color Choices for Presenting Data

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall

Applications of Dynamic Representation Technologies in Multimedia Electronic Map

PART IV Performance oriented design, Performance testing, Performance tuning & Performance solutions. Outline. Performance oriented design

Big Data in Pictures: Data Visualization

Self-Critique: Retail Planogram Interface

Taking Inverse Graphics Seriously

Cognitive and Organizational Challenges of Big Data in Cyber Defense

Interactive Graphic Design Using Automatic Presentation Knowledge

SECURETRAY ADVERTISING COPY. g u i d e l i n e s & b e s t p r ac t i c e s

STEPHEN FEW SHOW ME THE NUMBERS

What constitutes good visualization research?

Data Visualization Handbook

Autodesk SketchBook. for Windows Tablet

McAFEE IDENTITY. October 2011

The key to successful web design is planning. Creating a wireframe can be part of this process.

Visualization in 4D Construction Management Software: A Review of Standards and Guidelines

Dobbin Day - User Guide

Learning and cognitive effects of acquired brain injury caused by meningitis or septicaemia

USDA Web Standards and Style Guide. Version 2.0

Truck Drivers' Behaviors and Rational Driving Assistance

Geocoding in Law Enforcement Final Report

Perception of Light and Color

How To Run Statistical Tests in Excel

visualization pitfalls (and how to avoid them)

A Comparative Study of Visualization Techniques for Data Mining

Process Intelligence: An Exciting New Frontier for Business Intelligence

Learning Design Principles for Power Point Presentations

Buckets: Visualizing NBA Shot Data CPSC 547 Project Proposal

Math. So we would say that the volume of this cube is: cubic units.

Digital Audio and Video Data

DESIGN THINKING FOR VISUALIZATION

3 Introduction to HCI - Human Factor Theories - Mental Models and Problem Solving. Mental models are useful example: the bath tub faucets

USING SELF-ORGANIZING MAPS FOR INFORMATION VISUALIZATION AND KNOWLEDGE DISCOVERY IN COMPLEX GEOSPATIAL DATASETS

Curriculum Map. Discipline: Computer Science Course: C++

COMP Visualization. Lecture 11 Interacting with Visualizations

Innovative Information Visualization of Electronic Health Record Data: a Systematic Review

ARTIFICIAL INTELLIGENCE METHODS IN EARLY MANUFACTURING TIME ESTIMATION

Principles of Instruction. Teaching Letter-Sound Associations by Rebecca Felton, PhD. Introduction

Alaska Link-16 Coverage Analysis and Network Modeling using STK

ACE: After Effects CS6

9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08

How strategic testing of your campaigns can yield better results. Wendy Griffin Marketing Coordinator, Campaigner

Transcription:

UNDERSTANDING HUMAN FACTORS OF BIG DATA VISUALIZATION Mark A. Livingston, Ph.D. Jonathan W. Decker Zhuming Ai, Ph.D. United States Naval Research Laboratory Washington, DC TTC Big Data Symposium Rosslyn, VA 17-18 June 214

Research Interests 1 How much data can we show before operator overload? Which multivariate visualization (MVV) is best for which task? Can users discover patterns without explicit representations? Does multivariate visualization solve big data visualization? Ultimate goal: explain why a visualization representation works (or doesn t)

MVV Techniques 2 Data driven Spots Oriented Slivers Attribute Blocks Color denotes variable Intensity encodes value Orientation denotes variable Intensity encodes value Position denotes variable Heat map encodes value

MVV Techniques Brush Strokes Color Blending Dimensional Stacking 3 Stroke properties encode variable Intensity, length, width, hue, and orientation encode value Hue encodes variable Weighted average of hues encodes value Boxes encode variable Discretized hue encodes value range; extensions use heat maps

Past Comparisons of MVVs Techniques Task(s) Findings Citation Gridded icons, Jittered grid icons, Brush strokes, Line integral convolution, Image-guided streamlines, Grid-seeded streamlines Localize critical values, Identify critical point type, Advect particle Image-guided streamlines 1.5 standard deviations (SD) below mean error and 1. SD below mean response time (RT) for advection; also 1. SD below mean error and RT for identifying critical point type; LIC 1. SD below mean error and RT for critical point localization [Laidlaw et al., 25] 4 Color weaving, color blending Read sequence of data values Weaving better by 4-11 percentage points [Hagh-Shenas et al., 26] Multi-layer texture synthesis, Brush strokes Localize critical values Texture synthesis better by 7.5 percentage points [Tang et al., 26] DDS, Oriented slivers, Brush strokes, Color blending, Stick figures Localize critical values #1 DDS: 39-58% of mean error of other techniques; #2 Slivers: 44-65%; of mean error of others; Monitor sensitivity [Livingston et al., VDA 211] DDS, Oriented slivers, Brush strokes, Color blending, Dimensional Stacking, baseline Detect greatest trends #1 Baseline: 48-57% of mean error of MVVs; #2 DDS: 61-72%; of mean error of remaining MVVs; Users fooled by extreme value, better if target was near distraction [Livingston et al., TVCG 211] DDS, Oriented slivers, Brush strokes, Color blending, Attribute blocks, baseline Detect greatest trends Attribute blocks better than dimensional stacking in previous test; main effect of distance to distraction [Livingston et al., VDA 212] DDS, Oriented slivers, Attribute blocks, baseline Detect multi-way overlap MVVs better than baseline; DDS best among MVVs; inconclusive evidence for density and for density gain as key perceptual feature to predict error with MVVs [Livingston et al., TVCG 212] DDS, Oriented slivers, Attribute blocks (3x2), Oriented DDS, Attribute blocks (6x6) Detect multi-way overlap Slivers and Attribute (3x2) best; new techniques were fast but inaccurate; confirmed density gain as key perceptual feature to predict error [Livingston et al., VDA 213]

Discussion Perceptual and Cognitive Interpretations

1. Use salient cues to direct exogenous attention Exogenous attention may be directed by varying color or texture in MVV. Attribute blocks Data-driven spots Oriented slivers

Relationship of Density to Error 2.5 3.5 ODDS 2. 1.6 1.5 1.2 1..8.5.4..8 1 1.2 1.4..9 1.1 1.3 1.5 1.7 Temporal AB R=-.35, p<.73 3. 2. 2. R=-.49, p<.14 Attribute R=-.38, p<.56 2. DDS Slivers R=-.45, p<.2 1.6 1.2.8.4..9 1.1 1.3 1.5 1.7 R=-.42, p<.3 7 2.5 2. 1.5 1.6 1.2.8 1..5..4.6.8 1 1.2 1.4.4..4.6.8 1 1.2 1.4

Earth Mover s Distance Results 8 Error 3 2.5 2 1.5 1.5 Attribute blocks Data-driven Spots Oriented Slivers maxemd vs error meanemd vs error 2 4 6 EMD Error.8.7.6.5.4.3.2.1.5 1 1.5 EMD Error 1.2 1.8.6.4.2 5 1 15 EMD Response Time (s) 2 16 12 8 4 maxemd vs time meanemd vs time 2 4 6 EMD Response Time (s) 2 18 16 14 12 1 8 6 4 2.5 1 EMD Response Time (s) 16 12 8 4 maxemd vs time meanemd vs time 5 1 15 EMD

Sensitivity to Monitor Settings 9 Error (normalized).2.18.16.14.12.1.8.6.4 Slivers BrushStrokes DataSpots ColorBlend StickGlyph StickFigures Significant difference Significant difference Significant difference.2. Lights on Factory settings Lights off Factory settings Lights on Altered settings

Edge Strength 1 Error 1 Data-driven spots Error 1 Oriented Slivers.8.8.6.6.4.4.2.2.5.6.7.8.9 1 Mean edge strength.5.6.7.8.9 1 Mean edge strength

Luminance, Target Size, & Value 11 Error 3 By target size 91 Error 3 By target value 4 2.5 2 61 31 2.5 2 5 6 1.5 1.5 1 1.5.5.5.1.15.2 Standard deviation of luminance.5.1.15.2 Standard deviation of luminance

2. Provide strong grouping cues for information Working memory has a limited capacity, but the contents can be facts or concepts. Knowledge structures, such as patterns in a visualization that may be associated with data properties, can be retrieved from long term memory through cues that give rise to similar encoding. Attribute blocks Data-driven spots Oriented slivers Temporal AB Oriented DDS User Error (Field value [-6]) 1.6 1.4 1.2 1.8.6.4.2 Slivers Attribute DDSpots Temporal ODDSpots

Trend Detection Study 13 Baseline visualization had least error Some techniques had ambiguous representations Needed two variables to complete task

3. Provide strong grouping cues to facilitate chunking Reduce competition for working memory capacity by helping user to group pieces of information into larger chunks. Assist this by perceptual cues. 2 Error Answers Time Workload Error (Target Layers) and Number of Changes to Answer 1.5 1.5 DDSpots Slivers Attribute Baseline 7 5 3 2 1 Time (sec) and Workload (TLX)

Critical Point Study Results 15 Error: Data driven Spots and Oriented Slivers best F(4,56)=9.8364, p=. Outliers not removed 7 for Z>4 or 24 for Z>3 Time: Data driven Spots and Color Blending best Intuitiveness? Giving up? F(4,56)=34.763, p=. Workload (TLX) User confidence? F(4,56)=4.9599, p=.2 Need only a single variable to complete task

Overlap Detection Task 16 1.6 32 Error (variables present) 1.2.8 24 16 Response Time (sec).4 8 Slivers Attribute DDSpots Temporal ODDSpots

4. Organize information for knowledge structures Help user recall mental models, which involve possibilities and projection; these are in turn similar to sense-making. This should help a user understand data and turn it into actionable information. Unfortunately, tasks in most user studies of multivariate visualization are quite low-level and performed by novice users. Laidlaw et al. (25) found that image-guided streamline placement assisted in advection of particles and identification of critical point type. This follows from the leverage point, since streamlines guide even novice users to the proper mental model of flow through the field, which is critical to each task. Tasks in Multivariate Visualization User Studies Identify critical point type Advect particle Localize critical values Read sequence of data values Detect greatest trends Detect multi-way overlap

Overlap Detection Task 18 Data-driven Spots Oriented slivers Attribute blocks Juxtaposed layers Error (layers).9.42.47 1.54 Time (sec) 7.33 9.62 9.74 46.72 2 Error Answers Time Workload Error (Target Layers) and Number of Changes to Answer 1.5 1.5 DDSpots Slivers Attribute Baseline Time (sec) and Workload (TLX)

5. Structure information to provide strong retrieval cues Structure information to provide strong retrieval cues for mental models to help analogical reasoning. Enable user to apply prior knowledge to new data patterns.

6. Develop training regimes for implicit learning Statistical regularities may be learned implicitly; thus, visualizations could prime users to recognize certain data relationships through trained visual patterns that may be recognized..2 Returning New Participants Error (pct).1 JuxtMap Strokes DDSpots ABlocks Blending Slivers

Trend Detection Follow up Study 21 Improved some multivariate techniques; failed to improve others Theories regarding saturation, total intensity, and color maps Techniques that did not integrate variables tightly considered for further studies

Future Directions Gather more data! New tasks for user studies More applied tasks? Beginning to unlock the meaning of various components of multivariate visualizations for the difficulty they provide in accomplishing visual analysis tasks Data driven spots: density and color difference, then edge strength Oriented slivers: edge strength variation, then density Attribute blocks: color variation, then intensity variation 22

Thank you! 23 Work supported by NRL Base Program Thanks to anonymous subjects and reviewers of papers