Multi-Dimensional Data Visualization. Slides courtesy of Chris North



Similar documents
Cours de Visualisation d'information InfoVis Lecture. Multivariate Data Sets

Information Visualization Multivariate Data Visualization Krešimir Matković

Diagrams and Graphs of Statistical Data

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

Visualization Techniques in Data Mining

Chapter 3 - Multidimensional Information Visualization II

Iris Sample Data Set. Basic Visualization Techniques: Charts, Graphs and Maps. Summary Statistics. Frequency and Mode

COM CO P 5318 Da t Da a t Explora Explor t a ion and Analysis y Chapte Chapt r e 3

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

CS171 Visualization. The Visualization Alphabet: Marks and Channels. Alexander Lex [xkcd]

Data Exploration Data Visualization

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

TIES443. Lecture 9: Visualization. Lecture 9. Course webpage: November 17, 2006

An example. Visualization? An example. Scientific Visualization. This talk. Information Visualization & Visual Analytics. 30 items, 30 x 3 values

Visualization of Multivariate Data. Dr. Yan Liu Department of Biomedical, Industrial and Human Factors Engineering Wright State University

The Value of Visualization 2

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

A Comparative Study of Visualization Techniques for Data Mining

Information visualization examples

Visualization Quick Guide

Data Visualization. or Graphical Data Presentation. Jerzy Stefanowski Instytut Informatyki

COMP Visualization. Lecture 11 Interacting with Visualizations

Data Exploration and Preprocessing. Data Mining and Text Mining (UIC Politecnico di Milano)

Graphical Representation of Multivariate Data

Data Visualization Handbook

an introduction to VISUALIZING DATA by joel laumans

Clustering & Visualization

Chapter 3 - Multidimensional Information Visualization II

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

3D Interactive Information Visualization: Guidelines from experience and analysis of applications

Time Series Data Visualization

Data Visualization - A Very Rough Guide

Choosing a successful structure for your visualization

Scatterplot Layout for High-dimensional Data Visualization

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller

Hierarchical Data Visualization

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Visualization methods for patent data

Visualizing High-density Clusters in Multidimensional Data

Week 1. Exploratory Data Analysis

Exploratory Data Analysis with MATLAB

Visual Data Mining with Pixel-oriented Visualization Techniques

Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28

9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08

Exploratory Spatial Data Analysis

Visualization Software

Data Visualization Principles: Interaction, Filtering, Aggregation

TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:

Microsoft Business Intelligence Visualization Comparisons by Tool

The course: An Introduction to Information Visualization Techniques for Exploring Large Database

Big Data in Pictures: Data Visualization

MTH 140 Statistics Videos

Variable: characteristic that varies from one individual to another in the population

MetroBoston DataCommon Training

Data Visualization Techniques

By LaBRI INRIA Information Visualization Team

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

A Survey on Multivariate Data Visualization

Information Visualization

Lecture 2. Summarizing the Sample

?????? Data Analytics

Visual Data Mining. Motivation. Why Visual Data Mining. Integration of visualization and data mining : Chidroop Madhavarapu CSE 591:Visual Analytics

Visualizations. Cyclical data. Comparison. What would you like to show? Composition. Simple share of total. Relative and absolute differences matter

Introduction to Multivariate Analysis

Visibility optimization for data visualization: A Survey of Issues and Techniques

Hierarchy and Tree Visualization

Data Visualization in R

Raising the Bar (Chart)

Exercise 1.12 (Pg )

High-Dimensional Visualizations

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Business Intelligence and Process Modelling

Human-Computer Interaction

High Dimensional Data Visualization

Independence Diagrams: A Technique for Visual Data Mining

Summarizing and Displaying Categorical Data

1. Then f has a relative maximum at x = c if f(c) f(x) for all values of x in some

Data Visualization Techniques

Visualization and Visual Analytics

TEXT-FILLED STACKED AREA GRAPHS Martin Kraus

Data Preprocessing. Week 2

Statistics Chapter 2

Microsoft Excel 2010 Pivot Tables

Analyzing The Role Of Dimension Arrangement For Data Visualization in Radviz

Visual Mining of E-Customer Behavior Using Pixel Bar Charts

Graph/Network Visualization

Criteria for Evaluating Visual EDA Tools

Transcription:

Multi-Dimensional Data Visualization Slides courtesy of Chris North

What is the Cleveland s ranking for quantitative data among the visual variables: Angle, area, length, position, color

Where are we?! Tabular (multi-dimensional) " Spatial & Temporal " 1D / 2D " 3D " Networks " Trees " Graphs " Text & Documents # Fundamentals / Eval " Navigation strategies " Overview strategies " Interaction techniques " Design " Development " Evaluation

The Simple Stuff Univariate Bivariate Trivariate

Univariate Dot plot Bar chart (item vs. attribute) Tukey box plot Histogram

Bivariate Scatterplot

Trivariate 3D scatterplot, spin plot 2D plot + size (or color )

Multi-Dimensional Data Each attribute defines a dimension Small # of dimensions easy Data mapping, Cleveland s rules What about many dimensional data? n-d What does 10-D space look like?

Map n-d space onto 2-D screen Visual representations: Complex glyphs E.g. star glyphs, faces, embedded visualization, Multiple views E.g. plot matrices, brushing histograms, Spotfire, Non-orthogonal axes E.g. Parallel coords, star coords, Tabular layout Interactions: E.g. TableLens, Dynamic Queries Brushing & Linking Selecting for details,

10 Parameters: Glyphs: Chernoff Faces Head Eccentricity Eye Eccentricity Pupil Size Eyebrow Slope Nose Size Mouth Vertical Offset Eye Spacing Eye Size Mouth Width Mouth Openness http://hesketh.com/schampeo/projects/faces/chernoff.html

Glyphs: Stars d7 d1 d2 d6 d5 d4 d3 http://hal.inria.fr/docs/00/78/15/04/ PDF/Fuchs_2013_EOA.pdf

Multiple Views with Brushing-and-linking Robert Kosara: brush and linking, Parallel Coordinates: http://vimeo.com/13437693

All pairs of attributes Brushing and linking Scatterplot Matrix http://noppa5.pc.helsinki.fi/koe/3d3.html

on steroids

Different Arrangements of Axes Axes are good Lays out all points in a single space position is 1 st in Cleveland s rules Uniform treatment of dimensions Space > 3D? Must trash orthogonality

Parallel Coordinates Inselberg, Multidimensional detective (parallel coordinates)

Parallel Coordinates Forget about Cartesian orthogonal axes (0,1,-1,2)= x y z w 0 0 0 0

Star Plot 1 8 2 7 3 6 5 4 Parallel Coordinates with axes arranged radially

Star Coordinates Kandogan, Star Coordinates

Star Coordinates Cartesian d1 P=(v1, v2) Star Coordinates P=(v1,v2,v3,v4,v5,v6,v7,v8) d1 p d8 v2 v3 d2 v4 v1 v2 d2 d7 v1 p v5 d3 v8 v6 d6 Mapping: Items dots Σ attribute vectors (x,y) d5 v7 d4

Analysis

Table Lens Rao, Table Lens http://www.ramanarao.com/papers/ tablelens-chi94.pdf

Spenke, FOCUS FOCUS / InfoZoom Finding correlations between the measurements and the occurrence of a thrombosis

VisDB & Pixel Bar Charts Keim, VisDB

Comparison of Techniques

Comparison of Techniques ParCood: <1000 items, <20 attrs» Relate between adjacent attr pairs StarCoord: <1,000,000 items, <20 attrs» Interaction intensive TableLens: similar to par-coords» more items with aggregation» Relate 1:m attrs (sorting), short learn time Visdb: 100,000 items with 10 attrs» Items*attrs = screenspace, long learn time, must query Spotfire : <1,000,000 items, <10 attrs (DQ many)» Filtering, short learn time

Scaling up further Beyond 20 dimensions? 1. Interaction E.g. Offload some dims to Dynamic Query sliders, 2. Reduce dimensionality of the data E.g. Multi-dimensional scaling (MDS) later 3. Visualize features of the dimensions, instead of the data E.g. rank-by-feature Tableau

Seo, et al. Rank-by-Feature

Combining multiple data types Multi-Dimensional: PathBubbles: UMBC

Demographics data: Exercise Multi-Dimensional attributes (multiple measures over time) Geospatial map Static visual representation, no interaction (must visually relate all attributes to geospatial) Design choices?

Your solutions

1. Small Multiples Multiple views: 1 or more attribute / map Scalability issues: perception display 1976

2. Embedded Visualizations Complex glyphs: For each location, show vis of all attributes

2D vs. 3D

3. 2D vs. 3D Occlusion shadow changes the colors? Chart junk Patterns -> area; perspectives

Texts? 2D vs. 3D: text? Feiner, Augmented reality

Mental modal vs. Features VIS externalization media

Stereo, tracking, larger FOV? http://www.youtube.com/watch?v=i1x4-g6wbtu#t=29 Linsen group, the effect of stereoscopic immersive environment on projection-based multi-dimensional data visualization. IV 2013. Tasks: 1. count the cluster; 2/3. find closest cluster to a specific point/cluster; 4. detect the densest cluster; 5. find the most distant cluster pair; 6. count the outliers; 7. find the closest cluster to select group of points; 8. name all of the pairs of overlapping clusters

Time series Van wijk and van Selow 99, # of people inside and amount of power used over the course of each day for one full year.

2.5D Petra Isenberg, Hybrid-image vis (2013) Today: Alex Garbarino (pathway analysis)

2.5D Seokhee Hong

Multi-Dimensional Functions

Multi-Dimensional Functions y = f(x 1, x 2, x 3,, x n ) Continuous: E.g. y = x 1 3 + 2x 2 2-9x 3 Discrete: x i are uniformly sampled in a bounded region E.g. x i = [0,1,2,,100] E.g. measured density in a 3D material under range of pressures and room temperatures.

Independent variables Dependent variables Variables

Relations: Relations vs. Functions R(A, B, C, D, E, F) All dependent variables (1 ind.var.?) Sparse points in multi-d dep.var. space Functions: well behaved relations R(A, B, C, D, E, F, Y) : Y=f(A, B, C, D, E, F) Many independent variables Defined at every point in multi-d ind.var. space ( onto ) Huge scale: 6D with 10 samples/d = 1,000,000 data points

Relation or function? Function Function Relation Neither Function nor Relation?

Multi-D Relation Visualizations Don t work well for multi-d functions Example: Parallel coords 5D func sampled on 1-9 for all ind.vars.

Typically want to encode ind.vars. as spatial attrs

1-D: Easy b = f(a) a x b y b a

2-D: Easy c = f(a, b) Height field: a x b y c z c b a

2-D: Easy c = f(a, b) b Heat map: a x b y c color a c

3-D: Hard d = f(a, b, c) Color volume: a x b y c z d color b c a What s inside?

4D: Really Hard y = f(x 1, x 2, x 3, x 4,, x n ) What does a 5D space look like? Approaches: Positioning: Recursive pattern (Keim) Scalable aggregation: immens (Liu) Nested coordinate frames (Worlds within Worlds) Slicing (HyperSlice) Radial Focus+Context (PolarEyez, Sanjini) What we talked about before: filtering overview

Position on paper Keim, Recursive pattern: a technique for visualization very large amounts of data

Aggregation: Binned plots Liu, Jiang and Heer, immens: Real-time visual querying of big data big : one million or more data cases Bin: adjacent intervals over a continuous range Scalability issues: perception interaction. Novelty: scalability should be limited by the chosen resolution of the visualized data, not the number of records.

Hierarchical Axes 1D view of 3D function: (Mihalisin et al.) f(x 1, x 2, x 3 ) x3 x2 x1

5D 9 samp/d as in TableLens

5D 9 sample/d

Nested Coordinate Frames Feiner, Worlds within Worlds

Slicing Van Wijk, HyperSlice

Slicing

Radial Focus+Context Jayaraman, PolarEyez infovis.cs.vt.edu x2 x1 x3 x4 x5 -x5 -x4 -x2 -x1

Comparison Hierarchical axes (Mihalisin): Nested coordinate frames (Worlds in Worlds) Slicing (HyperSlice): Radial Focus+Context (PolarEyez)

Comparison Hierarchical axes (Mihalisin): < 6d by 10 samples, ALL slices, view 2d at a time Nested coordinate frames (Worlds in Worlds) < 5-8d, continuous, no overview, 3d hardware Slicing (HyperSlice): < 10d by 100 samples, 2d slices Radial Focus+Context (PolarEyez) < 10d by 1000 samples, overview, all D uniform, rays

Review (9/17)

Review (9/17) Many ways to look at tables; multi-d data; what are they? Glyph (face; star glyphs; ) Coordinates (star coordinates; parallel coordinates) Table lens

Upcoming (9/17) Homework 2 (out tonight, due 9/29) Proposal due (10/20) Title of your final report (begin with the title of the project first) Meeting with the client / advisors Data abstraction Task abstraction Initial design Mandatory: talk to Jian about your project topic next Monday / Wednesday after class 2:30-3:30pm. You MUST have a client.