Data Visualization - A Very Rough Guide



Similar documents
Multivariate Data Visualization

DATA VISUALIZATION. Lecture 1 Introduction. Lin Lu llu@sdu.edu.cn

Iris Sample Data Set. Basic Visualization Techniques: Charts, Graphs and Maps. Summary Statistics. Frequency and Mode

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

Data Exploration Data Visualization

The Value of Visualization 2

USING SELF-ORGANIZING MAPS FOR INFORMATION VISUALIZATION AND KNOWLEDGE DISCOVERY IN COMPLEX GEOSPATIAL DATASETS

COM CO P 5318 Da t Da a t Explora Explor t a ion and Analysis y Chapte Chapt r e 3

Spatio-Temporal Mapping -A Technique for Overview Visualization of Time-Series Datasets-

Data Visualization. or Graphical Data Presentation. Jerzy Stefanowski Instytut Informatyki

Chapter 3 - Multidimensional Information Visualization II

VISUALIZATION OF GEOSPATIAL METADATA FOR SELECTING GEOGRAPHIC DATASETS

Graphical Representation of Multivariate Data

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

Information Visualization Multivariate Data Visualization Krešimir Matković

VisCMD: Visualizing Cloud Modeling Data

Multi-Dimensional Data Visualization. Slides courtesy of Chris North

Operator-Centric Design Patterns for Information Visualization Software

A Framework for the Visualization of Multidimensional and Multivariate Data

Hypervariate Information Visualization

Information Visualization. Ronald Peikert SciVis Information Visualization 10-1

Data Exploration and Preprocessing. Data Mining and Text Mining (UIC Politecnico di Milano)

Exploratory Visualization of Data with Variable Quality

Visualization methods for patent data

Big Data: Rethinking Text Visualization

TIES443. Lecture 9: Visualization. Lecture 9. Course webpage: November 17, 2006

Visualization Techniques in Data Mining

The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA

How To Create A Data Visualization

Data Mining and Visualization

An example. Visualization? An example. Scientific Visualization. This talk. Information Visualization & Visual Analytics. 30 items, 30 x 3 values

2 Visual Analytics. 2.1 Application of Visual Analytics

Visualization for eresearch: Past, Present and Future

2D, 3D and High-Dimensional Data and Information Visualization

CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler

. Address the following issues in your solution:

Geovisual Analytics Exploring and analyzing large spatial and multivariate data. Prof Mikael Jern & Civ IngTobias Åström.

Cours de Visualisation d'information InfoVis Lecture. Multivariate Data Sets

Visualization and Astronomy

Introduction to Visualization with VTK and ParaView

Data. Visualization process. Segmented, adapted, chosen data. Sampled or simulated (original) data. Display Data (geometry information)

Visualization of Multivariate Data. Dr. Yan Liu Department of Biomedical, Industrial and Human Factors Engineering Wright State University

COC131 Data Mining - Clustering

CourseVis: Externalising Student Information to Facilitate Instructors in Distance Learning

Flexible delivery of visualization software and services

COSC 6344 Visualization

Clutter Reduction in Multi-Dimensional Data Visualization Using Dimension. reordering.

High-Performance Visualization of Geographic Data

Offshore Wind Farm Layout Design A Systems Engineering Approach. B. J. Gribben, N. Williams, D. Ranford Frazer-Nash Consultancy

is in plane V. However, it may be more convenient to introduce a plane coordinate system in V.

Interactive Visual Data Analysis in the Times of Big Data

Interactive Information Visualization using Graphics Hardware Študentská vedecká konferencia 2006

Analyzing The Role Of Dimension Arrangement For Data Visualization in Radviz

On History of Information Visualization

MATH1231 Algebra, 2015 Chapter 7: Linear maps

Hydrogeological Data Visualization

Visualisatie BMT. Introduction, visualization, visualization pipeline. Arjan Kok Huub van de Wetering

DataPA OpenAnalytics End User Training

Introduction to Principal Component Analysis: Stock Market Values

Exploratory Data Analysis with MATLAB

VisIVO, a VO-Enabled tool for Scientific Visualization and Data Analysis: Overview and Demo

A SIMULATOR FOR LOAD BALANCING ANALYSIS IN DISTRIBUTED SYSTEMS

Domain Analysis: A Technique to Design A User-Centered Visualization Framework

Visualization à la Unix TM

Norwegian Satellite Earth Observation Database for Marine and Polar Research USE CASES

VISUALIZING HIERARCHICAL DATA. Graham Wills SPSS Inc.,

Exploratory Visualization of Multivariate Data with Variable Quality

Visualizing High-density Clusters in Multidimensional Data

Visual Data Mining : the case of VITAMIN System and other software

LET S GO BACK TO THE VERY FIRST HISTORICAL KNOWN EXAMPLES OF INFORMATION VISUALIZATIONS

BIG DATA VISUALIZATION. Team Impossible Peter Vilim, Sruthi Mayuram Krithivasan, Matt Burrough, and Ismini Lourentzou

Visual Data Exploration Techniques for System Administration. Tam Weng Seng

Microsoft Business Intelligence Visualization Comparisons by Tool

The VisuLab : an Instrument for Interactive, Comparative Visualization

Integrating Cluster Formation and Cluster Evaluation in Interactive Visual Analysis

MEng, BSc Applied Computer Science

Why are we teaching you VisIt?

Spreadsheet software for linear regression analysis

Linking Scientific and Information Visualization with Interactive 3D Scatterplots

Design and Deployment of Specialized Visualizations for Weather-Sensitive Electric Distribution Operations

Atomic Force Microscope and Magnetic Force Microscope Background Information

Computer Graphics and Visualization in a Computational Science Program

Interactive Data Mining and Visualization

BiCluster Viewer: A Visualization Tool for Analyzing Gene Expression Data

Data Visualization. Principles and Practice. Second Edition. Alexandru Telea

Facts about Visualization Pipelines, applicable to VisIt and ParaView

Information Literacy Program

Prefetching for Visual Data Exploration

A Short Introduction on Data Visualization. Guoning Chen

By LaBRI INRIA Information Visualization Team

Visualizing Data: Scalable Interactivity

MEng, BSc Computer Science with Artificial Intelligence

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining

Visualizations for High Dimensional Data Mining - Table Visualizations

Information Visualization WS 2013/14 11 Visual Analytics

HDDVis: An Interactive Tool for High Dimensional Data Visualization

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

GeoGebra. 10 lessons. Gerrit Stols

Transcription:

Data Visualization - A Very Rough Guide Ken Brodlie University of Leeds 1

What is This Thing Called Visualization? Visualization Use of computersupported, interactive, visual representations of data to amplify cognition (Card, McKinlay, Shneiderman) Born as a discipline in 1987 with publication of NSF Report Now widely used in computational science and engineering Vis5D 2

Visualization Twin Subjects Visualization Twin Subjects Scientific Visualization Visualization of physical data Information Visualization Visualization of abstract data Ozone layer around earth Automobile web site - visualizing links 3

Scientific Visualization Another Characterisation Focus is on visualizing an entity measured in a multi-dimensional space 1D 2D 3D Occasionally nd Underlying field is recreated from the sampled data Relationship between variables well understood some independent, some dependent http://pacific.commerce.ubc.ca/xr/plot.html Image from D. Bartz and M. Meissner 4

Scientific Visualization Model Scientific Visualization Model Visualization represented as pipeline: Read in data data model visualize render Build model of underlying entity Construct a visualization in terms of geometry Render geometry as image Realised as modular visualization environment IRIS Explorer IBM Open Visualization Data Explorer (DX) AVS 5

Extending the SciVis Model Extending the SciVis Model The dataflow model has proved extremely flexible Provides basis of collaborative visualization Implemented in IRIS Explorer as the COVISA toolkit Extensible User code introduced as module in pipeline allows computational steering data model visualize render collaborative server internet render control simulate visualize render 6

An e-science Demonstrator An e-science Demonstrator Emergency scenario: release of toxic chemical Simulation launched on Grid resource, steered from desktop using IRIS Explorer Collaborators linked in remotely using COVISA toolkit Dispersion of pollutant studied under varying wind directions A collaborator links in over the network 7

Other Metaphors Other Metaphors Other user interface metaphors have been suggested Spreadsheet interface becoming popular.. Allows audit trail of visualizations Jankun-Kelly and Ma 8

Information Visualization Information Visualization Focus is on visualizing set of observations that are multi-variate Example of iris data set 150 observations of 4 variables (length, width of petal and sepal) Techniques aim to display relationships between variables 9

Dataflow for Information Visualization Again we can express as a dataflow but emphasis now is on data itself rather than underlying entity First step is to form the data into a table of observations, each observation being a set of values of the variables Then we apply a visualization technique as before data data table observations 1 2 visualize A.... variables B.... render C.... 10

Multivariate Visualization Multivariate Visualization Techniques designed for any number of variables Glyph techniques Parallel co-ordinates Scatter plot matrices Pixel-based techniques Software: Xmdvtool Matthew Ward Acknowledgement: Many of images in following slides taken from Ward s work..and also IRIS Explorer! 11

Glyph Techniques Glyph Techniques Star plots Each observation represented as a star Each spike represents a variable Length of spike indicates the value Variety of possible glyphs Chernoff faces Crime in Detroit 12

Parallel Co-ordinates Parallel Co-ordinates Each variate represented as vertical axis Axes laid out uniformly Observation represented as a polyline traversing all M axes, crossing each axis at the observed value of the variate Detroit homicide data (7 variables,13 observations) 13

Scatter Plot Matrices Scatter Plot Matrices Matrix of 2D scatter plots Each plot shows projection of data onto a 2D subspace of the variates Order M 2 plots 14

The Screen Space Problem The Screen Space Problem All techniques, sooner or later, run out of screen space Parallel coordinates Usable for up to 150 variates Unworkable greater than 250 variates Remote sensing: 5 variates, 16,384 observations) 15

Brushing as a Solution Brushing as a Solution Brushing selects a restricted range of one or more variables Selection then highlighted 16

Clustering as a Solution Clustering as a Solution Success has been achieved through clustering of observations Hierarchical parallel co-ordinates Cluster by similarity Display using translucency and proximity-based colour 17

Hierarchical Parallel Coordinates 18

Reduction of Dimensionality of Variate Space Reduce number of variables, preserve information Principal Component Analysis Transform to new coordinate system Hard to interpret Hierarchical reduction of variate space Cluster variables where distance between observations is typically small Choose representative for each cluster 19

Using a Dataflow System for Information Visualization IRIS Explorer used to visualize data from BMW Five variables displayed using spatial arrangement for three, colour and object type for others Notice the clusters More later.. Kraus & Ertl 20

Scientific Visualization Information Visualization Scientific Visualization Focus is on visualizing an entity measured in a multi-dimensional space Underlying field is recreated from the sampled data Relationship between variables well understood Information Visualization Focus is on visualizing set of observations that are multi-variate There is no underlying field it is the data itself we want to visualize The relationship between variables is not well understood 21