Visibility optimization for data visualization: A Survey of Issues and Techniques



Similar documents
Data Visualization Techniques

Data Visualization Techniques

Diagrams and Graphs of Statistical Data

Information visualization examples

DESIGN OF CLUSTER OF SIP SERVER BY LOAD BALANCER

The basics of storytelling through numbers

The Comparisons. Grade Levels Comparisons. Focal PSSM K-8. Points PSSM CCSS 9-12 PSSM CCSS. Color Coding Legend. Not Identified in the Grade Band

HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER

Summarizing and Displaying Categorical Data

Visualization Quick Guide

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

Perception of Light and Color

Iris Sample Data Set. Basic Visualization Techniques: Charts, Graphs and Maps. Summary Statistics. Frequency and Mode

Problem of the Month Through the Grapevine

Multivariate data visualization using shadow

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

Time Series Data Visualization

Choosing Colors for Data Visualization Maureen Stone January 17, 2006

Bernice E. Rogowitz and Holly E. Rushmeier IBM TJ Watson Research Center, P.O. Box 704, Yorktown Heights, NY USA

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Colour Image Segmentation Technique for Screen Printing

GEO-VISUALIZATION SUPPORT FOR MULTIDIMENSIONAL CLUSTERING

Expert Color Choices for Presenting Data

Data Visualization Handbook

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall

Simultaneous Gamma Correction and Registration in the Frequency Domain

All Visualizations Documentation

Video compression: Performance of available codec software

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Data Visualization Best Practice. Sophie Sparkes Data Analyst

Quantitative Displays for Combining Time-Series and Part-to-Whole Relationships

DHL Data Mining Project. Customer Segmentation with Clustering

The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA

Mining Signatures in Healthcare Data Based on Event Sequences and its Applications

SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL

Introduction to Computer Graphics

How To Filter Spam Image From A Picture By Color Or Color

Correlation and Regression

Visualization Techniques in Data Mining

Part 2: Data Visualization How to communicate complex ideas with simple, efficient and accurate data graphics

Character Image Patterns as Big Data

Visualization of large data sets using MDS combined with LVQ.

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller

Good Scientific Visualization Practices + Python

Using Data Mining for Mobile Communication Clustering and Characterization

DATA VISUALIZATION GABRIEL PARODI STUDY MATERIAL: PRINCIPLES OF GEOGRAPHIC INFORMATION SYSTEMS AN INTRODUCTORY TEXTBOOK CHAPTER 7

Environmental Remote Sensing GEOG 2021

Visualization methods for patent data

A Scientific Visualization Schema Incorporating Perceptual Concepts

Analysis of Preview Behavior in E-Book System

an introduction to VISUALIZING DATA by joel laumans

Multi-Attribute Glyphs on Venn and Euler Diagrams to Represent Data and Aid Visual Decoding

Flexible Web Visualization for Alert-Based Network Security Analytics

Using Lexical Similarity in Handwritten Word Recognition

Choosing a successful structure for your visualization

Exploratory data analysis (Chapter 2) Fall 2011

REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING

Extend Table Lens for High-Dimensional Data Visualization and Classification Mining

Information Visualization Multivariate Data Visualization Krešimir Matković

Northumberland Knowledge

Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies

The Value of Visualization 2

Taking Inverse Graphics Seriously

Multidimensional Data Visualization Tools

TEXT-FILLED STACKED AREA GRAPHS Martin Kraus

Business Intelligence and Process Modelling

Prentice Hall Mathematics: Course Correlated to: Arizona Academic Standards for Mathematics (Grades 6)

STAT355 - Probability & Statistics

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

Multi-Dimensional Data Visualization. Slides courtesy of Chris North

Data Exploration Data Visualization

2695 P a g e. IV Semester M.Tech (DCN) SJCIT Chickballapur Karnataka India

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE.

Algebra 1 Course Information

Data representation and analysis in Excel

CS 325 Computer Graphics

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow

Visualization Software

The Which Blair Project : A Quick Visual Method for Evaluating Perceptual Color Maps

EFFECTIVE DATA RECOVERY FOR CONSTRUCTIVE CLOUD PLATFORM

How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free)

Visualization of Multivariate Data. Dr. Yan Liu Department of Biomedical, Industrial and Human Factors Engineering Wright State University

Face Model Fitting on Low Resolution Images

Tracking and Recognition in Sports Videos

SSIM Technique for Comparison of Images

Glencoe. correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 3-3, , , 4-9

How To Create A Data Visualization

Transcription:

Visibility optimization for data visualization: A Survey of Issues and Techniques Ch Harika, Dr.Supreethi K.P Student, M.Tech, Assistant Professor College of Engineering, Jawaharlal Nehru Technological University, Hyderabad, Andhra Pradesh, India harikachukka24@gmail.com, supreethi.pujari@gmail.com Abstract Techniques of visualization use colors to represent categorical differences to the user. Color appearance changes with size of the object being viewed. While using colors, the structures which are more prominent as in position suppress small non uniform groups or inhomogeneous groups. Without maintaining sufficient distance among the colors they are difficult to be discriminated. Class visibility is used to measure the color palette utility and color optimization algorithm based on class visibility makes the differences clearly visible to users. 1. Introduction Data visualization is a visual representation or graphical representation of data. Graphical representation of data assists the users to recognize patterns, to detect trends, and to identify relationships hidden in data sets. Information that has been abstracted in some semantic form including attributes or variables for the unit of information like temporal data, persistent data can also be created as a study of visual representation. There are few concepts for the selection of visualization to our data. We should understand the data we are trying to visualize, along with its size and cardinality. One must know what we are trying to visualize and what kind of information we want to communicate. The visual that conveys the information in the best and simplest form to the audience must be used. [3] Data can be represented in different forms of graphs like line graphs, bar graphs, Scatter plots, pie charts etc. Line graphs show the relationship of one variable to another. These are also useful to compare multiple items over the same time period. Bar Graphs are used for comparing the quantities of different categories or groups. Scatter Plot is a good way to visualize relationships in data. Pie Charts are used to compare the parts of whole. This is most effective when there are limited components and when text and percentages are included.[3] Data visualization concept is mostly useful for representing the categorical differences and these categorical differences can be world climate, vegetation class changes etc...selecting categorical colors is challenging problem in data visualization. To choose proper categorical color we use visualization packages as a guidance to choose color. Choosing the categorical data without considering the above aspects can results in misleading of interpretation of data. The problems that generally occur during the selection of categorical colors are: The structures which are having more power and influence over others or more prominent as in position suppress the small inhomogeneous groups. Without maintaining sufficient distances among the colors they are difficult to be discriminated. Color appearance also changes based on spatial variations of data in an image because of visual sensitivity of humans. Color design also forms a crucial part in visual aesthetics. [9] The problems that occur during selection of categorical colors can be addressed by class visibility concept. By using this class visibility we measure the utility of color palette to make the categorical colors clear to the users. Visual saliency will make the will make the more prominent structures and small inhomogeneous groups clearly visible to users considering spatial variations. The visual saliency estimation relies on spatial averages, color averages. Apart from using the visual saliency, the colors do not correctly label the categorical groups like narrow areas with high visual sharpness. Point saliency is also used to measure the intensity of single class in pixels. The class visibility and point saliency both together will make the categorical groups differentiate. Visibility based optimization is used to improve the color of categorical data visualization. Visual based algorithm will increase the intensity of small groups and reduce the weight of large groups. Optimization procedures will maintain distinguishable distances between the colors. [1] A Typical method of visualizing data sets involves mapping data attributes to visual features like shape, size, spatial location, and orientation. Color is the important frequently used feature. The general rule of thumb for computer generated images is not more than five to seven colors. [6], previous works has addressed to increase

the total number of colors available by taking three aspects under consideration during selecting: 1. Color distance, 2.Linear separation, and 3.Color category. Also the results have showed that the selected group of colors will provide good differentiation between the data elements during data visualization [6]. 2. Methods of Data Visualization Different methods are available for visualization of data based on the type of data. Data can be: [2] Univariate Bivariate Multivariate 1.1 Univariate It is a measurement of single quantitative variable. It is Characterised distribution which is represented using following methods. Histograms: Pie charts 1.2 Bivariate This constitutes a paired sample of two quantitative variables. Variables are related and are represented using the following methods. Scatter Plots Line Graphs 1.3 Multivariate This is a multi dimensional representation of data and is represented using following methods

Icon Based Pixel Based Dynamic Parallel Coordinate System.[2][8] 3. Effective color palettes 3.1 Color Selection Principle The colors are selected by using the principal of order and separation. Order will arrange the colors in sequential order and separation wants all the colors used to make the differences must be different. There are many approaches that were proposed to realize order and separation for the colors that are isoluminant [2][4].In some situations many users can select their favourite color but they do not know the contrast of categorical groups. So, this made to improve the color mapping for users. 3.2 Size/Frequency Dependency Visual sensitivity of the human visual system differs in spatial frequencies of data items in an image. 3.3 Feature driven attention model This is a 2 stage process in the feature integration theory. First stage: Preattentive primitives like color, lightness are detected and are separately encoded into feature maps. These stages are performed rapidly in parallel. Second stage: This is a slow serial conjective search. This will integrate or combine the future maps into single topographical saliency map. [1] 3.4 Drawbacks The colors appearances changes with size of the object being viewed. The interaction between the size and the color appearance is a critical design factor. By considering 5 colors of different hues of the same luminance. By using these 5 colors in both line graph and bar graph, the difference can be easily distinguished as shown in figure 1. [10], the colors used in first graph are clearly visible than the colors used in second graph.

In Figure 2 the shaded colors are easily distinguish in the table but difficult to distinguish in the small color picker. [10] Fig 2 In Figure 3 unlike the above 2 diagrams a similar blending process occurs in human visual system which is called as effect spreading While seeing that colors the human eye blends at small areas of the colors that are close together even though the same colors are used in both bars.[10] Fig 3 In Figure 4, the structures which are more prominent suppress small inhomogeneous groups.[1]

4. Research Goals Our optimization tries to make the categorical differences clearly visible to the users by increasing the intensity of small groups and by reducing the weights of large groups.[1] That means optimization tries to increase the contrast of small regions and reduce it for large regions which will allow to produce image with balanced colors, according to the associate size of the categorical groups. [1] 5. Class Visibility and point saliency This is the measure of how the color and spatial distribution of each class affect the intensity of human visual system. Class visibility is estimated by weighting its point saliency with its structural weights. The structural weights are calculated by the contributions of neighbouring points (Derived rather than measure). [1][9][10] Unlike the conventional filtering we only collect the contributions of points whose labels are the same as the centre points. By preventing or excluding the contributions of points of different labels, we can avoid losing categorical differences. Point saliency will exactly weight the contribution of categorical color at a single data point. The point saliency is measured as the color difference from the average color of neighbouring data points. Color difference,, is measured by the CIE76 metric, which is the Euclidean distance in CIE Lab.[1] Our class visibility defined for each data point, is estimated by weighting its point saliency with its structural weights.

The structural weights, effectively captures the difficulty of a user to observe a specific class at narrow local regions. 6. Color Optimization 6.1 Mean class visibility We need a representative measure of each class. The mean class visibility of each class is computed by the average of the data points that belong to its category. 6.2 Cost Function Cost function is defined for each data point is to minimize differences among the class visibility levels in its local surroundings. Where, M=Number of classes V m =Mean class visibility of class M X=Color Vector T=Target Vector 6.3 Optimization The final resulting cost functional with the constraints to optimize is: Where, B(X) =Bounding constraint R(X) = Color discrimination constraints (colors should maintain sufficient perceptual distances). 7. Conclusion The color optimization algorithm based on visual sensitivity of human makes the categorical differences clearly visible in graphical representation of data. Also to make more prominent structures do not suppress small inhomogeneous group. 8. References [1] Sungkil Lee, Mike sips, and Hans-Peter Saidel Perceptually Driven Visibility Optimization For Categorical Data Visualization" IEEE Transactions on visualization and computer graphics,volume 19, No:10,pp 1746-1757, 2013. [2] P. Robertson and J. O Callaghan, The Generation of Color Sequences for Univariate and Bivariate Mapping, IEEE Computer Graphics and Applications, vol. 6, no. 2, pp. 24-32, Feb. 1986. [3] SAS the power to know: Data visualization techniques from basics to big data with SAS visual analytics by Justin Choy, Varsha Chawla, Lisa Whiteman, Proceedings of SAS Global Forum 2012 and SAS Global Forum 2011.

[4] H. Levkowitz and G.T. Herman, Color Scales for Image Data, IEEE Computer Graphics and Applications, vol. 12, no. 1, pp. 72-80, Jan. 1992. [5] M. Kawai, K. Uchikawa, and H. Ujike, Influence of Color Category on Visual Search, Proc. Ann. Meeting Assoc. for Research in Vision and Ophthalmology, p. 2991, 1995. [6]. C. G. Healey, Choosing Effective Colours for Data Visualization, Proc. IEEE Conf. Visualization,pp.263-271,1996. [7] L. Itti, C. Koch, and E. Niebur, A Model of Saliency-Based Visual Attention for Rapid Scene Analysis, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 11, pp. 1254-1259, Nov. 1998 [8] D.A. Keim, Designing Pixel-Oriented Visualization Techniques: Theory and Applications, IEEE Trans. Visualization and Computer Graphics, vol. 6, no. 1, pp. 59-78, Jan.-Mar. 2000. [9] L. Wang, J. Giesen, K.T. McDonnell, P. Zolliker, and K. Mueller, Color Design for Illustrative Visualization, IEEE Trans.Visualization and Computer Graphics, vol. 14, no. 6, pp. 1739-1754,Nov./Dec. 2008. [10] M. Stone, In Color Perception, Size Matters, IEEE Computer Graphics and applications,vol.32,no.2,pp.8-13,mar./apr.20.