Information Visualization and Visual Analytics Pekka Wartiainen University of Jyväskylä pekka.wartiainen@jyu.fi 23.4.2014
Outline Objectives Introduction Visual Analytics Information Visualization Our research Summary
Learning objectives To understand the definition of visual analytics. To be aware with visual analytics approach in problem solving. To understand the basics of data visualization.
Motivation Raw data has no value in itself, only the extracted information has value Time and money are wasted and opportunities are lost Success depends on availability of the right information Visual analytics aims at making data and information processing transparent Visual analytics combines the strengths of humans and computers
An historical perspective on visual analytics Early visual analytics: exploratory data analysis Visual data exploration and visual data mining First book of visual analytics: Illuminating the Path, 2005 Some earlier systems exhibited the characteristics of visual analytics CoCo system for improving silicon chips, 1990
Past few years VisMaster is an European Coordination Action Project Web-page: Book: URL: Visual-Analytics.EU URL: Mastering the information age - solving problems with visual analytics YouTube video: URL: Inria - Vismaster, visual analytics
Visual analytics Definition Visual analytics combines automated analysis techniques with interactive visualisations for an effective understanding, reasoning and decision making on the basis of very large and complex datasets. Timeline
Application of visual analytics First application area was security Many major application areas physics, astronomy, medicine, climate,... Example: business intelligence Financial market generates large amounts of data on a daily basis > extremely high data volumes over the years More than 300 million VISA credit card transactions per day Multiple perspectives and assumptions for analysis history, current situation, monitoring, forecasting, recurring situations
Visual analytics Coordinated Graph Visualization Visual support for the simulation of climate models provided by CGV, a highly interactive graph visualization system.
Visual analytics NFlowVis Analysis of a distributed network attack.
The visual analytics process Process model of visual analytics.
Building blocks of visual analytics research Visual analytics integrates science and technology from many disciplines.
Evaluation Evaluation include techniques, methods, modes and theories as well as software tools Challenge: often processing data from the real world Evaluation involves users, tasks and data Especially in the industry, the domain expert has the best knowledge > Empirical evaluation Evaluation criteria, e.g.: effectiveness efficiency user satisfaction Importance of documentation is emphasized
Infrastructure Visual analytics is both user-driven and data-driven Current challenges: lack of interaction and dynamic data Limitations of traditional data bases Need for: Old fashioned architectural reference model Big data solutions Fast imprecise answers with progressive refinement Incremental re-computation, either in the data (e.g., some data has been changed) or in the analysis parameters Steering the computation towards data regions that are of higher interest to the user.
Data management Why? The big opportunity of the Information Age Many obstacles need to be overcome Heterogeneity of data sources Different data types Data streams Working under pressure Time consuming activities Data management ensures data consistency and standards
Data management VA aspects Data and semantic integration Utilizing known processing methods Data Warehousing, OLAP and Data Mining Data reduction and abstraction Data quality is crucial (cf. GIGO model) Visual techniques for exploring data
Space and time In large systems, space and time are essential > complexity increases Space and time are more than just numbers Specific properties: Dependencies between observations Uncertainty Scale Time Spatial approaches: Cartography, GIS, Geovisualization Representation of time: visualization of time-related data and time itself Interactive visualizations Big data cases dimension reduction
Space and time OECD explorer Allows to explore regional statistics data from OECD URL:Organisation for Economic Cooperation and Development
Data mining Humans are required in the data analysis process New tools and methodologies are necessary to help experts extract relevant information Limitations in KDD process and visualizations Combination of multidisciplinary approaches Pattern identification methods Spatio-temporal data mining Many software have been developed
Perception and cognitive aspects visualization The human is at the heart of visual analytics human interaction, analysis, intuition, problem solving and visual perception. Distinction between high and low-level vision Humans do not have to remember everything but extract visual clues from the environment Pre-attentive processing makes items pop out the display automatically.
Data visualization Fast and understandable way to present data to a user Data mining methods as pre-processing tools Many visualization methods existing JFreeChart Google Charts Remember how not to use visualization techniques Dynamic behavior of the data sets special requirements Data visualization is part of information visualization
GUI design Visual analytics has high demand for GUI Scalable and interactive interface General guidelines for different purposes Windows, OS X, Android,... Online solutions Define target group before designing the GUI Multidisciplinary research groups Personalized user roles
Common interaction select : mark data items of interest, possible followed by another operation, explore : show some other data e.g., panning, zoom, resampling, reconfigure : rearrange the data spatially e.g., sort, change attribute assigned to axis, rotate (3D), slide, encode : change visual appearance e.g., change type of representation (view), adjust colour/size/shape, abstract/elaborate : show more or less detail e.g., details on demand, tooltips, geometric zoom, filter : select or show data matching certain conditions, connect : highlight related data items e.g., brushing (selection shown in multiple views).
Using colors Powerful element in visualization Wrong usage of colors is disturbing Color Usage Research Lab NASA Ames research center Ready made color palettes are solid alternatives
Visual analytics in energy production Application area: BFB boiler burning biomass Co-operation with VTT, department of chemistry, and private companies Funded by Regional Council of Central Finland Time-series data measured from the different parts of the process Context-sensitive framework approach Matlab routines with Java GUI
People included into process The human context of visual analytics.
Summary Visual analytics for multidisciplinary research problems Visualization, data analysis, user interaction Highly interactive interfaces The whole process should be taken into account Many challenges still existing, especially with big and dynamic data Humans are part of the process
References D. Keim, J. Kohlhammer, G. Ellis ja F. Mansmann, Mastering the Information Age: Solving Problems with Visual Analytics, Eurographics Association, Germany, 2010. P. Järvinen, K. Puolamäki, P. Siltanen ja M. Ylikerälä, Visual Analytics, Technical report, VTT, Finland, 2009. P. Wartiainen, T. Kärkkäinen, A. Heimbürger, ja S. Äyrämö. Context-sensitive approach to dynamic visual analytics of energy production processes. In 22th European-Japanese Conference on Information Modelling and Knowledge Bases. MATFYZPRESS - Univerzity Karlovy, 2012.