Pixel-oriented Database Visualizations
|
|
|
- Joella Randall
- 9 years ago
- Views:
Transcription
1 SIGMOD RECORD special issue on Information Visualization, Dec Pixel-oriented Database Visualizations Daniel A. Keim Institute for Computer Science, University of Munich Oettingenstr. 67, D Munich, Germany Phone (+49) , Fax (+49) Abstract In this paper, we provide an overview of several pixel-oriented visualization techniques which have been developed over the last years to support an effective querying and exploration of large databases. Pixel-oriented techniques use each pixel of the display to visualize one data value and therefore allow the visualization of the largest amount of data possible. The techniques may be divided into query-independent techniques which directly visualize the data (or a certain portion of it) and query-dependent techniques which visualize the relevance of the data with respect to a specific query. An example for the class of query-independent techniques is the recursive pattern technique which is based on a generic recursive scheme generalizing a wide range of pixel-oriented arrangements for visualizing large databases. Examples for the class of query-dependent techniques are the generalized spiral and circle-segments techniques, which visualize the distances with respect to a database query and arrange the most relevant data items in the center of the display. Keywords: 1. Introduction Visualizing Large Databases, Visual Data Mining, Visualizing Multidimensional and Multivariate Data Visualization of data which have some inherent two- or threedimensional semantics has been done even before computers were used to create visualizations (see for example the wellknown books of Tufte [Tuf 83, Tuf 90]). Since computers are used to create visualizations, many novel visualization techniques have been developed and existing techniques have been extended to work for larger data sets and make the displays interactive. For most of the data stored in databases, however, there is no standard mapping into the Cartesian coordinate system, since the data has no inherent two- or three-dimensional semantics. In general, relational databases can be seen as multidimensional data sets with the attributes of the database corresponding to the dimensions of the multidimensional data set. There are a number of well-known techniques for visualizing multidimensional data sets. Those techniques may be classified into geometric projection techniques, iconic display techniques, hierarchical techniques, graph-based techniques, pixel-oriented techniques, dynamic techniques, and combinations hereof. An overview of many of those techniques is presented in [Kei 96]. The research also resulted in data exploration and analysis systems which implement some of the mentioned techniques. Examples include statistical data analysis packages such as S Plus / Trellis [BCW 88], XGobi [SCB 92], and Data Desk [Vel 92], visualization oriented systems such as ExVis [GPW 89], XmdvTool [Ward 94], and IBM s Parallel Visual Explorer, as well as database oriented systems such as TreeViz [Shn 92], the Information Visualization and Exploration Environment (IVEE) [AW 95], the VisDB system [KK 95] and SGI s MineSet TM [SGI 96]. In this article, we provide an overview of the pixel-oriented visualization techniques. The basic idea of pixel-oriented techniques is to map each data value to a colored pixel and present the data values belonging to one attribute in separate windows (cf. Figure 1). Since in general our techniques use only one pixel per data value, the techniques allow us to visualize the largest amount of data, which is possible on current displays (up to about 1,000,000 data values). If each data value is represented by one pixel, the main question is how to arrange the pixels on the screen. Our pixel-oriented techniques use different arrangements for different purposes. If a user wants to visualize a large data set, the user may use a query-independent visualization technique which sorts the data according to some attribute(s) and uses a screen-filling pattern to arrange the data values on the display. The queryindependent visualization techniques are especially useful for data with a natural ordering according to one attribute (e.g., time series data). However, if there is no natural ordering of the data and the main goal is an interactive exploration of the database, the user will be more interested in feedback to some query. In this case, the user may turn to the query-dependent visualization techniques which visualize the relevance of the data items with respect to a query. Instead of directly mapping the data values to color, the query-dependent visualization techniques calculate the distances between data and query values, combine the distances for each data item into an overall distance, and visualize the distances for the attributes and the overall distance sorted according to the overall distance.
2 pixels belonging to one data item attribute 1 attribute 2 attribute 3 attribute 4 attribute 5 attribute 6 Figure 1: Arrangement of Windows for Data with Six Attributes The arrangement of the data items centers the most relevant data items in the middle of the window, and less relevant data items are arranged to the outside of the window. In the rest of this paper, we briefly introduce three of our pixel-oriented visual database exploration techniques. Section 2 describes a query-independent techniques and section 3 two query-dependent techniques. In section 4, we briefly discuss the implementation of our techniques in the VisDB system and in section 5, we summarize our approach and point out some of the open problems for future work. 2. Query-Independent Visualization Techniques As already mentioned, the basic idea of our visualization techniques is to present as many data values as possible at the same time with the number of data values being only limited by the number of pixels of the display. In dealing with arbitrary multidimensional data without any 2D- or 3D-semantics, one major problem is to find meaningful arrangements of the pixels on the screen. Even if the data has a natural ordering according to one attribute (e.g., time series data), there are many possibilities for arranging the data. One straightforward possibility is to arrange the data items from left to right in a line-by-line fashion. Another possibility is to arrange the data items top-down in a column-by-column fashion. If these arrangements are done pixelwise, in general, the resulting visualizations do not provide useful results [KKA 95]. More useful are techniques which provide a better clustering of closely related data items such as space-filling curves (e.g., the well-known curves by Peano & Hilbert [Pea 90, Hil 91] or Morton [Mor 66] ). For data mining even more important are techniques which provide nice clustering properties as well as an arrangement which is semantically meaningful. An example for a technique which provides these properties is the recursive pattern technique. The recursive pattern technique is based on a generic recursive scheme which allows the user to influence the arrangement of data items. The basic arrangement is based on a simple back and forth movement: First, a certain number of elements is arranged from left to right, then below backwards from right to left, then again forward from left to right, and so on. The same basic arrangement is done on all recursion levels with the only difference that the basic elements which are arranged on level i are the patterns resulting from level(i-1)- arrangements. Let w i be the number of elements arranged in the left-right direction on recursion level i and h i be the number of rows on recursion level i. On recursion level i (i 1), the algorithm draws w i level(i-1)-patterns h i times alternately to the right and to the left. The pattern on recursion level i consists of w i h i level(i-1)-patterns, and the maximum number of pixels that can be presented on recursion level k is k given by w i h. An example for a recursive pattern i = 1 i visualization of a database containing the 100 stocks of the FAZ index (Frankfurt Stock Index) from 20 years of stock price data (altogether 532,900 data values) is presented in Figure 4. For the details on the recursive pattern technique the reader is referred to [KKA 95]. Note that for the query-independent techniques, it is not mandatory that the data has some natural ordering. In searching for dependencies among attributes, one might sort the data according to one attribute and use our visualization technique for examining the dependencies of the other attributes. Consider, for example, a large database of personal data. If one wants to find dependencies between the parameter sales (of a person) and other attributes such as salary, age, and travel expenses, one might sort the data according to the sales parameter and visually examine the dependencies of the other attributes. 3. Query-Dependent Visualization Techniques The query-independent visualization techniques visualize the attribute values by directly mapping them to color. The idea of the query-dependent visualization techniques is to visualize the data in the context of a specific user query to give the users feedback on their queries and direct their search. Instead of directly mapping attribute values to colors, the distances of attribute values to the query are mapped to colors. Since the focus of the query-dependent techniques is on the relevance of the data items with respect to the query, different arrangements of the pixels are appropriate. In developing querydependent techniques, we experimented with several arrangements such as the left-right or top-down arrangements and compared the resulting visualizations. We found, that for visualizing the result for a database query it is most natural to present the data items with highest relevance to the query (e.g., data items fulfilling the query) in the center of the display. Our first approach described in [Kei 94, KK 94] arranges the data items with lower relevances in a rectangular spiral shape around the center. The techniques presented in this paper the generalized-spiral and the circle-segments techniques are generalizations of those techniques. In case of the generalized-spiral technique, the original spiral arrangement [KK 94] is extended to a generic snake- or hilbert-like form, of which the user may choose the height (cf. Figure 2). As in case of the query-independent visualization techniques, a separate visualization for each of the selection predicates (attributes) is generated (cf. Figure 1). An additional subwindow shows the overall distances. In all subwindows, the pixels for each data item are placed at the same position as the overall distance for the data item in the overall
3 Attr. 8 Attr. 1 Attr. 7 Attr. 2 height Figure 2: Generalized-Spiral Technique Attr. 6 Attr. 3 distance subwindow. By relating corresponding regions in the different windows, the user is able to perceive data characteristics such as multidimensional clusters or correlations. Additionally, the separate subwindows for each of the selection predicates provide important feedback to the user, e.g. on the restrictiveness of each of the selection predicates and on single exceptional data items. Note that the original spiral technique is now the special case of the generalized-spiral technique with a height of one pixel. The advantage of the generalized-spiral technique is that the degree of clustering is higher. For details on the different variants of the generalized-spiral technique and a first examination of their effectiveness the reader is referred to [Kei 95]. A second query-dependent technique is the circle-segments technique. The basic idea of the circle-segments visualization technique is to display the distances for the attributes as segments of a circle (cf. Figure 3). If the data consists of k attributes, the circle is partitioned into k segments, each representing one the distances for one attribute. Inside the segments, the distance values belonging to one attribute are arranged from the center of the circle to the outside in a back and forth manner orthogonal to the line that halves the segment. An example for a circle-segments visualization of 50 stock price developments from the FAZ index database used in the example above is presented in Figure 5. For the details on the circlesegments technique the reader is referred to [AKK 96]. 4. The VisDB System Several pixel-oriented visualization techniques including the techniques described in the previous subsections are implemented as part of the VisDB system [KK 95]. In addition to our pixel-oriented techniques, the VisDB system also supports the parallel coordinates technique developed by Inselberg & Dimsdale [Ins 81, ID 90] and the stick figure technique developed by Picket & Grinstein from the University of Massachusetts, Lowell. The VisDB system is implemented in C++/MOTIF and runs under X-Windows on HP 7xx machines. The system consists of an interactive interface which is divided into the visualization portion and the query specification portion (for the query-dependent techniques). The query specification portion provides a slider-based direct-interaction interface which allows an intuitive specification of queries [Kei 94, KKS 94]. Different types of sliders are available for different data types. Other options which support the data exploration process are the possibility to focus on certain colors and the possibility to get the data values corresponding to a pixel of the display. The current version of the VisDB system is main memory based and allows interactive query-dependent visualizations of very large databases. For databases with less than 100,000 data values, the recalculations can be considered to be truly interactive; for larger databases, the time is still in the range of a few seconds (for 1,000,000 data values, for example, the response time is about 20 seconds). When interfacing with current commercial database systems, however, performance problems arise since no access to partial results of a query is available, no support for incrementally changing queries is provided, and no multidimensional data structures are used for fast secondary storage access. We are currently working on improving the performance in directly interfacing to a database system. In the future, we plan to implement the VisDB system on a parallel machine which will be able to support interactive query modifications even for larger amounts of data. The VisDB system has been successfully used in several application areas including a financial application where the system has been used to analyze multidimensional timedependent data, a CAD database project where the system has been used to improve the similarity search, as well as a molecular biology project where the system has been used to find possible docking regions by identifying sets of surface points with distinct characteristics [Kei 94]. Currently, we explore several other data sets including a large database of geographical data, a large environmental database, and a NASA earth observation database. A first evaluation and comparison of multiple visualization techniques using different data sets and queries is reported in [KK 96]. 5. Conclusions Attr. 5 Attr. 4 Figure 3: The Circle-Segments Technique for 8-dimensional Data Pixel-oriented visualization techniques which use each pixel of the display to visualize one data value provide a valuable help in exploring very large databases. The techniques described in this paper allow users to get a visual overview of large data sets and supports them in finding correlations, functional dependencies, and clusters. Our query-independent techniques directly visualize the attribute values by arranging
4 the values according to some screen-filing curve. In addition, the recursive pattern technique allows the user to control the arrangement of the data values, providing the possibility to generate more meaningful visualizations. Our query-dependent techniques visualize the data attributes in the context of a specific query and provide visual feedback in querying the database. The techniques are especially helpful for interactively exploring large databases. We believe that the different techniques are useful for different data exploration tasks and for different stages of the data exploration process. At this point, we want to stress that our visualization techniques are not designed to replace or substitute methods which have been developed in statistics or knowledge discovery. Also, we do not claim that our techniques are in general better than those techniques. Data visualization, statistics, and knowledge discovery have their advantages and we view them as being complementary to each other. Statistical analysis may, for example, be used to validate the hypotheses generated by the visualizations and vice versa. Future integrated tools for exploratory data analysis should therefore include not only statistical methods and knowledge discovery techniques but also data visualization techniques such as our pixel-oriented visualizations. Acknowledgments Developing a large number of diverse visualization techniques and implementing a complex system such as the VisDB system can not be done by a single person. My thanks goes to all my colleagues and students who contributed to the VisDB system, especially Thomas Seidl who implemented the first prototype of the system, Juraj Porada who implemented most of the current version, Mihael Ankerst who developed and implemented the recursive pattern and circlesegments techniques, and Professor Dr. Kriegel who provided the inspiring environment for doing the research reported in this paper. References [AKK 96] [AW 95] [BCW 88] [GPW 89] [Hil 91] [ID 90] Ankerst M., Keim D. A., Kriegel H.-P.: Circle-Segments: A Technique for Visually Exploring Large Multidimensional Data Sets, Visualization 96, Hot Topic Session, San Francisco CA, Ahlberg C., Wistrand E.: IVEE: An Information Visualization and Exploration Environment, Proc. Int. Symposium on Information Visualization, Atlanta, GA, 1995, pp Becker R., Chambers J. M., Wilks A. R.: The New S Language, Wadsworth & Brooks/Cole Advanced Books and Software, Pacific Grove, CA., Grinstein G, Pickett R., Williams M. G.: EXVIS: An Exploratory Visualization Environment, Proc. Graphics Interface 89, London, Ontario, Canada, Hilbert D.: Über stetige Abbildung einer Line auf ein Flächenstück, Math. Annalen, Vol. 38, 1891, pp Inselberg A., Dimsdale B.: Parallel Coordinates: A Tool for Visualizing Multi-Dimensional Geometry, Visualization 90, San Francisco, CA., 1990, pp [Ins 81] [Kei 94] [Kei 95] [Kei 96] [KK 94] [KK 95] [KK 96] [KKA 95] [KKS 94] [Mor 66] [Pea 90] [PG 88] [SCB 92] [SGI 96] [Shn 92] [Tuf 83] [Tuf 90] [Vel 92] [Ward 94] Inselberg A.: N-Dimensional Graphics Part I: Lines & Hyperplanes, IBM LA Science Center Report, # G , Keim D. A.: Visual Support for Query Specification and Data Mining, Ph.D. Dissertation, University of Munich, July 1994, Shaker-Publishing Company, Aachen, Germany, 1995, ISBN Keim D. A.: Enhancing the Visual Clustering of Query-dependent Databases Visualization Techniques using Screen-Filling Curves, Proc. Int. Workshop on Database Issues in Visualization, Atlanta, GA, Keim D. A.: Databases and Visualization, Tutorial, ACM SIGMOD Int. Conf. on Management of Data, Montreal, Canada, 1996, p A postscript version of the tutorial notes is available under Keim D. A., Kriegel H.-P.: VisDB: Database Exploration using Multidimensional Visualization, Computer Graphics & Applications, 1994, pp Keim D. A., Kriegel H.-P.: VisDB: A System for Visualizing Large Databases, System Demonstration, Proc. ACM SIGMOD Int. Conf. on Management of Data, San Jose, CA, 1995, p Keim D. A., Kriegel H.-P.: Visualization Techniques for Mining Large Databases: A Comparison, Trans. on Knowledge and Data Engineering, Dec. 1996, to appear. Keim D. A., Kriegel H.-P., Ankerst M.: Recursive Pattern: A Technique for Visualizing Very Large Amounts of Data, Proc. Visualization 95, Atlanta, GA, 1995, pp Keim D. A., Kriegel H.-P., Seidl t.: Supporting Data Mining of Large Databases by Visual Feedback Queries, Proc. 10th Int. Conf. on Data Engineering, Houston, TX, 1994, pp Morton.: A Computer Oriented Geodetic Data Base and a New Technique in File Sequencing, IBM Ltd. Ottawa, Canada, Peano G.: Sur une courbe qui remplit toute une aire plaine, Math. Annalen, Vol. 36, 1890, pp Pickett R. M., Grinstein G. G.: Iconographic Displays for Visualizing Multidimensional Data, Proc. IEEE Conf. on Systems, Man and Cybernetics, IEEE Press, Piscataway, NJ, 1988, pp Swayne D.F., Cook D., Buja A.: User s Manual for XGobi, a Dynamic Graphics Program for Data Analysis, Bellcore Technical Memorandum, Database Mining and Visualization Group: Mine- Set TM : A System for High-End Data Mining and Visualization, Silicon Graphics Inc., VLDB 1996, p Shneiderman B.: Tree Visualization with Treemaps: A 2-D Space-filling Approach, ACM Trans. on Graphics, Vol. 11, No. 1, 1992, pp Tufte E. R.: The Visual Display of Quantitative Information, Graphics Press, Cheshire, CT, Tufte E. R.: Envisioning Information, Graphics Press, Cheshire, CT, Velleman P. F: Data Desk 4.2: Data Description, Ithaca, NY, Ward M. O.: XmdvTool: Integrating Multiple Methods for Visualizing Multivariate Data, Visualization 94, Washington, DC, 1994, pp
5 Figure 4: Recursive Pattern Visualization of 100 Stocks from Jan. 74 to Apr. 95 (about 530,000 Data Values) (cf. [KKA 95]) Figure 5: Circle-Segments Visualization of 50 Stocks from Jan. 74 to Apr. 95 (about 265,000 Data Values) (cf. [AKK 96])
Enhancing the Visual Clustering of Query-dependent Database Visualization Techniques using Screen-Filling Curves
Enhancing the Visual Clustering of Query-dependent Database Visualization Techniques using Screen-Filling Curves Extended Abstract Daniel A. Keim Institute for Computer Science, University of Munich Leopoldstr.
Visual Data Mining with Pixel-oriented Visualization Techniques
Visual Data Mining with Pixel-oriented Visualization Techniques Mihael Ankerst The Boeing Company P.O. Box 3707 MC 7L-70, Seattle, WA 98124 [email protected] Abstract Pixel-oriented visualization
Pushing the limit in Visual Data Exploration: Techniques and Applications
Pushing the limit in Visual Data Exploration: Techniques and Applications Daniel A. Keim 1, Christian Panse 1, Jörn Schneidewind 1, Mike Sips 1, Ming C. Hao 2, and Umeshwar Dayal 2 1 University of Konstanz,
Visualization Techniques in Data Mining
Tecniche di Apprendimento Automatico per Applicazioni di Data Mining Visualization Techniques in Data Mining Prof. Pier Luca Lanzi Laurea in Ingegneria Informatica Politecnico di Milano Polo di Milano
Visualization of Multivariate Data. Dr. Yan Liu Department of Biomedical, Industrial and Human Factors Engineering Wright State University
Visualization of Multivariate Data Dr. Yan Liu Department of Biomedical, Industrial and Human Factors Engineering Wright State University Introduction Multivariate (Multidimensional) Visualization Visualization
Extend Table Lens for High-Dimensional Data Visualization and Classification Mining
Extend Table Lens for High-Dimensional Data Visualization and Classification Mining CPSC 533c, Information Visualization Course Project, Term 2 2003 Fengdong Du [email protected] University of British Columbia
Test Data Sets for Evaluating Data Visualization Techniques
Test Data Sets for Evaluating Data Visualization Techniques Daniel A. Keim Institute for Computer Science, University of Munich, Leopoldstr. 11B, 80802 Munich, Germany, [email protected]
A Comparative Study of Visualization Techniques for Data Mining
A Comparative Study of Visualization Techniques for Data Mining A Thesis Submitted To The School of Computer Science and Software Engineering Monash University By Robert Redpath In fulfilment of the requirements
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
High Dimensional Data Visualization
High Dimensional Data Visualization Sándor Kromesch, Sándor Juhász Department of Automation and Applied Informatics, Budapest University of Technology and Economics, Budapest, Hungary Tel.: +36-1-463-3969;
A Survey on Multivariate Data Visualization
A Survey on Multivariate Data Visualization Winnie Wing-Yi Chan Department of Computer Science and Engineering Hong Kong University of Science and Technology Clear Water Bay, Kowloon, Hong Kong June 2006
Clustering & Visualization
Chapter 5 Clustering & Visualization Clustering in high-dimensional databases is an important problem and there are a number of different clustering paradigms which are applicable to high-dimensional data.
Chapter 3 - Multidimensional Information Visualization II
Chapter 3 - Multidimensional Information Visualization II Concepts for visualizing univariate to hypervariate data Vorlesung Informationsvisualisierung Prof. Dr. Florian Alt, WS 2013/14 Konzept und Folien
Visual Techniques for Exploring Databases. Overview
Visual Techniques for Exploring Databases Daniel A. Keim Institute for Computer Science, University of Halle-Wittenberg Overview 1. Introduction...2 2. Data Preprocessing Techniques...9 3. Visual Data
Visual Mining of E-Customer Behavior Using Pixel Bar Charts
Visual Mining of E-Customer Behavior Using Pixel Bar Charts Ming C. Hao, Julian Ladisch*, Umeshwar Dayal, Meichun Hsu, Adrian Krug Hewlett Packard Research Laboratories, Palo Alto, CA. (ming_hao, dayal)@hpl.hp.com;
Information Visualization and Visual Data Mining
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 7, NO. 1, JANUARY-MARCH 2002 100 Information Visualization and Visual Data Mining Daniel A. Keim Abstract Never before in history data has
Analyzing The Role Of Dimension Arrangement For Data Visualization in Radviz
Analyzing The Role Of Dimension Arrangement For Data Visualization in Radviz Luigi Di Caro 1, Vanessa Frias-Martinez 2, and Enrique Frias-Martinez 2 1 Department of Computer Science, Universita di Torino,
Scatterplot Layout for High-dimensional Data Visualization
Noname manuscript No. (will be inserted by the editor) Scatterplot Layout for High-dimensional Data Visualization Yunzhu Zheng Haruka Suematsu Takayuki Itoh Ryohei Fujimaki Satoshi Morinaga Yoshinobu Kawahara
The course: An Introduction to Information Visualization Techniques for Exploring Large Database
The course: An Introduction to Information Visualization Techniques for Exploring Large Database Jing Yang Fall 2006 www.cs.uncc.edu/~jyang13 1 Multi-dimensional Data Visualization 2 1 Parallel Coordinates
Robust Outlier Detection Technique in Data Mining: A Univariate Approach
Robust Outlier Detection Technique in Data Mining: A Univariate Approach Singh Vijendra and Pathak Shivani Faculty of Engineering and Technology Mody Institute of Technology and Science Lakshmangarh, Sikar,
Reviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical Study José F. Rodrigues Jr., Agma J. M. Traina, Maria Cristina F. de Oliveira, Caetano Traina Jr. Computer Science Department Computer and Mathematical
NeuroVis: exploring interaction techniques by combining dimensional stacking and pixelization to visualize multidimensional multivariate data
NeuroVis: exploring interaction techniques by combining dimensional stacking and pixelization to visualize multidimensional multivariate data Category: Research ABSTRACT Applying a dimensional stacking
Visualization methods for patent data
Visualization methods for patent data Treparel 2013 Dr. Anton Heijs (CTO & Founder) Delft, The Netherlands Introduction Treparel can provide advanced visualizations for patent data. This document describes
Pixel bar charts: a visualization technique for very large multi-attributes data sets{
(2001) 0000, 00 ± 00 ã 2001 Palgrave. All rights reserved 1473 ± 8716 $15.00 www.palgrave-journals.com/ivs Pixel bar charts: a visualization technique for very large multi-attributes data sets{ Daniel
The Eyes Have It: A Task by Data Type Taxonomy for Information Visualizations. Ben Shneiderman, 1996
The Eyes Have It: A Task by Data Type Taxonomy for Information Visualizations Ben Shneiderman, 1996 Background the growth of computing + graphic user interface 1987 scientific visualization 1989 information
Data Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: [email protected] 1 Introduction The promise of decision support systems is to exploit enterprise
Big Data: Rethinking Text Visualization
Big Data: Rethinking Text Visualization Dr. Anton Heijs [email protected] Treparel April 8, 2013 Abstract In this white paper we discuss text visualization approaches and how these are important
Independence Diagrams: A Technique for Visual Data Mining
From: KDD-98 Proceedings. Copyright 1998, AAAI (www.aaai.org). All rights reserved. Independence Diagrams: A Technique for Visual Data Mining Stefan Berchtold H. V. Jagadish AT&T Laboratories AT&T Laboratories
Cours de Visualisation d'information InfoVis Lecture. Multivariate Data Sets
Cours de Visualisation d'information InfoVis Lecture Multivariate Data Sets Frédéric Vernier Maître de conférence / Lecturer Univ. Paris Sud Inspired from CS 7450 - John Stasko CS 5764 - Chris North Data
Integration of Cluster Analysis and Visualization Techniques for Visual Data Analysis
Integration of Cluster Analysis and Visualization Techniques for Visual Data Analysis M. Kreuseler, T. Nocke, H. Schumann, Institute of Computer Graphics University of Rostock, D-18059 Rostock, Germany
Information Visualization Multivariate Data Visualization Krešimir Matković
Information Visualization Multivariate Data Visualization Krešimir Matković Vienna University of Technology, VRVis Research Center, Vienna Multivariable >3D Data Tables have so many variables that orthogonal
PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY
QÜESTIIÓ, vol. 25, 3, p. 509-520, 2001 PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY GEORGES HÉBRAIL We present in this paper the main applications of data mining techniques at Electricité de France,
SuperViz: An Interactive Visualization of Super-Peer P2P Network
SuperViz: An Interactive Visualization of Super-Peer P2P Network Anthony (Peiqun) Yu [email protected] Abstract: The Efficient Clustered Super-Peer P2P network is a novel P2P architecture, which overcomes
Specific Usage of Visual Data Analysis Techniques
Specific Usage of Visual Data Analysis Techniques Snezana Savoska 1 and Suzana Loskovska 2 1 Faculty of Administration and Management of Information systems, Partizanska bb, 7000, Bitola, Republic of Macedonia
Static Data Mining Algorithm with Progressive Approach for Mining Knowledge
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 85-93 Research India Publications http://www.ripublication.com Static Data Mining Algorithm with Progressive
Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
TEXT-FILLED STACKED AREA GRAPHS Martin Kraus
Martin Kraus Text can add a significant amount of detail and value to an information visualization. In particular, it can integrate more of the data that a visualization is based on, and it can also integrate
Interactive Data Mining and Visualization
Interactive Data Mining and Visualization Zhitao Qiu Abstract: Interactive analysis introduces dynamic changes in Visualization. On another hand, advanced visualization can provide different perspectives
Identifying erroneous data using outlier detection techniques
Identifying erroneous data using outlier detection techniques Wei Zhuang 1, Yunqing Zhang 2 and J. Fred Grassle 2 1 Department of Computer Science, Rutgers, the State University of New Jersey, Piscataway,
An example. Visualization? An example. Scientific Visualization. This talk. Information Visualization & Visual Analytics. 30 items, 30 x 3 values
Information Visualization & Visual Analytics Jack van Wijk Technische Universiteit Eindhoven An example y 30 items, 30 x 3 values I-science for Astronomy, October 13-17, 2008 Lorentz center, Leiden x An
Visibility optimization for data visualization: A Survey of Issues and Techniques
Visibility optimization for data visualization: A Survey of Issues and Techniques Ch Harika, Dr.Supreethi K.P Student, M.Tech, Assistant Professor College of Engineering, Jawaharlal Nehru Technological
Utilizing spatial information systems for non-spatial-data analysis
Jointly published by Akadémiai Kiadó, Budapest Scientometrics, and Kluwer Academic Publishers, Dordrecht Vol. 51, No. 3 (2001) 563 571 Utilizing spatial information systems for non-spatial-data analysis
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,
Space-filling Techniques in Visualizing Output from Computer Based Economic Models
Space-filling Techniques in Visualizing Output from Computer Based Economic Models Richard Webber a, Ric D. Herbert b and Wei Jiang bc a National ICT Australia Limited, Locked Bag 9013, Alexandria, NSW
Dynamic Data in terms of Data Mining Streams
International Journal of Computer Science and Software Engineering Volume 2, Number 1 (2015), pp. 1-6 International Research Publication House http://www.irphouse.com Dynamic Data in terms of Data Mining
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Zhao Wenbin 1, Zhao Zhengxu 2 1 School of Instrument Science and Engineering, Southeast University, Nanjing, Jiangsu
Map-like Wikipedia Visualization. Pang Cheong Iao. Master of Science in Software Engineering
Map-like Wikipedia Visualization by Pang Cheong Iao Master of Science in Software Engineering 2011 Faculty of Science and Technology University of Macau Map-like Wikipedia Visualization by Pang Cheong
Selection of the Suitable Parameter Value for ISOMAP
1034 JOURNAL OF SOFTWARE, VOL. 6, NO. 6, JUNE 2011 Selection of the Suitable Parameter Value for ISOMAP Li Jing and Chao Shao School of Computer and Information Engineering, Henan University of Economics
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar What is data exploration? A preliminary exploration of the data to better understand its characteristics.
Information Visualization of Attributed Relational Data
Information Visualization of Attributed Relational Data Mao Lin Huang Department of Computer Systems Faculty of Information Technology University of Technology, Sydney PO Box 123 Broadway, NSW 2007 Australia
Intinno: A Web Integrated Digital Library and Learning Content Management System
Intinno: A Web Integrated Digital Library and Learning Content Management System Synopsis of the Thesis to be submitted in Partial Fulfillment of the Requirements for the Award of the Degree of Master
High-Dimensional Visualizations
High-Dimensional Visualizations Georges Grinstein 1, 2, Marjan Trutschl 1, Urška Cvek 1 1 Institute for Visualization and Perception Research University of Massachusetts Lowell and 2 AnVil Informatics,
Oracle8i Spatial: Experiences with Extensible Databases
Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction
Interactive Exploration of Decision Tree Results
Interactive Exploration of Decision Tree Results 1 IRISA Campus de Beaulieu F35042 Rennes Cedex, France (email: pnguyenk,[email protected]) 2 INRIA Futurs L.R.I., University Paris-Sud F91405 ORSAY Cedex,
Spatio-Temporal Mapping -A Technique for Overview Visualization of Time-Series Datasets-
Progress in NUCLEAR SCIENCE and TECHNOLOGY, Vol. 2, pp.603-608 (2011) ARTICLE Spatio-Temporal Mapping -A Technique for Overview Visualization of Time-Series Datasets- Hiroko Nakamura MIYAMURA 1,*, Sachiko
TOP-DOWN DATA ANALYSIS WITH TREEMAPS
TOP-DOWN DATA ANALYSIS WITH TREEMAPS Martijn Tennekes, Edwin de Jonge Statistics Netherlands (CBS), P.0.Box 4481, 6401 CZ Heerlen, The Netherlands [email protected], [email protected] Keywords: Abstract:
Title Page. Cooperative Classification: A Visualization-Based Approach of Combining the Strengths of the User and the Computer
Title Page Cooperative Classification: A Visualization-Based Approach of Combining the Strengths of the User and the Computer Contact Information Martin Ester, Institute for Computer Science, University
Test Case Design Using Classification Trees and the Classification-Tree Editor CTE
Quality Week 1995 Test Case Design Using Classification Trees and the Classification-Tree Editor CTE Matthias Grochtmann Joachim Wegener Klaus Grimm Daimler-Benz AG Research and Technology Alt-Moabit 96a
Tensor Factorization for Multi-Relational Learning
Tensor Factorization for Multi-Relational Learning Maximilian Nickel 1 and Volker Tresp 2 1 Ludwig Maximilian University, Oettingenstr. 67, Munich, Germany [email protected] 2 Siemens AG, Corporate
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
A Comparative Study of clustering algorithms Using weka tools
A Comparative Study of clustering algorithms Using weka tools Bharat Chaudhari 1, Manan Parikh 2 1,2 MECSE, KITRC KALOL ABSTRACT Data clustering is a process of putting similar data into groups. A clustering
Analysis of Internet Topologies
Analysis of Internet Topologies Ljiljana Trajković [email protected] Communication Networks Laboratory http://www.ensc.sfu.ca/cnl School of Engineering Science Simon Fraser University, Vancouver, British
Visualizing Repertory Grid Data for Formative Assessment
Visualizing Repertory Grid Data for Formative Assessment Kostas Pantazos 1, Ravi Vatrapu 1, 2 and Abid Hussain 1 1 Computational Social Science Laboratory (CSSL) Department of IT Management, Copenhagen
Visual Discovery in Multivariate Binary Data
Visual Discovery in Multivariate Binary Data Boris Kovalerchuk a*, Florian Delizy a, Logan Riggs a, Evgenii Vityaev b a Dept. of Computer Science, Central Washington University, Ellensburg, WA, 9896-7520,
DATA MINING - SELECTED TOPICS
DATA MINING - SELECTED TOPICS Peter Brezany Institute for Software Science University of Vienna E-mail : [email protected] 1 MINING SPATIAL DATABASES 2 Spatial Database Systems SDBSs offer spatial
HierarchyMap: A Novel Approach to Treemap Visualization of Hierarchical Data
P a g e 77 Vol. 9 Issue 5 (Ver 2.0), January 2010 Global Journal of Computer Science and Technology HierarchyMap: A Novel Approach to Treemap Visualization of Hierarchical Data Abstract- The HierarchyMap
Visualization of large data sets using MDS combined with LVQ.
Visualization of large data sets using MDS combined with LVQ. Antoine Naud and Włodzisław Duch Department of Informatics, Nicholas Copernicus University, Grudziądzka 5, 87-100 Toruń, Poland. www.phys.uni.torun.pl/kmk
Analysis of Internet Topologies: A Historical View
Analysis of Internet Topologies: A Historical View Mohamadreza Najiminaini, Laxmi Subedi, and Ljiljana Trajković Communication Networks Laboratory http://www.ensc.sfu.ca/cnl Simon Fraser University Vancouver,
Agenda. TreeMaps. What is a Treemap? Basics
Agenda TreeMaps What is a Treemap? Treemap Basics Original Treemap Algorithm (Slice-and-dice layout) Issues for Treemaps Cushion Treemaps Squarified Treemaps Ordered Treemaps Quantum Treemaps Other Treemaps
On History of Information Visualization
On History of Information Visualization Mária Kmeťová Department of Mathematics, Constantine the Philosopher University in Nitra, Tr. A. Hlinku 1, Nitra, Slovakia [email protected] Keywords: Abstract: abstract
Information Visualisation and Visual Analytics for Governance and Policy Modelling
Information Visualisation and Visual Analytics for Governance and Policy Modelling Jörn Kohlhammer 1, Tobias Ruppert 1, James Davey 1, Florian Mansmann 2, Daniel Keim 2 1 Fraunhofer IGD, Fraunhoferstr.
