Tie Visualization in NodeXL

Size: px
Start display at page:

Download "Tie Visualization in NodeXL"

Transcription

1 Tie Visualization in NodeXL Nick Gramsky ngramsky at cs.umd.edu CMSC 838C Social Computing University of Maryland College Park Abstract: The ability to visualize a network as it varies over time has become a challenge for researchers despite the rapid growth of visualization techniques. Node-link approaches are no more attempted than matrix, plotting or graphing methods. The ability to classify and visualize the difference between ties, especially reciprocity, has become an increasingly interesting topic in recent years. Very few attempts have bridged the visualization of the evolution of ties over time. In this paper we present a Tie Visualization extension to the Node-Link visualization tool NodeXL. The Tie Visualization extension classifies reciprocated and non-reciprocated ties between nodes in a network. It then uses color to distinguish between the different relationships, producing a single node-link representation of the network that captures the evolution over a set period of time. I. Introduction In recent years, visualization techniques for the analysis of network evolution have lagged behind the rapid growth of social media and electronic databases. Attempts vary in approach from graphs, matrices and even node- link diagrams. Node- Link diagrams typically portray evolution through the use of multiple images, either juxtaposed or shown in sequence through the use of a slider. Matrix or graph approaches benefit from the fact that they are able to show this evolution through one single image. The advantage is that the end user does not have to reply on memory to piece together the different images to comprehend what changed over the temporal period. Few node- link approaches have captured the evolution through a single image. Almost equally behind in progress is the classification, identification and visualization of the difference between the different ties within a social network. Researchers are interested in the identification of different relationships between nodes within a network as this can help identify the dynamics between the actors within the network. Few attempts take into consideration time or how the relationship may vary over time. Increased interest in recent years has been the identification of reciprocated relationships. In this paper we present a method to classify relationships based on how the level of general interaction and reciprocity changes over time. This dimension of evolution is then visualized with node- link diagrams. The necessary calculations to classify and visualize are accomplished using NodeXL. Color is used as the distinguishing factor after relationships have been classified. Using this technique, one single image portrays the evolution of the network over a predefined time period rather than multiple over. The purpose of this method is to distinguish between different types of relationships and help identify which relationships are flourishing or fading over a period of time. This technique it then applied to existing and new datasets in an effort to show the validity of such classification and visualization methods. The rest of the paper proceeds as follows: Section 2 will provide a related works section and compare previous approaches with the one introduced in this paper. Section 3 will outline the approach and methods used. The integration with NodeXL to make the calculation and visualization possible is discussed in section 4. Analysis and sample networks will be discussed in section 5. We conclude with section 6 where we discuss challenges with this work and future plans. II. Related Work

2 The Strength of Ties [4] is arguably the most influential work regarding social ties between people in a social network. Since the publication of this work, researchers have sought to further this research in many ways. An indication of the strength of this work may be found in the almost 19,000 citations of this work (according to Google Scholar) since it s publication in While the work in this paper does not identify strong or weak ties, one of the main motivations of this work is the desire to classify the difference between a strengthening or weakening tie and then visualize it in the context of the network structure. Other attempts have worked to quantify ties in social networks. [2] defined a framework to quantify the different levels of strengths of relationships. This was accomplished by classifying Facebook relationships on 70 different numerical indicators. Time between interactions was used in a few of these dimensions, though it was never taken into account over the entire life span or a set period of time. [5] succeeded in quantifying the change in the interactions of people as well as the entire network in general by evaluating an network of a large university over the course of a year. Neither of these, however, provided any visualization techniques. Very few attempts have been made to quantify reciprocity and ever fewer have made attempts to visualize the ties based on reciprocity. [9] was able to assign reciprocity indices to individuals based on the amount of time between interactions. This was accomplished by aggregating the behaviors of every relationship for each node. This index does not, however, provide any indication if the level of reciprocity is increasing or decreasing nor does it distinguish one relationship from another. [1] studied patterns of reciprocity and actually visualized these patterns but did so with bar charts. For her Master s Thesis, [Sankaranarayanan] used reciprocity as the means to visualize the ties within a blogging site. Different shades of color were used to distinguish between reciprocated and non- reciprocated edges. This work used novel approaches to show network structure by using flower petals instead of node- link diagrams, however the basis of the coloring scheme was based on an index showing how much two nodes reciprocated one another, not how reciprocity varied over time. Attempts to show network evolution over time vary in approach. [3] used matrices to show how degree and other network traits vary over time, however interactions between nodes required a second view and were limited to one slice at a time. [7] visualized the differing levels of general interaction in the Ben Shneiderman collection through the use of bar graphs and hierarchial clustering techniques and was effective in showing how relationships vary over time based on general interaction. Both of these methods, however, do not portray the structure of the social network, requiring one to visualize the network in a separate utility if insight to the network structure is needed. [6] succeeded in showing the evolution of a social network over time in a node- link diagram using NodeXL. This approach however, merely showed the emergence and vacation of nodes and edges over discrete periods of time and did not take into account varying attributes of relationships between the nodes. III. Methods The aim of this work is to accomplish two goals: 1) Classify ties between nodes and 2) visualize them in a single node- link diagram over a pre- defined temporal period. Two types of classification methods are sought after in this work: classifying ties based on the variance of general interaction and the variance of reciprocity. We define variance as the identification of a relationship increasing, decreasing, or remaining stable in one of the two categories over a period of time. Indices will be calculated for each variance and will be used as the basis for the coloring of the edges in the node- link diagram. As two nodes can both vary in the amount of general interactions and/or reciprocity between one another, we are only able to visualize one dimension at a time in the node- link diagram. We now briefly discuss the methods to calculate each index. A. General Activity Index The general activity index is used to quantify how the general interaction between two nodes varies over time. General activity is defined as any interaction between two nodes and is independent of who initiates the interaction. Interactions could be an being sent in an network, a reply or comment in a blog site, Twitter mention or re- tweet in a Twitter network, etc. The general activity Index is designed to show if these interactions are flourishing, remaining stable or dying in nature. Such classification could be used to infer a change in relationship status between the two nodes. For example a decrease in Facebook activity over a long period of time could

3 be a possible indication of a failing friendship. An increase in activity between one node and another node of high betweenness centrality might be an indication of a node becoming more powerful in a terrorist or workplace network. The index is a simple calculation computed as follows: For each relationship all interactions between both nodes that compose the tie are gathered in a list and ordered by time. The delta between each interaction is calculated resulting in N- 1 deltas for N interactions. The deltas are plotted sequentially as Cartesian points with each interaction plotted along the X- axis and the value of the delta as the y- value. from the first interaction between the two nodes until the very last interaction of every node in the entire network. This assumes the relationship remains in existence from the start of the relationship until the last known time period of the network. In an effort not to bias the amount of time between the last interaction and the end of the total time for the network, if the time between the last interaction and the end of the network is greater than the average of all deltas, the value is used. If it is less than the average, it is discarded. The reasoning is as follows: If two people each other every day at 8AM and the network sample goes from 7AM Mon 9AM Sun, the last delta between each node will still be 24 hours but the time between the last interaction and the end of the network will be one hour. If that one- hour period is used in the calculation the slope will indicate a slight increase in activity. However, if the two nodes ed every day except for the weekends, the last delta would be 24 hours but the tail would show an interaction lapse of 49 hours. In the former example we use this long tail as the relationship has faded in the time period of the network as it is entered. We account for issues of this nature without incorrectly biasing potential short interactions. The index for the relationship is the slope of the trend line for the plotted data. A simple linear regression using least squares is used to calculate the slope of the trend line for the series of deltas. Slopes positive in nature indicate a decrease in activity, negative in nature indicates an increase in activity and those around 0indicate a stable relationship. As we calculate a slope using least squares, at least 3 interactions (or 2 deltas) are needed in order to classify a tie. Figure 1 provides a visualization of this calculation. Fig. 1 Visualization of the calculation of the General Activity Index. B. Reciprocity Index Similar to the general activity index, the reciprocity index is calculated using a simple linear regression but the data used for the plots is substantially different. As we are dealing with reciprocal relationships, these relationships must be ones where the nodes reciprocate interactions between one another. Thus it is not as simple as one node ing another in an network, the other node must return the or reciprocate the interaction in some form. Similar to the general activity index we will use deltas between interactions but here we will use the difference in the number of unreciprocated interactions between each reciprocation as the value. We will simply look at the number of one- sided interactions between a reciprocated interaction. That number will be the delta we will use in the least squares calculation. 3 interactions will not necessarily guarantee we can calculate an index unlike the previous index. Here we need 3 reciprocated interactions. One should note that the length of time between interactions is not taken in account as the index is calculated. Figure 2 visualizes how this index is calculated. As NodeXL is used to implement these methods, we now turn to a discussion of how this is accomplished. IV. Implementation The calculation of the indices and the coloring methods were accomplishing by extending NodeXL. NodeXL is a free add- on to the Microsoft Excel program. It provides templates that visualize

4 Fig. 2 Visualization of the calculation of the Reciprocity Index. network data into node- link diagrams. Index calculation is made possible through the addition of a time- series tab, a calculation method and classifier dialogue for users to navigate as they classify and visualize the ties in their network data. Figure 3 shows the enhanced NodeXL ribbon with buttons to navigate new features. Users must walk through a 3- step process to obtain the visualization: Input, calculation and visualization. Upon opening the new version of NodeXL users select the Create Time Series button and are presented with a tab to enter a time series for the network. Data is to be entered as a list of edges with timestamps for each edge. Timestamps are interactions between the nodes themselves. For an example in a Twitter dataset we could enter Tweets where a user mentions or Re- Tweets another Tweet. Each pair of users listed in the tweet would create an edge between the tweeter program assumes the data is from a directed graph and thus like edges (reversed in nature) share values. Thus the edge A- B which indicates edge A initiated an interaction with B will share the same values as B- A. After calculating the indices the user then has the option to color the edges according to their classification. The Color Ties button brings up the dialogue seen in Figure 4. The user has the option of assigning colors to edges based on Reciprocity Index or General Activity Index. For each index, color is assigned to one of 3 classifications: Increasing, Stable or Decreasing. Assignments to each classification are accomplished via the range slider at the bottom. Users can ignore the stable entity and color the graph with a binary representation by ignoring the stable label and coloring everything as increasing or decreasing by setting both range sliders to 0. Fig. 4 Coloring dialogue added to NodeXL to allow the user to classify and color nodes after indices are calculated. Fig. 3 Additional Controls added to NodeXL Ribbon (highlighted in red box) and the user mentioned in the tweet and the timestamp would be the time the tweet was created. After creating the time series the user would select Create Indices and the General Activity Index, Reciprocity Index and weight (number of interactions) of each edge is calculated. The V. Results Several datasets were visualized with the Tie Visualization extension during the testing phase of the software. Figure 5 presents a view of the NON dataset from November 2005 through August In this visualization blue indicates edges that are decreasing in general activity where red edges are those that are increasing inactivity and grey are those considered to be stable. Initial glances

5 Fig. 5 NON network of blog replies. Blue indicate fading relationships between users, grey are stable relationships and the single red tie is the only considered to be flourishing. indicate that the network as a whole is slowing fizzling out as almost half of the relationships appear to be slowly dying. The exception lies in a single relationship that is thriving. Perhaps a more intriguing visualization can be found using date from the VAST 2008 Cell Phone [10] mini- challenge. Figure 6 has a binary coloring of that dataset. Edges are filtered such that only edges of weight 3 or more are shown. This filtering, in essence, not only removes relationships with low frequency but also provides a list of edges that guarantee a general activity index. Again black edges are fading and lime edges are flourishing. What is interesting to note in this visualization are the hub nodes, or the ones with a high betweenness centrality. The edges emanating from these edges are all black, indicating that the characters that were essentially the glue in this network have allowed their relationships with others to fade. Previous analyses of this network show that key roles of actors were switched to nodes of different identities to throw off the authorities. More recent data visualized with this approach includes the social network behind the Occupy Wall Street movement. For the month of November all Tweets that contained the #occupywallstreet hashtag on Twitter were gathered and archived in a database. At 1:30AM on November 15 the New York City Police Department conducted a raid on Liberty Square in an effort to clean the park, thereby disrupting the movement and its populous. Figures 6 and 7 are the Twitter social networks behind the OWS movement from 10PM November 14 4AM November 15. Figure 7 visualizes the general activity index and figure 8 visualizes the reciprocity index. The graph is again filtered much like the VAST network to only show edges of weight 3 or higher. One should note that the daily volume of Tweets increased 1000% from the previous week during the 24- hour period starting at the point of the raid. It is interesting to note how activity declined over this period. Investigation into the political nature of the hub nodes indicate they are in favor of the OWS movement. Perhaps the timing of early morning hours police activity is the cause behind a weakened social network despite one of its more critical periods. Clearly people were tweeting as indicated by the volume of tweets, but the interactions between key figures is not flourishing Fig VAST Cell Phone Challenge. Hub nodes are all fading. This is a relic of key figures of the network changing identities near the end of a 10-day activity.

6 Fig. 8 Social network showing evolution of general activity from #occupywallstreet hashtag on Twitter from 10PM 11/14 4AM 11/15. Black edges are those that are fading and lime edges are the few that are thriving. in general through this time. Reciprocity is, however, increasing where calculations are possible. Only a few ties are subject to the calculation (as figure 7 would suggest), yet those who are talking to/mentioning each are doing so, they are do so more often. Further research could look at these traits and see if similar patterns are found in protesting movements as well as seeing if successful movements contain different behaviors compared to unsuccessful movements. VI. Challenges / Future work Despite the ability to visualize networks and calculate the different indices for networks with ease this effort has room for improvement. Most noticeably with this extension to NodeXL is the performance. Calculating the indices for a network with thousands of entries in a time series is computationally challenging. Representing a network through a time series expands the size of the network and space needed. Where NodeXL can easily work with a hundred or so nodes with ease doing so over time can be a challenge. If every link of a network has 10 distinct time points or interactions, the space is now 10 times the size of the same static network. Calculation of large networks such as the #occupywallstreet network took well over 30 minutes. Efforts to filter out nodes that do not factor into the calculations prior to computation or parallel processing can be explored. Aside from technical issues that can make the tool unbearable at times there exist other areas where NodeXL can be enhanced to further the tie visualization capabilities. Currently the time series of the network defines the temporal period that the network is visualized over. A time slider that re- calculates the indices and re- colors the edges as a user navigates with the slider could help better identify critical moments within the network. Currently one must chose a time period and explore the network as it is entered. This feature, however, greatly depends on the ability to quickly calculate the indices for each edge.

7 Fig. 6 Social network showing evolution of general activity from #occupywallstreet hashtag on Twitter from 10PM 11/14 4AM 11/15. Black edges are those that are fading and lime edges are the few that are thriving. The current method to color the ties upon calculations is in need of tuning. The slider present should be accompanied with a histogram to aid the user in defining the breaking point between increasing, stable and decreasing points. Additionally NodeXL has the ability to vary color over a range of values for edges or nodes. The built- in feature does not work with general activity or reciprocity indices as the indices can be skewed towards one side or the other, providing false coloring schemes. The ability to refine the gradual transition of color form increasing <- > decreasing might remove the need to distinctly classify relationships. Though accurate in the ability to identify a change in reciprocity between two nodes, it is unclear how valuable such a metric or visualization method is. No clear insights were gained as the datasets presented in this paper were analyzed. Further investigation into other datasets are needed before a determination can be made. Furthermore the results presented in this paper were made solely by the author of the paper. Additionally this work could benefit from a user study to effectively evaluate the value of these methods. Such a user study should investigate the usability of the additional dialogues and methods added as well as the ability for users to provide insights to datasets through a controlled experiment. This approach and methods discussed in this paper were shared with a leading expert in the field of information visualization. He confirmed the academic field has an increased interested in investigating and visualizing reciprocity and this attempt, though limited does offer merit. Lastly this paper discusses ways to classify and visualize edges in a link- node diagram in an effort to identify varying relationships. Similar index calculations and coloring techniques could extend to nodes as well. [9] s reciprocity index could possibly be used to classify and color different nodes in a network. VII. Conclusion

8 This paper presents a method for classifying and visualizing tie in a social network through NodeXL. We extended the current NodeXL software by allowing the entry of a time series network. From there we classified if networks were increasing, remaining stable or decreasing in either general activity or reciprocity. Using NodeXL users can color the edges to gain a further insight of how the network has evolved over time through one static image. [9] Zhang, H.; Dantu, R.; Cangussu, J. W. Quantifying Reciprocity in Social Networks. CSE (4). [S.l.]: IEEE Computer Society p [10] References [1] Garlaschelli, D., and Loffredo, M., Patterns of link reciprocity in directed networks. Physics Review Letters, 93, [2] Gilbert, E. and Karahalios, K., Predicting Tie Strength With Social Media. In Proc. of CHI, [3] Gove, R., Gramsky, N., Kirby, R., Sefer, E., Sopan, A., Dunne, C., Shneiderman, B. and Taieb- Maimon, M., NetVisia: Heat map & matrix visualization of dynamic social network statistics & content, Proc. IEEE Conference on Social Computing, IEEE Press, Piscataway, NJ (October 2011). [4] Granovetter, M. S., The strength of weak ties. American Journal of Sociology, 78: , [5] Kossinets, G., Watts, D., Empirical analysis of an evolving social network. Science, 311:88 90, [6] Khurana, U., Nguyen, V., Cheng, H., Ahn, J., Chen, X., Shneiderman, B., Visual analysis of temporal trends in social networks using edge color coding and metric timelines, Proc. IEEE Conference on Social Computing, IEEE Press, Piscataway, NJ (October 2011). [7] Perer, A., Shneiderman, B., and Oard, D. W., Using rhythms of relationships to understand e- mail archives. J. Am. Soc. Inf. Sci. Technol., 57(14): , [8] Sankaranarayanan, K. Visualizing Reciprocity In an Online Community To Motivate Participation. Masters Thesis, University of Saskatchewan, Saskatoon. 141 p.

Welcome to the topic on creating key performance indicators in SAP Business One, release 9.1 version for SAP HANA.

Welcome to the topic on creating key performance indicators in SAP Business One, release 9.1 version for SAP HANA. Welcome to the topic on creating key performance indicators in SAP Business One, release 9.1 version for SAP HANA. 1 In this topic, you will learn how to: Use Key Performance Indicators (also known as

More information

3D Interactive Information Visualization: Guidelines from experience and analysis of applications

3D Interactive Information Visualization: Guidelines from experience and analysis of applications 3D Interactive Information Visualization: Guidelines from experience and analysis of applications Richard Brath Visible Decisions Inc., 200 Front St. W. #2203, Toronto, Canada, [email protected] 1. EXPERT

More information

Data Visualization Techniques

Data Visualization Techniques Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The

More information

Data representation and analysis in Excel

Data representation and analysis in Excel Page 1 Data representation and analysis in Excel Let s Get Started! This course will teach you how to analyze data and make charts in Excel so that the data may be represented in a visual way that reflects

More information

A Tutorial on dynamic networks. By Clement Levallois, Erasmus University Rotterdam

A Tutorial on dynamic networks. By Clement Levallois, Erasmus University Rotterdam A Tutorial on dynamic networks By, Erasmus University Rotterdam V 1.0-2013 Bio notes Education in economics, management, history of science (Ph.D.) Since 2008, turned to digital methods for research. data

More information

Excel -- Creating Charts

Excel -- Creating Charts Excel -- Creating Charts The saying goes, A picture is worth a thousand words, and so true. Professional looking charts give visual enhancement to your statistics, fiscal reports or presentation. Excel

More information

SAP Business Intelligence ( BI ) Financial and Budget Reporting. 7.0 Edition. (Best Seller At Least 43 copies Sold)

SAP Business Intelligence ( BI ) Financial and Budget Reporting. 7.0 Edition. (Best Seller At Least 43 copies Sold) SAP Business Intelligence ( BI ) Financial and Budget Reporting 7.0 Edition (Best Seller At Least 43 copies Sold) November 2011 Table of Contents Log In... 3 Initial Variable Screen... 5 Multiple / Single

More information

IBM SPSS Direct Marketing 23

IBM SPSS Direct Marketing 23 IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release

More information

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation

More information

Analysis of Stock Symbol Co occurrences in Financial Articles

Analysis of Stock Symbol Co occurrences in Financial Articles Analysis of Stock Symbol Co occurrences in Financial Articles Gregory Kramida ([email protected]) Introduction Stock market prices are influenced by a wide variety of factors. Undoubtedly, market news

More information

Visualization Techniques in Data Mining

Visualization Techniques in Data Mining Tecniche di Apprendimento Automatico per Applicazioni di Data Mining Visualization Techniques in Data Mining Prof. Pier Luca Lanzi Laurea in Ingegneria Informatica Politecnico di Milano Polo di Milano

More information

Network-Based Tools for the Visualization and Analysis of Domain Models

Network-Based Tools for the Visualization and Analysis of Domain Models Network-Based Tools for the Visualization and Analysis of Domain Models Paper presented as the annual meeting of the American Educational Research Association, Philadelphia, PA Hua Wei April 2014 Visualizing

More information

Network Analysis For Sustainability Management

Network Analysis For Sustainability Management Network Analysis For Sustainability Management 1 Cátia Vaz 1º Summer Course in E4SD Outline Motivation Networks representation Structural network analysis Behavior network analysis 2 Networks Over the

More information

Introduction to Exploratory Data Analysis

Introduction to Exploratory Data Analysis Introduction to Exploratory Data Analysis A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,

More information

Tracking Project Progress

Tracking Project Progress L E S S O N 2 Tracking Project Progress Suggested lesson time 45-55 minutes Lesson objectives To begin tracking an active project, you will: a b c Modify the environment for tracking. You will use the

More information

IBM SPSS Direct Marketing 22

IBM SPSS Direct Marketing 22 IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release

More information

Demographics of Atlanta, Georgia:

Demographics of Atlanta, Georgia: Demographics of Atlanta, Georgia: A Visual Analysis of the 2000 and 2010 Census Data 36-315 Final Project Rachel Cohen, Kathryn McKeough, Minnar Xie & David Zimmerman Ethnicities of Atlanta Figure 1: From

More information

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015 Principles of Data Visualization for Exploratory Data Analysis Renee M. P. Teate SYS 6023 Cognitive Systems Engineering April 28, 2015 Introduction Exploratory Data Analysis (EDA) is the phase of analysis

More information

Visualization Quick Guide

Visualization Quick Guide Visualization Quick Guide A best practice guide to help you find the right visualization for your data WHAT IS DOMO? Domo is a new form of business intelligence (BI) unlike anything before an executive

More information

Predict the Popularity of YouTube Videos Using Early View Data

Predict the Popularity of YouTube Videos Using Early View Data 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

SAS Analyst for Windows Tutorial

SAS Analyst for Windows Tutorial Updated: August 2012 Table of Contents Section 1: Introduction... 3 1.1 About this Document... 3 1.2 Introduction to Version 8 of SAS... 3 Section 2: An Overview of SAS V.8 for Windows... 3 2.1 Navigating

More information

Cluster Analysis for Evaluating Trading Strategies 1

Cluster Analysis for Evaluating Trading Strategies 1 CONTRIBUTORS Jeff Bacidore Managing Director, Head of Algorithmic Trading, ITG, Inc. [email protected] +1.212.588.4327 Kathryn Berkow Quantitative Analyst, Algorithmic Trading, ITG, Inc. [email protected]

More information

Analyzing Evolution of Network Attributes and Content with NetFlow: A Network Evolution Visualization Tool

Analyzing Evolution of Network Attributes and Content with NetFlow: A Network Evolution Visualization Tool Analyzing Evolution of Network Attributes and Content with NetFlow: A Network Evolution Visualization Tool Robert Gove 1,2,3, Nick Gramsky 1, Rose Kirby 1, Emre Sefer 1 1 Department of Computer Science

More information

Final Project Report

Final Project Report CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes

More information

White Paper April 2006

White Paper April 2006 White Paper April 2006 Table of Contents 1. Executive Summary...4 1.1 Scorecards...4 1.2 Alerts...4 1.3 Data Collection Agents...4 1.4 Self Tuning Caching System...4 2. Business Intelligence Model...5

More information

NakeDB: Database Schema Visualization

NakeDB: Database Schema Visualization NAKEDB: DATABASE SCHEMA VISUALIZATION, APRIL 2008 1 NakeDB: Database Schema Visualization Luis Miguel Cortés-Peña, Yi Han, Neil Pradhan, Romain Rigaux Abstract Current database schema visualization tools

More information

Visualization methods for patent data

Visualization methods for patent data Visualization methods for patent data Treparel 2013 Dr. Anton Heijs (CTO & Founder) Delft, The Netherlands Introduction Treparel can provide advanced visualizations for patent data. This document describes

More information

Anomaly Detection in Predictive Maintenance Time Alignment and Visualization

Anomaly Detection in Predictive Maintenance Time Alignment and Visualization Anomaly Detection in Predictive Maintenance Time Alignment and Visualization Phil Winters Rosaria Silipo [email protected] [email protected] Copyright 2015 by KNIME.com AG all rights reserved

More information

MultiExperiment Viewer Quickstart Guide

MultiExperiment Viewer Quickstart Guide MultiExperiment Viewer Quickstart Guide Table of Contents: I. Preface - 2 II. Installing MeV - 2 III. Opening a Data Set - 2 IV. Filtering - 6 V. Clustering a. HCL - 8 b. K-means - 11 VI. Modules a. T-test

More information

Assignment 2: Animated Transitions Due: Oct 12 Mon, 11:59pm, 2015 (midnight)

Assignment 2: Animated Transitions Due: Oct 12 Mon, 11:59pm, 2015 (midnight) 1 Assignment 2: Animated Transitions Due: Oct 12 Mon, 11:59pm, 2015 (midnight) Overview One of the things that make a visualization look polished is to add animation (animated transition) between each

More information

Interactive Data Visualization Program to Analyze Word Count Frequencies Over Time

Interactive Data Visualization Program to Analyze Word Count Frequencies Over Time Interactive Data Visualization Program to Analyze Word Count Frequencies Over Time Aryn Grause March 8, 2011 1 Objective The goal of this project is to build an interactive software tool which will produce

More information

Heat Map Explorer Getting Started Guide

Heat Map Explorer Getting Started Guide You have made a smart decision in choosing Lab Escape s Heat Map Explorer. Over the next 30 minutes this guide will show you how to analyze your data visually. Your investment in learning to leverage heat

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

Design Considerations for a Visualization and Simulation Tool for CBMS Data

Design Considerations for a Visualization and Simulation Tool for CBMS Data Design Considerations for a Visualization and Simulation Tool for CBMS Data Nelson Marcos 1,*, Gerardo Largoza 2, Briane Paul Samson 3, Johnn Jelvin S. Base 4, Lawrence Patrick C. Calulo 5, Bervyn S. Co

More information

Capturing Meaningful Competitive Intelligence from the Social Media Movement

Capturing Meaningful Competitive Intelligence from the Social Media Movement Capturing Meaningful Competitive Intelligence from the Social Media Movement Social media has evolved from a creative marketing medium and networking resource to a goldmine for robust competitive intelligence

More information

Microsoft Office Project Standard 2007 Project Professional 2007. April 2006. February 2006

Microsoft Office Project Standard 2007 Project Professional 2007. April 2006. February 2006 Microsoft Office Project Standard 2007 Project Professional 2007 April 2006 February 2006 February 2006 Table of Contents Overview of Microsoft Office Project Standard 2007 and Office Project Professional

More information

Executive Dashboard. User Guide

Executive Dashboard. User Guide Executive Dashboard User Guide 2 Contents Executive Dashboard Overview 3 Naming conventions 3 Getting started 4 Welcome to Socialbakers Executive Dashboard! 4 Comparison View 5 Setting up a comparison

More information

Component visualization methods for large legacy software in C/C++

Component visualization methods for large legacy software in C/C++ Annales Mathematicae et Informaticae 44 (2015) pp. 23 33 http://ami.ektf.hu Component visualization methods for large legacy software in C/C++ Máté Cserép a, Dániel Krupp b a Eötvös Loránd University [email protected]

More information

Visualizing Relationships and Connections in Complex Data Using Network Diagrams in SAS Visual Analytics

Visualizing Relationships and Connections in Complex Data Using Network Diagrams in SAS Visual Analytics Paper 3323-2015 Visualizing Relationships and Connections in Complex Data Using Network Diagrams in SAS Visual Analytics ABSTRACT Stephen Overton, Ben Zenick, Zencos Consulting Network diagrams in SAS

More information

Twitter and Natural Disasters Peter Ney

Twitter and Natural Disasters Peter Ney Twitter and Natural Disasters Peter Ney Introduction The growing popularity of mobile computing and social media has created new opportunities to incorporate social media data into crisis response. In

More information

A simple three dimensional Column bar chart can be produced from the following example spreadsheet. Note that cell A1 is left blank.

A simple three dimensional Column bar chart can be produced from the following example spreadsheet. Note that cell A1 is left blank. Department of Library Services Creating Charts in Excel 2007 www.library.dmu.ac.uk Using the Microsoft Excel 2007 chart creation system you can quickly produce professional looking charts. This help sheet

More information

Innovative Information Visualization of Electronic Health Record Data: a Systematic Review

Innovative Information Visualization of Electronic Health Record Data: a Systematic Review Innovative Information Visualization of Electronic Health Record Data: a Systematic Review Vivian West, David Borland, W. Ed Hammond February 5, 2015 Outline Background Objective Methods & Criteria Analysis

More information

Interactive Excel Spreadsheets:

Interactive Excel Spreadsheets: Interactive Excel Spreadsheets: Constructing Visualization Tools to Enhance Your Learner-centered Math and Science Classroom Scott A. Sinex Department of Physical Sciences and Engineering Prince George

More information

NodeXL for Network analysis Demo/hands-on at NICAR 2012, St Louis, Feb 24. Peter Aldhous, San Francisco Bureau Chief. peter@peteraldhous.

NodeXL for Network analysis Demo/hands-on at NICAR 2012, St Louis, Feb 24. Peter Aldhous, San Francisco Bureau Chief. peter@peteraldhous. NodeXL for Network analysis Demo/hands-on at NICAR 2012, St Louis, Feb 24 Peter Aldhous, San Francisco Bureau Chief [email protected] NodeXL is a template for Microsoft Excel 2007 and 2010, which

More information

Hierarchical Data Visualization. Ai Nakatani IAT 814 February 21, 2007

Hierarchical Data Visualization. Ai Nakatani IAT 814 February 21, 2007 Hierarchical Data Visualization Ai Nakatani IAT 814 February 21, 2007 Introduction Hierarchical Data Directory structure Genealogy trees Biological taxonomy Business structure Project structure Challenges

More information

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network , pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and

More information

an introduction to VISUALIZING DATA by joel laumans

an introduction to VISUALIZING DATA by joel laumans an introduction to VISUALIZING DATA by joel laumans an introduction to VISUALIZING DATA iii AN INTRODUCTION TO VISUALIZING DATA by Joel Laumans Table of Contents 1 Introduction 1 Definition Purpose 2 Data

More information

Sonatype CLM Server - Dashboard. Sonatype CLM Server - Dashboard

Sonatype CLM Server - Dashboard. Sonatype CLM Server - Dashboard Sonatype CLM Server - Dashboard i Sonatype CLM Server - Dashboard Sonatype CLM Server - Dashboard ii Contents 1 Introduction 1 2 Accessing the Dashboard 3 3 Viewing CLM Data in the Dashboard 4 3.1 Filters............................................

More information

"Excel with Excel 2013: Pivoting with Pivot Tables" by Venu Gopalakrishna Remani. October 28, 2014

Excel with Excel 2013: Pivoting with Pivot Tables by Venu Gopalakrishna Remani. October 28, 2014 Teaching Excellence and Innovation 1 Pivot table Pivot table does calculations with criteria Data should be arranged as : Field names in the first rows, records in rows No blank rows or blank columns should

More information

Executive Dashboard Cookbook

Executive Dashboard Cookbook Executive Dashboard Cookbook Rev: 2011-08-16 Sitecore CMS 6.5 Executive Dashboard Cookbook A Marketers Guide to the Executive Insight Dashboard Table of Contents Chapter 1 Introduction... 3 1.1 Overview...

More information

Daily Traffic Control Log

Daily Traffic Control Log Daily Traffic Control Log User Instructions Name: FAP&A940/3.2 Property of Ford Motor Company GIS: 37.01 S+3T Proprietary Printed December 2012. This Instruction manual has been written to accompany the

More information

Microsoft Project 2007 Level 1: Creating Project Tasks

Microsoft Project 2007 Level 1: Creating Project Tasks Microsoft Project 2007 Level 1: Creating Project Tasks By Robin Peers Robin Peers, 2008 ABOUT THIS CLASS Regardless of job title, most of us have needed to act as a project manager, at one time or another.

More information

SuperViz: An Interactive Visualization of Super-Peer P2P Network

SuperViz: An Interactive Visualization of Super-Peer P2P Network SuperViz: An Interactive Visualization of Super-Peer P2P Network Anthony (Peiqun) Yu [email protected] Abstract: The Efficient Clustered Super-Peer P2P network is a novel P2P architecture, which overcomes

More information

Data Visualization Techniques

Data Visualization Techniques Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The

More information

Microsoft Excel 2010 Charts and Graphs

Microsoft Excel 2010 Charts and Graphs Microsoft Excel 2010 Charts and Graphs Email: [email protected] Web Page: http://training.health.ufl.edu Microsoft Excel 2010: Charts and Graphs 2.0 hours Topics include data groupings; creating

More information

Formulas, Functions and Charts

Formulas, Functions and Charts Formulas, Functions and Charts :: 167 8 Formulas, Functions and Charts 8.1 INTRODUCTION In this leson you can enter formula and functions and perform mathematical calcualtions. You will also be able to

More information

Dealing with Data in Excel 2010

Dealing with Data in Excel 2010 Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing

More information

Visualizing the Top 400 Universities

Visualizing the Top 400 Universities Int'l Conf. e-learning, e-bus., EIS, and e-gov. EEE'15 81 Visualizing the Top 400 Universities Salwa Aljehane 1, Reem Alshahrani 1, and Maha Thafar 1 [email protected], [email protected], [email protected]

More information

SPSS Manual for Introductory Applied Statistics: A Variable Approach

SPSS Manual for Introductory Applied Statistics: A Variable Approach SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All

More information

Describe the process of parallelization as it relates to problem solving.

Describe the process of parallelization as it relates to problem solving. Level 2 (recommended for grades 6 9) Computer Science and Community Middle school/junior high school students begin using computational thinking as a problem-solving tool. They begin to appreciate the

More information

Department of Information Technology. Microsoft Outlook 2013. Outlook 101 Basic Functions

Department of Information Technology. Microsoft Outlook 2013. Outlook 101 Basic Functions Department of Information Technology Microsoft Outlook 2013 Outlook 101 Basic Functions August 2013 Outlook 101_Basic Functions070713.doc Outlook 101: Basic Functions Page 2 Table of Contents Table of

More information

Microsoft Office Excel 2007 Key Features. Office of Enterprise Development and Support Applications Support Group

Microsoft Office Excel 2007 Key Features. Office of Enterprise Development and Support Applications Support Group Microsoft Office Excel 2007 Key Features Office of Enterprise Development and Support Applications Support Group 2011 TABLE OF CONTENTS Office of Enterprise Development & Support Acknowledgment. 3 Introduction.

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Creating Bar Charts and Pie Charts Excel 2010 Tutorial (small revisions 1/20/14)

Creating Bar Charts and Pie Charts Excel 2010 Tutorial (small revisions 1/20/14) Creating Bar Charts and Pie Charts Excel 2010 Tutorial (small revisions 1/20/14) Excel file for use with this tutorial GraphTutorData.xlsx File Location http://faculty.ung.edu/kmelton/data/graphtutordata.xlsx

More information

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers 60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative

More information

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions What is Visualization? Information Visualization An Overview Jonathan I. Maletic, Ph.D. Computer Science Kent State University Visualize/Visualization: To form a mental image or vision of [some

More information

ABOUT THIS DOCUMENT ABOUT CHARTS/COMMON TERMINOLOGY

ABOUT THIS DOCUMENT ABOUT CHARTS/COMMON TERMINOLOGY A. Introduction B. Common Terminology C. Introduction to Chart Types D. Creating a Chart in FileMaker E. About Quick Charts 1. Quick Chart Behavior When Based on Sort Order F. Chart Examples 1. Charting

More information

Petrel TIPS&TRICKS from SCM

Petrel TIPS&TRICKS from SCM Petrel TIPS&TRICKS from SCM Knowledge Worth Sharing Histograms and SGS Modeling Histograms are used daily for interpretation, quality control, and modeling in Petrel. This TIPS&TRICKS document briefly

More information

Computer Training Centre University College Cork

Computer Training Centre University College Cork Computer Training Centre University College Cork Project 2013 Table of Contents What's new in Project 2013... 1 Manual scheduling... 1 Graphical Reports... 1 Trace task paths... 1 Easier view customization...

More information

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 Yannis Stavrakas Vassilis Plachouras IMIS / RC ATHENA Athens, Greece {yannis, vplachouras}@imis.athena-innovation.gr Abstract.

More information

A CRF-based approach to find stock price correlation with company-related Twitter sentiment

A CRF-based approach to find stock price correlation with company-related Twitter sentiment POLITECNICO DI MILANO Scuola di Ingegneria dell Informazione POLO TERRITORIALE DI COMO Master of Science in Computer Engineering A CRF-based approach to find stock price correlation with company-related

More information

Dynamics CRM for Outlook Basics

Dynamics CRM for Outlook Basics Dynamics CRM for Outlook Basics Microsoft Dynamics CRM April, 2015 Contents Welcome to the CRM for Outlook Basics guide... 1 Meet CRM for Outlook.... 2 A new, but comfortably familiar face................................................................

More information

ECS 235A Project - NVD Visualization Using TreeMaps

ECS 235A Project - NVD Visualization Using TreeMaps ECS 235A Project - NVD Visualization Using TreeMaps Kevin Griffin Email: [email protected] December 12, 2013 1 Introduction The National Vulnerability Database (NVD) is a continuously updated United

More information

Best Practices for Dashboard Design with SAP BusinessObjects Design Studio

Best Practices for Dashboard Design with SAP BusinessObjects Design Studio Ingo Hilgefort, SAP Mentor February 2015 Agenda Best Practices on Dashboard Design Performance BEST PRACTICES FOR DASHBOARD DESIGN WITH SAP BUSINESSOBJECTS DESIGN STUDIO DASHBOARD DESIGN What is a dashboard

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

World Trade Analysis

World Trade Analysis World Trade Analysis Brendan Fruin [email protected] Introduction With the vast amount of data being collected and made publicly available, individuals from all walks of life have been able to provide

More information

Visual Structure Analysis of Flow Charts in Patent Images

Visual Structure Analysis of Flow Charts in Patent Images Visual Structure Analysis of Flow Charts in Patent Images Roland Mörzinger, René Schuster, András Horti, and Georg Thallinger JOANNEUM RESEARCH Forschungsgesellschaft mbh DIGITAL - Institute for Information

More information

MicroStrategy Desktop

MicroStrategy Desktop MicroStrategy Desktop Quick Start Guide MicroStrategy Desktop is designed to enable business professionals like you to explore data, simply and without needing direct support from IT. 1 Import data from

More information

Excel 2007 Basic knowledge

Excel 2007 Basic knowledge Ribbon menu The Ribbon menu system with tabs for various Excel commands. This Ribbon system replaces the traditional menus used with Excel 2003. Above the Ribbon in the upper-left corner is the Microsoft

More information

KPN SMS mail. Send SMS as fast as e-mail!

KPN SMS mail. Send SMS as fast as e-mail! KPN SMS mail Send SMS as fast as e-mail! Quick start Start using KPN SMS mail in 5 steps If you want to install and use KPN SMS mail quickly, without reading the user guide, follow the next five steps.

More information

8 TIPS FOR MAKING THE MOST OF GOOGLE ANALYTICS. Brought to you by Geary LSF and Orbital Informatics

8 TIPS FOR MAKING THE MOST OF GOOGLE ANALYTICS. Brought to you by Geary LSF and Orbital Informatics 8 TIPS FOR MAKING THE MOST OF GOOGLE ANALYTICS Brought to you by Geary LSF and Orbital Informatics TABLE OF CONTENTS 3 5 7 8 9 10 11 12 13 14 15 Introduction 8 Tips for Google Analytics Don t let Google

More information

The Kinetics of Enzyme Reactions

The Kinetics of Enzyme Reactions The Kinetics of Enzyme Reactions This activity will introduce you to the chemical kinetics of enzyme-mediated biochemical reactions using an interactive Excel spreadsheet or Excelet. A summarized chemical

More information

GeoGebra Statistics and Probability

GeoGebra Statistics and Probability GeoGebra Statistics and Probability Project Maths Development Team 2013 www.projectmaths.ie Page 1 of 24 Index Activity Topic Page 1 Introduction GeoGebra Statistics 3 2 To calculate the Sum, Mean, Count,

More information

Introduction to Project Management

Introduction to Project Management L E S S O N 1 Introduction to Project Management Suggested lesson time 50-60 minutes Lesson objectives To be able to identify the steps involved in project planning, you will: a b c Plan a project. You

More information

Numeracy and mathematics Experiences and outcomes

Numeracy and mathematics Experiences and outcomes Numeracy and mathematics Experiences and outcomes My learning in mathematics enables me to: develop a secure understanding of the concepts, principles and processes of mathematics and apply these in different

More information

Create an Excel BI report and share on SharePoint 2013

Create an Excel BI report and share on SharePoint 2013 2013 Create an Excel BI report and share on SharePoint 2013 Hands-On Lab Lab Manual This document is provided as-is. Information and views expressed in this document, including URL and other Internet Web

More information

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic Report prepared for Brandon Slama Department of Health Management and Informatics University of Missouri, Columbia

More information

SOCIAL ENGAGEMENT BENCHMARK REPORT THE SALESFORCE MARKETING CLOUD. Metrics from 3+ Million Twitter* Messages Sent Through Our Platform

SOCIAL ENGAGEMENT BENCHMARK REPORT THE SALESFORCE MARKETING CLOUD. Metrics from 3+ Million Twitter* Messages Sent Through Our Platform THE SALESFORCE MARKETING CLOUD SOCIAL ENGAGEMENT BENCHMARK REPORT Metrics from 3+ Million Twitter* Messages Sent Through Our Platform *All trademarks, service marks, and trade names are the property of

More information

Data Visualization Handbook

Data Visualization Handbook SAP Lumira Data Visualization Handbook www.saplumira.com 1 Table of Content 3 Introduction 20 Ranking 4 Know Your Purpose 23 Part-to-Whole 5 Know Your Data 25 Distribution 9 Crafting Your Message 29 Correlation

More information

SPSS: Getting Started. For Windows

SPSS: Getting Started. For Windows For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 Introduction to SPSS Tutorials... 3 1.2 Introduction to SPSS... 3 1.3 Overview of SPSS for Windows... 3 Section 2: Entering

More information

NHA. User Guide, Version 1.0. Production Tool

NHA. User Guide, Version 1.0. Production Tool NHA User Guide, Version 1.0 Production Tool Welcome to the National Health Accounts Production Tool National Health Accounts (NHA) is an internationally standardized methodology that tracks public and

More information

Adroit Research NVivo10 Workshop Notes

Adroit Research NVivo10 Workshop Notes Adroit Research NVivo10 Workshop Notes GENERAL Create a new project My training project. *nvp file is stored in My Documents (default). Three views navigation, list and detail view. Detail view by default

More information

NetBeans Profiler is an

NetBeans Profiler is an NetBeans Profiler Exploring the NetBeans Profiler From Installation to a Practical Profiling Example* Gregg Sporar* NetBeans Profiler is an optional feature of the NetBeans IDE. It is a powerful tool that

More information

Evaluating Web Site Structure A Set of Techniques

Evaluating Web Site Structure A Set of Techniques Introduction Evaluating Web Site Structure A Set of Techniques K. Frederickson-Mele, Michael D. Levi, and Frederick G. Conrad U.S. Department of Labor, Bureau of Labor Statistics Washington, DC As the

More information