Université Pierre et Marie Curie, LIP6, ComplexNetworks team. 4 place Jussieu, Paris, France. Phone: +33 (0)

Size: px
Start display at page:

Download "Université Pierre et Marie Curie, LIP6, ComplexNetworks team. 4 place Jussieu, 75005 Paris, France. Phone: +33 (0)1 44 27 88 88"

Transcription

1 Title: Name: Affil./Addr.: Gephi Sébastien Heymann Université Pierre et Marie Curie, LIP6, ComplexNetworks team 4 place Jussieu, Paris, France Phone: +33 (0) sebastien.heymann@lip6.fr Gephi Synonyms exploratory network analysis, network visualization, visual analytics, open source Glossary API: an Application Programming Interface is an interface for software components to communicate with each other with a clear separation of components. Homophily: tendency to link to similar others. Layout: algorithm which calculates the position of elements in a graphic space. Raster image: image encoded by a two-dimensional matrix of pixels. Shortest path: minimal distance between two nodes of a network. Sparkline: small intense, simple, word-sized graphic with typographic resolution. Subgraph: graph whose nodes and edges connecting these nodes are a subset of a graph. Vector image: images encoded by a set of geometrical functions.

2 Definition 2 Gephi was created in 2008 by Mathieu Bastian, Sébastien Heymann, and Mathieu Jacomy, and extended by Eduardo Ramos Ibañez, Cezary Bartosiak, Julian Bilcke, Patrick McSweeney, André Panisson, Jérémy Subtil, Helder Suzuki, Martin Skurla, and Antonio Patriarca. It is suitable for the analysis of all kind of complex networks, although it is mostly used for social network analysis. It is distributed using a dual licensing scheme under the GNU General Public License (GNU GPL) v3 and the Common Development and Distribution License (CDDL) v1. Gephi can be used as a stand-alone application for the desktop, and as Java library for embedding some of the features in third-party programs. It scales to 10,000 nodes and edges with 1GB RAM and 1 CPU, and up to 1 million nodes and edges with 32GB RAM and 8 CPUs. It runs on Linux, Windows, and Mac OS X. It is written in Java 6 and OpenGL 1.2. Introduction Gephi is an open source software for the visual exploration of networks (also called graphs). A network is made of a set of entities, called the nodes, and a set of relationships between entities, called the edges. While various softwares exist to visualize and analyse networks, Gephi is particularly suited for networks with node attributes. Attributes are key-value pairs associated to each node or each edge. For example, individuals of a social network may have attributes such as gender, language, and age. [ Video introducing Gephi: ] Gephi users interact with the visualization in real-time to position the nodes in a two or three dimensional space using layout algorithms, or by manually moving nodes (see Fig.1). They use node attributes to change the color and size of the nodes, in order to find groups and individuals. The goal is to study the correlation of node attributes

3 3 Fig. 1. Overview of Gephi 0.8. and network structure by using visual patterns. Classic metrics of social network analysis, such as node degree or betweenness centrality measures, can be computed and used in the visualization as well (see Fig.2). The network can also be filtered based on attributes. Fig. 2. Network visualization example. Node size is proportional to the betweenness centrality value of the node.

4 4 Gephi is not limited to social networks. Any kind of network can be analysed, like the internet topology (i.e. connections between machines), peer-to-peer file-sharing networks, biological networks, on-line social networks (e.g. Twitter, Facebook), communication (e.g. ) and financial networks, but also semantic networks, organizational networks and more. Gephi aims at covering the entire process from data importing to aesthetics refinements and interaction. Data can be imported and exported in various file formats, and can be retrieved from databases. Once the visual exploration is over, the user refine aesthetics and export graphics in vector file formats to ensure readability and quality publishing on print and interactive graphics. This project is supported by an international community, which is lead by the French non-profit corporation called the Gephi Consortium. Key Points The strengths of Gephi are real-time visual feedback, performance, modularity, and its community. The Gephi user interface is focused on the creation of network visuals in realtime. The key innovation is to ease the interactions with the network. The user can literally play with the visual representation of the network. By playing, we mean experimenting various visual configurations for the purpose of seeing the outcome of any action instantaneously. This is made possible with the following features. The user apply layout algorithms to shape the network structure in 2-D or 3-D, for instance using force-directed layouts. Such algorithms calculate the layout of a network using repulsive forces between all nodes, but also using attractive forces between nodes which are adjacent. Each layout iteration calculates the forces applied on each node, and updates each node position. The visualization is refreshed at each iteration, therefore providing

5 5 real-time feedback for users. Some layouts are implemented with no stopping condition. The user can therefore tweak the layout parameters in real-time, until they decide to stop its execution. Interactions while calculating layout is made technically possible by using multi-threading processing, and the GPU for rendering the visualization. Gephi is stable and can scale enough to load networks of up to 1 million nodes and edges. The rendering engine is able to handle large networks and yet guarantees responsiveness. The minimum technical requirements of the software makes small networks actionable on low configurations such as netbooks. large networks of around a million of nodes and edges can also be analysed on visualization servers. In addition of interactive exploration of large networks, Gephi provides efficient implementations of classic metrics used in Social Network Analysis, including Betweenness Centrality, Clustering Coefficient, PageRank or Louvain Modularity for community detection. Gephi is a stand-alone application, built with Java SE 6 on top of the NetBeans Platform, which is a software for creating applications (see Fig.3). An installer makes Gephi available on all platforms having the Java Virtual Machine running. A graphic card with OpenGL 1.2 is required. The features are extensible through Java plug-ins which use the Gephi APIs. For instance, the OpenOrd layout algorithm is a plug-in which implements the Layout API. Such source code structure makes the software maintainable. A version of the software without the user interface is also distributed under the name of Gephi Toolkit. It is used as a Java library to create novel applications on desktop or on server. Finally, key benefits are provided by the Gephi community: members answer questions on the forum and fix the most common bugs. They organize meet-ups in their cities, and provide training seminars to newcomers.

6 6 Fig. 3. Architecture of Gephi (left) and Gephi Toolkit( right). Historical Background Gephi is a software developed since It was primarily created to enable researchers in social sciences to study the Web at Fondation Maison des Sciences de l Homme in Paris, France. Today, the Gephi Consortium aims at creating a sustainable software and technical ecosystem, driven by an international open-source community which shares common interests in networks and complex systems. Since the begining, an non-profit organization called Association Gephi provided a legal entity to support, protect and promote the Gephi project. Hosted alternatively by Association WebAtlas, Linkfluence SAS and SciencesPo Medialab, the initial contributors Mathieu Bastian, Sebastien Heymann and Mathieu Jacomy have progressively set up an international community of users and contributors. They notably participated in the Google Summer of Code program each year since 2009, and won the Oracle 2010 Duke s Choice Award for best Innovative Technical Data Visualization. They launched the Gephi Consortium in 2011, which is a non-profit corporation created to join the efforts of industrials, laboratories and civil society in building Gephi. Created under the French law of July 1st, 1901, it is governed by a board of directors. The Gephi Consortium makes an R&D effort to build generic and reusable parts of Gephi, improves the competitive technology at low costs, and creates standards to ensure interoperability.

7 7 Research partners include Inria, Sciences-Po Medialab, Fondation Maison des Sciences de l Homme TIC-Migrations, UPMC-CNRS LIP6 ComplexNetworks, Université de Technologie de Compiègne COSTECH, ISI Foundation, Indiana University Center for Complex Networks and Systems Research, and Stanford Mapping the Republic of Letters. Private parters include Quid Inc, Linkfluence SAS, and Neo Technology Inc. Features Input/output data formats File input While many file formats exist to encode network data, Gephi supports the most common ones: CSV, GDF, GEXF, GML, GraphML, Graphviz DOT, Pajek NET, Tulip TLP, Ucinet DL, XGMML, and RDF. The later is available throught the Semantic Web plugin, developed by the Inria WIMMICS team. A spreadsheet importer helps users to model tabular data as a network. File Output Data can be saved in Gephi sessions, but also in other formats like node and edge lists, GDF, GEXF, GraphML, Pajek NET, Ucinet DL, and CXF. Therefore, users can migrate data from one software to another in order to benefit from their specific features. Graphical Output Network visuals can be exported in PDF or SVG for printing. Designers can edit them using third-party tools. Raster graphics such as PNG is also available, as long as the Tikz format for embedding figures in LateX documents. Interactive graphics exporters

8 8 are available as plug-ins, such as Microsoft Seadragon graphics, and KMZ for exporting nodes with geographical coordinates. Data exported in GEXF with visual attributes (i.e. node position, color, size) can be re-used in browser-based viewers like Sigma.js. GEXF The file format working group of the Gephi Consortium created the Graph Exchange XML Format (GEXF), which is the standard used in Gephi to encode network data. This format is an XML language for describing network nodes and edges, attributes, hierarchies and their temporal evolution. GEXF is an improvement compared to GraphML for the capability to encode dynamic networks. The namespaces allow anyone to extend the format for specific purposes, for instance the addition of application data, without disrupting other applications. The libraries made by the community enable the reading and writing of GEXF files in C++, R, Python, Java, Perl and Javascript. They facilitate the adoption of the format, and improve interoperability between Gephi and third-party tools. The GEXF format is also supported by other softwares such as NetworkX, Tulip, and GraphStream, and by on-line services like Issuecrawler. Databases Gephi can retrieve data from relational databases such as MySQL, SQL Server, PostgreSQL, SQLite and Teradata. The community creates plug-ins to support graph databases such as Neo4j, OrientDB, and InfiniteGraph. Streaming Real-world structures are constantly changing, and file formats are not suitable to exchange such type of dynamic data. A lot of well-established on-line systems already stream data to its users using a streaming API. Twitter for example defined a Streaming API to allow near real-time access to its data. Inspired by the GraphStream Java

9 9 Library, the Graph Streaming API of Gephi provides a unified framework for streaming network events in a JSON format, like the addition, the modification and the removal of nodes and edges over time. A client can receive data from a master, but the specifications allow more flexibility: clients can interact with the master by pushing data to it. In the case of two Gephi instances connected through this API, a change in a network at the master s Gephi should cause a change in the client s Gephi, and a change at the client s Gephi will cause it to send requests to the master to update its network accordingly. Both instances work in a distributed mode. Different people could therefore work in a collaborative mode to study a network. [ Video of the Graph Streaming in action: v=7sw_fdiy0sg ] Layouts Layouts are algorithms which position the nodes in the 2-D or 3-D graphic space. Choosing the right layout and tuning its parameters requires skills at the crossing of art and science. The readability of network visualizations is indeed both a matter of individual perception, knowledge on the data, and analytic skills. Layouts are used to help navigate in the network. The various patterns created emphasis different properties of the structure of networks. Force-directed algorithms Gephi provides layouts of the class called force-directed algorithms. These layouts rely on a physical metaphor to position the nodes according to the position of the others. Roughly speaking, connected nodes tend to be closer, while disconnected nodes tend to be further. They are usually described as spring embedders [Kobourov, 2012] due to the way the forces are computed. Choosing a layout is a trade-off between the capability of

10 10 the algorithm to handle the given data set, the user time constraint, and the structural properties to be emphasised. Layouts may take edge weight into account in calculating forces. They may prevent node to overlap, thus increasing the readability. Finally, some implementations can run faster on multi-core CPUs. The following table provides the technical capabilities of available layouts: Table 1. Layouts technical capabilities. The number of nodes and time complexity gives an order of magnitude. layout # nodes time complexity edge weight node overlap multi-cpu Fruchterman-Reingold 1 to 1,000 O(N 2 ) no prevent no ForceAtlas 1 to 10k O(N 2 ) yes prevent plug-in ForceAtlas 2 1 to 1m O(N log(n)) yes prevent no OpenOrd 100 to 1m O(N log(n)) yes cluttered native Yifan Hu Multilevel 100 to 100k O(N log(n)) no cluttered no Fruchterman-Reingold This layout [Fruchterman, 1991] simulates the graph as a system of mass particles. The nodes are the mass particles and the edges are springs between the particles. The algorithms try to minimize the energy of this physical system. It has become a standard but remains very slow (see Fig.4). ForceAtlas ForceAtlas is the home-brew layout of Gephi. It is made to lay out real-world networks, which have the following properties: scale-free distribution of node degree, and smallworld effect (i.e. small distance between all nodes). It is focused on readability but it is slow (see Fig.5).

11 11 Fig. 4. Coappearance network of characters in the novel Les Miserables [Knuth, 1993], laid out by Fruchterman-Reingold. Fig. 5. Coappearance network of characters in the novel Les Miserables [Knuth, 1993], laid out by ForceAtlas. ForceAtlas 2 Improved version of the ForceAtlas to handle large networks while keeping a good readability. Nodes repulsion is approximated with a Barnes-Hut calculation [Barnes, 1986], which therefore reduces the algorithm complexity. It replaces the attraction and repulsion forces of Force Atlas by a scaling parameter (see Fig.6). [ Video of the layout on a grid: ]

12 12 Fig. 6. Coappearance network of characters in the novel Les Miserables [Knuth, 1993], laid out by ForceAtlas 2. OpenOrd It is one of the few force-directed layout algorithms that can scale to over 1 million nodes, making it ideal for large graphs [Martin, 2011]. However, small graphs (i.e. hundreds or less nodes) do not always end up looking good. The algorithm is originally based on Frutcherman-Reingold and works with a fixed number of iterations controlled via a simulated annealing type schedule (liquid, expansion, cool-down, crunch, and simmer). Long edges are cut to allow clusters to separate. This algorithm expects undirected weighted graphs and aims at better distinguishing clusters. It can be run in parallel on multiple processors to speed up computing. It stops automatically (see Fig.7). [ Video of the layout on a grid: ] Yifan Hu Multilevel It is a very fast algorithm with a good quality on large graphs. It combines a forcedirected model with a graph coarsening technique to reduce the complexity [Hu, 2005]. The repulsive forces on one node from a cluster of distant nodes are approximated by a Barnes-Hut calculation, which treats them as one super-node. It stops automatically (see Fig.8).

13 13 Fig. 7. Coappearance network of characters in the novel Les Miserables [Knuth, 1993], laid out by OpenOrg. Fig. 8. Coappearance network of characters in the novel Les Miserables [Knuth, 1993], laid out by Yifan Hu Multilevel. [ Video of the layout on a grid: ] Other layouts Circular It draws nodes in a circle ordered by any node attribute. It is useful to show a distribution of nodes with their links (see Fig.9).

14 14 Fig. 9. Coappearance network of characters in the novel Les Miserables [Knuth, 1993], laid out by the Circular layout. Radial Axis It is provided with the Circular Layout plug-in. It groups nodes and draws the groups in axes (or spars) radiating outwards from a central circle. Groups are generated using a metric (degree, betweenness centrality...) or an attribute. It is useful to study homophily by showing distributions of nodes inside groups with their links (see Fig.10). Fig. 10. Coappearance network of characters in the novel Les Miserables [Knuth, 1993], laid out by the Radial Axis layout.

15 15 Geographical The GeoLayout uses latitude and longitude coordinates to set nodes position on the graphic space. Several projections are available, including Mercator which is used by Google Maps and other on-line services. Graphviz binding All Graphviz layouts are made available through a Gephi plug-in. Metrics Gephi provides classic statistics to the study of social networks. Network metrics are statistics related to the whole network. Node metrics are statistics related to each node. Edge metrics are statistics related to each edge. Network metrics Diameter It is the maximal distance between all pairs of nodes [Brandes, 2001]. Density It is a measure of how close the network is to complete. A complete graph has all possible edges and density equal to 1. Louvain Modularity It is a non-overlapping community detection algorithm based on modularity optimization able to run on large networks [Blondel, 2008]. Intuitively, it shows how the network divides naturally into groups of nodes with dense connections within groups and sparser connections between groups.

16 16 Number of Connected Components Connected Components are subgraphs in which a path exists between all pairs of nodes, and no path exists from a node of the subgraph to a node not in the subgraph [Tarjan, 1972]. Clustering Coefficient The Watts-Strogatz clustering coefficient, when applied to a single node, is a measure of how complete the neighborhood of a node is. When applied to an entire network, it is the average clustering coefficient over all of the nodes in the network [Latapy, 2008]. Node metrics Degree Centrality The degree of a node is the number of edges that are adjacent to that node. Betweenness Centrality It measures how often a node appears on shortest paths between nodes in the network [Brandes, 2001]. Closeness Centrality It is the average distance from a given node to all other nodes in the network [Brandes, 2001]. Eigenvector Centrality Node importance in a network based on a node s connections. A node is central to the extent that the node is connected to others who are central.

17 17 PageRank Importance of a Web page within the network considering the probability that a user reaches this page based on the hyperlinks. It is a variant of the Eigenvector Centrality [Page, 1999]. HITS Hyperlink-Induced Topic Search (HITS) is a link analysis algorithm that rates Web pages, developed by Jon Kleinberg [Kleinberg, 1999]. The HITS metric determines two values for a page: its authority, which estimates the value of the content of the page, and its hub value, which estimates the value of its links to other pages. Edge metrics Average Path Length The average distance between all pairs of nodes. Connected nodes have distance 1. The diameter is the longest distance between any two nodes in the network (i.e. how far apart are the two most distant nodes) [Brandes, 2001]. Dynamic metrics Some metrics can be computed over time: the number of nodes, the number of edges, the average degree, and the clustering coefficient. Methods Gephi takes its roots in the Exploratory Data Analysis field of research. Promoted by John Tukey in the book Exploratory Data Analysis (1977) to visualize data sets and statistical results, this approach emphasis the importance of curiosity and serendipity (i.e. discoveries made while searching for something else) to data analysis. As John

18 18 Tukey says, the greatest value of a picture is when it forces us to notice what we never expected to see. Main benefit is the generation of novel questions and research hypotheses. As depicted by Ben Fry in Computational Information Design (2004), one has to acquire and clean data, filter and compute statistics on it, represent and interact with it. But this process involves many back-and-forth between the different steps of data analysis.visualizing the data may indeed reveal the need to acquire more data, or filter it in another way ; interacting with it may requires to change visual variables and aesthetics. Gephi is designed to facilitate this non-linear process. In particular, Gephi is focused on the visualization of the network, the real-time interaction with the data (e.g. node grouping, filtering, use of statistical results in the visualization), and the building of a visual language [Bertin, 1999]. This language makes use of circles and lines, colors and sized to create informative visuals, which aim at being the network equivalent of geographical maps [Boyack, 2005]. Visualization Gephi is focused on the creation of node-link diagrams, which are graphics of dots joined by lines as a representation of nodes (the dots) and edges (the lines). Users interact with the visualization to explore the network structure and raise hypotheses based on the visual patterns. Beyond layouts, the mapping of data attributes and visual attributes allows to set node color and size, label color and size, edge color and thickness. Interaction techniques available in Gephi includes zooming and panning, node selection, node dragging, and tools like node painter and shortest path discovery (see Fig.11).

19 19 Fig. 11. Visualization window. Interoperability With the support of various file formats, Gephi can exchange data with at least these network analysis tools: Cytoscape, CuttleFish, GraphStream, Graphviz, GUESS, I- Graph, JUNG, Network Workbench, NodeXL, Pajek, Sonivis, Tulip, UCINET, and Visone. Special options Filters A key aspect of social network analysis is the identification of groups and the study of the connections between them. For instance, the study of homophily in networks relies on the correlation between the linkage structure and node attributes. One can ask do people who like the same film tend to connect more with each other, and less with

20 20 the rest of the network? The discovery of relevant groups is made easier with filters, which are conditions on nodes or edges applied to view a subgraph. Gephi provides a user interface to create filter queries based on metrics and attributes. For examples: Show only the nodes with degree between 38 and 125. Show only the nodes with gender attribute equal to female. Show only the nodes connected to a given node, and the relations between them. Filters can be combined using boolean operators to create complex queries. A scripting plug-in in Python enables the creation of scripts in a similar fashion as the GUESS software do. Network spreadsheet The network can be seen as a list of nodes and edges. A node table and an edge table are available in the Data Laboratory. Users can add nodes and edges, create or delete attributes. Each table have different features for searching, sorting, and editing data, like the merging of nodes, and the removal of duplicates. Vector graphics maker The publication of visual results requires to control rendering details and aesthetics, especially for printing. The user can tune the rendering of nodes and edges, and see the result before exporting it in a vector file format, either SVG or PDF (see Fig.12 and 13). Timeline Dynamic networks are networks which evolve over time with the addition and removal of nodes and edges. They have been the subject of increasing interest, given their potential as a theoretical model and their promising applications. Following this trend,

21 21 Fig. 12. Interface of the vector graphics maker. Fig. 13. Examples of aesthetics improvements: fonts (left), colors and sizes (right). Gephi has incorporated tools to study dynamic networks. From a visualization perspective, a critical tool is the Timeline component, which allows users to select pertinent time intervals to display and explore the corresponding network. The timeline component features a sparkline chart in the background of the interval selection drawer. This feature helps users to focus on particular periods of the evolution of the dynamic network, like bursts of connections or changes in network density or other simple metrics.

22 The timeline animation enable the selected time frame to slide as the corresponding network is being displayed on the screen, like a movie player (see Fig.14). 22 Fig. 14. Timeline component, where the sparkline shows the number of edges over time. The selected period is from December to April. Software updates When fixes are deployed, the users are notified in the Gephi interface, where they can apply them in a few clicks. Gephi automatically gets the list of plug-ins available from the Gephi Plugin portal. Documentation Information Website: Wiki: Video introduction: Help Forum: (5000+ posts, topics, 800+ active members) Mailing-lists: gephi-plugins@lists.gephi.org, gephi-dev@lists.gephi.org Individual contact: contact@gephi.org News Blog:

23 23 Twitter: Facebook: Video channel: Key Applications Web analysis: the e-diasporas Atlas The Digital Diasporas Atlas project aims at mapping and analyzing the occupation (in a quasi-geopolitical sense) of digital territories by the connected migrants [Diminescu, 2008]. In the context of the ediasporas Atlas, the network serves primarily to allow formulation of research hypotheses. Networks serve as an embodiment of the construction of an interpretation of data. They thus all have a heuristic function, their interpretation being an aspect of visual analytics. Gephi has been used to visualise and interpret the structuring and distribution of actors in migrant-community networks on the Web. By handling the network, by observing its evolution (timeline), by visualizing the place and the connections of a given website, by identifying clusters, by filtering the data by categories, in brief by interpreting the graph, the researcher produces various representations (or views) of the corpus that allow him to formulate hypothesis of research that will be supported (or not) by other online/offline fieldwork investigations. [Diminescu, 2011] Social media analysis: Truthy Truthy is a system to analyze and visualize the diffusion of information on Twitter. It evaluates thousands of tweets an hour to identify new and emerging bursts of activity around memes of various flavors. The data and statistics provided by Truthy are

24 24 designed to aid in the study of social epidemics: How do memes propagate through the Twittersphere? What causes a burst of popularity? [Ratkiewicz, 2011] This system helps users to identify suspicious memes which might deliver disinformation or propaganda. These memes are firstly selected by the system by analysing the diffusion network of the information (i.e. retweets and mentions). Then users classify the meme by exploring the related statistics, timeline and visualizations of the diffusion network. The Gephi Toolkit is used for rendering the visualizations. Dynamic network analysis: Face-to-Face Contact Patterns Describing and understanding contacts between children at school would help quantify the transmission opportunities of respiratory infections and identify situations within schools where the risk of transmission is higher. The measurements were carried out in a French school (612 years children). Data were collected on the time-resolved face-toface proximity of children and teachers. [Stehlé, 2011] The dynamical evolution of the contacts was visualized using Gephi. The video is available on Future Directions The Gephi Consortium identifies strategic needs from the industry and research, create standards to ensure interoperability, and organize the contributors to produce generic and reusable parts of Gephi. A stable Gephi 1.0 is under study, in parallel with developments to include Dynamic Network Analysis and improvements on visualization capabilities using shader techniques on GPU, and customizable renderers for information visualization research. A web marketplace is currently developed to facilitate the exchange of services be-

25 tween members of the community, like professional training, consulting and private development. 25 Cross-References Visualization of Networks, Visualization of Large Networks, Data Mining, Large Networks Analysis of, Mapping Online Networks, Network Representations of Complex Data, Temporal Networks, Formats, Linked Open Data, References [Barnes and Hut (1986)] Barnes J, Hut P (1986) A hierarchical O(N log N) force-calculation algorithm. In: Nature (ISSN ), vol. 324, Dec. 4, 1986, pp [Bastian et al. (2009)] Bastian M, Heymann S, Jacomy M (2009) Gephi: an open source software for exploring and manipulating networks. In: Proceedings of the Third International AAAI Conference on Weblogs and Social Media (ICWSM 09), in American Journal of Sociology, pp [Bertin J (1999)] Bertin J (1999) Sémiologie graphique: les diagrammes, les réseaux, les cartes. Editions de l Ecole des Hautes Etudes en Sciences Sociales [Blondel et al. (2008)] Blondel V, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks, In: Journal of Statistical Mechanics: Theory and Experiment 2008 (10), P1000 [Boyack et al. (2005)] Boyack K W, Klavans R, Brner K (2005) Mapping the backbone of science. In: Scientometrics 64(3), pp [Brandes U (2001)] Brandes U (2001) A faster algorithm for betweenness centrality, In: Journal of Mathematical Sociology, vol. 25, pp [Diminescu D (2008)] Diminescu D (2008) The Connected Migrant: an Epistemological Manifest. In: Social Sciences Information, vol 47 [Diminescu et al. (2011)] Diminescu D, Bourgeois M, Renault M, Jacomy M (2011) Digital Diasporas Atlas Exploration and Cartography of Diasporas in Digital Networks. In: Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 11)

26 26 [Fruchterman and Reingold (1991)] Fruchterman T M J, Reingold E M (1991) Graph Drawing by Force-Directed Placement. In: Software: Practice and Experience, 21(11) [Fry B (2004)] Fry B (2004) Computational Information Design. Ph.D. Thesis [Hu Y F (2005)] Hu Y F (2005) Efficient and high quality force-directed graph drawing. In: The Mathematica Journal, 10 (37-71) [Kleinberg J (1999)] Kleinberg J (1999) Authoritative sources in a hyperlinked environment. In: Journal of the ACM 46 (5): [Kobourov S G (to appear in 2012)] Kobourov S G (to appear in 2012) Force-Directed Drawing Algorithms. In: Handbook of Graph Drawing and Visualization, CRC Press [Knuth D E (1993)] Knuth D E (1993) The Stanford GraphBase: A Platform for Combinatorial Computing. Addison-Wesley, Reading, MA [Latapy M (2008)] Latapy M (2008) Main-memory Triangle Computations for Very Large (Sparse (Power-Law)) Graphs. In: Theoretical Computer Science (TCS) 407 (1-3), pp [Martin et al. (2011)] Martin S, Brown W M, Klavans R, Boyack K (2011) OpenOrd: An Open-Source Toolbox for Large Graph Layout. In: SPIE Conference on Visualization and Data Analysis (VDA) [Page et al. (1999)] Page L, Brin S, Motwani R, Winograd T (1999) The PageRank citation ranking: Bringing order to the Web. Technical Report. Stanford InfoLab. [Ratkiewicz et al. (2011)] Ratkiewicz J, Conover M, Meiss M, Gonçalves B, Patil S, Flammini A, Menczer F (2011) Truthy: mapping the spread of astroturf in microblog streams, In: Proceedings of the 20th international conference companion on World wide web (WWW 11) [Stehlé et al. (2011)] Stehlé J, Voirin N, Barrat A, Cattuto C, Isella L, Pinton J-F, Quaggiotto M, Van den Broeck W, Rgis C, Lina B, Vanhems P (2011) High-Resolution Measurements of Face-to-Face Contact Patterns in a Primary School. In: PLoS One, August 16, [Tarjan R (1972)] Tarjan R (1972) Depth-First Search and Linear Graph Algorithms. In: SIAM Journal on Computing 1 (2): [Tukey J (1977)] Tukey J (1977) Exploratory Data Analysis, 1 edn., Addison-Wesley Recommended Reading [Conway and White (2012)] Conway D, White J M (2012) Machine Learning for Hackers. Chapter Analyzing Social Graphs, Visualizing the Clustered Twitter Network with Gephi

27 27 [Bulik-Sullivan and Sullivan (2012)] Bulik-Sullivan B, Sullivan P (2012) The authorship network of genome-wide association studies. In: Nature Genetics 44, 113 [De Maeyer J (2010)] De Maeyer J (2010) Methods for mapping hyperlink networks: Examining the environment of Belgian news websites. In: 11th International Symposium on Online Journalism [Helmond and Weltevrede (2012)] Helmond A, Weltevrede E (2012) Where do bloggers blog? Platform transitions within the historical Dutch blogosphere. In: First Monday, vol 17, number 2 [Kelly et al. (2012)] Kelly J, Barash V, Alexanyan K, Etling B, Faris R, Gasser U, Palfrey J (2012) Mapping Russian Twitter. In: Berkman Center Research Publication No [Latour et al. (2012)] Latour B, Jensen P, Venturini T, Grauwin S, Boullier D (2012) The Whole is Always Smaller Than Its Parts. In: British Journal of Sociology [Oldham et al. (2012)] Oldham P, Hall S, Burton G (2012) Synthetic Biology: Mapping the Scientific Landscape. In: PLoS ONE 7(4): e [Teng et al. (2011)] Teng C-Y, Lin Y-R, Adamic L (2011) Recipe recommendation using ingredient networks [Barabasi A-L (2003)] Barabasi A-L (2003) Linked: How Everything Is Connected to Everything Else and What It Means for Business, Science, and Everyday Life. Plume [Börner K (2010)] Börner K (2010) Atlas of Science: Visualizing What We Know. The MIT Press [Easley and Kleinberg (2010)] Easley D, Kleinberg J (2010) Networks, Crowds, and Markets: Reasoning About a Highly Connected World. Cambridge University Press [Newman et al. (2006)] Newman M, Barabasi A-L, Watts D J (2006) The Structure and Dynamics of Networks. Princeton University Press [Watts D J (2003)] Watts D J (2003) Six Degrees: The Science of a Connected Age. W. W. Norton & Company

Gephi Tutorial Quick Start

Gephi Tutorial Quick Start Gephi Tutorial Welcome to this introduction tutorial. It will guide you to the basic steps of network visualization and manipulation in Gephi. Gephi version 0.7alpha2 was used to do this tutorial. Get

More information

Visual Analysis of Complex Networks for Business Intelligence with Gephi

Visual Analysis of Complex Networks for Business Intelligence with Gephi Visual Analysis of Complex Networks for Business Intelligence with Gephi Sébastien Heymann, Bénédicte Le Grand LIP6 - CNRS - Université Pierre et Marie Curie, CRI - Université Paris 1 Panthéon Sorbonne

More information

Sébastien Heymann <seb@gephi.org>

Sébastien Heymann <seb@gephi.org> Sébastien Heymann Exploratory Network Analysis 1 see the network 1st graph viz tool: Pajek (1996) Vladimir Batagelj, Andrej Mrvar 2 interact in real time Gephi prototype (2008) group,

More information

A Tutorial on dynamic networks. By Clement Levallois, Erasmus University Rotterdam

A Tutorial on dynamic networks. By Clement Levallois, Erasmus University Rotterdam A Tutorial on dynamic networks By, Erasmus University Rotterdam V 1.0-2013 Bio notes Education in economics, management, history of science (Ph.D.) Since 2008, turned to digital methods for research. data

More information

A comparative study of social network analysis tools

A comparative study of social network analysis tools Membre de Membre de A comparative study of social network analysis tools David Combe, Christine Largeron, Előd Egyed-Zsigmond and Mathias Géry International Workshop on Web Intelligence and Virtual Enterprises

More information

Graph Visualization Tools: A Comparative Analysis

Graph Visualization Tools: A Comparative Analysis Graph Visualization Tools: A Comparative Analysis Fariha Majeed 1, Dr. Saif-ur-Rahman 2 1,2 Shaheed Zulfikar Ali Bhutto Institute of Science and Technology (SZABIST) Karachi, Pakistan 1 majeed.fariha@gmail.com

More information

Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan

Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan Network Metrics, Planar Graphs, and Software Tools Based on materials by Lala Adamic, UMichigan Network Metrics: Bowtie Model of the Web n The Web is a directed graph: n webpages link to other webpages

More information

Graph/Network Visualization

Graph/Network Visualization Graph/Network Visualization Data model: graph structures (relations, knowledge) and networks. Applications: Telecommunication systems, Internet and WWW, Retailers distribution networks knowledge representation

More information

Exploratory Facebook Social Network Analysis

Exploratory Facebook Social Network Analysis Exploratory Facebook Social Network Analysis with Gephi Thomas Plotkowiak twitterresearcher.wordpress.com Process 1. Import Data with Netviz 2. Gephi 1. Open 2. Layout 3. Ranking (Degree) 4. Statistics

More information

Network VisualizationS

Network VisualizationS Network VisualizationS When do they make sense? Where to start? Clement Levallois, Assist. Prof. EMLYON Business School v. 1.1, January 2014 Bio notes Education in economics, management, history of science

More information

An Introduction to KeyLines and Network Visualization

An Introduction to KeyLines and Network Visualization An Introduction to KeyLines and Network Visualization 1. What is KeyLines?... 2 2. Benefits of network visualization... 2 3. Benefits of KeyLines... 3 4. KeyLines architecture... 3 5. Uses of network visualization...

More information

ForceAtlas2, A Graph Layout Algorithm for Handy Network Visualization

ForceAtlas2, A Graph Layout Algorithm for Handy Network Visualization 1 ForceAtlas2, A Graph Layout Algorithm for Handy Network Visualization Mathieu Jacomy, Non member Sebastien Heymann, Non member Tommaso Venturini, Non member and Mathieu Bastian, Non member MM. Jacomy

More information

By LaBRI INRIA Information Visualization Team

By LaBRI INRIA Information Visualization Team By LaBRI INRIA Information Visualization Team Tulip 2011 version 3.5.0 Tulip is an information visualization framework dedicated to the analysis and visualization of data. Tulip aims to provide the developer

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

Comparative Analysis Report:

Comparative Analysis Report: Comparative Analysis Report: Visualization Tools & Platforms By Annabel Weiner, Erol Basusta, Leah Wilkinson, and Quenton Oakes Table of Contents Executive Summary Introduction Assessment Criteria Publishability

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

Visualizing Data: Scalable Interactivity

Visualizing Data: Scalable Interactivity Visualizing Data: Scalable Interactivity The best data visualizations illustrate hidden information and structure contained in a data set. As access to large data sets has grown, so has the need for interactive

More information

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Zhao Wenbin 1, Zhao Zhengxu 2 1 School of Instrument Science and Engineering, Southeast University, Nanjing, Jiangsu

More information

Practical Graph Mining with R. 5. Link Analysis

Practical Graph Mining with R. 5. Link Analysis Practical Graph Mining with R 5. Link Analysis Outline Link Analysis Concepts Metrics for Analyzing Networks PageRank HITS Link Prediction 2 Link Analysis Concepts Link A relationship between two entities

More information

an introduction to VISUALIZING DATA by joel laumans

an introduction to VISUALIZING DATA by joel laumans an introduction to VISUALIZING DATA by joel laumans an introduction to VISUALIZING DATA iii AN INTRODUCTION TO VISUALIZING DATA by Joel Laumans Table of Contents 1 Introduction 1 Definition Purpose 2 Data

More information

OpenText Information Hub (ihub) 3.1 and 3.1.1

OpenText Information Hub (ihub) 3.1 and 3.1.1 OpenText Information Hub (ihub) 3.1 and 3.1.1 OpenText Information Hub (ihub) 3.1.1 meets the growing demand for analytics-powered applications that deliver data and empower employees and customers to

More information

Jing Yang Spring 2010

Jing Yang Spring 2010 Information Visualization Jing Yang Spring 2010 1 InfoVis Programming 2 1 Outline Look at increasing higher-level tools 2D graphics API Graphicial User Interface (GUI) toolkits Visualization framework

More information

Visualization Method of Trajectory Data Based on GML, KML

Visualization Method of Trajectory Data Based on GML, KML Visualization Method of Trajectory Data Based on GML, KML Junhuai Li, Jinqin Wang, Lei Yu, Rui Qi, and Jing Zhang School of Computer Science & Engineering, Xi'an University of Technology, Xi'an 710048,

More information

Temporal Visualization and Analysis of Social Networks

Temporal Visualization and Analysis of Social Networks Temporal Visualization and Analysis of Social Networks Peter A. Gloor*, Rob Laubacher MIT {pgloor,rjl}@mit.edu Yan Zhao, Scott B.C. Dynes *Dartmouth {yan.zhao,sdynes}@dartmouth.edu Abstract This paper

More information

Expanding the CASEsim Framework to Facilitate Load Balancing of Social Network Simulations

Expanding the CASEsim Framework to Facilitate Load Balancing of Social Network Simulations Expanding the CASEsim Framework to Facilitate Load Balancing of Social Network Simulations Amara Keller, Martin Kelly, Aaron Todd 4 June 2010 Abstract This research has two components, both involving the

More information

Gephi Tutorial Visualization

Gephi Tutorial Visualization Gephi Tutorial Welcome to this Gephi tutorial. It will guide you to the basic and advanced visualization settings in Gephi. The selection and interaction with tools will also be introduced. Follow the

More information

PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS. PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software

More information

SCADE System 17.0. Technical Data Sheet. System Requirements Analysis. Technical Data Sheet SCADE System 17.0 1

SCADE System 17.0. Technical Data Sheet. System Requirements Analysis. Technical Data Sheet SCADE System 17.0 1 SCADE System 17.0 SCADE System is the product line of the ANSYS Embedded software family of products and solutions that empowers users with a systems design environment for use on systems with high dependability

More information

A Performance Evaluation of Open Source Graph Databases. Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader

A Performance Evaluation of Open Source Graph Databases. Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader A Performance Evaluation of Open Source Graph Databases Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader Overview Motivation Options Evaluation Results Lessons Learned Moving Forward

More information

Network-Based Tools for the Visualization and Analysis of Domain Models

Network-Based Tools for the Visualization and Analysis of Domain Models Network-Based Tools for the Visualization and Analysis of Domain Models Paper presented as the annual meeting of the American Educational Research Association, Philadelphia, PA Hua Wei April 2014 Visualizing

More information

TIBCO Spotfire Network Analytics 1.1. User s Manual

TIBCO Spotfire Network Analytics 1.1. User s Manual TIBCO Spotfire Network Analytics 1.1 User s Manual Revision date: 26 January 2009 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO

More information

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Ernst van Waning Senior Sales Engineer May 28, 2010 Agenda SPSS, an IBM Company SPSS Statistics User-driven product

More information

Visualizing a Neo4j Graph Database with KeyLines

Visualizing a Neo4j Graph Database with KeyLines Visualizing a Neo4j Graph Database with KeyLines Introduction 2! What is a graph database? 2! What is Neo4j? 2! Why visualize Neo4j? 3! Visualization Architecture 4! Benefits of the KeyLines/Neo4j architecture

More information

Self-adaptive e-learning Website for Mathematics

Self-adaptive e-learning Website for Mathematics Self-adaptive e-learning Website for Mathematics Akira Nakamura Abstract Keyword searching and browsing on learning website is ultimate self-adaptive learning. Our e-learning website KIT Mathematics Navigation

More information

IC05 Introduction on Networks &Visualization Nov. 2009. <mathieu.bastian@gmail.com>

IC05 Introduction on Networks &Visualization Nov. 2009. <mathieu.bastian@gmail.com> IC05 Introduction on Networks &Visualization Nov. 2009 Overview 1. Networks Introduction Networks across disciplines Properties Models 2. Visualization InfoVis Data exploration

More information

WebSphere Business Monitor

WebSphere Business Monitor WebSphere Business Monitor Dashboards 2010 IBM Corporation This presentation should provide an overview of the dashboard widgets for use with WebSphere Business Monitor. WBPM_Monitor_Dashboards.ppt Page

More information

Create Cool Lumira Visualization Extensions with SAP Web IDE Dong Pan SAP PM and RIG Analytics Henry Kam Senior Product Manager, Developer Ecosystem

Create Cool Lumira Visualization Extensions with SAP Web IDE Dong Pan SAP PM and RIG Analytics Henry Kam Senior Product Manager, Developer Ecosystem Create Cool Lumira Visualization Extensions with SAP Web IDE Dong Pan SAP PM and RIG Analytics Henry Kam Senior Product Manager, Developer Ecosystem 2015 SAP SE or an SAP affiliate company. All rights

More information

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network , pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and

More information

Understanding Data: A Comparison of Information Visualization Tools and Techniques

Understanding Data: A Comparison of Information Visualization Tools and Techniques Understanding Data: A Comparison of Information Visualization Tools and Techniques Prashanth Vajjhala Abstract - This paper seeks to evaluate data analysis from an information visualization point of view.

More information

MicroStrategy Analytics Express User Guide

MicroStrategy Analytics Express User Guide MicroStrategy Analytics Express User Guide Analyzing Data with MicroStrategy Analytics Express Version: 4.0 Document Number: 09770040 CONTENTS 1. Getting Started with MicroStrategy Analytics Express Introduction...

More information

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize

More information

Ovation Operator Workstation for Microsoft Windows Operating System Data Sheet

Ovation Operator Workstation for Microsoft Windows Operating System Data Sheet Ovation Operator Workstation for Microsoft Windows Operating System Features Delivers full multi-tasking operation Accesses up to 200,000 dynamic points Secure standard operating desktop environment Intuitive

More information

Dong-Joo Kang* Dong-Kyun Kang** Balho H. Kim***

Dong-Joo Kang* Dong-Kyun Kang** Balho H. Kim*** Visualization Issues of Mass Data for Efficient HMI Design on Control System in Electric Power Industry Visualization in Computerized Operation & Simulation Tools Dong-Joo Kang* Dong-Kyun Kang** Balho

More information

MEng, BSc Applied Computer Science

MEng, BSc Applied Computer Science School of Computing FACULTY OF ENGINEERING MEng, BSc Applied Computer Science Year 1 COMP1212 Computer Processor Effective programming depends on understanding not only how to give a machine instructions

More information

Visualizing the Top 400 Universities

Visualizing the Top 400 Universities Int'l Conf. e-learning, e-bus., EIS, and e-gov. EEE'15 81 Visualizing the Top 400 Universities Salwa Aljehane 1, Reem Alshahrani 1, and Maha Thafar 1 saljehan@kent.edu, ralshahr@kent.edu, mthafar@kent.edu

More information

Algorithms for representing network centrality, groups and density and clustered graph representation

Algorithms for representing network centrality, groups and density and clustered graph representation COSIN IST 2001 33555 COevolution and Self-organization In dynamical Networks Algorithms for representing network centrality, groups and density and clustered graph representation Deliverable Number: D06

More information

Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle

Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle Outlines Business Intelligence Lecture 15 Why integrate BI into your smart client application? Integrating Mining into your application Integrating into your application What Is Business Intelligence?

More information

Technical Specification. Solutions created by knowledge and needs

Technical Specification. Solutions created by knowledge and needs Technical Specification Solutions created by knowledge and needs The industrial control and alarm management system that integrates video, voice and data Technical overview Process Architechture OPC-OCI

More information

Operationalise Predictive Analytics

Operationalise Predictive Analytics Operationalise Predictive Analytics Publish SPSS, Excel and R reports online Predict online using SPSS and R models Access models and reports via Android app Organise people and content into projects Monitor

More information

Recent Large Graph Visualization Tools : A Review

Recent Large Graph Visualization Tools : A Review 159 Recent Large Graph Visualization Tools : A Review Sorn Jarukasemratana Tsuyoshi Murata Large graph visualization tools are important instruments for researchers to understand large graph data sets.

More information

CRGroup Whitepaper: Digging through the Data. www.crgroup.com. Reporting Options in Microsoft Dynamics GP

CRGroup Whitepaper: Digging through the Data. www.crgroup.com. Reporting Options in Microsoft Dynamics GP CRGroup Whitepaper: Digging through the Data Reporting Options in Microsoft Dynamics GP The objective of this paper is to provide greater insight on each of the reporting options available to you within

More information

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers 60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative

More information

Open Source Visualization with OpenGraphiti. Thibault Reuille (@ThibaultReuille) thibault@opendns.com. Andrew Hay (@andrewsmhay) ahay@opendns.

Open Source Visualization with OpenGraphiti. Thibault Reuille (@ThibaultReuille) thibault@opendns.com. Andrew Hay (@andrewsmhay) ahay@opendns. Open Source Visualization with OpenGraphiti Thibault Reuille (@ThibaultReuille) thibault@opendns.com Andrew Hay (@andrewsmhay) ahay@opendns.com Introduction Humans have different ways of efficiently digesting

More information

EdgeLap: Identifying and discovering features from overlapping sets in networks

EdgeLap: Identifying and discovering features from overlapping sets in networks Project Title: EdgeLap: Identifying and discovering features from overlapping sets in networks Names and Email Addresses: Jessica Wong (jhmwong@cs.ubc.ca) Aria Hahn (hahnaria@gmail.com) Sarah Perez (karatezeus21@gmail.com)

More information

Big Data: Rethinking Text Visualization

Big Data: Rethinking Text Visualization Big Data: Rethinking Text Visualization Dr. Anton Heijs anton.heijs@treparel.com Treparel April 8, 2013 Abstract In this white paper we discuss text visualization approaches and how these are important

More information

PROVIDING INSIGHT FOR OPERATIONAL SUCCESS

PROVIDING INSIGHT FOR OPERATIONAL SUCCESS idashboards for Operations Management PROVIDING INSIGHT FOR OPERATIONAL SUCCESS idashboards helped Precoat move from manual data mining and paper reports to a system that allows us to identify best practices

More information

Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies

Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies Somesh S Chavadi 1, Dr. Asha T 2 1 PG Student, 2 Professor, Department of Computer Science and Engineering,

More information

INSPIRE Dashboard. Technical scenario

INSPIRE Dashboard. Technical scenario INSPIRE Dashboard Technical scenario Technical scenarios #1 : GeoNetwork catalogue (include CSW harvester) + custom dashboard #2 : SOLR + Banana dashboard + CSW harvester #3 : EU GeoPortal +? #4 :? + EEA

More information

BIG DATA IN BUSINESS ENVIRONMENT

BIG DATA IN BUSINESS ENVIRONMENT Scientific Bulletin Economic Sciences, Volume 14/ Issue 1 BIG DATA IN BUSINESS ENVIRONMENT Logica BANICA 1, Alina HAGIU 2 1 Faculty of Economics, University of Pitesti, Romania olga.banica@upit.ro 2 Faculty

More information

Advanced analytics at your hands

Advanced analytics at your hands 2.3 Advanced analytics at your hands Neural Designer is the most powerful predictive analytics software. It uses innovative neural networks techniques to provide data scientists with results in a way previously

More information

CiteSeer x in the Cloud

CiteSeer x in the Cloud Published in the 2nd USENIX Workshop on Hot Topics in Cloud Computing 2010 CiteSeer x in the Cloud Pradeep B. Teregowda Pennsylvania State University C. Lee Giles Pennsylvania State University Bhuvan Urgaonkar

More information

TEXT-FILLED STACKED AREA GRAPHS Martin Kraus

TEXT-FILLED STACKED AREA GRAPHS Martin Kraus Martin Kraus Text can add a significant amount of detail and value to an information visualization. In particular, it can integrate more of the data that a visualization is based on, and it can also integrate

More information

Can Twitter Predict Royal Baby's Name?

Can Twitter Predict Royal Baby's Name? Summary Can Twitter Predict Royal Baby's Name? Bohdan Pavlyshenko Ivan Franko Lviv National University,Ukraine, b.pavlyshenko@gmail.com In this paper, we analyze the existence of possible correlation between

More information

1. INTERFACE ENHANCEMENTS 2. REPORTING ENHANCEMENTS

1. INTERFACE ENHANCEMENTS 2. REPORTING ENHANCEMENTS W E L C O M E T O M O N I T O R I N G H E A V E N NEW THINGS ABOUT PANDORA FMS 5.0 A new version of Pandora FMS full of enhancements is about to hit the market. Pandora FMS 5.0 will be released by the

More information

Fogbeam Vision Series - The Modern Intranet

Fogbeam Vision Series - The Modern Intranet Fogbeam Labs Cut Through The Information Fog http://www.fogbeam.com Fogbeam Vision Series - The Modern Intranet Where It All Started Intranets began to appear as a venue for collaboration and knowledge

More information

SAS BI Dashboard 4.4. User's Guide Second Edition. SAS Documentation

SAS BI Dashboard 4.4. User's Guide Second Edition. SAS Documentation SAS BI Dashboard 4.4 User's Guide Second Edition SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2013. SAS BI Dashboard 4.4: User's Guide, Second

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs anton.heijs@treparel.com Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information

Web Data Visualization

Web Data Visualization Web Data Visualization Department of Communication PhD Student Workshop Web Mining for Communication Research April 22-25, 2014 http://weblab.com.cityu.edu.hk/blog/project/workshops Jie Qin & Hexin Chen

More information

IMPLEMENTING HEALTHCARE DASHBOARDS FOR OPERATIONAL SUCCESS

IMPLEMENTING HEALTHCARE DASHBOARDS FOR OPERATIONAL SUCCESS idashboards for Healthcare IMPLEMENTING HEALTHCARE DASHBOARDS FOR OPERATIONAL SUCCESS idashboards gives me access to real-time actionable data from all areas of the hospital. Internally, the adoption rate

More information

What's new in gvsig Desktop 2.0

What's new in gvsig Desktop 2.0 What's new in gvsig Desktop 2.0 What are the novelties? 2.0 1.12 Migrating and building... Some examples... Please pardon our appearance during construction Pie and bar chart legends Table in layout 1.12

More information

MEng, BSc Computer Science with Artificial Intelligence

MEng, BSc Computer Science with Artificial Intelligence School of Computing FACULTY OF ENGINEERING MEng, BSc Computer Science with Artificial Intelligence Year 1 COMP1212 Computer Processor Effective programming depends on understanding not only how to give

More information

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 Yannis Stavrakas Vassilis Plachouras IMIS / RC ATHENA Athens, Greece {yannis, vplachouras}@imis.athena-innovation.gr Abstract.

More information

Bringing Big Data Modelling into the Hands of Domain Experts

Bringing Big Data Modelling into the Hands of Domain Experts Bringing Big Data Modelling into the Hands of Domain Experts David Willingham Senior Application Engineer MathWorks david.willingham@mathworks.com.au 2015 The MathWorks, Inc. 1 Data is the sword of the

More information

SuperViz: An Interactive Visualization of Super-Peer P2P Network

SuperViz: An Interactive Visualization of Super-Peer P2P Network SuperViz: An Interactive Visualization of Super-Peer P2P Network Anthony (Peiqun) Yu pqyu@cs.ubc.ca Abstract: The Efficient Clustered Super-Peer P2P network is a novel P2P architecture, which overcomes

More information

9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08

9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08 9. Text & Documents Visualizing and Searching Documents Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08 Slide 1 / 37 Outline Characteristics of text data Detecting patterns SeeSoft

More information

Using Networks to Visualize and Understand Participation on SourceForge.net

Using Networks to Visualize and Understand Participation on SourceForge.net Nathan Oostendorp; Mailbox #200 SI708 Networks Theory and Application Final Project Report Using Networks to Visualize and Understand Participation on SourceForge.net SourceForge.net is an online repository

More information

<Insert Picture Here> Java, the language for the future

<Insert Picture Here> Java, the language for the future 1 Java, the language for the future Adam Messinger Vice President of Development The following is intended to outline our general product direction. It is intended for information

More information

Community-Aware Prediction of Virality Timing Using Big Data of Social Cascades

Community-Aware Prediction of Virality Timing Using Big Data of Social Cascades 1 Community-Aware Prediction of Virality Timing Using Big Data of Social Cascades Alvin Junus, Ming Cheung, James She and Zhanming Jie HKUST-NIE Social Media Lab, Hong Kong University of Science and Technology

More information

SAS BI Dashboard 4.3. User's Guide. SAS Documentation

SAS BI Dashboard 4.3. User's Guide. SAS Documentation SAS BI Dashboard 4.3 User's Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2010. SAS BI Dashboard 4.3: User s Guide. Cary, NC: SAS Institute

More information

Protein Protein Interaction Networks

Protein Protein Interaction Networks Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics

More information

Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014

Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014 Beyond Watson: Predictive Analytics and Big Data U.S. National Security Agency Research Directorate - R6 Technical Report February 3, 2014 300 years before Watson there was Euler! The first (Jeopardy!)

More information

Distributed Dynamic Load Balancing for Iterative-Stencil Applications

Distributed Dynamic Load Balancing for Iterative-Stencil Applications Distributed Dynamic Load Balancing for Iterative-Stencil Applications G. Dethier 1, P. Marchot 2 and P.A. de Marneffe 1 1 EECS Department, University of Liege, Belgium 2 Chemical Engineering Department,

More information

Advanced Visualizations Tools for CERN Institutional Data

Advanced Visualizations Tools for CERN Institutional Data Advanced Visualizations Tools for CERN Institutional Data September 2013 Author: Alberto Rodríguez Peón Supervisor(s): Jiří Kunčar CERN openlab Summer Student Report 2013 Project Specification The aim

More information

PROVIDING INSIGHT FOR OPERATIONAL SUCCESS

PROVIDING INSIGHT FOR OPERATIONAL SUCCESS idashboards for Financial Services PROVIDING INSIGHT FOR OPERATIONAL SUCCESS We had a huge leap in account openings once staff could see how their sales compared to other staff and branches. They now have

More information

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix Jennifer Clegg, SAS Institute Inc., Cary, NC Eric Hill, SAS Institute Inc., Cary, NC ABSTRACT Release 2.1 of SAS

More information

Component visualization methods for large legacy software in C/C++

Component visualization methods for large legacy software in C/C++ Annales Mathematicae et Informaticae 44 (2015) pp. 23 33 http://ami.ektf.hu Component visualization methods for large legacy software in C/C++ Máté Cserép a, Dániel Krupp b a Eötvös Loránd University mcserep@caesar.elte.hu

More information

What s new in TIBCO Spotfire 6.5

What s new in TIBCO Spotfire 6.5 What s new in TIBCO Spotfire 6.5 Contents Introduction... 3 TIBCO Spotfire Analyst... 3 Location Analytics... 3 Support for adding new map layer from WMS Server... 3 Map projections systems support...

More information

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent

More information

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions What is Visualization? Information Visualization An Overview Jonathan I. Maletic, Ph.D. Computer Science Kent State University Visualize/Visualization: To form a mental image or vision of [some

More information

Product Guide. Sawmill Analytics, Swindon SN4 9LZ UK sales@sawmill.co.uk tel: +44 845 250 4470

Product Guide. Sawmill Analytics, Swindon SN4 9LZ UK sales@sawmill.co.uk tel: +44 845 250 4470 Product Guide What is Sawmill Sawmill is a highly sophisticated and flexible analysis and reporting tool. It can read text log files from over 800 different sources and analyse their content. Once analyzed

More information

Welcome to the second half ofour orientation on Spotfire Administration.

Welcome to the second half ofour orientation on Spotfire Administration. Welcome to the second half ofour orientation on Spotfire Administration. In this presentation, I ll give a quick overview of the products that can be used to enhance a Spotfire environment: TIBCO Metrics,

More information

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many

More information

WHITE PAPER. Domo Advanced Architecture

WHITE PAPER. Domo Advanced Architecture WHITE PAPER Domo Advanced Architecture Overview There are several questions that any architect or technology advisor may ask about a new system during the evaluation process: How will it fit into our organization

More information

Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis

Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis 9/3/2013 Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis Seton Hall University, South Orange, New Jersey http://www.shu.edu/go/dava Visualization and

More information

NakeDB: Database Schema Visualization

NakeDB: Database Schema Visualization NAKEDB: DATABASE SCHEMA VISUALIZATION, APRIL 2008 1 NakeDB: Database Schema Visualization Luis Miguel Cortés-Peña, Yi Han, Neil Pradhan, Romain Rigaux Abstract Current database schema visualization tools

More information

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence Introduction to Oracle Business Intelligence Standard Edition One Mike Donohue Senior Manager, Product Management Oracle Business Intelligence The following is intended to outline our general product direction.

More information

2007 to 2010 SharePoint Migration - Take Time to Reorganize

2007 to 2010 SharePoint Migration - Take Time to Reorganize 2007 to 2010 SharePoint Migration - Take Time to Reorganize by Mark Klinchin CTO, MetaVis Technologies May 2010 Phone: (610)-717-0413 Email: info@metavistech.com Website: www.metavistech.com Introduction

More information

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study Oracle Big Data Spatial & Graph Social Network Analysis - Case Study Mark Rittman, CTO, Rittman Mead OTN EMEA Tour, May 2016 info@rittmanmead.com www.rittmanmead.com @rittmanmead About the Speaker Mark

More information

Security visualisation

Security visualisation Security visualisation This thesis provides a guideline of how to generate a visual representation of a given dataset and use visualisation in the evaluation of known security vulnerabilities by Marco

More information

Information Visualization of Attributed Relational Data

Information Visualization of Attributed Relational Data Information Visualization of Attributed Relational Data Mao Lin Huang Department of Computer Systems Faculty of Information Technology University of Technology, Sydney PO Box 123 Broadway, NSW 2007 Australia

More information