Visualizing Structured Data About Music

Size: px
Start display at page:

Download "Visualizing Structured Data About Music"

Transcription

1 Visualizing Structured Data About Music the k-pie network layout algorithm and applications to music data Kurt Jacobson Centre for Digital Music Queen Mary University of London Mile End, London, UK E1 4NS ABSTRACT In an effort to move towards intuitive visual interfaces for faceted browsing of structured data about music, we develop a visualization technique called k-pie. Derived from a network visualization technique know as k-cores decomposition, k-pie layout accounts for the semantic labels or colors associated with each vertex. Vertices of a graph are arranged in a 2 dimensional circle where slices in the circle correspond to a specific vertex label and the most connected vertices are found in the center of the visualization. We describe the k-pie algorithm and demonstrate how it can be useful in the context of Semantic Web technologies. Keywords k-pie layout, k-cores decomposition, complex networks, Semantic Web, music, visualization 1. INTRODUCTION Just as music is an important part of every culture, music is an important part of on-line culture. With the ubiquity of digital music on-line, we see an opportunity to realize the promise of Semantic Web technologies and bring exciting and innovative music experiences to the end user. Significant headway has already been made in developing ontologies for the music domain [1], re-describing on-line music-related data in a structured way [2], and automatically inter-linking music collections to the Semantic Web [3]. These efforts are helping to create a Web of structured data about music - a Global Music Graph - that can be leveraged for music discovery, intelligent music recommendation, metadata augmentation, digital rights management, and many other applications. We are now interested in making this powerful infrastructure palatable and accessible to the expert and non-expert users alike. In this work we develop an algorithm for visualizing large amounts of structured data. We call this algorithm k-pie. The algorithm is essentially an extension of the graph layout algorithm know as k-cores decomposition developed in [4]. However k-pie differs in that it accounts for vertex labels or color. Vertices of a graph are arranged in a 2 dimensional circle where slices in the circle correspond to a specific vertex label and the most connected vertices are found in the center of the visualization. WebSci 2009 Athens, Greece Mark Sandler Centre for Digital Music Queen Mary University of London Mile End, London, UK E1 4NS [email protected] By applying concepts from complex networks research to semantic graphs k-pie creates meaningful visualizations of the relationships between hundreds of thousands of individuals. By assigning an object property or a set of object properties as edges and a set of object classes or individuals as labels, k-pie visualizations can be created by parsing results from the SPARQL query language [5]. The rest of the paper is organized as follows. In Section 2 we briefly review some related work. In Section 3 we describe the k-pie algorithm and show how it can be used in the context of structured data. In Section 4 we show some of the applications of the k-pie method including the visualization of Myspace music artists and visualization of influence between classical composers. Finally in Section 5 we provide a discussion and ideas for future work. 2. BACKGROUND As mentioned above, the k-pie algorithm is a rather intuitive extension of the network visualization algorithm k-core decomposition developed by Alvarez-Hamelin et. al in [4]. The k-core decomposition algorithm begins with a k-core analysis on a given graph structure and places vertices in a 2 dimensional space using a pair of polar coordinates - a radius related to the shellness of a given vertex and an angle related to the cluster of that vertex after k-cores analysis. We will review these concepts again in Section 3.1. We are most interested in visualizing data about music artists. We describe this data using Music Ontology terms and other supporting Web onotolgies following the work of Raimond [1]. We are interested in utilizing the k-pie visualization algorithm as a tool for interacting with structured data on the Semantic Web [6]. There exists a growing number of tools for visualizing and browsing structured data [7]. 3. ALGORITHM To describe the k-pie algorithm first we will provide some definitions and concepts associated with the k-core decomposition algorithm. 3.1 Definitions Let us consider a graph G = (V, E) where V = n vertices and E = e edges. As described in [8], a k-core is defined as follows: Definition 1. A subgraph H = (C, E C) induced by the set C V is a k-core or a core of order k iff v C :

2 shellness and its semantic label. Each vertex i is positioned using a pair of polar coordinates (ρ i, α i). The radius ρ i depends on the shellness of i and that of its neighbors while the angle α i depends on the label associated with i. In the resulting visualization, k-shells are presented as layers of concentric circles with the innermost circle corresponding to the vertices with the highest shellness. Then vertices sharing the same label are positioned within in a certain angular range. This creates pie-like slices of vertices sharing the same label across the k-shell layers. Hence the name of the algorithm k-pie. Figure 1: The k-core decomposition for a small graph. Each closed line contains the set of vertices belonging to a given k-core, and the color of the vertices distinguish different k-shells. degree H(v) k, and H is the maximum subgraph with this property. Definition 2. A vertex i has a shellness c if it belongs to the c-core but not to the (c+1)-core. We denote the shellness of vertex i by c i. Definition 3. A shell C c is composed of all vertices whose shellness is c. The maximum value c such that C c is not empty is denoted c max. The k-core is then the union of all C c with c k. It is important to note the distinction between a k-core and a k-shell - a k-shell implies a certain range of vertex degrees while a k-core only implies a lower limit to vertex degree. We are more interested in k-shells for our visualization algorithm. In the k-core analysis all vertices of a connected graph belong to the 1-core. In Figure 3.1 this is indicated by the largest line encircling the entire graph. Then, all vertices of degree d < 2 are recursively cut out. In Figure 3.1 these are the blue vertices and they constitute the 1-shell. All other vertices maintain a degree of d 2 after pruning the blue vertices, and are not eliminated in this step. The remaining vertices form the 2-core, enclosed by another dotted line. In the next step vertices with degree d < 3 are pruned revealing the 3- core. Note that c max = 3 in the graph in Figure 3.1 as after pruning no vertex has a degree d > 3. Also note that the coloring of the vertices in the Figure indicate their shellness. In addition to the above definitions, let us also consider that each vertex i has a label associated with it l i and L is the set of all labels found in the graph G and s is the number of distinct labels found in L. 3.2 Layout equations The k-pie visualization algorithm positions vertices in 2 dimensional space. The position of each vertex depends on its The calculation of ρ i is as follows: ɛ ρ i = (1 ɛ)(c max c i) + V cj c i(i) X j V cj c i (i) (c max c i) where V cj c i is the set of neighbors of i having shellness c j greater or equal to c i. The parameter ɛ is a tuning parameter to control the possibility of rings overlapping. Then, the angle α i is calculated as follows: α i = 2π X L m n + N 1 m<l i Lli 2n, 2π L «l i n where L is an ordered list of the labels in the graph and L m is the number of vertices with the label m, N is a normal distribution of mean L l i and width 2π L l i. Assuming L 2n n is an ordered list of labels, referring to m < l i allows us to allocate the appropriate portion of the angular space to a given label. 3.3 Color and size of vertices and edges The size of each vertex in the visualization corresponds to the logarithm of its degree. Generally, the larger (higher degree) vertices be nearest the center of the visualization although it is possible for a higher degree vertex to exist in one of the more outer shells. The coloring of the vertices is related to the label associated with each vertex. Each distinct label found in L is given a unique color. The drawing of edges can be considered optional. In fact drawing all the edges in a larger graph will result in an unintelligible tangle. With no edges drawn at all, the resulting visualization is still useful. A homogeneously randomly sampled fraction of edges can be drawn. This approach does not add to computational cost significantly and is used in [4]. A more computationally expensive approach is to only draw edges with a higher betweenness centrality - those edges which are found more often in the shortest paths between a pair of vertices [9]. 3.4 Complexity The k-pie algorithm has a complexity that is nearly identical to that of k-core decomposition. If we assume no re-ordering of L we can index our list of labels for the angular calculation in O(s n) where s is the number of labels. Generally s will be small compared to n the number of vertices. The k-core (1) (2)

3 decomposition takes time O(n + e) - O(n) to build a list of vertex s degeree and O(e) to perform the pruning in the recursive decomposition step where e is the number of edges. So our total time complexity for k-pie is O(s n + n + e) or simply O(n + e) if the number of distinct labels s is small. 4. APPLICATION The k-pie algorithm could be applied to most any graph-like data that includes vertex labels of some kind. Here we will discuss the application that motivated our development of k-pie - visualization of the Myspace artist network. First discussed in [2] the Myspace artist network consists of a subset of the Myspace social network that includes only users who specify that they are artists. A network is constructed from the directed top friend relationship between pairs of artists. This relation is chosen over the general undirected friend relation because it is assumed to be more meaningful and salient in terms of musical similarity. That is to say, a top friend relationship is more likely to imply some musical relationship between two artists - collaboration, co-membership, or stylistic influence. The data set from [2] was republished as structured data in RDF and a live wrapper service 1 was created to dereference Myspace-related URIs on demand. We want to create a k-pie visualization where the vertices are Myspace music artists, the vertices labels are the musical genre labels specified by the artists on Myspace, and the edges are top friend relationships to other artists. We can fetch this data using the SPARQL query language and the endpoint containing out data set 2. The query is as follows PREFIX myspo:< SELECT?from?to?fromLabel?toLabel FROM < WHERE{ {?from myspo:topfriend?to. } OPTIONAL{?from myspo:genrelabel?fromlabel.?to myspo:genrelabel?tolabel} } After execution the from and to fields will specify our directed edge list for constructing our graph. The fromlabel and tolabel will contain the labels l i for each vertex i. We specify these triple patterns as OPTIONAL to include vertices that have no genre label specified. We can see the results of this query and visualization in Figure 2. Each vertex represents a music artist and the color of each vertex indicates the primary genre label associated with that artist. This data set contains n = 15, 019 vertices (music artists) and e = 114, 606 edges (top friend relations) and s = 106 labels (genre). Notice that a few highly 1 available at 2 available at connected music artists gravitate towards the center belonging to the highest k-shells around c max = 29. However the shells immediately lower than c max are mostly empty. This behavior is indicative of scale-free networks where the cumulative degree distribution follows a power law decay - P c(d) d (α 1) [10]. Also note that the highly connected vertices in the center k-shells constitute a rich club where the music artists with the highest degree values are also connected to each other [11]. We can also get a sense for what musical genres dominate this sample of the Myspace artist network. We can see that Hip Hop (yellow) and Rap (bright green) account for nearly half of all the genre labels in the data set. We also see that the rich club in the center of the visualization includes vertices with genre labels Hip Hop, Rap, Soul, Reggae, and Hardcore - a somewhat surprising addition to the list. Note that the genre label appears as text only for those genre labels that are associated with more than 1% of the vertices in the network. The data set actually includes 106 unique genre labels and therefore the visualization contains 106 distinct colors. However, it can be exceedingly difficult for the viewer to accurately distinguish between so many colors therefore text labels are used in favor of a color legend. The vertices are drawn to be translucent so the viewer can get a better sense vertex concentration where vertices fall on top of one another. In this visualization only 0.2% of the network edges are chosen uniformly at random and included in the visualization. The remainder of the edges are ignored. The k-pie algorithm has also been applied to a data set of classical composers and the network of influence between composers. 3 This data set is essentially a re-publication of the data about classical composer influence compiled by Charles H. Smith in the 1990 s for the Classical Music Navigator (CMN) website [12]. Following the Linked Data guidelines for publishing structured data on the Web [13], this data set contains links to the DBpedia data set. 4 We can follow these links to obtain additional data about the composers. Of the 444 composers in the CMN data set, we have obtained links to DBpedia for 245. The k-pie algorithm can be used to visualize the data using, for example, rdf:type or dbpedia:birthplace, as the vertex label parameter. A small proof-of-concept software package has been developed to visualize this small data set DISCUSSION AND FUTURE WORK The k-pie layout algorithm allows for the creation of useful and intuitive visualizations of large collections of labeled network data. This algorithm can be useful in the context of the Semantic Web and combined with technologies like SPARQL to create visualizations of moderately large sets of structured data. A quick inspection of the visualization gives the viewer a sense of the make up of the network in terms of labels and a sense of what labels are associated with the most connected vertices. However there are some difficulties with the k-pie algorithm in its current state. Perhaps most notably, there is no clear method for ordering the list of vertices labels L found in the 3 available at:

4 Figure 2: k-pie layout visualization of a sample of the Myspace artist network where the slices and vertex colors correspond to genre labels. Note that genre label text is only printed for genre labels that constitute > 1% of vertex labels. target graph. In the current implementation, L is naively ordered as the labels appear when constructing the graph - the label that is found first appears as the first label in L. The ordering of L dictates the ordering of the slices in the visualization. It would be preferable if this ordering had some meaning or justification. One option would be to build an adjacency matrix for the labels in L and approach this as a circular layout problem and use a method like spectral re-ordering [14] to organize the labels. This would have the effect of placing the most connected labels in the original graph closer together in the re-ordered list. Another short coming of the k-pie algorithm is that it allows for each vertex to be associated with only one label. This is often at odds with the complexity of real world entities and their descriptions on the Semantic Web, which by design, allows for an entity to be associated with many labels. Even in our Myspace music artist genre application, we naively select only the first genre label associated with a given artist assuming it is the most important. According to the interface design of Myspace, an artist may have between 0 and 3 genre labels. We simply ignore the additional labels in the visualization. To the best knowledge of the authors, there is no clear way to address multiple labels in the context of the k-pie layout algorithm. As can be seen in 2 nodes tend to bunch together in large graphs. Although the viewer can clearly see what portion of the vertices are associated with different labels and which vertices are the most connected in the graph, much of the detail is lost. For this reason we purpose k-pie is most appropriate for generating global over views of medium to large data sets and would be most effective as one aspect of a multi-view interactive data exploration experience. Because the complexity of k-pie is relatively low, it could be incorporated into a faceted browsing interface where the k- pie layout is re-caclulated as the browsing context changes. In future development we hope to use k-pie visualizations in a read-write tool for RDF data following the principles outlined in [15]. Currently k-pie requires the construction of ad hoc SPARQL queries to visualize data sets published on the Semantic Web. The original aim was to develop a more general framework for visualizing structured data and Linked Data [13]. Unfortunately the current work falls short of this aim and focuses instead on a basic network visualization algorithm and its application to a specific Semantic Web data set. Creating a more general framework for visualizing and interacting with Linked Data and structured data on the Semantic Web is a primary goal of future work. 6. WEB RESOURCES The following Web resources supplement the material presented in this work: An open-source Java implementation of k-pie available by svn checkout at KPieLayout

5 Fields-Jacobson Myspace data available via SPARQL at Myspace RDF wrapper service available at Classical Music Navigator data set available at Classical Music Universe data browser available at 7. ACKNOWLEDGEMENTS The authors would like to acknowledge Ben Fields at Goldsmiths University of London for his assistance in collecting the original Myspace data set. This work is supported as a part of the OMRAS2 project, EPSRC grants EP/E02274X/1 and EP/E017614/1. A survey of measurements, Advances In Physics, vol. 56, p. 167, [Online]. Available: doi: / [12] The classical music navigator. [Online]. Available: smithch/music/ [13] C. Bizer, R. Cyganiak, and T. Heath. How to publish linked data on the web. [Online]. Available: [14] D. J. Higham, G. Kalna, and M. Kibble, Spectral clustering and its use in bioinformatics, J. Comput. Appl. Math., vol. 204, no. 1, pp , [Online]. Available: [15] T. Berners-Lee, J. Hollenbach, K. Lu, J. Presbrey, E. P. d ommeaux, and m.c. schraefel, Tabulator redux: Writing into the semantic web, [Online]. Available: 8. REFERENCES [1] Y. Raimond, A distributed music information system, Ph.D. dissertation, Queen Mary University of London, [2] K. Jacobson and M. Sandler, Musically meaningful or just noise, an analysis of on-line artist networks, in Proc. of CMMR, 2008, pp [3] Y. Raimond, C. Sutton, and M. Sandler, Automatic interlinking of music datasets on the semantic web, [4] J. I. Alvarez-Hamelin, L. Dall Asta, A. Barrat, and A. Vespignani, k-core decomposition: a tool for the visualization of large scale networks, CANADA, p. 41, [Online]. Available: cs/ [5] SPARQL query language for RDF, W3C recommendation, [Online]. Available: [6] T. Burners-Lee, J. Hendler, and O. Lassila, The semantic web, Scientific American, vol. 284, no. 5, May [Online]. Available: Scientific%20American %20Feature%20Article %20The%20Semantic%20Web %20May% pdf [7] S. Kushro and A. Tjoa, Fulfilling the needs of a metadata creator and analyst- an investigation of rdf browsing and visualization tools, Canadian Semantic Web, pp , [Online]. Available: 6 [8] V. Batagelj and M. Zaversnik, Generalized cores, [Online]. Available: cs/ [9] L. C. Freeman, A set of measures of centrality based on betweenness, Sociometry, vol. 40, no. 1, pp , [10] M. E. J. Newman, The structure and function of complex networks, SIAM Review, vol. 45, p. 167, [Online]. Available: cond-mat/ [11] L. F. Costa, F. A. Rodrigues, G. Travieso, and P. R. V. Boas, Characterization of complex networks:

Handling the Complexity of RDF Data: Combining List and Graph Visualization

Handling the Complexity of RDF Data: Combining List and Graph Visualization Handling the Complexity of RDF Data: Combining List and Graph Visualization Philipp Heim and Jürgen Ziegler (University of Duisburg-Essen, Germany philipp.heim, [email protected]) Abstract: An

More information

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA [email protected]

More information

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.0, [email protected] Chapter 06: Network analysis Version: April 8, 04 / 3 Contents Chapter

More information

HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy

HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy SOCIAL NETWORK ANALYSIS University of Amsterdam, Netherlands Keywords: Social networks, structuralism, cohesion, brokerage, stratification, network analysis, methods, graph theory, statistical models Contents

More information

Part 2: Community Detection

Part 2: Community Detection Chapter 8: Graph Data Part 2: Community Detection Based on Leskovec, Rajaraman, Ullman 2014: Mining of Massive Datasets Big Data Management and Analytics Outline Community Detection - Social networks -

More information

DISCOVERING RESUME INFORMATION USING LINKED DATA

DISCOVERING RESUME INFORMATION USING LINKED DATA DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India [email protected] 2 Department of Computer

More information

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network , pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and

More information

HadoopSPARQL : A Hadoop-based Engine for Multiple SPARQL Query Answering

HadoopSPARQL : A Hadoop-based Engine for Multiple SPARQL Query Answering HadoopSPARQL : A Hadoop-based Engine for Multiple SPARQL Query Answering Chang Liu 1 Jun Qu 1 Guilin Qi 2 Haofen Wang 1 Yong Yu 1 1 Shanghai Jiaotong University, China {liuchang,qujun51319, whfcarter,yyu}@apex.sjtu.edu.cn

More information

9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08

9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08 9. Text & Documents Visualizing and Searching Documents Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08 Slide 1 / 37 Outline Characteristics of text data Detecting patterns SeeSoft

More information

Link Prediction in Social Networks

Link Prediction in Social Networks CS378 Data Mining Final Project Report Dustin Ho : dsh544 Eric Shrewsberry : eas2389 Link Prediction in Social Networks 1. Introduction Social networks are becoming increasingly more prevalent in the daily

More information

Graph Security Testing

Graph Security Testing JOURNAL OF APPLIED COMPUTER SCIENCE Vol. 23 No. 1 (2015), pp. 29-45 Graph Security Testing Tomasz Gieniusz 1, Robert Lewoń 1, Michał Małafiejski 1 1 Gdańsk University of Technology, Poland Department of

More information

Protein Protein Interaction Networks

Protein Protein Interaction Networks Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics

More information

Class One: Degree Sequences

Class One: Degree Sequences Class One: Degree Sequences For our purposes a graph is a just a bunch of points, called vertices, together with lines or curves, called edges, joining certain pairs of vertices. Three small examples of

More information

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis. Contents. Introduction. Maarten van Steen. Version: April 28, 2014

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis. Contents. Introduction. Maarten van Steen. Version: April 28, 2014 Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R.0, [email protected] Chapter 0: Version: April 8, 0 / Contents Chapter Description 0: Introduction

More information

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02) Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we

More information

Component visualization methods for large legacy software in C/C++

Component visualization methods for large legacy software in C/C++ Annales Mathematicae et Informaticae 44 (2015) pp. 23 33 http://ami.ektf.hu Component visualization methods for large legacy software in C/C++ Máté Cserép a, Dániel Krupp b a Eötvös Loránd University [email protected]

More information

Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014

Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014 Beyond Watson: Predictive Analytics and Big Data U.S. National Security Agency Research Directorate - R6 Technical Report February 3, 2014 300 years before Watson there was Euler! The first (Jeopardy!)

More information

Subgraph Patterns: Network Motifs and Graphlets. Pedro Ribeiro

Subgraph Patterns: Network Motifs and Graphlets. Pedro Ribeiro Subgraph Patterns: Network Motifs and Graphlets Pedro Ribeiro Analyzing Complex Networks We have been talking about extracting information from networks Some possible tasks: General Patterns Ex: scale-free,

More information

Statistical and computational challenges in networks and cybersecurity

Statistical and computational challenges in networks and cybersecurity Statistical and computational challenges in networks and cybersecurity Hugh Chipman Acadia University June 12, 2015 Statistical and computational challenges in networks and cybersecurity May 4-8, 2015,

More information

SPARQL: Un Lenguaje de Consulta para la Web

SPARQL: Un Lenguaje de Consulta para la Web SPARQL: Un Lenguaje de Consulta para la Web Semántica Marcelo Arenas Pontificia Universidad Católica de Chile y Centro de Investigación de la Web M. Arenas SPARQL: Un Lenguaje de Consulta para la Web Semántica

More information

A Semantic web approach for e-learning platforms

A Semantic web approach for e-learning platforms A Semantic web approach for e-learning platforms Miguel B. Alves 1 1 Laboratório de Sistemas de Informação, ESTG-IPVC 4900-348 Viana do Castelo. [email protected] Abstract. When lecturers publish contents

More information

Distributed Computing over Communication Networks: Maximal Independent Set

Distributed Computing over Communication Networks: Maximal Independent Set Distributed Computing over Communication Networks: Maximal Independent Set What is a MIS? MIS An independent set (IS) of an undirected graph is a subset U of nodes such that no two nodes in U are adjacent.

More information

City Data Pipeline. A System for Making Open Data Useful for Cities. [email protected]

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com

More information

Social Media Mining. Graph Essentials

Social Media Mining. Graph Essentials Graph Essentials Graph Basics Measures Graph and Essentials Metrics 2 2 Nodes and Edges A network is a graph nodes, actors, or vertices (plural of vertex) Connections, edges or ties Edge Node Measures

More information

Practical Graph Mining with R. 5. Link Analysis

Practical Graph Mining with R. 5. Link Analysis Practical Graph Mining with R 5. Link Analysis Outline Link Analysis Concepts Metrics for Analyzing Networks PageRank HITS Link Prediction 2 Link Analysis Concepts Link A relationship between two entities

More information

On the k-path cover problem for cacti

On the k-path cover problem for cacti On the k-path cover problem for cacti Zemin Jin and Xueliang Li Center for Combinatorics and LPMC Nankai University Tianjin 300071, P.R. China [email protected], [email protected] Abstract In this paper we

More information

SCAN: A Structural Clustering Algorithm for Networks

SCAN: A Structural Clustering Algorithm for Networks SCAN: A Structural Clustering Algorithm for Networks Xiaowei Xu, Nurcan Yuruk, Zhidan Feng (University of Arkansas at Little Rock) Thomas A. J. Schweiger (Acxiom Corporation) Networks scaling: #edges connected

More information

Course on Social Network Analysis Graphs and Networks

Course on Social Network Analysis Graphs and Networks Course on Social Network Analysis Graphs and Networks Vladimir Batagelj University of Ljubljana Slovenia V. Batagelj: Social Network Analysis / Graphs and Networks 1 Outline 1 Graph...............................

More information

COUNTING INDEPENDENT SETS IN SOME CLASSES OF (ALMOST) REGULAR GRAPHS

COUNTING INDEPENDENT SETS IN SOME CLASSES OF (ALMOST) REGULAR GRAPHS COUNTING INDEPENDENT SETS IN SOME CLASSES OF (ALMOST) REGULAR GRAPHS Alexander Burstein Department of Mathematics Howard University Washington, DC 259, USA [email protected] Sergey Kitaev Mathematics

More information

Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing. October 29th, 2015

Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing. October 29th, 2015 E6893 Big Data Analytics Lecture 8: Spark Streams and Graph Computing (I) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing

More information

Hierarchical Data Visualization

Hierarchical Data Visualization Hierarchical Data Visualization 1 Hierarchical Data Hierarchical data emphasize the subordinate or membership relations between data items. Organizational Chart Classifications / Taxonomies (Species and

More information

Introduction to Graph Mining

Introduction to Graph Mining Introduction to Graph Mining What is a graph? A graph G = (V,E) is a set of vertices V and a set (possibly empty) E of pairs of vertices e 1 = (v 1, v 2 ), where e 1 E and v 1, v 2 V. Edges may contain

More information

Self-adaptive e-learning Website for Mathematics

Self-adaptive e-learning Website for Mathematics Self-adaptive e-learning Website for Mathematics Akira Nakamura Abstract Keyword searching and browsing on learning website is ultimate self-adaptive learning. Our e-learning website KIT Mathematics Navigation

More information

Graph Mining and Social Network Analysis

Graph Mining and Social Network Analysis Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann

More information

High-dimensional labeled data analysis with Gabriel graphs

High-dimensional labeled data analysis with Gabriel graphs High-dimensional labeled data analysis with Gabriel graphs Michaël Aupetit CEA - DAM Département Analyse Surveillance Environnement BP 12-91680 - Bruyères-Le-Châtel, France Abstract. We propose the use

More information

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

LinkZoo: A linked data platform for collaborative management of heterogeneous resources LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research

More information

Cluster Analysis for Evaluating Trading Strategies 1

Cluster Analysis for Evaluating Trading Strategies 1 CONTRIBUTORS Jeff Bacidore Managing Director, Head of Algorithmic Trading, ITG, Inc. [email protected] +1.212.588.4327 Kathryn Berkow Quantitative Analyst, Algorithmic Trading, ITG, Inc. [email protected]

More information

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 8 by Tan, Steinbach, Kumar 1 What is Cluster Analysis? Finding groups of objects such that the objects in a group will

More information

! E6893 Big Data Analytics Lecture 9:! Linked Big Data Graph Computing (I)

! E6893 Big Data Analytics Lecture 9:! Linked Big Data Graph Computing (I) ! E6893 Big Data Analytics Lecture 9:! Linked Big Data Graph Computing (I) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science Mgr., Dept. of Network Science and

More information

KEYWORD SEARCH OVER PROBABILISTIC RDF GRAPHS

KEYWORD SEARCH OVER PROBABILISTIC RDF GRAPHS ABSTRACT KEYWORD SEARCH OVER PROBABILISTIC RDF GRAPHS In many real applications, RDF (Resource Description Framework) has been widely used as a W3C standard to describe data in the Semantic Web. In practice,

More information

Socio-semantic network data visualization

Socio-semantic network data visualization Socio-semantic network data visualization Alexey Drutsa 1,2, Konstantin Yavorskiy 1 1 Witology [email protected], [email protected] http://www.witology.com 2 Moscow State University,

More information

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003 Graph models for the Web and the Internet Elias Koutsoupias University of Athens and UCLA Crete, July 2003 Outline of the lecture Small world phenomenon The shape of the Web graph Searching and navigation

More information

How To Find Local Affinity Patterns In Big Data

How To Find Local Affinity Patterns In Big Data Detection of local affinity patterns in big data Andrea Marinoni, Paolo Gamba Department of Electronics, University of Pavia, Italy Abstract Mining information in Big Data requires to design a new class

More information

Graph theoretic approach to analyze amino acid network

Graph theoretic approach to analyze amino acid network Int. J. Adv. Appl. Math. and Mech. 2(3) (2015) 31-37 (ISSN: 2347-2529) Journal homepage: www.ijaamm.com International Journal of Advances in Applied Mathematics and Mechanics Graph theoretic approach to

More information

Using RDF Metadata To Enable Access Control on the Social Semantic Web

Using RDF Metadata To Enable Access Control on the Social Semantic Web Using RDF Metadata To Enable Access Control on the Social Semantic Web James Hollenbach, Joe Presbrey, and Tim Berners-Lee Decentralized Information Group, MIT CSAIL, 32 Vassar Street, Cambridge, MA, USA,

More information

Mining the Web of Linked Data with RapidMiner

Mining the Web of Linked Data with RapidMiner Mining the Web of Linked Data with RapidMiner Petar Ristoski, Christian Bizer, and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {petar.ristoski,heiko,chris}@informatik.uni-mannheim.de

More information

Technical paper review. Program visualization and explanation for novice C programmers by Matthew Heinsen Egan and Chris McDonald.

Technical paper review. Program visualization and explanation for novice C programmers by Matthew Heinsen Egan and Chris McDonald. Technical paper review Program visualization and explanation for novice C programmers by Matthew Heinsen Egan and Chris McDonald Garvit Pahal Indian Institute of Technology, Kanpur October 28, 2014 Garvit

More information

Types of Degrees in Bipolar Fuzzy Graphs

Types of Degrees in Bipolar Fuzzy Graphs pplied Mathematical Sciences, Vol. 7, 2013, no. 98, 4857-4866 HIKRI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2013.37389 Types of Degrees in Bipolar Fuzzy Graphs Basheer hamed Mohideen Department

More information

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks Melike Şah, Wendy Hall and David C De Roure Intelligence, Agents and Multimedia Group,

More information

A Survey on: Efficient and Customizable Data Partitioning for Distributed Big RDF Data Processing using hadoop in Cloud.

A Survey on: Efficient and Customizable Data Partitioning for Distributed Big RDF Data Processing using hadoop in Cloud. A Survey on: Efficient and Customizable Data Partitioning for Distributed Big RDF Data Processing using hadoop in Cloud. Tejas Bharat Thorat Prof.RanjanaR.Badre Computer Engineering Department Computer

More information

Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks

Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks Imre Varga Abstract In this paper I propose a novel method to model real online social networks where the growing

More information

NETZCOPE - a tool to analyze and display complex R&D collaboration networks

NETZCOPE - a tool to analyze and display complex R&D collaboration networks The Task Concepts from Spectral Graph Theory EU R&D Network Analysis Netzcope Screenshots NETZCOPE - a tool to analyze and display complex R&D collaboration networks L. Streit & O. Strogan BiBoS, Univ.

More information

THE SEMANTIC WEB AND IT`S APPLICATIONS

THE SEMANTIC WEB AND IT`S APPLICATIONS 15-16 September 2011, BULGARIA 1 Proceedings of the International Conference on Information Technologies (InfoTech-2011) 15-16 September 2011, Bulgaria THE SEMANTIC WEB AND IT`S APPLICATIONS Dimitar Vuldzhev

More information

Mining Social Network Graphs

Mining Social Network Graphs Mining Social Network Graphs Debapriyo Majumdar Data Mining Fall 2014 Indian Statistical Institute Kolkata November 13, 17, 2014 Social Network No introduc+on required Really? We s7ll need to understand

More information

A Brief Introduction to Property Testing

A Brief Introduction to Property Testing A Brief Introduction to Property Testing Oded Goldreich Abstract. This short article provides a brief description of the main issues that underly the study of property testing. It is meant to serve as

More information

Character Image Patterns as Big Data

Character Image Patterns as Big Data 22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,

More information

Predict the Popularity of YouTube Videos Using Early View Data

Predict the Popularity of YouTube Videos Using Early View Data 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Triangulation by Ear Clipping

Triangulation by Ear Clipping Triangulation by Ear Clipping David Eberly Geometric Tools, LLC http://www.geometrictools.com/ Copyright c 1998-2016. All Rights Reserved. Created: November 18, 2002 Last Modified: August 16, 2015 Contents

More information

Developing Web and Mobile Dashboards with Oracle ADF

Developing Web and Mobile Dashboards with Oracle ADF Developing Web and Mobile Dashboards with Oracle ADF In this lab you ll build a web dashboard that displays data from the database in meaningful ways. You are going to leverage Oracle ADF the Oracle Application

More information

The use of Semantic Web Technologies in Spatial Decision Support Systems

The use of Semantic Web Technologies in Spatial Decision Support Systems The use of Semantic Web Technologies in Spatial Decision Support Systems Adam Iwaniak Jaromar Łukowicz Iwona Kaczmarek Marek Strzelecki The INSPIRE Conference 2013, 23-27 June Wroclaw University of Environmental

More information

Temporal Visualization and Analysis of Social Networks

Temporal Visualization and Analysis of Social Networks Temporal Visualization and Analysis of Social Networks Peter A. Gloor*, Rob Laubacher MIT {pgloor,rjl}@mit.edu Yan Zhao, Scott B.C. Dynes *Dartmouth {yan.zhao,sdynes}@dartmouth.edu Abstract This paper

More information

Secure Semantic Web Service Using SAML

Secure Semantic Web Service Using SAML Secure Semantic Web Service Using SAML JOO-YOUNG LEE and KI-YOUNG MOON Information Security Department Electronics and Telecommunications Research Institute 161 Gajeong-dong, Yuseong-gu, Daejeon KOREA

More information

TIBCO Spotfire Network Analytics 1.1. User s Manual

TIBCO Spotfire Network Analytics 1.1. User s Manual TIBCO Spotfire Network Analytics 1.1 User s Manual Revision date: 26 January 2009 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO

More information

Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan

Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan Network Metrics, Planar Graphs, and Software Tools Based on materials by Lala Adamic, UMichigan Network Metrics: Bowtie Model of the Web n The Web is a directed graph: n webpages link to other webpages

More information

Social Media Mining. Network Measures

Social Media Mining. Network Measures Klout Measures and Metrics 22 Why Do We Need Measures? Who are the central figures (influential individuals) in the network? What interaction patterns are common in friends? Who are the like-minded users

More information

Graph/Network Visualization

Graph/Network Visualization Graph/Network Visualization Data model: graph structures (relations, knowledge) and networks. Applications: Telecommunication systems, Internet and WWW, Retailers distribution networks knowledge representation

More information

12 The Semantic Web and RDF

12 The Semantic Web and RDF MSc in Communication Sciences 2011-12 Program in Technologies for Human Communication Davide Eynard nternet Technology 12 The Semantic Web and RDF 2 n the previous episodes... A (video) summary: Michael

More information

technische universiteit eindhoven WIS & Engineering Geert-Jan Houben

technische universiteit eindhoven WIS & Engineering Geert-Jan Houben WIS & Engineering Geert-Jan Houben Contents Web Information System (WIS) Evolution in Web data WIS Engineering Languages for Web data XML (context only!) RDF XML Querying: XQuery (context only!) RDFS SPARQL

More information

SGL: Stata graph library for network analysis

SGL: Stata graph library for network analysis SGL: Stata graph library for network analysis Hirotaka Miura Federal Reserve Bank of San Francisco Stata Conference Chicago 2011 The views presented here are my own and do not necessarily represent the

More information

excellent graph matching capabilities with global graph analytic operations, via an interface that researchers can use to plug in their own

excellent graph matching capabilities with global graph analytic operations, via an interface that researchers can use to plug in their own Steve Reinhardt 2 The urika developers are extending SPARQL s excellent graph matching capabilities with global graph analytic operations, via an interface that researchers can use to plug in their own

More information

Network-Based Tools for the Visualization and Analysis of Domain Models

Network-Based Tools for the Visualization and Analysis of Domain Models Network-Based Tools for the Visualization and Analysis of Domain Models Paper presented as the annual meeting of the American Educational Research Association, Philadelphia, PA Hua Wei April 2014 Visualizing

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany [email protected],

More information

A MEDIATION LAYER FOR HETEROGENEOUS XML SCHEMAS

A MEDIATION LAYER FOR HETEROGENEOUS XML SCHEMAS A MEDIATION LAYER FOR HETEROGENEOUS XML SCHEMAS Abdelsalam Almarimi 1, Jaroslav Pokorny 2 Abstract This paper describes an approach for mediation of heterogeneous XML schemas. Such an approach is proposed

More information

TEXT-FILLED STACKED AREA GRAPHS Martin Kraus

TEXT-FILLED STACKED AREA GRAPHS Martin Kraus Martin Kraus Text can add a significant amount of detail and value to an information visualization. In particular, it can integrate more of the data that a visualization is based on, and it can also integrate

More information

Approximated Distributed Minimum Vertex Cover Algorithms for Bounded Degree Graphs

Approximated Distributed Minimum Vertex Cover Algorithms for Bounded Degree Graphs Approximated Distributed Minimum Vertex Cover Algorithms for Bounded Degree Graphs Yong Zhang 1.2, Francis Y.L. Chin 2, and Hing-Fung Ting 2 1 College of Mathematics and Computer Science, Hebei University,

More information

A scalable multilevel algorithm for graph clustering and community structure detection

A scalable multilevel algorithm for graph clustering and community structure detection A scalable multilevel algorithm for graph clustering and community structure detection Hristo N. Djidjev 1 Los Alamos National Laboratory, Los Alamos, NM 87545 Abstract. One of the most useful measures

More information

Towards a reference architecture for Semantic Web applications

Towards a reference architecture for Semantic Web applications Towards a reference architecture for Semantic Web applications Benjamin Heitmann 1, Conor Hayes 1, and Eyal Oren 2 1 [email protected] Digital Enterprise Research Institute National University

More information

Cycle transversals in bounded degree graphs

Cycle transversals in bounded degree graphs Electronic Notes in Discrete Mathematics 35 (2009) 189 195 www.elsevier.com/locate/endm Cycle transversals in bounded degree graphs M. Groshaus a,2,3 P. Hell b,3 S. Klein c,1,3 L. T. Nogueira d,1,3 F.

More information

Generating Visualizations From RDF Graphs

Generating Visualizations From RDF Graphs Generating Visualizations From RDF Graphs Zhuo Ma U4618536 Supervisor: Tom Gedeon, Armin Haller COMP8715 Computing Project Australian National University Semester 1, 2015 29 May, 2015 1 ACKNOWLEDGEMENT

More information

Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Abstract Keywords Introduction

Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Abstract Keywords Introduction Serendipity a platform to discover and visualize Open OER Data from OpenCourseWare repositories Nelson Piedra, Jorge López, Janneth Chicaiza, Universidad Técnica Particular de Loja, Ecuador [email protected],

More information

USE OF EIGENVALUES AND EIGENVECTORS TO ANALYZE BIPARTIVITY OF NETWORK GRAPHS

USE OF EIGENVALUES AND EIGENVECTORS TO ANALYZE BIPARTIVITY OF NETWORK GRAPHS USE OF EIGENVALUES AND EIGENVECTORS TO ANALYZE BIPARTIVITY OF NETWORK GRAPHS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA [email protected] ABSTRACT This

More information

Exponential time algorithms for graph coloring

Exponential time algorithms for graph coloring Exponential time algorithms for graph coloring Uriel Feige Lecture notes, March 14, 2011 1 Introduction Let [n] denote the set {1,..., k}. A k-labeling of vertices of a graph G(V, E) is a function V [k].

More information

Network Algorithms for Homeland Security

Network Algorithms for Homeland Security Network Algorithms for Homeland Security Mark Goldberg and Malik Magdon-Ismail Rensselaer Polytechnic Institute September 27, 2004. Collaborators J. Baumes, M. Krishmamoorthy, N. Preston, W. Wallace. Partially

More information

RDF y SPARQL: Dos componentes básicos para la Web de datos

RDF y SPARQL: Dos componentes básicos para la Web de datos RDF y SPARQL: Dos componentes básicos para la Web de datos Marcelo Arenas PUC Chile & University of Oxford M. Arenas RDF y SPARQL: Dos componentes básicos para la Web de datos Valladolid 2013 1 / 61 Semantic

More information

Cray: Enabling Real-Time Discovery in Big Data

Cray: Enabling Real-Time Discovery in Big Data Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects

More information

Data Mining and Database Systems: Where is the Intersection?

Data Mining and Database Systems: Where is the Intersection? Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: [email protected] 1 Introduction The promise of decision support systems is to exploit enterprise

More information

Greedy Routing on Hidden Metric Spaces as a Foundation of Scalable Routing Architectures

Greedy Routing on Hidden Metric Spaces as a Foundation of Scalable Routing Architectures Greedy Routing on Hidden Metric Spaces as a Foundation of Scalable Routing Architectures Dmitri Krioukov, kc claffy, and Kevin Fall CAIDA/UCSD, and Intel Research, Berkeley Problem High-level Routing is

More information

KNOWLEDGE-BASED VISUALIZATION

KNOWLEDGE-BASED VISUALIZATION UNIVERSITÀ DEGLI STUDI DI ROMA TOR VERGATA DIPARTIMENTO DI INFORMATICA SISTEMI E PRODUZIONE Dottorato di Ricerca Informatica e Ingegneria dell Automazione Ciclo XXIV KNOWLEDGE-BASED VISUALIZATION SYSTEMS

More information

Visual Data Mining with Pixel-oriented Visualization Techniques

Visual Data Mining with Pixel-oriented Visualization Techniques Visual Data Mining with Pixel-oriented Visualization Techniques Mihael Ankerst The Boeing Company P.O. Box 3707 MC 7L-70, Seattle, WA 98124 [email protected] Abstract Pixel-oriented visualization

More information