Visualization of Social Networks in Stata by Multi-dimensional Scaling
|
|
|
- Nancy Hudson
- 9 years ago
- Views:
Transcription
1 Visualization of Social Networks in Stata by Multi-dimensional Scaling Rense Corten Department of Sociology/ICS Utrecht University The Netherlands April 12, Introduction Social network analysis (SNA) studies patterns of interaction between social entities (Wasserman and Faust, 1994; Scott, 2000). In the past few decades, SNA has emerged as a major research paradigm in the social sciences (including economics), and has also attracted attention in other fields (Newman et al., 2006). While dedicated software for SNA exists (e.g., UCInet (Borgatti et al., 1999) or Pajek (Bagatelj and Mrvar, 2009)), Stata currently lacks readily available facilities for SNA. In this paper, we illustrate how methods for SNA can be developed in Stata, using network visualization as an example. Visualization is one of the oldest methods in SNA and is still one of its most important and widely applied tools to uncover patterns of relations (Freeman, 2000). We describe a procedure for network visualization using Stata s built-in procedures for multidimensional scaling and describe an implementation as a Stata command. While we believe that network visualization in itself can be highly useful, the example also illustrates how SNA problems can be handled in Stata more generally. 2 Methods 2.1 Some terminology Network visualization is concerned with showing binary relations between entities. Adopting the terminology of graph theory, we refer to these entities as vertices. Relations between vertices may be directed, if they can be understood as flowing from one vertex to another, or undirected if no such direction can be identified. We refer to directed relations as arcs and to undirected relations as edges. A typical representation of a network of relations is an adjacency matrix, as shown in Figure 1 for a network of 10 vertices. In this matrix, every cell represents a relation from a vertex (row) to another vertex (column); for undirected networks, this matrix is symmetric. Vertices that have no edges or arcs are called isolates. The number of edges connected to a vertex is called the degree of the vertex. Lastly, the distance between two vertices is defined as the shortest path between them. If there is no path between two isolates, we define the distance between them as infinite. I thank Jeroen Weesie for highly useful comments and suggestions. This paper is currently under review at the Stata Journal. 1
2 Figure 1: An adjacency matrix, N = 10 col 1 col Figure 2: Edgelist based on the adjacency matrix in Figure Data structure One particular obstacle in analyzing network data in conventional statistics packages such as Stata is the specific structure of relational data. Whereas in conventional datasets, one line in the data typically represents an individual entity, observations in relational datasets represent relations between entities. Weassumethat dataareavailableasalist ofedgesorarcs. That is, foranetworkofk relations, we have a k 2 data matrix, in which every row represents an edge (if the network is undirected) or arc (if the network is directed) between two vertices in the cells. The use of edgelist and arclists is an often more economical way to store network data than an adjacency matrix, especially for networks that are relatively sparse. We extend the traditional edgelist and arclist formats by allowing the use of missing values. We use missing values to include isolates in the list (Fig. 2). In Figure 1, vertex 10 is isolated; in Figure 2, its vertex number appears in one column but is accompanied by a missing value in the other column.the order of appearance might be reversed. Thus, a network consisting of k edges and N vertices of which h isolates can be represented by a (k +h) 2 matrix. 2.3 Procedure The main task in network visualization is to determine the positions of the vertices in a (typically two-dimensional) graphical layout. Obviously, the optimal placement of vertices depends on the purpose of the analysis, but it is often desirable to place vertices that have a central position in the network also centrally in the graphic and to represent larger distance in the network by a larger distance in the two-dimensional graph. Various algorithms have been proposed to solve this problem, of which those by Kamada and Kawai (1989) and Fruchterman and Reingold (1991) are probably most widely used. Instead, we use multidimensional scaling (MDS) to compute 2
3 coordinates for the vertices, which has the advantage of being available in Stata by default. The use of MDS for network visualization has a long history in SNA and was first used as such by Laumann and Guttman (1966). Assuming that we have a relational dataset formatted as an edgelist, we propose to visualize the network by the following procedure: 1. Reshape the data into an adjacency matrix; 2. Compute the matrix of shortest paths (the distance matrix); 3. Compute coordinates for the vertices, arranged on a circle, in a random order; 4. Compute coordinates for the vertices by mds, using the modern method, and using the coordinates circle layout obtained in the previous step as starting positions; 5. Draw the graphic by combining the twoway plot types pcspike or pcarrow and scatter. In our implementation, steps 1 3 are performed in Mata. The calculation of the distance matrix (step 2) involves calculating higher powers of the adjacency matrix, and can be rather time-consuming for larger networks. More efficient procedures for obtaining distances in a network are feasible, but not implemented in our example. We choose Stata s iterative modern mds method for step 4 because it allows for the specification of starting positions and appears to provide better results in our tests. In particular, the modern method performs better than the classic method with regard the placement of vertices that have identical distances to all other vertices (for example, vertices in the periphery of a star ). Experimentation furthermore suggests that starting with a circular layout provides the best results. 1 3 Implementation: the netplot command Syntax: netplot var1 var2 [ if ][ in ] [, type(mds circle) label arrows iterations(#) ] The command netplot produces a graphical representation of a network stored as an extended edgelist or arclist in var1 and var2. type(string) specifies the type of layout. Valid values are: circle: vertices are arranged on a circle mds: positions of vertices are calculated using multidimensional scaling. This is the default; omitting type() is equivalent to specifying type(mds). label specifies that vertices are to be labeled using their identifiers in var1 and var2 arrows specifies that arrows rather than lines are drawn between vertices. Arrows run from the vertex in var1 to the vertex in var2. This option is useful for arclists that represent directed relations. iterations(#) specifies the maximum number of iterations in the multidimensional scaling procedure. The default is iterate(1000). 4 Examples To illustrate the process outlined above we use a well-known dataset known as Padgett s Florentine Families, which contains information on relations among 16 families in 15th century Florence, Italy (Padgett and Ansell, 1993). The part of the data we use represent marital relations between the families. These relations are by nature undirected. The data are described below: 1 Internally, our program issues the command mdsmat distance matrix, noplot method(modern) initialize(from(circle matrix)) iterate(#). 3
4 .. desc Contains data from padgett_marital02_undir.dta obs: 21 Padgett marital data with undirected ties vars: 2 22 Jan :37 size: 588 (99.9% of memory free) (_dta has notes) storage display value variable name type format label variable label from str12 %12s family 1 name to str12 %12s family 2 name Sorted by: from to. list, sepby(from) from to 1. Pucci 2. Albizzi Guadagni 3. Albizzi Medici 4. Barbadori Medici 5. Bischeri Guadagni 6. Bischeri Peruzzi 7. Bischeri Strozzi 8. Castellani Barbadori 9. Castellani Strozzi 10. Ginori Albizzi 11. Guadagni Lamberteschi 12. Medici Acciaiuoli 13. Medici Salviati 14. Medici Tornabuoni 15. Pazzi Salviati 16. Peruzzi Castellani 17. Peruzzi Strozzi 18. Ridolfi Medici 19. Ridolfi Tornabuoni 20. Strozzi Ridolfi 21. Tornabuoni Guadagni. Note that the data are in this case formatted as strings, simply using the family names as identifiers for the vertices of the network. The first example shows the most basic usage of netplot, using the command netplot from to, which produces a network plot of the data based as resulting from multidimensional scaling (Fig. 3). In many analyses it is useful to be able to identify specific vertices in the network, which is facilitated by adding labels to the plot using the option label (Fig. 4). We can now observe that this network has a cohesive core formed by the Medici, Ridolfi and Tornabuoni families, and that the isolated vertex is the Pucci family. The rotation of the graph and the different location of the isolated vertex as compared to Figure 3 is caused by the randomized initial positions used in the 4
5 MDS procedure. 2 Sometimes it is not necessary to have the relatively complicated plot as produced by multidimensional scaling. Then, a simple view on the data can be produced by the circle option (Fig. 5). For our final example with these data, we assume that the data are directed. That is, we assume that each line in the data represents a relation from the vertex in from to the vertex in to. Imagine, for instance, that the data now represent whether a family has ever sold goods to another family. Such situations can be visualized using the option arrows, which draws arrows instead of lines between the vertices (Fig. 6). The graph in this example was slightly adjusted afterwards using the Graph Editor, by reducing the sizes of the markers in order to make the arrowheads better visible. As a final example, we draw a plot of a somewhat larger network of 100 vertices. The data for this example were simulated using the preferential attachment algorithm proposed by Albert and Barabási (1999) to construct the network (Fig. 7). 3 This example highlights two limitations of netplot. First, as Figure 7 shows, vertex placement can be suboptimal: several vertices (such those labeled 63, 58, and 68) in the figure are placed too far from neighboring vertices, leading to many crossings of edges and an overall unpleasing result. The reason is that, in this particular tree-like network structure, there are many vertices (such as those mentioned) that have the exact same distance to all other vertices, which makes placement by MDS difficult. Second (not visible in the figure), the procedure becomes considerably more time-consuming with this number of vertices. We discuss this issue in more detail in the next section. 5 Performance To get of rough idea of the performance of netplot in terms of computation time we conduct two simulated tests. First, we draw plots of networks of increasing network size, keeping network density constant at.5. Note that this leads to an exponentially increasing number of edges in the network. To draw the plots, we use neplot without any options. The input networks are randomly generated Erdös-Rényi graphs (Erdös and Rényi, 1959). 4 For the second test, we again draw plots with increasing network size, but keep average degree constant rather than density. This implies a linear increase in the number of edges in the network. We use an average degree of 3. In both test, we look at networks sizes ranging from 5 to 100 in steps of 5, and keep track of the time needed to draw a graph over 10 iterations per network size. The results are shown in Figure 8. The figure indicates that average time increases quadratically with network size, although more strongly with constant density than with constant degree. Closer analysis (not reported) of the running time of the different components of the program reveals that the computation of coordinates using MDS (after computation of distances) is the most time-consuming step in the procedure. 6 Discussion In this paper we have demonstrated how network data can be visualized in Stata, using the built-in techniques for multidimensional scaling and graphics. This method often produces useful results, although not always for all networks. Moreover, for larger networks the time needed to compute vertex coordinates can become a nuisance, especially when several trials are needed to achieve a 2 The placement of labels outside the plotregion is part of the default behavior of twoway scatter, which is used by netplot. This can be easily adjusted afterwards. 3 The actual simulation was conducted in Mata. The function in which the Albert-Barabasi algorithm is implemented is part of a larger library of functions for network analysis under development by the author. 4 The tests were run in Stata 11 SE on a PC with a 2.66GHz dual core processor and 1Gb of memory, and running the Microsoft Windows XP 32-bit operating system. 5
6 Figure 3: Example: Marital relations between Florentine families with vertex placement by MDS Pucci Ginori Lamberteschi Albizzi Guadagni Acciaiuoli Tornabuoni Bischeri Ridolfi Medici Salviati Pazzi Peruzzi Strozzi Barbadori Castellani Figure 4: Example: Marital relations between Florentine families with vertex placement by MDS and labels added 6
7 Castellani Bischeri Barbadori Ginori Albizzi Guadagni Acciaiuoli Lamberteschi Tornabuoni Medici Strozzi Pazzi Salviati Peruzzi Pucci Ridolfi Figure 5: Example: Marital relations between Florentine families with circular vertex placement and labels Pucci Ginori Lamberteschi Albizzi Guadagni Acciaiuoli Tornabuoni Bischeri Ridolfi Medici Salviati Pazzi Peruzzi Strozzi Barbadori Castellani Figure 6: Example: Marital relations between Florentine families as directed relations with vertex placement by MDS and labels 7
8 Figure 7: Example: A simulated network of 100 vertices with vertex placement by MDS Time (seconds) N of vertices Density =.5 Average degree = 3 Figure 8: Average computation time by network size 8
9 satisfying result. As a workaround for this problem, the number of iterations may be limited using the iterations() option. Visual results could likely be improved by using different vertex placement algorithms than MDS. Good candidates are the often used spring embedding algorithms by Kamada and Kawai (1989) and Fruchterman and Reingold (1991). Given the program architecture of netplot, these methods could be added relatively easily, and implementing them would be an obvious target for future development. It is, however, not clear how the Kamada-Kawai and Fruchterman-Reingold algorithms compare to MDS in terms of computation time. Another approach to improving efficiency is to use more efficient methods for computing distances in the network. The simple approach currently implemented, based on repeated matrix squaring, computes quite some unneeded information in the process. More efficient algorithms for computing shortest paths exist (i.e., Cormen et al., 2001) and might be implemented in the future. The introduction of Mata with Stata 9 has made matrix programming more effective and more accessible for the average user. This opens up further possibilities for the development of SNA methods in Stata. Especiallythe fact thatmata canbe usedinteractivelymakesit easiertouse the alternative data structures representing networks common in SNA. The quickly growing interest in social networks in and outside the social sciences certainly justifies the further development of SNA methods for Stata. References Albert, R. and A.-L. Barabási (1999). Emergence of scaling in random networks. Science 286, Bagatelj, V. and A. Mrvar (2009). Pajek. Borgatti, S. P., M. G. Everett, and L. C. Freeman (1999). Ucinet 5. Cormen, T. H., C. E. Leiserson, R. L. Rivest, and C. Stein (2001). Introduction to Algorithm Design (Second ed.). Cambridge, MA: MIT Press. Erdös, P. and A. Rényi (1959). On random graphs i. Publicationes Mathematicae Debrecen 6, Freeman, L. C. (2000). Visualizing social networks. Journal of Social Structure 1(1). Fruchterman, T. M. J. and E. M. Reingold (1991). Graph drawing by force-directed placement. Software-Practice and Experience 21(1 1), Kamada, T. and S. Kawai(1989). An algoritm for drawing general undirected graphs. Information Processing Letters 31, Laumann, E. O. and L. Guttman (1966). The relative associational contiguity of occupations in an urban setting. American Sociological Review 31(2), Newman, M., A.-L. Barabási, and D. Watts (Eds.) (2006). The Structure and Dynamics of Networks. Princeton, NJ: Princeton University Press. Padgett, J. F. and C. K. Ansell (1993). Robust action and the rise of the medici, The American Journal of Sociology 98(6), Scott, J. (2000). Social Network Analysis: A Handbook (second ed.). London: Sage. Wasserman, S. and K. Faust (1994). Social Network Analysis: Methods and Applications. Cambridge: Cambridge University Press. 9
SGL: Stata graph library for network analysis
SGL: Stata graph library for network analysis Hirotaka Miura Federal Reserve Bank of San Francisco Stata Conference Chicago 2011 The views presented here are my own and do not necessarily represent the
HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy
SOCIAL NETWORK ANALYSIS University of Amsterdam, Netherlands Keywords: Social networks, structuralism, cohesion, brokerage, stratification, network analysis, methods, graph theory, statistical models Contents
Analyzing Big Data: Social Choice & Measurement
Analyzing Big Data: Social Choice & Measurement John W. Patty Elizabeth Maggie Penn August 30, 2014 Social scientists commonly construct indices to reduce and summarize complex, multidimensional data.
A comparative study of social network analysis tools
Membre de Membre de A comparative study of social network analysis tools David Combe, Christine Largeron, Előd Egyed-Zsigmond and Mathias Géry International Workshop on Web Intelligence and Virtual Enterprises
Analyzing Big Data: Social Choice & Measurement
Analyzing Big Data: Social Choice & Measurement John W. Patty Elizabeth Maggie Penn September 27, 2014 Social scientists commonly construct indices to reduce and summarize complex, multidimensional data.
Examining graduate committee faculty compositions- A social network analysis example. Kathryn Shirley and Kelly D. Bradley. University of Kentucky
Examining graduate committee faculty compositions- A social network analysis example Kathryn Shirley and Kelly D. Bradley University of Kentucky Graduate committee social network analysis 1 Abstract Social
Temporal Visualization and Analysis of Social Networks
Temporal Visualization and Analysis of Social Networks Peter A. Gloor*, Rob Laubacher MIT {pgloor,rjl}@mit.edu Yan Zhao, Scott B.C. Dynes *Dartmouth {yan.zhao,sdynes}@dartmouth.edu Abstract This paper
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Zhao Wenbin 1, Zhao Zhengxu 2 1 School of Instrument Science and Engineering, Southeast University, Nanjing, Jiangsu
Social network and smoking: a pilotstudyamonghigh-schoolstudents
2013 Italian Stata Users Group meeting Social network and smoking: a pilotstudyamonghigh-schoolstudents Bruno Federico Firenze, 14 novembre 2013 The social pattern of smoking Smoking is an important risk
Follow links for Class Use and other Permissions. For more information send email to: [email protected]
COPYRIGHT NOTICE: Matthew O. Jackson: Social and Economic Networks is published by Princeton University Press and copyrighted, 2008, by Princeton University Press. All rights reserved. No part of this
TIBCO Spotfire Network Analytics 1.1. User s Manual
TIBCO Spotfire Network Analytics 1.1 User s Manual Revision date: 26 January 2009 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO
Graph/Network Visualization
Graph/Network Visualization Data model: graph structures (relations, knowledge) and networks. Applications: Telecommunication systems, Internet and WWW, Retailers distribution networks knowledge representation
Social Network Analysis: Lecture 4-Centrality Measures
Social Network Analysis: Lecture 4-Centrality Measures Donglei Du ([email protected]) Faculty of Business Administration, University of New Brunswick, NB Canada Fredericton E3B 9Y2 Donglei Du (UNB) Social Network
Knowledge diffusion and complex networks: a model of hightech geographical industrial clusters
Knowledge diffusion and complex networks: a model of hightech geographical industrial clusters Agustí Canals a a Information and Communication Sciences Department, Universitat Oberta de Catalunya, Barcelona,
An Introduction to APGL
An Introduction to APGL Charanpal Dhanjal February 2012 Abstract Another Python Graph Library (APGL) is a graph library written using pure Python, NumPy and SciPy. Users new to the library can gain an
DATA ANALYSIS II. Matrix Algorithms
DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where
Fast Sequential Summation Algorithms Using Augmented Data Structures
Fast Sequential Summation Algorithms Using Augmented Data Structures Vadim Stadnik [email protected] Abstract This paper provides an introduction to the design of augmented data structures that offer
Data Structures. Chapter 8
Chapter 8 Data Structures Computer has to process lots and lots of data. To systematically process those data efficiently, those data are organized as a whole, appropriate for the application, called a
Excel -- Creating Charts
Excel -- Creating Charts The saying goes, A picture is worth a thousand words, and so true. Professional looking charts give visual enhancement to your statistics, fiscal reports or presentation. Excel
Scientific Graphing in Excel 2010
Scientific Graphing in Excel 2010 When you start Excel, you will see the screen below. Various parts of the display are labelled in red, with arrows, to define the terms used in the remainder of this overview.
CSE 326, Data Structures. Sample Final Exam. Problem Max Points Score 1 14 (2x7) 2 18 (3x6) 3 4 4 7 5 9 6 16 7 8 8 4 9 8 10 4 Total 92.
Name: Email ID: CSE 326, Data Structures Section: Sample Final Exam Instructions: The exam is closed book, closed notes. Unless otherwise stated, N denotes the number of elements in the data structure
Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan
Network Metrics, Planar Graphs, and Software Tools Based on materials by Lala Adamic, UMichigan Network Metrics: Bowtie Model of the Web n The Web is a directed graph: n webpages link to other webpages
GRID SEARCHING Novel way of Searching 2D Array
GRID SEARCHING Novel way of Searching 2D Array Rehan Guha Institute of Engineering & Management Kolkata, India Abstract: Linear/Sequential searching is the basic search algorithm used in data structures.
Introduction to the TI-Nspire CX
Introduction to the TI-Nspire CX Activity Overview: In this activity, you will become familiar with the layout of the TI-Nspire CX. Step 1: Locate the Touchpad. The Touchpad is used to navigate the cursor
Equivalence Concepts for Social Networks
Equivalence Concepts for Social Networks Tom A.B. Snijders University of Oxford March 26, 2009 c Tom A.B. Snijders (University of Oxford) Equivalences in networks March 26, 2009 1 / 40 Outline Structural
How to do a Business Network Analysis
How to do a Business Network Analysis by Graham Durant-Law Copyright HolisTech 2006-2007 Information and Knowledge Management Society 1 Format for the Evening Presentation (7:00 pm to 7:40 pm) Essential
UCINET Quick Start Guide
UCINET Quick Start Guide This guide provides a quick introduction to UCINET. It assumes that the software has been installed with the data in the folder C:\Program Files\Analytic Technologies\Ucinet 6\DataFiles
Formulas, Functions and Charts
Formulas, Functions and Charts :: 167 8 Formulas, Functions and Charts 8.1 INTRODUCTION In this leson you can enter formula and functions and perform mathematical calcualtions. You will also be able to
Microsoft Excel 2010 Part 3: Advanced Excel
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES Microsoft Excel 2010 Part 3: Advanced Excel Winter 2015, Version 1.0 Table of Contents Introduction...2 Sorting Data...2 Sorting
Linear Programming for Optimization. Mark A. Schulze, Ph.D. Perceptive Scientific Instruments, Inc.
1. Introduction Linear Programming for Optimization Mark A. Schulze, Ph.D. Perceptive Scientific Instruments, Inc. 1.1 Definition Linear programming is the name of a branch of applied mathematics that
Lab 11. Simulations. The Concept
Lab 11 Simulations In this lab you ll learn how to create simulations to provide approximate answers to probability questions. We ll make use of a particular kind of structure, called a box model, that
The number of marks is given in brackets [ ] at the end of each question or part question. The total number of marks for this paper is 72.
ADVANCED SUBSIDIARY GCE UNIT 4736/01 MATHEMATICS Decision Mathematics 1 THURSDAY 14 JUNE 2007 Afternoon Additional Materials: Answer Booklet (8 pages) List of Formulae (MF1) Time: 1 hour 30 minutes INSTRUCTIONS
Visualization of Communication Patterns in Collaborative Innovation Networks. Analysis of some W3C working groups
Visualization of Communication Patterns in Collaborative Innovation Networks Analysis of some W3C working groups Peter A. Gloor 1,2, Rob Laubacher 1, Scott B.C. Dynes 2, Yan Zhao 3 1 MIT Center for Coordination
Social Media Mining. Graph Essentials
Graph Essentials Graph Basics Measures Graph and Essentials Metrics 2 2 Nodes and Edges A network is a graph nodes, actors, or vertices (plural of vertex) Connections, edges or ties Edge Node Measures
A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION
A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS Kyoungjin Park Alper Yilmaz Photogrammetric and Computer Vision Lab Ohio State University [email protected] [email protected] ABSTRACT Depending
Introduction to social network analysis
Introduction to social network analysis Paola Tubaro University of Greenwich, London 26 March 2012 Introduction to social network analysis Introduction Introducing SNA Rise of online social networking
Matrix Multiplication
Matrix Multiplication CPS343 Parallel and High Performance Computing Spring 2016 CPS343 (Parallel and HPC) Matrix Multiplication Spring 2016 1 / 32 Outline 1 Matrix operations Importance Dense and sparse
ALGEBRA. sequence, term, nth term, consecutive, rule, relationship, generate, predict, continue increase, decrease finite, infinite
ALGEBRA Pupils should be taught to: Generate and describe sequences As outcomes, Year 7 pupils should, for example: Use, read and write, spelling correctly: sequence, term, nth term, consecutive, rule,
5. A full binary tree with n leaves contains [A] n nodes. [B] log n 2 nodes. [C] 2n 1 nodes. [D] n 2 nodes.
1. The advantage of.. is that they solve the problem if sequential storage representation. But disadvantage in that is they are sequential lists. [A] Lists [B] Linked Lists [A] Trees [A] Queues 2. The
Data Structure [Question Bank]
Unit I (Analysis of Algorithms) 1. What are algorithms and how they are useful? 2. Describe the factor on best algorithms depends on? 3. Differentiate: Correct & Incorrect Algorithms? 4. Write short note:
A Review And Evaluations Of Shortest Path Algorithms
A Review And Evaluations Of Shortest Path Algorithms Kairanbay Magzhan, Hajar Mat Jani Abstract: Nowadays, in computer networks, the routing is based on the shortest path problem. This will help in minimizing
Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary
Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:
Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014
Beyond Watson: Predictive Analytics and Big Data U.S. National Security Agency Research Directorate - R6 Technical Report February 3, 2014 300 years before Watson there was Euler! The first (Jeopardy!)
Eigenvector-like measures of centrality for asymmetric relations
Social Networks 23 (2001) 191 201 Eigenvector-like measures of centrality for asymmetric relations Phillip Bonacich, Paulette Lloyd Department of Sociology, University of California at Los Angeles, 2201
2 SYSTEM DESCRIPTION TECHNIQUES
2 SYSTEM DESCRIPTION TECHNIQUES 2.1 INTRODUCTION Graphical representation of any process is always better and more meaningful than its representation in words. Moreover, it is very difficult to arrange
Getting started with the Stata
Getting started with the Stata 1. Begin by going to a Columbia Computer Labs. 2. Getting started Your first Stata session. Begin by starting Stata on your computer. Using a PC: 1. Click on start menu 2.
System Interconnect Architectures. Goals and Analysis. Network Properties and Routing. Terminology - 2. Terminology - 1
System Interconnect Architectures CSCI 8150 Advanced Computer Architecture Hwang, Chapter 2 Program and Network Properties 2.4 System Interconnect Architectures Direct networks for static connections Indirect
KaleidaGraph Quick Start Guide
KaleidaGraph Quick Start Guide This document is a hands-on guide that walks you through the use of KaleidaGraph. You will probably want to print this guide and then start your exploration of the product.
A greedy algorithm for the DNA sequencing by hybridization with positive and negative errors and information about repetitions
BULLETIN OF THE POLISH ACADEMY OF SCIENCES TECHNICAL SCIENCES, Vol. 59, No. 1, 2011 DOI: 10.2478/v10175-011-0015-0 Varia A greedy algorithm for the DNA sequencing by hybridization with positive and negative
Network-Based Tools for the Visualization and Analysis of Domain Models
Network-Based Tools for the Visualization and Analysis of Domain Models Paper presented as the annual meeting of the American Educational Research Association, Philadelphia, PA Hua Wei April 2014 Visualizing
Pennsylvania System of School Assessment
Pennsylvania System of School Assessment The Assessment Anchors, as defined by the Eligible Content, are organized into cohesive blueprints, each structured with a common labeling system that can be read
Memory Systems. Static Random Access Memory (SRAM) Cell
Memory Systems This chapter begins the discussion of memory systems from the implementation of a single bit. The architecture of memory chips is then constructed using arrays of bit implementations coupled
Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
The KaleidaGraph Guide to Curve Fitting
The KaleidaGraph Guide to Curve Fitting Contents Chapter 1 Curve Fitting Overview 1.1 Purpose of Curve Fitting... 5 1.2 Types of Curve Fits... 5 Least Squares Curve Fits... 5 Nonlinear Curve Fits... 6
Tools and Techniques for Social Network Analysis
Tools and Techniques for Social Network Analysis Pajek Program for Analysis and Visualization of Large Networks Pajek: What is it Pajek is a program, for Windows and Linux (via Wine) Developers: Vladimir
This activity will show you how to draw graphs of algebraic functions in Excel.
This activity will show you how to draw graphs of algebraic functions in Excel. Open a new Excel workbook. This is Excel in Office 2007. You may not have used this version before but it is very much the
Krishna Institute of Engineering & Technology, Ghaziabad Department of Computer Application MCA-213 : DATA STRUCTURES USING C
Tutorial#1 Q 1:- Explain the terms data, elementary item, entity, primary key, domain, attribute and information? Also give examples in support of your answer? Q 2:- What is a Data Type? Differentiate
Merging Labels, Letters, and Envelopes Word 2013
Merging Labels, Letters, and Envelopes Word 2013 Merging... 1 Types of Merges... 1 The Merging Process... 2 Labels - A Page of the Same... 2 Labels - A Blank Page... 3 Creating Custom Labels... 3 Merged
Open Source Software Developer and Project Networks
Open Source Software Developer and Project Networks Matthew Van Antwerp and Greg Madey University of Notre Dame {mvanantw,gmadey}@cse.nd.edu Abstract. This paper outlines complex network concepts and how
Handling the Complexity of RDF Data: Combining List and Graph Visualization
Handling the Complexity of RDF Data: Combining List and Graph Visualization Philipp Heim and Jürgen Ziegler (University of Duisburg-Essen, Germany philipp.heim, [email protected]) Abstract: An
Graph Visualization Tools: A Comparative Analysis
Graph Visualization Tools: A Comparative Analysis Fariha Majeed 1, Dr. Saif-ur-Rahman 2 1,2 Shaheed Zulfikar Ali Bhutto Institute of Science and Technology (SZABIST) Karachi, Pakistan 1 [email protected]
ultra fast SOM using CUDA
ultra fast SOM using CUDA SOM (Self-Organizing Map) is one of the most popular artificial neural network algorithms in the unsupervised learning category. Sijo Mathew Preetha Joy Sibi Rajendra Manoj A
Arrangements And Duality
Arrangements And Duality 3.1 Introduction 3 Point configurations are tbe most basic structure we study in computational geometry. But what about configurations of more complicated shapes? For example,
Summary of important mathematical operations and formulas (from first tutorial):
EXCEL Intermediate Tutorial Summary of important mathematical operations and formulas (from first tutorial): Operation Key Addition + Subtraction - Multiplication * Division / Exponential ^ To enter a
Basics of STATA. 1 Data les. 2 Loading data into STATA
Basics of STATA This handout is intended as an introduction to STATA. STATA is available on the PCs in the computer lab as well as on the Unix system. Throughout, bold type will refer to STATA commands,
Linear Programming. Solving LP Models Using MS Excel, 18
SUPPLEMENT TO CHAPTER SIX Linear Programming SUPPLEMENT OUTLINE Introduction, 2 Linear Programming Models, 2 Model Formulation, 4 Graphical Linear Programming, 5 Outline of Graphical Procedure, 5 Plotting
What is SNA? A sociogram showing ties
Case Western Reserve University School of Medicine Social Network Analysis: Nuts & Bolts Papp KK 1, Zhang GQ 2 1 Director, Program Evaluation, CTSC, 2 Professor, Electrical Engineering and Computer Science,
Distance Degree Sequences for Network Analysis
Universität Konstanz Computer & Information Science Algorithmics Group 15 Mar 2005 based on Palmer, Gibbons, and Faloutsos: ANF A Fast and Scalable Tool for Data Mining in Massive Graphs, SIGKDD 02. Motivation
Create Charts in Excel
Create Charts in Excel Table of Contents OVERVIEW OF CHARTING... 1 AVAILABLE CHART TYPES... 2 PIE CHARTS... 2 BAR CHARTS... 3 CREATING CHARTS IN EXCEL... 3 CREATE A CHART... 3 HOW TO CHANGE THE LOCATION
Polynomial and Rational Functions
Polynomial and Rational Functions Quadratic Functions Overview of Objectives, students should be able to: 1. Recognize the characteristics of parabolas. 2. Find the intercepts a. x intercepts by solving
Exponential Growth and Modeling
Exponential Growth and Modeling Is it Really a Small World After All? I. ASSESSSMENT TASK OVERVIEW & PURPOSE: Students will apply their knowledge of functions and regressions to compare the U.S. population
A Non-Linear Schema Theorem for Genetic Algorithms
A Non-Linear Schema Theorem for Genetic Algorithms William A Greene Computer Science Department University of New Orleans New Orleans, LA 70148 bill@csunoedu 504-280-6755 Abstract We generalize Holland
Asset Track Getting Started Guide. An Introduction to Asset Track
Asset Track Getting Started Guide An Introduction to Asset Track Contents Introducing Asset Track... 3 Overview... 3 A Quick Start... 6 Quick Start Option 1... 6 Getting to Configuration... 7 Changing
Socio-semantic network data visualization
Socio-semantic network data visualization Alexey Drutsa 1,2, Konstantin Yavorskiy 1 1 Witology [email protected], [email protected] http://www.witology.com 2 Moscow State University,
Linear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
Evolving Networks with Distance Preferences
Evolving Networks with Distance Preferences Juergen Jost M. P. Joy SFI WORKING PAPER: 2002-07-030 SFI Working Papers contain accounts of scientific work of the author(s) and do not necessarily represent
Using Pivot Tables in Microsoft Excel 2003
Using Pivot Tables in Microsoft Excel 2003 Introduction A Pivot Table is the name Excel gives to what is more commonly known as a cross-tabulation table. Such tables can be one, two or three-dimensional
The mathematics of networks
The mathematics of networks M. E. J. Newman Center for the Study of Complex Systems, University of Michigan, Ann Arbor, MI 48109 1040 In much of economic theory it is assumed that economic agents interact,
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
JustClust User Manual
JustClust User Manual Contents 1. Installing JustClust 2. Running JustClust 3. Basic Usage of JustClust 3.1. Creating a Network 3.2. Clustering a Network 3.3. Applying a Layout 3.4. Saving and Loading
An approach of detecting structure emergence of regional complex network of entrepreneurs: simulation experiment of college student start-ups
An approach of detecting structure emergence of regional complex network of entrepreneurs: simulation experiment of college student start-ups Abstract Yan Shen 1, Bao Wu 2* 3 1 Hangzhou Normal University,
PRODUCT INFORMATION. Insight+ Uses and Features
PRODUCT INFORMATION Insight+ Traditionally, CAE NVH data and results have been presented as plots, graphs and numbers. But, noise and vibration must be experienced to fully comprehend its effects on vehicle
SHARP BOUNDS FOR THE SUM OF THE SQUARES OF THE DEGREES OF A GRAPH
31 Kragujevac J. Math. 25 (2003) 31 49. SHARP BOUNDS FOR THE SUM OF THE SQUARES OF THE DEGREES OF A GRAPH Kinkar Ch. Das Department of Mathematics, Indian Institute of Technology, Kharagpur 721302, W.B.,
Learning Module 4 - Thermal Fluid Analysis Note: LM4 is still in progress. This version contains only 3 tutorials.
Learning Module 4 - Thermal Fluid Analysis Note: LM4 is still in progress. This version contains only 3 tutorials. Attachment C1. SolidWorks-Specific FEM Tutorial 1... 2 Attachment C2. SolidWorks-Specific
Chapter 6: Graph Theory
Chapter 6: Graph Theory Graph theory deals with routing and network problems and if it is possible to find a best route, whether that means the least expensive, least amount of time or the least distance.
Quickstart for Desktop Version
Quickstart for Desktop Version What is GeoGebra? Dynamic Mathematics Software in one easy-to-use package For learning and teaching at all levels of education Joins interactive 2D and 3D geometry, algebra,
Offline 1-Minesweeper is NP-complete
Offline 1-Minesweeper is NP-complete James D. Fix Brandon McPhail May 24 Abstract We use Minesweeper to illustrate NP-completeness proofs, arguments that establish the hardness of solving certain problems.
Intermediate PowerPoint
Intermediate PowerPoint Charts and Templates By: Jim Waddell Last modified: January 2002 Topics to be covered: Creating Charts 2 Creating the chart. 2 Line Charts and Scatter Plots 4 Making a Line Chart.
Simplified External memory Algorithms for Planar DAGs. July 2004
Simplified External Memory Algorithms for Planar DAGs Lars Arge Duke University Laura Toma Bowdoin College July 2004 Graph Problems Graph G = (V, E) with V vertices and E edges DAG: directed acyclic graph
Glencoe. correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 3-3, 5-8 8-4, 8-7 1-6, 4-9
Glencoe correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 STANDARDS 6-8 Number and Operations (NO) Standard I. Understand numbers, ways of representing numbers, relationships among numbers,
Big Data Analysis. Rajen D. Shah (Statistical Laboratory, University of Cambridge) joint work with Nicolai Meinshausen (Seminar für Statistik, ETH
Big Data Analysis Rajen D Shah (Statistical Laboratory, University of Cambridge) joint work with Nicolai Meinshausen (Seminar für Statistik, ETH Zürich) University of Cambridge Mathematical Sciences Showcase
6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
Midterm Practice Problems
6.042/8.062J Mathematics for Computer Science October 2, 200 Tom Leighton, Marten van Dijk, and Brooke Cowan Midterm Practice Problems Problem. [0 points] In problem set you showed that the nand operator
Scheduling. Getting Started. Scheduling 79
Scheduling 9 Scheduling An event planner has to juggle many workers completing different tasks, some of which must be completed before others can begin. For example, the banquet tables would need to be
