Science of Science Research and Tools Tutorial #09 of 12
|
|
|
- Coral Grant
- 10 years ago
- Views:
Transcription
1 Science of Science Research and Tools Tutorial #09 of 12 Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director School of Library and Information Science Indiana University, Bloomington, IN With special thanks to Kevin W. Boyack, Micah Linnemeier, Russell J. Duhon, Patrick Phillips, Joseph Biberstine, Chintan Tank Nianli Ma, Hanning Guo, Mark A. Price, Angela M. Zoss, and Scott Weingart Invited by Robin M. Wagner, Ph.D., M.S. Chief Reporting Branch, Division of Information Services Office of Research Information Systems, Office of Extramural Research Office of the Director, National Institutes of Health Suite 4090, 6705 Rockledge Drive, Bethesda, MD a-noon, July 21, Tutorials in 12 Days at NIH Overview 1. Science of Science Research 1 st Week 2. Information Visualization 3. CIShell Powered Tools: Network Workbench and Science of Science Tool 4. Temporal Analysis Burst Detection 5. Geospatial Analysis and Mapping 6. Topical Analysis & Mapping 7. Tree Analysis and Visualization 8. Network Analysis 9. Large Network Analysis 10. Using the Scholarly Database at IU 11. VIVO National Researcher Networking 12. Future Developments 2 nd Week 3 rd Week 4 th Week 2
2 12 Tutorials in 12 Days at NIH Overview [#09] Large Network Analysis and Visualization General Overview Designing Effective Network Visualizations Sci2-Reading and Modeling Networks Sci2-Analysing Large Networks Sci2-Visualizing Large Networks and Distributions Outlook Exercise: Identify Promising Large Network Analyses of NIH Data Recommended Reading NWB Team (2009) Network Workbench Tool, User Manual 1.0.0, Börner, Katy, Sanyal, Soma and Vespignani, Alessandro (2007). Network Science. In Blaise Cronin (Ed.), ARIST, Information Today, Inc./American Society for Information Science and Technology, Medford, NJ, Volume 41, Chapter 12, pp [#09] Large Network Analysis and Visualization General Overview Designing Effective Network Visualizations Sci2-Reading and Modeling Networks Sci2-Analysing Large Networks Sci2-Visualizing Large Networks and Distributions Outlook Exercise: Identify Promising Large Network Analyses of NIH Data 4
3 Large Networks More than 10,000 nodes. Neither all nodes nor all edges can be shown at once. Sometimes, there are more nodes than pixels. Examples of large networks Communication networks: Internet, telephone network, wireless network. Network applications: The World Wide Web, interactions Transportation network/road maps Relationships between objects in a data base: Function/module dependency graphs Knowledge bases 5 Amsterdam RealTime project, WIRED Magazine, Issue March
4 Direct Manipulation Modify focusing parameters while continuously provide visual feedback and update display (fast computer response). Conditioning: filter, set background variables and display foreground parameters Identification: highlight, color, shape code Parameter control: line thickness, length, color legend, time slider, and animation control Navigation: Bird s Eye view, zoom, and pan Information requests: Mouse over or click on a node to retrieve more details or collapse/expand a subnetwork See NIH Awards Viewer at 7 VxInsight Tool VxInsight is a general purpose knowledge visualization software package developed at Sandia National Laboratories. It enables researchers, analysts, and decision-makers to accelerate their understanding of large databases. Show Insight_demo.exe Davidson, G.S., Hendrickson, B., Johnson, D.K., Meyers, C.E., Wylie, B.N., November/December "Knowledge Mining with VxInsight: Discovery through Interaction," Volume 11, Number 3, Journal of Intelligent Information Systems, Special Issue on Integrating Artificial Intelligence and Database Technologies. pp ) 8
5 Other Tools See for references. 9 Other Tools cont. See for references. 10
6 [#09] Large Network Analysis and Visualization General Overview Designing Effective Network Visualizations Sci2-Reading and Modeling Networks Sci2-Analysing Large Networks Sci2-Visualizing Large Networks and Distributions Outlook Exercise: Identify Promising Large Network Analyses of NIH Data 11 QVR Dataset Provided by Robert F. Moore, Deepshikha Roychowdhury, Emilee Pressman, and Matthew Eblen All NIH projects that received funding in (Oct 1, 1997-Sept 30, 2009) and their associated publications (max 100 per project so that SAS can handle the data. Note that some projects had publications! We do miss much data here.) 168,764 grant records collapsed by base project. 119,230 grants have a linked publications (pubid). There are 157,376 unique publications. Three (planned) analyses: 1. Large network visualization of 119k grants to 157k pubs network to show the scalability. 2. Horizontal Bar Graph visualization of all NIH grants. (need $ amounts) 3. UCSD science map of publications for different institutes. (need journal name) 12
7 QVR Dataset Large network visualization of 119k grants to 157k pubs network to show the scalability of the tool. 1. In original data file, delete all grants that have no associated publications. 2. Load resulting using File > Load > QVR-Bob Grants.csv SAS-grants-pubs-simplified.csv as csv file format. 3. Extract author bipartite grant to publications network using Data Preparation > Text Files > Extract Directed Network using parameters: 13 SAS Dataset Provided by Lindsey Pool 62,864 records, one per publication. Replace missing values by NULL to load into Sci2 Tool Load using File > Load > SAS-grants-pubs-simplified.csv as csv file format. If you run out of Java heap space: Load using File > Load > SAS-grants-pubs-4columns.csv 14
8 SAS Dataset Extract Co-Author Network Extract author co-occurrence network using Data Preparation > Text Files > Extract Co-Occurrence Network With parameters: (ignore the Aggregate Function File but note the space after ;) 15 SAS Dataset Extract Co-Author Network cont. Nodes: 127,879 authors Edges: 640,861 co-author relationships 16
9 [#09] Large Network Analysis and Visualization General Overview Designing Effective Network Visualizations Sci2-Reading and Modeling Networks Sci2-Analysing Large Networks Sci2-Visualizing Large Networks and Distributions Outlook Exercise: Identify Promising Large Network Analyses of NIH Data 17 Modeling the Co-Evolving Author-Paper Networks Börner, Katy, Maru, Jeegar & Goldstone, Robert. (2004). The Simultaneous Evolution of Author and Paper Networks. PNAS. Vol. 101(Suppl. 1), The TARL Model (Topics, Aging, and Recursive Linking) incorporates A partitioning of authors and papers into topics, Aging, i.e., a bias for authors to cite recent papers, and A tendency for authors to cite papers cited by papers that they have read resulting in a rich get richer effect. The model attempts to capture the roles of authors and papers in the production, storage, and dissemination of knowledge. Model Assumptions Co-author and paper-citation networks co-evolve. Authors come and go. Papers are forever. Only authors that are 'alive' are able to co-author. All existing (but no future) papers can be cited. Information diffusion occurs directly via co-authorships and indirectly via the consumption of other authors papers. Preferential attachment is modeled as an emergent property of the elementary, local networking activity of authors reading and citing papers, but also the references listed in papers. 18
10 Aging function Model Validation The properties of the networks generated by this model are validated against a 20-year data set ( ) of documents of type article published in the Proceedings of the National Academy of Science (PNAS) about 106,000 unique authors, 472,000 coauthor links, 45,120 papers cited within the set, and 114,000 citation references within the set. 19 The TARL Model: Pseudo Code 20
11 The TARL Model: The Effect of Parameters (0000) (1000) Topics (0100) Co-Authors (0010) References Topics lead to disconnected networks. Co-authoring leads to fewer papers. 21 Counts for Papers and Authors Aging function Counts for Citations 22
12 Co-Author and Paper-Citation Network Properties Aging function Power Law Distributions 23 Topics: The number of topics is linearly correlated with the clustering coefficient of the resulting network: C= * #topics. Increasing the number of topics increases the power law exponent as authors are now restricted to cite papers in their own topics area. Aging function Aging: With increasing b, and hence increasing the number of older papers cited as references, the clustering coefficient decreases. Papers are not only clustered by topic, but also in time, and as a community becomes increasingly nearsighted in terms of their citation practices, the degree of temporal clustering increases. References/Recursive Linking: The length of the chain of paper citation links that is followed to select references for a new paper also influences the clustering coefficient. Temporal clustering is ameliorated by the practice of citing (and hopefully reading!) the papers that were the earlier inspirations for read papers. 24
13 [#09] Large Network Analysis and Visualization General Overview Designing Effective Network Visualizations Sci2-Reading and Modeling Networks Sci2-Analyzing Large Networks Sci2-Visualizing Large Networks and Distributions Outlook Exercise: Identify Promising Large Network Analyses of NIH Data 25 Network Analysis and Visualization General Workflow Original Data Calculate Node Attributes Extract Network Extract Bipartite Network was selected. Input Parameters: First column: Source Node Text Delimiter: ; Second column: Target Nodes Visualization/Layout 26
14 Large Network Analysis & Visualization General Workflow Original Data Millions of records, in 100s of columns. SAS and Excel might not be able to handle these files. Files are shared between DB and tools as delimited text files (.csv). Derived Statistics Degree distributions Number of components and their sizes Extract giant component, subnetworks for further analysis Extract Network It might take several hours to extract a network on a laptop or even on a parallel cluster. Visualizations It is typically not possible to layout the network. DrL scales to 10 million nodes. 27 [#09] Large Network Analysis and Visualization General Overview Designing Effective Network Visualizations Sci2-Reading and Modeling Networks Sci2-Analysing Large Networks Sci2-Visualizing Large Networks and Distributions Outlook Exercise: Identify Promising Large Network Analyses of NIH Data 28
15 DrL Large Network Layout See Section in Sci2 Tutorial, DrL is a force directed graph layout toolbox for real world large scale graphs up to 2 million nodes. It includes: Standard force directed layout of graphs using algorithm based on the popular VxOrd routine (used in the VxInsight program). Parallel version of force directed layout algorithm. Recursive multilevel version for obtaining better layouts of very large graphs. Ability to add new vertices to a previously drawn graph. The version of DrL included in Sci2 only does the standard force directed layout (no recursive or parallel computation). Davidson, G. S., B. N. Wylie and K. W. Boyack (2001). "Cluster stability and the use of noise in interpretation of clustering." Proc. IEEE Information Visualization 2001: DrL Large Network Layout See Section in Sci2 Tutorial, How to use: DrL expects the edges to be weighted and undirected where the non zero weight denotes how similar the two nodes are (higher is more similar). Parameters are as follows: The edge cutting parameter expresses how much automatic edge cutting should be done. 0 means as little as possible, 1 as much as possible. Around.8 is a good value to use. The weight attribute parameter lets you choose which edge attribute in the network corresponds to the similarity weight. The X and Y parameters let you choose the attribute names to be used in the returned network which corresponds to the X and Y coordinates computed by the layout algorithm for the nodes. DrL is commonly used to layout large networks, e.g., those derived in co citation and co word analyses. In the Sci2 Tool, the results can be viewed in either GUESS or Visualization > Specified (prefuse alpha). See also 30
16 Use Ctrl+Alt+Delete to see CPU and Memory Usage SAS Dataset Extract Co-Author Network Extract author co-occurrence network using Data Preparation > Text Files > Extract Co-Occurrence Network With parameters: (ignore the Aggregate Function File but note the space after ;) 32
17 DrL Run & Output DrL (VxOrd) was selected. Author(s): S. Martin, W. M. Brown, K. Boyack Implementer(s): S. Martin, W. M. Brown, K. Boyack Integrator(s): Bruce Herr Reference: S. Martin, W. M. Brown, K. Boyack, "Dr. L: Distributed Recursive (Graph) Layout," in preparation for Journal of Graph Algorithms and Applications. ( Documentation: Input Parameters: Edge Cutting Strength: 0.8 New X-Position Attribute Name: xpos Edge Weight Attribute: weight Do not cut edges: false New Y-Position Attribute Name: ypos Entering liquid stage... Liquid stage completed in 317 seconds, total energy = e+013. Entering expansion stage... Finished expansion stage in 324 seconds, total energy = e+009. Entering cool-down stage... Completed cool-down stage in 321 seconds, total energy = e+009. Entering crunch stage... Finished crunch stage in 79 seconds, total energy = e+009. Entering simmer stage... Finished simmer stage in 98 seconds, total energy = Layout calculation completed in 1139 seconds (not including I/O). Writing out solution to infile.icoord... Total Energy: Program terminated successfully. 33 DrL Output Visualization I saved file as SAS-Co-Author-DrL-Layout.nwb. Visualize network using GUESS by selecting the network file, running Network > Visualizing > GUESS, then run the following commands in the GUESS Interpreter: > for node in g.nodes:... node.x = node.xpos... node.y = node.ypos to position the nodes at the x and y position calculated by DrL. 34
18 DrL Output Visualization Visualize SAS-Co-Author-DrL-Layout.nwb using Visualization > Specified (prefuse alpha) 35 DrL Output Plot Node Degree Distribution Calculate degree distribution using and plot using Visualization > General > Gnuplot Or Excel (right click file and View ). 36
19 DrL Output Plot Node Degree Distribution Calculate degree distribution using and plot using Visualization > General > Gnuplot Or Excel (right click file and View ). 37 [#09] Large Network Analysis and Visualization General Overview Designing Effective Network Visualizations Sci2-Reading and Modeling Networks Sci2-Analysing Large Networks Sci2-Visualizing Large Networks and Distributions Outlook Exercise: Identify Promising Large Network Analyses of NIH Data 38
20 Planned Work Add (scalable) clustering algorithms to Sci2 Tool. Advanced network reduction algorithms. Visual language that helps communicate patterns, trends, activity bursts, etc. More interactivity, e.g., by opening networks in Cytoscape 39
21
22 [#09] Large Network Analysis and Visualization General Overview Designing Effective Network Visualizations Sci2-Reading and Modeling Networks Sci2-Analysing Large Networks Sci2-Visualizing Large Networks and Distributions Outlook Exercise: Identify Promising Large Network Analyses of NIH Data 43 Exercise Please identify a promising large network analysis of NIH data. Document it by listing Project title User, i.e., who would be most interested in the result? Insight need addressed, i.e., what would you/user like to understand? Data used, be as specific as possible. Analysis algorithms used. Visualization generated. Please make a sketch with legend. 44
23 All papers, maps, cyberinfrastructures, talks, press are linked from 45
Visualization of Phylogenetic Trees and Metadata
Visualization of Phylogenetic Trees and Metadata November 27, 2015 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com [email protected]
NakeDB: Database Schema Visualization
NAKEDB: DATABASE SCHEMA VISUALIZATION, APRIL 2008 1 NakeDB: Database Schema Visualization Luis Miguel Cortés-Peña, Yi Han, Neil Pradhan, Romain Rigaux Abstract Current database schema visualization tools
Gephi Tutorial Quick Start
Gephi Tutorial Welcome to this introduction tutorial. It will guide you to the basic steps of network visualization and manipulation in Gephi. Gephi version 0.7alpha2 was used to do this tutorial. Get
Social Network Analysis Workshop CIShell Powered Tools: Network Workbench (NWB) & Science of Science (Sci2) Tool
Social Network Analysis Workshop CIShell Powered Tools: Network Workbench (NWB) & Science of Science (Sci2) Tool Dr. Katy Börner and Chin Hua Kong Cyberinfrastructure for Network Science Center Information
A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING
A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING Ahmet Selman BOZKIR Hacettepe University Computer Engineering Department, Ankara, Turkey [email protected] Ebru Akcapinar
<no narration for this slide>
1 2 The standard narration text is : After completing this lesson, you will be able to: < > SAP Visual Intelligence is our latest innovation
Subgraph Patterns: Network Motifs and Graphlets. Pedro Ribeiro
Subgraph Patterns: Network Motifs and Graphlets Pedro Ribeiro Analyzing Complex Networks We have been talking about extracting information from networks Some possible tasks: General Patterns Ex: scale-free,
Visualizing the Top 400 Universities
Int'l Conf. e-learning, e-bus., EIS, and e-gov. EEE'15 81 Visualizing the Top 400 Universities Salwa Aljehane 1, Reem Alshahrani 1, and Maha Thafar 1 [email protected], [email protected], [email protected]
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
Visualization methods for patent data
Visualization methods for patent data Treparel 2013 Dr. Anton Heijs (CTO & Founder) Delft, The Netherlands Introduction Treparel can provide advanced visualizations for patent data. This document describes
Oracle Data Miner (Extension of SQL Developer 4.0)
An Oracle White Paper October 2013 Oracle Data Miner (Extension of SQL Developer 4.0) Generate a PL/SQL script for workflow deployment Denny Wong Oracle Data Mining Technologies 10 Van de Graff Drive Burlington,
Clustering & Visualization
Chapter 5 Clustering & Visualization Clustering in high-dimensional databases is an important problem and there are a number of different clustering paradigms which are applicable to high-dimensional data.
Visual Analysis Tool for Bipartite Networks
Visual Analysis Tool for Bipartite Networks Kazuo Misue Department of Computer Science, University of Tsukuba, 1-1-1 Tennoudai, Tsukuba, 305-8573 Japan [email protected] Abstract. To find hidden features
KnowledgeSEEKER Marketing Edition
KnowledgeSEEKER Marketing Edition Predictive Analytics for Marketing The Easiest to Use Marketing Analytics Tool KnowledgeSEEKER Marketing Edition is a predictive analytics tool designed for marketers
Visualizing bibliometric networks
This is a preprint of the following book chapter: Van Eck, N.J., & Waltman, L. (2014). Visualizing bibliometric networks. In Y. Ding, R. Rousseau, & D. Wolfram (Eds.), Measuring scholarly impact: Methods
KaleidaGraph Quick Start Guide
KaleidaGraph Quick Start Guide This document is a hands-on guide that walks you through the use of KaleidaGraph. You will probably want to print this guide and then start your exploration of the product.
Distance Degree Sequences for Network Analysis
Universität Konstanz Computer & Information Science Algorithmics Group 15 Mar 2005 based on Palmer, Gibbons, and Faloutsos: ANF A Fast and Scalable Tool for Data Mining in Massive Graphs, SIGKDD 02. Motivation
Network VisualizationS
Network VisualizationS When do they make sense? Where to start? Clement Levallois, Assist. Prof. EMLYON Business School v. 1.1, January 2014 Bio notes Education in economics, management, history of science
Client Overview. Engagement Situation. Key Requirements
Client Overview Our client is one of the leading providers of business intelligence systems for customers especially in BFSI space that needs intensive data analysis of huge amounts of data for their decision
TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:
Table of contents: Access Data for Analysis Data file types Format assumptions Data from Excel Information links Add multiple data tables Create & Interpret Visualizations Table Pie Chart Cross Table Treemap
The Big Data methodology in computer vision systems
The Big Data methodology in computer vision systems Popov S.B. Samara State Aerospace University, Image Processing Systems Institute, Russian Academy of Sciences Abstract. I consider the advantages of
KNIME TUTORIAL. Anna Monreale KDD-Lab, University of Pisa Email: [email protected]
KNIME TUTORIAL Anna Monreale KDD-Lab, University of Pisa Email: [email protected] Outline Introduction on KNIME KNIME components Exercise: Market Basket Analysis Exercise: Customer Segmentation Exercise:
Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results
, pp.33-40 http://dx.doi.org/10.14257/ijgdc.2014.7.4.04 Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results Muzammil Khan, Fida Hussain and Imran Khan Department
A Brief Introduction to Apache Tez
A Brief Introduction to Apache Tez Introduction It is a fact that data is basically the new currency of the modern business world. Companies that effectively maximize the value of their data (extract value
DHL Data Mining Project. Customer Segmentation with Clustering
DHL Data Mining Project Customer Segmentation with Clustering Timothy TAN Chee Yong Aditya Hridaya MISRA Jeffery JI Jun Yao 3/30/2010 DHL Data Mining Project Table of Contents Introduction to DHL and the
BIG DATA VISUALIZATION. Team Impossible Peter Vilim, Sruthi Mayuram Krithivasan, Matt Burrough, and Ismini Lourentzou
BIG DATA VISUALIZATION Team Impossible Peter Vilim, Sruthi Mayuram Krithivasan, Matt Burrough, and Ismini Lourentzou Let s begin with a story Let s explore Yahoo s data! Dora the Data Explorer has a new
Extend Table Lens for High-Dimensional Data Visualization and Classification Mining
Extend Table Lens for High-Dimensional Data Visualization and Classification Mining CPSC 533c, Information Visualization Course Project, Term 2 2003 Fengdong Du [email protected] University of British Columbia
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis What is Hierarchical Clustering? Hierarchical clustering is used to group similar objects into clusters. In the beginning, each row and/or column is considered a cluster.
Back Propagation Neural Networks User Manual
Back Propagation Neural Networks User Manual Author: Lukáš Civín Library: BP_network.dll Runnable class: NeuralNetStart Document: Back Propagation Neural Networks Page 1/28 Content: 1 INTRODUCTION TO BACK-PROPAGATION
Visualizing Data from Government Census and Surveys: Plans for the Future
Censuses and Surveys of Governments: A Workshop on the Research and Methodology behind the Estimates Visualizing Data from Government Census and Surveys: Plans for the Future Kerstin Edwards March 15,
Polynomial Neural Network Discovery Client User Guide
Polynomial Neural Network Discovery Client User Guide Version 1.3 Table of contents Table of contents...2 1. Introduction...3 1.1 Overview...3 1.2 PNN algorithm principles...3 1.3 Additional criteria...3
MINING THE DATA FROM DISTRIBUTED DATABASE USING AN IMPROVED MINING ALGORITHM
MINING THE DATA FROM DISTRIBUTED DATABASE USING AN IMPROVED MINING ALGORITHM J. Arokia Renjit Asst. Professor/ CSE Department, Jeppiaar Engineering College, Chennai, TamilNadu,India 600119. Dr.K.L.Shunmuganathan
MicroStrategy Analytics Express User Guide
MicroStrategy Analytics Express User Guide Analyzing Data with MicroStrategy Analytics Express Version: 4.0 Document Number: 09770040 CONTENTS 1. Getting Started with MicroStrategy Analytics Express Introduction...
Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan
Network Metrics, Planar Graphs, and Software Tools Based on materials by Lala Adamic, UMichigan Network Metrics: Bowtie Model of the Web n The Web is a directed graph: n webpages link to other webpages
HierarchyMap: A Novel Approach to Treemap Visualization of Hierarchical Data
P a g e 77 Vol. 9 Issue 5 (Ver 2.0), January 2010 Global Journal of Computer Science and Technology HierarchyMap: A Novel Approach to Treemap Visualization of Hierarchical Data Abstract- The HierarchyMap
TECHNOLOGY ANALYSIS FOR INTERNET OF THINGS USING BIG DATA LEARNING
TECHNOLOGY ANALYSIS FOR INTERNET OF THINGS USING BIG DATA LEARNING Sunghae Jun 1 1 Professor, Department of Statistics, Cheongju University, Chungbuk, Korea Abstract The internet of things (IoT) is an
A Visualization Technique for Monitoring of Network Flow Data
A Visualization Technique for Monitoring of Network Flow Data Manami KIKUCHI Ochanomizu University Graduate School of Humanitics and Sciences Otsuka 2-1-1, Bunkyo-ku, Tokyo, JAPAPN [email protected]
Utilizing spatial information systems for non-spatial-data analysis
Jointly published by Akadémiai Kiadó, Budapest Scientometrics, and Kluwer Academic Publishers, Dordrecht Vol. 51, No. 3 (2001) 563 571 Utilizing spatial information systems for non-spatial-data analysis
KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES
HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES Translating data into business value requires the right data mining and modeling techniques which uncover important patterns within
A Tutorial on dynamic networks. By Clement Levallois, Erasmus University Rotterdam
A Tutorial on dynamic networks By, Erasmus University Rotterdam V 1.0-2013 Bio notes Education in economics, management, history of science (Ph.D.) Since 2008, turned to digital methods for research. data
ORGANIZATIONAL KNOWLEDGE MAPPING BASED ON LIBRARY INFORMATION SYSTEM
ORGANIZATIONAL KNOWLEDGE MAPPING BASED ON LIBRARY INFORMATION SYSTEM IRANDOC CASE STUDY Ammar Jalalimanesh a,*, Elaheh Homayounvala a a Information engineering department, Iranian Research Institute for
Workshop: Open Source Tools for Data Analysis and Visualization
Workshop: Open Source Tools for Data Analysis and Visualization Katy Börner Victor H. Yngve Professor of Information Science Director, Cyberinfrastructure for Network Science Center School of Informatics
A comparative study of social network analysis tools
Membre de Membre de A comparative study of social network analysis tools David Combe, Christine Largeron, Előd Egyed-Zsigmond and Mathias Géry International Workshop on Web Intelligence and Virtual Enterprises
MicroStrategy Desktop
MicroStrategy Desktop Quick Start Guide MicroStrategy Desktop is designed to enable business professionals like you to explore data, simply and without needing direct support from IT. 1 Import data from
Interactive Information Visualization of Trend Information
Interactive Information Visualization of Trend Information Yasufumi Takama Takashi Yamada Tokyo Metropolitan University 6-6 Asahigaoka, Hino, Tokyo 191-0065, Japan [email protected] Abstract This paper
Visualizing e-government Portal and Its Performance in WEBVS
Visualizing e-government Portal and Its Performance in WEBVS Ho Si Meng, Simon Fong Department of Computer and Information Science University of Macau, Macau SAR [email protected] Abstract An e-government
JustClust User Manual
JustClust User Manual Contents 1. Installing JustClust 2. Running JustClust 3. Basic Usage of JustClust 3.1. Creating a Network 3.2. Clustering a Network 3.3. Applying a Layout 3.4. Saving and Loading
SAS BI Dashboard 4.3. User's Guide. SAS Documentation
SAS BI Dashboard 4.3 User's Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2010. SAS BI Dashboard 4.3: User s Guide. Cary, NC: SAS Institute
Ovation Operator Workstation for Microsoft Windows Operating System Data Sheet
Ovation Operator Workstation for Microsoft Windows Operating System Features Delivers full multi-tasking operation Accesses up to 200,000 dynamic points Secure standard operating desktop environment Intuitive
OECD.Stat Web Browser User Guide
OECD.Stat Web Browser User Guide May 2013 May 2013 1 p.10 Search by keyword across themes and datasets p.31 View and save combined queries p.11 Customise dimensions: select variables, change table layout;
Formulas, Functions and Charts
Formulas, Functions and Charts :: 167 8 Formulas, Functions and Charts 8.1 INTRODUCTION In this leson you can enter formula and functions and perform mathematical calcualtions. You will also be able to
KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE
POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE Most Effective Modeling Application Designed to Address Business Challenges Applying a predictive strategy to reach a desired business
InfoVis Cyberinfrastructure
InfoVis Cyberinfrastructure Katy Börner School of Library and Information Science [email protected] SLIS Colloquium, November 19 th, 2004 http://iv.slis.indiana.edu/sw http://iv.slis.indiana.edu/db http://iv.slis.indiana.edu/cr
What is Visualization? Information Visualization An Overview. Information Visualization. Definitions
What is Visualization? Information Visualization An Overview Jonathan I. Maletic, Ph.D. Computer Science Kent State University Visualize/Visualization: To form a mental image or vision of [some
Decision Trees from large Databases: SLIQ
Decision Trees from large Databases: SLIQ C4.5 often iterates over the training set How often? If the training set does not fit into main memory, swapping makes C4.5 unpractical! SLIQ: Sort the values
Graph/Network Visualization
Graph/Network Visualization Data model: graph structures (relations, knowledge) and networks. Applications: Telecommunication systems, Internet and WWW, Retailers distribution networks knowledge representation
TORNADO Solution for Telecom Vertical
BIG DATA ANALYTICS & REPORTING TORNADO Solution for Telecom Vertical Overview Last decade has see a rapid growth in wireless and mobile devices such as smart- phones, tablets and netbook is becoming very
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software
Didacticiel Études de cas. Association Rules mining with Tanagra, R (arules package), Orange, RapidMiner, Knime and Weka.
1 Subject Association Rules mining with Tanagra, R (arules package), Orange, RapidMiner, Knime and Weka. This document extends a previous tutorial dedicated to the comparison of various implementations
DB2 Web Query: Creating Dashboards with HTML Composer and InfoAssist. Jacqueline Jansen [email protected] Information Builders
DB2 Web Query: Creating Dashboards with HTML Composer and InfoAssist Jacqueline Jansen [email protected] Information Builders Dashboard Creation Agenda HTML Composer Add push buttons, hyperlinks, images
Do you know how your TSM environment is evolving?
Trend reporting for Tivoli Storage Manager Holger Speh Consulting IT Specialist Do you know how your TSM environment is evolving? Healthy? Well integrated? Data Growth? Accounting? 2 2 Historical Reporting
What s New in JReport 13.1
Highlights JReport 13.1 focuses on new geographical tools for data visualization, enhanced data analysis and presentation in dashboards and reports, as well as greater performance and scalability when
Visual Mining of E-Customer Behavior Using Pixel Bar Charts
Visual Mining of E-Customer Behavior Using Pixel Bar Charts Ming C. Hao, Julian Ladisch*, Umeshwar Dayal, Meichun Hsu, Adrian Krug Hewlett Packard Research Laboratories, Palo Alto, CA. (ming_hao, dayal)@hpl.hp.com;
Creating a Network Graph with Gephi
Creating a Network Graph with Gephi Gephi is a powerful tool for network analysis, but it can be intimidating. It has a lot of tools for statistical analysis of network data most of which you won't be
APPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION ANALYSIS. email [email protected]
Eighth International IBPSA Conference Eindhoven, Netherlands August -4, 2003 APPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION Christoph Morbitzer, Paul Strachan 2 and
M. Sugumaran / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 2 (3), 2011, 1001-1006
A Design of Centralized Meeting Scheduler with Distance Metrics M. Sugumaran Department of Computer Science and Engineering,Pondicherry Engineering College, Puducherry, India. Abstract Meeting scheduling
TheFinancialEdge. Dashboard Guide
TheFinancialEdge Dashboard Guide 101911 2011 Blackbaud, Inc. This publication, or any part thereof, may not be reproduced or transmitted in any form or by any means, electronic, or mechanical, including
Scalable Developments for Big Data Analytics in Remote Sensing
Scalable Developments for Big Data Analytics in Remote Sensing Federated Systems and Data Division Research Group High Productivity Data Processing Dr.-Ing. Morris Riedel et al. Research Group Leader,
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
TIBCO Spotfire Network Analytics 1.1. User s Manual
TIBCO Spotfire Network Analytics 1.1 User s Manual Revision date: 26 January 2009 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO
an introduction to VISUALIZING DATA by joel laumans
an introduction to VISUALIZING DATA by joel laumans an introduction to VISUALIZING DATA iii AN INTRODUCTION TO VISUALIZING DATA by Joel Laumans Table of Contents 1 Introduction 1 Definition Purpose 2 Data
EcOS (Economic Outlook Suite)
EcOS (Economic Outlook Suite) Customer profile The International Monetary Fund (IMF) is an international organization working to promote international monetary cooperation and exchange stability; to foster
OpenText Information Hub (ihub) 3.1 and 3.1.1
OpenText Information Hub (ihub) 3.1 and 3.1.1 OpenText Information Hub (ihub) 3.1.1 meets the growing demand for analytics-powered applications that deliver data and empower employees and customers to
Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations
Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations Binomol George, Ambily Balaram Abstract To analyze data efficiently, data mining systems are widely using datasets
Data Mining III: Numeric Estimation
Data Mining III: Numeric Estimation Computer Science 105 Boston University David G. Sullivan, Ph.D. Review: Numeric Estimation Numeric estimation is like classification learning. it involves learning a
Criteria for Evaluating Visual EDA Tools
Criteria for Evaluating Visual EDA Tools Stephen Few, Perceptual Edge Visual Business Intelligence Newsletter April/May/June 2012 We visualize data for various purposes. Specific purposes direct us to
Assignment 5: Visualization
Assignment 5: Visualization Arash Vahdat March 17, 2015 Readings Depending on how familiar you are with web programming, you are recommended to study concepts related to CSS, HTML, and JavaScript. The
Visualization with Excel Tools and Microsoft Azure
Visualization with Excel Tools and Microsoft Azure Introduction Power Query and Power Map are add-ins that are available as free downloads from Microsoft to enhance the data access and data visualization
Visualization with ParaView. Greg Johnson
Visualization with Greg Johnson Before we begin Make sure you have 3.8.0 installed so you can follow along in the lab section http://paraview.org/paraview/resources/software.html http://www.paraview.org/
Technical paper review. Program visualization and explanation for novice C programmers by Matthew Heinsen Egan and Chris McDonald.
Technical paper review Program visualization and explanation for novice C programmers by Matthew Heinsen Egan and Chris McDonald Garvit Pahal Indian Institute of Technology, Kanpur October 28, 2014 Garvit
Visualization with ParaView
Visualization with ParaView Before we begin Make sure you have ParaView 4.1.0 installed so you can follow along in the lab section http://paraview.org/paraview/resources/software.php Background http://www.paraview.org/
IEEE JAVA Project 2012
IEEE JAVA Project 2012 Powered by Cloud Computing Cloud Computing Security from Single to Multi-Clouds. Reliable Re-encryption in Unreliable Clouds. Cloud Data Production for Masses. Costing of Cloud Computing
An Interactive Visualization Tool for the Analysis of Multi-Objective Embedded Systems Design Space Exploration
An Interactive Visualization Tool for the Analysis of Multi-Objective Embedded Systems Design Space Exploration Toktam Taghavi, Andy D. Pimentel Computer Systems Architecture Group, Informatics Institute
KNOWLEDGE NETWORK SYSTEM APPROACH TO THE KNOWLEDGE MANAGEMENT
KNOWLEDGE NETWORK SYSTEM APPROACH TO THE KNOWLEDGE MANAGEMENT ZHONGTUO WANG RESEARCH CENTER OF KNOWLEDGE SCIENCE AND TECHNOLOGY DALIAN UNIVERSITY OF TECHNOLOGY DALIAN CHINA CONTENTS 1. KNOWLEDGE SYSTEMS
ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat
ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web
Data exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
