InfoVis Cyberinfrastructure



Similar documents
Getting Started with Oracle Data Miner 11g R2. Brendan Tierney

DATA MINING - SELECTED TOPICS

Curriculum Vitae Ruben Sipos

Science of Science Research and Tools Tutorial #09 of 12

Sci/Tech & Eng 201: Data Visualization

Tim Hsu. Updated Fall 2012

Stephanie A. Blanda 020 McAllister Building University Park, PA Webpage:

PSG College of Technology, Coimbatore Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

Hadoop Technology for Flow Analysis of the Internet Traffic

Curriculum Vitae Peter Andrews

Using EMC Documentum with Adobe LiveCycle ES

An Empirical Study of Application of Data Mining Techniques in Library System

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

Visualizing Software Projects in JavaScript

Tableau's data visualization software is provided through the Tableau for Teaching program.

Pre-Masters. Science and Engineering

Information Management

CS Data Science and Visualization Spring 2016

A New MSc Curriculum in Computer Science and Mathematics at the University of Zagreb

COMPUTER SCIENCE: MISCONCEPTIONS, CAREER PATHS AND RESEARCH CHALLENGES

Lili SLIS, SJSU

Introduction to Databases and Data Mining

Data Mining in Web Search Engine Optimization and User Assisted Rank Results

The National Consortium for Data Science (NCDS)

How To Become A Data Scientist

A FRAMEWORK OF WEB-BASED SERVICE SYSTEM FOR SATELLITE IMAGES AUTOMATIC ORTHORECTIFICATION

Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I

Anna Jacob Egalite B.Ed. in Elementary Education, St. Patrick's College, Dublin, Ireland

Big Data Analytics: Where is it Going and How Can it Be Taught at the Undergraduate Level?

Programme Specification (Undergraduate) Date amended: 27 February 2012

Nathaniel L. Foster Department of Psychology St. Mary s College of Maryland St. Mary s City, MD (240) nlfoster@smcm.

Master Specialization in Knowledge Engineering

Scholarly Use of Web Archives

254 South 300 East #101 Salt Lake City, Utah

VITAE KYUNGWON KOH, PHD

New Matrix Approach to Improve Apriori Algorithm

THE CCLRC DATA PORTAL

RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE. Luigi Grimaudo Database And Data Mining Research Group

A Collaborative Approach to Building Personal Knowledge Networks or How to Build a Knowledge Advantage Machine?

Masters in Advanced Computer Science

Course Development of Programming for General-Purpose Multicore Processors

An interdisciplinary model for analytics education

Chung-Bang Ben Weng, Ph.D, MSCS, MA

ResearchGate. Scientific Profile. Professional network for scientists. ResearchGate is. Manage your online presence

IT services for analyses of various data samples

Masters in Computing and Information Technology

JAVA-BASED FRAMEWORK FOR REMOTE ACCESS TO LABORATORY EXPERIMENTS. Department of Electrical Engineering University of Hagen D Hagen, Germany

Programme Specification (Undergraduate) Date amended: 28 August 2015

MEIRA LEVINSON. 54 Arborway, Jamaica Plain, MA (617)

Module 8 Digital Libraries and Open Access

Adina Crainiceanu. Ph.D. in Computer Science, Cornell University, Ithaca, NY May 2006 Thesis Title: Answering Complex Queries in Peer-to-Peer Systems

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

Lisa D. Friedland School of Computer Science 140 Governors Drive Amherst, MA (413)

Discover Viterbi: New Programs in Computer Science

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

CURRICULUM VITAE. University of Michigan, Ann Arbor, MI August 2005 Master of Arts Major: Educational Studies; Language, Literacy, and Culture

KARIM CHALAK PERSONAL. Born: March 1982 Webpage: Phone:

DATA SCIENCE ADVISING NOTES David Wild - updated May 2015

ERIC WILLIAM GILL, Ph.D., P.Eng. Faculty of Engineering and Applied Science Memorial University of Newfoundland Ph: ;

BONG-SOO SOHN RESEARCH INTERESTS

Okemos Road, Apt G214, Okemos, MI Tel: Teaching Portfolio Website:

B.Sc. in Computer Information Systems Study Plan

The Implementation of Wiki-based Knowledge Management Systems for Small Research Groups

M.S., Computer and Information Science, University of Massachusetts, Amherst, May Thesis: On Ramsey Tautologies.

Building Library Website using Drupal

Test Plan Security Assertion Markup Language Protocol Interface BC-AUTH-SAML 1.0

Mathematics Discrete Mathematics (F06-S07), Finite Mathematics (F00-S01, S07), Intermediate Algebra

Enterprise Software Engineering MSc

SCHOOL OF COMPUTING & MATHEMATICAL SCIENCES. Computer Science MSc. Greenwich Campus.

Computer Science. 232 Computer Science. Degrees and Certificates Awarded. A.S. Degree Requirements. Program Student Outcomes. Department Offices

Jennifer L. Davidson

Masters in Information Technology

Integrated System Modeling for Handling Big Data in Electric Utility Systems

Reverse Engineering in Data Integration Software

Terry Ann Morris, Ed.D.

Georgetown Center for the Constitution GEORGETOWN LAW

MSc in Network Centred Computing. For students entering in October contributions from other EU universities Faculty of Science

Transcription:

InfoVis Cyberinfrastructure Katy Börner School of Library and Information Science katy@indiana.edu SLIS Colloquium, November 19 th, 2004 http://iv.slis.indiana.edu/sw http://iv.slis.indiana.edu/db http://iv.slis.indiana.edu/cr http://iv.slis.indiana.edu/lm 1

Motivation IVC Database Provide access to major scholarly databases. IVC Software Framework Support developers and programmers in the comparison and distribution of new algorithms. Interconnect algorithm developers and users. What algorithms do users need/want? IVC Learning Modules Support (non-programmer) users in the utilization of advanced InfoVis algorithms. Provide a unique resource for InfoVis education. Support InfoVis & Knowledge Domain Visualization research. Publications about the Infrastructure Börner, Katy and Zhou, Yuezheng. (2001) A Software Repository for Education and Research in Information Visualization. Information Visualisation Conference, London, England, July 25-27, pp. 257-262. Baumgartner, Jason and Börner, Katy (2002). Towards an XML Toolkit for a Software Repository Supporting Information Visualization Education. IEEE Information Visualization Conference, Boston, MA, 2002. Interactive Poster. Baumgartner, Jason, Börner, Katy, Deckard, Nathan J., Sheth, Nihar. (2003). An XML Toolkit for an Information Visualization Software Repository. Poster Compendium, IEEE Information Visualization Conference, pp. 72-73. Penumarthy, Shashikant, Börner, Katy and Herr, Bruce. Information Visualization Cyberinfrastructure Software Framework. Submitted to Information Visualization. Moral: Do not do infrastructure development if you need/want scholarly publications. 2

Grants Center of Excellence for Computational Diagnostics. 21st Century Grant (Susanne Ragg, David Clemmer, Sven Rahmann, and Ilka Ott, Terry Vik, R Clement McDonald, Nunroe Pecock, Zina Ben Miled & Katy Börner, $1,994,951) Sept. 04 - Aug. 06. Sun Center of Excellence in Knowledge Management and Discovery, SUN Microsystems (Stephanie Burks, Katy Börner, Zina Ben-Miled), March 2004. Outstanding Junior Faculty Award. (Principal Investigator, $14,000), 2004. Data-Code-Computing Infrastructure for Data Mining, Modeling, and Visualization Research and Education. Pervasive Technology Labs Fellowship (Principal Investigator, $48,750) Sept. 2003 - Aug. 2004. CAREER: Visualizing Knowledge Domains. NSF IIS-0238261 award (Principal Investigator, $400,000) Sept. 2003-Aug. 2008. Information Visualization Learning Modules. SBC (formerly Ameritech) Fellow Grant (Principal Investigator, $15,000) May 2003-June 2004. Development of a Spatial-Experimental Laboratory for Research and Policy Analysis Related to Complex Systems. NSF/BCS-0215738 Major Research Instrumentation Grant. This project benefits multiple departments at IU. (PIs are Elinor Ostrom, Jerome Busemeyer, Tom Evans, Robert Huckfeldt & James Walker $847,874) Aug. 2002-July 2007. Moral: A good infrastructure (development) attracts grant funding. http://iv.slis.indiana.edu/sw http://iv.slis.indiana.edu/db http://iv.slis.indiana.edu/cr http://iv.slis.indiana.edu/lm 3

IVC Database The Team Design and Implementation Jay Askren Saiful Bahari Andrew Bangert Christopher Friend Stephanie Gato Todd Holloway (Lead) Ruchi Kapoor Ketan Mane Lalitha Visvanath Qian Wang Jose Montalvo Elijah Wright Graphic Design Caroline Courtney Project Start September 2003 IVC Database - System Overview Oracle/Apache/Tomcat/Java Well understood and reliable tools Oracle DB Several terabytes of data Relational design Allow for more collections to be added Search Engine Search on abstract, author, title, journal, date published, and more User login for both IU and non-iu users User histories Administration of data and user accounts Compressed downloading of results Term-by-document and co-author matrices of results 4

IVC Database - Data Sets (http://iv.slis.indiana.edu/db) 5

http://iv.slis.indiana.edu/sw http://iv.slis.indiana.edu/db http://iv.slis.indiana.edu/cr http://iv.slis.indiana.edu/lm http://iv.slis.indiana.edu/sw http://iv.slis.indiana.edu/db http://iv.slis.indiana.edu/cr http://iv.slis.indiana.edu/lm 6

IVC Software Framework The Team Master Minds/Programmers Jason Baumgartner, SLIS Nathan James Deckard, CS Nihar Sheth, Informatics Bruce William Herr, CS Shashikant Penumarthy, SLIS Graphic Design Caroline Courtney, Fine Art Project Start 2001 Algorithm Development and Integration Vivek Agrawal, Summer Intern Renee LeBeau, SLIS Josh Bonner, CS Todd Holloway, CS Jeegar Maru, CS Laura Northrup, CS Sriram Raghuraman, Informatics Nihar Sanghvi, Informatics Hardik Sheth, Informatics Sidharth Thakur, CS Ning Yu, SLIS Yuezheng Zhou, CS Students taking K. R. Subramanian s (UNC Charlotte) InfoVis class integrated diverse algorithms into the IVC. IVC Software Framework Algorithms (http://iv.slis.indiana.edu/sw) 7

8

9

10

11

IVC Software Framework Core (http://iv.slis.indiana.edu/sw) Demo IVC Software Framework 12

Downloads via Sourceforge since June 21, 2004 (http://sourceforge.net/projects/ivc) http://vw.indiana.edu/ivsi2004/ 13

http://iv.slis.indiana.edu/sw http://iv.slis.indiana.edu/db http://iv.slis.indiana.edu/cr http://iv.slis.indiana.edu/lm IVC Learning Modules (http://iv.slis.indiana.edu/iv) 14

Visualizing Tree Data http://iv.slis.indiana. edu/lm/lmtrees.html Student s Project Results User & Task Analysis for Visualizing Tree Data Visualizing the structure of IU s Decision Support System Visualizing the co-occurences of keywords in DLib Magazine articles. Visualization of the Java API Visualizing the the Library of Congress Classification System to retrieve legal materials in a library. See Handin pages at http://ella.slis.indiana.edu/~katy/ handin/l579-s04/cgi/handinlogin.cgi Image by Peter Hook and Rongke Gao 15

Time Series Analysis & Visualization http://iv.slis.indiana.edu /lm/lm-time-series.html Student s Project Results Time Series Analysis & Visualization Using Timesearcher and the Burst Detection Algorithm to Analyze the Stock Market from 1925 to 1945 Applying Burst and TimeSearcher to Chat Data Lab Access Trends Quest Atlantis Chat Log Data See Handin pages at http://ella.slis.indiana.edu/~katy/handin/l579-s04/cgi/handinlogin.cgi 16

Visualizing the Work of the United States Supreme Court Based on Time Data and Top Level West Topics by Peter A. Hook & Rongke Gao Top fifteen most occurring topics from 1944 to 2004 in Timesearcher All topics grouped by West Category and Sub-Category grouped over the entire lengths of All topics by West Category and Sub-Category grouped the data set corresponding to the five chief justices Visualizing Niches of the Blog Universe BY Mike Tyworth and Elijah Wright Visualizing niches of the blog universe. 17

L579 Information Visualization Spring 2005 This course covers Perceptual basis of information visualization. Data mining algorithms that enable extraction of relationships in data. Visualization and interaction techniques. Discussions of systems that drive research and development, and Future trends and remaining fundamental problems in the field. Students do weekly readings, provide a presentation on specific readings, do projects, and participate in class & online discussion. Class Webpage: http://ella.slis.indiana.edu/~katy/l579 L597 Structural Data Mining and Modeling Fall 2005 This course Introduces students to major methods, theories, and applications of structural data mining and modeling. Covers elementary graph theory and matrix algebra, data collection, structural data mining, data modeling, and applications. Upon taking this course students will be able to analyze and describe real networks (power grids, WWW, social networks, etc.) as well as relevant phenomena such as disease propagation, search, organizational performance, social power, and the diffusion of innovations. Format: Lectures and 4-5 labs. Class Webpage: http://ella.slis.indiana.edu/~katy/l597 18

Future Work IVC Database Create tables/upload Citeseer, 110 year Physical Review journals dataset, etc. Optimize online interface and make it available to other researchers. Create connections to R and other packages for large scale (network) data analysis. Document, document, document. IVC Software Framework Release IVC core as alpha. Integrate a lot more algorithms. IVC Learning Modules Write new learning modules as new algorithms become available. User test learning modules. Outreach There will be a Data Analysis, Modeling and Visualization Tutorial @ Electronic Imaging, San Jose, CA, Jan 16th, 2005 which uses the IVC infrastructure. Do RESEARCH using this infrastructure! Acknowledgements Craig A. Stewart, Mary Papakhian, Anurag Shankar all UITS generously made the Research Database Complex available for this project and provided very insightful comments. Stephanie Burks, Principal Unix Systems Administrator, Research and Technical Services, UITS has been instrumental in setting up the computing infrastructure and administration of the Oracle database. 19

20