VIVO Dashboard A Drupal-based tool for harvesting and executing sophisticated queries against data from a VIVO instance



Similar documents
Data Consumer's Guide. Aggregating and consuming data from profiles March, 2015

Who? Wolfgang Ziegler (fago) Klaus Purer (klausi) Sebastian Gilits (sepgil) epiqo Austrian based Drupal company Drupal Austria user group

The truth about Drupal

Things Made Easy: One Click CMS Integration with Solr & Drupal

A (Web) Face for Radio. NPR and Drupal7 David Moore

Content Management Systems: Drupal Vs Jahia

Draft Response for delivering DITA.xml.org DITAweb. Written by Mark Poston, Senior Technical Consultant, Mekon Ltd.

ASTROBOA - DRUPAL Integrated PLATFORM

Optimizing Drupal Performance. Benchmark Results

Sisense. Product Highlights.

The Search API in Drupal 8. Thomas Seidl (drunken monkey)

How To Design A Webbased Dashboard

Trainer name is P. Ranjan Raja. He is honour of and he has 8 years of experience in real time programming.

Git - Working with Remote Repositories

Semantic Web Success Story

Drupal.

1. INTERFACE ENHANCEMENTS 2. REPORTING ENHANCEMENTS

ALERT installation setup

Client Overview. Engagement Situation. Key Requirements

1. INTERFACE ENHANCEMENTS 2. REPORTING ENHANCEMENTS

D5.4.4 Integrated SemaGrow Stack API components

Using IBM dashdb With IBM Embeddable Reporting Service

Semantic Stored Procedures Programming Environment and performance analysis

D5.3.2b Automatic Rigorous Testing Components

Bring your intranet to the IBM i With Drupal and Zend Server

... Introduction... 17

ATLAS job monitoring in the Dashboard Framework

What we're doing with Drupal at UChicago and how it can help you.

A Close Look at Drupal 7

Data Store Interface Design and Implementation

MicroStrategy Course Catalog

INSPIRE Dashboard. Technical scenario

Databricks. A Primer

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

Linked Data Publishing with Drupal

ADS2013: App Development with SharePoint 2013

Visualizing a Neo4j Graph Database with KeyLines

Course MS55077A Project Server 2013 Development. Length: 5 Days

Dashboards for Monitoring, Analyzing and Managing Course 50596A; 3 Days, Instructor-led

Technokrafts Labs Pvt. Ltd.

Four Reasons Your Technical Team Will Love Acquia Cloud Site Factory

itanalyzer Data Protection Module

MatchPoint Technical Features Tutorial Colygon AG Version 1.0

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

Drupal Training Modules 2015

WEB-8000 Series and WEBs-N4. Our newest products are open 4 innovation.

Databricks. A Primer

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

DBpedia German: Extensions and Applications

A Practical Approach to Process Streaming Data using Graph Database

KonyOne Server Prerequisites _ MS SQL Server

Tuskar UI Documentation

COURSE SYLLABUS COURSE TITLE:

Aspire Systems - Experience in Digital Marketing and Social Media

SharePoint 2013 PerformancePoint Services

I have experience in building large Community websites, Business websites, Products and Services websites and E-commerce websites using Drupal.

SharePoint 2013 Business Intelligence

INTRODUCTION: SQL SERVER ACCESS / LOGIN ACCOUNT INFO:

Course: SharePoint 2013 Business Intelligence

SharePoint 2013 PerformancePoint Services Course 55057; 3 Days

GSA2013: The Great SharePoint Adventure 2013

Everything you ever wanted to know about Drupal 8*

Drupal 6 to Drupal 7 Migration Worksheet

Managing Projects Using Drupal 8 s Configuration Management System

Components of SAP BusinessObjects 4.0 An Overview. Adam Getz Practice Manager, Business Intelligence DCS Consulting, Inc.

Application Discovery Manager User s Guide vcenter Application Discovery Manager 6.2.1

Portal Version 1 - User Manual

How To Write A Drupal Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post

Bring your intranet to the IBM i With Drupal and Zend Server

Web project proposal. European e-skills Association

docs.hortonworks.com

Data Driven Success. Comparing Log Analytics Tools: Flowerfire s Sawmill vs. Google Analytics (GA)

SharePoint 2013 Business Intelligence Course 55042; 3 Days

The following is a comparison between CiviCRM, RedHen and CRM Core which are the leading option for CRM in the Drupal Community.

Content Management Systems: Drupal Vs Jahia

Building Your First Drupal 8 Company Site

MASTER DRUPAL 7 MODULE DEVELOPMENT

<Insert Picture Here> Michael Hichwa VP Database Development Tools Stuttgart September 18, 2007 Hamburg September 20, 2007

Preface. Motivation for this Book

SAP Business One and SAP HANA

How is it helping? PragmatiQa XOData : Overview with an Example. P a g e Doc Version : 1.3

Big Data Visualization with JReport

This module provides an overview of service and cloud technologies using the Microsoft.NET Framework and the Windows Azure cloud.

NetBeans IDE Field Guide

Medications Shortages Dashboard

SAV2013: The Great SharePoint 2013 App Venture

Big Data Drupal. Commercial Open Source Big Data Tool Chain

Transcription:

VIVO Dashboard A Drupal-based tool for harvesting and executing sophisticated queries against data from a VIVO instance! Paul Albert, Miles Worthington and Don Carpenter

Chapter I: The Problem

Administrators are avid consumers of institutional data.

In 2011, the Dean s Office requested the following reports List of publications in which a Weill Cornell author was first or last author List of publications in which a Weill Cornell author was first or last author, appearing in journals of impact factor > 15

This is a fairly reasonable request compared to others

Chapter II: Legacy Approach

Why this approach is poor Asynchronous, can t sort or facet, and requires a lot of sweat

Sample SPARQL query SELECT distinct?article1_pmid?person1_cwid? Authorship1_authorRank! WHERE{?Article1 rdf:type bibo:document.?article1 vivo:informationresourceinauthorship?authorship1.?article1 bibo:pmid?article1_pmid.?authorship1 rdf:type vivo:authorship.?authorship1 vivo:authorrank?authorship1_authorrank.?authorship1 vivo:linkedauthor?person1.?person1 rdf:type foaf:person.?person1 wcmc:cwid?person1_cwid. }!

Chapter III: Let s create a prototype

Goal of VIVO Dashboard Empower untrained users to run sophisticated semantic queries on Weill Cornell faculty publications * Secondary directive: kill Sarah Connor

Publications Graph List Export Date 2009 - Present The following publications are for all publications by active Weill Cornell Medical College faculty as represented in VIVO. 100 75 Publication Type Research Article (657)! In Process (55)! Review (45)! Clinical Guideline (32)! more... Journal ranking 15.4-68.3 50 Journal Name 25 Author Name

Invention is 1% inspiration and 99% perspiration.! Thomas A. Edison Source: Yahoo Answers

Chapter IV: Technology Stack of next version

VIVO Dashboard is an installed profile based on Drupal

Why Drupal? Familiar platform - Used at many institutions - Developers are familiar with the technology - Easy to host Existing solutions - Many modules available to solve common use cases - Same application could be built with any other web platform, but Drupal saves a huge amount of development effort.

VIVO Dashboard leverages existing Drupal modules Faceted search - Search API module - Facet API module Data import - Feeds module - Linked data import module * All of which are actively maintained and supported

Key Module: Linked Data Import Feeds plugin for linked data as a data source Uses open source library called ARC2 for requesting and parsing RDF Authored by Miles Worthington Originally created for Cornell's CALS Research & Impact site

Key Module: Feeds Offers a generic import system for Drupal Handles scheduling Maps various data sources (RSS, CSV, SQL) into Drupal content (nodes, taxonomy terms) Manages custom data sources via plugins

Other technology Stores content using the robust indexing application, Apache Solr AJAX Key modules - Apache Solr - Elysia cron - D3.js (visualization library) - Charts and graphs - VIVO Dashboard Core, Publications, and Import (custom module)

Demonstration Demonstration Demonstration Demonstration Demonstration Demonstration Demonstration Demonstration Demonstration Demonstration

Chapter V: Import process

Get a list of all publication URIs from VIVO

Go through list of URIs, request RDF for each

Request necessary related data for each article

Map VIVO RDF to Drupal structures

Repeat for all publication URIs

Chapter VI: How to install VIVO Dashboard

I am not a smart programmer, but even I can set up a VIVO Dashboard Me

Code and instructions github.com/paulalbert1/vivodashboard

Install Drush locally and create a repository Install Drush on local machine by following the instructions at: https://github.com/drush-ops/drush Run following command: drush make https://raw.github.com/paulalbert1/ vivodashboard/master/distro.make vivo-dashboard

Create a new site in Pantheon Also easily done in Acquia

Put the Pantheon site in SFTP mode

Using SFTP connection info on previous screen, copy local code to Pantheon.

Wipe database

Visit development site

Install profile (resulting screen)

You are now taken to VIVO Dashboard s home page.

Select a fetcher (VIVO Class Fetcher requires 1.7 if you have > 30,000 objects)

Setup the publications import

http://vivo.med.cornell.edu http://purl.org/ontology/bibo/document Enter site URL and top level article class

Define URIs of custom classes

Hide certain types

Journal ranking data can be imported from Scimago Journal Rank (Pearson correlation between Impact Factor and ScImago Journal ranking c. 2006 is 0.915)

Chapter VII: Final Thoughts

Advantages with this approach Leverages standard VIVO features No SPARQL endpoint required No authentication necessary Configured through an admin UI All but major ontology changes require code changes

You can use VIVO Dashboard s data harvesting approach to create new apps to do other kinds of data visualization and analysis

Disadvantages with this approach Takes a long time Drupal/PHP not designed for long-running jobs

Future Work Make VIVO Dashboard VIVO-ISF 1.6 compatible

Data dashboards tend to elicit a Highlander-type response among administrators