Streamlining analytics and visualization infrastructure at the University of Calgary

Size: px
Start display at page:

Download "Streamlining analytics and visualization infrastructure at the University of Calgary"

Transcription

1 Streamlining analytics and visualization infrastructure at the University of Calgary Submitted to the Vice- President (Research) by Advisory Committee on Analytics & Visualization (ACAV) Big data = big opportunity When he was a kid in 1969, James Gosling saw his first computer in a lab at the University of Calgary. He thought it was so cool that he started coming back to the computer science lab breaking in by figuring out the simple combination lock on the door and using one of the smallest computers, the size of a refrigerator, to teach himself how to write code. Gosling graduated from the University of Calgary with his BSc in 1977 and went on to create Java, the universal programming language that helped the Internet develop. In the four decades since, the Internet and technological infrastructure has quite simply revolutionized our society, our lives and our work. And the revolution continues. Information technology is advancing at a staggeringly fast rate. Along with the hardware and software, the exponential proliferation of data is continuing it s estimated that 90 per cent of the data in the world has been created in the last few years. And it all has to be stored, analyzed and understood so that better decisions can be made for the future. The University of Calgary is striving to become one of the top research universities in the country. Our scholars are ever active in collecting and creating more data. Big data is creating a big opportunity for the university to organize and streamline how we manage our essential cyberinfrastructure, how we make sense of vast amounts of research data, and how we increase the impact of research results on society. But we have not only reached, but surpassed, our capacity in high powered computing. Our inability to keep pace with technology is slowing us down. And if we don t speed it up, we will no longer be competitive. For example, advancements in cryo electron microscopy which allows researchers to see individual proteins in atomistic detail is revolutionizing structural biology and accelerating the amount of structural information available. But without sufficient HPC, our very strong concentration of experts in simulations of proteins won t be able to compete. Our researchers are losing ground to groups in other universities and we are at risk of being unable to attract funding, excellent graduate students, postdocs and collaborators. Without increasing our HPC capacity to be competitive, some of our scholar s research programs are not viable. Consider: The Reservoir Simulation Group at the Schulich School of Engineering needs to run data with 2 billion grid cells to accurately predict the performance of a petroleum reservoir. But that s impossible given the current compute power on campus. The group has an IBM cluster with limited memory size and speed and so it also uses the Parallel cluster to run large- scale reservoir

2 simulations. Parallel s memory speed isn t fast enough and it s often bottlenecked. Because many labs use Parallel, the queue to run large simulations can be weeks or more. Libin Institute researcher Wayne Chen has discovered a protein, ryanodine receptor, that s responsible for the initiation of calcium waves and calcium- triggered arrhythmias. This will lead to a better understanding of the molecular basis of anti- arrhytmic treatment. But we do not have sufficient HPC to analyze the protein and capitalize on our expertise. The research team was able to make temporary collaborations with institutions that do have the computing capacity. Computational Biophysics researcher Gurpreet Singh needed to run a scaling test for a molecular simulation that required using 50 Nvidia graphics processing units (GPUs) for 20 minutes. The WestGrid/Compute Canada Parallel cluster is the only machine in Western Canada capable of running the test and it is booked solid. Luckily, the cluster was booked for an annual operating system upgrade and Singh s job was allowed to run. But without an outage of some kind, our researchers had no chance to access the machine in the near future. Mark Lowerison of the Clinical Research Unit (CRU) was also able to take advantage of WestGrid down times. He needed to run 300,000 simulations utilizing the R statistical package within a three week time period. Each run needed five to 90 minutes. Normally, this would be impossible. In this case, the annual upgrade and subsequent down time created spare cycles on the three WestGrid clusters. Plugging in to high performance computing (HPC) Having significant compute and storage capacity for all researchers at the university high performance computing is every bit as essential as having sufficient electricity to power our operations. We have a state of the art cogeneration plant to help provide our main campus with power. It s time we had a strategy to ensure we have access to leading edge compute technology. Our current cyberinfrastructure is at capacity. As individual researchers and projects apply for funding and procure technology, we re not always acquiring the technology that would best serve the entire university community nor are we creating a sustainable support infrastructure. The three WestGrid clusters on campus are already over- committed to researchers from the Compute Canada catchment area. We have little room to maneuver except when an outage is scheduled (as the examples above illustrate). The WestGrid machines are due for retirement starting in 2016, but replacement clusters which still would need to be funded will only replace the current cycles, they will not add much needed supplemental capacity. And, the new Compute Canada clusters will not be placed on- campus but at other institutions reducing our ability to access downtime cycles. The global trend in HPC is toward sharing services and we see Campus Alberta capacity as a pre- requisite for many projects. With very few exceptions, having access and control is more important than the physical location of any equipment. It s imperative that we are competitive. We have to ensure that our research community has the capacity it requires. We have to be more strategic in how and what cyber infrastructure we acquire as well as how me make it accessible to researchers on campus and beyond. The Advisory Committee on Analytics & Visualization (ACAV) was struck in 2013 by the Vice President (Research) to examine the university s growing requirements for technology, our current methods of procuring technology through research grant proposals and other means and identify and explore opportunities for improvement.

3 Strategic recommendations for the path forward Take inventory of all major analytics and visualization equipment across campus and define clear paths to access and support it. Surveying faculties and departments to provide a clearer picture of cyberinfrastructure demands, requirements and bottlenecks. The survey could also help make an asset map that incorporates research interests. As well as the hardware and infrastructure, we need sufficient support and the correct expertise to manage it. These expenses should be considered an annual operational expenditure not a capital one. The level of cyberinfrastructure service provided needs to be evaluated regularly to ensure sufficient capacity. Cleary define points of contact to manage and access cyberinfrastructure on and off campus, including senior level research personnel to coordinate initiatives across faculties, an UCIT contact person for system architecture and data scientists. Access and control to HPC resources is more important than physical location. Deployment of new cyber- infrastructure in non- UofC data centres could be used if it is cost effective. Ensure major cyber- infrastructure grants are coordinated centrally. The university needs to provide a high- level of service and cover partial costs to encourage researchers to contribute to a shared infrastructure. Researchers successful in major infrastructure grant proposals will have priority access to the equipment while surplus capacity if made available to other researchers on Campus. Develop an institutional strategy to catalogue and curate a research data library and make it available for future research projects. The research data archive can also facilitate sharing data which would enable collaborations with people inside and outside the university. Different researchers and research groups would have easy access to data in a controlled and secure manner. Create a Digital Data Commons (DDC), a physical space, as a nexus for collaboration o Sharing space will facilitate face- to- face collaboration between analytics and visualization research groups and application researchers. Core analytics/vis researchers can be hosted in this space and be joined by other researchers with needs that could be addressed by big data analytics. o o The digital data commons should have oversight on shared digital research infrastructure (I.e. HPC, data analytics hardware, curated archives of research data) Support personal should be part of the DDC + research leadership needs to be provided (I.e. AKA an academic director) Develop benchmarks in 2015/16 that would demonstrate progress on access to HPC/data analytics capabilities. Strategic recommendations for infrastructure investment Invest in general research data storage and archiving. This would benefit many groups across campus and would also help meet requirements of tri- council and other funding bodies to keep research data for five years or longer. Invest in an analytics cloud system, such as one based on Hadoop. To overcome our currently limited capacity, we need to engage with partner organizations to increase abilities. Support personal and data scientists need to be made available to all researchers.

4 Invest in super computers that combine GPUs or other accelerators with more standard CPUs. Canada lags significantly in this area with no large GPU- based HPC facilities. Storing results in separate facilities and moving them to local equipment for analysis is no longer adequate where data analysis requires the power of the HPC facility combined with substantial storage for intermediate and final results (e.g. genomics, large- scale biomolecular simulation, whole- cell simulations, other computational biology, materials research, large- scale geospatial modeling). Invest in next generation sequencing. Its related techniques already provide severe challenges for storing data in genomics and bioinformatics. Certain datasets have specific security and privacy concerns, particularly in the growing field of patient- related data. Sequencing data could soon be linked with patient data to enable personalized medicine, creating specific requirements for secure storage. Invest in large- scale visualization equipment for making sense of vast amounts of data and inviting opportunities for collaborative work. Upgrade two existing major visualization facilities (CCIT and TFDL), including more powerful graphics hardware driving the CCIT projectors Invest in software such as TechViz XL to allow seamless integration of key software applications (such as Petrel, Matlab and Paraview) with our virtual reality environments. Further demand may exist for: High- throughput streaming analytics capacity will be of use for oil/gas/health. An important application area has been smart cities and analytics problems related to managing the cities of the future. Future of ACAV: Strategic recommendations Identify co- chairs that can act as first point of contact for researchers and senior administration. ACAV becomes a strategic oversight committee that will among other things, review strategic recommendations every few years. Sponsor domain- specific networking events that connect researchers across campus with complimentary research interests. Examples could be Informatics for the life sciences or Energy analytics.

5 ACAV committee members Name Frank Maurer (co-chair) Sam Wiebe (co-chair) Carey Williamson Christopher Hugenholtz Deborah Marshall Jason de Koning Karen Bourrier Kim Koh Laleh Behjat Loren Falkenberg Michael Ranelli Michael Ullyot Parsa Samavati Paul Galpern Peter Tieleman Robin Winsor Sergei Noskov Sheelagh Carpendale Stafford Dean Steve Liang Thomas Hickerson Faculty/Department Medicine/CRU Geography Community Health Sciences Genomics/ACRI Arts Education ENEL Haskayne UCIT Arts Undergrads EVDS Bio Cybera Biological Sciences AHS, DIMR Geomatics Library

Using GPUs in the Cloud for Scalable HPC in Engineering and Manufacturing March 26, 2014

Using GPUs in the Cloud for Scalable HPC in Engineering and Manufacturing March 26, 2014 Using GPUs in the Cloud for Scalable HPC in Engineering and Manufacturing March 26, 2014 David Pellerin, Business Development Principal Amazon Web Services David Hinz, Director Cloud and HPC Solutions

More information

White Paper. Version 1.2 May 2015 RAID Incorporated

White Paper. Version 1.2 May 2015 RAID Incorporated White Paper Version 1.2 May 2015 RAID Incorporated Introduction The abundance of Big Data, structured, partially-structured and unstructured massive datasets, which are too large to be processed effectively

More information

Accelerate > Converged Storage Infrastructure. DDN Case Study. ddn.com. 2013 DataDirect Networks. All Rights Reserved

Accelerate > Converged Storage Infrastructure. DDN Case Study. ddn.com. 2013 DataDirect Networks. All Rights Reserved DDN Case Study Accelerate > Converged Storage Infrastructure 2013 DataDirect Networks. All Rights Reserved The University of Florida s (ICBR) offers access to cutting-edge technologies designed to enable

More information

The Fusion of Supercomputing and Big Data. Peter Ungaro President & CEO

The Fusion of Supercomputing and Big Data. Peter Ungaro President & CEO The Fusion of Supercomputing and Big Data Peter Ungaro President & CEO The Supercomputing Company Supercomputing Big Data Because some great things never change One other thing that hasn t changed. Cray

More information

Powering Cutting Edge Research in Life Sciences with High Performance Computing

Powering Cutting Edge Research in Life Sciences with High Performance Computing A Point of View Powering Cutting Edge Research in Life Sciences with High Performance Computing High performance computing (HPC) is the foundation of pioneering research in life sciences. HPC plays a vital

More information

The University of Alabama at Birmingham. Information Technology. Strategic Plan 2011 2013

The University of Alabama at Birmingham. Information Technology. Strategic Plan 2011 2013 The University of Alabama at Birmingham Information Technology Strategic Plan 2011 2013 Table of Contents Message from the Vice President... 3 About UAB... 4 About UAB Information Technology Meeting needs

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research

More information

The GPU Accelerated Data Center. Marc Hamilton, August 27, 2015

The GPU Accelerated Data Center. Marc Hamilton, August 27, 2015 The GPU Accelerated Data Center Marc Hamilton, August 27, 2015 THE GPU-ACCELERATED DATA CENTER HPC DEEP LEARNING PC VIRTUALIZATION CLOUD GAMING RENDERING 2 Product design FROM ADVANCED RENDERING TO VIRTUAL

More information

Jean-Pierre Panziera Teratec 2011

Jean-Pierre Panziera Teratec 2011 Technologies for the future HPC systems Jean-Pierre Panziera Teratec 2011 3 petaflop systems : TERA 100, CURIE & IFERC Tera100 Curie IFERC 1.25 PetaFlops 256 TB ory 30 PB disk storage 140 000+ Xeon cores

More information

Make the Most of Big Data to Drive Innovation Through Reseach

Make the Most of Big Data to Drive Innovation Through Reseach White Paper Make the Most of Big Data to Drive Innovation Through Reseach Bob Burwell, NetApp November 2012 WP-7172 Abstract Monumental data growth is a fact of life in research universities. The ability

More information

Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee

Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee Technology in Pedagogy, No. 8, April 2012 Written by Kiruthika Ragupathi (kiruthika@nus.edu.sg) Computational thinking is an emerging

More information

Stream Processing on GPUs Using Distributed Multimedia Middleware

Stream Processing on GPUs Using Distributed Multimedia Middleware Stream Processing on GPUs Using Distributed Multimedia Middleware Michael Repplinger 1,2, and Philipp Slusallek 1,2 1 Computer Graphics Lab, Saarland University, Saarbrücken, Germany 2 German Research

More information

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Volker Markl volker.markl@tu-berlin.de dima.tu-berlin.de dfki.de/web/research/iam/ bbdc.berlin Based on my 2014 Vision Paper On

More information

Future Directions in Canadian Research Computing: Complexities of Big Data TORONTO RESEARCH MANAGEMENT SYMPOSIUM, DECEMBER 4, 2014

Future Directions in Canadian Research Computing: Complexities of Big Data TORONTO RESEARCH MANAGEMENT SYMPOSIUM, DECEMBER 4, 2014 Future Directions in Canadian Research Computing: Complexities of Big Data TORONTO RESEARCH MANAGEMENT SYMPOSIUM, DECEMBER 4, 2014 1 Role of ARC* Today * Advanced Research Computing New Paradigms Simulation:

More information

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud February 25, 2014 1 Agenda v Mapping clients needs to cloud technologies v Addressing your pain

More information

OpenCB a next generation big data analytics and visualisation platform for the Omics revolution

OpenCB a next generation big data analytics and visualisation platform for the Omics revolution OpenCB a next generation big data analytics and visualisation platform for the Omics revolution Development at the University of Cambridge - Closing the Omics / Moore s law gap with Dell & Intel Ignacio

More information

Jeff Wolf Deputy Director HPC Innovation Center

Jeff Wolf Deputy Director HPC Innovation Center Public Presentation for Blue Gene Consortium Nov. 19, 2013 www.hpcinnovationcenter.com Jeff Wolf Deputy Director HPC Innovation Center This work was performed under the auspices of the U.S. Department

More information

New Jersey Big Data Alliance

New Jersey Big Data Alliance Rutgers Discovery Informatics Institute (RDI 2 ) New Jersey s Center for Advanced Computation New Jersey Big Data Alliance Manish Parashar Director, Rutgers Discovery Informatics Institute (RDI 2 ) Professor,

More information

IU Cyberinfrastructure Overview

IU Cyberinfrastructure Overview IU Cyberinfrastructure Overview, Cyberinfrastructure and Service Center Indiana University Pervasive Technology Institute Science Storage Computation Analysis/ Bio/Health Visualization Campus Education/

More information

SGI HPC Systems Help Fuel Manufacturing Rebirth

SGI HPC Systems Help Fuel Manufacturing Rebirth SGI HPC Systems Help Fuel Manufacturing Rebirth Created by T A B L E O F C O N T E N T S 1.0 Introduction 1 2.0 Ongoing Challenges 1 3.0 Meeting the Challenge 2 4.0 SGI Solution Environment and CAE Applications

More information

NVIDIA GPUs in the Cloud

NVIDIA GPUs in the Cloud NVIDIA GPUs in the Cloud 4 EVOLVING CLOUD REQUIREMENTS On premises Off premises Hybrid Cloud Connecting clouds New workloads Components to disrupt 5 GLOBAL CLOUD PLATFORM Unified architecture enabled by

More information

The digital future and dealing with disruption

The digital future and dealing with disruption The digital future and dealing with disruption Dr Giles Nelson, Senior Vice President of Product Marketing and Strategy May 2015 1 BIG CHANGE due to digitization 2 2 billion internet users worldwide 40%

More information

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing July 7, 2014 David Pellerin, Business Development Principal Amazon Web Services What Do We Hear From Customers?

More information

Bringing Big Data Modelling into the Hands of Domain Experts

Bringing Big Data Modelling into the Hands of Domain Experts Bringing Big Data Modelling into the Hands of Domain Experts David Willingham Senior Application Engineer MathWorks david.willingham@mathworks.com.au 2015 The MathWorks, Inc. 1 Data is the sword of the

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.3 Selected Standards

More information

THE EFFECTS OF BIG DATA ON INFRASTRUCTURE. Sakkie Janse van Rensburg Dr Dale Peters UCT 1

THE EFFECTS OF BIG DATA ON INFRASTRUCTURE. Sakkie Janse van Rensburg Dr Dale Peters UCT 1 THE EFFECTS OF BIG DATA ON INFRASTRUCTURE Sakkie Janse van Rensburg Dr Dale Peters UCT 1 BIG DATA DEFINED Gartner in 2001 first coined the phrase Big Data Volume Velocity Big data is a popular term used

More information

BIG DATA & DATA SCIENCE

BIG DATA & DATA SCIENCE BIG DATA & DATA SCIENCE ACADEMY PROGRAMS IN-COMPANY TRAINING PORTFOLIO 2 TRAINING PORTFOLIO 2016 Synergic Academy Solutions BIG DATA FOR LEADING BUSINESS Big data promises a significant shift in the way

More information

Copyright www.agileload.com 1

Copyright www.agileload.com 1 Copyright www.agileload.com 1 INTRODUCTION Performance testing is a complex activity where dozens of factors contribute to its success and effective usage of all those factors is necessary to get the accurate

More information

Big Data: Overview and Roadmap. 2015 eglobaltech. All rights reserved.

Big Data: Overview and Roadmap. 2015 eglobaltech. All rights reserved. Big Data: Overview and Roadmap 2015 eglobaltech. All rights reserved. What is Big Data? Large volumes of complex and variable data that require advanced techniques and technologies to enable capture, storage,

More information

I. Justification and Program Goals

I. Justification and Program Goals MS in Data Science proposed by Department of Computer Science, B. Thomas Golisano College of Computing and Information Sciences Department of Information Sciences and Technologies, B. Thomas Golisano College

More information

HPC Growing Pains. Lessons learned from building a Top500 supercomputer

HPC Growing Pains. Lessons learned from building a Top500 supercomputer HPC Growing Pains Lessons learned from building a Top500 supercomputer John L. Wofford Center for Computational Biology & Bioinformatics Columbia University I. What is C2B2? Outline Lessons learned from

More information

High Performance Computing

High Performance Computing High Parallel Computing Hybrid Program Coding Heterogeneous Program Coding Heterogeneous Parallel Coding Hybrid Parallel Coding High Performance Computing Highly Proficient Coding Highly Parallelized Code

More information

The IBM Solution Architecture for Energy and Utilities Framework

The IBM Solution Architecture for Energy and Utilities Framework IBM Solution Architecture for Energy and Utilities Framework Accelerating Solutions for Smarter Utilities The IBM Solution Architecture for Energy and Utilities Framework Providing a foundation for solutions

More information

Cornell University Center for Advanced Computing A Sustainable Business Model for Advanced Research Computing

Cornell University Center for Advanced Computing A Sustainable Business Model for Advanced Research Computing Cornell University Center for Advanced Computing A Sustainable Business Model for Advanced Research Computing David A. Lifka lifka@cac.cornell.edu 4/20/13 www.cac.cornell.edu 1 My Background 2007 Cornell

More information

Computational infrastructure for NGS data analysis. José Carbonell Caballero Pablo Escobar

Computational infrastructure for NGS data analysis. José Carbonell Caballero Pablo Escobar Computational infrastructure for NGS data analysis José Carbonell Caballero Pablo Escobar Computational infrastructure for NGS Cluster definition: A computer cluster is a group of linked computers, working

More information

Parallel Computing: Strategies and Implications. Dori Exterman CTO IncrediBuild.

Parallel Computing: Strategies and Implications. Dori Exterman CTO IncrediBuild. Parallel Computing: Strategies and Implications Dori Exterman CTO IncrediBuild. In this session we will discuss Multi-threaded vs. Multi-Process Choosing between Multi-Core or Multi- Threaded development

More information

Research Computing Building Blocks INFRASTRUCTURE FOR DATA AT PURDUE PRESTON SMITH, DIRECTOR OF RESEARCH SERVICES PSMITH@PURDUE.

Research Computing Building Blocks INFRASTRUCTURE FOR DATA AT PURDUE PRESTON SMITH, DIRECTOR OF RESEARCH SERVICES PSMITH@PURDUE. Research Computing Building Blocks INFRASTRUCTURE FOR DATA AT PURDUE PRESTON SMITH, DIRECTOR OF RESEARCH SERVICES PSMITH@PURDUE.EDU Discussion http://www.geartechnology.com/blog/wp- content/uploads/2015/11/opportunity-

More information

New solutions for Big Data Analysis and Visualization

New solutions for Big Data Analysis and Visualization New solutions for Big Data Analysis and Visualization From HPC to cloud-based solutions Barcelona, February 2013 Nacho Medina imedina@cipf.es http://bioinfo.cipf.es/imedina Head of the Computational Biology

More information

Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga

Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga Programming models for heterogeneous computing Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga Talk outline [30 slides] 1. Introduction [5 slides] 2.

More information

Bricata Next Generation Intrusion Prevention System A New, Evolved Breed of Threat Mitigation

Bricata Next Generation Intrusion Prevention System A New, Evolved Breed of Threat Mitigation Bricata Next Generation Intrusion Prevention System A New, Evolved Breed of Threat Mitigation Iain Davison Chief Technology Officer Bricata, LLC WWW.BRICATA.COM The Need for Multi-Threaded, Multi-Core

More information

DATA MANAGEMENT FOR THE INTERNET OF THINGS

DATA MANAGEMENT FOR THE INTERNET OF THINGS DATA MANAGEMENT FOR THE INTERNET OF THINGS February, 2015 Peter Krensky, Research Analyst, Analytics & Business Intelligence Report Highlights p2 p4 p6 p7 Data challenges Managing data at the edge Time

More information

Accelerating Academic Research with Cloud Computing

Accelerating Academic Research with Cloud Computing Accelerating Academic Research with Cloud Computing Published: September 2014 For the latest information, please see www.microsoft.com/education Overview... 1 What Is the Cloud?... 1 Cloud Service Models...

More information

EXPLORATION TECHNOLOGY REQUIRES A RADICAL CHANGE IN DATA ANALYSIS

EXPLORATION TECHNOLOGY REQUIRES A RADICAL CHANGE IN DATA ANALYSIS EXPLORATION TECHNOLOGY REQUIRES A RADICAL CHANGE IN DATA ANALYSIS EMC Isilon solutions for oil and gas EMC PERSPECTIVE TABLE OF CONTENTS INTRODUCTION: THE HUNT FOR MORE RESOURCES... 3 KEEPING PACE WITH

More information

Using Cloud-Based Technologies in Clinical Trials by Niki Kutac, Director, Product Management

Using Cloud-Based Technologies in Clinical Trials by Niki Kutac, Director, Product Management White Paper Using Cloud-Based Technologies in Clinical Trials by Niki Kutac, Director, Product Management Technology has transformed industries, from music to medicine. Advances in data availability and

More information

How To Manage Research Data At Columbia

How To Manage Research Data At Columbia An experience/position paper for the Workshop on Research Data Management Implementations *, March 13-14, 2013, Arlington Rajendra Bose, Ph.D., Manager, CUIT Research Computing Services Amy Nurnberger,

More information

SMART ASSET MANAGEMENT MAXIMISE VALUE AND RELIABILITY

SMART ASSET MANAGEMENT MAXIMISE VALUE AND RELIABILITY SMART ASSET MANAGEMENT MAXIMISE VALUE AND RELIABILITY Electrical equipment is a critical component of your asset portfolio. Beyond its financial value, it plays an even greater role in your business performance.

More information

Commonwealth Advanced Data Analytics Alliance & The President s Precision Medicine Initiative

Commonwealth Advanced Data Analytics Alliance & The President s Precision Medicine Initiative Commonwealth Advanced Data Analytics Alliance & The President s Precision Medicine Initiative Deputy Secretary Anthony Fung Presentation to the Health IT Standards Advisory Committee December 17, 2015

More information

Cloud Computing on a Smarter Planet. Smarter Computing

Cloud Computing on a Smarter Planet. Smarter Computing Cloud Computing on a Smarter Planet Smarter Computing 2 Cloud Computing on a Smarter Planet As our planet gets smarter more instrumented, interconnected and intelligent the underlying infrastructure needs

More information

Charting the Evolution of Campus Cyberinfrastructure: Where Do We Go From Here? 2015 National Science Foundation NSF CC*NIE/IIE/DNI Principal

Charting the Evolution of Campus Cyberinfrastructure: Where Do We Go From Here? 2015 National Science Foundation NSF CC*NIE/IIE/DNI Principal Jim Bottum Charting the Evolution of Campus Cyberinfrastructure: Where Do We Go From Here? 2015 National Science Foundation NSF CC*NIE/IIE/DNI Principal Investigators Meeting The CC* Mission Campuses today

More information

What s New in MATLAB and Simulink

What s New in MATLAB and Simulink What s New in MATLAB and Simulink Kevin Cohan Product Marketing, MATLAB Michael Carone Product Marketing, Simulink 2015 The MathWorks, Inc. 1 What was new for Simulink in R2012b? 2 What Was New for MATLAB

More information

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India 3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India Call for Papers Cloud computing has emerged as a de facto computing

More information

UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure

UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure Authors: A O Jaunsen, G S Dahiya, H A Eide, E Midttun Date: Dec 15, 2015 Summary Uninett Sigma2 provides High

More information

Big Data Performance Growth on the Rise

Big Data Performance Growth on the Rise Impact of Big Data growth On Transparent Computing Michael A. Greene Intel Vice President, Software and Services Group, General Manager, System Technologies and Optimization 1 Transparent Computing (TC)

More information

How To Change Medicine

How To Change Medicine P4 Medicine: Personalized, Predictive, Preventive, Participatory A Change of View that Changes Everything Leroy E. Hood Institute for Systems Biology David J. Galas Battelle Memorial Institute Version

More information

Oracle Engineered Systems and Triple Point Technology

Oracle Engineered Systems and Triple Point Technology Oracle Engineered Systems and Triple Point Technology SuccESSful volatility management for commodities 2 commodity Xl for Oil Extreme commodity volatility and increased supply chain complexity are the

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing

Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing James D. Jackson Philip J. Hatcher Department of Computer Science Kingsbury Hall University of New Hampshire Durham,

More information

Emerging Technology for the Next Decade

Emerging Technology for the Next Decade Emerging Technology for the Next Decade Cloud Computing Keynote Presented by Charles Liang, President & CEO Super Micro Computer, Inc. What is Cloud Computing? Cloud computing is Internet-based computing,

More information

2015 The MathWorks, Inc. 1

2015 The MathWorks, Inc. 1 25 The MathWorks, Inc. 빅 데이터 및 다양한 데이터 처리 위한 MATLAB의 인터페이스 환경 및 새로운 기능 엄준상 대리 Application Engineer MathWorks 25 The MathWorks, Inc. 2 Challenges of Data Any collection of data sets so large and complex

More information

Scaling LS-DYNA on Rescale HPC Cloud Simulation Platform

Scaling LS-DYNA on Rescale HPC Cloud Simulation Platform Scaling LS-DYNA on Rescale HPC Cloud Simulation Platform Joris Poort, President & CEO, Rescale, Inc. Ilea Graedel, Manager, Rescale, Inc. 1 Cloud HPC on the Rise 1.1 Background Engineering and science

More information

From Big Data to Smart Data Thomas Hahn

From Big Data to Smart Data Thomas Hahn Siemens Future Forum @ HANNOVER MESSE 2014 From Big to Smart Hannover Messe 2014 The Evolution of Big Digital data ~ 1960 warehousing ~1986 ~1993 Big data analytics Mining ~2015 Stream processing Digital

More information

Deploying an Operational Data Store Designed for Big Data

Deploying an Operational Data Store Designed for Big Data Deploying an Operational Data Store Designed for Big Data A fast, secure, and scalable data staging environment with no data volume or variety constraints Sponsored by: Version: 102 Table of Contents Introduction

More information

Blue: C= 77 M= 24 Y=19 K=0 Font: Avenir. Clockwork LCM Cloud. Technology Whitepaper

Blue: C= 77 M= 24 Y=19 K=0 Font: Avenir. Clockwork LCM Cloud. Technology Whitepaper Technology Whitepaper Clockwork Solutions, LLC. 1 (800) 994-1336 A Teakwood Capital Company Copyright 2013 TABLE OF CONTENTS Clockwork Solutions Bringing Cloud Technology to the World Clockwork Cloud Computing

More information

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce Elastic Application Platform for Market Data Real-Time Analytics Can you deliver real-time pricing, on high-speed market data, for real-time critical for E-Commerce decisions? Market Data Analytics applications

More information

Acceleration for Personalized Medicine Big Data Applications

Acceleration for Personalized Medicine Big Data Applications Acceleration for Personalized Medicine Big Data Applications Zaid Al-Ars Computer Engineering (CE) Lab Delft Data Science Delft University of Technology 1" Introduction Definition & relevance Personalized

More information

MATLAB in Business Critical Applications Arvind Hosagrahara Principal Technical Consultant Arvind.Hosagrahara@mathworks.

MATLAB in Business Critical Applications Arvind Hosagrahara Principal Technical Consultant Arvind.Hosagrahara@mathworks. MATLAB in Business Critical Applications Arvind Hosagrahara Principal Technical Consultant Arvind.Hosagrahara@mathworks.com 310-819-3970 2014 The MathWorks, Inc. 1 Outline Problem Statement The Big Picture

More information

Data Centric Systems (DCS)

Data Centric Systems (DCS) Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems

More information

COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1)

COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1) COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1) Mary Thomas Department of Computer Science Computational Science Research Center (CSRC) San Diego State University

More information

Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory

Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory Customer Success Story Los Alamos National Laboratory Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory June 2010 Highlights First Petaflop Supercomputer

More information

Large-Scale Reservoir Simulation and Big Data Visualization

Large-Scale Reservoir Simulation and Big Data Visualization Large-Scale Reservoir Simulation and Big Data Visualization Dr. Zhangxing John Chen NSERC/Alberta Innovates Energy Environment Solutions/Foundation CMG Chair Alberta Innovates Technology Future (icore)

More information

Healthcare, transportation,

Healthcare, transportation, Smart IT Argus456 Dreamstime.com From Data to Decisions: A Value Chain for Big Data H. Gilbert Miller and Peter Mork, Noblis Healthcare, transportation, finance, energy and resource conservation, environmental

More information

High-Performance Computing and Big Data Challenge

High-Performance Computing and Big Data Challenge High-Performance Computing and Big Data Challenge Dr Violeta Holmes Matthew Newall The University of Huddersfield Outline High-Performance Computing E-Infrastructure Top500 -Tianhe-II UoH experience: HPC

More information

Compute Canada Technology Briefing

Compute Canada Technology Briefing Compute Canada Technology Briefing November 12, 2015 Introduction Compute Canada, in partnership with regional organizations ACENET, Calcul Québec, Compute Ontario and WestGrid, leads the acceleration

More information

FDA STAFF MANUAL GUIDES, VOLUME I - ORGANIZATIONS AND FUNCTIONS FOOD AND DRUG ADMINISTRATION OFFICE OF OPERATIONS

FDA STAFF MANUAL GUIDES, VOLUME I - ORGANIZATIONS AND FUNCTIONS FOOD AND DRUG ADMINISTRATION OFFICE OF OPERATIONS SMG 1117.22a FDA STAFF MANUAL GUIDES, VOLUME I - ORGANIZATIONS AND FUNCTIONS FOOD AND DRUG ADMINISTRATION OFFICE OF OPERATIONS OFFICE OF INFORMATION MANAGEMENT AND TECHNOLOGY OFFICE OF INFORMATICS AND

More information

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

Accelerating Hadoop MapReduce Using an In-Memory Data Grid Accelerating Hadoop MapReduce Using an In-Memory Data Grid By David L. Brinker and William L. Bain, ScaleOut Software, Inc. 2013 ScaleOut Software, Inc. 12/27/2012 H adoop has been widely embraced for

More information

Java/Scala Engineer Internet of Iot Competitors

Java/Scala Engineer Internet of Iot Competitors JOB 1 Sr. Java/Scala Engineer Internet of Things, IoT, is a true digital revolution. Predictions of 20, 50 or 100 billion connected devices in 2020 are pointing to massive changes for people and industries.

More information

University Uses Business Intelligence Software to Boost Gene Research

University Uses Business Intelligence Software to Boost Gene Research Microsoft SQL Server 2008 R2 Customer Solution Case Study University Uses Business Intelligence Software to Boost Gene Research Overview Country or Region: Scotland Industry: Education Customer Profile

More information

Applications to Computational Financial and GPU Computing. May 16th. Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61

Applications to Computational Financial and GPU Computing. May 16th. Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61 F# Applications to Computational Financial and GPU Computing May 16th Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61 Today! Why care about F#? Just another fashion?! Three success stories! How Alea.cuBase

More information

EMC XtremSF: Delivering Next Generation Performance for Oracle Database

EMC XtremSF: Delivering Next Generation Performance for Oracle Database White Paper EMC XtremSF: Delivering Next Generation Performance for Oracle Database Abstract This white paper addresses the challenges currently facing business executives to store and process the growing

More information

IBM Software Information Management Creating an Integrated, Optimized, and Secure Enterprise Data Platform:

IBM Software Information Management Creating an Integrated, Optimized, and Secure Enterprise Data Platform: Creating an Integrated, Optimized, and Secure Enterprise Data Platform: IBM PureData System for Transactions with SafeNet s ProtectDB and DataSecure Table of contents 1. Data, Data, Everywhere... 3 2.

More information

Big Data Executive Survey

Big Data Executive Survey Big Data Executive Full Questionnaire Big Date Executive Full Questionnaire Appendix B Questionnaire Welcome The survey has been designed to provide a benchmark for enterprises seeking to understand the

More information

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014 HPC Cluster Decisions and ANSYS Configuration Best Practices Diana Collier Lead Systems Support Specialist Houston UGM May 2014 1 Agenda Introduction Lead Systems Support Specialist Cluster Decisions Job

More information

Understanding Big Data Analytics for Research

Understanding Big Data Analytics for Research Understanding Big Data Analytics for Research Hye-Chung Kum Texas A&M Health Science Center, Dept. of Health Policy & Management University of North Carolina at Chapel Hill, Dept. of Computer Science (kum@tamhsc.edu)

More information

Steps to Migrating to a Private Cloud

Steps to Migrating to a Private Cloud Deploying and Managing Private Clouds The Essentials Series Steps to Migrating to a Private Cloud sponsored by Introduction to Realtime Publishers by Don Jones, Series Editor For several years now, Realtime

More information

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Is a Data Scientist the New Quant? Stuart Kozola MathWorks Is a Data Scientist the New Quant? Stuart Kozola MathWorks 2015 The MathWorks, Inc. 1 Facts or information used usually to calculate, analyze, or plan something Information that is produced or stored by

More information

Smart Data Innovation Lab (SDIL)

Smart Data Innovation Lab (SDIL) Smart Data Innovation Lab (SDIL) Accelerating Data driven Innovation NESSI Summit May 27, 2014 Prof. Dr.-Ing. Michael Beigl Department of Informatics KIT University of the State of Baden-Wuerttemberg and

More information

Cloud Computing for Research Roger Barga Cloud Computing Futures, Microsoft Research

Cloud Computing for Research Roger Barga Cloud Computing Futures, Microsoft Research Cloud Computing for Research Roger Barga Cloud Computing Futures, Microsoft Research Trends: Data on an Exponential Scale Scientific data doubles every year Combination of inexpensive sensors + exponentially

More information

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging Outline High Performance Computing (HPC) Towards exascale computing: a brief history Challenges in the exascale era Big Data meets HPC Some facts about Big Data Technologies HPC and Big Data converging

More information

Deep Learning Meets Heterogeneous Computing. Dr. Ren Wu Distinguished Scientist, IDL, Baidu wuren@baidu.com

Deep Learning Meets Heterogeneous Computing. Dr. Ren Wu Distinguished Scientist, IDL, Baidu wuren@baidu.com Deep Learning Meets Heterogeneous Computing Dr. Ren Wu Distinguished Scientist, IDL, Baidu wuren@baidu.com Baidu Everyday 5b+ queries 500m+ users 100m+ mobile users 100m+ photos Big Data Storage Processing

More information

Realizing the Benefits of Data Modernization

Realizing the Benefits of Data Modernization February 2015 Perspective Realizing the Benefits of How to overcome legacy data challenges with innovative technologies and a seamless data modernization roadmap. Companies born into the digital world

More information

University of Utah backbone is fully redundant with one or more 10Gb/s connecting each distribution node to a redundant core which connects to a

University of Utah backbone is fully redundant with one or more 10Gb/s connecting each distribution node to a redundant core which connects to a 1 * Dave Pershing 2 University of Utah backbone is fully redundant with one or more 10Gb/s connecting each distribution node to a redundant core which connects to a redundant WAN which connects to redundant

More information

How To Understand The Benefits Of Big Data

How To Understand The Benefits Of Big Data Findings from the research collaboration of IBM Institute for Business Value and Saïd Business School, University of Oxford Analytics: The real-world use of big data How innovative enterprises extract

More information

Microsoft Research Windows Azure for Research Training

Microsoft Research Windows Azure for Research Training Copyright 2013 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the

More information

ebook Utilizing MapReduce to address Big Data Enterprise Needs Leveraging Big Data to shorten drug development cycles in Pharmaceutical industry.

ebook Utilizing MapReduce to address Big Data Enterprise Needs Leveraging Big Data to shorten drug development cycles in Pharmaceutical industry. Utilizing MapReduce to address Big Data Enterprise Needs Leveraging Big Data to shorten drug development cycles in Pharmaceutical industry. www.persistent.com 3 4 5 5 7 9 10 11 12 13 From the Vantage Point

More information

Data Management Programs

Data Management Programs Data Management Programs GETTING TO CONCRETE RESULTS Paul J. Bracke Associate Dean for Research and Assessment ARL Membership Meeting October 8, 2014 The Purdue Context THE ROAD TO CONCRETE RESULTS Response

More information

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

Data-intensive HPC: opportunities and challenges. Patrick Valduriez Data-intensive HPC: opportunities and challenges Patrick Valduriez Big Data Landscape Multi-$billion market! Big data = Hadoop = MapReduce? No one-size-fits-all solution: SQL, NoSQL, MapReduce, No standard,

More information

Top Ten Questions. to Ask Your Primary Storage Provider About Their Data Efficiency. May 2014. Copyright 2014 Permabit Technology Corporation

Top Ten Questions. to Ask Your Primary Storage Provider About Their Data Efficiency. May 2014. Copyright 2014 Permabit Technology Corporation Top Ten Questions to Ask Your Primary Storage Provider About Their Data Efficiency May 2014 Copyright 2014 Permabit Technology Corporation Introduction The value of data efficiency technologies, namely

More information

HPC technology and future architecture

HPC technology and future architecture HPC technology and future architecture Visual Analysis for Extremely Large-Scale Scientific Computing KGT2 Internal Meeting INRIA France Benoit Lange benoit.lange@inria.fr Toàn Nguyên toan.nguyen@inria.fr

More information

BIG DATA-AS-A-SERVICE

BIG DATA-AS-A-SERVICE White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers

More information