Distributed Big Data and Analytics (DBDA)

Similar documents
New Jersey Big Data Alliance

Available online at ScienceDirect. Procedia CIRP 38 (2015 ) 3 7

Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC) Vision and Mission

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Collaborative Research and Education in Southeastern US in Emerging Areas of Power Engineering

EO Data by using SAP HANA Spatial Hinnerk Gildhoff, Head of HANA Spatial, SAP Satellite Masters Conference 21 th October 2015 Public

Big Data Executive Survey

Building Energy Management: Using Data as a Tool

Big Data in Manufacturing: Market Analysis, Case Studies, and Forecasts

Industrial Dr. Stefan Bungart

National Nursing Informatics Deep Dive Program

Leveraging Big Data

Indian Institute of Technology Hyderabad. Standing Advertisement No. IITH/2013/Faculty/5

SAP HANA Cloud Platform Overview Customer

SAP Business Suite powered by SAP HANA

M.S. AND PH.D. IN BIOMEDICAL ENGINEERING

Big Data Explained. An introduction to Big Data Science.

The IoT Inc Business Meetup Silicon Valley

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

From Distributed Computing to Distributed Artificial Intelligence

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

Discover Viterbi: Petroleum Engineering

Cloud Computing. What is it? Presented by Prof. Dr.Prabhas CHONGSTITVATANA Asst. Prof. Dr.Chaiyachet SAIVICHIT. Source : Montana State Library Archive

Charting the Evolution of Campus Cyberinfrastructure: Where Do We Go From Here? 2015 National Science Foundation NSF CC*NIE/IIE/DNI Principal

Hofstra University. Big Data Laboratory Robotics and Advanced Manufacturing Laboratory

Big Data Analytics. for the Exploitation of the CERN Accelerator Complex. Antonio Romero Marín

Increase System Efficiency with Condition Monitoring. Embedded Control and Monitoring Summit National Instruments

Data Centric Computing Revisited

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

Building a Scalable Big Data Infrastructure for Dynamic Workflows

Mobile Monetization Scenario Design & Big Data. Arther Wu Senior Director of Monetization and Business Operation

The future of Big Data A United Hitachi View

Jack Baskin School of Engineering The University of California, Santa Cruz. Steve Kang, Dean and Prof. of Electrical Engineering October 15, 2003

Discover Viterbi: Petroleum Engineering

Strategic Decisions Supported by SAP Big Data Solutions. Angélica Bedoya / Strategic Solutions GTM Mar /2014

How To Write An Nccwsc/Csc Data Management Plan

How To Manage Research Data At Columbia

Data Science Institute Recent Activities. Education Faculty Excellence & Recruiting Research Industrial Relationships

G-Cloud Service Definition Canopy Big Data proof of concept Service SCS

Technology Big Data Solutions for Aeronautics : value, issues and solution. Business Models. Usage

Leveraging OpenStack Private Clouds

Assistant Professor: Rs.12, ,300/- plus allowances as admissible to Central Government employees.

Visualization and Analytics Centers

From Big Data to Smart Data Thomas Hahn

Graduate Training in Wireless Intelligent Sensor Networks

The Purview Solution Integration With Splunk

Precision Agriculture Using SAP HANA and F4F Cloud Integration to Improve Agribusiness. Dr. Lauren McCallum May, 2015

Big Data R&D Initiative

Does the program qualify for supplemental Palmetto Fellows and LIFE Scholarship awards? Yes No

Vivir en un mar de Datos 2015: Big Data una mirada Global Fundación Telefónica

Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research

How To Teach B.S. In Biomedical Engineering At The University Of South Carolina

Big Data a threat or a chance?

Make the Most of Big Data to Drive Innovation Through Reseach

Intelligent Lifecycle Asset Management Cradle to Grave Data Utilization in Transportation

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Russ College of Engineering and Technology. Revised 9/ Undergraduate GPA of 3.0/4.0 or equivalent.

Data analysis in Par,cle Physics

How do we train Data Scientists and Data Engineers?

Methodology and Framework of a Cloud-Based Prognostics and Health Management System for Manufacturing Industry

AURO Enterprise Cloud

Big Analytics in the Cloud. Matt Winkler PM, Big

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

BUILDING A SCALABLE BIG DATA INFRASTRUCTURE FOR DYNAMIC WORKFLOWS

Cornell University Optimizes Energy Efficiency with GE s Proficy Software

Amplify Serviceability and Productivity by integrating machine /sensor data with Data Science

The Internet of Everything

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Introduction of Information Visualization and Visual Analytics. Chapter 2. Introduction and Motivation

Ganzheitliches Datenmanagement

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Government Technology Trends to Watch in 2014: Big Data

ediscovery Features of SharePoint 2013 and Exchange 2013 Paul Branson Solution Architect Microsoft

Master s in Computational Mathematics. Faculty of Mathematics, University of Waterloo

A New Era Of Analytic

Doing Multidisciplinary Research in Data Science

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Prairie View A&M University Computer Science Department 招 生 简 介

Are You Ready for Big Data?

Transcription:

Distributed Big Data and Analytics (DBDA) Internet2 CINO Ini,a,ve Working Group Co- Chair Mee,ng 10 August 2015 Chairs Alex Feltus, Clemson Sam Gustman, USC Marc Hoit, NC State 1 1

Meeting Objectives Discussion of eight submitted use cases Discussion of next steps 2

Use case input from the working group Use Case # Use Case Title Name Ins0tu0on 1 Data Analy,cs of Campus- Scale Power SubmiHed by Alex Feltus Clemson System Contact Dan Noneaker 2 Intelligent Management Systems Center SubmiHed by Jane Combs Univ of Cincinna, Contact Prof. Jay Lee 3 Machine Tool Ball Screw Health Monitoring SubmiHed by Jane Combs Univ of Cincinna, Contact Prof. Jay Lee 4 Bioinforma,cs SubmiHed by Jane Combs Univ of Cincinna, 5 Computa,onal Fluid Dynamics Research: SubmiHed by Jane Combs Univ of Cincinna, Aerospace 6 Geography/Climate SubmiHed by Jane Combs Univ of Cincinna, 7 High Energy Physics SubmiHed by Jane Combs Univ of Cincinna, 8 Modeling and Simula,ons SubmiHed by Jane Combs Univ of Cincinna, 3

Data Analytics of Campus-Scale Power System Submitted by Alex Feltus at Clemson Contact Dan Noneaker at Clemson Project/Research Title: Data Analytics of Campus-Scale Power System Industry Sector: Electric Power Utility Science Sub-domain: Electrical Engineering Short Description of Project & Relation to Big Data: The local electric power grid will be heavily instrumented on a campus containing a mix of residential sites, office spaces, industrial-scale electromechanical systems, and distributed energy sources. The instruments will be networked to a server that provides data for use in analytics focused on electric energy consumption, electric-service reliability, power quality, and local grid planning and design. The analytics will support research in local-grid technologies, distributed control of the electric grid, and power electronics, etc. Potential Industrial Partners: Duke Energy (Clemson's electric service provider), other electric utilities, power-industry instrumentation and electronics manufacturers, power-system monitoring and control vendors Other Faculty Involved All power faculty at Clemson, power research staff at Clemson's CURI site in Charleston, SoC faculty working in data analytics Best Contact: Dan Noneaker, ECE Dept. Chair, dnoneak@clemson.edu Big Data Attributes: sensor, near-realtime, distributed, geospatial Aggregate Data Size: 2 TB 4 TB 2017 16 TB 2020 1 PB 4

Intelligent Maintenance Systems (IMS) Center Contact Prof. Jay Lee at University of Cincinnati Project/Research Title: Utilizing Prognostics & Health Management (PHM) Cloud Technology to Improve Band Sawing Process Industry Sector: Manufacturing / Industrial Machinery Science Sub-domain: Data Analytics / Prognostics & Health Management Short Description of Project & Relation to Big Data: The goal of this project is to acquire a large amount of operating data from band saw machines both in the field and from an in-house test bed. This data is then analyzed using the Watchdog Agent toolkit to assess and predict the health condition of the monitored band saws. Once validated, this approach will be used to construct a commercial cloud-based platform and mobile app for the project sponsor. Best Contact: Professor Jay Lee Big Data Attributes: Sensor, Near Real-time Aggregate Data Size: 5 TB 10 TB 2017 2020 5

Machine Tool Ball Screw Health Monitoring Contact Prof. Jay Lee at University of Cincinnati Project/Research Title: Machine Tool Ball Screw Health Monitoring Industry Sector: Manufacturing / Industrial Machinery Science Sub-domain: Data Analysis / Prognostics & Health Management Short Description of Project & Relation to Big Data: The goal of this project is to conduct multiple run-to-failure tests using commercially available machine tool ball screws and motors to collect data and design a data driven model for health monitoring and prediction of such ball screws. Data from these tests is transferred and stored on a central server. A mobile app will be developed for monitoring these tests. Best Contact: Professor Jay Lee Big Data Attributes: Sensor, Near Real-time Aggregate Data Size: 5TB 15TB 2017 30TB 2020 6

Bioinformatics Project/Research Title: NIH BD2K-LINCS Perturbation Data Coordination and Integration Center Industry Sector: Health care / Environmental Health/ Biomedical Research Science Sub-domain: Bioinformatics and Data Science Short Description of Project & Relation to Big Data: The Library of Integrated Network Based Cellular Signatures (LINCS) project is expected to produce masses of data collected from human cells and tissues perturbed with drugs and other molecules. The center s role is to develop new methods to integrate big data, come up with intelligent ways to mine and analyze it, intuitive tools to interact with it and to educate the research community on how to best leverage this trove of information for biomedical research. Best Contact: Jane Combs, combsje@uc.edu Big Data Attributes: In biomedical research, these data sources include the diverse, complex, disorganized, massive, and multimodal data being generated by researchers, hospitals, and mobile devices around the world. Aggregate Data Size: 2017 2020 7

Computational Fluid Dynamics Research: Aerospace Project/Research Title: Study of Active and Passive Flow Control Techniques over Turbine Blades Industry Sector: Aerospace Science Sub-domain: Mechanical Engineering, Comp Fluid Dynamics Short Description of Project & Relation to Big Data: Collaborative immersive visualization of large datasets and simulation trajectories to support the study of active and passive flow control techniques and turbine-blade cooling. The goal of such simulations is to provide predictive performance analysis of physical systems that may contain many integrated components and which are described by multiple, interacting, physical processes. Best Contact: Jane Combs, combsje@uc.edu Big Data Attributes: Simulation datasets and data visualization Aggregate Data Size: 2TB 4TB 2017 6TB 2020 8

Geography/Climate Project/Research Title: Toward a Circumarctic Lakes Observation Network (CALON)--Multiscale Observations of Lacustrine Systems Industry Sector: Geography Science Sub-domain: Climate Short Description of Project & Relation to Big Data: Expand on existing lake monitoring sites in northern Alaska by developing a network of regionally representative lakes along environmental gradients from which we will collect baseline data to assess current physical, chemical, and biological lake characteristics. Download and process hundreds of up to 1TB processed satellite image data sets with four Internet2 universities and NSF National Snow and Ice Data Center. Develop and refine data management, visualization, and archiving activities with ACADIS. Best Contact: Jane Combs, combsje@uc.edu Big Data Attributes: Satellite image data sets Aggregate Data Size: 1TB 3TB 2017 2020 9

High Energy Physics Project/Research Title: Large Hadron Collider (LHCb) experiment at CERN studying heavy flavor physics Industry Sector: Nuclear Physics, Energy Science Sub-domain: Physics Short Description of Project & Relation to Big Data: The physics focus is studying oscillations of matter into anti-matter and studying the differences in decays rates of matter and corresponding anti-matter to mirror-image final states. These address the nature of fundamental interactions between the basic constituents of matter. We move large data files from host laboratories to computers at UC for final analysis. For example, a single file with an LHCb NTUPLE from a small fraction of the data is 3GB. The size of the data set to be transferred for this analysis will be on the order of 1TB. Best Contact: Jane Combs, combsje@uc.edu Big Data Attributes: LHCb NTUPLE data sets Aggregate Data Size: 3TB 6TB 2017 6TB 2020 10

Modeling and Simulation Project/Research Title: Study of Active and Passive Flow Control Techniques over Turbine Blades Industry Sector: Consumer Products Science Sub-domain: Mechanical Engineering, Modeling and Simulation Short Description of Project & Relation to Big Data: The UC Simulation Center, in collaboration with Procter & Gamble, focuses on high-fidelity numerical simulation of complex flow phenomena with a wide range of length and time scales. The UC Simulation center is a partnership where students work on M&S of complex industrial problems associated with porous media, multiphase flows, etc. Predictive performance analysis of the multidisciplinary, multi-scale systems generate terabytes of data. Best Contact: Jane Combs, combsje@uc.edu Big Data Attributes: Simulation datasets and data visualization Aggregate Data Size: 2TB 4TB 2017 6TB 2020 11

Next Steps Schedule deep dive calls between Co-Chairs and Use Case POCs Determine co-chair presenters for August 31 Joint Collaborative Innovation Community call Preparation for in person meetings at TechEx October 4-7 in Cleveland, Ohio DBDA Innovation Working Group meeting 75 minutes Review current status, use cases, gather new ideas Collaborative Innovation Community meeting all 3 working groups together 90 minutes Each working group presents status Invite new participants and new ideas Innovation hackathon over lunch for new ideas in current innovation areas or new ones Monthly team meeting next one September 12

Thank You 13