Data Intensive Science and Computing



Similar documents
Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

UNCLASSIFIED. UNCLASSIFIED Office of Secretary Of Defense Page 1 of 8 R-1 Line #50

Data Centric Systems (DCS)

Condor Supercomputer

Department of Defense INSTRUCTION. Measurement and Signature Intelligence (MASINT)

UNCLASSIFIED R-1 ITEM NOMENCLATURE FY 2013 OCO

Flexible, Life-Cycle Support for Unique Mission Requirements

Army Research Laboratory Technical Strategy

U.S. Army Research, Development and Engineering Command. Cyber Security CRA Overview

HPC in Oil and Gas Exploration

Applied Research Laboratory: Visualization, Information and Imaging Programs

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

EL Program: Smart Manufacturing Systems Design and Analysis

High Performance Computing

Data Analytics at NERSC. Joaquin Correa NERSC Data and Analytics Services

Data Centric Computing Revisited

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Concept and Project Objectives

STRENGTHENING DEPARTMENT OF DEFENSE LABORATORIES David Graham, Robert Leheny, and Susan Clark-Sestak

AeroVironment, Inc. Unmanned Aircraft Systems Overview Background

High Performance Computing and Big Data: The coming wave.

UNCLASSIFIED. R-1 Program Element (Number/Name) PE A / High Performance Computing Modernization Program. Prior Years FY 2013 FY 2014 FY 2015

Collaborations between Official Statistics and Academia in the Era of Big Data

Panel on Emerging Cyber Security Technologies. Robert F. Brammer, Ph.D., VP and CTO. Northrop Grumman Information Systems.

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

FACT SHEET. General Information about the Defense Contract Management Agency

DGE /DG Connect

NITRD and Big Data. George O. Strawn NITRD

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

Simulation and Training Solutions

Government Technology Trends to Watch in 2014: Big Data

Information Fusion (Hard and Soft) For Intelligence, Surveillance & Reconnaisance (ISR) Joint Symposium IST-106 and SET-189

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER

The Challenge. Shipboard crew members need training because cadres are not expected to provide a long-term ConOps for UUV employment.

FACT SHEET. General Information About the Defense Contract Management Agency

Department of Defense INSTRUCTION

Department of Defense DIRECTIVE

Citizen's Charter Department of Defence Research and Development Government of India 2015

From Data to Insight: Big Data and Analytics for Smart Manufacturing Systems

SPAWAR HQ ARCHITECTURE AND HUMAN SYSTEMS GROUP Human-Systems Integration/Systems Engineering Support Performance Work Statement

A Candid Survey of Defense Managers December 2014

NAVAL SEA SYSTEMS COMMAND STRATEGIC BUSINESS PLAN

Sensing, monitoring and actuating on the UNderwater world through a federated Research InfraStructure Extending the Future Internet SUNRISE

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

HPCMP New Users Guide Who Are We? Distribution A: Approved for Public release; distribution is unlimited.

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

Big Data. George O. Strawn NITRD

Mississippi State University High Performance Computing Collaboratory Brief Overview. Trey Breckenridge Director, HPC

Machine Learning for Data-Driven Discovery

How To Improve The Performance Of Anatm

Are You Ready for Big Data?

WHAT IS SOFTWARE PERFORMANCE ENGINEERING? By Michael Foster

Preface Introduction

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Defining a Secure Mobile Framework Architecture at DHA

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

COMBATSS-21 Scalable combat management system for the world s navies

DEPARTMENT OF THE NAVY OFFICE OF THE CHIEF OF NAVAL OPERATIONS 2000 NAVY PENTAGON WASHINGTON, DC

Engineered Resilient Systems Power of Advanced Modeling and Analytics in Support of Acquisition

Information Visualization WS 2013/14 11 Visual Analytics

From Distributed Computing to Distributed Artificial Intelligence

Department of Defense DIRECTIVE

Information Technology

The State of DoD Biometrics

The PHI solution. Fujitsu Industry Ready Intel XEON-PHI based solution. SC Denver

Intermediate Advanced All

DoD Application Store: Enabling C2 Agility?

Department of Defense INSTRUCTION

Cray: Enabling Real-Time Discovery in Big Data

Understanding the Value of In-Memory in the IT Landscape

Beyond Watson: The Business Implications of Big Data

Department of Defense INSTRUCTION

Department of Defense NetOps Strategic Vision

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / %

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21)

Sourcery Overview & Virtual Machine Installation

HPC & Visualization. Visualization and High-Performance Computing

Transcription:

DEFENSE LABORATORIES ACADEMIA TRANSFORMATIVE SCIENCE Efficient, effective and agile research system INDUSTRY Data Intensive Science and Computing Advanced Computing & Computational Sciences Division University of Tennessee (AC&CSD) Mr. Charles J. Nietubicz Division Chief & Oak Ridge National Laboratory October 29-30, 2015 Dr. Michael Barton Computational Sciences Division U.S. Army Research Laboratory DISTRIBUTION STATEMENT A. Approved for public release; distribution is unlimited.

Our Mandate Vision The Nation s Premier Laboratory for Land Forces. Mission DISCOVER, INNOVATE, and TRANSITION Science and Technology to ensure dominant strategic land power Making today s Army and the next Army obsolete

Who We Are Associate Director Plans & Programs ARL Director Military Deputy Associate Director Laboratory Operations Chief Scientist Deputy Director Basic Science Sergeant Major Director ARO Vehicle Technology Human Research & Engineering Survivability/ Lethality Analysis Computational & Information Sciences Sensors & Electron Devices Weapons & Materials Research

Assessment and Analysis Quantitatively Assess the development and application of analytical tools and methodologies to quantitatively assess the military utility of Army, DoD, and select foreign combat systems. Materials Research Fundamental understanding of structural, electronic, photonic, and energy materials & devices. ARL S&T Campaigns Human Sciences Fundamental understanding of Warfighter performance enhancement, training aids, and man-machine integration.. Computational Sciences Fundamental understanding of computer hardware, high efficiency algorithms, and novel mathematical methods. Information Sciences Fundamental understanding of information generation, collection, assurance, distribution, and exploitation. Sciences for Lethality & Protection Fundamental understanding of emerging technologies that support weapon systems, protection systems, and injury mechanisms affecting the Warfighter Sciences for Maneuver Fundamental understanding of the design, integration, control, and exploitation of highly adaptive platforms in complex environments Extramural Basic Research Steering and oversight of the systematic study to increase fundamental knowledge and understanding in physical, engineering, environmental, and life sciences related to long-term national security needs.

Computational Sciences Taxonomy Advanced Computing Architectures Emergent Architectures Tactical Computing Next Generation Computing Systems High Performance Networking and Memory B I G Computing Sciences Programming Environments Programming Languages Software Integration High Performance Computing Predictive Simulation Sciences Computational Math & Algorithms Scientific Computing Verification, Validation & Uncertainty Quantification Applied Computer Modeling and Analysis D A T A Data Intensive Sciences Sciences of Large Data Computational Math for Data Analytics Real-time Data Access & Analytics

Our Hardware MPI and BDMPI Testbed, Diskless Nodes Cray XC-40 Processing cores: 101,312 GPGPU s: 32 NVIDIA Tesla K40 Memory: 400 Terabytes 122 Terabytes: Solid State Disk or flash storage Peak Capability: 3.7 Petaflops Data Analytics Platforms Neuromorphic system 48 nodes Intel Phi, 16 nodes Nvidia GPUs

Scientific Discovery Theory Theory embodied in computation Hypotheses tested through experiment SCIENTIFC METHODS 21 st Century Research Continuum Computation Computation complements experiment Experiment

Data-Driven Discovery Modern scientific instruments are capable of collecting a petabyte or more of data per day HPC enables researchers to generate terabytes of data per case from physics-based computations for design, analysis, and discovery Developmental and operational tests, and training exercises, can generate a terabyte per day of data for events lasting multiple weeks from geographically separated areas and highly disparate data sources Compounding the issue is the observation Between 2000 and 2010, disk storage capacity grew by a factor of 1,000 while disc access time improved by only a factor of 2

Definitions Big Data Characterized by Volume How much data Velocity The speed at which data arrives and the speed with which decisions based on it must be made Variety Heterogeneity of storage platforms, data types, representation, semantic interpretation, and security classification or other distribution limitations Veracity How trustworthy is the data, what is its uncertainty, and what is the error associated with it Value What is the data worth

Background Big Data = Data Science + Big Data Computing Data Science The systematic study of digital data using scientific techniques Observation, theory development, systematic analysis, hypothesis testing, and rigorous validation Big Data Computing Novel and highly scalable computing paradigms Dynamically provisioned networks Innovative methods for processing and analysis Domain-specific, value-driven applications

Changing Perspective Big Data is Big Business Big Data is Big Potential Big Data is Growing Exponentially Infeasible to store some data collections in their entirety Signal processing and data modeling must be used to extract manageable data samples Full data collection capability not used in all cases because of Volume We can no longer blithely collect or generate data and worry about its use and disposition after the fact We must recognize that like a materiel system, data has a Life Cycle, and we must plan for that Life Cycle

Example Air Force Acquisition F-35 Program Data from OEM, AF Program Office, AFRL, T&E, Training 412 th Test Wing has 455+ TB of data, Combined Test Force has more Can collect 250K parameters at up to 10 6 Hz Navy and Marines have their own variants and data Foreign military sales add other partners Aircraft and program life cycle well defined; data life cycle is not The next airplane program will face the same issues again, only with even greater data generation and collection capability

Example Army Test & Evaluation Instrumented hundreds of vehicles in theater for data collection For the first time have the ability to understand if vehicles are being used as designed, on terrain for which they were designed Capturing braking, roll-over, vibration, crew compartment temperature, and other data in real time Have developmental test data for comparison and for forensics Have algorithms to reduce and display the data Need computational capacity (parallel algorithms and HPC platforms) to perform the analysis time-critically Need data framework to allow rapid access, comparison, analysis, display

Big Data Life Cycle Systems Engineering Approach to Big Data Requirements Based Grounded in how data will be used and who will use it Complements materiel system development life cycle Life cycle stages (feedback can exist at every stage) Cross-cutting challenges of Big Data

Big Data Requirements Establish with Use Cases Involve all stakeholders Acquisition S&T T&E Training OEMs Etc. Identify common elements in requirements and embody those in standard or common tools and processes Domain specific requirements met with domain unique tools Create flexible Big Data Framework to capture common and domain specific requirements

Extensible to other Stakeholders Big Data Framework Standardize common tools and processes in Core Toolset Provide interface for domain specific tools Local and distributed computing, dynamically provision networks, control access to platforms, networks, tools, and data Use feedback to improve processes, tools, and implementation

Summary Big Data isn t new We ve always dealt with it Now we must do so systematically and repeatably it s growing exponentially The Army must remain a leader in timely processing of Big Data the lives of our Warfighters depend on it We seek the best ideas from all sources, and invite collaboration for advancing data intensive science and computing There are more things in heaven and earth, Horatio, than are dreamt of in your philosophy. Hamlet Act 1, scene 5

Research Opportunities Neuromorphic Computing Deep Learning Tactical Cloudlets Software Defined Networks Highly Heterogeneous, Reconfigurable Systems CPU, GPU, FPGA, Neuromorphic, D-Wave, etc. Advanced Graph Analysis Domain Specific Languages Stream Computing Highly Scalable Software and Systems

Computational Science Ecosystem Materials Research Information Science Extramural programs ARL Campaigns & Unique facilities Sciences for Lethality and Protection Sciences for Maneuver Advanced Computing and Networking Research Testbed Open Campus Intern programs - Summer Institutes (high school) - Internships (college)

Army Research Laboratory Open Campus Initiative Piloting a New Laboratory Business Model Transformation Principles Flow, Agility, Quality, Efficiency & Effectiveness ACADEMIA Facilities DEFENSE LABORATORIES People TRANSFORMATIVE SCIENCE Resources Efficient, effective and agile research system ATTRACT AND RETAIN BEST & BRIGHTEST OPEN CAMPUSES SHARED MODERN FACILITIES INNOVATION PRACTICES INDUSTRY We will need new technology over the next 10 years to make a leaner and more capable Army. GEN Raymond T. Odierno 38th Chief of Staff, Army Responding to the National Security Challenges of the 21st Century Open Campus Website: http://www.arl.army.mil/opencampus/