Big Data R&D Initiative

Similar documents
Big Data R&D Initiative

Core Techniques and Technologies for Advancing Big Data Science & Engineering (BIGDATA) NSF

Big Data. George O. Strawn NITRD

NITRD and Big Data. George O. Strawn NITRD

National Big Data R&D Initiative

SECURE AND TRUSTWORTHY CYBERSPACE (SaTC)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21)

How To Use Hadoop For Gis

High Performance Computing Initiatives

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data.

CC-NIE PI Workshop Plenary

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

COGNITIVE SCIENCE AND NEUROSCIENCE

SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) $124,250,000 +$1,500,000 / 1.2%

SDN Security Challenges. Anita Nikolich National Science Foundation Program Director, Advanced Cyberinfrastructure July 2015

1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India

NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015

Information Technology R&D and U.S. Innovation

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

NITRD: National Big Data Strategic Plan. Summary of Request for Information Responses

US Federal Cyber Security Research Program. NITRD Program

Business Intelligence meets Big Data: An Overview on Security and Privacy

Manjula Ambur NASA Langley Research Center April 2014

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / %

Survey of Canadian and International Data Management Initiatives. By Diego Argáez and Kathleen Shearer

Sanjeev Kumar. contribute

Information Visualization WS 2013/14 11 Visual Analytics

Participants: Introduction:

EL Program: Smart Manufacturing Systems Design and Analysis

Big Data Analytics. Chances and Challenges. Volker Markl

Workprogramme

ICT Perspectives on Big Data: Well Sorted Materials

Information Technology Laboratory (ITL) - Strategic Planning Update - Cita Furlani, Director

Sunnie Chung. Cleveland State University

Proposal for the Theme on Big Data. Analytics. Qiang Yang, HKUST Jiannong Cao, PolyU Qi-man Shao, CUHK. May 2015

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

Government Technology Trends to Watch in 2014: Big Data

Massive Cloud Auditing using Data Mining on Hadoop

Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC) Vision and Mission

Horizontal IoT Application Development using Semantic Web Technologies

APPENDIX to CAHIIM 2012 Curriculum Requirements Health Informatics Master s Degree

Dr Alexander Henzing

Technical Club: New Vision of Computing

Knowledge Discovery from patents using KMX Text Analytics

Data Driven Discovery In the Social, Behavioral, and Economic Sciences

Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

Cyber Security. BDS PhantomWorks. Boeing Energy. Copyright 2011 Boeing. All rights reserved.

Exploiting the power of Big Data

Course Requirements for the Ph.D., M.S. and Certificate Programs

12/7/2015. Data Science Master s programs

Databases & Data Infrastructure. Kerstin Lehnert

New Jersey Big Data Alliance

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

Ernestina Menasalvas Universidad Politécnica de Madrid

The Tonnabytes Big Data Challenge: Transforming Science and Education. Kirk Borne George Mason University

UNIVERSITY OF INFINITE AMBITIONS. MASTER OF SCIENCE COMPUTER SCIENCE DATA SCIENCE AND SMART SERVICES

What is Engineering? A challenging, rewarding potential career. Vina Nguyen & Diana Wu January 2009

Computational Science and Informatics (Data Science) Programs at GMU

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum

BIG DATA: CHALLENGES AND OPPORTUNITIES IN LOGISTICS SYSTEMS

Big Data in the context of Preservation and Value Adding

Big-Data Computing: Creating revolutionary breakthroughs in commerce, science, and society

Kimmo Rossi. European Commission DG CONNECT

Transcription:

Big Data R&D Initiative Howard Wactlar CISE Directorate National Science Foundation NIST Big Data Meeting June, 2012 Image Credit: Exploratorium.

The Landscape: Smart Sensing, Reasoning and Decision Environment Sensing Emergency Response Percepts (sensors) Agent (Reasoning) Situation Awareness: Humans as sensors feed multimodal data streams Credit: Photo by US Geological Survey People-Centric Sensing Actions (controllers ) Pervasive Social Computing Informatics Smart Health Care Evaluate Sense Personal Sensing Public Sensing Social Sensing Intervene Assess Identify Source: Sajal Das, Keith Marzullo

Paradigm Shift: from Hypothesisdriven to Data-driven Discovery Data-driven discovery is revolutionizing scientific exploration and engineering innovations Automatic extraction of new knowledge about the physical, biological and cyber world continues to accelerate Multi-cores, concurrent and parallel algorithms, virtualization and advanced server architectures will enable data mining and machine learning, and discovery and visualization of Big Data

Examples of Research Challenges More data is being collected than we can store Analyze the data as it becomes available Decide what to archive and what to discard Many data sets are too large to download Analyze the data wherever it resides Many data sets are too poorly organized to be usable Better organize and retrieve data Many data sets are heterogeneous in type, structure, semantics, organization, granularity, accessibility Integrate and customize access to federate data Utility of data limited by our ability to interpret and use it Extract and visualize actionable knowledge Evaluate results Large and linked datasets may be exploited to identify individuals Design management and analysis with built-in privacy preserving characteristics

Administration s Big Data Research and Development Initiative Big Data Senior Steering Group chartered in spring 2011 under the Networking and Information Technology R&D (NITRD) Program Members from NIH, NSF, DARPA, DOD OSD, DHS, DOE-Science, HHS, NARA, NASA, NIST, NOAA, NSA, and USGS Co-chaired by NIH and NSF 15 Image Credit: Fuqing Zhang and Yonghui Weng, Pennsylvania State University; Frank Marks, NOAA; Gregory P. Johnson, Romy Schneider, John Cazes, Karl Schulz, Bill Barth, The University of Texas at Austin

NSF Strategy to Address Big Data Foundational research to develop new techniques and technologies to derive knowledge from data New cyberinfrastructure to manage, curate, and serve data to research communities Policy New approaches for education and workforce development New types of inter-disciplinary collaborations, grand challenges, and competitions

A Complex Policy Setting Researchers want data. Public policy requires access to data. Public policy also requires protection of privacy and intellectual property and other sensitive information. Much more to be done: Policy on data management and data access

Core Techniques and Technologies for Advancing Big Data Science & Engineering (BIG DATA) Foundational research to extract knowledge from data Foundational research to advance the core techniques and technologies for managing, analyzing, visualizing, and extracting useful information from large, diverse, distributed and heterogeneous data sets. Image Credit: Jurgen Schulze, Calit2, UC-San Diego Cross-Directorate Program: NSF Wide Multi-agency Commitment: NSF and NIH

BIG DATA Research Thrusts Collection, Storage, and Management of Big Data Data Analytics Research in Data Sharing and Collaboration Data representation, storage, and retrieval New parallel data architectures, including clouds Data management policies, including privacy and access Communication and storage devices with extreme capacities Sustainable economic models for access and preservation Computational, mathematical, statistical, and algorithmic techniques for modeling high dimensional data Learning, inference, prediction, and knowledge discovery for large volumes of dynamic data sets Data mining to enable automated hypothesis generation, event correlation, and anomaly detection Tools for distant data sharing, real time visualization, and software reuse of complex data sets Cross disciplinary model, information and knowledge sharing Remote operation and real time access to distant data sources and instruments Credit: Fermilab Photo Information infusion of multiple data sources

Big Data to Address National Priorities Health & Wellbeing Environment & Sustainability Emergency Response & Disaster Resiliency Manufacturing, Robotics, & Smart Systems Secure Cyberspace Transportation & Energy Education and Workforce Development

Smart Health & Wellbeing Reasoning under uncertainty The ability to acquire, aggregate and mine clinical, scientific, behavioral data will create an unprecedented amount of high quality data from individuals and population Enabling evidence-based medicine, early diagnoses, personalized assessments and care

Era of Big Data in Healthcare Large volumes of data currently collected EHRs and PHRs Multi-scale and multi-source During hospitalizations For safety and diagnosis On an out-patient basis Typically event monitors Via ubiquitous mobile sensors Behavior, physiology, environment As part of clinical studies To evaluate safety and efficacy From growing body of scientific knowledge In biomedical research literature Gigabits/patient/day High sampling rates Multiple signals Accumulating data is getting easier, but using data is hard

Secure & Trustworthy Cyberspace Securing our Nation s Data & Com New interdisciplinary program that aims to support fundamental scientific advances and technologies to protect cyber-systems from malicious behavior, while preserving privacy and promoting usability. Enabling evidence-based medicine, early diagnoses, personalized assessments and care

Cyberlearning: Transforming Education Integrate technology with knowledge about how people learn Goals: Understand how people learn in technology rich environments Design and study ways in which innovative technologies and tools can promote learning and support assessment Prototype new technologies and integrate them into learning environments

Big Opportunities for the Future Our investments in research and education have returned exceptional dividends to our nation. Scientific discovery and technological innovation are at the core of our response to national and societal challenges from environment, energy, transportation, sustainability and healthcare, to cyber security and national defense. Many of tomorrow s breakthroughs will occur at the intersections of diverse disciplines.

Thanks! hwactlar@nsf.gov