Challenges and Solutions for Big Data in the Public Sector:

Size: px
Start display at page:

Download "Challenges and Solutions for Big Data in the Public Sector:"

Transcription

1 Challenges and Solutions for Big Data in the Public Sector: Digital Government Institute s Annual Big Data Conference, October 9, Washington, DC Reagan Building Dr. Brand Niemann Director and Senior Data Scientist Semantic Community October 9,

2 Overview Related Presentations: COM.BigData Conference (Keynote and Panel), August 4-6, Washington, DC, and IEEE 2014 Big Data Conference (Paper and NIST Big Data Workshop), October 27-30, Washington, DC. Moderator: Dr. Brand Niemann, Director and Senior Data Scientist, Semantic Community, and Co-organizer, Federal Big Data Working Group Meetup Panelists: Dr. Tom Rindflesch, Information Research Specialist at Cognitive Science Branch, National Institutes for Health (NIH): Semantic Medline (Ontology, Cray Graph Appliance, and Relational Databases) Dr. Kirk Borne, Professor of Astrophysics and Computational Science, George Mason University: NSF Big Data Project of the Decade: LSST 2

3 Fourth Paradigm and Fourth Question The Fourth Paradigm of Science (1): First Paradigm. Observation, descriptions of natural phenomena, and experimentation. Second Paradigm. Theoretical science such as Newton s laws of motion and Maxwell s equations. Third Paradigm. Simulation and modelling, such as in astronomy. Fourth Paradigm. Data-intensive science that exploits the large volumes of data in new ways for scientific exploration, such as the International Virtual Observatory Alliance in astronomy. The Fourth Question of Big Data for Science (2): How was the data collected? Where is the data stored? What are the data results? Does the data story persuade? President Obama Discovers Big Data in 2009 (1) Bell G, Hey, T., & Szalay, A. (2009) Beyond the data deluge, Science 323, 6 March 2009, pp (2) de Waard, Anita, (2014) About Stories, that Persuade With Data, Federal Big Data Working Group Meetup, 20 May,, 41 slides. 3

4 Mission Statement Federal: Supports the Federal Big Data Initiative, but not endorsed by the Federal Government or its Agencies; Big Data: Supports the Federal Digital Government Strategy which is "treating all content as data", so big data = all your content; Working Group: Data Science Teams composed of Federal Government and Non-Federal Government experts producing big data products (How was the data collected, Where is it stored, What are the results, and Does the data story persuade?); and Meetup: The world's largest network of local groups to revitalize local community and help people around the world self-organize like MOOCs (Massive Open On-line Classes) being considered by the White House to reduce the cost of higher education. Co-organizers: Brand Niemann and Katherine Goodier 4

5 What Are We Doing? Leadership of the Semantic Data Science Team that produced Semantic Medline running on the Yarc Data Graph Appliance. Founding and co-organizing of the Federal Big Data Working Group Meetup. A graduate class prepared for GMU entitled Practical Data Science for Data Scientists. Using the Cross Industry Standard Process for Data Mining (CRISP-DM; Shearer, 2000) to build a Data Science Knowledge Base Mining of the Data Science and Digital Earth scientific journals for the CODATA International Workshop on Big Data for International Scientific Programmes, June 8-9, in Beijing. Participation in the Data FAIRport (Findable, Accessible, Interoperable, and Reusable) with Data Publication in Data Browsers. Providing data stories that persuade and presentation materials for public education conferences like the COM.BigData Conference, August 4-6, in Washington, DC. 5

6 NIH Data Commons Dr. Phil Bourne (7/30/2014): Rules, Credit/Not Money, & More Offline My Note: Registries, Repositories, Clearinghouses, Portals, GitHubs, Data Commons, & Data FAIRports to MindTouch and Spotfire 6

7 How Are We Doing It? Federating Uses Cases: Data Science (Brand Niemann); Environmental and Earth Science (Joan Aron); and Astronomy (Kirk Borne) Federating Data Publications: Structured Scientific Content (Papers, journals, books, reports, etc.); Data FAIRports (Findable, Accessible, Interoperable); and Reusable Data Stories That Persuade (Claims and Evidence) Federating Solutions & Technologies: Hand-Crafted by Individuals and Teams (Mary Galvin, STEM); Data Mining Standards and Products (Brand Niemann, Data Publications in Data Browsers); Machine Processing (Fredrik Salvesen, Semantic Data Publications on Yarc Data Graph Appliance); Reading and Reasoning (Katherine Goodier and Chuck Rehberg (Semantic Insights on Elsevier Content Text Mining); and Data Curation at Scale (Alan Wagner, Tamr on 1000s of Spreadsheets) 7

8 Data Science for JHU DIBBs Project: Knowledge Bases Data Science Data Publication: Table of Contents is An Ontology! Data Science Publication Index: Index is Linked Open Data! Data Science for JHU DIBBs Project SDSS.xlsx 8

9 Data Science for JHU DIBBs Project: Analytics & Visualizations Spotfire Content, Network, and Data Analytics and Data Ecosystem: Spotfire is a Microscope and a Telescope! Web Player 9

10 Data Science for JHU DIBBs Project: Conclusions Science is increasingly driven by data (big and small) New instruments: microscopes & telescopes for data A major challenge on the long tail A new, Fourth Paradigm of Science is emerging SDSS has been at the cusp of this transition Now the SciServer is continuing the legacy Gray's Law of Data Engineering: Scientific computing is revolving around data Need scale out solution for analysis Take the analysis to the data! Start with 20 queries Go from working to working Source: 10

How To Teach Data Science

How To Teach Data Science The Past, Present, and Future of Data Science Education Kirk Borne @KirkDBorne http://kirkborne.net George Mason University School of Physics, Astronomy, & Computational Sciences Outline Research and Application

More information

Computational Science and Informatics (Data Science) Programs at GMU

Computational Science and Informatics (Data Science) Programs at GMU Computational Science and Informatics (Data Science) Programs at GMU Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ Outline Graduate Program

More information

The Research Data Revolution. 2015 Harvard/Purdue Data Symposium Sayeed Choudhury

The Research Data Revolution. 2015 Harvard/Purdue Data Symposium Sayeed Choudhury The Research Data Revolution 2015 Harvard/Purdue Data Symposium Sayeed Choudhury Data Conservancy (DC) One of five awards through US National Science Foundation s (NSF) DataNet program $10 million award

More information

Conquering the Astronomical Data Flood through Machine

Conquering the Astronomical Data Flood through Machine Conquering the Astronomical Data Flood through Machine Learning and Citizen Science Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ The Problem:

More information

I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD

I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD I N T E L L I G E N T S O L U T I O N S, I N C. OILFIELD DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD 5 5 T A R A P L A C E M O R G A N T O W N, W V 2 6 0 5 0 USA

More information

ICSU and the Challenge of Big Data in Science

ICSU and the Challenge of Big Data in Science ICSU and the Challenges of Big Data in Science Elsevier Conference on Big Data, E-Science and Science Policy 16 17 May 2012 Canberra Professor Ray Harris UCL International Council for Science ICSU 121

More information

a Data Science initiative @ Univ. Piraeus [GR]

a Data Science initiative @ Univ. Piraeus [GR] a Data Science initiative @ Univ. Piraeus [GR] The Data Science Lab members June 2015 What is Data Science source: quora.com! Looking at data! Tools and methods used to analyze large amounts of data! Anything

More information

Data Driven Discovery In the Social, Behavioral, and Economic Sciences

Data Driven Discovery In the Social, Behavioral, and Economic Sciences Data Driven Discovery In the Social, Behavioral, and Economic Sciences Simon Appleford, Marshall Scott Poole, Kevin Franklin, Peter Bajcsy, Alan B. Craig, Institute for Computing in the Humanities, Arts,

More information

curation, analyses and interpretation of massive datasets opportunities are varied across disciplines

curation, analyses and interpretation of massive datasets opportunities are varied across disciplines ! Efficiency in scientific discovery through curation, analyses and interpretation of massive datasets! Uptake level and concentration on Big Data opportunities are varied across disciplines The nature

More information

CASC Spring Meeting 2014 Federal Agency Panel Update on Big Data

CASC Spring Meeting 2014 Federal Agency Panel Update on Big Data CASC Spring Meeting 2014 Federal Agency Panel Update on Big Data Robert Chadduck Program Director, Data & CI CISE Division of Advanced Cyberinfrastructure 23 April 2014 ACI data focused CI - A view towards

More information

Data at NIST: A View from the Office of Data and Informatics

Data at NIST: A View from the Office of Data and Informatics Data at NIST: A View from the Office of Data and Informatics Robert Hanisch Office of Data and Informatics Material Measurement Laboratory National Institute of Standards and Technology Data and NIST 1

More information

Data Literacy For All: Astrophysics and Beyond (Astronomy is evidence-based forensic science, thus it is a data & information science)

Data Literacy For All: Astrophysics and Beyond (Astronomy is evidence-based forensic science, thus it is a data & information science) Data Literacy For All: Astrophysics and Beyond (Astronomy is evidence-based forensic science, thus it is a data & information science) Kirk Borne George Mason University, Fairfax, VA www.kirkborne.net

More information

Data-Intensive Science and Scientific Data Infrastructure

Data-Intensive Science and Scientific Data Infrastructure Data-Intensive Science and Scientific Data Infrastructure Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 13 April 2011 Overview Data-intensive science Publishing scientific

More information

Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research

Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research Astrophysics with Terabyte Datasets Alex Szalay, JHU and Jim Gray, Microsoft Research Living in an Exponential World Astronomers have a few hundred TB now 1 pixel (byte) / sq arc second ~ 4TB Multi-spectral,

More information

可 视 化 与 可 视 计 算 概 论. Introduction to Visualization and Visual Computing 袁 晓 如 北 京 大 学 2015.12.23

可 视 化 与 可 视 计 算 概 论. Introduction to Visualization and Visual Computing 袁 晓 如 北 京 大 学 2015.12.23 可 视 化 与 可 视 计 算 概 论 Introduction to Visualization and Visual Computing 袁 晓 如 北 京 大 学 2015.12.23 2 Visual Analytics Adapted from Jim Thomas s slides 3 Visual Analytics Definition Visual Analytics is the

More information

Considering the Way Forward for Data Science and International Climate Science

Considering the Way Forward for Data Science and International Climate Science Considering the Way Forward for Data Science and International Climate Science Improving Data Mobility and Management for International Climate Science July 14-16, 2014 Boulder, CO Sara J. Graves, Ph.D.

More information

BYODs & FAIR Data Stewardship

BYODs & FAIR Data Stewardship BYODs & FAIR Data Stewardship Luiz Olavo Bonino luiz.bonino@dtls.nl www.elixir-europe.org Summary FAIR Data stewardship Approach in NL BYOD FAIR Data tooling ecosystem Way of working (FAIR) Data Stewardship

More information

Learning from Big Data in

Learning from Big Data in Learning from Big Data in Astronomy an overview Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ From traditional astronomy 2 to Big Data

More information

The Best Way to Get BIG DATA is By Starting Small

The Best Way to Get BIG DATA is By Starting Small The Best Way to Get BIG DATA is By Starting Small Dr. Brand Niemann Director and Senior Data Scientist Semantic Community for Johns Hopkins University School of Medicine and Modus Operandi http://semanticommunity.info/

More information

The Challenge of Handling Large Data Sets within your Measurement System

The Challenge of Handling Large Data Sets within your Measurement System The Challenge of Handling Large Data Sets within your Measurement System The Often Overlooked Big Data Aaron Edgcumbe Marketing Engineer Northern Europe, Automated Test National Instruments Introduction

More information

Manjula Ambur NASA Langley Research Center April 2014

Manjula Ambur NASA Langley Research Center April 2014 Manjula Ambur NASA Langley Research Center April 2014 Outline What is Big Data Vision and Roadmap Key Capabilities Impetus for Watson Technologies Content Analytics Use Potential use cases What is Big

More information

Big Data Hope or Hype?

Big Data Hope or Hype? Big Data Hope or Hype? David J. Hand Imperial College, London and Winton Capital Management Big data science, September 2013 1 Google trends on big data Google search 1 Sept 2013: 1.6 billion hits on big

More information

Big Data to Knowledge (BD2K)

Big Data to Knowledge (BD2K) Big Data to Knowledge () potential funding agency synergies Jennie Larkin, PhD Office of the Associate Director of Data Science National Institutes of Health idash-pscanner meeting UCSD September 16, 2014

More information

Global Scientific Data Infrastructures: The Big Data Challenges. Capri, 12 13 May, 2011

Global Scientific Data Infrastructures: The Big Data Challenges. Capri, 12 13 May, 2011 Global Scientific Data Infrastructures: The Big Data Challenges Capri, 12 13 May, 2011 Data-Intensive Science Science is, currently, facing from a hundred to a thousand-fold increase in volumes of data

More information

Academic Education in Era of Digital Culture

Academic Education in Era of Digital Culture Academic Education in Era of Digital Culture Ilya Levin Tel Aviv University, Tel Aviv, Israel ilia1@post.tau.ac.il Abstract: The paper reports results of a theoretical research studying the academic education

More information

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration Revised Proposal from The National Academies Summary An NRC-appointed committee will plan and organize a cross-disciplinary

More information

College of Science George Mason University Fairfax, VA 22030

College of Science George Mason University Fairfax, VA 22030 College of Science George Mason University Fairfax, VA 22030 Dr. Sidney Wolff and the LSST Board of Directors LSST Corporation 933 N. Cherry Avenue Tucson, AZ 85721-0009 June 14, 2010 Dear Dr. Wolff and

More information

Data analysis of L2-L3 products

Data analysis of L2-L3 products Data analysis of L2-L3 products Emmanuel Gangler UBP Clermont-Ferrand (France) Emmanuel Gangler BIDS 14 1/13 Data management is a pillar of the project : L3 Telescope Caméra Data Management Outreach L1

More information

NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015

NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015 NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015 I. Background Over the next decade, the dramatic growth of

More information

Analytics-as-a-Service: From Science to Marketing

Analytics-as-a-Service: From Science to Marketing Analytics-as-a-Service: From Science to Marketing Data Information Knowledge Insights (Discovery & Decisions) Kirk Borne George Mason University, Fairfax, VA www.kirkborne.net @KirkDBorne Big Data: What

More information

EDISON Education for Data Intensive Science to Open New science frontiers

EDISON Education for Data Intensive Science to Open New science frontiers H2020 INFRASUPP-4 CSA Project EDISON Education for Data Intensive Science to Open New science frontiers Yuri Demchenko University of Amsterdam Outline Consortium members EDISON Project Concept and Objectives

More information

How To Understand And Understand The Science Of Astronomy

How To Understand And Understand The Science Of Astronomy Introduction to the VO Christophe.Arviset@esa.int ESAVO ESA/ESAC Madrid, Spain The way Astronomy works Telescopes (ground- and space-based, covering the full electromagnetic spectrum) Observatories Instruments

More information

Core Ideas of Engineering and Technology

Core Ideas of Engineering and Technology Core Ideas of Engineering and Technology Understanding A Framework for K 12 Science Education By Cary Sneider Last month, Rodger Bybee s article, Scientific and Engineering Practices in K 12 Classrooms,

More information

Evolution of Chinese Research Data Policy

Evolution of Chinese Research Data Policy Bilateral US-China CODATA Workshop 2014 Evolution of Chinese Research Data Policy Jianhui Li(lijh@cnic.cn) Computer Network Information Center, CAS CODATA-China 25 Aug 2014 Outline Scientific Data Sharing

More information

BIG DATA for. Government

BIG DATA for. Government The 4th AIE Symposium on BIG DATA for Government FEDERAL R&D, PLANS, AND OPPORTUNITIES Over Top 20 Experts from DoD, NITRD, NIST, RATB, DHS, DHHS, FBI, DIA, DOE, NASA, NRL, IBM, Harris, CSC, Amazon, Splunk,

More information

Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data.

Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. 1 Advances in information technologies are transforming the fabric of our society and data represent

More information

Big Data in the context of Preservation and Value Adding

Big Data in the context of Preservation and Value Adding Big Data in the context of Preservation and Value Adding R. Leone, R. Cosac, I. Maggio, D. Iozzino ESRIN 06/11/2013 ESA UNCLASSIFIED Big Data Background ESA/ESRIN organized a 'Big Data from Space' event

More information

Training for Big Data

Training for Big Data Training for Big Data Learnings from the CATS Workshop Raghu Ramakrishnan Technical Fellow, Microsoft Head, Big Data Engineering Head, Cloud Information Services Lab Store any kind of data What is Big

More information

11-12 June 2015, Bari-Italy. Stefano Nativi CNR-IIA

11-12 June 2015, Bari-Italy. Stefano Nativi CNR-IIA 11-12 June 2015, Bari-Italy Stefano Nativi CNR-IIA Coordinating an Observation Network of Networks EnCompassing satellite and IN-situ to fill the Gaps in European Observations GEOSS Information System

More information

UCLA Graduate School of Education and Information Studies UCLA

UCLA Graduate School of Education and Information Studies UCLA UCLA Graduate School of Education and Information Studies UCLA Peer Reviewed Title: Slides for When use cases are not useful: Data practices, astronomy, and digital libraries Author: Wynholds, Laura, University

More information

Big Data. George O. Strawn NITRD

Big Data. George O. Strawn NITRD Big Data George O. Strawn NITRD Caveat auditor The opinions expressed in this talk are those of the speaker, not the U.S. government Outline What is Big Data? NITRD's Big Data Research Initiative Big Data

More information

Data Science at U of U

Data Science at U of U Data Science at U of U Je M. Phillips Assistant Professor, School of Computing Center for Extreme Data Management, Analysis, and Visualization Director, Data Management and Analysis Track University of

More information

Architecture 3.0 Landscape Analytics

Architecture 3.0 Landscape Analytics Architecture 3.0 Landscape Analytics Jürgen Döllner Hasso- Plattner- Institut Landscape Analytics Big Data Big Data Analytics Visual Analytics Predictive Analytics Landscape Analytics Big Data Data is

More information

The Tonnabytes Big Data Challenge: Transforming Science and Education. Kirk Borne George Mason University

The Tonnabytes Big Data Challenge: Transforming Science and Education. Kirk Borne George Mason University The Tonnabytes Big Data Challenge: Transforming Science and Education Kirk Borne George Mason University Ever since we first began to explore our world humans have asked questions and have collected evidence

More information

Big Data Management and Analytics

Big Data Management and Analytics Big Data Management and Analytics Lecture Notes Winter semester 2015 / 2016 Ludwig-Maximilians-University Munich Prof. Dr. Matthias Renz 2015 Based on lectures by Donald Kossmann (ETH Zürich), as well

More information

Panel on Big Data Challenges and Opportunities

Panel on Big Data Challenges and Opportunities Panel on Big Data Challenges and Opportunities Dr. Chaitan Baru Senior Advisor for Data Science, Directorate for Computer & Information Science & Engineering National Science Foundation NSF s Perspective

More information

An Introduction to Advanced Analytics and Data Mining

An Introduction to Advanced Analytics and Data Mining An Introduction to Advanced Analytics and Data Mining Dr Barry Leventhal Henry Stewart Briefing on Marketing Analytics 19 th November 2010 Agenda What are Advanced Analytics and Data Mining? The toolkit

More information

Taming the Internet of Things: The Lord of the Things

Taming the Internet of Things: The Lord of the Things Taming the Internet of Things: The Lord of the Things Kirk Borne @KirkDBorne School of Physics, Astronomy, & Computational Sciences College of Science, George Mason University, Fairfax, VA Taming the Internet

More information

Cloud and Big Data Standardisation

Cloud and Big Data Standardisation Cloud and Big Data Standardisation EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University of Amsterdam

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43%

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43% CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43% Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering,

More information

Data Analytics: The Next Big Thing in Information

Data Analytics: The Next Big Thing in Information Data Analytics: The Next Big Thing in Information June Crowe and Joseph R. Candlish (United States) Abstract Information is now available in an overabundance, so much so, that distinguishing the noise

More information

Integrating pharmacological data

Integrating pharmacological data Integrating pharmacological data For scientists For software and application developers A semantic data integration infrastructure Open PHACTS is a 3-year project of the Innovative Medicines Initiative

More information

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Dr Liz Lyon, UKOLN, University of Bath Introduction and Objectives UKOLN is undertaking

More information

Bringing the Night Sky Closer: Discoveries in the Data Deluge

Bringing the Night Sky Closer: Discoveries in the Data Deluge EARTH AND ENVIRONMENT Bringing the Night Sky Closer: Discoveries in the Data Deluge Alyssa A. Goodman Harvard University Curtis G. Wong Microsoft Research Th r o u g h o u t h i s t o r y, a s t r o n

More information

Data-Driven Discovery through e-science Technologies

Data-Driven Discovery through e-science Technologies Data-Driven Discovery through e-science Technologies Kirk D. Borne George Mason University, QSS Group Inc., and NASA-GSFC Kirk.borne@gsfc.nasa.gov Abstract Future space missions and science programs will

More information

A Capability Maturity Model for Scientific Data Management

A Capability Maturity Model for Scientific Data Management A Capability Maturity Model for Scientific Data Management 1 A Capability Maturity Model for Scientific Data Management Kevin Crowston & Jian Qin School of Information Studies, Syracuse University July

More information

A Strategic Approach to Unlock the Opportunities from Big Data

A Strategic Approach to Unlock the Opportunities from Big Data A Strategic Approach to Unlock the Opportunities from Big Data Yue Pan, Chief Scientist for Information Management and Healthcare IBM Research - China [contacts: panyue@cn.ibm.com ] Big Data or Big Illusion?

More information

A Component of Professional Skills Workshops for Graduate Research Students

A Component of Professional Skills Workshops for Graduate Research Students A Component of Professional Skills Workshops for Graduate Research Students 06/03/2012 Research Data Management Seminar, February 1-2, 2012, Carleton University 1 Seminar presenters Ernie Boyko, Carleton

More information

ANALYTICS CENTER LEARNING PROGRAM

ANALYTICS CENTER LEARNING PROGRAM Overview of Curriculum ANALYTICS CENTER LEARNING PROGRAM The following courses are offered by Analytics Center as part of its learning program: Course Duration Prerequisites 1- Math and Theory 101 - Fundamentals

More information

NITRD and Big Data. George O. Strawn NITRD

NITRD and Big Data. George O. Strawn NITRD NITRD and Big Data George O. Strawn NITRD Caveat auditor The opinions expressed in this talk are those of the speaker, not the U.S. government Outline What is Big Data? Who is NITRD? NITRD's Big Data Research

More information

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering, and Education (CIF21) investment

More information

Stephen M. Fiore, Ph.D. University of Central Florida Cognitive Sciences, Department of Philosophy and Institute for Simulation & Training

Stephen M. Fiore, Ph.D. University of Central Florida Cognitive Sciences, Department of Philosophy and Institute for Simulation & Training Stephen M. Fiore, Ph.D. University of Central Florida Cognitive Sciences, Department of Philosophy and Institute for Simulation & Training Fiore, S. M. (2015). Collaboration Technologies and the Science

More information

The Legacy Value of Large Public Surveys: the SDSS Archive. Alexander Szalay The Johns Hopkins University

The Legacy Value of Large Public Surveys: the SDSS Archive. Alexander Szalay The Johns Hopkins University The Legacy Value of Large Public Surveys: the SDSS Archive Alexander Szalay The Johns Hopkins University Sloan Digital Sky Survey The Cosmic Genome Project Started in 1992, finished in 2008 Data is public

More information

Medical Data Review and Exploratory Data Analysis using Data Visualization

Medical Data Review and Exploratory Data Analysis using Data Visualization Paper PP10 Medical Data Review and Exploratory Data Analysis using Data Visualization VINOD KERAI, ROCHE, WELWYN, UKINTRODUCTION Drug Development has drastically changed in the last few decades. There

More information

Computer and Information Scientists $105,370.00. Computer Systems Engineer. Aeronautical & Aerospace Engineer Compensation Administrator

Computer and Information Scientists $105,370.00. Computer Systems Engineer. Aeronautical & Aerospace Engineer Compensation Administrator Reinhardt University Name: Francesco Strazzullo Group: Faculty Major Selection Summary Saved Majors Careers That Match Mathematics Saved Occupation Name Mean Salary Bank and Branch Managers $113,730.00

More information

Navigating Big Data business analytics

Navigating Big Data business analytics mwd a d v i s o r s Navigating Big Data business analytics Helena Schwenk A special report prepared for Actuate May 2013 This report is the third in a series and focuses principally on explaining what

More information

ANALYTICS IN BIG DATA ERA

ANALYTICS IN BIG DATA ERA ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY, DISCOVER RELATIONSHIPS AND CLASSIFY HUGE AMOUNT OF DATA MAURIZIO SALUSTI SAS Copyr i g ht 2012, SAS Ins titut

More information

NIH As A Digital Enterprise Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health

NIH As A Digital Enterprise Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health NIH As A Digital Enterprise Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health Data Science Timeline 6/12 Findings: Sharing data & software through catalogs Support

More information

Organizational Implications of Data Science Environments in Education, Research, and Research Management in Libraries

Organizational Implications of Data Science Environments in Education, Research, and Research Management in Libraries Organizational Implications of Data Science Environments in Education, Research, and Research Management in Libraries Erik Mitchell Associate University Librarian & Associate CIO U of California, Berkeley

More information

Big Data and Science: Myths and Reality

Big Data and Science: Myths and Reality Big Data and Science: Myths and Reality H.V. Jagadish http://www.eecs.umich.edu/~jag Six Myths about Big Data It s all hype It s all about size It s all analysis magic Reuse is easy It s the same as Data

More information

Human Brain Project -

Human Brain Project - Human Brain Project - Scientific goals, Organization, Our role Wissenswerte, Bremen 26. Nov 2013 Prof. Sonja Grün Insitute of Neuroscience and Medicine (INM-6) & Institute for Advanced Simulations (IAS-6)

More information

ASCR Program Response to the Report of the ASCAC Committee of Visitors Review of the Computer Science Program

ASCR Program Response to the Report of the ASCAC Committee of Visitors Review of the Computer Science Program 1(a): Efficacy and quality of the processes used to solicit, review, recommend and document application and proposal actions: Continue to improve the online information management capabilities of the program

More information

High Performance Computing

High Performance Computing High Parallel Computing Hybrid Program Coding Heterogeneous Program Coding Heterogeneous Parallel Coding Hybrid Parallel Coding High Performance Computing Highly Proficient Coding Highly Parallelized Code

More information

The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory

The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory Astronomy in the XXI century The Internet revolution (the dot com boom ) has transformed

More information

BIG Data Analytics Move to Competitive Advantage

BIG Data Analytics Move to Competitive Advantage BIG Data Analytics Move to Competitive Advantage where is technology heading today Standardization Open Source Automation Scalability Cloud Computing Mobility Smartphones/ tablets Internet of Things Wireless

More information

Data tells stories, and business analytics depends on data. Hearing the stories in the data, however, only happens when you have people with the

Data tells stories, and business analytics depends on data. Hearing the stories in the data, however, only happens when you have people with the Data tells stories, and business analytics depends on data. Hearing the stories in the data, however, only happens when you have people with the right listening skills. Skilled people are the heart of

More information

Databases & Data Infrastructure. Kerstin Lehnert

Databases & Data Infrastructure. Kerstin Lehnert + Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,

More information

Research Data Alliance: Current Activities and Expected Impact. SGBD Workshop, May 2014 Herman Stehouwer

Research Data Alliance: Current Activities and Expected Impact. SGBD Workshop, May 2014 Herman Stehouwer Research Data Alliance: Current Activities and Expected Impact SGBD Workshop, May 2014 Herman Stehouwer The Vision 2 Researchers and innovators openly share data across technologies, disciplines, and countries

More information

MOOCdb: Developing Data Standards for MOOC Data Science

MOOCdb: Developing Data Standards for MOOC Data Science MOOCdb: Developing Data Standards for MOOC Data Science Kalyan Veeramachaneni, Franck Dernoncourt, Colin Taylor, Zachary Pardos, and Una-May O Reilly Massachusetts Institute of Technology, USA. {kalyan,francky,colin

More information

NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons

NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons The NIH Commons Summary The Commons is a shared virtual space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage,

More information

The Fourth Paradigm: Data-Intensive Scientific Discovery, Open Science and the Cloud

The Fourth Paradigm: Data-Intensive Scientific Discovery, Open Science and the Cloud The Fourth Paradigm: Data-Intensive Scientific Discovery, Open Science and the Cloud Tony Hey Senior Data Science Fellow escience Institute University of Washington tony.hey@live.com The Fourth Paradigm:

More information

SCIENCE DATA ANALYSIS ON THE CLOUD

SCIENCE DATA ANALYSIS ON THE CLOUD SCIENCE DATA ANALYSIS ON THE CLOUD ESIP Cloud Computing Cluster Thomas Huang and Phil Yang Agenda Invited speakers Petr Votava, NASA Earth Exchange (NEX): Early Observations on Community Engagement in

More information

& ENTERPRISE DATA COST AND SCALE WAREHOUSE AUGMENTATION BIG DATA COST, SCALABILITY

& ENTERPRISE DATA COST AND SCALE WAREHOUSE AUGMENTATION BIG DATA COST, SCALABILITY COST AND SCALE BIG DATA COST, SCALABILITY & ENTERPRISE DATA 1 WAREHOUSE AUGMENTATION To derive the most value from Big Data technologies, enterprises must solve the cost and scalability problems inherent

More information

Increase Revenue THE JOURNEY TO BIG DATA. Gary Evans. CTO EMC Ireland. Twitter.com/Gary3vans. Copyright 2013 EMC Corporation. All rights reserved.

Increase Revenue THE JOURNEY TO BIG DATA. Gary Evans. CTO EMC Ireland. Twitter.com/Gary3vans. Copyright 2013 EMC Corporation. All rights reserved. THE JOURNEY TO BIG DATA Increase Revenue Gary Evans CTO EMC Ireland Twitter.com/Gary3vans 1 THE VALUE OF BIG DATA VARIETY VELOCITY BIG DATA VOLUME COMPLEXITY organizations can earn an incremental ROI of

More information

Symposium on the Interagency Strategic Plan for Big Data: Focus on R&D

Symposium on the Interagency Strategic Plan for Big Data: Focus on R&D Symposium on the Interagency Strategic Plan for Big Data: Focus on R&D NAS Board on Research Data and Information October 23, 2014 Big Data Senior Steering Group (BDSSG) Allen Dearry, NIH, Co-Chair Suzi

More information

The 4 Pillars of Technosoft s Big Data Practice

The 4 Pillars of Technosoft s Big Data Practice beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed

More information

urika! Unlocking the Power of Big Data at PSC

urika! Unlocking the Power of Big Data at PSC urika! Unlocking the Power of Big Data at PSC Nick Nystrom Director, Strategic Applications Pittsburgh Supercomputing Center February 1, 2013 nystrom@psc.edu 2013 Pittsburgh Supercomputing Center Big Data

More information

Data Mining Challenges and Opportunities in Astronomy

Data Mining Challenges and Opportunities in Astronomy Data Mining Challenges and Opportunities in Astronomy S. G. Djorgovski (Caltech) With special thanks to R. Brunner, A. Szalay, A. Mahabal, et al. The Punchline: Astronomy has become an immensely datarich

More information

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD 72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD Paulo Gottgtroy Auckland University of Technology Paulo.gottgtroy@aut.ac.nz Abstract This paper is

More information

Exploitation of ISS scientific data

Exploitation of ISS scientific data Cooperative ISS Research data Conservation and Exploitation Exploitation of ISS scientific data Luigi Carotenuto Telespazio s.p.a. Copernicus Big Data Workshop March 13-14 2014 European Commission Brussels

More information

Panel on Emerging Cyber Security Technologies. Robert F. Brammer, Ph.D., VP and CTO. Northrop Grumman Information Systems.

Panel on Emerging Cyber Security Technologies. Robert F. Brammer, Ph.D., VP and CTO. Northrop Grumman Information Systems. Panel on Emerging Cyber Security Technologies Robert F. Brammer, Ph.D., VP and CTO Northrop Grumman Information Systems Panel Moderator 27 May 2010 Panel on Emerging Cyber Security Technologies Robert

More information

The New Computational and Data Sciences Undergraduate Program at George Mason University

The New Computational and Data Sciences Undergraduate Program at George Mason University The New Computational and Data Sciences Undergraduate Program at George Mason University Kirk Borne, John Wallin, and Robert Weigel Computational and Data Sciences, George Mason University, Fairfax, VA

More information

The data forest. Application. Application Application DATA. Office of Research

The data forest. Application. Application Application DATA. Office of Research The data forest DATA Unfortunately Data to the rescue The Rensselaer IDEA HPC: Computational Science and Engineering + Data Science and Predictive Analytics + Cognitive Computing + Perceptualization DATA

More information

BIG DATA PUBLIC PRIVATE FORUM

BIG DATA PUBLIC PRIVATE FORUM BIG DATA PUBLIC PRIVATE FORUM Agenda 09:00-10:30 9:00-9:20 9:20-9:55 9:55-10:30 The Big Project Results (Session 1) - The Big Project - Welcome and Introduction Nuria De Lama (ATOS Spain) - Key Technology

More information

Standard Big Data Architecture and Infrastructure

Standard Big Data Architecture and Infrastructure Standard Big Data Architecture and Infrastructure Wo Chang Digital Data Advisor Information Technology Laboratory (ITL) National Institute of Standards and Technology (NIST) wchang@nist.gov May 20, 2016

More information

Statistics, Data Mining and Machine Learning in Astronomy: A Practical Python Guide for the Analysis of Survey Data. and Alex Gray

Statistics, Data Mining and Machine Learning in Astronomy: A Practical Python Guide for the Analysis of Survey Data. and Alex Gray Statistics, Data Mining and Machine Learning in Astronomy: A Practical Python Guide for the Analysis of Survey Data Željko Ivezić, Andrew J. Connolly, Jacob T. VanderPlas University of Washington and Alex

More information

COGNITIVE SCIENCE AND NEUROSCIENCE

COGNITIVE SCIENCE AND NEUROSCIENCE COGNITIVE SCIENCE AND NEUROSCIENCE Overview Cognitive Science and Neuroscience is a multi-year effort that includes NSF s participation in the Administration s Brain Research through Advancing Innovative

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research

More information

Standards for Big Data in the Cloud

Standards for Big Data in the Cloud Standards for Big Data in the Cloud International Cloud Symposium 15/10/2013 Carola Carstens (Project Officer) DG CONNECT, Unit G3 Data Value Chain European Commission Outline 1) Data Value Chain Unit

More information