The Digicene: the Age of Big Data in the Geosciences
|
|
|
- Leonard Hudson
- 9 years ago
- Views:
Transcription
1 The Digicene: the Age of Big Data in the Geosciences Lee Allison Arizona Geological Survey USGIN Foundation, Inc. Earth Data Science in the Era of Big Data and Compute April 30, 2015
2 A system that works for one geologist, in the field and at their desk and for users of streaming, High Performance Computing
3 Data Long tail of science Pb Tb ~20% of earth scientists use/need HPC capabilities; 80% rely dominantly on desktop software and data sets they collect themselves Gb Mb Mainstream Scientists
4 Big Data: Data Integration ( interoperability ) Capabilities for moving data between data warehousing, business analytics, master data management, enterprise applications, and custom applications. Integrating structured data in relational databases with social media data, weblogs, and various unstructured data Dain Hansen, Oracle Corp 2012
5 Big Data is not just large data sets Digitizing, organizing, analyzing, modeling, visualizing [small] data from disparate sources
6 National Geothermal Data System Oregon Institute of Technology Geo Heat Center Boise State Free online access to: Stanford Reservoir Engineering AZGS: 50 State Geological Surveys U of Nevada Reno University of Utah Energy & Geosciences Institute U.S. DOE Geothermal Data Repository hosted at NREL U.S. Geological Survey Maps, data, & documents from 65+ providers nationwide Southern Methodist University Distributed network >10 million data records >3 million oil & gas wells >47,000 maps & reports Powered by USGIN
7 For mainstream science Not centralized big iron, but decentralized data wrangling Not one ring to rule them all but small pieces loosely joined Mass democratisation of the means of access, storage & processing of data A distributed ecosystem of information, an ecosystem of small data Distributed models not centralized ones Collaboration not control, and small data not big data Rufus Pollock Open Knowledge Foundation April 2013
8 Small data culture Big data environment Barriers to discovery, access, and integration of data have shaped the scientific practice since it s beginnings Most of us work in small data because there is no viable alternative Tackling interdisciplinary/multidisciplinary problems was not realistic Many large geoscience databases are not really big data Open data access and interoperability are changing the paradigm Digitization, integration, and modeling are creating unprecedented opportunities and challenges The new generation of geoscientists increasingly will need data science skills
9 Turning legacy data.
10 .into this NERC 2008
11 Creating common standards and protocols Engaging the vast number of distributed data resources Establishing practices for recognition of and respect for intellectual property Developing simple data and resource discovery and access systems Building mechanisms to encourage development of web service tools and workflows for data analysis Brokering the diverse disciplinary service buses Creating sustainable business models for maintenance and evolution of information resources Integrating the data management life-cycle into the practice of science
12 Big Data and the Geoscience Profession There will be a shortage of talent necessary for organizations to take advantage of big data. By 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big data to make effective decisions. McKinsey Global Institute, 2011
13 Global Cyberinfrastucture and Data Management Initiatives Environment Earth Observation Geoscience Marine Petroleum/ Energy All FOCUS EarthCube (US only) Earth Science Information Partners (ESIP) (US) GeoSeas (EU) Renewable Energy Agency US Inspire (EU only) National/ Continental IGSN (EU, US, AUS) ODIP Oceans Data Interoperability Platform (US, EU, AUS) CoopEUS (EU, US) Cross Continental Belmont e- Infrastructure GEO/GEOSS Geoscience Information Council OneGeology Energistics National Data Repositories CODATA ICSU World data System Research Data Alliance Global USGIN involvement Ones GA is actively involved in Ones GA follows Ones mentioned in OneGeology agenda Ones not on OneGeology Agenda Geoscience Australia
14 Geothermal Petroleum Mining Geological Surveys Academia Professional Organizations National Geothermal Data System (NGDS) IRENA Global Atlas African Rift Geotherm al (ARGeo) Standards Leadership Council National Data Repositories (NDR) African Minerals Geoscience Initiative OneGeology USGIN USGS Community for Data Integration EarthCube Belmont Forum Intl Geo Sample Numbering (IGSN) IUGS CGI GeoSciML Coalition for Publishing Data in the Earth and Space Sciences Geoscience Cyberinfrastructure
15 Everything digital, online, & interoperable Productivity increase Enabling inter- and multidisciplinary research Enabling unheard-of analytical, computational, visualization, and modeling
16 Digitization Open access Integration/Interoperability Modeling, visualization, analytical capacity Capacity building Social construct
17 GLOBAL CONVERGENCE IN GEOSCIENCE CYBERINFRASTRUCTURE DAWN OF THE DIGICENE
National Geothermal Data System and Global Geosciences Data Integration
National Geothermal Data System and Global Geosciences Data Integration Lee Allison, AZGS Stephen Richard, AZGS Arlene Anderson, US DOE David Cuyler, US DOE & Sandia Natl Lab NERC 2008 Outline Rational
Geothermal Technologies Program
Geothermal Technologies Program Enel Salt Wells - Courtesy of Enel Green Power North America IEA-GIA ExCo National Geothermal Data System & Online Tools September 30, 2011 Jay Nathwani Energy Program Efficiency
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21)
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering, and Education (CIF21) investment
Data Intensive Research Initiative for South Africa (DIRISA)
Data Intensive Research Initiative for South Africa (DIRISA) A Reinterpreted Vision A. Vahed 25 November 2014 Outline Background Data Landscape Strategy & Objectives Activities & Outputs Organisational
ODIP: Establishing and operating an Ocean Data Interoperability Platform
ODIP: Establishing and operating an Ocean Data Interoperability Platform EU US Australia cooperation Proposal : 312492 Call: FP7-INFRASTRUCTURES-2012-1-INFSO Activity: INFRA-2012-3.2 International co-operation
DATA STEWARDSHIP from a geoscience and academic perspective
DATA STEWARDSHIP from a geoscience and academic perspective Margaret Leinen Vice Chancellor for Marine Science, UC San Diego Director, Scripps Institution of Oceanography Research Data Alliance - 5 San
Doing Multidisciplinary Research in Data Science
Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University [email protected] http://ce.qu.edu.az/~aadamov 16 May
Directorate for Geosciences
NSF Regional Grants Conference Salt Lake City, Utah Directorate for Geosciences Sonia Esperança, Ph.D. Division of Earth Sciences [email protected] The Mission of the Directorate for Geosciences Support
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research
The Next Generation Science Standards (NGSS) Correlation to. EarthComm, Second Edition. Project-Based Space and Earth System Science
The Next Generation Science Standards (NGSS) Achieve, Inc. on behalf of the twenty-six states and partners that collaborated on the NGSS Copyright 2013 Achieve, Inc. All rights reserved. Correlation to,
BIG DATA & DATA SCIENCE
BIG DATA & DATA SCIENCE ACADEMY PROGRAMS IN-COMPANY TRAINING PORTFOLIO 2 TRAINING PORTFOLIO 2016 Synergic Academy Solutions BIG DATA FOR LEADING BUSINESS Big data promises a significant shift in the way
Department of Geology
Department of Geology Faculty of Science Brandon University This document is meant as a planning guide only. Students are advised to consult with the Chair of the Department if they have specific questions
Collecting and Analyzing Big Data for O&G Exploration and Production Applications October 15, 2013 G&G Technology Seminar
Eldad Weiss Founder and Chairman Collecting and Analyzing Big Data for O&G Exploration and Production Applications October 15, 2013 G&G Technology Seminar About Paradigm 700+ 26 700+ 29 7 15,000+ 15+ 200M+
Databases & Data Infrastructure. Kerstin Lehnert
+ Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,
Pan-European infrastructure for management of marine and ocean geological and geophysical data
Pan-European infrastructure for management of marine and ocean geological and geophysical data By Dick M.A. Schaap Geo-Seas Technical Coordinator March 2010 Supported by the European Commission FP7 - Research
History of the Military Geology Branch of the U.S. Geological Survey. Joseph M. Duracinsky
History of the Military Geology Branch of the U.S. Geological Survey Joseph M. Duracinsky Purpose - To provide an overview of the history of the Military Geology Branch of the U.S. Geological Survey from
Infosys Oil and Gas Practice
Infosys Oil and Gas Practice Legacy systems and technologies hamper production at unconventional oil and gas basins and inhibit productivity at conventional energy reserves. The steep cost of exploration,
Oracle Big Data Discovery Unlock Potential in Big Data Reservoir
Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Gokula Mishra Premjith Balakrishnan Business Analytics Product Group September 29, 2014 Copyright 2014, Oracle and/or its affiliates. All
Drilling into Science: A Hands-on Cooperative Learning Oil Exploration Activity designed for Middle School and High School Students
Drilling into Science: A Hands-on Cooperative Learning Oil Exploration Activity designed for Middle School and High School Students Lauren C. Neitzke 1, Teresa Rousseau 2, and Diane Gavin 2 1. Rutgers
Delivering Smart Answers!
Companion for SharePoint Topic Analyst Companion for SharePoint All Your Information Enterprise-ready Enrich SharePoint, your central place for document and workflow management, not only with an improved
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43%
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43% Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering,
Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data.
Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. 1 Advances in information technologies are transforming the fabric of our society and data represent
A CONTENT STANDARD IS NOT MET UNLESS APPLICABLE CHARACTERISTICS OF SCIENCE ARE ALSO ADDRESSED AT THE SAME TIME.
Earth Systems Curriculum The Georgia Performance Standards are designed to provide students with the knowledge and skills for proficiency in science. The Project 2061 s Benchmarks for Science Literacy
IBM Big Data in Government
IBM Big in Government Turning big data into smarter decisions Deepak Mohapatra Sr. Consultant Government IBM Software Group [email protected] The Big Paradigm Shift 2 Big Creates A Challenge And an
National Higher Education & Workforce Initiative Regional Economic Growth Through High skill, High demand Workforce Development
National Higher Education & Workforce Initiative Regional Economic Growth Through High skill, High demand Workforce Development 2015 Virginia Summit on Higher Education and Economic Competitiveness Brian
Answer Keys to Unit Tests
Reading Geography Series Answer Keys to Unit Tests Unit 1 The Five Themes of Geography Unit 2 Patterns in Physical Geography Unit 3 Natural Resources 7 Portage & Main Press Unit Test for The Five Themes
Proposal for New Program: Minor in Data Science: Computational Analytics
Proposal for New Program: Minor in Data Science: Computational Analytics 1. Rationale... The proposed Data Science: Computational Analytics minor is designed for students interested in signaling capability
Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER
Hur hanterar vi utmaningar inom området - Big Data Jan Östling Enterprise Technologies Intel Corporation, NER Legal Disclaimers All products, computer systems, dates, and figures specified are preliminary
Geoscientists follow paths of exploration and discovery in quest of solutions to some of society's most challenging problems.
Page 1 of 5 Geoscientists follow paths of exploration and discovery in quest of solutions to some of society's most challenging problems. Predicting the behavior of Earth systems and the universe. Finding
354 Russell Senate Office Building 724 Hart Senate Office Building Washington, D.C. 20510 Washington, D.C. 20510
The Honorable Cory Gardner The Honorable Gary Peters 354 Russell Senate Office Building 724 Hart Senate Office Building Washington, D.C. 20510 Washington, D.C. 20510 30 October 2015 Dear Senators Gardner
Geoparks: Creating a Vision for North America
: Creating a Vision for North America Richard Calnan, Sally R. Brady, and Wesley Hill Guest editors note: At the 2009 George Wright Society Biennial Conference on Parks, Protected Areas and Cultural Sites,
NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015
NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015 I. Background Over the next decade, the dramatic growth of
The following was presented at DMT 14 (June 1-4, 2014, Newark, DE).
DMT 2014 The following was presented at DMT 14 (June 1-4, 2014, Newark, DE). The contents are provisional and will be superseded by a paper in the DMT 14 Proceedings. See also presentations and Proceedings
SURVEY REPORT DATA SCIENCE SOCIETY 2014
SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses
A New Era Of Analytic
Penang egovernment Seminar 2014 A New Era Of Analytic Megat Anuar Idris Head, Project Delivery, Business Analytics & Big Data Agenda Overview of Big Data Case Studies on Big Data Big Data Technology Readiness
The Challenges of Integrating Structured and Unstructured Data
LANDMARK TECHNICAL PAPER 1 LANDMARK TECHNICAL PAPER The Challenges of Integrating Structured and Unstructured Data By Jeffrey W. Pferd, PhD, Sr. Vice President Strategic Consulting Practice at Petris Presented
Insight for Informed Decisions
Insight for Informed Decisions NORC at the University of Chicago is an independent research institution that delivers reliable data and rigorous analysis to guide critical programmatic, business, and policy
Data Management Considerations for the Data Life Cycle
Data Management Considerations for the Data Life Cycle NRC STS Panel 2011 November 17, 2011, Washington DC Peter Fox (RPI) [email protected], [email protected] Tetherless World Constellation http://tw.rpi.edu
International Data Sharing Framework
International Data Sharing Framework Including ICSU World Data System Dr. Yasuhiro Murayama ICSU-WDS Scientific Committee ex officio Member of Cabinet Office Expert Panel of Open Science Associate member,
Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.
Impact of Big Data in Oil & Gas Industry Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. New Age Information 2.92 billions Internet Users in 2014 Twitter processes 7 terabytes
National Big Data R&D Initiative
National Big Data R&D Initiative Suzi Iacono, PhD National Science Foundation Co-chair NITRD Big Data Senior Steering Group for CASC Spring Meeting April 23, 2014 Why is Big Data Important? Transformative
Workprogramme 2014-15
Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES
Paradigm High Tech and Innovative Software Solutions for the Oil and Gas Industry
Paradigm High Tech and Innovative Software Solutions for the Oil and Gas Industry EOST Strasbourg 17 Jan. 2012 Vision for Energy Matthieu Quinquet Outline General information Core activities and market
African European Georesources Observation System
African European Georesources Observation System Spatial Data Infrastructure (SDI) Dr. Andreas Barth, Bernd Torchala Beak, Germany Outline Project Overview AEGOS SDI data / services hardware / software
Oil Gas expo 2015 is comprised of 13 Main tracks and 131 sub tracks designed to offer comprehensive sessions that address current issues.
OMICS Group cordially invites participants from all over the world to attend International Conference and Expo on Oil and Gas, scheduled during November, 16-18, 2015 at Dubai, UAE mainly focused on the
HDP Hadoop From concept to deployment.
HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some
DATA SCIENTIST TRAINING FOR LIBRARIANS #DST4L. C. Erdmann DST4L @ Designing Libraries IV @libcce
DATA SCIENTIST TRAINING FOR LIBRARIANS #DST4L C. Erdmann DST4L @ Designing Libraries IV @libcce On the Same Page We started speaking the same language. A side conversation with a Harvard faculty member
Microsoft - Oil and Gas
Microsoft - Oil and Gas Upstream IT Reference Architecture Introduction Today s oil and gas industry needs a common information technology (IT) reference architecture for use by upstream organizations,
The Master in Geology
Master s programme / Master of Science in Geology www./sciences and Environment with a concentration in geology from the University of Lausanne, or a degree deemed equivalent, upon completion of up to
Big Data & Coal. How big data can help Coal Geology, Exploration and Resource Evaluation in a downturn 12/07/2013
Big Data & Coal How big data can help Coal Geology, Exploration and Resource Evaluation in a downturn 12/07/2013 GEOLOGY GEOTECH MINING PROCESSING VALUATION/RISK TECHNOLOGIES ENVIRONMENT TRAINING Xstract
KNOWLEDGE MANAGEMENT AT ECOPETROL
KNOWLEDGE MANAGEMENT AT ECOPETROL A CONTINUES JOURNEY Oscar Javier Guerra Perdomo Unit Head of Knowledge Management and Innovation Strategy ECOPETROL S.A. [email protected] More than half of
Exploiting Prestack Seismic from Data Store to Desktop
Exploiting Prestack Seismic from Data Store to Desktop Solutions to maximize your assets. Landmark Software & Services Exploiting Prestack Seismic from Data Store to Desktop Author: Ciaran McCarry, Principal
College of Agriculture, Engineering and Science INSPIRING GREATNESS
School of Agricultural, Earth and Environmental Sciences College of Agriculture, Engineering and Science INSPIRING GREATNESS UKZN s School of Agricultural, Earth and Environmental Sciences is one of five
Analytics Centre of Excellence: Roles, Responsibilities and Challenges
Analytics Centre of Excellence: Roles, Responsibilities and Challenges Warwick Graco Analytics Professional Convenor of the Whole of Government Data Analytics Centre of Excellence 1 Contents Changes to
Big Data in the context of Preservation and Value Adding
Big Data in the context of Preservation and Value Adding R. Leone, R. Cosac, I. Maggio, D. Iozzino ESRIN 06/11/2013 ESA UNCLASSIFIED Big Data Background ESA/ESRIN organized a 'Big Data from Space' event
Empowering the Masses with Analytics
Empowering the Masses with Analytics THE GAP FOR BUSINESS USERS For a discussion of bridging the gap from the perspective of a business user, read Three Ways to Use Data Science. Ask the average business
GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington
GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise
BOOSTING THE COMMERCIAL RETURNS FROM RESEARCH
BOOSTING THE COMMERCIAL RETURNS FROM RESEARCH Submission in response to the Discussion Paper November 2014 Page 1 ABOUT RESEARCH AUSTRALIA is an alliance of 160 members and supporters advocating for health
Using Google Earth to Explore Plate Tectonics
Using Google Earth to Explore Plate Tectonics Laurel Goodell, Department of Geosciences, Princeton University, Princeton, NJ 08544 [email protected] Inspired by, and borrows from, the GIS-based Exploring
VMworld 2015 Track Names and Descriptions
Software- Defined Data Center Software- Defined Data Center General VMworld 2015 Track Names and Descriptions Pioneered by VMware and recognized as groundbreaking by the industry and analysts, the VMware
UNH Strategic Technology Plan
UNH Strategic Technology Plan Joanna Young, UNH Chief Information Officer - April 2010 People increasingly experience or interact with an organization through a technology lens. Accessible, engaging, responsive,
Sharing the experiences of teaching business analytics in a University course
Sharing the experiences of teaching business analytics in a University course Dr Michael Lane School of Management and Enterprise Email: [email protected] Agenda Background to Business Intelligence
Are You Big Data Ready?
ACS 2015 Annual Canberra Conference Are You Big Data Ready? Vladimir Videnovic Business Solutions Director Oracle Big Data and Analytics Introduction Introduction What is Big Data? If you can't explain
LIMITED RESOURCES: "A SHORTAGE IN THE SEA" QUESTION Are the things that we use from the ocean unlimited? Can we run out?
LIMITED RESOURCES: "A SHORTAGE IN THE SEA" QUESTION Are the things that we use from the ocean unlimited? Can we run out? UNDERLYING CONCEPT Resources are limited and we must take care in how we use them.
Iterative Database Design Challenges and Solutions for a Geomechanics Database
1 Iterative Database Design Challenges and Solutions for a Geomechanics Database By James Ding and Cary Purdy Presented at Petroleum Network Education Conference (PNEC) 2011 2 Abstract Pressworks is an
Organic Data Publishing: A Novel Approach to Scientific Data Sharing
Second International Workshop on Linked Science Tackling Big Data, (LISC 2012), colocated with the International Semantic Web Conference (ISWC), Boston, MA, November 11-15, 2012. Organic Data Publishing:
IBM - Fueling the Oil & Gas Industry
World Wide Chemicals & Petroleum IBM - Fueling the Oil & Gas Industry Big Data in Oil & Gas - Improved Decision Making & Operational Efficiency Ole Evensen, IBM Chemical & Petroleum WW Upstream Business
Information Technology Career Path Overview
Information Technology Career Path Overview + Joy Global Has a Positive Corporate Culture Joy Global is a company of diverse people working among operating divisions located in communities around the world.
