Digital Earth: Big Data, Heritage and Social Science



Similar documents
Doing Multidisciplinary Research in Data Science

Applications for Business Intelligence, Predictive Analytics and Big Data

Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis. Opportunities in Power Distribution Network System Optimization (from EDA Perspective)

Introduction to the Mathematics of Big Data. Philippe B. Laval

Introduction to Predictive Analytics. Dr. Ronen Meiri

BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

How To Use Hadoop For Gis

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Advanced Image Management using the Mosaic Dataset

Age of Big data. Presented by: Mohammad Iqbal BCM -2014

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

Big Data; Old News or New Hype? Marcel den Hartog, June 2012

Computer Logic (2.2.3)

Definition of Computers. INTRODUCTION to COMPUTERS. Historical Development ENIAC

The premier software for extracting information from geospatial imagery.

We are Big Data A Sonian Whitepaper

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP

What happens when Big Data and Master Data come together?

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

What Is Big Data? Craig C. Douglas University of Wyoming

the Cloud Dean McCormick Regional Sales Manager Geospatial MEA Region

ENVI THE PREMIER SOFTWARE FOR EXTRACTING INFORMATION FROM GEOSPATIAL IMAGERY.

DATA ANALYTICS: GO BIG OR GO HOME

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

IT Platforms for Utilization of Big Data

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

T a c k l i ng Big Data w i th High-Performance

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

Cloud storage Megas, Gigas and Teras

A New Era Of Analytic

IDC Digital Universe Study: Big Data, Bigger Digital Shadows and Biggest Growth in the Far East

Extraction of Satellite Image using Particle Swarm Optimization

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

How To Use Data From Copernicus And Big Data To Help The Environment

JAPAN UNIVERSE. RICH DATA & the Increasing Value of the INTERNET OF THINGS. The DIGITAL UNIVERSE of OPPORTUNITIES GET STARTED COUNTRY BRIEF

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Architectures for massive data management

Keystone Image Management System

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

Big Data: Study in Structured and Unstructured Data

A View on the Future of Tape

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Data Storage At the Heart of any Information System. Ken Claffey, VP/GM - June 2015

<Insert Picture Here> Data Management Innovations for Massive Point Cloud, DEM, and 3D Vector Databases

Pervasive Location Analytics and A Billion Dollar Opportunity. Jitender Aswani, Portfolio Strategist, Business Analytics, SAP

Foundations of Business Intelligence: Databases and Information Management

A Web-Based Library and Algorithm System for Satellite and Airborne Image Products

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

BIG DATA CHALLENGES AND PERSPECTIVES

Mr. Jay Arnold, Magnolia River Mr. Richard Vincent, McKim & Creed

DIGITAL UNIVERSE UNIVERSE

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

How To Use Big Data In Healthcare

Analytics Data Discovery QlikView

Big data, big deal? Presented by: David Stanley, CTO PCI Geomatics

Big Data Analytics. Genoveva Vargas-Solar French Council of Scientific Research, LIG & LAFMIA Labs

Big Data: Image & Video Analytics

Foundations of Business Intelligence: Databases and Information Management

Big Data and Analytics: A Conceptual Overview. Mike Park Erik Hoel

Peter Rakers De Verstoring van Big Data

lesson 1 An Overview of the Computer System

Determining Your Computer Resources

Algorithms and Methods for Distributed Storage Networks 7 File Systems Christian Schindelhauer

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data

The HP IT Transformation Story

Feasibility Study of Searchable Image Encryption System of Streaming Service based on Cloud Computing Environment

Game On: How Information is Changing the Rules of Insurance

Outline. What is Big data and where they come from? How we deal with Big data?

An Open Dynamic Big Data Driven Applica3on System Toolkit

Big Data and Cloud Computing for GHRSST

Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud

Big Data: Public Sector Opportunities, Challenges, and Implications

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

Real Time Big Data Processing

GEOSPATIAL DIGITAL ASSET MANAGEMENT A SOLUTION INTEGRATING IMAGERY AND GIS WHERE WILL ALL THE PIXELS GO?(AND HOW WILL WE EVER FIND THEM?

Analytical Tools: What Auditors Need to Know About Big Data

BIG DATA, BIOBANKS AND PREDICTIVE ANALYTICS FOR A BETTER CLINICAL OUTCOME

Discovering Computers Chapter 7 Storage

Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University

Transcription:

Digital Earth: Big Data, Heritage and Social Science The impact on geographic information and GIS

Geographic Information Systems Analysis for Decision Support Impact of Big Data Digital Earth Citizen Engagement Analysis for Decision Support

Traditional GIS (self contained) imagery maps town plans cadastral data networks retrieval and processing software all data stored locally decision support product at agency level It is a decision support tool, in that it manipulates and presents data in a way that allows the user to form inferences or make decisions that otherwise would not be possible or as easy. Humans(resource managers) are still involved.

There has been a move towards web-serving data supplier 1 data supplier 2 data supplier 3 original GIS -all data stored locally problems with data duplication and maintenance data user 1 data user 2 value added products data maintenance handled by principal suppliers data access almost unlimited

Cloud served GIS has now been proposed (ESRI) in which both the data and processing software is on remote servers data supplier 1 data supplier 2 data supplier 3 GIS software data user 2 data user 1 value added products

The Geographic Information System The GIS model is variable different people have different needs and concepts What has been constant is the underlying spatial data reference such as a map grid. Will the planar grid in the GIS be replaced by the virtual globe? Explore the digital earth concept.

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science Digital Earth The concept is well established; it originated in 1998 and 1999 attributes addressed as on a globe Mercator globe from space Google Earth like A. Gore, The Digital Earth: Understanding our Planet in the 21 st Century, PERS, vol. 65, no 5., 1999, p528.

Digital Earth Because it has a virtual globe basis it is immediately accessible by anyone with a reasonable home computer. It is GIS-like because of the data layers added by other users and researchers. heritage, cultural, and social layers physical layers complex analysis for decision support The citizen scientist, although possibly untrained in the formal sense, may have much to add even simple facts, in aggregate over many people, can add important information to the data base.

What can we learn from the Internet about the role of the citizen, often an untrained amateur? My first suspicion of something born so instantaneously perfect is an alien origin. There was probably a meeting on a distant planet where they debated whether or not it was time for the earth to have the web. They ll only botch it up, the opponents probably argued. They ll put all kinds of garbage on it and ruin our good idea. R.W. Lucky, Bellcore, 1995

But the alien proponents and supporters of the Web carried the day. The earth has all sorts of talented people eager to get their material out to the public they undoubtedly said. Wonderful things will happen, just wait and see. R.W. Lucky, Bellcore, 1995

The Web is proving the efficacy of the long believed and hoped for Field of Dreams approach if we build it they will come. The people of the earth have been empowered by the Web and there has been an incredible outpouring of creativity When you empower a few, its not much competition. But when you empower tens of millions, surprising things will happen, some of them breathtakingly good R.W. Lucky, Bellcore, 1995

Citizen user interaction The internet and social networks give the citizen user ready and immediate access to data and results, give the citizen user a simple means for communicating, and allow the citizen user to communicatetheir own views to other citizens and to professionals alike.

We have seen an evolution from remote sensing, through GIS, to digital earth, and citizen science. remote sensing remote sensing satellites remote sensing aircraft UAVs navigation satellites GIS existing maps other spatial data sensor networks ground based sensors digital earth citizens as sensors social, cultural data

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science So, what sort of data future are we facing? Physical data (optical spectral, radar, thermal, lidar, ) from large satellites aircraft and UAVs satellite clusters ground based sensors Cultural, heritage and other forms of social data Citizen data over social networks that may be of questionable accuracy and precision ephemeral convergent over many citizens (crowd sourcing)

Thus, the emerging data environment is now like social media input heritage, cultural, and social layers physical layers analysis earth observation satellite and ground sensor inputs (the sensor web) clever servers, broadband internet social media data the challenge is to analyse this mixed data to provide meaningful decision support to the resource manager

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science Decision support in the digital earth framework TWO OBSERVATIONS 1. Physical sensors generate datathat is analysedto provide information. That information allows the human analyst to acquire knowledge about the problem domain. 2. Human (citizen) input to the digital earth model is generally in the form of knowledge(or facts) already.

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science Thus, knowledge propagation in the digital earth framework might look like: the GIS analysis problem sensor data analysis information human assimilation usable knowledge citizen input integrity checking, etc. usable knowledge knowledge fusion social network analysis

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science How are facts processed into knowledge? data type 1 (e.g. spectral) analytical process 1 labels data type 2 (e.g. radar) analytical process 2 labels knowledge processor (several candidates are available) product This is what we can do in GIS

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science How are facts processed into knowledge? data type 1 (e.g. spectral) analytical process 1 facts data type 2 (e.g. radar) cultural information analytical process 2 facts facts knowledge processor product citizen information facts can add to the digital earth archive

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science When does social network knowledge become useful? citizens C 1 C 2 C n social convergence and error correction useful knowledge (set of facts) citizen observations process could be deterministic: e.g. through issuing a call for help opportunistic: e.g. by using tourist photographs spontaneous: e.g. unsolicited input there is a whole body of research on social media, when messages stick and go viral, and what the major influences are this is still an open research topic, at the intersection of computer and social science

Now explore the spatial data domain a bit further, because the changing nature of data supply may impact on the way that GIS evolves. How much data are we now talking about? or will be soon!

kilobyte kb 10 3 1000 1024 megabyte MB 10 6 1000 2 1024 2 gigabyte GB 10 9 1000 3 1024 3 terabyte TB 10 12 1000 4 1024 4 petabyte PB 10 15 1000 5 1024 5 exabyte EB 10 18 1000 6 1024 6 zettabyte ZB 10 21 1000 7 1024 7 estimated storage capacity of the human brain ~ 2.5PB October 2012 Internet Archive ~ 10PB (190 TB per month)

(courtesy of GuoHuadong, Director, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences)

(courtesy of GuoHuadong, Director, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences)

The digital universe is a measure of all the data created, replicated and consumed in a single year 1,000,000,000GB Source: IDC's Digital Universe Study Executive Summary, December 2012 (courtesy of GuoHuadong, Director, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences)

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science In summary, what is the overall impact on GIS? apartfrom the virtual globe as a map base, the impact is largely through the data available, its increasing volume, and the fact that some data is ephemeral, and sometimes of questionable accuracy, particularly that delivered over social networks imagery maps town plans cadastral data networks citizen generated data retrieval and processing software decision support product how can we process this to provide more meaningful decision support, particularly in the face of a massive increase in data volume?

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science Look at technological considerations Big data implies, among other likely considerations, Large storage High processing power Effective knowledge extraction

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science STORAGE OF BIG DATA 4.00 Digital Universe absolute scales are relative; it is the trends that are important log scale 3.00 2.00 Hard drive capacity growth in memory capacity is greater than the growth of the digital universe storage capacity will not be a problem 1.00 2000 2005 2010 2015 2020 but access and smart archiving may be Based on: en.wikipedia.org/wiki/file:hard_drive_capacity_over_time.svgand J. Gantzand D. Reinsel, The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East, IDC IVIEW, December 2012

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science price USD/MB 1000 100 10 1 10 COST OF STORAGE the biggest technology impact on society will be the falling cost of computer memory price cents/mb 1 0.1 0.01 0.001 $10/TB L. Free, Sydney, late 1970s 1980 1985 1990 1995 2000 2005 2010 2015 Based on: hblok.net/blog/posts/2013/02/13/historical-cost-of-computer-memory-and-storage/

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science EVOLUTION OF COMPUTING POWER after Hans Moravec, Carnegie Mellon See www.mocom2020.com/2009/05/evolution-of-computer-capacity-and-costs/

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science kilobyte megabyte kb 10 3 1000 Big Earth Data Challenges MB 10 6 1000 2 1024 1024 2 Generally, storage and processing power should not be GB 10 9 1000 limitations. 3 gigabyte petabyte PB 10 15 1024 3 terabyte TB 10 12 1000 4 1024 4 Knowledge extraction may be; what approaches are available to us? 1000 5 1024 5 exabyte EB 10 18 1000 6 1024 6 zettabyte ZB 10 21 1000 7 1024 7 See www.mocom2020.com/2009/05/evolution-of-computer-capacity-and-costs/

Consider human eye information processing 0.0128deg ~500Mpixel over 90deg x 90deg field of view ~1 million colours ~3 bytes per pixel, one for each colourprimary Can easily cope with a frame rate of about 20 per second, although not all pixels are viewed on each frame, but just a subset in the region of the field of view currently being looked at. If all pixels in a scene were assimilated then the processing rate, at a conservative maximum, could be 30GB per second. Let s use a much lower rate of 0.3GB per second. Then every hour about 1TB is processed, giving 10TB in a 10 hour day per person. This is big data! How is it processed? Not all is stored!

Take a citizen science example observes an event, such as a broken levee bank May take about a second to assimilate and cross check ~ 0.3GB Message passed to authorities, say 200 words (1200 characters at 8B per character) ~ 10kB This represents a compression of about 30,000! This is the result of the human reasoning system s ability to undertake complex analysis and to reduce large data volumes to important facts

Human Reasoning Can make complex, high level decisions; spatial reasoning is easy. Whiledata rates might be high, decision making is slow. Isprone to inconsistency and errors, especially when tired. Machine Processing High level reasoning ability is poor by comparison at this time; spatial reasoning is challenging. Very high processing speeds. Error free processing,in principle, with no fatigue. Best of both worlds would be achieved if we can emulate human reasoning by machine (artificial intelligence). General AI might be challenging, but applications-specific AI is not. This might be a pointer to the challenge of handling big data.

A simple, long-standing AIapproach, that works well in remote sensing and GIS, is the expert system based on production rules It can be used and can also be used to combine data types in a GIS

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science a classical data reduction exercise big data stores numerical process OR facts cultural information AI process facts facts knowledge processor product citizen information facts expert systems is this such a challenge?

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science So what might the GIS of the big data future look like? big data stores data data suppliers data AI interface data users data facts facts software suppliers software knowledge broker client citizen input facts web (cloud) knowledge infrastructure with a virtual globe basis facts data data user value added products an agent, skilled in turning client requirements into a set of specifications for data analysts, and skilled in combining knowledge

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science Decision support in the digital earth framework The advantage of the digital earth model is the diversity of data types, and the flexibility of data representation. Data and knowledge come from physical sensors, archives and citizens. They can be physical, social, cultural, heritage, visual, non-visual, with differing degrees of reliability, and available at different times and places. It is even envisaged that sound (bird calls?), smell and other attributes, with time, might be added.can anything be more complex? Further complexity is added by the rapidly increasing and massive amounts of data, particularly earth imaging data. Emulating human reasoning may be the way to extract relevant knowledge.

Will the Digital Earth Paradigm Change Digital GIS? Earth: Big Data, Heritage and Social Science