Big Data for Development: What May Determine Success or failure?



Similar documents
Is big data the new oil fuelling development?

BIG DATA FOR DEVELOPMENT: A PRIMER

UN Global Pulse: Harnessing Big Data for a Revolution in Sustainable Development and Humanitarian Action Robert Kirkpatrick

BIG DATA FUNDAMENTALS

BIG DATA AND THE 2030 AGENDA FOR SUSTAINABLE DEVELOPMENT

Data analytics Delivering intelligence in the moment

Data Scientist for 1h

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Getting the Most Out of SIEM. Presentation Title. Data in Big Data. Presented By: Dr. Char Sample, CERT

Big Data. Fast Forward. Putting data to productive use

Big Data, Official Statistics and Social Science Research: Emerging Data Challenges

PDF PREVIEW EMERGING TECHNOLOGIES. Applying Technologies for Social Media Data Analysis

How Big Data is Different

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

The Intersection of Big Data, Data Science, and The Internet of Things

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

SOCIAL MEDIA MONITORING AND SENTIMENT ANALYSIS SYSTEM

How To Create A Data Science System

Collaborations between Official Statistics and Academia in the Era of Big Data

Statistical Challenges with Big Data in Management Science

Protecting Resilience through Social Media Data - A White Paper

Accelerate your Big Data Strategy. Execute faster with Capgemini and Cloudera s Enterprise Data Hub Accelerator

SOCIAL MEDIA LISTENING AND ANALYSIS Spring 2014

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

Exploiting Data at Rest and Data in Motion with a Big Data Platform

S o c i a l B u s i n e s s F r a m e w o r k : U s i n g P e o p l e a s a P l a t f o r m t o E n a b l e T r a n s f o r m a t i o n

Text and data analytics for social network mining

Machine Data Analytics with Sumo Logic

Marathon Information Management Program

Expressing a complex world in terms you can understand

Big Data for Social Good. Nuria Oliver, PhD Scientific Director User, Data and Media Intelligence Telefonica Research

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS November 7, Machine Learning Group

Some Economics of Cultural PSI: the Micro Perspective

Unlocking the Intelligence in. Big Data. Ron Kasabian General Manager Big Data Solutions Intel Corporation

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

BIG DATA. - How big data transforms our world. Kim Escherich Executive Innovation Architect, IBM Global Business Services

2013 BIG DATA OPPORTUNITIES SURVEY

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data

SOCIAL MEDIA LISTENING AND ANALYSIS Spring 2014

Management Information and big data in Insurance

The 4 Pillars of Technosoft s Big Data Practice

BIG DATA : Big Opportunity or Big Threat for Official Statistics?* Jose Ramon G. Albert, Ph.D. Secretary General, NSCB jrg.albert@nscb.gov.

The Challenges of Geospatial Analytics in the Era of Big Data

Is Big Data a Big Deal? What Big Data Does to Science

Big Data Analytics and its Impact on Supply Chain Management

Voice. listen, understand and respond. enherent. wish, choice, or opinion. openly or formally expressed. May Merriam Webster.

W3C USING OPEN DATA WORKSHOP VISUAL ANALYTICS FOR POLICY-MAKING OPPORTUNITIES AND RESEARCH CHALLENGES Francesco Mureddu, Tech4i2.

Annex: Concept Note. Big Data for Policy, Development and Official Statistics New York, 22 February 2013

BIG DATA AND ANALYTICS

BIG DATA: BIG BOOST TO BIG TECH

Demystifying Big Data Government Agencies & The Big Data Phenomenon

Top 10 Predictive Use Cases and Customer Case Studies

12/7/2015. Data Science Master s programs

CRM as a Service. For Customers in the Cloud

Economic and Social Council

T r a n s f o r m i ng Manufacturing w ith the I n t e r n e t o f Things

Five Steps for Succeeding with Social Media and Delivering an Enhanced Customer Experience

ORACLE SOCIAL ENGAGEMENT AND MONITORING CLOUD SERVICE

EO Data by using SAP HANA Spatial Hinnerk Gildhoff, Head of HANA Spatial, SAP Satellite Masters Conference 21 th October 2015 Public

Using Big Data Analytics to

ENABLING REAL-TIME BUSINESS WITH SAP HANA IN THE CLOUD

HOW TO DO A SMART DATA PROJECT

Tweets as big data. Rob Procter and Alex Voss. andrews.ac.

Data Mining with Qualitative and Quantitative Data

Big Data (and official statistics) *

Human mobility and displacement tracking

Software: Driving Innovation for Engineered Products. Page

Graphing Made Easy for Project Management

The New Age of Smart Decision Making

Developing a social media strategy. The Road Ahead

Social Media Implementations

Sources: Summary Data is exploding in volume, variety and velocity timely

Delivering new insights and value to consumer products companies through big data

Beyond Watson: The Business Implications of Big Data

The Promise of Industrial Big Data

NoSQL for SQL Professionals William McKnight

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

An interdisciplinary model for analytics education

The Six Critical Considerations of Social Media Threat Intelligence

CORRALLING THE WILD, WILD WEST OF SOCIAL MEDIA INTELLIGENCE

Social Media Issues - How to Manage and Prevent Brand Censorship

Information-Driven Transformation in Retail with the Enterprise Data Hub Accelerator

Whitepaper. Position Your Business for Big Data Success: 5 Steps to Create Sustained Value to the Bottom Line NVP. NewVantage Partners

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Big Data for Development

MES and Industrial Internet

Big Data a threat or a chance?

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

The Internet of Things

The Scientific Data Mining Process

IBM Software Hadoop in the cloud

WHAT DOES BIG DATA MEAN FOR OFFICIAL STATISTICS?

Digital Marketing to the Digital Self: Winning with Social & Predictive Marketing

Roadmapping Discussion Summary. Social Media and Linked Data for Emergency Response

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

Transcription:

Big Data for Development: What May Determine Success or failure? Emmanuel Letouzé letouze@unglobalpulse.org OECD Technology Foresight 2012 Paris, October 22

Swimming in Ocean of data Data deluge Algorithms Drowning in Digital smoke signals Pattern recognition Petabytes Digital footprints Correlations Data exhaust Digital signatures Proxy indicators Unstructured data Sentiment analysis Digital crumbs Noise Physical sensors

Background and Question(s) 1. Complex, hyperconnected, volatile world. Shocks, risks, vulnerability, resilience, agility, have become mainstream. Policies need to be more agile. 2. Big Data. Big excitement. Big skepticism. New Industrial Revolution? New oil that needs to be refined? Big enough data speak for themselves? Big Data, Big Deal? Worse: a dangerous path? (think conflict settings)? Can it help / change development how? What is the opportunity, what are the challenges? A better question is: What will determine whether Big Data for Development succeeds or fails?

Big Data for Development=Success Enables Proximate determinants Requires 1. Right Intent 2. Right Capacities Enables Underlying drivers Recognizing/communicating around: 1. Potential and applications 2. Challenges and Risks 3. Necessary requirements Requires

From Big data Big Data for development + purpose (intent) + computing tools, systems (capacity) It s relative It s contextual

Big data for development: Global Pulse taxonomy Open Web Data Web content such as news media and social media interactions (e.g. blogs, Twitter), news articles obituaries, e- commerce, job postings; sensor of human intent, sentiments, perceptions, and want. Data Exhaust passively collected transactional data from people s use of digital services like mobile phones, purchases, web searches, etc., these digital services create networked sensors of human behavior; Citizen reporting information actively produced or submitted by citizens through mobile phone-based surveys, hotlines, user-generated maps, etc; Physical Sensors satellite or infrared imagery of changing landscapes, traffic patterns, light emissions, urban development and topographic changes, etc; remote sensing of changes in human activity

Potential applications according to Global Pulse Early warning: early detection of anomalies in how populations use digital devices and services can enable faster response in times of crisis ; Digital awareness: Big Data can paint a fine-grained and current representation of reality which can inform the design and targeting of programs and policies; Real-time feedback: monitor a population in real time makes it possible to understand where policies and programs are failing and make adjustments.

Intent Capacity Early warning digital smoke signals -syndromic surveillance; anomaly detection, extreme values.. Enables Requires Digital awareness Digital signatures, patternsunderstanding human ecosystems

Ex: Conflict prevention (highly simplified) Operational prevention, i.e. conflict early warning and response systems. Detect and respond to anomalies Structural prevention; addressing structural drivers of conflict (e.g., poverty, inequality, elite capture). understand the ecosystem through digital patterns/signatures

Examples Early warning, syndromic surveillance and proxies= pick up / model anomalies Twitter-based vs. Official Influenza Rate in the U.S. «You Are What You Tweet: Analyzing Twitter for Public Health. M. J. Paul and M. Dredze, 2011. http://www.cs.jhu.edu/%7empaul/files/2011.icwsm.twitter_health.pdf

Modeling & predicting food riots http://necsi.edu/research/social/food_crises.pdf

Finding proxy indicators (Global Pulse and SAS project) Tweets about the price of rice vs. actual price of rice in Indonesia

Tweets per day about meals during Ramadan Start of Ramadan End of Ramadan

Digital awareness = understand ecosystem Tea Party on Twitter Occupy Wall Street on Twitter The Tea Party appears as a far more tight-knit group (..) whereas Occupy is made up of looser clusters with a few high-profile accounts. In the lower right-hand corner = matrix of "isolates:" People who are talking about the ideas of Occupy or the Tea Party, but ( ) don't have connections to others on the graph. For Occupy, the number of isolates is greater, which (..) could indicate a larger potential for growth and stronger brand cachet. Source: M. Smith cited in FP What if we could do the same thing for militias, rebel groups not today, but in N years, where N=5, 10, 15..?

Mood analysis = understand ecosystem + detect anomalies

Summing up potential / applications i. digital signatures that help better understand human ecosystems ii. digital smoke signals for anomaly detection may be especially relevant in risky/poor places high rates of growth of tech penetration with typically poor / little more traditional data But also gives a sense of some of the risks and challenges posed by Big Data for Development

Recognizing the difficulties and requirements

1. Data challenges Data: How do we access data and protect their producers? Challenge #1: Availability: Depends on technological penetration and use. But mobile phone use is increasing fast and supports/will support Internet access. Challenge #2: Reliability. Attempts at playing the system(s), plus suppressing signals? Especially prevalent in conflict settings.

1. Data challenges Challenges #3: Access. Not all data produced in easily accessible, storable (e.g. Twitter). This is compounded in poor countries. How to access data from a mobile phone carrier? Challenge #4: Privacy. Because it s accessible does not make it ethical or safe. True in all settings. In conflict/crisis settings, the privacy challenge may soon become a security risk.

2. Analytical challenges Challenge #1: Arrogance/naivety. Believing that the truth is in the (big) data, that the (big enough) data speak for themselves, that all it takes is to beep digging and mining to unveil the truth, that more is always better etc. Apophenia seeing patterns where none exists. If you torture the data long enough, it will eventually start talking. S&P index and butter production in Bangladesh? (Leinweber, 2007)

2. Analytical challenges Challenge #2: Making sense of the data.. Not having the right computing capacities. Unstructured data. Sentiment analysis. Translation. Sample bias. No understanding of context.

3. Operational/systemic challenges Challenge #1: Legal. Data privacy? Intellectual property? Challenge #2: changing systems, policies, frameworks, decision making processes

2. Risks Risk #1: Security and privacy of individual and communities may be jeopardize dark side of Big Data. Conflict zones! Risk #2: Creating new digital divides, between the Small and the Big world, as well as within them, between communities. Who has access, who has the knowledge? Those who can answer the questions are often those who ask the questions. Risk #3: Respond foolishly. i.e. in a way that may exacerbate tensions, poverty..

What to avoid Vertical down top down a-theoritical no questions asked a-contextual automated, autarchic, unresponsive to the recommendations of practitioners and social scientists

Desirable principles and policies

Right intent, sound principles: Big Data is a tool that can help answer QUESTIONS start from questions Contextualization is key especially when lives are in the line. Rely on local insights to make sense of the data. Responsibility: do not harm. Privacy and security: privacy preserving data analytics Build on existing systems and knowledge, e.g. thinking around strategic peacebuilding Think incremental, complementary, iterative and long term

Right Capacities Policies and systems: Bring together data and social scientists, private sector and public sector, traditional statisticians and Big Data people Set up Small World-Big World interdisciplinary facilities and agreements to ask questions, test and exchange, draw lessons Create work flows as feedback loops Engage in debates on privacy, security, responsibility, design legal frameworks

Research on big data sample bias correction exists You are where you E-mail: Using E-mail Data to Estimate International Migration Rates llustrative relationship between the correction factor for selection bias and Internet penetration rates for different choices of the parameter k.

Where may this take us? Lessons from the Technology Sector

Agile Global Development?

2000 Traditional planning-monitoring policymaking cycle Policymaking/De velopment cycle

2020 Responsive policymaking cycle?? Big Data Policymaking/De velopment cycle!

2020 Agile policymaking cycle?? All Data? All Data? All Data!!!

10 years ahead? Data streams will continue to have grown in volume, velocity and variety. It is very unlikely that Big Data won t be playing a significant role in Development. Question is: what do we want to achieve and how? Big Data for Development should start small, and grow.

Thanks