Using mobile phone data to map human population distribution



Similar documents
USING GEOSPATIAL ANALYSIS TO INFORM DECISION MAKING IN TARGETING HEALTH FACILITY- BASED PROGRAMS

3 PRINCIPLES OF MOBILE DEVICE DATA FOR POPULATION DISTIRBUTOIN AND MOTION ANALYSYS

USE OF STATE FLEET VEHICLE GPS DATA FOR TRAVEL TIME ANALYSIS

Dynamic accessibility analysis using big data

Big Data for Social Good. Nuria Oliver, PhD Scientific Director User, Data and Media Intelligence Telefonica Research

PERCENTAGE OF TOTAL POPULATION LIVING IN COASTAL AREAS. Name: Percentage of Total Population Living in Coastal Areas.

Urban Land Use Data for the Telecommunications Industry

USAF STRATEGIC PLANNING ICT MARKET ASSESSMENT TEMPLATE

IRG-Rail (13) 2. Independent Regulators Group Rail IRG Rail Annual Market Monitoring Report

Spatial Data Analysis

Using cellphone data to measure population movements. Experimental analysis following the 22 February 2011 Christchurch earthquake

Enriching the transport model of. the Rotterdam. region by cell phone data. MT-ITS Budapest, June Klaas Friso

Processes of urban regionalization in Italy: a focus on mobility practices explained through mobile phone data in the Milan urban region

Dr. Shih-Lung Shaw s Research on Space-Time GIS, Human Dynamics and Big Data

CIESIN Columbia University

Supported by. Energy for the Telecom Towers India Market Sizing and Forecasting

y = Xβ + ε B. Sub-pixel Classification

Visualizing of Berkeley Earth, NASA GISS, and Hadley CRU averaging techniques

Big Data, Official Statistics and Social Science Research: Emerging Data Challenges

ESTIMATING YIELDS AND YIELD GAPS: Experiences from East Africa

Change Detection In Satellite Observed Nightime Lights:

In comparison, much less modeling has been done in Homeowners

Using Social Media Data to Assess Spatial Crime Hotspots

Forecasting in supply chains

Appendix C - Risk Assessment: Technical Details. Appendix C - Risk Assessment: Technical Details

Stock prices are not open-ended: Stock trading seems to be *

Tools and Methods for Global Urban Analysis

Predictive Simulation & Big Data Analytics ISD Analytics

High Performance Spatial Queries and Analytics for Spatial Big Data. Fusheng Wang. Department of Biomedical Informatics Emory University

Broadband speed impact on GDP growth and household income: Comparing OECD and BRIC

GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere

Lesson 15 - Fill Cells Plugin

06 - NATIONAL PLUVIAL FLOOD MAPPING FOR ALL IRELAND THE MODELLING APPROACH

The Challenges of Geospatial Analytics in the Era of Big Data

Internal Migration and Regional Disparities in India

Estimation of σ 2, the variance of ɛ

Development of a. Solar Generation Forecast System

Estimating Individual Behaviour from Massive Social Data for An Urban Agent-Based Model

GTFS: GENERAL TRANSIT FEED SPECIFICATION

Load Balancing in Cellular Networks with User-in-the-loop: A Spatial Traffic Shaping Approach

A GRID-BASED APPROACH FOR SPATIAL VULNERABILITY ASSESSMENT TO FLOODS: A CASE STUDY ON THE COASTAL AREA OF BANGLADESH

Impact of water harvesting dam on the Wadi s morphology using digital elevation model Study case: Wadi Al-kanger, Sudan

Big Data: A new era of insurance? Dr. Iordanis Chatziprodromou, Mexico 2015

A quick overview of geographic information systems (GIS) Uwe Deichmann, DECRG

TravelOAC: development of travel geodemographic classifications for England and Wales based on open data

Introduction to GIS (Basics, Data, Analysis) & Case Studies. 13 th May Content. What is GIS?

OSMatrix Grid-based Analysis and Visualization of OpenStreetMap

Creating Geospatial Metadata. Kim Durante Geo4Lib Camp

Comparing data from mobile and static traffic sensors for travel time assessment


Spatio-Temporal Networks:

OPENPOPGRID: AN OPEN GRIDDED POPULATION DATASET FOR ENGLAND AND WALES

CityGML goes to Broadway

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

International Carriers

Integrating DNA Motif Discovery and Genome-Wide Expression Analysis. Erin M. Conlon

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle

Privacy Techniques for Big Data

Using Big [Traffic] Data to help Drivers, Road Authorities and Businesses

Prospective Life Tables

Pearson's Correlation Tests

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Location tracking: technology, methodology and applications

Improvement in Transit Service using GIS Case study of Bhavnagar State Transport Depot

A HYDROLOGIC NETWORK SUPPORTING SPATIALLY REFERENCED REGRESSION MODELING IN THE CHESAPEAKE BAY WATERSHED

Part 2: Analysis of Relationship Between Two Variables

Analysing Big Data in ArcGIS

Natural Resource Scarcity:

Visualization and Big Data in Official Statistics

GIS Data in ArcGIS. Pay Attention to Data!!!

The Role of Spatial Data in EU Agricultural Policy Analysis

Primavera Project Management System at WVDOT. Presented by Marshall Burgess, WVDOT Stephen Cole, Stephen Cole Consulting Jervetta Bruce, CDP, Inc.

Transcription:

Using mobile phone data to map human population distribution Pierre Deville, Vincent D. Blondel Université catholique de Louvain, Belgium Andrew J. Tatem University of Southampton, UK Marius Gilbert Université Libre de Bruxelles, Belgium Samuel Martin Université de Lorraine CNRS, France Catherine Linard Biological Control & Spatial Ecology Université Libre de Bruxelles Andrea Gaughan, Forrest Stevens University of Louisville, USA Vienna, September 23-26, 2014 GIScience 2014 - Eighth International Conference on Geographic Information Science

Gridded population datasets: the options Dataset Coverage Spatial resolution Year(s) represented GPW v3 Global 2.5 arcminutes (~5 km) 1990, 1995, 2000, 2005, 2010, 2015 GRUMP Global 30 arcseconds (~1 km) 1990, 1995, 2000 UNEP Africa, Asia, South America 2.5 arcminutes (~5 km) 2000 LandScan Global 30 arcseconds (~1 km) 1998-2012 WorldPop Africa, Asia, South America 3 arcseconds (~100 m) 2010, 2015, 2020

Population Counts Population Density (pph) Intro to gridded population data Introduction Methods Results Conclusions Census data linked to GIS administrative boundaries Ancillary data e.g. Settlements, roads Spatial modelling rules to disaggregate census counts Estimates of number of people in each grid cell

Population distribution in Africa in 2010 Introduction Methods Results Conclusions A. C. How can we better inform on temporal changes? B. Linard et al (2012) PLoS ONE

Mobile phone usage data X User makes a call from location X Y User travels to Y and makes a call Call routed through nearest tower Network operator records time and tower of call for billing Penetration rates (2013) Global: 96% Developed countries: 128% Developing countries: 89%

Objectives Use MP data to map the spatio-temporal distribution of human population over large spatial scales Develop a method that: Is easy to implement Minimizes the impact of phone usage heterogeneities Preserves users privacy Mobile phone towers in France ~17,000 towers May-October 2007 > 1 billion calls 17 millions users

MP call density by admin. unit Number of calls aggregated by tower Coverage area approximated using Voronoi polygons Density of calls estimated for each administrative unit Census data 2007 Population Density (people/km²) < 10 10-50 51-100 101-500 501-1,000 1,001-5,000 5,001-10,000 > 10,000

Calibration c c Population density in admin. unit c Phone call density in admin. unit c α and β fitted by a linear regression α = scale ratio β = super linear effect of population density Adjustment of population estimates using national population

Training data Training data ~1,000 communes (ADM-5) 2 sampling procedures: random and spatial 1000 bootstraps Spatiallystratified random sampling

Coefficient estimates Introduction Methods Results Conclusions

Census-derived population density MP method WorldPop method

Relative error MP RMSE: 517 COR: 0.85 WorldPop RMSE: 539 COR: 0.9

Seasonal movements Relative difference in pop. density between holidays and working periods Asterix Park CDG Airport Disneyland Versailles Brest Rennes Nantes

Weekly movements Relative difference in pop. density between weekends and weekdays Asterix Park CDG Airport Disneyland Versailles Brest Rennes Nantes

Extrapolation ability Stability of β within and between countries Random sampling Spatial sampling

Extrapolation ability Stability of β within and between countries Sensitivity analysis of pop. estimates to α and β +15%

Conclusions Mobile phone method: WorldPop method: Dynamic Static Higher accuracy in urban areas Higher accuracy in rural areas Very simple aggregated data Many input data required Easy to implement More complex implementation Combine both methods?

Conclusions MP call activities can be used to produce spatially and temporarily explicit estimations of population densities across countries and their changes over multiple timescales, while preserving the anonymity of individual users. Limitations: Density of calls vs. density of users Daily-aggregated data Variations in phone usage behaviours not taken into account Partnerships between governments and phone companies could enable fast and cheap production of accurate maps of population distribution for every country in the world for every month

Thank you! E-mail: linard.catherine@gmail.com Reference Deville P., Linard C., et al. (2014) Dynamic population mapping using mobile phone data. Proc Natl Acad Sci:201408439. www.worldpop.org.uk