Novel Analysis of Air Pollution Sources and Trends using openair Tools



Similar documents
measured (empirical) data from CCC. The modelled values are of value for all (ecosystem specific modelling work deposition)

Openair on the SAQD. Dr Scott Hamilton - AEA

UK Air Quality Monitoring Networks

Data Analysis and Validation Support for PM2.5 Chemical Speciation Networks- #82

PART 1. Representations of atmospheric phenomena

Preliminary Estimate Distribution of Exide s Lead Emissions in Soil

CERTIFIED MULESOFT DEVELOPER EXAM. Preparation Guide

The Webcast will begin at 1:00pm EST.

Microsoft SPLA Reporting for Exchange Best Practices. Hosting Controller Whitepaper

A WEB-BASED APPLICATION FOR REAL-TIME GIS

4 Decades of Belgian Marine Monitoring. presented by Karien De Cauwer, RBINS, Belgian Marine Data Centre

Distribution of Chemical Elements In Urban Sediments in Slovenia (Extended Abstract)

Environmental Data Services for Delaware:

Monsoon Variability and Extreme Weather Events

Data Migration from Magento 1 to Magento 2 Including ParadoxLabs Authorize.Net CIM Plugin Last Updated Jan 4, 2016

The laboratory fulfils the requirements for periodic emission measurement according to ČSN P CEN/TS 15675:2009

Multi-elemental determination of gasoline using Agilent 5100 ICP-OES with oxygen injection and a temperature controlled spray chamber

Data processing goes big

Data Mining Algorithms Part 1. Dejan Sarka

Jozef Matula. Visualisation Team Leader IBL Software Engineering. 13 th ECMWF MetOps Workshop, 31 th Oct - 4 th Nov 2011, Reading, United Kingdom

Integrating Big Data into the Computing Curricula

B I N G O B I N G O. Hf Cd Na Nb Lr. I Fl Fr Mo Si. Ho Bi Ce Eu Ac. Md Co P Pa Tc. Uut Rh K N. Sb At Md H. Bh Cm H Bi Es. Mo Uus Lu P F.

Analysing Questionnaires using Minitab (for SPSS queries contact -)

SPAMMING BOTNETS: SIGNATURES AND CHARACTERISTICS

How To Use Vista Data Vision

Axibase Time Series Database

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

WMO Climate Database Management System Evaluation Criteria

Sampling and preparation of heterogeneous waste fuels?

MI oceanographic data

REVIEW QUESTIONS Chapter 8

ManageEngine Exchange Reporter Plus :: Help Documentation WELCOME TO EXCHANGE REPORTER PLUS... 4 GETTING STARTED... 7 DASHBOARD VIEW...

Christmas. National Meteorological Library and Archive Fact sheet 5 White Christmas. (version 01)

VX Search File Search Solution. VX Search FILE SEARCH SOLUTION. User Manual. Version 8.2. Jan Flexense Ltd.

Gravir Outer, Isle of Lewis Site and Hydrographic survey report

WINDOWS AZURE DATA MANAGEMENT

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Rapid Thermophilic Digestion Technology

The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA

STATEMENT OF WORK. NETL Cooperative Agreement DE-FC26-02NT41476

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

ECN Spec. Sheet. Knowledge Marketing s Integrated Marketing Platform Delivers: Customer Lifecycle Marketing. Centralized Data Technology

Development of an Integrated Data Product for Hawaii Climate

Scholar: Elaina R. Barta. NOAA Mission Goal: Climate Adaptation and Mitigation

Comparison of PM10 and SO 2 Concentrations in the Cities Located at the Mediterranean Coast of Turkey

CALL FOR OFFER. Győr Distillery Co. Ltd. (H-9027 Győr, Budai u. 7.) hereinafter referred to as Investor calls for offers in subject matter below

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Technical. Overview. ~ a ~ irods version 4.x

A TRAFFIC INFORMATION SYSTEM BY MEANS OF REAL-TIME FLOATING-CAR DATA. Ralf-Peter Schäfer, Kai-Uwe Thiessenhusen, Peter Wagner

An Introduction to Cloud Computing Concepts

ADAGUC & PyTROLL. Maarten Plieger Ernst de Vreede. Application of polar orbiter products in weather forecasting Using open source tools and standards

R Tools Evaluation. A review by Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015

Utilizing KolibriMFG Software System to Schedule and Control Shop Floor

For more detailed information, see your Vantage Vue Console manual.

The IPCC Special Report on Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation

Summary Report on National and Regional Projects set-up in Russian Federation to integrate different Ground-based Observing Systems

How to read your Oil Analysis Report

Estimating Firn Emissivity, from 1994 to1998, at the Ski Hi Automatic Weather Station on the West Antarctic Ice Sheet Using Passive Microwave Data

A Performance Evaluation of Open Source Graph Databases. Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader

Software improves alarm management and KPI reporting at UK-based power station

Sisense. Product Highlights.

Maximising the value of pxrf data

Aspose.Cells Product Family

Short-Term Forecasting in Retail Energy Markets

Glossary of terms used in the survey

Steven M. Ho!and. Department of Geology, University of Georgia, Athens, GA

*This view is a sample image of electricity consumption.

ELECTRON CONFIGURATION (SHORT FORM) # of electrons in the subshell. valence electrons Valence electrons have the largest value for "n"!

D où vient le plomb? :

There are a number of different methods that can be used to carry out a cluster analysis; these methods can be classified as follows:

Hong Kong Observatory Summer Placement Programme 2015

OpenText Actuate Big Data Analytics 5.2

Cacti complete network graphing solution. Oz Melamed E&M Computing Jun 2009

Transcription:

Novel Analysis of Air Pollution Sources and Trends using openair Tools David Carslaw 8 th October 2015

2 Briefly What is openair and why was it developed? What can openair do? Some examples of recent developments Trajectory modelling Future developments

3 The importance of measurements High quality measurements are the cornerstone of effective air quality management There are 100s of continuous monitoring sites across the UK which represent a significant capital and on-going investment ( 10Ms) But If we only compare annual means for compliance reasons we are massively under-exploiting the value of these measurements And No dedicated analysis tools to do more with data

4 The grand challenge How to extract meaning from this?

5 The openair project Started in 2008 with 3-year NERC funding with additional support from Defra Aim: to make innovative open-source data analysis tools freely available to the air quality community Sub aim: As much as possible, no programming knowledge required by users Use software called R Often thought of as statistical software But it is really a programming language specifically designed for data analysis Usage and capabilities continue to grow at a rapid rate (> 7,000 packages ) openair is one of these R packages dedicated to the analysis of air quality data

6 R software that can do many things Excellent for access to many data formats csv, txt, Excel, binary files, databases (e.g. SQL Server, MySQL, Postgres ), XML, JSON, web scraping, NetCDF, Did I mention > 7000 packages? Almost endless possibilities Excellent code sharing on Github (think Facebook for computer code) Growing capability for reproducible reporting / research and interactive web-based rich content document

Overview of openair Many capabilities Source detection and characterisation Robust trend analysis Local and regional cluster analysis Back trajectory analysis Model evaluation Training possibilities Widely used* *Downloaded >28,000 times via RStudio, 1,500 to 2,000 times a month, top 7% of all R packages Ricardo Energy & Environment in Confidence 7

8 Conditioning plots central theme Almost all openair functions have a type option that allows data to be partitioned in flexible ways Built-in types include "year" splits data by year "month" splits variables by month of the year "season" splits variables by season. "weekday" splits variables by day of the week "weekend" splits variables by Saturday, Sunday, weekday "daylight" splits variables by nighttime/daytime. "wd" if wind direction (wd) will split the data up into 8 sectors: N, NE, E, SE, S, SW, W, NW.

9 Multispecies correlations + clustering Can be hard to get a feel for what s going on with multiple measurements Can look at correlations Apply hierarchical clustering to group things that are most similar to one another corplot(metals, dendrogram = TRUE) Cu As Ni Mn Fe V Cr Hg Zn Cd Pb Days.elapsed -3 35 22 20 9 24 33 8 17 13 36 100 5 27 19 15 7 14 28 13 20 7 100 36-2 16 14 22 7 43 20 21 21 100 7 13-11 42 48 65 39 58 67 93 100 21 20 17-10 35 38 55 38 51 57 100 93 21 13 8-2 49 42 49 43 50 100 57 67 20 28 33-10 20 20 35 31 100 50 51 58 43 14 24-7 34 21 33 100 31 43 38 39 7 7 9-4 50 78 100 33 35 49 55 65 22 15 20 0 56 100 78 21 20 42 38 48 14 19 22 4 100 56 50 34 20 49 35 42 16 27 35 100 4 0-4 -7-10 -2-10 -11-2 5-3 Days.elapsed Pb Cd Zn Hg Cr V Fe Mn Ni As Cu

10 Source characterisation with bivariate polar plots Scunthorpe is an interesting example Very complex local sources of multiple pollutants from integrated steelworks Major but much more distant sources e.g. Drax power station (#2 on the map, ~35 km from measurement site) How to un-pick the contributions made by these very different sources? Examples of what can be done with 3 simple variables: concentration, wind speed and wind direction

11 Source characterisation with bivariate polar plots Example at Scunthorpe Town Steelworks to the east Just considering concentration by wind direction only tells us about the direction of major sources Considering the joint wind speeddirection variation says much more about the nature of the emission sources

12 New developments with bivariate polar plots Consider intervals of concentration and look at the probability of finding such concentrations for specific wind speed-directions Can scan full range of concentration intervals Sources tend to occupy distinct ranges in concentration Can reveal many otherwise hidden sources Uria-Tellaetxe, I. and D. C. Carslaw (2014). Source identification using a conditional bivariate probability function. Environmental Modelling & Software, Vol. 59, 1-9.

13 Polar plots the movie! When scanning concentration intervals can convert to an animation Makes it easier to see source signatures emerge and then disappear

14 Regional impacts and data analysis Many techniques such as polar plots are most informative at the local scale At the regional scale back trajectories can be very useful NOAA make available a global model called Hysplit that can generate back trajectories openair makes available pre-calculated 96-hour back trajectories at specified locations, run every 3-hours of each year Can be used in many different ways to inform air quality data analysis

15 Regional impacts and data analysis Easy to import data And show trajectory frequencies Many different map projections available and can be used anywhere in the World All EMEP sites in Europe available soon (contributed by the University of Edinburgh, Chris Malley)

16 Regional impacts and data analysis Straightforward to link back trajectories to pollutant concentrations Useful for analysing specific pollution episodes in detail Having access to more data e.g. sulphate/nitrate or PM composition data in general can be very useful

17 Regional impacts and data analysis Can estimate source emission locations Plot is the estimated important source regions for PM 2.5 at the North Kensington site

18 Origin of highest PM 10 concentrations at Narberth (2014) Associate back trajectories with concentration Lots of trajectories over many different paths can give indication of most important source origins

19 Where does the air come from? Models are available such as Hysplit (from NOAA) that allow back trajectories to be calculated Openair has for a long time allowed the analysis of back trajectories Developments in progress provide forecast back trajectories Run daily to give past few and next few days Example based on Port Talbot

20 Future directions Make much more data available Better integration with European measurements + interest from the US EPA Better web integration Plotting on maps more effectively Removing the influence of meteorology Understanding air pollution would be much easier if we had the same weather every day! Pervasive sensors will become increasingly important What new analysis opportunities will these bring? Measurements on the move e.g. personal exposure Going beyond plotting colours on a map Training courses

21 Thank you for your attention Word cloud of the openair manual, generated in R of course!

David Carslaw david.carslaw@ricardo.com