Hybrid Data Assimilation in the GSI



Similar documents
Hybrid-DA in NWP. Experience at the Met Office and elsewhere. GODAE OceanView DA Task Team. Andrew Lorenc, Met Office, Exeter.

Testing and Evaluation of GSI-Hybrid Data Assimilation and Its Applications for HWRF at the Developmental Testbed Center

Advances in data assimilation techniques

A Hybrid ETKF 3DVAR Data Assimilation Scheme for the WRF Model. Part II: Real Observation Experiments

An Integrated Ensemble/Variational Hybrid Data Assimilation System

A Comparison of the Hybrid and EnSRF Analysis Schemes in the Presence of Model Errors due to Unresolved Scales

Assimilation of cloudy infrared satellite observations: The Met Office perspective

A Description of the 2nd Generation NOAA Global Ensemble Reforecast Data Set

Estimation of satellite observations bias correction for limited area model

IMPACT OF SAINT LOUIS UNIVERSITY-AMERENUE QUANTUM WEATHER PROJECT MESONET DATA ON WRF-ARW FORECASTS

Breeding and predictability in coupled Lorenz models. E. Kalnay, M. Peña, S.-C. Yang and M. Cai

The Wind Integration National Dataset (WIND) toolkit

Real-time Ocean Forecasting Needs at NCEP National Weather Service

Next generation models at MeteoSwiss: communication challenges

A Real Case Study of Using Cloud Analysis in Grid-point Statistical Interpolation Analysis and Advanced Research WRF Forecast System

How To Model An Ac Cloud

Status of ALADIN-LAEF, Impact of Clustering on Initial Perturbations of ALADIN-LAEF

ECMWF Aerosol and Cloud Detection Software. User Guide. version /01/2015. Reima Eresmaa ECMWF

Analysis of Bayesian Dynamic Linear Models

The STAR Algorithm Integration Team (AIT) Research to Operations Process

NOAA Big Data Project. David Michaud Acting Director, Office of Central Processing Office Monday, August 3, 2015

System Identification for Acoustic Comms.:

NOMADS. Jordan Alpert, Jun Wang NCEP/NWS. Jordan C. Alpert where the nation s climate and weather services begin

Applications to Data Smoothing and Image Processing I

Simple Linear Regression Inference

Data Assimilation and Operational Oceanography: The Mercator experience

All-sky assimilation of microwave imager observations sensitive to water vapour, cloud and rain

Part 2: Analysis of Relationship Between Two Variables

Big Data Techniques Applied to Very Short-term Wind Power Forecasting

Annealing Techniques for Data Integration

SOLAR IRRADIANCE FORECASTING, BENCHMARKING of DIFFERENT TECHNIQUES and APPLICATIONS of ENERGY METEOROLOGY

Cloud/Hydrometeor Initialization in the 20-km RUC Using GOES Data

Partnership to Improve Solar Power Forecasting

P164 Tomographic Velocity Model Building Using Iterative Eigendecomposition

6.9 A NEW APPROACH TO FIRE WEATHER FORECASTING AT THE TULSA WFO. Sarah J. Taylor* and Eric D. Howieson NOAA/National Weather Service Tulsa, Oklahoma

NOWCASTING OF PRECIPITATION Isztar Zawadzki* McGill University, Montreal, Canada

Very High Resolution Arctic System Reanalysis for

National Dam Safety Program Technical Seminar #22. When is Flood Inundation Mapping Not Applicable for Forecasting

Linköping University Electronic Press

[ Climate Data Collection and Forecasting Element ] An Advanced Monitoring Network In Support of the FloodER Program

MICROPHYSICS COMPLEXITY EFFECTS ON STORM EVOLUTION AND ELECTRIFICATION

Testing for Granger causality between stock prices and economic growth

The Influence of Airborne Doppler Radar Data Quality on Numerical Simulations of a Tropical Cyclone

Java Modules for Time Series Analysis

Limitations of Equilibrium Or: What if τ LS τ adj?

Lecture 3. Turbulent fluxes and TKE budgets (Garratt, Ch 2)

Part 4 fitting with energy loss and multiple scattering non gaussian uncertainties outliers

PTE505: Inverse Modeling for Subsurface Flow Data Integration (3 Units)

A system of direct radiation forecasting based on numerical weather predictions, satellite image and machine learning.

PART 1. Representations of atmospheric phenomena

Performance Analysis and Application of Ensemble Air Quality Forecast System in Shanghai

Atmospheric kinetic energy spectra from high-resolution GEM models

Develop a Hybrid Coordinate Ocean Model with Data Assimilation Capabilities

A new approach for dynamic optimization of water flooding problems

Turbulent mixing in clouds latent heat and cloud microphysics effects

Status of HPC infrastructure and NWP operation in JMA

Benefits accruing from GRUAN

Altuğ Aksoy Current Affiliation: Academic Background. Current Research Interests and Activities

A Direct Numerical Method for Observability Analysis

Parameterization of Cumulus Convective Cloud Systems in Mesoscale Forecast Models

Linear Threshold Units

Preliminary summary of the 2015 NEWS- e realtime forecast experiment

Description of zero-buoyancy entraining plume model

Understanding and Applying Kalman Filtering

Comparing TIGGE multi-model forecasts with. reforecast-calibrated ECMWF ensemble forecasts

QUALITY ENGINEERING PROGRAM

Comparative Evaluation of High Resolution Numerical Weather Prediction Models COSMO-WRF

Developing Continuous SCM/CRM Forcing Using NWP Products Constrained by ARM Observations

DESWAT project (Destructive Water Abatement and Control of Water Disasters)

11. Time series and dynamic linear models

Univariate and Multivariate Methods PEARSON. Addison Wesley

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

Application of Numerical Weather Prediction Models for Drought Monitoring. Gregor Gregorič Jožef Roškar Environmental Agency of Slovenia

Distributed Computing. Mark Govett Global Systems Division

Development of new hybrid geoid model for Japan, GSIGEO2011. Basara MIYAHARA, Tokuro KODAMA, Yuki KUROISHI

CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.

Improved diagnosis of low-level cloud from MSG SEVIRI data for assimilation into Met Office limited area models

Transcription:

Hybrid Data Assimilation in the GSI Rahul Mahajan NOAA / NWS / NCEP / EMC IMSG GSI Hybrid DA Team: Daryl Kleist (UMD), Jeff Whitaker (NOAA/ESRL), John Derber (EMC), Dave Parrish (EMC), Xuguang Wang (OU) 14 August 2015 DTC GSI Workshop, Boulder

Incremental Variational DA J( δx)= 1 2 δx T 1 B s δx + 1 2 ( H k M k δx d k ) T R 1 k H k M k δx d k J : Penalty = Fit to background + Fit to observations δx : Analysis increment (x a x b ) ; where x b is a background B s : (Static) Background error covariance, estimated a priori / offline H : Linearized observation (forward) operator M : Tangent-linear of the forecast model R : Observation error covariance (Instrument + representativeness) K k=1 ( ) d : Innovation y h(x b ), where y are the observations Cost function (J) is minimized to find solution, δx ; x a = x b + δx

Why is B e necessary? Introduces flow-dependence/errors of the day to analysis increments Provides multivariate correlations from dynamic model - quite difficult to incorporate this into fixed error covariance models Sparse observations near coherent dynamical features are utilized more effectively. Evolves with system, can capture changes in the observing network through evolving background error covariances More information extracted from the observations => better analysis => better forecasts

How the EnKF may achieve its improvement relative to 3D-Var: better background-error covariances B s B e Courtesy: Jeff Whitaker

What does B e gain us? Surface pressure observation near an atmospheric river Background surface pressure (white contours) and precipitable water increment (red contours) after assimilating a single surface pressure observation (yellow dot) using B e 3DVar increment would be zero! (cross-covariances are hard to model with a static B s ) Courtesy: Jeff Whitaker

More examples of flow-dependent background-error covariances Courtesy: Jeff Whitaker

Hybrid Data Assimilation J( δx)= 1 2 δx T ( β s B s + β e B e ) 1 δx + J o Simply put, linear combination of static and ensemble based B: B s : Static background error covariance B e : Ensemble estimated background error covariance β s : Weighting factor for static contribution β e : Weighting factor for ensemble contribution (typically 1- β s ) This does not address the need or lack of the TL/Adj. in 4D

Hybrid Methods Nomenclature Hybrid: Variational methods that combine static and ensemble covariances. EnVar: Variational methods using ensemble covariances Hybrid 4DVar: Variational method using a combination of static and ensemble covariances at the beginning of the window, but using a tangent-linear and adjoint model Hybrid 4DEnVar: Variational method using a combination of static and ensemble covariances at the all times in the window, without the need of a tangent-linear or adjoint

GSI Hybrid 3DEnVar (ignoring preconditioning for simplicity) Incorporate ensemble perturbations directly into variational cost function through extended control variable method Lorenc (2003), Buehner (2005), Wang et. al. (2007), etc. Other methods exist, e.g. Buehner et. al. (2011), Liu et. al (2008) 1 J( δx s,α)=β s 2 δx T 1 1 s B s δx s + β e 2 M m=1 α m T L 1 α m + 1 ( 2 Hδx d t ) T R 1 ( Hδx t d) M e δx t =δx s +Τ α m! x m = δx s +Τ α m! X m X m=1 δx t : (total increment) sum of increment from static δx s and ensemble parts x m e ( ) α m : extended control variable; : ensemble perturbations L: correlation matrix [effectively the localization of ensemble perturbations] T: operator mapping from ensemble grid to analysis grid (for dual-resolution) M m=1

Preconditioning Sidebar ν m = β e L 1 α m z = β s B s 1 δx s J( z,ν )= 1 2 δx T s z + 1 2 M m=1 α mt ν m +J o δx s = β s 1 Bz α = β e 1 Lν In preconditioned conjugate gradient minimization, inverses of B and L not needed and the solution is pre-conditioned by full B. This formulation differs from the UKMO and CMC, who use a square root formulation and apply the weights directly to the increment: δx t =δx s +Τ M m=1 α m! x m e

Single Temperature Observation 3DVAR β s =0.0 11 β s =0.5

So what s the catch? Need an (fairly large) ensemble to accurately represent the uncertainty in the background forecast Fairly large implies O(50-100) for NWP type applications smaller ensemble sizes lead to larger sampling errors more weight will be given to the static larger ensembles have increased computational burden An update algorithm is needed for the ensemble perturbations. NCEP operations applies a Ensemble Kalman Filter (EnKF) EnKF in itself is a standalone DA system that updates the mean and perturbations with new observations. Google Ensemble Data Assimilation for an excellent review article by Tom Hamill

Dual- Res Coupled Hybrid Var/EnKF Cycling T254 L64 member 1 forecast member 2 forecast member 3 forecast Generate new ensemble perturba6ons given the latest set of observa6ons and first- guess ensemble EnKF member update recenter analysis ensemble member 1 analysis member 2 analysis member 3 analysis Ensemble contribu6on to background error covariance Replace the EnKF ensemble mean analysis and inflate T574 L64 high res forecast GSI Hybrid Ens/Var high res analysis Previous Cycle Current Update Cycle

What if I am not running an EnKF? In principle any ensemble can be used; so long as the ensemble represents the forecast errors well. GSI can ingest GFS global ensemble to update regional models WRF ARW/NMM NAM RR HWRF 80 member GFS/EnKF 6h ensemble forecasts are archived at NCEP since May 2012. Real time ensemble is also publicly available.

Ensemble s of EnsVar no EnKF! Much too expensive right now!

GSI Hybrid Configuration Hybrid related parameters controlled via GSI namelist &hybrid_ensemble Logical to turn on/off hybrid ensemble option (l_hyb_ens) Ensemble size (n_ens), resolution (jcap_ens, nlat_ens, nlon_ens) Source of ensemble: GFS spectral, native model, etc. (regional_ensemble_option) Weighting factor for static contribution to increment (beta1_inv) Horizontal and vertical distances for localization, via L on augmented control variable (s_ens_h, s_ens_v) Localization distances are the same for all variables since operating on α. i.e. no variable localization Option to specify different localization distances as a function of vertical level (readin_localization) Instead of single parameters, read in ascii text file containing a value for each layer Example for global in fix directory (global_hybens_locinfo.l64.txt) Other regional options related to resolution, pseudo ensemble, etc. Other 4D options related to binning, thinning of observations in the assimilation window.

Localization Simple example Courtesy: Jeff Whitaker

Localization Real world example

Hybrid 4DEnVar [No Adjoint] The cost function can be easily expanded to include a static contribution 1 J( δx s,α)=β s 2 δx T 1 1 s B s δx s + β e 2 M 1 2 K k=1 α T m L 1 α m + Where the 4D increment is prescribed exclusively through linear combinations of the 4D ensemble perturbations and a static contribution Here, the static contribution is considered time-invariant (i.e. from 3DVAR-FGAT). Hybrid weighting parameters (β) exist just as in the other hybrid variants. Also, the ensemble perturbation weights (α) are assumed to be the same. M m=1 ( H k δx t,k d k ) T R 1 k H k δx t,k d k ( ) ( ) δx t,k =δx s +Τ α m! x e m,k = δx s +Τ α m! X m,k X k m=1 M m=1 ( )

Single Observation (-3h) Example for 4D Variants 4DVar 4DEnVar H-4DVar β s =0.25 H-4DEnVar β s =0.25

Time Evolution of Increment t = -3h Solution at beginning of window same to within round-off (because observation is taken at that time, and same weighting parameters used) t = 0h Evolution of increment qualitatively similar between dynamic and ensemble specification t = +3h ** Current linear and adjoint models in GSI are computationally impractical for use in 4DVAR other than simple single observation testing at low resolution H-4DVar H-4DEnVar

3DVar / H3DEnVar / H4DEnVar H4DEnsVar 3DVar Move from 3D Hybrid (current operations) to Hybrid 4D-EnVar yields improvement that is about 75% in amplitude in comparison from going to 3D Hybrid from 3DVAR.

Summary The hybrid ensemble GSI system uses an ensemble of first-guess forecasts to better estimate the background-error covariance term in the cost function. More information can be extracted from observations Added expense mostly IO Added complexity running (and updating) an ensemble. maintaining an additional DA component (EnKF), alternative EVIL More tuning is necessary that depend on model, resolution, observing network localization length scales ensemble and static weights Ensemble (co)variances must be representative of control forecast error. Current operations is H3DEnVar, with an upgrade to H4DEnVar coming in Q2FY16.

Some Relevant References Buehner, M., P. L. Houtekamer, C. Charette, H. L. Mitchell, B. He, 2010a: Intercomparison of variational data assimilation and the ensemble Kalman filter for global deterministic NWP. Part I: Description and single-observation experiments. Mon. Wea. Rev., 138, 1550-1566. Buehner, M., P. L. Houtekamer, C. Charette, H. L. Mitchell, B. He, 2010b: Intercomparison of variational data assimilation and the ensemble Kalman filter for global deterministic NWP. Part II: One-month experiments with real observations. Mon. Wea. Rev., 138, 1567-1586. Kleist, D. T., and K. Ide, 2013: An OSSE-based Evaluation of Hybrid Variational-Ensemble Data Assimilation for the NCEP GFS, Part I: System description and 3D hybrid results. Mon. Wea. Rev. Kleist, D. T., and K. Ide, 2013: An OSSE-based Evaluation of Hybrid Variational-Ensemble Data Assimilation for the NCEP GFS, Part II: 4D EnVar and Hybrid Variants. Mon. Wea. Rev. Liu C., Q. Xiao, and B. Wang, 2008: An ensemble-based four dimensional variational data assimilation scheme. Part I: technique formulation and preliminary test. Mon. Wea. Rev., 136, 3363-3373 Wang, X., C. Snyder, and T. M. Hamill, 2007a: On the theoretical equivalence of differently proposed ensemble/3d-var hybrid analysis schemes. Mon. Wea. Rev., 135, 222-227. Wang, X., D. Parrish, D. Kleist, J. Whitaker, 2013: GSI 3DVar-Based Ensemble Variational Hybrid Data Assimilation for NCEP Global Forecast System: Single-Resolution Experiments. Mon. Wea. Rev., 141, 4098 4117.