Quality Assurance for Hydrometric Network Data as a Basis for Integrated River Basin Management



Similar documents
Quality assurance for hydrometric network data as a basis for integrated river basin management

WISKI. Quality Assurance for Hydrometric Network Data as a Basis for Integrated River Basin Management. 11 th July 2005

Robert Gál. KISTERS AG, Charlottenburger Allee 5, Aachen, Germany;

City of Calgary KISTERS KISTERS Overview Overview. KISTERS Software Solutions Water, Energy and Environmental Data May 19, 2015

DANGER indicates that death or severe personal injury will result if proper precautions are not taken.

Software Development

CLIDATA In Ostrava 18/06/2013

Ovation Operator Workstation for Microsoft Windows Operating System Data Sheet

The software solution to complete your flow monitoring program

DATA STORAGE, ACCESS AND DISSEMINATION [HOMS G06]

Radiological Assessment Display and Control System

Whisky & Soda vs. WISKI & SODA. Eastern CA Collaborative Sharing WISKI with your Neighbours 1

Data Management Implementation Plan

Dream Report vs MS SQL Reporting. 10 Key Advantages for Dream Report

Data quality assurance for Basin-scale real-time water quality management

WebSphere Business Monitor

Final Report - HydrometDB Belize s Climatic Database Management System. Executive Summary

Lavastorm Resolution Center 2.2 Release Frequently Asked Questions

tools that make every developer a quality expert

Designing a Windows Server 2008 Applications Infrastructure

Electronic Document Workflow Platform for KBA Customers

Sensor data management software, requirements and considerations. Don Henshaw H.J. Andrews Experimental Forest

TSM Studio Server User Guide

Product Data Sheet.

STATEMENT OF WORK. NETL Cooperative Agreement DE-FC26-02NT41476

SeeSOR.NET. Quality Management Approach and Technology of SeeSOR.NET.

Excel Companion. (Profit Embedded PHD) User's Guide

Comparison Document. SupportCenter Plus Comparison Documents 1

DiskBoss. File & Disk Manager. Version 2.0. Dec Flexense Ltd. info@flexense.com. File Integrity Monitor

B.Sc (Computer Science) Database Management Systems UNIT-V

MDS. Measured Data Server Online Measurement Network. Properties and Benefits »»» »»»» ProduCt information

The Requirements Compliance Matrix columns are defined as follows:

ArcSight Express Administration and Operations Course

EnterpriseLink Benefits

SPPA-D3000 Plant Monitor Technical Description

ManageEngine SupportCenter Plus 7.7 Edition Comparison

WMO Climate Database Management System Evaluation Criteria

Implementing Project Server 2010

PM Fine Database and Associated Visualization Tool

DiskPulse DISK CHANGE MONITOR

Declaration of Conformity 21 CFR Part 11 SIMATIC WinCC flexible 2007

Data Centre Efficiency Management Concurrent Thinking Appliances

MS Access Queries for Database Quality Control for Time Series

INTELLECT TM Software Package

Subject: Request for Information (RFI) Franchise Tax Board (FTB) Security Information and Event Management (SIEM) Project.

How to carry out primary validation for rainfall data

EcOS (Economic Outlook Suite)

Real Time Flood Alert System (RTFAS) for Puerto Rico

Frequently Asked Questions. Secure Log Manager. Last Update: 6/25/ Barfield Road Atlanta, GA Tel: Fax:

GEOSPATIAL DIGITAL ASSET MANAGEMENT A SOLUTION INTEGRATING IMAGERY AND GIS WHERE WILL ALL THE PIXELS GO?(AND HOW WILL WE EVER FIND THEM?

U.S. FDA Title 21 CFR Part 11 Compliance Assessment of SAP Records Management

Power Log PC Application Software

IFS-8000 V2.0 INFORMATION FUSION SYSTEM

PROMIS Tutorial MODBUS Data Collection System

Analyst 1.6 Software. Laboratory Director s Guide

Ensure Continuous Availability and Real-Time Monitoring of your HP NonStop Systems with ESQ Automated Operator

How To Use Vista Data Vision

Enterprise Historian Information Security and Accessibility

The development of an information modelling system for regional water resource assessments

Presents. WITSML Solutions For Your Business

Semaphore T BOX Applications in Data Center Facilities

ProView. Remote monitoring and administration for self-service networks. Copyright Wincor Nixdorf International GmbH

PROFESSIONAL SERVICES

Figure 1.1 The Sandveld area and the Verlorenvlei Catchment - 2 -

uturenet & its Document Imaging Services

Technical Specification. Solutions created by knowledge and needs

City of Toronto Wet Weather Flow Monitoring Network: Baseline Conditions

EUROPASS DIPLOMA SUPPLEMENT

Entry of Historical SW Data Proposed Approach Contents

TOPdesk Professional. Service Management Simplified

Managing and Maintaining Windows Server 2008 Servers

Product Overview. DSL Xpert Advantages. Flexible Configuration Options. User-Friendly PC-Controlled GUI. Testing of ADSL, ADSL2 and ADSL2+

ADVANCED EXCEL FOR ENERGY EFFICIENCY PROFESSIONALS

GelAnalyzer 2010 User s manual. Contents


Eastern Caribbean Open-Source Geospatial Data Sharing and Management Workshop

S m a r t M a s t e B T E C O R P O R A T I O N USER MANUAL

Network Event Viewer now supports real-time monitoring enabling system administrators to be notified immediately when critical events are logged.

Antelope Enterprise. Electronic Documents Management System and Workflow Engine

FaciliWorks 8i. Product Features. Features in blue are specific to FaciliWorks 8i Enterprise. Flexible Installation Options

Data Management Model. Potential Component of the Thames River Water Management Plan

Industrial HiVision Software

Papermule Workflow. Workflow and Asset Management Software. Papermule Ltd

Real Time Monitor. A Real-Time Windows Operator Interface. DDE Compliant. (for remote data display)

automates system administration for homogeneous and heterogeneous networks

Christoph Schlenzig 1

Safer Solutions for Safer Hospitals. Electronic Medical Records One System with Multiple Solutions! Anesthesia EMR. Perfusion ECMO EMR EMR

WinCC. Configuration Manual. Manual Volume 3

Capacity Management PinkVERIFY

This release also incorporates new features which improve manageability for system administrators and usability for contributors.

Introducing IBM Tivoli Configuration Manager

Synergy Controller Cloud Storage Features and Benefits

An Internet-based Groundwater Information System

Data Quality Assurance and Control Methods for Weather Observing Networks. Cindy Luttrell University of Oklahoma Oklahoma Mesonet

Building & Developing the Environmental

NNMi120 Network Node Manager i Software 9.x Essentials

Guidelines on Quality Control Procedures for Data from Automatic Weather Stations

Industrial IT Ó Melody Composer

29200 Northwestern Hwy Suite 350 Southfield, MI WINSPC winspc.com

Seeing the Value of Real Time Monitoring

Transcription:

Quality Assurance for Hydrometric Network Data as a Basis for Integrated River Basin Management FRANK SCHLAEGER 1, MICHAEL NATSCHKE 1 & DANIEL WITHAM 2 1 Kisters AG, Charlottenburger Allee 5, 52068 Aachen, Germany frank.schlaeger@kisters.de 2 Kisters AG, 28 Denton Road, Ilkley, West Yorkshire, LS29 0AA, United Kingdom- Abstract Hydrological network data is increasingly demanded by hydrologists, engineers, decision makers and the general public. This means for hydrologists, that they have to process the raw data into quality controlled information in an efficient way. The hydrological information system WISKI offers a broad range of standardized quality assurance procedures to fulfil these requirements. Key words Data quality assurance, hydrometric network, information system INTRODUCTION The demand for information about different measured hydrometric parameters such as precipitation, water level or flow is increasing in quantity and quality significantly. This is regarding the use in hydrological modelling, water management techniques and public information. Today the level of information technology available represents an ideal base for hydrological information systems optimized for acquisition, data evaluation and dissemination of verified hydrological and ecological data sets. Consequently, the demand for tools for quality assurance during the whole process is on the increase. For the successful quality assurance of hydrometric network data, work at different levels such as hydrometric standards, workflows, software and data security is needed. In addition to the data value itself, the user must be supplied with information about the data quality and the processing history of data values. Most of these requirements can be complied by a hydrometric information system set up in a relational database environment. Typical elements such as time series plots, tables, reports, GIS functionality should be available as a user interface. For high acceptance Windows- 1

compatibility and office integration is desirable. All data quality information should always be available. The WISKI software package is a modern tool for hydrological data management. WISKI is a Windows based client/server system based on relational databases, designed in close cooperation with water agencies, authorities, engineers and hydrologists (fig.1). It combines modern standards of data management with advanced tools to collect, edit, store and present time series data to intranet, internet and GIS users. The system allows the automatic and effortless flow of time series data from the measuring site into the database. This data is then reviewed using a powerful graphical or tabular graphical user interface. Furthermore, a detailed quality flag sub-system has been integrated to present the data together with its quality. This quality flag system deals with the data quality on two levels. A generic primary quality flag summaries the data quality of data sets with standard flags, like good, suspect or missing. A more detailed secondary flag gives additional specialized information to the user. These secondary flags inform the user about the editing process, abnormalities during calculation or modelling. It is configurable for the different processing methods. During the aggregation of instantaneous data to daily, monthly or annual data, not only are the values aggregated, but the quality flags are also aggregated. Consequently, for every aggregated value, information is available about the quality of the input data. 2

Fig. 1 Schema of the WISKI System THE SOFTWARE PACKAGE Data acquisition and integration In order to support a more integrated approach to water management, a modern hydrological information system has to acquire and store all types of data from a wide range of parameters. Not only the commonly used parameters such as water stage and discharge, but also, for example climate parameters to allow analysis within the same system. Typical data input sources are remote data acquisition from field data loggers, import of third party data via input files in different formats, read out of field devices, digitisation from graphical charts or manual input using the interactive time series editor. The import process should be automated as much as possible to free staff resources from manually entering data. The WISKI data acquisition server allows the automatic collection of data from field data loggers. The Import Server can scan directories and import data files, which are stored directly into the database. When the user digitises graphical charts, the digitised data is imported automatically afterwards, without user intervention. 3

Data storage Due to the complexity of data modelling, WISKI stores measured data in relational database systems (RDBMS). As historical data plays a fundamental role in hydrology, one of the greatest challenges is to achieve a good performance when dealing with large amounts of measured data; this is a key objective of the developers of WISKI. Another fundamental aspect is to allow multiple users to work on the same system simultaneously. They can benefit from working on the same data but at the same preventing users from editing the same data at the same time. The data model of WISKI was developed to deal with large amounts of measured data. Therefore it has all the necessary functionality to manage time series on such a level. Another important topic is time series classification. One of the major differences found, is the distinction between equidistant data (e.g. 15minute values) and non-equidistant data (e.g. rainfall event data). Data processing The data stored in the WISKI database can be accessed through the Graphical User Interface (GUI) of the WISKI application. The WISKI application is based on 32-bit technology and runs on Windows platforms. Therefore, WISKI s toolbox has a modern interactive graphical time series editor, to allow graphical editing of time series. Following the data flow through the system, WISKI delivers the following processes and functionalities: DATA VALIDATION Identifying abnormalities in the data is normally the first step of the validation process. Due to malfunctioning of field sensors and other devices, or due to the maintenance of those devices, measured data is likely to have a range of different quality that has to be corrected. To help the user locate data of poor quality, WISKI has auto validation routines. These routines 4

validate the data using predefined criteria and place remarks for user notification each time one of the criteria is violated. Examples of these quality controls are identification of gaps, maximum/minimum exceeding of thresholds, differences to neighbored stations or other user defined formulas and rules. Data manipulation For editing purposes, WISKI offers efficient data editing functionality using an interactive graph or table to allow graphical editing using the computer mouse and keyboard. Examples of functions and methods to correct errors in time series are the filling of gaps with interactive linear or spline interpolation, adjusted data from neighbouring stations or regressions or e.g. vertical or horizontal stretching of the trace. Thereby WISKI allows data quality flags to be assigned to each data value. This will identify the primary quality of Good, Suspect, Estimated, Unchecked or Missing; a secondary flag will provide additional information about the quality, for example edited. Standard comments from a pick list and free text comments added by users can be used provide additional information about the data (fig. 2). WISKI keeps a record of who changed the data, for which period and writes the name of the applied method automatically for the predefined editing functions listed above. The original data is not edited; instead a production time series is defined which stores a copy of the original values, which can be validated. Working within a multi-user environment also implies being able to regulate access to the data by defining roles with corresponding access rights. This allows the hierarchy of the organisation to be represented by the information system. 5

Fig. 2 Interactive Data Editing in the WISKI Graph Rating curves WISKI has a fully integrated rating curve editor. Based on discharge measurements, rating curves can be managed without having to leave the main application. After validation of the stage data all necessary functionality is available to perform the complex task of defining the stage/discharge relationship of a river. The discharge is calculated automatically, or can be triggered by opening the corresponding time series in a graph or table. Fig. 3 SKED, the rating curve editor 6

Derived and aggregated values While validation takes place on high resolution time series, WISKI calculates the derived aggregated time series for long-term analysis of the state of the environment. The daily, monthly and yearly values are derived from the high resolution time series and are calculated automatically. In combination with built-in reports, the user can rapidly generate these reports without exporting data to external publishing software. An internal mechanism makes sure that each time the underlying high resolution time series are modified, the derived aggregated time series are updated automatically. This mechanism, prevents a user from publishing or exporting out of date data, and prevents them from having to keep a record of when to update derived time series. Data analysis For advanced analysis of measured data, WISKI offers statistical analysis tools such as linear and non-linear regression analysis, statistical analysis of durations, floods and low floods, double mass plots, rain storm frequency analysis (fig.4). The WISKI statistical analysis tools were developed in close collaboration with hydrologists, and are based on national and international standards, such as the United States Geological Survey, the World Meteorological Organization guidelines and ISO standards. 7

Fig. 4 WISKI Duration Curve Analysis Data dissemination WISKI has an open data model. This means that it is possible to access the data through special views so that users can execute queries using other commercial software such as Crystal Reports and Business Objects, or even their own software to generate reports. WISKI provides standard reports that can be adapted to an organisation s requirements. Today, the Internet has become one of the key gateways for exchanging information. For this purpose a web module has been developed for WISKI for disseminating data on the Internet/Intranet using a web browser as a Graphical User Interface. Moreover, WISKI supports fully automated services such as the sending of data files via FTP (file transfer protocol) or email. QUALITY ASSURANCE Today's demand on hydrological network data by hydrologists, engineers, decision makers and the public is increasing in quantity, quality and time to delivery. For us, this means, that we have to process the raw data to value added products and information in an efficient way. Standardized quality assurance procedures can help to fulfill this requirement. For best results such tools are needed in the whole chain: 8

measurement data acquisition validation and correction dissemination use The hydrological workbench WISKI supports this process with different tools and functions. The quality assurance on the following levels is available: - Data level - Workflow level - System level - IT level The quality assurance (QA) on these levels is described in the following chapters. QA on data level The quality assurance on data level can be ensured by: - Securing the raw data before processing - Standardized procedures for data validation, correction and evaluation - Use of national and international standards or agreed organization wide standards - Continuous quality marking of the data by users or automatically by the system - Use of audit trail tools for logging and evaluating data changes The raw data is secured in WISKI by saving it in a so-called original time series. These time series are write- protected. They can only be changed by importing new data. To correct any errors in the raw data the original time series is copied into a so-called production time series. The production time series is the source for any other derived time series (e.g. daily mean data). To validate, correct and evaluate data a broad range of different tools are provided. To each time series a set of so-called plausibility checks can be assigned to. These plausibility 9

checks can be defined for a particular time range or season. The following list shows a selection of supported plausibility checks: - boundary check - completeness check - gradient check - distance check - etc. Furthermore, specific checks for tidal or rainfall data are available. For defining very specific checks, which depend on particular boundary conditions, the user can define his own plausibility checks by applying the WISKI macro scripting language KiBasic. The plausibility check results are shown in the WISKI graph and in the WISKI table and will be summarized in a plausibility check table. The calculation methods, which are applied for time series derivation and statistical analysis, are based on national and international standards. For example the German Pegelvorschrift, the English Flood Estimation Handbook (FEH) or EC ISO standards are supported. An important step to assure the data quality is the WISKI flagging and remark concept. Each time series value gets a quality flag assigned to. Three levels of quality flags are supported, primary, secondary and tertiary quality flags (Table 1). Additionally remarks and tasks can be assigned to time series data. The quality flags, which are listed in the table below, are available. 10

Table 1 Summary of the quality flags Quality flag level Quality flag Abbreviation Description Primary Secondary Good Missing Unchecked Estimated Suspect Etc. Complete Incomplete Missing G M U E S C I M These flags are assigned automatically or manually to high resolution data. Further primary flags are supported These flags are assigned automatically to derived data on daily and upper time level. Edited Ed This flag is automatically assigned to data which has been edited manually or automatically. Within rating Beyond upper limit Beyond lower limit No rating etc. Tertiary Snow d Trace WR BL> BL< NR t These flags are assigned to derived high resolution data, e.g. flow data, which is derived by applying rating curves. Further secondary flags are supported This flag is assigned to rainfall data as additional information. Apportioned a This flag is automatically assigned Etc. Further tertiary flags are supported. Derived data on daily and upper time level generally gets the primary CIM flag. Additionally this data gets secondary flags which are derived directly from the primary flags of the source values (including their percentages). A typical flag of a daily mean value would be I [30% G, 55% U, 10% S, 5% M] 11

By analyzing the quality flags data can be included or excluded automatically from calculations and statistical analysis. The quality flags can also be exported into files and reports and can be viewed or edited in the WISKI graph and WISKI table (see fig. 2 and 3). In order to trace all modifications of basic data and time series data the corresponding transactions are logged. By logging the transaction, the user name, the time of modification as well as the server name an extended audit trail is available. In the figure below the audit trail log dialog as well as the audit trail filter are depicted. Audit trail log Audit trail filter Audit trail query builder Fig. 5 Audit trail log and filter QA on workflow level The quality assurance on the workflow level must be ensured by: - Clear definitions and policies - Adoption to the requirements of the individual organization - Regular training of the users. 12

Clear definitions and policies naturally must be communicated by the institution itself, but it is helpful if they are supported by the software, for example, by online guidelines or use cases. For the WISKI package many hundred of use cases are available which describe workflows in a user friendly manner. Furthermore, the workflow must be supported by adoption to the requirements of individual organizations. This is supported by different wizards for exporting data, creating reports and so on. These wizards lead through the steps of fulfilling the required tasks. QA on system level The quality assurance on the system level, which is related to the hydrological network, must be ensured by: - Hydrological cross-checking - Spatial analysis in GIS environment - Optimization of number and location for the gauges. A hydrological cross-checking could for example be the validation of rating curves in a catchment area by calculation of the annual water balance. This calculation can be carried out by applying the formula editor and the macro scripting language KiBasic. By using these tools almost every calculation is possible. This tool can also be applied for creating new virtual time series. The quality assurance of spatial data can be carried out by applying the WISKI Arc View extension, which is linked directly to the WISKI database. This tool provides a cartographic overview of the station network and allows by its bi-directional communication with WISKI the correction and analysis of spatial data (see fig.6). 13

Fig. 6 Spatial analysis in the WISKI Arc View extension The quality assurance on the system level, which is related to the data archive, must be ensured by: - Overall analysis about trends in number and length of gaps in the records - Overall analysis about trends in data quality - Overall analysis about trends in time to publish the data - Comparisons between these indicators between different regions, directorates or departments. In order to fulfil these important tasks several areas of WISKI functionalities (statistical tools, reporting etc.) are available. This is a part of data quality assurance, which has still to be extended. This extension is planned for a next release within the KISTERS R&D budget. 14

QA on IT level Last but not least the quality assurance on the IT level should not be omitted. Generally these tasks have to be carried out by the authority itself. The quality assurance on the IT level must be ensured by: - Standardized methods and policies for data base back-up and maintenance - Back-up strategy for data base and software servers, multiple network lines - Regular data base maintenance (Statistics etc.) - Regular checks of hardware and backup systems - etc. CONCLUSION The WISKI system should provide a reliable and flexible archive enabling a central, consolidated source of information. It can provide a modern software platform with sufficient flexibility to encompass future business change and data demands. Its implementation in different authorities reduces costs for support and maintenance, frees up staff resources to improve data quality, undertake more comprehensive analysis and to convert data into information for managing the environment. On all levels of data quality assurance WISKI provides ideas, strategies or tools to increase the quality of the data and information. However, due to the never ending process of increasing quality continuous discussion on this topic is needed. 15