Pooling and Sharing Statistical Series (P3S)



Similar documents
Traineeships in DG-Statistics

European Reporting Framework (ERF) - a possible solution to reporting challenges for banks

Strategy and Developement In Austrian Bank-reporting

Business Operations. Module Db. Capita s Combined Offer for Business & Enforcement Operations delivers many overarching benefits for TfL:

A. Document repository services for EU policy support

An Enterprise Architecture and Data quality framework

Conference on Data Quality for International Organizations

CASE STUDY: XBRL IMPLEMENTATION FOR INDONESIA S ISLAMIC BANKING REGULATORY REPORTING SYSTEM

Enhancing the synergies between the SSM and statistical reporting 1

VISION OF THE FUTURE NATIONAL PAYMENT SYSTEMS

AnaCredit Gives Banks an Opportunity to Improve Data Management, but Challenges Remain

COST OF BORROWING INDICATORS: METHODOLOGICAL NOTE 1

Documenting the research life cycle: one data model, many products

EXPLORING THE CAVERN OF DATA GOVERNANCE

Guidelines on operational functioning of colleges

Module 2 IS Assurance Services

Understanding Storage Virtualization of Infortrend ESVA

EIOPACP 13/011. Guidelines on PreApplication of Internal Models

F.A.Q. Differences between statistical and supervisory reporting

IAAS CLOUD EXCHANGE WHITEPAPER

Memorandum of Understanding on financial crisis management

Enterprise Information Management and Business Intelligence Initiatives at the Federal Reserve. XXXIV Meeting on Central Bank Systematization

Guidelines on preparation for and management of a financial crisis

Central Credit Registers (CCRs) as a Multi Purpose Tool to close Data Gaps

A Dell Technical White Paper Dell Compellent

2.0 OBJECTIVES OF THE CONSULTANCY SERVICE

Data Warehouse: Introduction

G-Cloud Service Definition Canopy Big Data proof of concept Service SCS

Delegations will find attached the draft Council conclusions on a Capital Markets Union, as prepared by the Economic and Financial Committee.

OPEN DATA: ADOPTING A SECURITY-MINDED APPROACH

PUB (MPI) 1-62 Reference: Gartner Scorecard

Evaluation of the Expansion of the GEF Partnership Concept Note

SimCorp Solution Guide

Resolution No. 391/2008 of the Polish Financial Supervision Authority. of 17 December 2008

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

EFFECTIVE STORAGE OF XBRL DOCUMENTS

6 Cloud computing overview

THE ROLE OF TECHNOLOGY IN PRIMARY REPORTING 1 December 2004

Centralized Operations: Strategies for Today and Tomorrow

Frequently Asked Questions

Regional workshop BCCL-METAC Operational functioning of supervisory Colleges BEYROUTH, April 25 th, 2012

Business Plan Summary

ORACLE PRODUCT DATA HUB

Irving Fisher Committee on Central Bank Statistics. Data-sharing: issues and good practices

Challenges of Analytics

ECB-PUBLIC. 2. General observations

MAHAGOV CLOUD. Maharashtra State Data Center. December Directorate of Information Technology, Government of Maharashtra.

LIQUIDITY RISK MANAGEMENT GUIDELINE

The ECB Statistical Data Warehouse: improving data accessibility for all users

Your Partner for European Payment Processing

REINSURANCE RISK MANAGEMENT GUIDELINE

Global Research Alliance Charter

Implementation of Solvency II: The dos and the don ts

Developing Commercial Property Price Indicators

MISSION VALUES. The guide has been printed by:

Job description - Fundraising Database Reporting and Solutions Analyst

The Information Commissioner s Office response to HM Treasury s Call for Evidence on Data Sharing and Open Data in Banking

Research Data Archival Guidelines

dxhub Denologix MDM Solution Page 1

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Corruption Risk Assessment Topic Guide

EIB Group Risk Management Charter

Works closely with all members of the Training and Consultancy team, and the wider Operations, Fundraising and Marketing directorate.

Liquidity Stress Testing

Next-Generation IT Asset Management: Transform IT with Data-Driven ITAM

Oracle Database 12c Plug In. Switch On. Get SMART.

EUROPEAN COMMISSION Enterprise and Industry DG

Emerging Opportunities and Challenges with Central Bank Data For the Sveriges Riksbank Conference September 2015

G20/OECD HIGH-LEVEL PRINCIPLES OF LONG-TERM INVESTMENT FINANCING BY INSTITUTIONAL INVESTORS

Improving the visualisation of statistics: The use of SDMX as input for dynamic charts on the ECB website

Guidance Note 3/99. Guidance Note 2/99. Money Market Funds: European Central Bank Reporting Requirements. December 2011

Steering Insurance Companies 3.0 Economic Capital and Economic Value Management integrated

* * * BIS central bankers speeches 1

List of legislative acts

Mapping of outsourcing requirements

ECM Migration Without Disrupting Your Business: Seven Steps to Effectively Move Your Documents

University of Edinburgh Risk Policy and Risk Appetite

EACB messages for the Trialogue negotiations on Bank Recovery and Resolution Directive

14 December 2006 GUIDELINES ON OUTSOURCING

The concrete impacts of BCBS principles on data value chains

Integrated Stress Testing

DLT Solutions and Amazon Web Services

Transcription:

Directorate General Statistics Pooling and Sharing Statistical Series () An on-going data management initiative at the Banque de Renaud Lacroix Director, Statistical and IT engineering division CEMLA-FIF, 8 June RESTRICTED Banque de Projet XXX de pilotage du JJ/MM/AAAA

Outline Goals of the new set-up and key principle Data and confidentiality issues Governance Technical solution Project plan Access to individual data for external users Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 2 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Goals of the new set-up Pooling data. To gather data on financial institutions and non-financial corporations Collected by the CB and by the Banking and Insurance Supervisory Authority While respecting confidentiality rules Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 3 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Goals of the new set-up to allow enhanced analysis for all involved departments and for the supervisory authority Offering access to internal users on a need to know basis Fostering synergies and economy of scale Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 4 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Goals of the new set-up Create a value chain between all supervisory and CB functions Collecting DQM, storing and processing Sharing and using Contribute to the Banque de Digital Plan implementation Add a new dimension to the Banque de information system Enable a data sharing that spans silos and is crucial to analyse and better mitigate risks in the future - the ultimate aim being strategic and not technical. Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 5 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Key principle Users use data in their own Information System. data (and their metadata) are available in alternative formats (SAS, CSV) Statistics Markets, Payment systems Economics and Research Financial stability Banking and Insurance Supervision Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 6 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Data and confidentiality issues A collaborative work involving 5 DGs Work-streams on confidentiality issues with representatives of all stakeholders and of the Legal department Data offering and demand expressed by each DG data typology compliant to legal constraints Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 7 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Data and confidentiality issues 15 main datasets Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 8 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Data and confidentiality issues Most (individual) data can be shared by personal accreditations updated every 6 months (Generally shareable data) Monetary statistics, credit register, Financial reports from banks, For a subset of more sensitive data, access will be precisely limited and granted on a need-to-know basis (Non generally shareable data) Components of the solvency & liquidity ratios, information on business managers,. Access rights are defined through an Accreditation Matrix which crosses sub-family and users Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 9 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Data and confidentiality issues Accreditations will be defined for subsets of data 202 subsets of data 35 subsets «generally non shareable» 167 subsets «generally shareable» Functional matrix (supply/demand in datasets terms) Generally shareable data pooling rate 3 Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 10 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Governance : Validation and Monitoring Committee (PVMC) Committee at DG level, co-chaired by DGS and IT, including all stakeholders and the Legal department Validating the lists of accredited agents Monitoring functioning Addressing unforeseen issues Seeking for consensus Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 11 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Governance : main tasks undertaken by the PVMC Definition of access policy Updates of the white list (cartography of individual access to each dataset) PVMC High level monitoring Banque de direction Renaud de Lacroix l organisation Banque er du de développement Annual report to Governor and Deputy Governors with a presentation to BdF Executive Board RESTRICTED Page 12 PSG Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Governance : operational process for user accreditation to for generally shareable data User request Management approval (Head of the unit) Service Desk Status of the request Person in the white list for the dataset required Yes : access granted No : access is temporarily denied and the request is submitted to the PVMC (update of the white list) Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 13 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Governance : operational process for user accreditation to for restricted data User request Management approval (Head of the unit) Service Desk If no consensus reached (power of veto from data producers) Decision communicated by the PVMC : access granted or denied PVMC Yes / No Governor, Deputy Governors Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 14 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Technical solution : design Ability to manage large volumes of data Secured access rights Flexibility to integrate and handle heterogeneous data: Codification of series is specific to each dataset Technical formats : XBRL, SDMX, SAS, Powerful research engine Scalability and interoperability (integration of new data sets, connection to additional analysis tools) A Proof of Concept has been performed and approved by key users Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 15 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Technical solution : architectural frame A solution based on open source «Big Data» technology A dedicated BigData platform in the Banque de Datacenter An up-to-date search engine (ElasticSearch) A Not Only SQL (NoSQL) DataBase well suited for the storage of heterogeneous data Main differentiator : all formats are accepted (SDMX-ML, XBRL,..) Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 16 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Technical solution : architectural frame Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 17 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Technical solution : metadata Metadata are used to describe the statistical series, support the search functionalities and link heterogeneous data Metadata selection is performed by data scientists who identify the most suitable information to be used as Metadata. This is an ongoing process, the Metadata list being enriched continuously. Furthermore Metadata can be queryable or not, depending on their nature Non queryable metadata also called descriptive metadata provide information on sub-families and can be visualised in the data catalog Data sensibility, applied methodology,. Queryable metadata can be used as search criteria and address either common or specific attributes Family, sub-families, source, population, (common attributes) Accounting sector, (specific attributes) Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 18 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Technical solution : search engine ElasticSearch is the selected search engine for the This search engine is well suited for BigData contexts with large volumes as well as unstructured and heterogeneous information stored in a NoSQL database Search can be performed not only through pre-defined criteria such as the (queryable ) metadata but also using natural language. Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 19 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Technical solution : storage 2000 GB of data integrated into MUSES, generating a storage need of 10000 GB after having taken into account the auxiliary tools required for the organization, research and data identification Target : 400-500 million series As a comparison, the BDF macro-economic database records 6 million series - i.e. 28 GB of data Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 20 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Project plan Project duration is at least 2 years Four steps are considered Step 0 (end 2014 early ): Initialization of the technical base Step 1 (june ) : Most functional and technical requirements Integration of a first significant group of subset of data, for all DGs Related control and restitution functionalities Step 2 (end ) : All requirements Further integration of the subsets of data Step 3 (2016) : Completion of the integration of the subsets of data Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 21 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Project plan The realization phase is on track and on schedule First go-live is scheduled for July : Integration of an highly significant group of data covering all business areas Access granted to a first batch of 60 users from all business areas (BDF and Supervisory authority) Starting in January 2016 : Progressive accreditations increase (300 users as a target) Integration of additional datasets in Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 22 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Main datasets to be loaded in by end 2016 BSI and MIR Data from credit institutions FINREP reporting Other prudential data (solo & consolidated) Data from securitization bodies and investment firms International banking data Insurance data Investment funds data Business surveys Securities holding and issuance Credit Register Data from the balance sheet of corporates Granular data on 'Household over-indebtedness' Data on negotiable debt securities Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 23 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Access to individual data for external users Since 2012, the Banque de has reset the instruction procedures for individual data access; The Banque de has chosen a pragmatic approach: Prevent legal risks Contain operational costs; The procedure aims at guaranteeing : A level playing field among researchers A better-governed process A full transparency of data requests. Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 24 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

A new process implemented by the Directorate General Statistics (1/3) The applicant(s) fills in a detailed application form describing the research project and the team organisation A confidentiality agreement is signed by each member of the research team Applications are collectively reviewed by a decision body ( Secretariat for the instruction of data requests ) chaired by the Director General Statistics. Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 25 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

A new process implemented by the Directorate General Statistics (2/3) The committee is composed of representatives from all business areas of the Banque de and the legal department ; The Secretariat decides on the approval of the request based on : the relevance of the project the legal framework under which the data have been collected Strict adherence to the European regulation for data collected according to an ECB regulation the data protection measures indicated by the applicant. Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 26 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

A new process implemented by the Directorate General Statistics (3/3) In case of approval of the request by the Secretariat, the DGS is responsible for providing the data to the applicant (in close cooperation with the unit in charge of the production of the relevant data) ; The entire technical process may be lengthly and cumbersome : 1. Data extraction and preparation : specific to each dataset 2. Anonymisation : challenging and difficult to automate when indirect identification must be taken care of. 3. USB drive hand-delivered to the applicant Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 27 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Towards a more efficient process Though an internal platform designed exclusively for BDF users (including supervisory departments), will allow to speed up the first phase and enrich the data with metadata Extract data & metadata Anonymisation Raw Database Final Database Secured extranet Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 28 Applicant Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA

Thank you for your attention Banque de direction Renaud de Lacroix l organisation Banque er du de développement RESTRICTED Page 29 Projet : XXX de pilotage de pilotage N X du JJ/MM/AAAA