PhUSE Metadata Management Project



Similar documents
PhUSE Annual Meeting, London 2014

Rationale and vision for E2E data standards: the need for a MDR

A Brief Introduc/on to CDISC SDTM and Data Mapping

CDISC Roadmap Outline: Further development and convergence of SDTM, ODM & Co

Lessons on the Metadata Approach. Dave Iberson- Hurst 9 th April 2014 CDISC Euro Interchange 2014

Managing and Integrating Clinical Trial Data: A Challenge for Pharma and their CRO Partners

UTILIZING CDISC STANDARDS TO DRIVE EFFICIENCIES WITH OPENCLINICA Mark Wheeldon CEO, Formedix Boston June 21, 2013

Electronic Submission of Regulatory Information, and Creating an Electronic Platform for Enhanced Information Management

Business & Decision Life Sciences What s new in ADaM

CDISC and Clinical Research Standards in the LHS

BRIDGing CDASH to SAS: How Harmonizing Clinical Trial and Healthcare Standards May Impact SAS Users Clinton W. Brownley, Cupertino, CA

Business & Decision Life Sciences

Udo Siegmann member of e3c, CDISC Sen. Dir. Acc. Management PAREXEL

Implementing CDASH Standards Into Data Collection and Database Design. Robert Stemplinger ICON Clinical Research

USE CDISC SDTM AS A DATA MIDDLE-TIER TO STREAMLINE YOUR SAS INFRASTRUCTURE

The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar

Data Standards and the National Cardiovascular Research Infrastructure (NCRI)

Streamlining the Flow of Clinical Trial Data: EHR to EDC to Sponsor

PharmaSUG 2016 Paper IB10

Introduction to the CDISC Standards

PhUSE Paper CD13

Automate Data Integration Processes for Pharmaceutical Data Warehouse

Overview of CDISC Implementation at PMDA. Yuki Ando Senior Scientist for Biostatistics Pharmaceuticals and Medical Devices Agency (PMDA)

Modernizing Your Data Strategy

Gregory S. Nelson ThotWave Technologies, Cary, North Carolina

Practical application of SAS Clinical Data Integration Server for conversion to SDTM data

Implementing the CDISC standards into an existing CDMS

Understanding CDISC Basics

Implementation of SDTM in a pharma company with complete outsourcing strategy. Annamaria Muraro Helsinn Healthcare Lugano, Switzerland

SDTM AND ADaM: HANDS-ON SOLUTIONS

PharmaSUG 2015 Paper SS10-SAS

SDTM-ETL TM. The user-friendly ODM SDTM Mapping software package. Transforming operational clinical data into SDTM datasets is not an easy process.

XClinical offers an integrated range of software products for CROs, pharmaceutical, medical device and biopharmaceutical companies.

Bringing Order to Your Clinical Data Making it Manageable and Meaningful

Transforming CliniCal Trials: The ability to aggregate and Visualize Data Efficiently to make impactful Decisions

Enterprise Data Center Networks

Using SAS Data Integration Studio to Convert Clinical Trials Data to the CDISC SDTM Standard Barry R. Cohen, Octagon Research Solutions, Wayne, PA

We are pleased to share the recent topics on CDISC standards with you at the Japan Interchange.

Guidance for Industry

The Future of Clinical Data in Clinical Research

CDER/CBER s Top 7 CDISC Standards Issues

Did you know? Accenture can deliver business outcome-focused results for your life sciences research & development organization like these:

Using the SAS XML Mapper and ODS PDF to create a PDF representation of the define.xml (that can be printed)

PK IN DRUG DEVELOPMENT. CDISC management of PK data. Matteo Rossini Milan, 9 February 2010

DTCC Data Quality Survey Industry Report

Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers

Data Conversion to SDTM: What Sponsors Can Do to Facilitate the Process

Using SAS in Clinical Research. Greg Nelson, ThotWave Technologies, LLC.

Use of Metadata to Automate Data Flow and Reporting. Gregory Steffens Novartis PhUSE 13 June 2012

MDM Approach for EVMPD & IDMP Compliance

Providing Regulatory Submissions In Electronic Format Standardized Study Data

ADaM or SDTM? A Comparison of Pooling Strategies for Integrated Analyses in the Age of CDISC

IBM Analytics Make sense of your data

SDTM, ADaM and define.xml with OpenCDISC Matt Becker, PharmaNet/i3, Cary, NC

Qualification Process for Standard Scripts in the Open Source Repository with Cloud Services

ADaM Implications from the CDER Data Standards Common Issues and SDTM Amendment 1 Documents Sandra Minjoe, Octagon Research Solutions, Wayne, PA

Extracting the value of Standards: The Role of CDISC in a Pharmaceutical Research Strategy. Frank W. Rockhold, PhD* and Simon Bishop**

Accenture Accelerated R&D Services: CDISC Conversion Service Overview

Development of an open metadata schema for Prospective Clinical Research (openpcr)

Use of Electronic Health Records in Clinical Research: Core Research Data Element Exchange Detailed Use Case April 23 rd, 2009

Programme Guide PGDCDM

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

Business & Decision Life Sciences CDISC Workshop: From SDTM to ADaM: Mapping Methodologies

Clinical Data Management BPaaS Approach HCL Technologies

SAS CLINICAL TRAINING

ABSTRACT INTRODUCTION THE MAPPING FILE GENERAL INFORMATION

Introducing webmethods OneData for Master Data Management (MDM) Software AG

The ADaM Solutions to Non-endpoints Analyses

Bridging Statistical Analysis Plan and ADaM Datasets and Metadata for Submission

PhUSE Paper CD07. Data Governance Keeping Control through a Well-Defined Change Request Process

Clinical Trial Data Integration: The Strategy, Benefits, and Logistics of Integrating Across a Compound

Accelerating Clinical Trials Through Shared Access to Patient Records

4. Executive Summary of Part 1 FDA Overview of Current Environment

Development of CDISC Tuberculosis Data Standards

DATA GOVERNANCE AT UPMC. A Summary of UPMC s Data Governance Program Foundation, Roles, and Services

Therapeutic Area Standards (TAS) Initiative Project Plan

How to easily convert clinical data to CDISC SDTM

Statistical Operations: The Other Half of Good Statistical Practice

SDTM Validation Rules in XQuery

What is the Certified Health Record Analyst (CHDA)?

A white paper presented by: Barry Cohen Director, Clinical Data Strategies Octagon Research Solutions, Inc. Wayne, PA

Training/Internship Brochure Advanced Clinical SAS Programming Full Time 6 months Program

DIaaS (Data Integration as A Service) CDISC Conversion Platform

EHR Standards Landscape

Master Data Management The Nationwide Experience. Lance Dacre Director, Data Governance

OpenCDISC.org an open source initiative delivering tools for validation of CDISC data

Health Data Analysis Specialty Track Curriculum Competencies

PharmaSUG Paper CD13

Essential Elements of a Master Data Management Architecture

How to Create Variables Related to Age Joyce Gui and Shaoan Yu Merck & Company, Rahway, NJ

ABSTRACT On October 1st, 2008, CDASH released the first 16 common CRF streams (or domains) for use by the Pharmaceutical Industry.

Transcription:

PhUSE SDE Mee<ng, NY 2015 PhUSE Management Project, Study data standards, Master data, terminology and interoperability defini<ons Mitra Rocca, FDA Marcelina Hungria, DIcore Group

Table of Content PhUSE CSS Emerging Technology (ET) Working group Management Project DefiniHons ImplementaHon DefiniHons and study data standards Master data Controlled terminology Interoperability Pooling, aggregahon, integrahon Lessons learned

PhUSE CSS Emerging Technology FDA/PhUSE ComputaHonal Science Symposium (CSS) is a collaborahve effort between industry and the FDA to work on implementahon of data standards In 2013 a new working group was established focusing on the following emerging technologies: semanhc technology (now in a dedicated working group) management Cloud compuhng Big data The ET WG has re- organized in 2014

Management Project Goals Changing landscape: need for concept based Repository (MDR) from protocol to data submission

Project Team Deliverables Defini<ons Document hzp://www.phusewiki.org/wiki/index.php?htle=_management Comments to FDA Guidances SubmiKed to the FDA docket (by the May- 2014 deadline)

Defini<ons Soup. 6

Defini<ons 1 METADATA MANAGEMENT 1.1 1.2 Structural metadata 1.3 Descrip5ve metadata 1.4 Study Instance 1.5 repository 1.6 registry 1.7 Data element 1.8 ABribute 1.9 Class 1.10 Data type 1.11 Value level metadata 2 CONTROLLED TERMINOLOGY, CODE SYSTEMS & VALUE SETS 2.1 Controlled Terminology/controlled vocabulary 2.2 Code system 2.3 Dic5onary 2.4 Concept 2.5 Code 2.6 Code list 2.7 Value set 3 MASTER DATA MANAGEMENT 3.1 Master Data 3.2 (Master) Reference Data 3.3 Master Data Management 4 INTEROPERABILITY Categoriza5on of Interoperability (by HL7) 4.1 Technical interoperability ( machine interoperability ) 4.2 Seman5c interoperability 4.3 Process Interoperability 5 DATA AGGREGATION, INTEGRATION, POOLING 5.1 Data pooling 5.2 Data integra5on 5.3 Data aggrega5on

PhUSE SDE Mee<ng, NY 2015 Approach Defini<ons Lessons Learned

Defini<ons 1 METADATA MANAGEMENT 1.1 1.2 Structural metadata 1.3 Descrip5ve metadata 1.4 Study Instance 1.5 repository 1.6 registry 1.7 Data element 1.8 ABribute 1.9 Class 1.10 Data type 1.11 Value level metadata 2 CONTROLLED TERMINOLOGY, CODE SYSTEMS & VALUE SETS 2.1 Controlled Terminology/controlled vocabulary 2.2 Code system 2.3 Dic5onary 2.4 Concept 2.5 Code 2.6 Code list 2.7 Value set 3 MASTER DATA MANAGEMENT 3.1 Master Data 3.2 (Master) Reference Data 3.3 Master Data Management 4 INTEROPERABILITY Categoriza5on of Interoperability (by HL7) 4.1 Technical interoperability ( machine interoperability ) 4.2 Seman5c interoperability 4.3 Process Interoperability 5 DATA AGGREGATION, INTEGRATION, POOLING 5.1 Data pooling 5.2 Data integra5on 5.3 Data aggrega5on

Approach Master Data Management Synonym DefiniHon & source DescripHon Example Recommended definihon Reference Data Management; MDM [Gartner Magic Quadrant for Master Data Management of Customer Data SoluHon] hzp://www.gartner.com/technology/reprints.do?id=1-1ck9udo&ct=121019&st=sb MDM is a technology- enabled discipline in which business and IT work together to ensure the uniformity, accuracy, stewardship, semanhc consistency and accountability of the enterprise's official, shared master data assets. [Source: Master Data Management] Master Data Management (MDM) is the collechve applicahon of governance, business processes, policies, standards and tools facilitate consistency in data definihon. The idea of Master Data focuses on providing unobstructed access to a consistent representa5on of shared informa5on [Source: SAS White Paper on SupporHng Your InformaHon Strategy with a Phased Approach to Master Data Management Master Data Management (MDM) comprises of a set of processes and tools that consistently define and manage the master data and master reference data of an enterprise, which are fundamental to the company s business operahons. MDM has the objechve of providing processes & tools for collechng, aggregahng, matching, consolidahng, quality- assuring, persishng and distribuhng such data throughout an organizahon to ensure consistency and control in the ongoing maintenance and applicahon use of this informahon. There are different models for master data management the 2 main extremes are Centralized model where all data are managed within a central data store and pushed to the different applicahons within an organizahon. Decentralized model (registry) where the master data are managed within each applicahons but then reconciled through a registry systems to federate. Specific products from vendors such as INFORMATICA, IBM, Soqware AG, Set of processes and tools needed for the deployment of master data and master reference data within an organizahon.

(Organization/ Enterprise Level) (Drug Level) Drug Structural Descriptive Drug Structural Drug Descriptive Semantic Descriptive Process Descriptive Semantic Descriptive Process Descriptive Subset of IDMP standard + CDISC (CDASH, SDTM for a compound) (Study Level) Study Subset of CDISC CDASH, SDTM standard (based on company best practice) Study Structural Semantic Descriptive Study Descriptive Process Descriptive

Master Data

How Controlled Vocabularies are described and used Codes C16576 for F Concept Identifiers Designations Female F (Primary) female Concepts C16576 + F Concept Representation ISO 21090 Datatypes the CD Concept Descriptor Controlled Terminology In define.xml (machine processable): Code System (CodeList Context): nciextcodeid (not directly processable URI instead) Value Set (CodeList) CUI for SEX: C66731 Code CUI for Designa<on F (Female): C16576 Code System Versioning Code Systems Codelist Value Set & Code with CDISC example Value Set Definition Value Set Versioning Value Sets C66731 for SEX inspired from Julie James, BlueWave Informatics

Interoperability

Data Pooling, Integra<on, Aggrega<on Dataset 1 Dataset 2 Dataset 3 AGGREGATION AddiHonal grouping or derivahon of data POOLING Storing data together without changing the datasets INTEGRATION: TransformaHon, mapping or harmonizahon of data (ETL process)

Lessons learned (Compiled from different team members) Efficient Data Integra<on and compliance to regulatory standards does not start ader pooling (retroac<ve approach); it starts with the protocol (proac<ve approach) A proac<ve approach is based on two components: o DefiniHon of Master Data (Drug Products, Studies, Sites, InvesHgators,..) and associated descriphve metadata o DefiniHon of study structural metadata aka study specific data standards as a subset of the enterprise wide variables and value sets contained in a repository (MDR) To be manageable, variables in an MDR need to be grouped in seman<cally meaningful "clinical research concepts" (CRC)

CHANGING LANDSCAPE : Enforcing data standards from protocol onwards Retro-active approach from paper protocol, Pro-active approach with structural metadata Different interpretations of same protocol Limited standards Time to build integrated SDTM data sets 17 Courtesy of Isabell de Zegher 2014 PAREXEL INTERNATIONAL CORP. / 17 CONFIDENTIAL One single interpretation of protocol Increased efficiency, consistency & quality through standards Reduced time for integration and secondary data use

CONCEPT BASED MDR : Protocol is not about variables but about concepts Annotated ecrf for Patient Demography? Courtesy of Isabell de Zegher 2014 PAREXEL INTERNATIONAL CORP. / 18 CONFIDENTIAL? SDTM data set (SAS) (different t variables names and different structures than ecrf)

CONCEPT BASED MDR: CDASH/SDTM can be organized by CRC Courtesy of Isabell de Zegher Concept CDASH Question CDASH Variable Subject What is the sex of the subject? What is the subject s date of birth? What is the ethnicity of the subject? What is the race of the subject? What is the subject s age? What are the age units used? ecrf content description 2014 PAREXEL INTERNATIONAL CORP. / 19 CONFIDENTIAL SEX BRTHDAT or BRTHYR BRTHMO BRTHDY ETHNIC RACE AGE AGEU SDTM Variable SEX BRTHDTC EHTNIC RACE AGE SDTM mapping AGEU

Conclusions Let us speak the same language We need to change the way we consider compliance to data standards and data integrahon: o From a retroachve way (building define.xml at submission) o To a proachve approach (study data standards defined at study setup) We need new tools to manage metadata: Concept based MDR o Grouping variables into semanhcally meaningful concepts (following industry wide pazerns) o Linking data sources (e.g, CDASH based collechon) to data submission (SDTM) variables o Linking with controlled terminology o With capabilihes to handle standards versioning

Defini<on Project Isabelle de Zegher (co- chair) Par<cipants Parexel Mitra Rocca (co- chair) FDA Marcelina Hungria (co- chair) DiCore Group Julie James BlueWave InformaHcs Tim Church Torch Yun Oldshue Takeda Praveen Garg ICON Kenneth Stoltzfus Accenture Gregory Steffens NovarHs John Leveille d- Wise Aimee Basile Celgene Sam Hume CDISC

PhUSE SDE Mee<ng, NY 2015 Mitra Rocca Senior Medical InformaHcian Office of TranslaHonal Sciences CDER, FDA Mitra.rocca@fda.hhs.gov Marcelina Hungria Clinical Data Standards & IntegraHon Consultant / Owner DIcore Group, LLC mhungria@dicoregroup.com