Practical application of SAS Clinical Data Integration Server for conversion to SDTM data



Similar documents
Business & Decision Life Sciences

Meta-programming in SAS Clinical Data Integration

PhUSE Paper CD13

Automate Data Integration Processes for Pharmaceutical Data Warehouse

USE CDISC SDTM AS A DATA MIDDLE-TIER TO STREAMLINE YOUR SAS INFRASTRUCTURE

Using SAS Data Integration Studio to Convert Clinical Trials Data to the CDISC SDTM Standard Barry R. Cohen, Octagon Research Solutions, Wayne, PA

How to easily convert clinical data to CDISC SDTM

CDISC SDTM & Standard Reporting. One System

Sanofi-Aventis Experience Submitting SDTM & Janus Compliant Datasets* SDTM Validation Tools - Needs and Requirements

Rationale and vision for E2E data standards: the need for a MDR

Statistical Operations: The Other Half of Good Statistical Practice

SDTM, ADaM and define.xml with OpenCDISC Matt Becker, PharmaNet/i3, Cary, NC

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

UTILIZING CDISC STANDARDS TO DRIVE EFFICIENCIES WITH OPENCLINICA Mark Wheeldon CEO, Formedix Boston June 21, 2013

WHITE PAPER. CONVERTING SDTM DATA TO ADaM DATA AND CREATING SUBMISSION READY SAFETY TABLES AND LISTINGS. SUCCESSFUL TRIALS THROUGH PROVEN SOLUTIONS

PharmaSUG 2015 Paper SS10-SAS

Using the SAS XML Mapper and ODS PDF to create a PDF representation of the define.xml (that can be printed)

Implementing the CDISC standards into an existing CDMS

Lessons on the Metadata Approach. Dave Iberson- Hurst 9 th April 2014 CDISC Euro Interchange 2014

PharmaSUG Paper CD13

A white paper presented by: Barry Cohen Director, Clinical Data Strategies Octagon Research Solutions, Inc. Wayne, PA

Accenture Accelerated R&D Services: CDISC Conversion Service Overview

ClinPlus. Report. Technology Consulting Outsourcing. Create high-quality statistical tables and listings. An industry-proven authoring tool

ABSTRACT INTRODUCTION THE MAPPING FILE GENERAL INFORMATION

Managing Clinical Trials Data using SAS Software

Managing Custom Data Standards in SAS Clinical Data Integration

Utilizing the SAS Business Intelligence Platform in a Clinical Trial Environment

Using SAS in Clinical Research. Greg Nelson, ThotWave Technologies, LLC.

SDTM-ETL TM. The user-friendly ODM SDTM Mapping software package. Transforming operational clinical data into SDTM datasets is not an easy process.

Gregory S. Nelson ThotWave Technologies, Cary, North Carolina

Understanding CDISC Basics

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

ORACLE CLINICAL. Globalization. Flexibility. Efficiency. Competition ORACLE DATA SHEET OVERVIEW ROBUST CLINICAL DATA MANAGEMENT SOLUTION

End-to-End E-Clinical Coverage with Oracle Health Sciences InForm GTM

Use of Metadata to Automate Data Flow and Reporting. Gregory Steffens Novartis PhUSE 13 June 2012

CDISC Roadmap Outline: Further development and convergence of SDTM, ODM & Co

The Recipe for Sarbanes-Oxley Compliance using Microsoft s SharePoint 2010 platform

Managing and Integrating Clinical Trial Data: A Challenge for Pharma and their CRO Partners

Overview of CDISC Implementation at PMDA. Yuki Ando Senior Scientist for Biostatistics Pharmaceuticals and Medical Devices Agency (PMDA)

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Training/Internship Brochure Advanced Clinical SAS Programming Full Time 6 months Program

SDTM AND ADaM: HANDS-ON SOLUTIONS

Developing Microsoft SharePoint Server 2013 Advanced Solutions MOC 20489

What's New in SAS Data Management

Electronic Submission of Regulatory Information, and Creating an Electronic Platform for Enhanced Information Management

Did you know? Accenture can deliver business outcome-focused results for your life sciences research & development organization like these:

ABSTRACT THE ISSUE AT HAND THE RECIPE FOR BUILDING THE SYSTEM THE TEAM REQUIREMENTS. Paper DM

Implementation of SDTM in a pharma company with complete outsourcing strategy. Annamaria Muraro Helsinn Healthcare Lugano, Switzerland

A Macro to Create Data Definition Documents

Package R4CDISC. September 5, 2015

SAS Drug Development User Connections Conference 23-24Jan08

Extracting the value of Standards: The Role of CDISC in a Pharmaceutical Research Strategy. Frank W. Rockhold, PhD* and Simon Bishop**

SAS Business Intelligence Online Training

PhUSE Paper CD07. Data Governance Keeping Control through a Well-Defined Change Request Process

SAS IT Resource Management 3.2

Data Conversion to SDTM: What Sponsors Can Do to Facilitate the Process

CDISC and Clinical Research Standards in the LHS

DIaaS (Data Integration as A Service) CDISC Conversion Platform

Adam Rauch Partner, LabKey Software Extending LabKey Server Part 1: Retrieving and Presenting Data

ProjectTrackIt: Automate your project using SAS

Clinical Trial Data Integration: The Strategy, Benefits, and Logistics of Integrating Across a Compound

ADaM or SDTM? A Comparison of Pooling Strategies for Integrated Analyses in the Age of CDISC

Organization Profile. IT Services

Semarchy Convergence for Data Integration The Data Integration Platform for Evolutionary MDM

SAS BI Course Content; Introduction to DWH / BI Concepts

SDTM Validation: Methodologies and Tools

Developing Microsoft SharePoint Server 2013 Advanced Solutions

Contents. Introduction... 1

ClinPlus. Clinical Trial Management System (CTMS) Technology Consulting Outsourcing. Expedite trial design and study set-up

SAS CLINICAL STANDARDS TOOKIT

Metadata Submission Guidelines Appendix to the Study Data Tabulation Model Implementation Guide

Business Process Analysis & Management. Corporate Synergy

Introduction. Architecture Re-engineering. Systems Consolidation. Data Acquisition. Data Integration. Database Technology

IBM WebSphere ILOG Rules for.net

Clinical Data Management (Process and practical guide) Nguyen Thi My Huong, MD. PhD WHO/RHR/SIS

SOFTWARE TESTING TRAINING COURSES CONTENTS

Course: SAS BI(business intelligence) and DI(Data integration)training - Training Duration: 30 + Days. Take Away:

PharmaSUG Paper AD08

ADaM Implications from the CDER Data Standards Common Issues and SDTM Amendment 1 Documents Sandra Minjoe, Octagon Research Solutions, Wayne, PA

Metastorm BPM Interwoven Integration. Process Mapping solutions. Metastorm BPM Interwoven Integration. Introduction. The solution

METADATA-DRIVEN QLIKVIEW APPLICATIONS AND POWERFUL DATA INTEGRATION WITH QLIKVIEW EXPRESSOR

MOVING THE CLINICAL ANALYTICAL ENVIRONMENT INTO THE CLOUD

Work Process Management

How to Use SDTM Definition and ADaM Specifications Documents. to Facilitate SAS Programming

Developing ASP.NET MVC 4 Web Applications

Data-centric System Development Life Cycle for Automated Clinical Data Development System Kevin Lee, MarkLogic, Washington D.C.

Developing ASP.NET MVC 4 Web Applications MOC 20486

Dynamic Decision-Making Web Services Using SAS Stored Processes and SAS Business Rules Manager

The Intelligent Content Framework

SAS Add in to MS Office A Tutorial Angela Hall, Zencos Consulting, Cary, NC

Filestor Digital Asset Management. The way it works

TU04. Best practices for implementing a BI strategy with SAS Mike Vanderlinden, COMSYS IT Partners, Portage, MI

Essential Project Management Reports in Clinical Development Nalin Tikoo, BioMarin Pharmaceutical Inc., Novato, CA

MODULE FRAMEWORK : Dip: Information Technology Network Integration Specialist (ITNIS) (Articulate to Edexcel: Adv. Dip Network Information Specialist)

Aspen InfoPlus.21. Family

Scheduling in SAS 9.4 Second Edition

SOA REFERENCE ARCHITECTURE: WEB TIER

Transcription:

Paper DM03 Practical application of SAS Clinical Data Integration Server for conversion to SDTM data Peter Van Reusel, Business & Decision Life Sciences, Brussels, Belgium Mark Lambrecht, SAS, Tervuren, Belgium ABSTRACT Conversion of legacy clinical data into a standardized data model consists of different steps from annotating the CRF, preparing the mapping into the CDISC SDTM model to producing metadata reports and CRT-DDS (define.xml). In order to accelerate the conversion process, we have used the SAS Clinical Data Integration Server framework, which has been designed to help life sciences organizations visually design conversion processes driven by standardized metadata. We will discuss in this paper our implementation approach and how SAS Clinical Data Integration Server significantly shortened our throughput time needed to produce SDTM-compliant submission-ready packages. In addition, extensive compliancy and quality control checks ensure that the data and metadata is consistent and according to the defined standards. INTRODUCTION Most pharmaceutical organizations have only recently started adapting their internal data models to accommodate the CDISC data models, such as the Study Data Tabulation Model (SDTM) (http://www.cdisc.org). Therefore a core requirement in implementing CDISC standards in clinical development operations is the ability to convert existing or legacy data into CDISC-compliant formats. Business & Decision Life Sciences has been converting legacy clinical data into SDTM-structured data using SAS Clinical Data Integration Server. CLINICAL DATA INTEGRATION SERVER Clinical Data Integration Server is a processing framework provided by SAS to facilitate production of CDISCcompliant data sets and of the associated metadata reports, such as define.xml. A core component of all SAS solutions is the SAS Open Metadata Architecture (OMA) which stores the CDISC SDTM data model and makes it centrally available to all users and the different clinical projects. The SAS Clinical Data Integration Server framework contains the target SDTM domains (including the special-purpose relationship datasets), generic job templates, the CDISC controlled terminology and the variables needed to create a custom domain according to the instructions in the SDTM 3.1.1 IG. All related metadata in the SAS OMA is grouped together in repositories. The Foundation repository groups technical and environmental metadata that is needed in other repositories, e.g. about users, groups, security permissions, computing servers, scheduled jobs Other repositories are dependant on the Foundation repository. The Global Standards repository (Figure 1) groups the CDISC standard (meta)data and in this configuration acts as the parent repository of the different clinical project repositories. A user opens a metadata profile to a specific clinical project repository (Figure 1) and consequently has access to all parent repositories, given the right permissions. For example, if a user connects to the 712 clinical project repository, the Global Standards repository containing all CDISC standards metadata and the Foundation repository are accessible. This clear separation and dependency of repositories allows for better management of security and access to (meta)data based on the role of the user. Changes to objects requires going through a check in / check out cycle in user repositories whereby changes are captured in an audit trails. The object-locking facility has proven to be very useful when working with a large team of programmers (or data integration programmers) on the same conversion jobs. 1

Figure 1 : Structure of Clinical Data Integration Server repositories in the case of 3 clinical projects. For our use of SAS Clinical Data Integration Server, additional data and metadata was needed. For example, we added datasets with Controlled Terminology (Figure 2) tables, both at a global level (in the Global Standards Repository) and at the clinical project level. Value-level metadata is added in a folder both at the global, project and study levels. When value-level metadata has to be combined with the source data, a value-level metadata transpose custom transformation has been created. Figure 2 : Folder structure in Clinical Data Integration Server supporting the SDTM conversion and production. (a) Global level (b) Project level - (c) Study level. 2

PROCESS Figure 3 describes the different deliverables in a CDISC conversion process: - Generation of the SDTM Implementation Guide (IG) 3.1.1 annotated CRF s - Creation of the SDTM IG 3.1.1 datasets - Creation of the define.xml datasets Figure 3 : Conversion of legacy clinical data into the SDTM format, including define.xml, for submission purposes. Metadata is indicated in orange, data and data mapping in blue and processes in green. The process (Figure 3) is carried out by the conversion team consisting of data mappers who define the mapping and of programmers who create the conversion programs. The different steps in the whole process are : 1. Annotating the CRF s 2. Set-up of a new clinical project in Clinical DI 3. Creation of conversion processes in Data Integration Studio 4. Management of metadata and production of define.xml 5. Compliancy checks STEP 1: ANNOTATING THE CRF First the SDTM mappers annotate the CRF s, either paper or electronic, and add the SDTM variables onto the forms. (Figure 4). The described annotations allow to quickly understand how source data in the original format are mapped to SDTM target data. The define.xml will contain links to the respective annotated CRF s. 3

Figure 4 : CDISC annotated CRF s. STEP 2: SET-UP OF A NEW CLINICAL PROJECT When a new clinical project is set up, an external Microsoft Excel file is constructed that contains the source datasets, and the information needed to map source variables onto target variables. This separate source of metadata makes the setup at the start of the conversion process a flexible process. All the mapping information and the preparation of all needed metadata are done by the mappers in this Excel sheet. A new Clinical Data Integration Server study folder or clinical project repository is then created with the appropriate source, target and job folders. The target SDTM dataset structures are copied from the Global Standards repository. Project- or study-specific metadata about the datasets or variables is then programmatically added to the Clinical Data Integration Server metadata in the project or study level repositories (Figure 5) from the central mapping file. This metadata is later re-used for compliancy checks, for the production of value-level metadata, and for the creation of the define.xml. The time needed to set up a new study is kept to a minimum because of the automated procedure. 4

Figure 5 : Table and variable metadata as extended attributes and notes. The mapping sheet also contains advanced metadata such as a code list conversion table and an automatic transpose table. This metadata will be used later in the process to create both the define.xml, and to automate big parts of the conversion processes. STEP 3: CREATION OF CONVERSION PROCESSES IN SAS DATA INTEGRATION STUDIO The programmers then start working using the mapping logic produced by the data mappers. SAS Data Integration Studio provides a drag-and-drop interface which allows the programmers to visually create Data Integration Studio jobs (Figure 6) that auto-generate robust SAS code. Because one job per SDTM domain is created in each study, it is possible to reuse a large part of the transformation logic over different studies if the source dataset structure is stable. A large number of transformations are available in the Process Library of Data Integration Studio. The ability to generate custom transformations using the transformation generator allowed us to add specific clinical transformations such as ISO8601 date-time conversions, a look-up and transpose transformation These custom transformations can be parameterized and therefore easily deployed in different clinical projects and studies. Figure 6 : Part of a job flow in Data Integration Studio resulting in the creation of the AE SDTM domain 5

STEP 4 : METADATA MANAGEMENT Once the Data Integration Studio jobs are created and the SDTM datasets are populated with values, define.xml can be produced. Metadata stored in the SAS Metadata Server can be queried from Base SAS, Java, or.net programs. For our purposes, the Base SAS syntax was meeting our requirements. The information needed in the define.xml is easily extracted from the SAS metadata repositories (see Figure 7). The define.xml is produced in SAS Clinical Data Integration Server using a user-written custom macro that makes use of a SAS template. The define.xml includes the code lists, the value-level metadata and the pseudo-code used for the computed variables (Figure 8). By using specific style sheets, it is possible to surface the built-in links to annotated CRF s, to the code lists and computed variables. Figure 7 : Production of define.xml STEP 5: SDTM COMPLIANCE CHECKS The metadata about source, SDTM target datasets and job conversion metadata, is automatically collected in the SAS metadata server and enables further exploitation. Extensive compliance checks allow for cross-checking the produced SDTM datasets with the CDISC standards contained in the Global Standards repository, and with the produced define.xml for consistency. Quality issues that are identified are added to a large exception table, and in a second step a check report generator generates from this table a set of quality check reports. 6

Figure 8 : A detail of the define.xml file CONCLUSIONS Data from multiple phase Is to Phase IIIs and mega-trials with tens of thousands of subjects have been successfully converted into an SDTM-compliant submission package using the above described process. A similar process can be applied for ADaM dataset production. While the preparation of the mapping process is done in a central mapping file, the actual mapping processes and metadata exploitation is carried forward in SAS Clinical Data Integration Server. The main advantage of Clinical Data Integration Server is that existing business processes do not have to be redrafted, making them more efficient and transparent. The SAS Open Metadata Architecture has a flexible metadata model that drives standardization and enforces documentation. The user-specific project repositories impose collaboration and an audit trail of the development process. Clinical Data Integration Server is executed on scalable server-client architecture and a process that used to run for hours on an interactive SAS Windows session is now executed in minutes. This contributes to a fast and efficient optimization of the data mapping process, an added advantage that was not clear to us at the start of the project. 7

Figure 9 : Data Integration Studio automatically captures variable mapping inconsistencies The ability to rapidly translate the mapping requirements into robust Data Integration Studio-generated SAS code significantly reduces the time needed in a conversion project. Moreover, because the jobs are metadata-driven, Data Integration Studio captures the link between the mapped datasets and columns (impact analysis) and automatically checks for variable mapping inconsistencies (Figure 9). The flexibility of SAS Clinical Data Integration Server, the integrated metadata that is centrally accessible and the increased collaboration between developers have highly improved the efficiency of our team in executing CDISC conversion projects. The proven process methodologies, the CDISC expertise in our team, and the SAS Clinical Data Integration Server technology have all contributed to this success. REFERENCES Kilhullen, Michael. Implementing CDISC Data Models in the SAS Metadata Server Proceedings for the Pharmaceutical Industry SAS Users Group Conference, 2006 & 2007 ACKNOWLEDGMENTS The authors would like to thank everyone involved in the work described, both from B&D and SAS. BIOGRAPHY Peter Van Reusel, manager CRO services Peter Van Reusel is Business & Decision Life Sciences CDISC expert and representative in the CDISC bodies and teams as well as one of the few CDISC SDTM trainers providing the CDISC SDTM courses in Europe. Mark Lambrecht, SAS Institute consultant Mark Lambrecht, PhD, is Principal Consultant at SAS and promotes and implements SAS solutions for clinical data integration, genomics, statistics and reporting in the life sciences industry. 8

CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the authors at: Peter Van Reusel Business & Decision, Life Sciences Rue de la Révolution 8 B-1000 Brussels Belgium Office phone: +32 2 510 05 40 http://www.businessdecision-lifesciences.com Mark Lambrecht SAS Institute Hertenbergstraat 6 B-3080 Tervuren Belgium Office phone: +32 2 766 07 00 http://www.sas.com/industry/pharma/ Brand and product names are trademarks of their respective companies. 9