Using SAS ETL Studio to Create a CDISC-Compliant Clinical Data Warehouse Barry Cohen

Similar documents
Using SAS Data Integration Studio to Convert Clinical Trials Data to the CDISC SDTM Standard Barry R. Cohen, Octagon Research Solutions, Wayne, PA

Practical application of SAS Clinical Data Integration Server for conversion to SDTM data

Automate Data Integration Processes for Pharmaceutical Data Warehouse

USE CDISC SDTM AS A DATA MIDDLE-TIER TO STREAMLINE YOUR SAS INFRASTRUCTURE

Lessons on the Metadata Approach. Dave Iberson- Hurst 9 th April 2014 CDISC Euro Interchange 2014

How to easily convert clinical data to CDISC SDTM

Meta-programming in SAS Clinical Data Integration

Managing and Integrating Clinical Trial Data: A Challenge for Pharma and their CRO Partners

A white paper presented by: Barry Cohen Director, Clinical Data Strategies Octagon Research Solutions, Inc. Wayne, PA

Using the SAS XML Mapper and ODS PDF to create a PDF representation of the define.xml (that can be printed)

Overview of CDISC Implementation at PMDA. Yuki Ando Senior Scientist for Biostatistics Pharmaceuticals and Medical Devices Agency (PMDA)

Did you know? Accenture can deliver business outcome-focused results for your life sciences research & development organization like these:

4. Executive Summary of Part 1 FDA Overview of Current Environment

Implementation of SDTM in a pharma company with complete outsourcing strategy. Annamaria Muraro Helsinn Healthcare Lugano, Switzerland

Bridging Statistical Analysis Plan and ADaM Datasets and Metadata for Submission

UTILIZING CDISC STANDARDS TO DRIVE EFFICIENCIES WITH OPENCLINICA Mark Wheeldon CEO, Formedix Boston June 21, 2013

CDISC SDTM & Standard Reporting. One System

SAS CLINICAL TRAINING

Statistical Operations: The Other Half of Good Statistical Practice

Business & Decision Life Sciences

Training/Internship Brochure Advanced Clinical SAS Programming Full Time 6 months Program

Challenges of What, Why, and How of Clinical Metadata Beginner s Guide to Metadata

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

Understanding CDISC Basics

Using SAS in Clinical Research. Greg Nelson, ThotWave Technologies, LLC.

Clinical Research Innovation through Shared Clinical Data Warehousing

Gregory S. Nelson ThotWave Technologies, Cary, North Carolina

Accenture Accelerated R&D Services: CDISC Conversion Service Overview

Extracting the value of Standards: The Role of CDISC in a Pharmaceutical Research Strategy. Frank W. Rockhold, PhD* and Simon Bishop**

Alcohol Use Disorder Identification Test Self-Report Version (AUDIT-SR)

Clinical Data Management BPaaS Approach HCL Technologies

End-to-End E-Clinical Coverage with Oracle Health Sciences InForm GTM

ADaM or SDTM? A Comparison of Pooling Strategies for Integrated Analyses in the Age of CDISC

<Insert Picture Here> The Evolution Of Clinical Data Warehousing

ABSTRACT INTRODUCTION THE MAPPING FILE GENERAL INFORMATION

Managing Custom Data Standards in SAS Clinical Data Integration

Integrated Clinical Data with Oracle Life Sciences Applications. An Oracle White Paper October 2006

Trials and Tribulations of SDTM Trial Design

PharmaSUG Paper CD13

Data Conversion to SDTM: What Sponsors Can Do to Facilitate the Process

Current Status and Future Perspectives for Systemization of Clinical Study related the issues of CDISC in USA and other

PhUSE Paper CD13

Electronic Submission of Regulatory Information, and Creating an Electronic Platform for Enhanced Information Management

Implementing the CDISC standards into an existing CDMS

ADaM Implications from the CDER Data Standards Common Issues and SDTM Amendment 1 Documents Sandra Minjoe, Octagon Research Solutions, Wayne, PA

A Macro to Create Data Definition Documents

SDTM-ETL TM. The user-friendly ODM SDTM Mapping software package. Transforming operational clinical data into SDTM datasets is not an easy process.

Building and Customizing a CDISC Compliance and Data Quality Application Wayne Zhong, Accretion Softworks, Chester Springs, PA

CDER/CBER s Top 7 CDISC Standards Issues

eclinical Services Predictable Pricing Full Service EDC Phase I-IV Sophisticated Edit Checks Drug Supply Chain Forms Library Data Collection Services

WHITE PAPER. CONVERTING SDTM DATA TO ADaM DATA AND CREATING SUBMISSION READY SAFETY TABLES AND LISTINGS. SUCCESSFUL TRIALS THROUGH PROVEN SOLUTIONS

Organization Profile. IT Services

SDTM, ADaM and define.xml with OpenCDISC Matt Becker, PharmaNet/i3, Cary, NC

Introduction to the CDISC Standards

How To Write A Clinical Trial In Sas

Course: SAS BI(business intelligence) and DI(Data integration)training - Training Duration: 30 + Days. Take Away:

PK IN DRUG DEVELOPMENT. CDISC management of PK data. Matteo Rossini Milan, 9 February 2010

Udo Siegmann member of e3c, CDISC Sen. Dir. Acc. Management PAREXEL

Rationale and vision for E2E data standards: the need for a MDR

Practical Image Management for

MOVING THE CLINICAL ANALYTICAL ENVIRONMENT INTO THE CLOUD

«How we did it» Implementing CDISC LAB, ODM and SDTM in a Clinical Data Capture and Management System:

Use of Electronic Health Records in Clinical Research: Core Research Data Element Exchange Detailed Use Case April 23 rd, 2009

ectd Digital Handbook Table of Contents

Clinical Trial Data Integration: The Strategy, Benefits, and Logistics of Integrating Across a Compound

Best Practice in SAS programs validation. A Case Study

Guidance for Industry

SAS Drug Development Integration & PheedIt

Programme Guide PGDCDM

CDISC standards and data management The essential elements for Advanced Review with Electronic Data

Challenges and Opportunities in Clinical Trial Data Processing

11. Extension Potential Financial Benefits

Transforming CliniCal Trials: The ability to aggregate and Visualize Data Efficiently to make impactful Decisions

First in Life Sciences. First in Content Management. First in Cust omer Satisfaction

Synergizing global best practices in the CRO industry

CA Repository for z/os r7.2

Use of standards: can we really be analysis ready?

How to Use SDTM Definition and ADaM Specifications Documents. to Facilitate SAS Programming

GETTING BACK ON TRACK IN RECORD TIME: OPTIMIZING A VISUAL ANALYTICS PROGRAM AND PROCESS

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

An Introduction to Master Data Management (MDM)

AnalytiX MappingManager Big Data Edition

BRIDGing CDASH to SAS: How Harmonizing Clinical Trial and Healthcare Standards May Impact SAS Users Clinton W. Brownley, Cupertino, CA

SDTM AND ADaM: HANDS-ON SOLUTIONS

Enterprise Application Development in SharePoint 2010

Test Data Management Concepts

Sanofi-Aventis Experience Submitting SDTM & Janus Compliant Datasets* SDTM Validation Tools - Needs and Requirements

Technology-Driven Demand and e- Customer Relationship Management e-crm

PharmaSUG 2015 Paper SS10-SAS

ClinPlus. Report. Technology Consulting Outsourcing. Create high-quality statistical tables and listings. An industry-proven authoring tool

Optimizing Safety Surveillance During Clinical Trials Using Data Visualization Tools

Streamlining the drug development lifecycle with Adobe LiveCycle enterprise solutions

SAP Data Services 4.X. An Enterprise Information management Solution

Using Tableau Software with Hortonworks Data Platform

ABSTRACT INTRODUCTION. Paper RS08

SDTM-ETL 3.1 New Features

PhilaSUG Fall 2011 Meeting Thursday, October 27 th

Karnofsky Performance Status Scale (KPS Scale)

Integrating Data and Business Rules with a Control Data Set in SAS

Use of Metadata to Automate Data Flow and Reporting. Gregory Steffens Novartis PhUSE 13 June 2012

Transcription:

Octagon - 1 Using SAS ETL Studio to Create a CDISC-Compliant Clinical Data Warehouse Barry Cohen PhilaSUG Spring Meeting June 15, 2005

Octagon - 2 Background Octagon Research Solutions (1) E-publication of NDA documents (2) CDISC/SDTM expertise! Clinical Data Strategies dept: Migrate data to SDTM Clinical Data Warehouse & strategic uses Earlier w/o ETL Studio, now with it Building and refining processes Topical, hence present some thoughts

Octagon - 3 Overview CDISC/SDTM background Migration scenarios Migration process Two approaches to migration Why use ETL Studio?

Octagon - 4 CDISC / SDTM

Octagon - 5 CDISC SDTM (1) CDISC Clinical Data Interchange Standards Consortium SDTM Study Data Tabulation Model (the std) FDA issued guidance for SDTM use FDA desires SDTM data. Future: require SDTM FDA developing tools to process SDTM data, and build warehouse on this standard

Octagon - 6 CDISC SDTM (2) No tutorial about SDTM today Today: Process of migrating data to SDTM, and use of ETL Studio in the process SDTM info and training resources available through CDISC Published info on web site CDISC-sponsored training by certified orgs Octagon Research is certified

Octagon - 7 Migration Scenarios

Octagon - 8 Migration Scenarios (1) From what and where, to what (SDTM) and when Many scenarios, based on: Data mgmt history Data to be used for reporting & analysis Multiple in-house structures No in-house standard all dif. strucs In-house standard new mix std and dif. strucs Work done by CROs using own standards Many-to-one migration

Octagon - 9 Migration Scenarios (2) Multiple in-house structures Migrate to SDTM early: SDTM for reporting and analysis, per study SDTM to integrate data for ISS/ISE SDTM for submission Migrate to SDTM late: Various struc for reporting & analysis, per study Ad hoc integrate various struc for ISS/ISE (or SDTM for ISS/ISE) SDTM for submission

Octagon - 10 Migration Scenarios (3) Single in-house structure Stable standard used in-house CROs deliver in standard One-to-one migration May have some multiple-to-one cases, too

Octagon - 11 Migration Scenarios (4) Migrate to SDTM early: OperDB for collect/clean/manage SDTM for rpting and analysis, per CSR SDTM to integrate data for ISS/ISE rpting and analysis SDTM for submission Migrate to SDTM late: OperDB for collect/clean/manage OperDB for rpting and analysis, per CSR OperDB to integrate data for ISS/ISE rpting and analysis SDTM for submission

Octagon - 12 Migration Scenarios (5) Three-Levels Scenario Legacy data in various structures Building in-house std now (prospective use) Migrate legacy data to in-house std For current subm. (CSR, ISS) For future Clinical Data Warehouse Analysis and reporting from in-house std Migrate from in-house std to SDTM for subm. Many-to-one-to-one migration

Octagon - 13 Migration Process

Octagon - 14 Migration Process (1) Process: From source to target Source Clinical data in DB (Oracle, SAS) Multiple tables/datasets one per domain Data sets today Target Same Individual files, or integrated in warehouse

Octagon - 15 Migration Process (2) Top-level steps Build and maintain metadata for the target (std) Build metadata for each source Map source to target - specs Develop programs as per specs Execute programs Integrate std data in warehouse

Octagon - 16 Migration Process (3) Build and maintain metadata for the target (std) Tables and structure Items and attributes (name, label, type, length) Item controlled content (terminology) Items display format Study CRFs annotated to SDTM

Octagon - 17 Migration Process (4) Build metadata for each source Same pieces as for target (except controlled terms) Study CRFs annotated to source

Octagon - 18 Migration Process (5) Map source to target - specs what goes to what and how (rules) not pgms Transformations: One-to-one, no other transform To single table, single item, w transform To single table, multiple items To multiple tables, multiple items

Octagon - 19 Migration Process (6) Develop programs as per specs Programming task Programs may be written, else auto generated by ETL tool Execute Programs/Job Test and Production Error check / resolve Manage production jobs (logs, audits)

Octagon - 20 Migration Process (7) Integrate std data in warehouse Another (final) step in the migration process. The Load step. Immediate: For ISS/ISE reporting, if reporting from SDTM data ( Submission-level ) Long-term: Multiple studies, subm s, and compounds stored together for analysis and reporting ( Company-level )

Octagon - 21 Two Approaches Build process/programs w/o ETL tool Build process/programs w ETL tool (ETL-Studio)

Octagon - 22 Without ETL Tool

Octagon - 23 Without ETL Tool (1) Build and maintain metadata for SDTM (target) In Excel SS s or SAS data sets or other CDISC provides SDTM metadata in Excel SS Annotate study CRFs to SDTM Build metadata for each source In Excel SS s or SAS data sets or other Annotate study CRFs to source

Octagon - 24 Without ETL Tool (2) Map source to target - specs Source-centric SS s - one row per source item SrcDataSet.Varname to SdtmDataSet.Varname, plus how rules Done by clinical/sdtm-aware person Resources: Annotated CRFs (to source, to target) Metadata SS s SDTM Implementation Guide

Octagon - 25 Without ETL Tool (3) Develop programs as per specs SAS programmer, using map specs SS Levels of sophistication Fully ad hoc Table-driven, auto-generated program, for oneto-one process, using: Map specs SS; Source & target metadata (Excel, SAS) Macros for common transformations referenced in map specs SS

Octagon - 26 Without ETL Tool (4) Levels of sophistication Table-driven full job, not just for indv ( one-toone ) processes: e.g., transpose, sort, validate, load Notes: Plus ad hoc code for remainder of program(s) May be building robust application

Octagon - 27 Without ETL Tool (5) Execute programs/job Controlled by programmer, unless robust application developed Integrate std data in warehouse Last program in the process Submission-level, company-level

Octagon - 28 With ETL Studio

Octagon - 29 With ETL Studio (1) Build and maintain metadata for SDTM (target) Stored in the target metadata library CDISC provides SDTM metadata in Excel SS SAS provides a load process Annotate study CRFs to SDTM Build metadata for each source Stored in the source metadata library Build by loading the source SAS dataset Annotate study CRFs to source

Octagon - 30 With ETL Studio (2) Map source to target - specs Same process as w/o ETL Studio Map specs Excel SS s Done by clinical/sdtm-aware person Resources: Annotated CRFs (to source, to target) Metadata SS s SDTM Implementation Guide

Octagon - 31 With ETL Studio (3) Develop programs as per specs Work shared by ETL developer and SAS pgmr ETL Developer Job comprised of processes Process Design wizard for full job Process Properties wndw to define ea. process Process ~ SAS data step(s) Code for job is auto-generated Uses map specs SS (CRFs as needed)

Octagon - 32 With ETL Studio (4) Process might have: Map, Transform, Sort, Transpose, Join, Validate Auto-generated code, or user-written code SAS programmer: Develop code and load to ETL-S, to be engaged by ETL developer Ad hoc: one-time transformations Macros: repeating transformations

Octagon - 33 ETL Studio Screen Shots

Octagon - 34 With ETL Studio (6) Execute programs/process Run by ETL developer Using ETL Studio job scheduler Define method of error reporting (rpt, table, msg) Integrate std data in warehouse Last ad hoc program in the process Submission-level, company-level

Octagon - 35 Why Use ETL Studio?

Octagon - 36 Why use ETL-Studio? (1) Central metadata uses SAS Metadata Server SDTM metadata Other metadata (eg, libraries, servers) Available for all processes in larger workflow Available for all SAS products in larger workflow Enterprise Guide Web Report Studio Valuable for Clinical Data Warehouse analysis and reporting

Octagon - 37 Why use ETL-Studio? (2) Programmer Perspective Code generation, minimize code written Metadata management Documentation of indv processes and full job Flexible Staffing (Clinical/SDTM-aware staff does map specs SS) ETL Developer builds and executes jobs SAS programmer generates utility programs Other code by ETL-S; Not managing jobs

Octagon - 38 Why use ETL-Studio? (3) Change Management For collaborative work Valuable: Large projects! Multiple developers! Update the same resources Project Management Organized, structured, managed env. for data, programs, and processes

Octagon - 39 Why use ETL-Studio? (4) Impact Analysis Easy to assess change Valuable: Migrations can be large projects Deploy Jobs to SAS Drug Development Ability to do this now Improvement planned

Octagon - 40 Why use ETL-Studio? (5) SDTM Metadata loader Upon request Easy Re-use of Work Process and job metadata Leverage workflow for: New sources New SDTM domains Startpoint for in-house standard

Octagon - 41 Questions

Octagon - 42 Contact Information Barry Cohen (bcohen@octagonresearch.com) 610.535.6500 x635 585 East Swedesford Road, Suite 200 Wayne, PA 19087