Data Conversion to SDTM: What Sponsors Can Do to Facilitate the Process

Similar documents
A white paper presented by: Barry Cohen Director, Clinical Data Strategies Octagon Research Solutions, Inc. Wayne, PA

ADaM Implications from the CDER Data Standards Common Issues and SDTM Amendment 1 Documents Sandra Minjoe, Octagon Research Solutions, Wayne, PA

PK IN DRUG DEVELOPMENT. CDISC management of PK data. Matteo Rossini Milan, 9 February 2010

Metadata Submission Guidelines Appendix to the Study Data Tabulation Model Implementation Guide

Lessons on the Metadata Approach. Dave Iberson- Hurst 9 th April 2014 CDISC Euro Interchange 2014

Using the SAS XML Mapper and ODS PDF to create a PDF representation of the define.xml (that can be printed)

SDTM AND ADaM: HANDS-ON SOLUTIONS

A Brief Introduc/on to CDISC SDTM and Data Mapping

Using SAS Data Integration Studio to Convert Clinical Trials Data to the CDISC SDTM Standard Barry R. Cohen, Octagon Research Solutions, Wayne, PA

Copyright 2012, SAS Institute Inc. All rights reserved. VISUALIZATION OF STANDARD TLFS FOR CLINICAL TRIAL DATA ANALYSIS

Training/Internship Brochure Advanced Clinical SAS Programming Full Time 6 months Program

Bridging Statistical Analysis Plan and ADaM Datasets and Metadata for Submission

How to build ADaM from SDTM: A real case study

STUDY DATA TECHNICAL CONFORMANCE GUIDE

Automate Data Integration Processes for Pharmaceutical Data Warehouse

CDISC Roadmap Outline: Further development and convergence of SDTM, ODM & Co

USE CDISC SDTM AS A DATA MIDDLE-TIER TO STREAMLINE YOUR SAS INFRASTRUCTURE

Current Status and Future Perspectives for Systemization of Clinical Study related the issues of CDISC in USA and other

PharmaSUG2010 HW06. Insights into ADaM. Matthew Becker, PharmaNet, Cary, NC, United States

PharmaSUG Paper DS15

PhUSE Paper CD13

Implementation of SDTM in a pharma company with complete outsourcing strategy. Annamaria Muraro Helsinn Healthcare Lugano, Switzerland

Understanding CDISC Basics

Business & Decision Life Sciences CDISC Workshop: From SDTM to ADaM: Mapping Methodologies

STUDY DATA TECHNICAL CONFORMANCE GUIDE

STUDY DATA TECHNICAL CONFORMANCE GUIDE

Trials and Tribulations of SDTM Trial Design

SDTM-ETL TM. The user-friendly ODM SDTM Mapping software package. Transforming operational clinical data into SDTM datasets is not an easy process.

SAS CLINICAL TRAINING

The CDISC Study Data Tabulation Model (SDTM): History, Perspective, and Basics Fred Wood Principal Consultant, Octagon Research Solutions

Statistical Operations: The Other Half of Good Statistical Practice

Section 1 Project Management, Project Communication/Process Design, Mgmt, Documentation, Definition & Scope /CRO-Sponsor Partnership

Analysis Data Model (ADaM)

Introduction to the CDISC Standards

Pharmaceutical Applications

Practical application of SAS Clinical Data Integration Server for conversion to SDTM data

Clinical Data Management Overview

Einführung in die CDISC Standards CDISC Standards around the World. Bron Kisler (CDISC) & Andrea Rauch DVMD Tagung

Use of standards: can we really be analysis ready?

Business & Decision Life Sciences

4. Executive Summary of Part 1 FDA Overview of Current Environment

Data Standards in Clinical Trials, A Regulatory Perspec9ve

PharmaSUG Paper DS02

Overview of CDISC Implementation at PMDA. Yuki Ando Senior Scientist for Biostatistics Pharmaceuticals and Medical Devices Agency (PMDA)

New features in SDTM-ETL v SDTM-ETL TM. New Features in version 1.2

SDTM Validation: Methodologies and Tools

ADaM or SDTM? A Comparison of Pooling Strategies for Integrated Analyses in the Age of CDISC

Clinical Data Management (Process and practical guide) Dr Nguyen Thi My Huong WHO/RHR/RCP/SIS

Business & Decision Life Sciences What s new in ADaM

Implementing the CDISC standards into an existing CDMS

CDISC SDTM/ADaM Pilot Project 1 Project Report

Clinical Data Management (Process and practical guide) Nguyen Thi My Huong, MD. PhD WHO/RHR/SIS

ABSTRACT INTRODUCTION PATIENT PROFILES SESUG Paper PH-07

Providing Regulatory Submissions In Electronic Format Standardized Study Data

CDISC Journal. Using CDISC ODM to Migrate Data. By Alan Yeomans. Abstract. Setting the Scene

PharmaSUG Paper HS01. CDASH Standards for Medical Device Trials: CRF Analysis. Parag Shiralkar eclinical Solutions, a Division of Eliassen Group

A Macro to Create Data Definition Documents

WHITE PAPER. CONVERTING SDTM DATA TO ADaM DATA AND CREATING SUBMISSION READY SAFETY TABLES AND LISTINGS. SUCCESSFUL TRIALS THROUGH PROVEN SOLUTIONS

Analysis Data Model (ADaM) Implementation Guide

CDER/CBER s Top 7 CDISC Standards Issues

How to easily convert clinical data to CDISC SDTM

CDISC SDTM & Standard Reporting. One System

Guidance for Industry

QUALITY CONTROL AND QUALITY ASSURANCE IN CLINICAL RESEARCH

Clinical Trial Data Integration: The Strategy, Benefits, and Logistics of Integrating Across a Compound

The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Use of Metadata to Automate Data Flow and Reporting. Gregory Steffens Novartis PhUSE 13 June 2012

From Validating Clinical Trial Data Reporting with SAS. Full book available for purchase here.

Electronic Submission of Regulatory Information, and Creating an Electronic Platform for Enhanced Information Management

Needs, Providing Solutions

Organization Profile. IT Services

Guidance for Industry

Study Data Tabulation Model Version 1.4

Alcohol Use Disorder Identification Test Self-Report Version (AUDIT-SR)

Accenture Accelerated R&D Services: CDISC Conversion Service Overview

Udo Siegmann member of e3c, CDISC Sen. Dir. Acc. Management PAREXEL

Did you know? Accenture can deliver business outcome-focused results for your life sciences research & development organization like these:

Building and Customizing a CDISC Compliance and Data Quality Application Wayne Zhong, Accretion Softworks, Chester Springs, PA

PharmaSUG Paper DS07

Challenges and Opportunities in Clinical Trial Data Processing

ABSTRACT INTRODUCTION THE MAPPING FILE GENERAL INFORMATION

Subject: No. Page PROTOCOL AND CASE REPORT FORM DEVELOPMENT AND REVIEW Standard Operating Procedure

Managing Clinical Trials Data using SAS Software

Clinical Data Management BPaaS Approach HCL Technologies

ABSTRACT INTRODUCTION WINDOWS SERVER VS WINDOWS WORKSTATION. Paper FC02

REx: An Automated System for Extracting Clinical Trial Data from Oracle to SAS

PharmaSUG Paper AD08

SDTM-ETL 3.1 New Features

Analysis Data Model: Version 2.0

CDISC standards and data management The essential elements for Advanced Review with Electronic Data

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

PharmaSUG2010 Paper CD04 CD04

Date/Time Stamped Files and Audit Trails: What Part 11 Compliant SAS Systems are Made of. Carolyn Dougherty, ViroPharma Incorporated, Exton, PA

XClinical offers an integrated range of software products for CROs, pharmaceutical, medical device and biopharmaceutical companies.

MAKE THE SWITCH TO ELECTRONIC DATA CAPTURE

SAS Drug Development User Connections Conference 23-24Jan08

.CRF. Electronic Data Capture and Workflow System for Clinical Trials

Transcription:

Data Conversion to SDTM: What Sponsors Can Do to Facilitate the Process Fred Wood VP, Data Standards Consulting Octagon Research Solutions CDISC U.S. Interchange Baltimore, MD November 2009 1

Outline Background The Data Supporting Documentation and Metadata Other Considerations Questions to Ask Potential Partners 2

Background: Interest in SDTM SDTM added as a Study Data Specification in the ectd Guidance (July 2004) Withdrawal of Guidances for electronic submissions (September 29, 2006) ectd as the preferred format for electronic submissions As of January 1, 2008, any electronic submission going to CDER must be ectd. Announcements of a forthcoming Notice of Proposed Rulemaking (NPRM) (Dec 2006, Apr 2007, Dec 2007, May 2008, Nov 2008) SDTM would be required Mention of the SDTM in the PDUFA IV IT Plan (May 2008) SDTM is the foundation for standardized clinical content. 3

Background: Need for Data Conversion Some reasons data may have been collected, stored, and extracted in a non-sdtm-compliant format: To facilitate collection, analysis, and/or reporting Studies began before the SDTM specification Submission of electronic data was not anticipated End-stage conversion is part of the clinical data-flow process 4

Background: The Challenges (1) Converting to SDTM-compliant datasets will present challenges, even for company with well-established and followed processes for standards governance. These challenges become greater, or course, for companies who did not maintain or enforce data standards. 5

Background: The Challenges (2) Challenges include the following: Using standard variable names Adopting industry-wide controlled terminology Adding SDTM-required Sequence Numbers (--SEQ variables) Representing SDTM Findings data in a vertical, normalized format with added baseline flags Representing non-standard variables in separate Supplemental Qualifiers datasets Representing foreign keys in a separate relationship (RELREC) dataset Creating the Trial Design datasets Creating the special-purpose Subject Elements and Subject Visits datasets. 6

The Data Tabulation Data in Electronic Format Complete and Clean Minimize Number of Splits, Merges, Transformations Define Merge Keys for Datasets Needing to Be Merged Determination of Study Anchors Have Standard Data-Transfer Specifications Identify Foreign-Key Relationships Avoid Splitting Data Across Variables 7

The Data: Tabulation Data, and in Electronic Format Tabulation Data A component of the Case Report Tabulations (CRT) separate from Patient Profiles, Data Listings, and Analysis Datasets Data Collected on CRFs Electronic Format SAS datasets are preferred Excel, CSV (comma-separated values), or tab-delimited files also work The creation of SDTM-compliant datasets from paper or PDF files will affect timing and conversion costs 8

The Data: Complete and Clean Data Cleaning Sponsor s responsibility Data integrity and consistency Passing SDTM compliance checks at FDA Should occur prior to database lock May not be possible during conversion Start date after the end date If possible during conversion, will be time consuming Standard unit used in consistently for each test or measurement Possible need to unlock database so that changes can be reflected in audit trail 9

The Data: Minimize Number of Splits, Merges, Transformations (1) Issues more often seen with vendor-supplied data than with sponsor-generated data. Real-life Case 1: Main source PK dataset included twenty-five nominal-time concentrations and twelve PK parameters per record Actual date/times for sampling were in a separate dataset. Actions needed Split main dataset into PC and PK Transform the two resulting datasets Merge actual times onto PC dataset 10

The Data: Minimize Number of Splits, Merges, Transformations (2) Issues more often seen with vendor-supplied data than with sponsor-generated data. Real-life Case 2: PK data supplied as one Excel spreadsheet per timeconcentration curve per subject Actions needed Combine all data in individual-subject spreadsheets Convert to SAS Transform dataset 11

The Data: Define Merge Keys for Datasets Needing to Be Merged The SDTM defines a number of datasets that that might have existed as separate legacy datasets. A specimen dataset and a lab results dataset. The appropriate merge keys need to be present and defined. Without these, merging lab results with the correct specimen may: Require considerable detective work Consume costly resources Be impossible Real-life case: Specimen dataset has a date with no visit. Lab dataset has visit but no date Dates were not collected for all visits 12

The Data: Determination of Study Anchors RFSTDTC and RFENDTC Basis for Study Days (--DY) across all domains Do not expect study-day derivations to be different in different domains --TPTREF Reference point for PK and PD data Each concentration or effect vs. time curve should have one There should be an associated and matching --RFTDTC If this is a dose, there should be an Exposure record --STTPT and --ENTPT Sponsor defined, but actually determined by protocol and CRF 13

The Data: Have Standard Data-Transfer Specifications There should be a single data-transfer specification for the same data for the entire study Across all study sites, subjects, visits, or labs, and vendors. It should include both structure and content Lab test names and codes Specimen naming A unit for each test Normal ranges Real-life case Units for glucose coming in as mg/dl, mg/l, and mmol/l Normal ranges in a separate file, needing to merged by age and sex 14

The Data: Identify Foreign- Key Relationships Often seen with the entry of an adverse-event line number on a concomitant medications record. Need to be clearly identified by sponsor. The SDTM and SDTMIG specify the use of the RELREC dataset for representing these record-to-record relationships in a consistent manner. Some amount of programming will be likely required to create RELREC from the collected information. 15

The Data: Avoid Splitting Data Across Variables The problem often occurs as a result of poor CRF design, poor instructions, and/or lack of adequate data cleaning. Frequently seen with concomitant medication data. Numeric dose information may be present in different fields for different subjects or medications. In a numeric variable equivalent to the SDTM-based CMDOSE Concatenated with the units in a variable equivalent to the SDTMbased CMDOSTXT (character) In a variable equivalent to the SDTM-based CMDOSTOT (numeric total daily dose) A record-level mapping of data may be costly in terms of time and money. 16

Supporting Documentation and Metadata: Data Definitions Data Definitions Annotated CRF Protocol Format Catalogs 17

Supporting Documentation and Metadata: Providing it Early Not all datasets or studies are equal in terms of conversion effort. Early communication results in more accurate assessment of timing and cost. Examples of useful information: Data definition files Zero-observation datasets CRF books 18

Supporting Documentation and Metadata: Data Definition File Complete and proper source metadata are critical in communicating the nature of the data to be converted. Ideally, the metadata will be provided in a computer-readable format. When provided, the dataset-level and variable-level metadata should be consistent with the data in the datasets. Include variable names, variable labels, data types, data formats, and data origin. 19

Supporting Documentation and Metadata: Annotated CRF Having a high-quality acrf facilitates the conversion process by representing how the data were collected. Major attributes of an acrf that will affect the efficiency of data conversion: Completeness Searchability Annotated Electronically Annotated by a Knowledgeable Person at the Sponsor Company 20

Supporting Documentation and Metadata: Protocol It serves as the background for understanding the data that were collected on the CRFs. Ideally, it should be: Complete, including all amendments In a searchable electronic format Octagon has seen numerous instances where the CRF and the protocol are not in agreement, and the CRF cannot be relied upon to represent the protocol. 21

Supporting Documentation and Metadata: Format Catalogs Data collected using codes must be converted to data containing codelists decodes to meet SDTMIG requirements. The sponsor should provide the conversion partner with the format catalog for all codelists. Codes may be helpful in performing analysis of the data, and may exist in an operational dataset, but would not be part of the SDTM submission. 22

Other Considerations Dictionary Coding Submission of Screen Failures Trial Design Sponsor Points of Contact Minimize Creativity 23

Other Considerations: Dictionary Coding All data within a study should be coded to the same version, and not a mix of versions. Sponsors may need to decide whether to: Change to a different dictionary Upgrade to a newer version of the current dictionary The SDTMIG provides no guidance for handling integrated databases, so sponsors should consult with the FDA review division to determine the best practice. 24

Other Considerations: Submission of Screen Failures Discussions with the review division should determine whether any data collected for screen failures should be submitted (regardless of the submission format). If screen failures are to be submitted, then these subjects should be clearly identified in the source data. Ideally, these subjects will : Have at least one documented inclusion or exclusion exception Not have a record(s) indicating that the subject was randomized or treated. Octagon has seen cases where screen failures were randomized and or treated. This is challenging. 25

Other Considerations: Trial Design Submission of TD tables should be discussed with FDA ahead of time. Retrospective creation can take considerably longer than the creation of safety-domain datasets. The rules for creating the start and end of treatment periods and visits are critical: Ability to know what treatment a subject was on at any point in the trial Ability to know at what visit any data was collected Rules are often based upon other study anchors. If these anchors do not have collected times, there will be problems. Real-life case: The protocol states that a visit begins with admission to the study site, but the CRF does not collect date and time of admission to the site. 26

Other Considerations: Sponsor Points of Contact The sponsor should have a representative who is familiar with the protocol and the data. The sponsor should also have an individual who is: capable of making decisions empowered to gather together internal experts to make a decision knows when to do each of these. Ideally, the roles of expert and decision maker would belong to individuals who are well coordinated or to a single point of contact. 27

Other Considerations: Minimize Creativity Faulty beliefs: Data standards stifle creativity The CRF is a vehicle for expressing creativity Conversion partner may be relegated to creating mapping specifications: For each study, even when the primary and secondary endpoints are the same To a level as low as each visit when the same information is collected differently at each visit within a study 28

Questions to Ask Potential Partners (1) How much experience (i.e., number of studies or submissions) do they have working with data: Not in electronic format and/or not tabulation data Requiring splits, transformations, and merges, sometimes without the merge keys being clear Requiring conditional mapping (record dependent) Having no or limited metadata, acrfs, or protocol How much experience do they have creating: Supplemental Qualifiers RELREC from foreign-key variables Trial Design datasets at both the trial and subject level Custom domains not modeled in the SDTMIG 29

Questions to Ask Potential Partners (2) How much experience do they have: Working with sponsors who might have limited knowledge of their data, as well its origins Running and interpreting the output from automated SDTMcompliance checks What are the qualifications of the personnel: Number of years of SDTM conversion experience Level of involvement on the CDISC SDS Team, which developed the SDTM Number of studies and datasets converted Number of complex datasets converted Therapeutic-area breadth and depth Functional breadth and depth (e.g., data management, PK, programming, statistics) 30

Conclusions A successful data conversion is dependent upon several key factors: the quality of the data the quality and extent of the supporting documentation an understanding of the study by someone at the sponsor company The experience of the conversion partner 31