Balancing Big Data for Security, Collaboration and Performance
|
|
|
- Stewart Williamson
- 10 years ago
- Views:
Transcription
1 Balancing Big Data for Security, Collaboration and Performance Sai Balu Lineberger Cancer Center UNC Chapel Hill Oct 14, 2014
2 About UNC Oldest Public University Top 5 Public University. 46th World Wide Clinical Translational Science Award NCTraCS Institute Carolina Data Warehouse - Hospital/Research School of Medicine - 6th in NIH funding
3 About Lineberger NCI Designated Comprehensive Cancer Center Largest Research Entity at UNC - $190 million/year in external grants 300 Scientists, 1200 Staff across UNC Campus 250 Clinical Trials offered NC Cancer Hospital : Clinical Home University Cancer Research Fund - $25 million in 2007 and $42 million/year in 2014
4 About UNC Hospitals Not-for-Profit Integrated Health Care Teaching Mission State of the Art Patient Care EMR and Cancer Registry WebCIS Epic
5 About RENCI A Leader in Cyber Technologies Scientific Discoveries & Business Innovations Medicine & Genomics Environmental Sciences Data Management Technologies: irods
6 Bioinformatics Core at Lineberger Infrastructure for Data Management and Data Analysis Integrated Data Analysis - Genomic & Clinical & public annotations Supporting Instruments
7
8 Big Data Velocity The rate of data generation, rate of change Volume The size of data Variety Under represented of the Vs but not Today!
9 TCGA The Cancer Genome Atlas Project Study Molecular Basis for Cancer 20+ tumor types studied Expression, Copy Number, DNA/RNA, mirna UNC is Gene Expression Center Dr. Chuck Perou 10K samples processed
10 TCGA Analysis Tumor Working Groups & Data Freezes Exposure to Variety Types of data, Security, Sources, Performance, Sharing, Analysis.
11 UNC Cancer Survivorship Goal: Enroll 10K Patients! Collect Biospecimens, medical records and follow-up with questionnaires
12
13 UNCseq Genetic Profiling Cancer Patient Specimens Support Treatment Decisions Target ~200 genes of potential clinical utility All known druggable targets Genes of interest confirmed by experts
14
15 Big Data - Variety! 1. Clinic Schedules 7. Public data - Clinical Trials, Oncotator, Death Indexes 2. ICD codes 8. Ancillary Studies 3. Consent Status 9. Workflows 4. Tissue Banking and Annotations 10. Metadata 5. Questionnaires - 2 different languages 11. Analysis - exome, survival, spatial. 6. EMR - Pathology as an example 12. Instruments - robots, sequencing - sequonom, snp arrays
16 Big Data - Variety! Variety of Sources Epic, SAS-Health Outcome Analytics, Death Indexes Variety of Security Public Data to CLIA to FISMA compliance Variety of Standards +1 standards
17 SAS - HOA Private partnership to create Cancer Data Mart Patient Counts - 155,078 Pathology Report Types - 33 Pathology Report Datapoints - 21,347,023 Lab Tests - 387,495 Lab Test Observations - 34,168,986
18 Security, Collaboration and Performance Balancing is an art Institutional Policies Develop Trust Develop standard verification processes Develop Training materials
19 Security HIPAA Sensitive Data FISMA Moderate Claims Data Secure Medical Workspaces Secure Cluster Computing
20
21 Performance Sustained 15Gbs/sec over the network for many hours - largest network traffic seen within UNC campus Transferring to Data Coordination Centers - Bit Torrent Style software
22 Collaboration Through Data Sharing Without Duplication with different ACLs Bring Compute to Data irods - A Possible Solution
23 Data Governance Identify Stewards Identify Custodians Identify Users Develop Policies Create Workgroups
24 Acknowledgement UNCseq Team Health Registry Team TCGA at UNC Team DDN Thank you!
Cancer Genomics: What Does It Mean for You?
Cancer Genomics: What Does It Mean for You? The Connection Between Cancer and DNA One person dies from cancer each minute in the United States. That s 1,500 deaths each day. As the population ages, this
The National Consortium for Data Science (NCDS)
The National Consortium for Data Science (NCDS) A Public-Private Partnership to Advance Data Science Ashok Krishnamurthy PhD Deputy Director, RENCI University of North Carolina, Chapel Hill What is NCDS?
Comprehensive Data Resource Introduction
National Cancer Institute U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health Comprehensive Data Resource Introduction Helen Moore and Ping Guan Biorepositories and Biospecimen Research
How Can Institutions Foster OMICS Research While Protecting Patients?
IOM Workshop on the Review of Omics-Based Tests for Predicting Patient Outcomes in Clinical Trials How Can Institutions Foster OMICS Research While Protecting Patients? E. Albert Reece, MD, PhD, MBA Vice
Managing Next Generation Sequencing Data with irods
Managing Next Generation Sequencing Data with irods Presented by Dan Bedard // [email protected] at the 9 th International Conference on Genomics Shenzhen, China September 12, 2014 Managing NGS Data with
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives Chalapathy Neti, Ph.D. Associate Director, Healthcare Transformation, Shahram Ebadollahi, Ph.D. Research Staff Memeber IBM Research,
Integrating a Research Management System & EMR: Motivations and Benefits Host
Integrating a Research Management System & EMR: Motivations and Benefits Host Lisa Haddican, Content Marketing Specialist Specialized Solutions by Forte Developed with the expertise of 20,000 trials Clinical
High Performance Computing Initiatives
High Performance Computing Initiatives Eric Stahlberg September 1, 2015 DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Cancer Institute Frederick National Laboratory is
A leader in the development and application of information technology to prevent and treat disease.
A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today
Personalized Medicine: Humanity s Ultimate Big Data Challenge. Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences
Personalized Medicine: Humanity s Ultimate Big Data Challenge Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences 2012 Oracle Corporation Proprietary and Confidential 2 3 Humanity
irods for Big Data Management in Research Driven Organizations Charles Schmitt CTO & Director of Informatics RENCI
irods for Big Data Management in Research Driven Organizations Charles Schmitt CTO & Director of Informatics RENCI Acknowledgements Presented work funded in part by grants from NIH, NSF, NARA, DHS, as
Opportunities and Limitations of Big Data to Address Diversity. Shawn Murphy MD, Ph.D.
Opportunities and Limitations of Big Data to Address Diversity Shawn Murphy MD, Ph.D. The Research Data Warehouse at Partners Healthcare Partners Enterprise Research Patient Data Registry Multiple Systems
School of Nursing. Presented by Yvette Conley, PhD
Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression
Nazneen Aziz, PhD. Director, Molecular Medicine Transformation Program Office
2013 Laboratory Accreditation Program Audioconferences and Webinars Implementing Next Generation Sequencing (NGS) as a Clinical Tool in the Laboratory Nazneen Aziz, PhD Director, Molecular Medicine Transformation
INTRODUCTION TO THE DATAVERSE NETWORK
INTRODUCTION TO THE DATAVERSE NETWORK JANUARY 7, 2015 Jonathan Crabtree Assistant Director of Computing and Archival Research THE ODUM INSTITUTE FOR RESEARCH IN SOCIAL SCIENCE 228 DAVIS LIBRARY, CB# 3355
Transla6ng from Clinical Care to Research: Integra6ng i2b2 and OpenClinica
Transla6ng from Clinical Care to : Integra6ng i2b2 and OpenClinica Aaron Abend Managing Director, Recombinant Data Corp May 13, 2011 Copyright 2011 Recombinant Data Corp. All rights reserved. 1 About Recombinant
Opportunities and Challenges in Translating Novel Discoveries into Useful Clinical Tests
Opportunities and Challenges in Translating Novel Discoveries into Useful Clinical Tests James H. Doroshow, M.D. NCI Deputy Director for Clinical and Translational Research NCI Workshop: Evidence Needed
BioGrid s use of Business Analytics for Collaborative Medical Research. Maureen Turner, CEO, BioGrid Australia
BioGrid s use of Business Analytics for Collaborative Medical Research Maureen Turner, CEO, BioGrid Australia Overview Data sharing considerations BioGrid a collaborative model Data ethics, privacy, security
Big Data Visualization for Genomics. Luca Vezzadini Kairos3D
Big Data Visualization for Genomics Luca Vezzadini Kairos3D Why GenomeCruzer? The amount of data for DNA sequencing is growing Modern hardware produces billions of values per sample Scientists need to
IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS
IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS 29 OCTOBER 2015 DR. DIRK J. EVERS BACKGROUND TreatmentMAP
TRACKS GENETIC EPIDEMIOLOGY
Dr. Priya Duggal, Director In the post-genomic era where larger amounts of genetic data are now readily available, it has become increasingly important to design studies and use analytical techniques that
68 th Meeting of the National Cancer Institute (NCI) NCI Council of Research Advocates (NCRA) National Institutes of Health (NIH)
68 th Meeting of the National Cancer Institute (NCI) NCI Council of Research Advocates (NCRA) National Institutes of Health (NIH) Updates on NCI Programs Building 31, C Wing, Conference Room 6 NIH Campus
Fast. Integrated Genome Browser & DAS. Easy. Flexible. Free. bioviz.org/igb
bioviz.org/igb Integrated Genome Browser & DAS Free tools for visualizing, sharing, and publishing genomes and genome-scale data. Easy Flexible Fast Free Funding: National Science Foundation Arabidopsis
Big data in cancer research : DNA sequencing and personalised medicine
Big in cancer research : DNA sequencing and personalised medicine Philippe Hupé Conférence BIGDATA 04/04/2013 1 - Titre de la présentation - nom du département émetteur et/ ou rédacteur - 00/00/2005 Deciphering
m 4 Biobank Alliance & m 4 Trial Service Center
m 4 Biobank Alliance & m 4 Trial Service Center Services & Consulting in (non-)clinical development www.m4.de m 4 Biobank Alliance Central Access to High Quality Human Biospecimens for R&D The m 4 Biobank
ALCHEMIST (Adjuvant Lung Cancer Enrichment Marker Identification and Sequencing Trials)
ALCHEMIST (Adjuvant Lung Cancer Enrichment Marker Identification and Sequencing Trials) 3 Integrated Trials Testing Targeted Therapy in Early Stage Lung Cancer Part of NCI s Precision Medicine Effort in
Next Generation Sequencing: Technology, Mapping, and Analysis
Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University [email protected] http://tandem.bu.edu/ The Human Genome Project took
Clinical Trials: Questions and Answers
Clinical Trials: Questions and Answers Key Points Clinical trials are research studies that test how well new medical approaches work in people (see Question 1). Every clinical trial has a protocol, which
Worldwide Collaborations in Molecular Profiling
Worldwide Collaborations in Molecular Profiling Lillian L. Siu, MD Director, Phase I Program and Cancer Genomics Program Princess Margaret Cancer Centre Lillian Siu, MD Contracted Research: Novartis, Pfizer,
Automated and Scalable Data Management System for Genome Sequencing Data
Automated and Scalable Data Management System for Genome Sequencing Data Michael Mueller NIHR Imperial BRC Informatics Facility Faculty of Medicine Hammersmith Hospital Campus Continuously falling costs
Testimony of. Paul Misener Vice President for Global Public Policy, Amazon.com. Before the
Testimony of Paul Misener Vice President for Global Public Policy, Before the United States House of Representatives Committee on Energy and Commerce Subcommittee on Communications and Technology Subcommittee
Big Data and the Data Lake. February 2015
Big Data and the Data Lake February 2015 My Vision: Our Mission Data Intelligence is a broad term that describes the real, meaningful insights that can be extracted from your data truths that you can act
Genomic Medicine The Future of Cancer Care. Shayma Master Kazmi, M.D. Medical Oncology/Hematology Cancer Treatment Centers of America
Genomic Medicine The Future of Cancer Care Shayma Master Kazmi, M.D. Medical Oncology/Hematology Cancer Treatment Centers of America Personalized Medicine Personalized health care is a broad term for interventions
Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
NIH s Genomic Data Sharing Policy
NIH s Genomic Data Sharing Policy 2 Benefits of Data Sharing Enables data generated from one study to be used to explore a wide range of additional research questions Increases statistical power and scientific
Digital Health: Catapulting Personalised Medicine Forward STRATIFIED MEDICINE
Digital Health: Catapulting Personalised Medicine Forward STRATIFIED MEDICINE CRUK Stratified Medicine Initiative Somatic mutation testing for prediction of treatment response in patients with solid tumours:
SAP Healthcare Analytics Solutions Provide physicians and researchers access to patient data from various systems in realtime
SAP Healthcare Analytics Solutions Provide physicians and researchers access to patient data from various systems in realtime Stephan Schindewolf, SAP SE, July 13, 2015 Facts per Decision Need Decision
Big Data for Population Health
Big Data for Population Health Prof Martin Landray Nuffield Department of Population Health Deputy Director, Big Data Institute, Li Ka Shing Centre for Health Information and Discovery University of Oxford
ARIA ONCOLOGY INFORMATION SYSTEM RADIATION ONCOLOGY
ARIA ONCOLOGY INFORMATION SYSTEM RADIATION ONCOLOGY The ARIA oncology information system is a powerful information and image management solution designed to support the specific clinical needs of oncology
IBCSG Tissue Bank Policy
THE INTERNATIONAL BREAST CANCER STUDY GROUP IBCSG Tissue Bank Policy Accepted by the IBCSG Executive Committee on March 24, 2006 Accepted by the IBCSG Ethics Committee on March 24, 2006 Accepted by the
Carolina s Journey: Turning Big Data Into Better Care. Michael Dulin, MD, PhD
Carolina s Journey: Turning Big Data Into Better Care Michael Dulin, MD, PhD Current State: Massive investments in EMR systems Rapidly Increase Amount of Data (Velocity, Volume, Veracity) The Data has
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
If you are signing for a minor child, you refers to your child throughout the consent document.
CONSENT TO PARTICIPATE IN A CLINICAL RESEARCH STUDY Adult Patient or Parent, for Minor Patient INSTITUTE: National Cancer Institute PRINCIPAL INVESTIGATOR: Raffit Hassan, M.D. STUDY TITLE: Tissue Procurement
THE SIDNEY KIMMEL COMPREHENSIVE CANCER CENTER AT JOHNS HOPKINS
Ushering in a new era of cancer medicine Center is ushering in a new era of cancer medicine. Progress that could not even be imagined a decade ago is now being realized in our laboratories and our clinics.
The Future of Personalized Medicine: Powered by Real World Data. Kris Joshi, PhD Global Vice President
The Future of Personalized Medicine: Powered by Real World Data Kris Joshi, PhD Global Vice President Safe Harbor Statement The following is intended to outline our general product direction. It is intended
G E N OM I C S S E RV I C ES
GENOMICS SERVICES THE NEW YORK GENOME CENTER NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. capabilities. N E X T- G E
Computational Pathology and the Role of Pathology Informatics
Pathology Informatics Summit 2015 Computational Pathology and the Role of Pathology Informatics Michael J. Becich, MD PhD - [email protected] Chairman, Department of Biomedical Informatics http://www.dbmi.pitt.edu
NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons
The NIH Commons Summary The Commons is a shared virtual space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage,
HIPAA: Open Research Issues Michael L. Blau, Esq. McDermott, Will & Emery
HIPAA: Open Research Issues Michael L. Blau, Esq. McDermott, Will & Emery Research A. General Rules. There are four pathways for covered entities ( CEs ) to obtain permission under the Health Insurance
Masters of Science in Clinical Research (MSCR) Curriculum. Goal/Objective of the MSCR
Masters of Science in Clinical (MSCR) Curriculum Goal/Objective of the MSCR The MSCR program is an interdisciplinary research degree program housed within the Department of Epidemiology in the School of
Center for Health Informatics & Bioinformatics. A New Catalyst For Cutting Edge research, Funding Opportunities, and Education at NYULMC
Center for Health Informatics & Bioinformatics A New Catalyst For Cutting Edge research, Funding Opportunities, and Education at NYULMC 1 Current Challenges Biological Research Complex assays/instruments:
8/27/2014. Office of Research Informatics(ORI) CORI. Introduction- The Office of Research Informatics (ORI)?
Office of Research Informatics(ORI) Project Updates and New Initiatives Research Wednesday August 27th, 2014 Cory Ennis Julie Eckstrand Eric Hall Tony Leiro Introduction- The (ORI)? Product Updates- CTMS
Secondary Uses of Data for Comparative Effectiveness Research
Secondary Uses of Data for Comparative Effectiveness Research Paul Wallace MD Director, Center for Comparative Effectiveness Research The Lewin Group [email protected] Disclosure/Perspectives Training:
An Introduction to Genomics and SAS Scientific Discovery Solutions
An Introduction to Genomics and SAS Scientific Discovery Solutions Dr Karen M Miller Product Manager Bioinformatics SAS EMEA 16.06.03 Copyright 2003, SAS Institute Inc. All rights reserved. 1 Overview!
European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute
European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute Justin Paschall Team Leader Genetic Variation / EGA ! European Genome-phenome
From Fishing to Attracting Chicks
The Greater Plains Collaborative: a PCORNet Clinical Data Research Network s Strategies for Creating an Interoperable Architecture From Fishing to Attracting Chicks Russ Waitman, PhD Associate Professor,
Data-driven Medicine in the Age of Genomics Overcoming the Challenge With Advanced Molecular Analytics
Data-driven Medicine in the Age of Genomics Overcoming the Challenge With Advanced Molecular Analytics David A Dworaczyk, PhD Life and Health Sciences Strategic Development 11 December, 2014 Safe Harbor
Moffitt Cancer Center, M2Gen and ConvergeHEALTH Collaboration
Moffitt Cancer Center, M2Gen and ConvergeHEALTH Collaboration Eric Padron, M.D., Section Head, Personalized Medicine and Genomics, Malignant Hematology, H. Lee Moffitt Cancer Center and Research Institute
Understanding Big Data Analytics for Research
Understanding Big Data Analytics for Research Hye-Chung Kum Texas A&M Health Science Center, Dept. of Health Policy & Management University of North Carolina at Chapel Hill, Dept. of Computer Science ([email protected])
Using genetic biomarkers to pre-identify oncology patients for clinical trials
White paper Quintiles Vantage Point Quintiles helped develop or commercialize all of the Top 30 bestselling oncology products of 2014 Oncology pre-profiling: Using genetic biomarkers to pre-identify oncology
How does genetic testing work?
How does genetic testing work? What is a genetic test? A genetic test looks at to find changes (variants) that cause disease or put you at greater risk to develop disease. DNA is the code our bodies use
How Real-time Analysis turns Big Medical Data into Precision Medicine?
Medical Data into Dr. Matthieu-P. Schapranow GLOBAL HEALTH, Rome, Italy August 27, 2014 Important things first: Where to find additional information? Online: Visit http://we.analyzegenomes.com for latest
The MSCR Curriculum and Its Advantages
Masters of Science in Clinical Research (MSCR) Curriculum Goal/Objective of the MSCR The MSCR program is an interdisciplinary research degree program housed within the Department of Epidemiology in the
What Cancer Patients Need To Know
Taking Part in Clinical Trials What Cancer Patients Need To Know NATIONAL INSTITUTES OF HEALTH National Cancer Institute Generous support for this publication was provided by Novartis Oncology. Taking
Technology funding opportunities at the National Cancer Institute
Technology funding opportunities at the National Cancer Institute Through the Cancer Diagnosis Program http://cancerdiagnosis.nci.nih.gov/index.html Avraham Rasooly Ph.D. National Cancer Institute, Cancer
Course Requirements for the Ph.D., M.S. and Certificate Programs
Health Informatics Course Requirements for the Ph.D., M.S. and Certificate Programs Health Informatics Core (6 s.h.) All students must take the following two courses. 173:120 Principles of Public Health
University of Medicine and Dentistry of New Jersey (UMDNJ)
University of Medicine and Dentistry of New Jersey (UMDNJ) Dual-Degree Program between the UMDNJ Graduate School of Biomedical Sciences (GSBS) And the UMDNJ School of Public Health (SPH) Leading to the:
Using the Bionimbus Protected Data Cloud (PDC): Obtaining Access Credentials FAQ
Using the Bionimbus Protected Data Cloud (PDC): Obtaining Access Credentials FAQ It s very important that a PDC user is the only one who logs in with an account. If you have members of your lab that would
PATHOLOGY DEPARTMENTS AT GRADUATE MEDICAL EDUCATION TEACHING INSTITUTIONS
Chapter Six PATHOLOGY SPECIMENS A large number of tissues are collected for diagnostic or therapeutic reasons. These tissues are usually sent to a clinical, diagnostic or pathology laboratory for examination.
NCI Community Cancer Centers Program Program Overview Ascension Health St. Vincent Indianapolis Hospital
A. Name and location of hospital:, Indianapolis, IN B. Name of cancer center: St. Vincent Oncology Center C. Identify PI and key personnel with contact information for each pilot focus areas: a. Disparities
Visual Analytics to Enhance Personalized Healthcare Delivery
Visual Analytics to Enhance Personalized Healthcare Delivery A RENCI WHITE PAPER A data-driven approach to augment clinical decision making CONTACT INFORMATION Ketan Mane, PhD kmane@renci,org 919.445.9703
The University is comprised of seven colleges and offers 19. including more than 5000 graduate students.
UNC CHARLOTTE A doctoral, research-intensive university, UNC Charlotte is the largest institution of higher education in the Charlotte region. The University is comprised of seven colleges and offers 19
Big Data for Population Health and Personalised Medicine through EMR Linkages
Big Data for Population Health and Personalised Medicine through EMR Linkages Zheng-Ming CHEN Professor of Epidemiology Nuffield Dept. of Population Health, University of Oxford Big Data for Health Policy
Ask Us About Clinical Trials
Ask Us About Clinical Trials Clinical Trials and You. Our specialists and researchers are at the forefront of their fields and are leading the way in developing new therapies and procedures for diagnosing
