Big Data for Population Health and Personalised Medicine through EMR Linkages



Similar documents
Big Data for Population Health

DISCLOSURES RISK ASSESSMENT. Stroke and Heart Disease -Is there a Link Beyond Risk Factors? Daniel Lackland, MD

evaluation of cancer hazards Robert A Baan PhD The IARC MONOGRAPHS International Agency for Research on Cancer Lyon, France

Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider

GENETICS AND INSURANCE: QUANTIFYING THE IMPACT OF GENETIC INFORMATION

A Multi-locus Genetic Risk Score for Abdominal Aortic Aneurysm

What are observational studies and how do they differ from clinical trials?

SAP HANA Enabling Genome Analysis

Informatics: Opportunities & Applications. Professor Colin McCowan Robertson Centre for Biostatistics and Glasgow Clinical Trials Unit

The M.U.R.D.O.C.K. Study

Information for patients and the public and patient information about DNA / Biobanking across Europe

Genomics and Family History Survey Questions Updated March 2007 Compiled by the University of Washington Center for Genomics & Public Health

Epidemiology of Hypertension 陈 奕 希 李 禾 园 王 卓

Genomic Medicine The Future of Cancer Care. Shayma Master Kazmi, M.D. Medical Oncology/Hematology Cancer Treatment Centers of America

Big data in health research Professor Tony Blakely

The National Institute of Genomic Medicine (INMEGEN) was

EXPANDING THE EVIDENCE BASE IN OUTCOMES RESEARCH: USING LINKED ELECTRONIC MEDICAL RECORDS (EMR) AND CLAIMS DATA

A leader in the development and application of information technology to prevent and treat disease.

Effect measure modification & Interaction. Madhukar Pai, MD, PhD McGill University madhukar.pai@mcgill.ca

Innovation Platform: Sudden Cardiac Death

Beware that Low Urine Creatinine! by Vera F. Dolan MSPH FALU, Michael Fulks MD, Robert L. Stout PhD

Survey on the Mortality from Malignant Tumors of Female Asbestos Spinning Workers

Coronary Heart Disease (CHD) Brief

Gene Therapy. The use of DNA as a drug. Edited by Gavin Brooks. BPharm, PhD, MRPharmS (PP) Pharmaceutical Press

Number. Source: Vital Records, M CDPH

Rheumatoid arthritis in England 1996 to Mortality trends

Outline. Personal profile & research interests. Rheumatology research in Ireland. Current standing. Future plans

Risk Factors for Alcoholism among Taiwanese Aborigines

The American Cancer Society Cancer Prevention Study I: 12-Year Followup

Complex Genetic Risk: The Implications for Insurance

Understanding Diseases and Treatments with Canadian Real-world Evidence

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE E15

Causation in Systems Medicine: Epistemological and Metaphysical Challenges

Saving healthcare costs by implementing new genetic risk tests for early detection of cancer and prevention of cardiovascular diseases

Decision to Continue the Development of Tecemotide (L-BLP25) in Non-Small Cell Lung Cancer to be Announced

Research Skills for Non-Researchers: Using Electronic Health Data and Other Existing Data Resources

How Can Institutions Foster OMICS Research While Protecting Patients?

Quantifying Life expectancy in people with Type 2 diabetes

Ass Professor Frances Kay-Lambkin. NHMRC Research Fellow, National Drug and Alcohol Research Centre UNSW

Core therapeutic areas

Vision for the Cohort and the Precision Medicine Initiative Francis S. Collins, M.D., Ph.D. Director, National Institutes of Health Precision

Globally 12% of all deaths among adults aged 30 years and over were attributed to tobacco.

JESSE HUANG ( 黄 建 始 ),MD,MHPE,MPH,MBA Professor of Epidemiology Assistant President

COMMITTEE FOR MEDICINAL PRODUCTS FOR HUMAN USE (CHMP)

Η δίαιτα στην πρόληψη του αγγειακού εγκεφαλικού επεισοδίου

Randomized trials versus observational studies

Validation and Replication

Voluntary Genomic Data Submissions at the U.S. FDA

Hepatitis C Glossary of Terms

NGS and complex genetics

Summary of Investigation into the Occurrence of Cancer Census Tract 2104 Zip Code 77009, Houston Harris County, Texas May 11, 2010

Optimal levels of alcohol consumption for men and women at different ages, and the all-cause mortality attributable to drinking

Lung cancer and asbestos

Table 1. Underlying causes of death related to alcohol consumption, International Classification of Diseases, Ninth Revision

HeartScore Web - based version users guide TABLE OF CONTENTS. 1. Preamble Benefits of using HeartScore Accessing HeartScore...

Pharmacology skills for drug discovery. Why is pharmacology important?

Bayer Extends Clinical Investigation of Xarelto for the Prevention and Treatment of Life-Threatening Blood Clots in Patients with Cancer

Nuevas tecnologías basadas en biomarcadores para oncología

Alcohol Units. A brief guide

ALCHEMIST (Adjuvant Lung Cancer Enrichment Marker Identification and Sequencing Trials)

Chapter 15 Multiple myeloma

The VITamin D and OmegA-3 TriaL (VITAL)

Targeting Specific Cell Signaling Pathways for the Treatment of Malignant Peritoneal Mesothelioma

Bayer Pharma AG Berlin Germany Tel News Release. Not intended for U.S. and UK Media

Ask Us About Clinical Trials

Public Health Education

2.5 Cancer of the liver

GENETIC DATA ANALYSIS

Know your Numbers The D5 Goals for Diabetes Care. Shelly Hanson RN, CNS, CDE Cuyuna Regional Medical Center November 6, 2014

Corporate Medical Policy Genetic Testing for Alpha-1 Antitrypsin Deficiency

Epigenetic variation and complex disease risk

Clinical Research Infrastructure

The PSA Controversy: Defining It, Discussing It, and Coping With It

Big data size isn t enough! Irene Petersen, PhD Primary Care & Population Health

Chickenpox and Shingles Vaccines

Public health benefits of strategies to reduce greenhouse gas emissions. Professor Sir Andy Haines

Regulatory Issues in Genetic Testing and Targeted Drug Development

Chapter 7: Effect Modification

Aggregate data available; release of county or case-based data requires approval by the DHMH Institutional Review Board

Social inequalities in all cause and cause specific mortality in a country of the African region

The impact of genomic selection on North American dairy cattle breeding organizations

A Primer of Genome Science THIRD

Underwriting Critical Illness Insurance: A model for coronary heart disease and stroke

Transcription:

Big Data for Population Health and Personalised Medicine through EMR Linkages Zheng-Ming CHEN Professor of Epidemiology Nuffield Dept. of Population Health, University of Oxford Big Data for Health Policy Workshop, Toronto, Canada 5 November 2014

Declines in stroke mortality: not fully explained but nothing to do with genetic factors 2 Male 2 Female Japan UK Annual deaths at age 35-69 yrs per 1000 1 1 0 0 1950 1970 1990 2010 1950 1970 1990 2010 Year

China: large, unexplained mortality variations Oesophagus cancer Nasopharynx cancer Females only, hence little effect of tobacco or alcohol (Red = high mortality is >10x yellow = low mortality)

Age-specific trends in adult liver cancer and cirrhosis mortality in Qidong, China, 1980-2009

Trend of annual cigarette production in China (5% annual increase since 1998) 2000 billion 2000 billion 1000 billion 1000 billion 1949 1954 1959 1964 1969 1974 1979 1984 1989 1994 1999 2004

Cigarette consumption & lung cancer in US

CKB: Smoking patterns by year of birth among men Two thirds of men smoked, slightly higher in rural than in urban

CKB: Adjusted RR for total mortality by age started (Tobacco-attributed death: 25% urban, 15% rural) To be submitted to the Lancet, 2014

China Kadoorie Biobank: design (genetic & other causes of common disease) 500K recruited from 10 localities in 2004-08 Participants interviewed, measured, and gave 10 ml blood for long-term storage Periodic resurvey of 5% (for regression dilution) All followed up indefinitely via electronic record linkage to deaths and ALL hospital episodes General consent for access to health record for unspecified medical research

CKB: Location of the 10 survey sites in China (with different risk exposure and disease patterns) Harbin Qingdao Gansu Sichuan Henan Suzhou Zhejiang Urban Rural Hunan Liuzhou Haikou Chen Z, et al. Int J Epidemiol 2005, 2011

A human face on Mars? 1976: Viking Orbiter 2001: Mars Orbiter More observations allow a clearer, more precise, and more detailed picture of reality also makes it less likely that we see patterns when none exist

SIZE matters: SBP vs IHD mortality, by age 5K, 50K & 500K randomly chosen from PSC* IHD mortality & 95% CI 256 128 64 32 16 8 4 2 5,000 adults Age 80-89 70-79 60-69 50-59 40-49 256 128 64 32 16 8 4 2 50,000 adults Age 80-89 70-79 60-69 50-59 40-49 256 128 64 32 16 8 4 2 500,000 adults Age 80-89 70-79 60-69 50-59 40-49 1 1 1 120 140 160 180 Usual SBP (mmhg) 120 140 160 180 Usual SBP (mmhg) 120 140 160 180 Usual SBP (mmhg) * Prospective Studies Collaboration, Lancet 2002

CKB: Main data sources for linkage Death registries Active followup Outcome Follow up in CKB Disease registries Health insurance (national)

National Health Insurance system in China Health Care Information Management System Ischaemic stroke Confirmed by MRI By 1.1.2014, >98% of participants had been linked to the HI databases through unique national ID number

National health insurance system in China Introduced during 2004-6 with almost universal coverage by 2010 Diagnosis ICD-10 coded, plus disease descriptions and >2,000 procedure codes Managed electronically at city or county levels, mainly for financial purposes (& itemised cost) In CKB ~1.6M episodes, ~20M procedures/tests, ~3500 diseases had been recorded during 2006-14

Strong political support within China

CKB: examples of new research using EMR Infective causes of cancer (WHO IARC, France) Genetics to aid drug development (GSK, Merck) Multi-omics biomarker discovery (Oulu,SomaLogic) Effects of air pollution (Fudan University, China) Healthcare delivery in China (Oxford & Fudan) Plus conventional epidemiological research

Drug Development Across the Industry: From Discovery to Approval For 5-10,000 compounds discovered, only 1 becomes a FDA-approved drug It takes 10-15 years to develop a new drug, costing ~US$1.3 billion Despite soaring cost, the annual No. of approved drugs halved since 1996

Lp-PLA 2 A phospholipase enzyme carried on LDL and macrophages in atherosclerotic plaques Elevated activity predicts CVD risk, but causal effect uncertain Null variants in PLA2G7F (found only in East Asians), gene encoding Lp-PLA 2, reduces enzyme activity In animal models inhibitors of Lp-PLA 2 (darapladib) reduced coronary atherosclerosis Two trials assessed the effects of darapladib in 30,000 patients CKB: using PheWAS approach to assess the efficacies and safeties of the inhibition of Lp-PLA 2 in 100K participants

CKB: Examples of PheWAS of genetic variant or GRS To compare disease risk between extreme thirds of a gene score based on all SNPs 0.5 1.0 1.5 OR per allele (95% CI) Unpublished results

CKB: opportunities for multi-omics research Genome Transcriptome Proteome Metabolome Phenome ~40,000,000 ~20,000 ~100,000 ~5,000 ~5,000 We aim to genotype 510,000 samples using customised array

Genomics in medicine Understanding the structure of genomes Understanding the biology of genomes Understanding the biology of diseases Advancing the science of medicine Improving the effectiveness of healthcare DNA sequence Genes variation Gene regulation Gene function Pathways Mechanisms Diagnosis Treatment Prevention Risk prediction Targeted therapy ED Green et al. Nature 2011; 470:204-213

CKB: Opportunities for BIG DATA using EMR and multi-omics information Great increase in the range of diseases that can be studied Improved power, disease classification & patient stratifications Better understanding of genetic factors on multiple diseases with shared pathways/mechanisms Further exploration of causative genes at loci discovered previously from trans-ethnic studies Identification of novel biomarkers as therapeutic targets Better predication of drug response and prognosis Need novel tools for data handling, analyses and interpretation

Oxford Big Data Institute