A.I. in health informatics lecture 5 standards & ontolologies



Similar documents
3M Health Information Systems

IBM Watson and Medical Records Text Analytics HIMSS Presentation

Electronic Health Record (EHR) Standards Survey

Terminology Services in Support of Healthcare Interoperability

Health Care Information System Standards

Electronic Health Record. Standards, Coding Systems, Frameworks, and Infrastructures

Standardized Terminologies Used in the Learning Health System

What is your level of coding experience?

Problem-Centered Care Delivery

MED 2400 MEDICAL INFORMATICS FUNDAMENTALS

Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary: Standardizing lab data to LOINC for meaningful use

SNOMED CT in the EHR Present and Future. with the patient at the heart

Bench to Bedside Clinical Decision Support:

Medical Terminologies and Classification Systems

IDRT: Integration and Maintenance of Medical Terminologies in i2b2

Analyzing rare diseases terms in biomedical terminologies

Reasoning / Medical Context / Physicians Area of Competence

Improving EHR Semantic Interoperability Future Vision and Challenges

TRANSFoRm: Vision of a learning healthcare system

Introduction to ICD-10-CM. An Introduction to the Transition from ICD-9-CM to ICD-10-CM

Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary: Enabling effective health information exchange

Lecture 3 Structured Data: Coding, Classification (ICD, SNOMED, MeSH, UMLS)

Healthcare Data: Secondary Use through Interoperability

Meaningful Use Stage 2 Update: Deploying SNOMED CT to provide decision support in the EHR

Interface Terminology to Facilitate the Problem List Using SNOMED CT and other Terminology Standards

SNOMED CT. The Language of Electronic Health Records

AHIMA Curriculum Map Health Information Management Baccalaureate Degree Approved by AHIMA Education Strategy Committee February 2011

Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future

ICD-9-CM to MedDRA Mapping How Well Do the. Disclaimer

Theodoros. N. Arvanitis, RT, DPhil, CEng, MIET, MIEEE, AMIA, FRSM

ICD-10 Frequently Asked Questions

ADVANCING MEASUREMENT OF PATIENT- CENTERED OUTCOMES AND QUALITY METRICS WITH ELECTRONIC HEALTH RECORDS

From Fishing to Attracting Chicks

The Yosemite Project A Roadmap for Healthcare Information Interoperability

HIPAA Glossary of Terms

A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches

Standards and Terminology in Health Informatics

Anthem Workers Compensation

The American Academy of Ophthalmology Adopts SNOMED CT as its Official Clinical Terminology

ARCHETYPE ALIGNMENT: A TWO-LEVEL DRIVEN SEMANTIC MATCHING APPROACH TO INTEROPERABILITY IN THE CLINICAL DOMAIN

MEDICAL INFORMATICS. Sorana D. Bolboacă

Re: Comments on 2015 Interoperability Standards Advisory Best Available Standards and Implementation Specifications

Belgian Terminology. Dr. Benny Van Bruwaene, M.D. Semantic Interoperability Committee

A Semantic Foundation for Achieving HIE Interoperability

EHR Systems: an Introduction

Medical Coding and Billing Specialist Course Description

How to extract transform and load observational data?

Medical Billing & Coding Catalog Course Description

Web-Based Genomic Information Integration with Gene Ontology

Presenting data: how to convey information most effectively Centre of Research Excellence in Patient Safety 20 Feb 2015

Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers

CODING. Neighborhood Health Plan 1 Provider Payment Guidelines

SNOMED-CT

Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment

ICD-10 Frequently Asked Questions For Providers

FHIM Model Content Overview

How to stop looking in the wrong place? Use PubMed!

HIPAA: AN OVERVIEW September 2013

Health Data Analysis Specialty Track Curriculum Competencies

Extreme Makeover - ICD-10 Code Edition: Demystifying the Conversion Toolkit

Automated Problem List Generation from Electronic Medical Records in IBM Watson

Role of SNOMED CT in Public Health Surveillance

ehr Sharable Data Vicky Fung Senior Health Informatician ehr Information Standards Office

A Framework for Ontology-Based Knowledge Management System

HIM Master s Degree Competencies* Domains, Subdomains, and Tasks 2007 and Beyond

The Big Picture: IDNT in Electronic Records Glossary

What is the Certified Health Record Analyst (CHDA)?

Knowledge Management

Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies

Find the signal in the noise

TEST INSTRUCTIONS FOR CROSS VENDOR EXCHANGE TABLE OF CONTENTS

METHODS IN MEDICAL INFORMATICS

METADATA STANDARDS AND METADATA REGISTRIES: AN OVERVIEW

Using a medical data dictionary to comply with vocabulary standards and exchange clinical data

Implementing reusable software components for SNOMED CT diagram and expression concept representations

What do clinical data standards mean for clinicians? Dr Nick Booth GP and Informatician, Warden, Northumberland, UK

Transcription:

A.I. in health informatics lecture 5 standards & ontolologies kevin small & byron wallace * Slides reuse material from Kleinsorge, Willis, and Emrick; 2007.

today standards facilitates interoperability if well designed stifles creativity if poorly designed ontologies standardization of knowledge UMLS

standards set of rules and definitions regarding completion of a process permits disassociated entities to cooperate important for medicine

standards required when excessive diversity impedes effectiveness standards can impede innovation effective and timely standards can also focus innovation

health care standards portability of patient records accuracy security, privacy uniformity in billing standardization of clinical information electronic communication standards

origination of standards ad hoc mutual agreement of participating entities de facto incidentally generated by dominant entity government mandate HCFA UB92 insurance-claim form consensus open process by interested parties

developing standards identification stage need and technological maturity conceptualization stage purpose, scope, format, etc. discussion stage identification of critical issues, timelines early implementation conformance and certification

the fruit of standards

and ANSI, CEN, ISO, ASTM Health Care Informatics Standards Board (HISB) electronic health care records data exchange health care codes and terminology communication with devices and instrumentation Knowledge and model representation privacy, confidentiality, security HIMSS, CPRI, IHE, NQF, WEDI

why do we care? in many (most) ways, we (I) don t standardized information is easier to reason with (less ambiguity) ontologies research science should have a role in determining standards

ontologies An ontology may take a variety of forms, but necessarily it will include a vocabulary of terms, and some specifica3on of their meaning. This includes definifons and an indicafon of how concepts are inter- related which collecfvely impose a structure on the domain and constrain the possible interpretafon of terms - Uschold et al., 1998

ontologies Burgun et al., 2001

UMLS semantic network Kleonsorge et al., 2007

ontologies frames knowledge within a domain structured representation universal encoding reduces ambiguity DNA region of biological interest with a name and that carries a genefc trait or phenotype gene the coding region of DNA DNA fragment that can be transcribed and translated in to a protein

ontologies scientific knowledge desires precision decomposes into entities and relationships logical reasoning allows storing information in databases requires hand encoding or information extraction from natural language efficient storage of volumes of knowledge human/machine interface

ontological components concepts entities within a domain relations interactions between concepts instances named entities axioms conceptualizafon concrete constraints between (named) entities

concepts primitive concepts globular protein hydrophobic core defined concepts nucleus containing Eukaryotic

relations taxonomic specialization ( is a kind of ) partitive ( is a component of ) associative nominative locative causative many others

ontology use domain-oriented encodes domain knowledge task-oriented encodes methods of completing tasks generic open frameworks (e.g., Cyc)

ontology benefits community reference schema specification ontology-based search input to NLP systems

ontology life-cycle Stevens et al., 2000

knowledge representation natural-language vocabularies hand-crafted tree inheritance structures frame-based systems similar to object-based modeling description logics primacy of relationship inference

evaluating ontologies expressivity can the domain be encoded? rigor satisfiable and consistent semantics captures intended meaning?

Unified Medical Language System (UMLS ) The UMLS, or Unified Medical Language System, is a set of files and sorware that brings together many health and biomedical vocabularies and standards to enable interoperability between computer systems. You can use the UMLS to enhance or develop applicafons, such as electronic health records, classificafon tools, dicfonaries and language translators. more than just an ontology ostensibly rich knowledge source for AI research http://umls.nlm.nih.gov

NLM strategy designating U.S. standards starting point and maintenance coordinate development of standards into interlocking set broaden participation promote usage

standards CHI (clinical) LOINC E.g., lab test results, problems, diagnoses, history, physical Electronic exchange of clinical health informafon in U.S. Government systems HIPAA (administrafve) CPT e.g., health insurance claims, billing, ordering HIPAA AdministraFve SimplificaFon provisions Designated DHHS nafonal standards for electronic healthcare transacfons PHIN (public health) ICD- 9- CM e.g., disease surveillance, immunizafon rates, environmental monitoring CDC designated standards for public health reporfng

UMLS objectives intellectual middleware for developers (not end users) knowledge sources to ameliorate: disparities in language and format (e.g., atrial fibrillation, auricular fibrillation, af) disparities in granularity and perspective (e.g., contusions, hematoma, bruise)

knowledge sources Metathesaurus 1 million+ biomedical concepts from over 100 sources SemanFc Network 135 broad categories and 54 relafonships between categories SPECIALIST Lexicon & Tools lexical informafon and programs for language processing 3 knowledge sources (used separately or together)

Metathesaurus very large multipurpose multi-lingual information regarding biomedical/health concepts names and associated codes relationships amongst concepts

source vocabularies derived from clinical, research, administrative, public health, etc. valid values thesauri: MeSH, CRISP, NCI statistical classifications: ICD-9-CM billing codes: CPT, ABC codes clinical coding: SNOMED CT

source vocabularies intended to coordinate, not derive a single vocabulary Diagnosis/signs and symptoms: ICD9CM, ICD10, ICD10CM, ICD10AM, ICD-O, ICPC, ICF, SNOMED CT, Read Codes, MedDRA, MEDCIN, DSM Procedures: CPT, CDT, HCPCS, OCPS, SNOMED CT, ICD9CM, ICD10-PCS Nursing: NANDA, NIC, NOC, OMS, HHC Diagnostic tests: LOINC, UltraSTAR Drugs: VANDF, NDC, RXNORM, NDDF Medical devices: SPN, UMD Genomics: GO, HUGO, NCBI Taxonomy

term clusters concepts contain synonymous terms preferred term is indicated unique identifier (CUI) is assigned term source term type source ID Addison s disease Metathesaurus PN Addison s disease SNOMED CT PT 363732003 Addison s Disease MedlinePlus PT T1233 Addison Disease MeSH PT D000224 Bronzed disease SNOMED Intl SY DB- 70620 Primary Adrenal Insufficiency MeSH EN D000224 Primary hypoadrenalism MedDRA LT 10036696 syndrome, Addison C0001403 Addison s disease

concept organization Concept C0001621 Term L0001621 Term L0181041 Term L1279026 S0011231 Adrenal Gland Disease A0020266 MeSH A7568579 NCI Thesaurus S0000441 Disease of adrenal gland A0001264 SNOMED 1982 A6917004 SNOMED Clinical Terms S0481705 Diseases of Adrenal Gland A0014499 SNOMED 1982 S0220090 Diseases, adrenal gland A0049924 MeSH S0632950 Disorder of adrenal gland A0688820 Read Codes A4778687 SNOMED Clinical Terms S0354509 Adrenal Gland Disorders A6996540 MedlinePlus A7576253 NCI Thesaurus A7561794 Psychological Index Terms S1520972 Nebennierenkrankheiten A7500884 [ ]

concepts ~1.5M CUI synonymous sets ~5.5M LUI normalized names ~6.1M SUI concept strings ~7.4M AUI source-specific A0066000 Headache (MeSH) A0065992 Headache (ICD-10) S0046854 A0066007 Headaches (MedDRA) A12003304 Headaches (OMIM) S0046855 L0018681 A0540936 Cephalodynia (MeSH) S0475647 L0380797 C0018681 2007 numbers

concept categories high-level semantic types NLM derived source independent Disease or Syndrome Diseases Endocrine Diseases Adrenal Gland Diseases Adrenal Gland Hypofunction Addison s Disease

concept relationships symbolic relationships ~8M pairs of concepts co-occurrence relationships ~6M pairs of concepts mapping relationships ~150k mappings

symbolic relationships Hierarchical Parent / Child Broader / Narrower than Derived from hierarchies Siblings (children of parents) AssociaFve Other Various flavors of near- synonymy Similar Source asserted synonymy Possible synonymy PAR/CHD RB/RN SIB RO RL SY RQ

Semantic Types Anatomical Structure Fully Formed Anatomical Structure Body Part, Organ or Organ Component Embryonic Structure Pharmacologic Substance Disease or Syndrome Population Group Semantic Network Concepts Esophagus 16 Left Phrenic Nerve 5 Mediastinum Heart 13 Valves 22 Heart Fetal Heart Saccular Viscus 38 49 Angina Pectoris Cardiotonic 237 Agents Tissue Donors Metathesaurus

semantic network 135 semantic types broad categories (drug, virus, etc.) 54 semantic relationships links categories (is-a, causes, treats) types + relationships broad categorization of biomedicine

UMLS semantic network Kleonsorge et al., 2007

SPECIALIST lexicon Over 330k entries syntax morphology orthography natural language interface

orthography Spelling variants oe/e ae/e ise/ize genifve mark BriFsh- American variants oesophagus - esophagus anaemia - anemia cauterise - cauterize Addison's disease Addison disease Addisons disease crifcise - - crifcize centre - - center foetus - - fetus

normalization Hodgkin s diseases, NOS Hodgkin diseases, NOS Hodgkin diseases, hodgkin diseases, hodgkin diseases hodgkin disease disease hodgkin

SPECIALIST lexicon {base=kaposi's sarcoma spelling_variant=kaposi sarcoma entry=e0003576 cat=noun variants=uncount variants=reg variants= } {base=chronic entry=e0016869 cat=adj variants=inv posifon=aurib(1) posifon=pred } {base=aspirate entry=e0010803 cat=verb variants=reg tran=np nominalizafon=aspirafon noun E0010804 } {base=in entry=e0033870 cat=prep }

subdomain integration Clinical repositories Genetic knowledge bases Other subdomains SNOMED CT OMIM NCBI Taxonomy UMLS MeSH Biomedical literature Model organisms FMA GO Anatomy Genome annotations

why do we care? actionable intelligence requires knowledge about the universe medicine very knowledge intensive very rich information source machine learning requires appropriate representation models uncertainty