Real-Time Transcription of Radiology Dictation: A Case Study for TabletPCs

Similar documents
Voice Driven Animation System

Chapter 5 Objectives. Chapter 5 Input

Voice Input Computer Systems Computer Access Series

Dragon Solutions. Using A Digital Voice Recorder

Transcription FAQ. Can Dragon be used to transcribe meetings or interviews?

SMART Boards. If the board is connected to a different computer - Orientation is needed whenever you connect it to a new or different computer.

WHITEPAPER. Mobile Workforce Productivity Solutions. Streamline Field Reporting Workflow with Speech Recognition

How To Draw On An Ipad With A Touch Tablet (For Free) On A Blackberry Or Ipad 2 (For A Sims) On An Easter Egg (For An Sims 2) On Blackberry 2 (Blackberry)

Chapter 9 Input/Output Devices

KINDERGARTEN INSTRUCTIONAL TECHNOLOGY OBJECTIVES

SketchUp Instructions

SMART Ink 1.5. Windows operating systems. Scan the following QR code to view the SMART Ink Help on your smart phone or other mobile device.

NOTE TAKING AND THE TABLET PC

Transcription Module Easy Start Guide

Digital Pen & USB Flash Drive. User Guide. December

REPLAYING A USER LOGFILE AND PRELIMINARY ANALYSIS

SMART Board Interactive Whiteboard Setup with USB Cable

Speech Recognition Software Review

DRAGON CONNECTIONS PARTNER PROGRAM GUIDE

Dragon Solutions Using A Digital Voice Recorder

Guide to the Dragon Bar

Heuristic Evaluation of Three Cellular Phones

In this session, we will explain some of the basics of word processing. 1. Start Microsoft Word 11. Edit the Document cut & move

The Keyboard One of the first peripherals to be used with a computer and is still the primary input device for text and numbers.

The second goal is to provide a list of tips, tricks, and best known methods that have been discovered over the life span of the course.

set in Options). Returns the cursor to its position prior to the Correct command.

mini ScanEYE User Manual

CareTracker Electronic Health Record (EHR) Setting Up Your Computers for CareTracker EHR

The preliminary design of a wearable computer for supporting Construction Progress Monitoring

SMART Notebook 10 User s Guide. Linux Operating Systems

ADDING DOCUMENTS TO A PROJECT. Create a a new internal document for the transcript: DOCUMENTS / NEW / NEW TEXT DOCUMENT.

Creating Captions in YouTube

Winzer Corporation 1 Revision: 4.0

Microsoft Access 2010 Part 1: Introduction to Access

Medical 360 Network Edition and Citrix

Spreadsheet - Introduction

The Prognosis is Good: Speech Recognition Software Can Increase Productivity in the Medical Environment

Photo-triage: Rapidly annotating your digital photographs

User Support Manual KIDS IEP AND DATA MANAGEMENT SOFTWARE PROGRAM. Customized Relational Technology, Inc.

Opal-RAD Dictation/Transcription User Manual

Frequently Asked Questions

Word 2007: Basics Learning Guide

Microsoft Project 2010

Evaluation of Tablet PCs for engineering content development and instruction

Creating a Poster in PowerPoint A. Set Up Your Poster

Tablet PC Quick Start

PowerPoint 2007: Basics Learning Guide

DOING MORE WITH WORD: MICROSOFT OFFICE 2010

Welcome to the Notability User Guide Find what you re looking for quickly using the search icon.

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

OneNote 2013 Tutorial

Copyright EPiServer AB

Productivity for the Enterprise

Educational Support for Children with Special Needs: K-12 SNE Kids Touch

Chapter 5 Input. Chapter 5 Objectives. What Is Input? What Is Input? The Keyboard. The Keyboard

Chapter 3 Input Devices

Dragon Solutions Transcription Workflow

2. How to Use SMART Board as a Projector and Whiteboard

Terminal Server Guide

Using Excel for Analyzing Survey Questionnaires Jennifer Leahy

Book Builder Training Materials Using Book Builder September 2014

paragraph(s). The bottom mark is for all following lines in that paragraph. The rectangle below the marks moves both marks at the same time.

Rule-Based Ship Design

Removing Primary Documents From A Project. Data Transcription. Adding And Associating Multimedia Files And Transcripts

Excel 2007: Basics Learning Guide

Sweet Home 3D user's guide

The Notebook Software Activity Guide

Microsoft PowerPoint 2010 Computer Jeopardy Tutorial

GETTING STARTED WITH COVALENT BROWSER

OX Spreadsheet Product Guide

Properties of Real-World Digital Logic Diagrams

Quick Help Guide (via SRX-Pro Remote)

ChemPad3. a tutorial. Ben Shine and Dana Tenneson. May 21, 2008

Excel 2007 Basic knowledge

Microsoft PowerPoint Tutorial

Getting Started with WebEx Access Anywhere

USING TABLET COMPUTERS FOR PRESENTATION OF SLIDES IN THE PROCESS OF TEACHING

Employee Appointment Books. User s Manual

SignalDraw: GUI Tool For Generating Pulse Sequences

Introduction to Microsoft Word 2008

How to Practice Pronunciation Without a Microphone

Excel Spreadsheet Activity Redo #1

Enhanced Formatting and Document Management. Word Unit 3 Module 3. Diocese of St. Petersburg Office of Training Training@dosp.

Interactive Whiteboard Functionality Overview Choosing Pen Style Erasing / Modifying Writing Undo / Redo

All V7 registers support barcode printing, except the Sharp 410/420 1A ROM and that limitation is based upon the register.

Welcome to the Notability! User Guide! Find what you re looking for! quickly using the search icon.!

Internet and Computing Core Certification Guide Module A Computing Fundamentals

SMART Board Menu. Full Reference Guide

On Demand Customer Feedback at the Point of Experience

F9D7 04 (ESKWP2): Word Processing Software 2

Excel Unit 4. Data files needed to complete these exercises will be found on the S: drive>410>student>computer Technology>Excel>Unit 4

The benefits of Microsoft OneNote 2013

A Short Introduction to Transcribing with ELAN. Ingrid Rosenfelder Linguistics Lab University of Pennsylvania

P a g e 0. Training Guide for the Content Management System

X-Trade Brokers Dom Maklerski S.A. XTB Expert Builder. Tutorial. Michał Zabielski

Enterprise Express DICTAPHONE ENTERPRISE EXPRESS

Using Excel As A Database

A Real Time, Object Oriented Fieldbus Management System

Cricut Design Space Reference Guide & Glossary

Welcome to The Grid 2

Transcription:

Real-Time Transcription of Radiology Dictation: A Case Study for TabletPCs Wu FENG feng@cs.vt.edu Depts. of Computer Science and Electrical & Computer Engineering Virginia Tech Laboratory Microsoft escience Biomedical & Bioinformatics Background This Talk Making IT More Useful for Radiologists (and Hospitals) Acknowledgments: IBM mpiblast (http://www.mpiblast.org) A parallelized version of NCBI BLAST (Basic Local Alignment Search Tool) that delivers super-linear speed-up. mpiblast reduces search time from 1346 minutes (22.4 hours) to just over 4 minutes! # Nodes 1 4 mpiblast v1.0 1.00 9.23 mpiblast New 1.00 4.52 The Design, Implementation, and Evaluation of mpiblast, 4 th Int l Conf. on Linux Clusters, 6/2002. Parallel Genomic Sequence-Searching on an Ad-Hoc Grid ACM/IEEE SC 06, 11/2006. How to do all-vs-all comparisons of 1000s of genomes 16 64 128 33.15 94.95 170.49 48.03 173.24 305.49 1

Overview Motivation Approach Speech Recognition System Handwriting Recognition System Gesture Recognition System Integrated Multimodal Environment for Immediate Radiology Transcription Usage Scenarios Case Study Conclusion Radiology Transcription Today? Radiologist dictates x-ray diagnoses into a tape recorder. Radiologist forwards the tape to a transcriber who types the analyses into hardcopy reports for the radiologist. Radiologist receives the hardcopy reports 24 to 72 hours later (i.e., after having looked at potentially hundreds of other x-ray analyses in the interim). 2

Motivation: Radiology Issues Previously such transcription was performed at (or near) the hospital. Now such transcription is outsourced overseas?! Slow turnaround means that the radiologist cannot be expected to remember exactly what (s)he dictated for each individual x-ray, thus increasing the liability to the hospital. Challenge Immediate turnaround time on radiology transcription Lower cost (by leveraging information technology to eliminate the need for transcribers). Significantly reduced liability to the hospital. Motivation: Information Technology Problems with Computers, PDAs, and PocketPCs Ease of Use Computers: Users have been forced to adapt to unnatural input devices, e.g., keyboard, mouse, trackball. PDAs and PocketPCs: Relatively good due to natural metaphor that PDA = paper and stylus = pen User Productivity Keyboard and mouse are slower than speech input but faster than stylus input, e.g., sending e-mail. Keyboard: 60-100 words per minute (wpm). Stylus: 20-30 wpm. Speech: 150-250 wpm. Is the Tablet PC an ideal platform? Yes (and no). 3

Overview Motivation Approach Speech Recognition System Handwriting Recognition System Gesture Recognition System Integrated Multimodal Environment for Immediate Radiology Transcription Usage Scenarios Case Study Conclusion Approach Speech to improve ease of use & increase productivity. Requirements A speech recognizer that runs in real time and relatively errorfree, e.g., > 95% recognition accuracy. Tools for correcting speech-recognition errors: Simple and natural to use, e.g., stylus and keyboard. Seamlessly integrated with the speech recognizer. Virtually error-free. Necessity of Requirements Ensure that productivity is significantly better than typing and mouse-ing. 4

Overview of Proposed Solution Integrated environment that seamlessly integrates different modes of input: speech, handwriting, and gestures. Dictate x-ray analysis into a speech recognizer. Correct any recognition errors (or even re-organize the dictation) using handwriting and gesture recognizers. Print and sign hardcopy of the radiology transcription. Overview Motivation Approach Speech Recognition System Handwriting Recognition System Gesture Recognition System Integrated Multimodal Environment for Immediate Radiology Transcription Usage Scenarios Case Study Conclusion 5

Speech Recognition System Requirements Real time and relatively error-free, e.g., 95% accurate. Currently Available Systems IBM ViaVoice & Dragon Systems Naturally Speaking. Continuous-speech processing produces pseudo-real-time and error-prone, i.e., typically 70-75%, recognition. Our System Reduce computational complexity; improve recognition. Customized speech profile for each individual. Discrete speech processing (i.e., talking like a robot). Result: Real-time and relatively error-free recognition, i.e., 95% on average. Handwriting Recognition System Requirements Virtually error-free. Current Solutions Palm Pilot Graffiti Requires learning a new alphabet and is error-prone. Virtual keyboard (as a back-up) Our System Discrete block letters from the regular alphabet (with each letter confined to a virtual box). Virtual keyboard as a back-up. 6

Gesture Recognition System Handwriting and Written-Gesture Recognition Use the same software architecture hand-drawn gestures or written characters pre-processor shape recognize r displayed characters or recognized gestures Capture temporal information as the user writes. Number of strokes, order of strokes, direction of strokes, and speed of strokes. Major Difference Gesture recognition is more general than handwriting recognition. Why? Complications with Gesture Recognition Handwriting Depends only on shape consistency. The difference between the same symbol produced at different times is less than the difference between different symbols. Gestures Symbols routinely violate shape consistency but are still recognizable as the same symbol by the human eye. 7

Gestural Variations Size Non-Linear Scaling Orientation Direction Reversal Defining a Gesture Alphabet or. Select the region encompassed by the oval. Select the linear region encompassed by the brackets. Select the vertical region bounded by the corners, e.g., complete lines of text. Delete selected region. Move selected region. Delete crossed out linear region. Split a line (or object). Display alternate word list. Playback speech. Undo. 8

Integrated Multimodal Environment title bar menu bar status bar handwriting & gesture window speech & gesture window Overview Motivation Approach Speech Recognition System Handwriting Recognition System Gesture Recognition System Integrated Multimodal Environment for Immediate Radiology Transcription Usage Scenarios Case Study Conclusion 9

Usage Scenarios Brackets: Select Linear Text 10

Delete Gesture: Removes Selected Region Corner Gestures: Select Row(s) of Text 11

Oval: Select a Region of Text Arrow: Moves Selected Text 12

Informal Case Study Setting: A leading research hospital. Test Subjects: Five Four native English speakers, one European. All male. Duration of Case Study: One Working Week One day set-up and training. Three days of use. One day of gathering information and feedback. Empirical Data Speech Recognition Rates 91% for the European (with a thick accent). 94%, 95%, 97%, and 97% for the native English speakers. Recognition rates improved over the course of three days. Handwriting & Written-Gesture Recognition Rates Given the infrequency of use, not enough data to really make any conclusive statements. Handwriting Initially below 90% for everyone. Improved significantly over the three days. Clear preference for virtual keyboard. Gesture Virtually 100% due to distinctness of gestures. 13

What Test Subjects Said The Negative Speaker profiles Remembering to load speaker profile a priori. Training the system to create a custom speaker profile. Turning speech recognition on and off. Having to talk in discrete speech. Slower than dictating into a tape recorder. Requiring too much precision in handwriting. Virtual keyboard preferred (a la what PDA market found out about thumbpads). The Positive Very usable. Very promising and innovative use of technology for immediate transcription. Conclusion (Traditional) Graphical User Interface of a PC Enhances ease of use and increase productivity. Problem: Keyboard and mouse are not natural input devices to the novice user. PDA & PocketPC Interface Enhances ease of use but reduce productivity. Problem: Stylus writing is 2x-5x slower than keyboarding and 8x-10x slower than talking. Tablet PC Interface (Arguably) enhances ease of use with marginal increase in productivity (lack of seamless integration of input devices). 14

Conclusion Integrated Multimodal Environment Speech Natural interface to enhance ease of use. Productivity increase of 2x-4x over keyboarding and 8x-10x over writing with a stylus. Gestures and Handwriting Natural interface to enhance ease of use. Supplements the speech recognition environment. Re-visiting the Challenge Immediate turnaround time on radiology transcription Lower cost (by leveraging information technology to eliminate the need for transcribers). Significantly reduced liability to the hospital. 15