IBM Watson : Beyond playing Jeopardy!



Similar documents
What is Watson An Overview

Putting IBM Watson to Work In Healthcare

Paul J. Ledak Vice President, IBM Research IBM Corporation

MAN VS. MACHINE. How IBM Built a Jeopardy! Champion x The Analytics Edge

How Big Data and Artificial Intelligence Change the Game for. presented by Jamie Bisker Senior Analyst, P&C Insurance Aite Group

Watson. An analytical computing system that specializes in natural human language and provides specific answers to complex questions at rapid speeds

» A Hardware & Software Overview. Eli M. Dow <emdow@us.ibm.com:>

Andre Standback. IT 103, Sec /21/12. IBM s Watson. GMU Honor Code on I am fully aware of the

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

What is Artificial Intelligence?

Dr. John E. Kelly III Senior Vice President, Director of Research. Differentiating IBM: Research

IBM's Watson could usher in new era of ALS research and medicine ons/ideas/index.html?

CSC384 Intro to Artificial Intelligence

Watson, what s on, what s next?

Data Refinery with Big Data Aspects

Auto-Classification for Document Archiving and Records Declaration

Big Data and Natural Language: Extracting Insight From Text

The Prolog Interface to the Unstructured Information Management Architecture

Search and Information Retrieval

Infrastructure Matters: POWER8 vs. Xeon x86

Keywords: Big Data, HDFS, Map Reduce, Hadoop

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Big Data and Trusted Information

Detecting Anomalous Behavior with the Business Data Lake. Reference Architecture and Enterprise Approaches.

IBM Unstructured Data Identification and Management

IBM Netezza High Capacity Appliance

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

Web Data Mining: A Case Study. Abstract. Introduction

Knowledge Discovery from patents using KMX Text Analytics

Tap into Big Data at the Speed of Business

Big Data & Analytics for Semiconductor Manufacturing

Using In-Memory Computing to Simplify Big Data Analytics

PLA 7 WAYS TO USE LOG DATA FOR PROACTIVE PERFORMANCE MONITORING. [ WhitePaper ]

Big Data Are You Ready? Thomas Kyte

In-Database Analytics

The Value of Advanced Data Integration in a Big Data Services Company. Presenter: Flavio Villanustre, VP Technology September 2014

Key differences between virtualization and cloud computing

Business Analytics and the Nexus of Information

Content Analyst's Cerebrant Combines SaaS Discovery, Machine Learning, and Content to Perform Next-Generation Research

Service Oriented Architecture(SOA)

Data Centric Computing Revisited

Ceph Distributed Storage for the Cloud An update of enterprise use-cases at BMW

Using Ultra-Large Data Sets in Healthcare New Questions-New Answers

Certified Information Professional 2016 Update Outline

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

Who needs humans to run computers? Role of Big Data and Analytics in running Tomorrow s Computers illustrated with Today s Examples

Technology and Trends for Smarter Business Analytics

Whitepaper: Globanet Cloud Enablement Service

Intel Platform and Big Data: Making big data work for you.

Introduction to DISC and Hadoop

Big Data with Rough Set Using Map- Reduce

Certified Information Professional (CIP) Certification Maintenance Form

Transforming the Telecoms Business using Big Data and Analytics

IBM Content Analytics with Enterprise Search, Version 3.0

Analytics In the Cloud

Cray: Enabling Real-Time Discovery in Big Data

hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau

Big Data Performance Growth on the Rise

Auto Classification and the Holy Grail for Records Managers

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform

Text Analytics with Ambiverse. Text to Knowledge.

Innoveren door te leren

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out

Big Data and Transactional Databases Exploding Data Volume is Creating New Stresses on Traditional Transactional Databases

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances

Getting to Know Big Data

Big Data: Study in Structured and Unstructured Data

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

Focus on the business, not the business of data warehousing!

MapReduce and Hadoop Distributed File System

Interactive data analytics drive insights

3 Red Hat Enterprise Linux 6 Consolidation

How To Handle Big Data With A Data Scientist

Data Centric Systems (DCS)

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

Convergence of Big Data and Cloud

Transcription:

IBM Watson : Beyond playing Jeopardy! Katharine Frase, VP Industries Research, IBM with thanks to: David Ferrucci, Principal Investigator, DeepQA Team @ IBM Research April 24, 2012

Want to Play Chess or Just Chat? Chess A finite, mathematically well-defined search space Limited number of moves and states All the symbols are completely grounded in the mathematical rules of the game Human Language Words by themselves have no meaning Only grounded in human cognition Words navigate, align and communicate an infinite space of intended meaning Computers can not ground words to human experiences to derive meaning

Informed Decision Making: Search vs. Expert Q&A Decision Maker Has Question Distills to 2-3 Keywords Reads Documents, Finds Answers Finds & Analyzes Evidence Decision Maker Asks NL Question Considers Answer & Evidence Search Engine Finds Documents containing Keywords Delivers Documents based on Popularity Expert Understands Question Produces Possible Answers & Evidence Analyzes Evidence, Computes Confidence Delivers Response, Evidence & Confidence

Easy Questions? ln((12,546,798 * π)) ^ 2 / 34,567.46 = 0.00885 Select Payment where Owner= David Jones and Type(Product)= Laptop, Owner David Jones Serial Number 45322190-AK Invoice # Vendor Payment INV10895 MyBuy $104.56 Serial Number Type Invoice # 45322190-AK LapTop INV10895 David Jones David Jones = Dave Jones David Jones 4 IBM 2011 Confidential IBM Corporation

Hard Questions? Computer programs are natively explicit, fast and exacting in their calculation over numbers and symbols.but Natural Language is implicit, highly contextual, ambiguous and often imprecise. Person Birth Place Structured Where was X born? One day, from among his city views of Ulm, Otto chose a water color to send to Albert Einstein as a remembrance of Einstein s birthplace. X ran this? A. Einstein ULM Person If leadership is an art then surely Jack Welch has proved himself a master painter during his tenure at GE. Organization J. Welch GE Unstructured

The Jeopardy! Challenge: A compelling and notable way to drive and measure the technology of automatic Question Answering along 5 Key Dimensions Complex Language Accurate Confidence Broad/Open Domain High Precision High Speed $200 If you're standing, it's the direction you should look to check out the wainscoting. $600 In cell division, mitosis splits the nucleus & cytokinesis splits this liquid cushioning the nucleus $1000 The first person mentioned by name in The Man in the Iron Mask is this hero of a previous book by the same author. $2000 Of the 4 countries in the world that the U.S. does not have diplomatic relations with, the one that s farthest north

The Best Human Performance: Our Analysis Reveals the Winner s Cloud Each dot represents an actual historical human Jeopardy! game Top human players are remarkably good. Winning Human Performance Grand Champion Human Performance Computers? 2007 QA Computer System 7 More Confident Less Confident

What is Watson? Decomposition Hypothesis Generation Hypothesis & Evidence Scoring Question & Topic Analysis Synthesis Final Confidence Merging & Ranking Workload Optimized

DeepQA: The Technology Behind Watson Massively Parallel Probabilistic Evidence-Based Architecture Generates and scores many hypotheses using a combination of 1000 s Natural Language Processing, Information Retrieval, Machine Learning and Reasoning Algorithms. These gather, evaluate, weigh and balance different types of evidence to deliver the answer with the best support it can find. Question Question & Topic Analysis Primary Search Multiple Interpretations Answer Sources 100 s sources Question Decomposition Candidate Answer Generation 100 s Possible Answers Hypothesis Generation Answer Scoring 1000 s of Pieces of Evidence Evidence Sources Evidence Retrieval Hypothesis and Evidence Scoring Deep Evidence Scoring 100,000 s Scores from many Deep Analysis Algorithms Synthesis Learned Models help combine and weigh the Evidence Balance & Combine Models Models Models Models Models Models Final Confidence Merging & Ranking Hypothesis Generation... Hypothesis and Evidence Scoring Answer & Confidence

Automatic Learning From Reading Volumes of Text Syntactic Frames Semantic Frames Inventors patent inventions (.8) Officials Submit Resignations (.7) People earn degrees at schools (0.9) Fluid is a liquid (.6) Liquid is a fluid (.5) Vessels Sink (0.7) People sink 8-balls (0.5) (in pool/0.8)

Some Basic Jeopardy! Clues This fish was thought to be extinct millions of years ago until one was found off South Africa in 1938 Category: ENDS IN "TH" Answer: coelacanth The type of thing being asked for is often indicated but can go from specific to very vague When hit by electrons, a phosphor gives off electromagnetic energy in this form Category: General Science Answer: light (or photons) Secy. Chase just submitted this to me for the third time--guess what, pal. This time I'm accepting it Category: Lincoln Blogs Answer: his resignation 19

Categories are not as simple as the seem Watson uses statistical machine learning to discover that Jeopardy! categories are only weak indicators of the answer type. U.S. CITIES St. Petersburg is home to Florida's annual tournament in this game popular on ship decks (Shuffleboard) Rochester, New York grew because of its location on this (the Erie Canal) Country Clubs From India, the shashpar was a multi-bladed version of this spiked club (a mace) A French riot policeman may wield this, simply the French word for "stick (a baton) Authors Archibald MacLeish? based his verse play "J.B." on this book of the Bible (Job) In 1928 Elie Wiesel was born in Sighet, a Transylvanian village in this country (Romania)

Evaluating Possibilities and Their Evidence In cell division, mitosis splits the nucleus & cytokinesis splits this liquid cushioning the nucleus. Organelle Vacuole Cytoplasm Plasma Mitochondria Blood Is( Cytoplasm, liquid ) = 0.2 Is( organelle, liquid ) = 0.1 Is( vacuole, liquid ) = 0.2 Is( plasma, liquid ) = 0.7 Many candidate answers (CAs) are generated from many different searches Each possibility is evaluated according to different dimensions of evidence. Just One piece of evidence is if the CA is of the right type. In this case a liquid. Cytoplasm is a fluid surrounding the nucleus Wordnet Is_a(Fluid, Liquid)? Learned Is_a(Fluid, Liquid) yes.

Divide and Conquer (Typical in Final Jeopardy!) Lyndon B Johnson Must identify and solve sub-questions from different sources to answer the top level question In 1968 this man was U.S. president. When "60 Minutes" premiered this man was U.S. president.? The DeepQA architecture attempts different decompositions and recursively applies the QA algorithms 22

Some Growing Pains (early answers) the People THE AMERICAN DREAM Decades before Lincoln, Daniel Webster spoke of government "made for", "made by" & "answerable to" them No One NEW YORK TIMES HEADLINES An exclamation point was warranted for the "end of" this! In 1918 WW I MILESTONES In 1994, 25 years after this event, 1 participant said, "For one crowning moment, we were creatures of the cosmic ocean a sentence Apollo 11 moon landing the Big Bang THE QUEEN'S ENGLISH Give a Brit a tinkle when you get into town & you've done this Call on the phone urinate

DeepQA: Incremental Progress in Answering Precision on the Jeopardy Challenge: 6/2007-11/2010 100% IBM Watson Playing in the Winners Cloud 90% 80% v0.8 11/10 V0.7 04/10 70% v0.6 10/09 Precision 60% 50% 40% 30% v0.5 05/09 v0.4 12/08 v0.1 12/07 v0.3 08/08 v0.2 05/08 20% 10% Baseline 12/06 0% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% % Answered

One Jeopardy! question can take 2 hours on a single 2.6Ghz Core Optimized & Scaled out on 2,880-Core Power750 using UIMA-AS, Watson is answering in ~3 seconds 90 x IBM Power 750 1 servers 2880 POWER7 cores POWER7 3.55 GHz chip 500 GB per sec on-chip bandwidth 10 Gb Ethernet network 15 Terabytes of memory 20 Terabytes of disk, clustered Can operate at 80 Teraflops Runs IBM DeepQA software Scales out with and searches vast amounts of unstructured information with UIMA & Hadoop open source components SUSE Linux provides a cost-effective open platform which is performanceoptimized to exploit POWER 7 systems 10 racks include servers, networking, shared disk system, cluster controllers

Watson brings many innovations to apply to business issues Watson Capabilities Best Fit for Watson Watson Solution Area and Industry Application Assisting Research (e.g., Financial Services) 1 Natural language understanding Analysis of unstructured data Assisting research of macroeconomic trends, current events and historic data to support investment decisions Results: increased alpha and risk management due to improved investment decision making and customer insights 2 3 4 5 Broad domain of unstructured data Parallel hypothesis generation and confidence scoring Iterative Question/Answering to refine results Machine learning Critical questions that require decision support with prioritized recommendations and evidence High value in decision support Leverage scale to maximize machine learning and improve outcomes over time Other Applications: Customer Insights, Knowledge Mgmt Assisting Discovery (e.g., Legal Services) Assisting research of historical and current documents to discover relevant references and identify relationships. Results: increased confidence in due diligence, prior art or precedent evaluations, due to context understanding of full passages and learned value of sources, vs keyword search based methods. Other Applications: R&D Mgmt, Compliance Mgmt Assisting Diagnosis (e.g., Healthcare) Assisting diagnosis and suggestions for patient condition based on patient data, family history, medical journals etc. Results: improved patient care and payer/provider cost management based on more efficient patient outcomes 26 Other Applications: Contact Center, Technical Support

THANK YOU IBM 2011 Confidential IBM Corporation