Watson. An analytical computing system that specializes in natural human language and provides specific answers to complex questions at rapid speeds

Similar documents
MAN VS. MACHINE. How IBM Built a Jeopardy! Champion x The Analytics Edge

Andre Standback. IT 103, Sec /21/12. IBM s Watson. GMU Honor Code on I am fully aware of the

Putting IBM Watson to Work In Healthcare

CHAPTER 15: IS ARTIFICIAL INTELLIGENCE REAL?

What is Artificial Intelligence?

» A Hardware & Software Overview. Eli M. Dow <emdow@us.ibm.com:>

CSC384 Intro to Artificial Intelligence

GRADE 10 Listening Comprehension From "Humans Take on Computer in Jeopardy" In 1997, there was a very famous chess match. The world champion chess

IBM's Watson could usher in new era of ALS research and medicine ons/ideas/index.html?

COMP 590: Artificial Intelligence

CSE 517A MACHINE LEARNING INTRODUCTION

IBM Watson : Beyond playing Jeopardy!

Innoveren door te leren

How Big Data and Artificial Intelligence Change the Game for. presented by Jamie Bisker Senior Analyst, P&C Insurance Aite Group

CS440/ECE448: Artificial Intelligence. Course website:

Chapter 11. Managing Knowledge

Dr. John E. Kelly III Senior Vice President, Director of Research. Differentiating IBM: Research

Artificial Intelligence (AI)

Who needs humans to run computers? Role of Big Data and Analytics in running Tomorrow s Computers illustrated with Today s Examples

Power Systems Trends and Directions

Understanding Proactive vs. Reactive Methods for Fighting Spam. June 2003

The Prolog Interface to the Unstructured Information Management Architecture

Applications of Artificial Intelligence. Omark Phatak

Magento & Zend Benchmarks Version 1.2, 1.3 (with & without Flat Catalogs)

I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD

Facilitating Adult Learning

From Particles To Electronic Trading. Simon Bevan

An Oracle Best Practice Guide April Best Practices for Knowledgebase and Search Effectiveness

EXECUTIVE SUPPORT SYSTEMS (ESS) STRATEGIC INFORMATION SYSTEM DESIGNED FOR UNSTRUCTURED DECISION MAKING THROUGH ADVANCED GRAPHICS AND COMMUNICATIONS *

BBC LEARNING ENGLISH 6 Minute English Do you fear Artificial Intelligence?

Lotto Master Formula (v1.3) The Formula Used By Lottery Winners

The Case for Right Brain Thinking in a Left Brain World. The Case for a Whole New Way of Thinking and Succeeding Bill Duffy RN MJ CNOR

Virtual Personal Assistant

Hyper-connectivity and Artificial Intelligence

Driving Insurance World through Science Murli D. Buluswar Chief Science Officer

How I won the Chess Ratings: Elo vs the rest of the world Competition

Learning is a very general term denoting the way in which agents:

RIGHTNOW GUIDE: KNOWLEDGE BASE AND SEARCH BEST PRACTICES

Hexaware E-book on Predictive Analytics

Beating the NCAA Football Point Spread

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Using Artificial Intelligence to Manage Big Data for Litigation

Collecting Polish German Parallel Corpora in the Internet

Artificial Intelligence Beating Human Opponents in Poker

How To Handle Big Data With A Data Scientist

Game Playing in the Real World. Next time: Knowledge Representation Reading: Chapter

A Hurwitz white paper. Inventing the Future. Judith Hurwitz President and CEO. Sponsored by Hitachi

BEGINNER S GUIDE TO BETTING THE RACES

Liferay Portal Performance. Benchmark Study of Liferay Portal Enterprise Edition

The Benefits of POWER7+ and PowerVM over Intel and an x86 Hypervisor

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

Software Engineering of NLP-based Computer-assisted Coding Applications

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

SYNTASA's Personalization Maturity Index by Kirk Borne, Advisor to SYNTASA TM July 2014

Mus is the most popular Spanish game, due to its attractiveness and characteristics. This game was born in Basque Country

HOW IS C360 DIFFERENT THAN TRADITIONAL LEAD SCORING?

IBM RATIONAL PERFORMANCE TESTER

The Relationship between Artificial Intelligence and Finance

Computer-Based Text- and Data Analysis Technologies and Applications. Mark Cieliebak

SURVEY REPORT DATA SCIENCE SOCIETY 2014

Today. Important From Last Time. Old Joke. Computer Security. Embedded Security. Trusted Computing Base

BMC Remedy vs. IBM Control Desk. How to choose between BMC Remedy and IBM Control Desk December 2014

Introduction to Cloud Services

COLLECTIVE INTELLIGENCE: A NEW APPROACH TO STOCK PRICE FORECASTING

Infrastructure Matters: POWER8 vs. Xeon x86

Operationalise Predictive Analytics

Important dimensions of knowledge Knowledge is a firm asset: Knowledge has different forms Knowledge has a location Knowledge is situational Wisdom:

WHITEPAPER A BUSINESS CASE FOR VIRTUAL DESKTOP INFRASTRUCTURE (VDI) DEPLOYMENTS.

Software & systems for the neuromorphic generation of computing. Peter Suma co-ceo peter.suma@appliedbrainresearch.

Computer Science Introduction

A Middleware Strategy to Survive Compute Peak Loads in Cloud

Programme Specification for the. Cardiff Metropolitan University. Master of Science (MSc) in Information Technology

NATIONAL: THE GOOD AND MOSTLY BAD OF ARTIFICIAL INTELLIGENCE

You Must Ask Your Internet Marketing Consultant to Multiply Results

We employed reinforcement learning, with a goal of maximizing the expected value. Our bot learns to play better by repeated training against itself.

Scope Of Services At Dataflurry Prospectus

Transcription:

Watson An analytical computing system that specializes in natural human language and provides specific answers to complex questions at rapid speeds I.B.M. OHJ-2556 Artificial Intelligence Guest lecturing Teemu J. Heinimäki March 12, 2013

Outline Jeopardy! What is Watson? History Implementation and properties How Does Watson Work? Strengths and Weaknesses Tactics of Watson The IBM Challenge Contest Future Views in 2011 and Actual Development Since The Significance of Watson Computer vs. Human Brain The Essence of Real AI

Jeopardy! A popular American quiz show with answer-question format, adapted also internationally Three competitors Three rounds Jeopardy! Six categories, five clues per category, clues valued $200 $1000 One daily double ($5 highest dollar value / true daily double) Double Jeopardy! $400 $2000 Two daily doubles Final Jeopardy! Participated by the players with money Single question, bet $0-all Normally the winner keeps the money earned and continues to the next game, second and third get consolation prizes

What is Watson? An open domain QA machine by I.B.M. understands questions in natural language finds information in relevant sources determines the confidence responds with factual answers An instance of I.B.M.'s DeepQA technology Application of several fields, like machine learning, natural language processing, information retrieval, knowledge presentation, hypothesis generation... Combination of hardware and software Tackled Jeopardy!, has commercial applications Near the Holy Grail of the AI?

History of Watson (1) Achievements of latest decades: linguistic tools: synonym finders, rhyming dictionaries, classifiers for recognizing parts of the speech... QA systems for simple tasks Latest 10 15 years: material became available online (blogs, wikis, newspapers, academic papers...) Computing resources statistical document analysis became practical

History of Watson (2) 1997: Deep Blue vs. Garry Kasparov Much publicity for I.B.M. However, not a marketable product little direct income Another high-profile project, the grand challenge (that would be applicable in the real world) was sought after. An advanced QA system seemed to be a good choice, while QA had become important for different firms.

History of Watson (3) Possibility of competing with humans in Jeopardy! was suggested. First thought to be simply too tough a task (human language is hard, unlike chess=) Complex questions to be answered in seconds. The scope is very wide. Natural language: intended meanings, nuances, connotations, allusions, riddles, irony, ambiguities... Occasionally the answer cannot be found from any single source, but it must be synthesized based on several sources

History of Watson (4) 2006 the I.B.M.'s most advanced QA system was tested with poor results 2007 Dr. Ferrucci got 3 5 years and his team size was increased (20 25 people) 2008: Watson could theoretically beat some lesser Jeopardy! champions. Possibility of having a real Jeopardy! show aired in the national TV was inquired. Practice matches were organized (error analysis). January 13, 2011: Press conference demonstration in NY. February 14 16, 2011: Two special Jeopardy! exhibition matches on TV. Watson against former champions Ken Jennings and Brad Rutter for $1M Commercial applications

Implementation & properties Distinguishes itself by speed and memory capacity: 90 IBM Power 750 Linux servers, 2880 Power7 processor cores (3.55 GHz), running at 80 teraflops, 15 16 TB of RAM. (Deep Blue: about 1 teraflops) Code written in Java, C++ and Prolog. Components deployed and integrated using UIMA-AS (a standard asynchronous messaging middleware). Approximately 6 million logic rules to determine the best answer Knowledge base: about 200 million pages of documents of different types (4 TB) Besides natural language texts, also some structural and semistructural data (formal KBs, structured text)

How Does Watson Work? (1) Uses several well-known statistical methods More than 100 algorithms run simultaneously to analyze a question Another algorithm set rank the answer candidates e.g. by inserting possible solutions into the original phrase and running the search again for support cross-checking against time and space Machine learning framework used for weighting different algorithms and their results Also dynamic learning within categories: Verifying category interpretation: knowledge about knowledge

How Does Watson Work? (2) Data (unstructured) Pipeline of text analysis processes Structure & meaning Question NL processing techniques What is looked for Primary searches Search results Parallel analysis,evidence gathering, scoring 300 500 answer candidates ML Merging scores Candidates with confidence values

Strengths and weaknesses Knowledge acquired by the machine itself in reasonable time Good at brute force searches Applicability Cannot prove things (as opposed to knowledge-based systems) Not so good with short clues Must calculate confidences, not instantaneous knowledge about knowledge

Playing Tactics of Watson (1) Hunts for the daily doubles, when opportunity to select the category and the clue No need to worry about competitors being faster to buzz Good chances to boost score for everyone Uses statistical knowledge When no daily doubles anymore, selects the lowest clue value in a category with significant number of high valued ones Aims to checks its understanding about the category with low risk before high stakes

Playing Tactics of Watson (2) Daily double wagering In-category DD confidence Game State Evaluator a regression model estimating the winning chances at any stage Computing the expected chance to win for every legal bet Risk analytics also involved

Playing Tactics of Watson (3) Final Jeopardy! wagering Wagering in game one of two-game match resembles the DD wagering, except likely accuracy must be predicted based on the category title only. Wagering in single games and second games in matches Positions of the players: predicting the opponents' bets. A library of known strategy rules, special additions for different situations. Either uses a suitable rule or simulates different bets. Considers also prizes for second and third places

The IBM Challenge Contest (1) February 14 16, 2011: A match of two special Jeopardy! exhibition games on TV. Watson against former champions Ken Jennings and Brad Rutter for $1M Watson represented by an avatar, synthesized voice and answer panel. Buzzing using a robot finger (delay about 8 ms).

The IBM Challenge Contest (2) First game: Jeopardy! round: Watson and Rutter $5000, Jennings $2000 Watson repeated a wrong answer of Jennings Double Jeopardy! round: Watson buzzed and answered 7 times correctly in a row ($21035) Misunderstood an art period question, but dominated the game again soon Final jeopardy before: W $36681, R $5400, J $2400 Humans answered correctly, but W answered What is Toronto????? However, bet only $947 The game ended with W $35734, R $10400 & J $4800

The IBM Challenge Contest (3) Second game: Jeopardy! round: Humans successful. After this, Watson placed second. (W $4800, J $8600, R $2400) Double Jeopardy! round: Wrong answer to DD Humans still quite successful, but Watson took the lead anyway. (W $23440, J $18200, R $5600) Final Jeopardy! All answered correctly. The second game ended with W $41413, J $19200, and R $11200. The final result of the whole match: W $77147, J $24000, and R $21600

Future Views in 2011 (1) Goal to start selling Watson versions to companies in 1 2 year Health diagnosis application I.B.M.'s primary focus. Rapid growth of knowledge Hard diagnoses Medical Watson being built by I.B.M., Nuance Healthcare, Columbia University and University of Maryland. Goal to install Watson to every medical center's computational cloud (in U.S.A.)

Future Views in 2011 (2) Virtual call centers Help desks Helping with bureaucracy Web self-service Contradiction engines Police, CIA & Co. etc. Price tag of several M$s (needs to be run on at least $1 million I.B.M. server). The situation in ten years? In fifteen?

Future Views in 2011 (3) Concerns: Orders from Watson M.D.? Unemployment? Human resistance, ego Computers finally taking over the world =)

Development 2011 2013 (1) Miniaturization size of a pizza box Processing speed improved to 240% of the original Making business with a medical (oncological) version has started. Wellpoint exclusive reseller, at least six instances installed so far. Cloud or local server Utilization management decision accuracy 90% (not yet with cancer diagnoses) Provides treatment options, confidences, supporting evidence. Information for Watson can be given in plain text. Web portal for other providers will be made available.

Development 2011 2013 (2) Watson and financial applications The challenges somewhat similar as in healthcare vast amounts of data Citigroup, Inc. the first client in financial services offered by Watson, I.B.M. working also with other financial institutions Cloud service, earning a percentage of additional revenue gained Watson in education & science? An academic institute deployment: a Watson computing system will be donated to the Rensselaer Polytechnic Institute Call center pilots

The Significance of Watson A good commercial A significant milestone in AI (?) nothing new..?...but at least a nice compilation of existing techniques a great demonstration of the capability of state-of-art AI Human way of processing language? one step further towards the Star Trek computer one step further towards passing the Turing test Important applications Good for business high expectations of revenue May really help people in many areas dealing with Big Data..(?)

Computer vs. Human Brain (in Jeopardy! etc.) Computer brain analogy in cognitive science High parallelism Watson simulating some parts of human language processing? Power efficiency: several kws vs. 20 W Emotions a lack of them an advantage Reliability of the memory Data analysis methods Connecting relevant pieces of text logically Understanding meanings and nuances of the language Reaction speed Buzzing behavior and confidence Nonverbal communication

The Essence of Real AI? Thinking/acting humanly/rationally Consciousness? Understanding? Fluid vs. crystallized intelligence Turing test

Some Material http://www-03.ibm.com/innovation/us/watson/ http://www.forbes.com/sites/bruceupbin/2013/02/08/ibms-watson-gets-its-first-piece-of-business-in-healthcare/ http://en.wikipedia.org/wiki/watson_%28computer%29 http://www.nj.com/news/index.ssf/2013/01/ibms_jeopardy-winning_supercom.html http://www.bloomberg.com/news/2012-03-05/ibm-s-watson-computer-gets-wall-street-job-one-year-after-jeopardywin.html http://slashdot.org/topic/bi/when-ibms-watson-learned-too-much-about-natural-language/ https://www.facebook.com/ibmwatson http://www.wired.co.uk/news/archive/2013-02/11/ibm-watson-medical-doctor http://spectrum.ieee.org/automaton/robotics/artificial-intelligence/ibm-watson-jeopardy-computer-shuts-down-humans http://en.wikipedia.org/wiki/jeopardy!