Watson, what s on, what s next?



Similar documents
Putting IBM Watson to Work In Healthcare

IBM Watson : Beyond playing Jeopardy!

A Strategic Approach to Unlock the Opportunities from Big Data

Paul J. Ledak Vice President, IBM Research IBM Corporation

MAN VS. MACHINE. How IBM Built a Jeopardy! Champion x The Analytics Edge

Microsoft + SOA = Sant? Joakim Linghall Principal System Engineer SOA and Business Process joakiml@microsoft.com

What is Watson An Overview

IBM idag. Johan Rittner VD IBM Svenska

Watson. An analytical computing system that specializes in natural human language and provides specific answers to complex questions at rapid speeds

How Big Data and Artificial Intelligence Change the Game for. presented by Jamie Bisker Senior Analyst, P&C Insurance Aite Group

How To Get Healthy With A Game Called Angel Hour

» A Hardware & Software Overview. Eli M. Dow <emdow@us.ibm.com:>

Auto Classification and the Holy Grail for Records Managers

Adjectives/adverbs When do you use careless and when do you use carelessly?

DECISION/BESLUT

Effektiv hantering av Data och Information i M3 Joakim Jannerfeldt Anders Cottman

Ange om en aktivitet har medfört att en tjänsteresa har utförts med flyg under 2013, och i sådana fall antal gånger.

Sweden National H.O.G. Rally July 2010

Intrepid Travel Language Guides

IBM Watson Ecosystem. Getting Started Guide

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Introduktion till SAS 9 Plattformen Helikopterkursen

PRTK. Password Recovery ToolKit EFS (Encrypting File System)

Dr. John E. Kelly III Senior Vice President, Director of Research. Differentiating IBM: Research

Tanden Care Provider Interfaces Reverse Claim v1

IBM's Watson could usher in new era of ALS research and medicine ons/ideas/index.html?

Rek. 1995:- Technical specifications SB12V3200E-AC SB12V3200E-AC. Recharges much faster. Longer service life. Only 1/3 of the size.

Vocabulary in A1 level second language writing

Maskinöversättning F2 Översättningssvårigheter + Översättningsstrategier

Design Suggestions for Danske Bank SE

The Problem With Adding Positive and Negative Numbers

Tanden Care Provider Interfaces PreAssessmentSTB v3

GeoInt 2015 Watson Workshop

SAS Education Providing knowledge through global training and certification. SAS Foundation. Kursöversikt 2010

Mot 100% förnybar energi i Sverige och Världen

Big Data with Rough Set Using Map- Reduce

SYNTASA's Personalization Maturity Index by Kirk Borne, Advisor to SYNTASA TM July 2014

Innoveren door te leren

Strategisk planering, Aktiv demokrati, 6-8 jan 2012

SAS Data Integration SAS Business Intelligence

Category work in courtroom talk about domestic violence: Gender as an interactional accomplishment in child custody disputes

Interface Programmera mot interface Johan Eliasson Johan Eliasson Interface kan bryta beroendekedjor Skriv generell kod «Type» Class2 Interface

Revision av ISO 15189

Använd SAS för att bearbeta och analysera ditt data i Hadoop

Beroendemekanismer- ett beroende som andra?

Enfo Zystems Services

THE INBOUND MARKETING WAY THE BATTLE OF GOOGLE MARKETING HOUSE MARKETING HOUSE MARKETING HOUSE

Elektronikavfall. Thomas Lindhqvist IIIEE Lund University. 18 February 2009

Anders Ingvarsson (CEO) LifeAssays today May, 2015

Jag valde att använda Net-EPP_client.php från centralnic för att komma igång.

Solve your toughest challenges with data mining

TDDB84 Design Patterns Exam

Chapter 11. Managing Knowledge

Coop Policy Sustainable Business: Impact on Coop private brand

Product Lifecycle Management (PLM) Service Providers. On Leading PLM Solutions

SEO: What is it and Why is it Important?

Manjula Ambur NASA Langley Research Center April 2014

How To Work For A Car Maker

If You Get Sick during a Temporary Stay Abroad [Sjuk vid tillfällig vistelse utomlands]

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines

3gamma Från traditionell IT-leverans till modern, processtyrd tjänsteleverans i en multi-sourcing miljö. Peter Wahlgren, September 2013

Sentiment Analysis on Big Data

What Is the Productivity Gain in Machine Translation of Subtitles?

Natural Language to Relational Query by Using Parsing Compiler

Solve Your Toughest Challenges with Data Mining

Sreerupa Sen Senior Technical Staff Member, IBM December 15, 2013

ediscovery and Search of Enterprise Data in the Cloud

PROCESSING & MANAGEMENT OF INBOUND TRANSACTIONAL CONTENT

DATA SCIENCE CURRICULUM WEEK 1 ONLINE PRE-WORK INSTALLING PACKAGES COMMAND LINE CODE EDITOR PYTHON STATISTICS PROJECT O5 PROJECT O3 PROJECT O2

Oleksandr Romanko, Ph.D. Senior Research Analyst, Risk Analytics Business Analytics, IBM Canada October 8, Business Analytics and Optimization

Business Process Testing Accelerator for PeopleSoft Applications

IT Governance behöver inte vara någon svår konst

SPATIAL DATA CLASSIFICATION AND DATA MINING

Ironfan Your Foundation for Flexible Big Data Infrastructure

What is Artificial Intelligence?

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast

Readme10_054.doc page 1 of 7

Disruption ahead Deloitte s point of view on IBM Watson

School of Electrical Engineering

Machine Learning and Predictive Analytics Foster Growth [1]

Skyrocket Your Cloud Business with Digital Marketing. Rob Bracey

Transcription:

Watson, what s on, what s next? Mikael Haglund CTO, IBM Sweden mikael.haglund@se.ibm.com about.me/mikaelhaglund

2 2013 IBM Corporation

Sinnen Kognitiva System Hjärnliknande chips Cognitive Computing Do not distribute 2013 IBM Corporation 3

Watson Do not distribute 2013 IBM Corporation

IBM Watson, an early example of a cognitive system 1 Understands natural language and human speech 2 Generates and evaluates hypothesis for better outcomes 3 Adapts and Learns from user selections and responses built on a massively parallel probabilistic evidence-based architecture 2013 IBM Corporation

Open-Domain in Business Questions vs. Clues Questions What is the limit of a Roth IRA contribution based on an annual income of $110,000? Clues This is the limit of a Roth IRA contribution based on an annual income of $110,000. What drug has been shown to relieve the symptoms of ADD with relatively few side effects? This drug has been shown to relieve the symptoms of ADD with relatively few side effects. I was running HALO3 and my machine took a dive with a pink screen, what are the possible causes? I was running HALO3 and my machine took a dive with a pink screen, these are the possible causes.

Informed decision making: search vs. Watson Decision Maker Has Question Search Engine Distills to 2-3 Keywords Reads Documents, Finds Answers Finds & Analyzes Evidence Finds Documents Containing Keywords Delivers Documents Based on Popularity Decision Maker Asks NL Question Considers Answer & Evidence Watson Understands Question Produces Possible Answers & Evidence Analyzes Evidence, Computes Confidence Delivers Response, Evidence & Confidence

Some Challenges

Some Basic Jeopardy! Clues This fish was thought to be extinct millions of years ago until one was found off South Africa in 1938 Category: ENDS IN "TH" Answer: coelacanth The type of thing being asked for is often indicated but can go from specific to very vague When hit by electrons, a phosphor gives off electromagnetic energy in this form Category: General Science Answer: light (or photons) Secy. Chase just submitted this to me for the third time--guess what, pal. This time I'm accepting it Category: Lincoln Blogs Answer: his resignation

Broad Domain We do NOT attempt to anticipate all questions and build databases. We do NOT try to build a formal model of the world In a random sample of 20,000 questions we found 2,500 distinct types*. The most frequent occurring <3% of the time. The distribution has a very long tail. And for each these types 1000 s of different things may be asked. Even going for the head of the tail will barely make a dent *13% are non-distinct (e.g, it, this, these or NA) Our Focus is on reusable NLP technology for analyzing vast volumes of as-is text. Structured sources (DBs and KBs) provide background knowledge for interpreting the text.

Different Types of Evidence: Keyword Evidence In May 1898 Portugal celebrated the 400th anniversary of this explorer s arrival in India. In May, Gary arrived in India after he celebrated his anniversary in Portugal. arrived in celebrated Keyword Matching celebrated In May 1898 Keyword Matching In May 400th anniversary Keyword Matching anniversary Portugal Keyword Matching in Portugal arrival in India Keyword Matching India explorer Gary

Different Types of Evidence: Deeper Evidence In May 1898 Portugal celebrated the 400th anniversary of this explorer s arrival in India. On On 27th 27th May May 1498, 1498, Vasco Vasco da da Gama Gama On landed 27th May landed in in Kappad Kappad 1498, Vasco Beach Beachda Gama On the 27 landed in th of May 1498, Vasco da Kappad Beach Gama landed in Kappad Beach Ø Search Far and Wide Ø Explore many hypotheses celebrated Portugal Ø Find Judge Evidence Ø Many inference algorithms landed in May 1898 400th anniversary Temporal Reasoning 27th May 1498 Stronger evidence can be much harder to find and score. arrival in India explorer Statistical Paraphrasing GeoSpatial Reasoning Date Math Paraphrases Geo-KB Kappad Beach Vasco da Gama The evidence is still not 100% certain. 16

Watson s Guts https://www.youtube.com/watch?v=dywo4zksfxw

How Watson works: DeepQA Architecture Learned Models help combine and weigh the Evidence Inquiry Answer Sources Primary Search Candidate Answer Generation Answer Scoring Evidence Sources Evidence Retrieval Deep Evidence Scoring Models Models Models Models Models Models Inquiry/Topic Multiple Analysis Interpretations of a question 100 s sources Inquiry Decomposition Hypothesis Generation Hypothesis and Evidence Scoring Synthesis Final Confidence Merging & Ranking Hypothesis Generation Hypothesis and Evidence Scoring Responses with Confidence

How Watson works: DeepQA Architecture Inquiry Inquiry/Topic Multiple Analysis Interpretations of a question Answer Sources Primary Search Candidate Answer Generation 100 s sources Inquiry Decomposition 100 s Possible Answers Hypothesis Generation Answer Scoring Evidence Sources 1000 s of Pieces of Evidence Evidence Retrieval Hypothesis and Evidence Scoring Deep Evidence Scoring 100,000 s Scores from many Deep Analysis Algorithms Synthesis Learned Models help combine and weigh the Evidence Balance & Combine Models Models Models Models Models Models Final Confidence Merging & Ranking Hypothesis Generation Hypothesis and Evidence Scoring Responses with Confidence

Moving beyond Jeopardy! is a non-trivial challenge Watson at Play 1 User Max. input was two sentences 5+ days to retrain Evidence not present Text-only input Q&A model Basic security Watson at Work 10s of thousands concurrent users Pages of input (e.g. medical record) Dynamic content ingestion Supporting evidence integral Text, tables and images as input Both Q&A + Conversation model High security (e.g. HIPAA)

Watson möjliggör tre klasser av kognitiva tjänster Fråga Absorbera och dra nytta av enorma mängder data Ställ nyanserade frågor för bättre insikter Förstå frågor ställda med naturligt språk Upptäck Hitta de djupa sambanden Begär mer information för att förbättra svaren Gå från sökning till att göra upptäckter Fatta beslut Absorbera och analysera källor från relevanta domäner Fatta evidens-baserade beslut Lär från varje situation, åtgärd och följd

IBM Watson can be applied to many pressing business priorities Watson Decision Advisor Watson Engagement Advisor Watson Discovery Advisor Watson Case Advisor https://www.youtube.com/watch?v=hzspc0h_mtm

IBM Watson can be applied to many pressing business priorities Watson Decision Advisor Watson Engagement Advisor Watson Discovery Advisor Watson Case Advisor

Traditional approaches to engaging with customers come up short 270B Calls made annually to call center costing $600B 1 in 2 incoming calls require escalation or go unresolved 61% of all calls could have been resolved with better access to information 4.6% Market value gain from a single point customer sat gain *Case studies based on Coremetrics, Sterling Commerce and Unica solutions

Customers expect personalization and control You don t know me Intolerance of mass-market, impersonalized approaches You re not connecting with me Demand for interaction on channel of choice You make it too hard Expectations for immediate results

Do not distribute 2013 IBM Corporation

Watson Engagement Advisor - 6 Weeks Deploy / 6 Mo. ROI Ready Build Teach Pilot Run Extend Identify content Develop Q&A training set Upload Q&A pairs Configure & adapt for use cases Validate UX Auto. ingest documents (PDF, HTML, etc.) Add content Test & evaluate Migrate to production Utilize Watson in production Full production Q&A pairs Expand corpus Utilize Watson in new domains Automated content ingestion tools Toolkit allows building custom UIs Q&A pairs for training & accuracy Activity Monitoring Tools Initial Ready/Run Cycle = 6 Weeks WEA End User UI Tight integration with existing infrastructure or stand alone offering

https://www.youtube.com/watch?v=mr-1janairs 2013 IBM Corporation

2013 IBM Corporation

Watson Ecosystem

Fluid, Powered by IBM Watson Jag och familjen tänker åka till svenska fjällen och gå en vandringsled, och sova i tält i sommar. Vi är inte vana vandrare så det kommer att vara ganska korta etapper. Vi är fyra personer. Vad behöver vi? Barnen kan inte bära ett eget tält.

Watson på Svenska? Brisman(?) IBM i Sveriges första chef, 1928

Äh, det är ju bara att översätta... (1) Chandeliers look great but nowadays do not usually use these items from which their name is derived. Bonus: Vad sökte man? What is a...

Äh, det är ju bara att översätta... (2) Chandeliers look great but nowadays do not usually use these items from which their name is derived. Översättning med Google translate: Ljuskronor ser bra ut, men nu för tiden brukar inte använda dessa poster från vilken deras namn härstammar. Borde vara något i stil med: Ljuskronor ser bra ut, men nu för tiden brukar de inte använda dessa föremål från vilka deras namn härstammar.

Sinnen Kognitiva System Hjärnliknande chips Cognitive Computing Do not distribute 2013 IBM Corporation 35

Smarter Computing 1997 Die Mensch Maschine Halb Wesen und halb Ding Die Mensch Maschine Halb Wesen und halb Überding /Kraftwerk

Tack! Mikael Haglund IBM mikael.haglund@se.ibm.com about.me/mikaelhaglund 2011 IBM