Watson und Big Data Wolfgang Nimführ IBM Software Group
Am 16.Feb hat das IBM Watson System Jeopardy gewonnen
Was sind typische Jeopardy Fragen? Category: Cambridge (Ambiguity) Mit viel Gravitas wurde dieser Jünger der Dreifaltigkeit 1669 der Lucasian Professor für Mathematik. Category: Do not Worry (Wordplay) Du brauchst nur ein Nickerchen! Du hast nicht diese Schlafstörung die dazu führen kann, dass man im Stehen einschläft. Category: Proverbs (Brainteaser) Sogar eine kaputte von diesen an der Wand stimmt zwei mal am Tag. Wer ist Isaac Newton? Was ist Narkolepsie? Was ist eine Uhr?
Was ist die Technologie hinter Watson? Question Answer Sources Primary Search Candidate Answer Generation Answer Scoring Evidence Sources Evidence Retrieval Deep Evidence Scoring Learned Models help combine and weigh the Evidence Models Models Models Models Models Models Question & Topic Analysis Question Decomposition Hypothesis Generation Hypothesis and Evidence Scoring Synthesis Final Confidence Merging & Ranking Hypothesis Generation Hypothesis and Evidence Scoring... Answer & Confidence
Was ist Keyword Matching? In May 1898 Portugal celebrated the 400th anniversary of this explorer s arrival in India. In May, Gary arrived in India after he celebrated his anniversary in Portugal. arrived in celebrated Keyword Matching celebrated In May 1898 Keyword Matching In May Evidence suggests Gary is the answer BUT the system must learn that keyword matching may be weak relative to other types of evidence 400th anniversary arrival in India explorer Portugal Keyword Matching Keyword Matching Keyword Matching Gary India anniversary in Portugal
Was ist DeepQA Matching? In May 1898 Portugal celebrated the 400th anniversary of this explorer s arrival in India. On On 27th 27th May May 1498, Vasco da dagama On On landed the 27th 27 th May 1498, Vasco da Gama in in of Kappad May 1498, Beach Vasco da Gama landed landed in in Kappad Kappad Beach Beach celebrated Portugal Search Far and Wide Explore many hypotheses Find Judge Evidence Many inference algorithms landed in May 1898 400th anniversary Temporal Reasoning 27th May 1498 Stronger evidence can be much harder to find and score. arrival in India explorer Statistical Paraphrasing GeoSpatial Reasoning Kappad Beach Vasco da Gama The evidence is still not 100% certain.
Wie wird Text verstanden?
Was ist ein lernendes System? Society Nature Institutions Archives (Natural Interfaces) Active Learning Verification Engines (e.g. Simulations) Training and Learning Engines To Build Models and Define Insight Hypothesis Engines To Understand and Plan Actions Policy Engine Business, Legal and Ethical Rules Outcome Engine Actuation and Validation
Welche Produkte sind heute verfügbar? InfoSphere BigInsights IBM Business Analytics InfoSphere Streams IBM Content Analytics InfoSphere Warehouse IBM Power Systems *) nicht alle hier genannte Technologien wurden für Jeopardy genutzt
Was ist Big Data?
Was ist die IBM Big Data Platform? IBM Big Data Solutions Client and Partner Solutions Big Data User Environments Developers End Users Administrators Enterprise Software AGENTS Big Data Enterprise Engines INTEGRATION Streaming Analytics Connectors Internet Scale Analytics Accelerators and Blueprints IBM Non-IBM Open Source Foundational Components
Wie spielt Big Data mit Watson Technologien zusammen? Approx. 200M pages of text (To compete on Jeopardy!) POS Data CRM Data Social Media InfoSphere BigInsights Distilled Insight - Spending habits - Social relationships - Buying trends Watson s Memory Advanced search and analysis
Was sind typische Einsatzszenarien für Big Data und Watson? Neonatal Care Trading Advantage Environment Law Enforcement Customer Retention Telecom Manufacturing Traffic Control Risk Prevention