15-381: Artificial Intelligence. Probabilistic Reasoning and Inference



Similar documents
Machine Learning.

Comparing & Contrasting. - mathematically. Ways of comparing mathematical and scientific quantities...

STA 371G: Statistics and Modeling

Knowledge-based systems and the need for learning

I Have...Who Has... Multiplication Game

Elementary Statistics and Inference. Elementary Statistics and Inference. 16 The Law of Averages (cont.) 22S:025 or 7P:025.

Part III: Machine Learning. CS 188: Artificial Intelligence. Machine Learning This Set of Slides. Parameter Estimation. Estimation: Smoothing

Statistics in Geophysics: Introduction and Probability Theory

Introduction to Probability

Bayesian spam filtering

A Probabilistic Causal Model for Diagnosis of Liver Disorders

MAT 155. Key Concept. February 03, S4.1 2_3 Review & Preview; Basic Concepts of Probability. Review. Chapter 4 Probability

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

A Tutorial on Probability Theory

Probability and statistics; Rehearsal for pattern recognition

Betting interpretations of probability

1 What is Machine Learning?

It is remarkable that a science, which began with the consideration of games of chance, should be elevated to the rank of the most important

A Bayesian Network Model for Diagnosis of Liver Disorders Agnieszka Onisko, M.S., 1,2 Marek J. Druzdzel, Ph.D., 1 and Hanna Wasyluk, M.D.,Ph.D.

Stat 20: Intro to Probability and Statistics

The Basics of Graphical Models

GfK 2016 Tech Trends 2016

COMP 590: Artificial Intelligence

36 Odds, Expected Value, and Conditional Probability

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Bayesian Tutorial (Sheet Updated 20 March)

Solution (Done in class)

Public Key Cryptography. c Eli Biham - March 30, Public Key Cryptography

E3: PROBABILITY AND STATISTICS lecture notes

Week 2: Conditional Probability and Bayes formula

CS MidTerm Exam 4/1/2004 Name: KEY. Page Max Score Total 139

Chapter ML:IV. IV. Statistical Learning. Probability Basics Bayes Classification Maximum a-posteriori Hypotheses

Learning is a very general term denoting the way in which agents:

Artificial Intelligence and Politecnico di Milano. Presented by Matteo Matteucci

Bayesian Analysis for the Social Sciences


Bayesian Networks. Read R&N Ch Next lecture: Read R&N

Decision Theory Rational prospecting

Course Outline Department of Computing Science Faculty of Science. COMP Applied Artificial Intelligence (3,1,0) Fall 2015

Neural Networks for Machine Learning. Lecture 13a The ups and downs of backpropagation

CHAPTER 2 Estimating Probabilities

Supervised Learning (Big Data Analytics)

Basic Probability. Probability: The part of Mathematics devoted to quantify uncertainty

STOCHASTIC MODELING. Math3425 Spring 2012, HKUST. Kani Chen (Instructor)

Entropy and Mutual Information

Circuits and Boolean Expressions

Sample Space and Probability

3.2 Roulette and Markov Chains

Chapter 20: Data Analysis

Betting with the Kelly Criterion

1 Introduction to Option Pricing

MAS113 Introduction to Probability and Statistics

In the situations that we will encounter, we may generally calculate the probability of an event

Study Plan for the Master Degree In Industrial Engineering / Management. (Thesis Track)

CCNY. BME I5100: Biomedical Signal Processing. Linear Discrimination. Lucas C. Parra Biomedical Engineering Department City College of New York

Lecture Note 1 Set and Probability Theory. MIT Spring 2006 Herman Bennett

Problems often have a certain amount of uncertainty, possibly due to: Incompleteness of information about the environment,

comp4620/8620: Advanced Topics in AI Foundations of Artificial Intelligence

Threat Density Map Modeling for Combat Simulations

Probability, statistics and football Franka Miriam Bru ckler Paris, 2015.

Chapter 5 A Survey of Probability Concepts

(67902) Topics in Theory and Complexity Nov 2, Lecture 7

Management Decision Making. Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011

Discrete Math in Computer Science Homework 7 Solutions (Max Points: 80)

FINAL EXAM, Econ 171, March, 2015, with answers

Informatics 2D Reasoning and Agents Semester 2,

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 2

STATISTICS 230 COURSE NOTES. Chris Springer, revised by Jerry Lawless and Don McLeish

HUGIN Expert White Paper

ECE302 Spring 2006 HW1 Solutions January 16,

Evaluation of Processor Health within Hierarchical Condition Monitoring System

Desktop Publishing. Specialized Application Software. 1 Chapter 4. 2 Introduction

Bayesian Networks. Mausam (Slides by UW-AI faculty)

Boolean Algebra. Boolean Algebra. Boolean Algebra. Boolean Algebra

ARTIFICIAL INTELLIGENCE AND EXPERT SYSTEMS: KNOWLEDGE-BASED SYSTEMS

Invited Applications Paper

A Few Basics of Probability

Machine Learning. CS 188: Artificial Intelligence Naïve Bayes. Example: Digit Recognition. Other Classification Tasks

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Gibbs Sampling and Online Learning Introduction

Data Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber

Improving the Effectiveness of Time-Based Display Advertising (Extended Abstract)

5. Probability Calculus

Attribution. Modified from Stuart Russell s slides (Berkeley) Parts of the slides are inspired by Dan Klein s lecture material for CS 188 (Berkeley)

Chapter 12 Discovering New Knowledge Data Mining

Bayesian Updating with Discrete Priors Class 11, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Probability Using Dice

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

Chapter 6: Episode discovery process

Chapter 16: law of averages

Probability and Expected Value

cs171 HW 1 - Solutions

MAT 211 Introduction to Business Statistics I Lecture Notes

Concepts of Probability

Transcription:

5-38: Artificial Intelligence robabilistic Reasoning and Inference

Advantages of probabilistic reasoning Appropriate for complex, uncertain, environments - Will it rain tomorrow? Applies naturally to many domains - Robot predicting the direction of road, biology, Word paper clip Allows to generalize acquired knowledge and incorporate prior belief - Medical diagnosis Easy to integrate different information sources - Robot s sensors

Unmanned vehicles Examples

Examples: Speech processing

Example: Biological data ATGAAGCTACTGTCTTCTATCGAACAAGCATGCG ATATTTGCCGACTTAAAAAGCTCAAG TGCTCCAAAGAAAAACCGAAGTGCGCCAAGTGT CTGAAGAACAACTGGGAGTGTCGCTAC TCTCCCAAAACCAAAAGGTCTCCGCTGACTAGG GCACATCTGACAGAAGTGGAATCAAGG CTAGAAAGACTGGAACAGCTATTTCTACTGATTT TTCCTCGAGAAGACCTTGACATGATT

Basic notations Random variable - referring to an element / event whose status is unknown: A it will rain tomorrow Domain usually denoted by Ω - The set of values a random variable can take: - A The stock market will go up this year : Binary - A Number of Steelers wins in 007 : Discrete - A % change in Google stock in 007 : Continuous

Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B

Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B Steelers win the 05-06 season

Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B

Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B

Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B There have been several other attempts to provide a foundation for probability theory. Kolmogorov s axioms are the most widely used.

Example of using the axioms Assume a probability for dice as follows: 3 0. 4 5 0. 6? {heads,tails}? heads+-tails?

Using the axioms How can we use the axioms to prove that: A A?

riors Degree of belief in an event in the absence of any other information No rain Rain rain tomorrow 0. no rain tomorrow 0.8

Conditional probablity What is the probability of an event given knowledge of another event Example: - raining sunny - raining cloudy - raining cloudy, cold

Conditional probability A B : The fraction of cases where A is true if B is true A 0. A B 0.5

Conditional probability In some cases, given knowledge of one or more random variables we can improve upon our prior belief of another random variable For example: pslept in movie 0.5 pslept in movie liked movie /3 pdidn t sleep in movie liked movie /3 Liked movie 0 0 Slept 0 0 0. 0.4 0. 0.3

Joint distributions The probability that a set of random variables will take a specific value is their joint distribution. Notation: A B or A,B Example: liked movie, slept Liked movie 0 0 Slept 0 0 0. 0.4 0. 0.3

Joint distribution cont class size > 0 0.5 summer /3 class size > 0, summer? Time regular, summer Evaluation of classes Class size Evaluation -3 0 34 65 5 43 3 5 3 3 3

Joint distribution cont class size > 0 0.5 summer /3 class size > 0, summer 0 Time regular, summer Evaluation of classes Class size Evaluation -3 0 34 65 5 43 3 5 3 3 3

Joint distribution cont class size > 0 0.5 eval /9 class size > 0, eval /9 Evaluation of classes Time regular, summer Class size 0 34 65 5 43 3 5 Evaluation -3 3 3 3

Chain rule The joint distribution can be specified in terms of conditional probability: A,B A B*B Together with Bayes rule which is actually derived from it this is one of the most powerful rules in probabilistic reasoning

Bayes rule One of the most important rules for AI usage. Derived from the chain rule: A,B A BB B AA Thus, A B B A A B Thomas Bayes was an English clergyman who set out his theory of probability in 764.

Bayes rule cont Often it would be useful to derive the rule a bit further:! A A A B A A B B A A B A B This results from: B A B,A A B A B B,A B,A0

Using Bayes rule Cards game: lace your bet on the location of the King!

Using Bayes rule Cards game: Do you want to change your bet?

Using Bayes rule selb k C k C selb selb k C Computing the posterior probability: C k selb A B C 0 0 + C C selb k C k C selb k C k C selb Bayes rule

Using Bayes rule A B C Ck selb / /3 selb C selb C k C k C k + selb k C 0 C 0 /3 / /3 / /3

Important points Random variables Chain rule Bayes rule Joint distribution, independence, conditional independence

Joint distributions The probability that a set of random variables will take a specific value is their joint distribution. Requires a joint probability table to specify the possible assignments The table can grow very rapidly Liked movie 0 0 Slept 0 0 0. 0.4 0. 0.3 How can we decrease the number of columns in the table?