5-38: Artificial Intelligence robabilistic Reasoning and Inference
Advantages of probabilistic reasoning Appropriate for complex, uncertain, environments - Will it rain tomorrow? Applies naturally to many domains - Robot predicting the direction of road, biology, Word paper clip Allows to generalize acquired knowledge and incorporate prior belief - Medical diagnosis Easy to integrate different information sources - Robot s sensors
Unmanned vehicles Examples
Examples: Speech processing
Example: Biological data ATGAAGCTACTGTCTTCTATCGAACAAGCATGCG ATATTTGCCGACTTAAAAAGCTCAAG TGCTCCAAAGAAAAACCGAAGTGCGCCAAGTGT CTGAAGAACAACTGGGAGTGTCGCTAC TCTCCCAAAACCAAAAGGTCTCCGCTGACTAGG GCACATCTGACAGAAGTGGAATCAAGG CTAGAAAGACTGGAACAGCTATTTCTACTGATTT TTCCTCGAGAAGACCTTGACATGATT
Basic notations Random variable - referring to an element / event whose status is unknown: A it will rain tomorrow Domain usually denoted by Ω - The set of values a random variable can take: - A The stock market will go up this year : Binary - A Number of Steelers wins in 007 : Discrete - A % change in Google stock in 007 : Continuous
Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B
Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B Steelers win the 05-06 season
Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B
Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B
Axioms of probability Kolmogorov s axioms A variety of useful facts can be derived from just three axioms:. 0 A. true, false 0 3. A B A + B A B There have been several other attempts to provide a foundation for probability theory. Kolmogorov s axioms are the most widely used.
Example of using the axioms Assume a probability for dice as follows: 3 0. 4 5 0. 6? {heads,tails}? heads+-tails?
Using the axioms How can we use the axioms to prove that: A A?
riors Degree of belief in an event in the absence of any other information No rain Rain rain tomorrow 0. no rain tomorrow 0.8
Conditional probablity What is the probability of an event given knowledge of another event Example: - raining sunny - raining cloudy - raining cloudy, cold
Conditional probability A B : The fraction of cases where A is true if B is true A 0. A B 0.5
Conditional probability In some cases, given knowledge of one or more random variables we can improve upon our prior belief of another random variable For example: pslept in movie 0.5 pslept in movie liked movie /3 pdidn t sleep in movie liked movie /3 Liked movie 0 0 Slept 0 0 0. 0.4 0. 0.3
Joint distributions The probability that a set of random variables will take a specific value is their joint distribution. Notation: A B or A,B Example: liked movie, slept Liked movie 0 0 Slept 0 0 0. 0.4 0. 0.3
Joint distribution cont class size > 0 0.5 summer /3 class size > 0, summer? Time regular, summer Evaluation of classes Class size Evaluation -3 0 34 65 5 43 3 5 3 3 3
Joint distribution cont class size > 0 0.5 summer /3 class size > 0, summer 0 Time regular, summer Evaluation of classes Class size Evaluation -3 0 34 65 5 43 3 5 3 3 3
Joint distribution cont class size > 0 0.5 eval /9 class size > 0, eval /9 Evaluation of classes Time regular, summer Class size 0 34 65 5 43 3 5 Evaluation -3 3 3 3
Chain rule The joint distribution can be specified in terms of conditional probability: A,B A B*B Together with Bayes rule which is actually derived from it this is one of the most powerful rules in probabilistic reasoning
Bayes rule One of the most important rules for AI usage. Derived from the chain rule: A,B A BB B AA Thus, A B B A A B Thomas Bayes was an English clergyman who set out his theory of probability in 764.
Bayes rule cont Often it would be useful to derive the rule a bit further:! A A A B A A B B A A B A B This results from: B A B,A A B A B B,A B,A0
Using Bayes rule Cards game: lace your bet on the location of the King!
Using Bayes rule Cards game: Do you want to change your bet?
Using Bayes rule selb k C k C selb selb k C Computing the posterior probability: C k selb A B C 0 0 + C C selb k C k C selb k C k C selb Bayes rule
Using Bayes rule A B C Ck selb / /3 selb C selb C k C k C k + selb k C 0 C 0 /3 / /3 / /3
Important points Random variables Chain rule Bayes rule Joint distribution, independence, conditional independence
Joint distributions The probability that a set of random variables will take a specific value is their joint distribution. Requires a joint probability table to specify the possible assignments The table can grow very rapidly Liked movie 0 0 Slept 0 0 0. 0.4 0. 0.3 How can we decrease the number of columns in the table?