How To Understand Artificial Intelligence

Similar documents

comp4620/8620: Advanced Topics in AI Foundations of Artificial Intelligence

One Decade of Universal Artificial Intelligence

One Decade of Universal Artificial Intelligence

Space-Time Embedded Intelligence

Learning is a very general term denoting the way in which agents:

Learning What to Value

1 What is Machine Learning?

Machine Learning Introduction

Machine Learning: Overview

COMP 590: Artificial Intelligence

Lecture 1: Introduction to Reinforcement Learning

Reinforcement Learning and the Reward Engineering Principle

Attribution. Modified from Stuart Russell s slides (Berkeley) Parts of the slides are inspired by Dan Klein s lecture material for CS 188 (Berkeley)

CSC384 Intro to Artificial Intelligence

Intelligent Agents. Based on An Introduction to MultiAgent Systems and slides by Michael Wooldridge

Course: Model, Learning, and Inference: Lecture 5

Proceedings of the 9th WSEAS International Conference on APPLIED COMPUTER SCIENCE

Introduction to Learning & Decision Trees

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

Fall 2012 Q530. Programming for Cognitive Science

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Machine Learning.

CHAPTER 2 Estimating Probabilities

Teaching Introductory Artificial Intelligence with Pac-Man

Course Outline Department of Computing Science Faculty of Science. COMP Applied Artificial Intelligence (3,1,0) Fall 2015

So, how do you pronounce. Jilles Vreeken. Okay, now we can talk. So, what kind of data? binary. * multi-relational

An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Doctor of Philosophy in Computer Science

About the Author. The Role of Artificial Intelligence in Software Engineering. Brief History of AI. Introduction 2/27/2013

Measuring the Performance of an Agent

Learning Agents: Introduction

large-scale machine learning revisited Léon Bottou Microsoft Research (NYC)

Game Theory 1. Introduction

Reinforcement Learning

6.2.8 Neural networks for data mining

I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD

Course 395: Machine Learning

Statistics Graduate Courses

Compact Representations and Approximations for Compuation in Games

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

Learning outcomes. Knowledge and understanding. Competence and skills

When Machine Learning Meets AI and Game Theory

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Decision Theory Rational prospecting

1 Portfolio Selection

School of Computer Science

Data Isn't Everything

Motivation. Motivation. Can a software agent learn to play Backgammon by itself? Machine Learning. Reinforcement Learning

Introduction to Online Learning Theory

MACHINE LEARNING BASICS WITH R

Web Data Mining: A Case Study. Abstract. Introduction

SYSM 6304: Risk and Decision Analysis Lecture 5: Methods of Risk Analysis

cs171 HW 1 - Solutions

M.S. Computer Science Program

Real-world Limits to Algorithmic Intelligence

AN INTRODUCTION TO PSYCHOLOGY

A HYBRID GENETIC ALGORITHM FOR THE MAXIMUM LIKELIHOOD ESTIMATION OF MODELS WITH MULTIPLE EQUILIBRIA: A FIRST REPORT

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

DRAFT TJ PROGRAM OF STUDIES: AP PSYCHOLOGY

Creating a NL Texas Hold em Bot

Machine Learning and Statistics: What s the Connection?

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

LEARNING OUTCOMES FOR THE PSYCHOLOGY MAJOR

JOINT ATTENTION. Kaplan and Hafner (2006) Florian Niefind Coli, Universität des Saarlandes SS 2010

Bootstrapping Big Data

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

SINGLE-STAGE MULTI-PRODUCT PRODUCTION AND INVENTORY SYSTEMS: AN ITERATIVE ALGORITHM BASED ON DYNAMIC SCHEDULING AND FIXED PITCH PRODUCTION

Artificial Intelligence and Politecnico di Milano. Presented by Matteo Matteucci

Data Warehousing and Data Mining in Business Applications

A Learning Based Method for Super-Resolution of Low Resolution Images

A Client-Server Interactive Tool for Integrated Artificial Intelligence Curriculum

Artificial Intelligence BEG471CO

What is Artificial Intelligence?

List of Ph.D. Courses

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

MA2823: Foundations of Machine Learning

Introducing Social Psychology

Big Data - Lecture 1 Optimization reminders

Kolmogorov Complexity and the Incompressibility Method

Chapter 6. The stacking ensemble approach

Reinforcement Learning

Undergraduate Psychology Major Learning Goals and Outcomes i

MSCA Introduction to Statistical Concepts

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Applied Psychology. Dr. Marya Howell-Carter, Acting Chair Psychology Dept. Bachelor of Science Degree

Research Basis for Catchup Math

What is Learning? CS 391L: Machine Learning Introduction. Raymond J. Mooney. Classification. Problem Solving / Planning / Control

Analysis of Computer Algorithms. Algorithm. Algorithm, Data Structure, Program

TExES Special Education Supplemental (163) Test at a Glance

Psychology. 42 Credits Complete the requirements shown in the General Education Requirements section of this catalog. Include this specific course.

: Introduction to Machine Learning Dr. Rita Osadchy

Transcription:

Universal Artificial Intelligence Marcus Hutter Canberra, ACT, 0200, Australia http://www.hutter1.net/

Marcus Hutter - 2 - Universal Artificial Intelligence Abstract The dream of creating artificial devices that reach or outperform human intelligence is many centuries old. In this talk I present an elegant parameter-free theory of an optimal reinforcement learning agent embedded in an arbitrary unknown environment that possesses essentially all aspects of rational intelligence. The theory reduces all conceptual AI problems to pure computational questions. The necessary and sufficient ingredients are Bayesian probability theory; algorithmic information theory; universal Turing machines; the agent framework; sequential decision theory; and reinforcement learning, which are all important subjects in their own right. I also present some recent approximations, implementations, and applications of this modern top-down approach to AI.

Marcus Hutter - 3 - Universal Artificial Intelligence Overview Goal: Construct a single universal agent that learns to act optimally in any environment. State of the art: Formal (mathematical, non-comp.) definition of such an agent. Accomplishment: Well-defines AI. Formalizes rational intelligence. Formal solution of the AI problem in the sense of... = Reduces the conceptional AI problem to a (pure) computational problem. Evidence: Mathematical optimality proofs and some experimental results.

Marcus Hutter - 4 - Universal Artificial Intelligence Contents Philosophical and Mathematical Background. Universal Intelligence. Discussion. Summary and References.

Marcus Hutter - 5 - Universal Artificial Intelligence PHILOSOPHICAL AND MATHEMATICAL BACKGROUND

Marcus Hutter - 6 - Universal Artificial Intelligence What is (Artificial) Intelligence? Intelligence can have many faces formal definition difficult reasoning creativity association generalization pattern recognition problem solving memorization planning achieving goals learning optimization self-preservation vision language processing classification induction deduction... What is AI? Thinking Acting humanly Cognitive Turing test, Science Behaviorism rationally Laws Doing the Thought Right Thing Collection of 70+ Defs of Intelligence http://www.vetta.org/ definitions-of-intelligence/ Real world is nasty: partially unobservable, uncertain, unknown, non-ergodic, reactive, vast, but luckily structured,...

Marcus Hutter - 7 - Universal Artificial Intelligence Relevant Research Fields (Universal) Artificial Intelligence has interconnections with (draws from and contributes to) many research fields: computer science (artificial intelligence, machine learning), engineering (information theory, adaptive control), economics (rational agents, game theory), mathematics (statistics, probability), psychology (behaviorism, motivation, incentives), philosophy (reasoning, induction, knowledge).

Marcus Hutter - 8 - Universal Artificial Intelligence Relation between ML & RL & (U)AI Universal Artificial Intelligence Covers all Reinforcement Learning problem types Statistical Machine Learning Mostly i.i.d. data classification, regression, clustering RL Problems & Algorithms Stochastic, unknown, non-i.i.d. environments Artificial Intelligence Traditionally deterministic, known world / planning problem

Marcus Hutter - 9 - Universal Artificial Intelligence There is an Elegant Theory of... Cellular Automata... Computing Iterative maps...chaos and Order QED... Chemistry Super-Strings... the Universe Universal AI... Super Intelligence

Marcus Hutter - 10 - Universal Artificial Intelligence Informal Definition of (Artificial) Intelligence Intelligence measures an agent s ability to achieve goals in a wide range of environments. [S. Legg and M. Hutter] Emergent: Features such as the ability to learn and adapt, or to understand, are implicit in the above definition as these capacities enable an agent to succeed in a wide range of environments. The science of Artificial Intelligence is concerned with the construction of intelligent systems/artifacts/agents and their analysis. What next? Substantiate all terms above: agent, ability, utility, goal, success, learn, adapt, environment,... Never trust a theory === if it is not supported by an experiment ===== experiment theory

Marcus Hutter - 11 - Universal Artificial Intelligence Induction Prediction Decision Action Having or acquiring or learning or inducing a model of the environment an agent interacts with allows the agent to make predictions and utilize them in its decision process of finding a good next action. Induction infers general models from specific observations/facts/data, usually exhibiting regularities or properties or relations in the latter. Example Induction: Find a model of the world economy. Prediction: Use the model for predicting the future stock market. Decision: Decide whether to invest assets in stocks or bonds. Action: Trading large quantities of stocks influences the market.

Foundations of Universal Artificial Intelligence Ockhams razor (simplicity) principle Entities should not be multiplied beyond necessity. Epicurus principle of multiple explanations If more than one theory is consistent with the observations, keep all theories. Bayes rule for conditional probabilities Given the prior belief/probability one can predict all future probabilities. Posterior(H D) Likelihood(D H) Prior(H). Turing s universal machine Everything computable by a human using a fixed procedure can also be computed by a (universal) Turing machine. Kolmogorov s complexity The complexity or information content of an object is the length of its shortest description on a universal Turing machine. Solomonoff s universal prior=ockham+epicurus+bayes+turing Solves the question of how to choose the prior if nothing is known. universal induction, formal Ockham. Prior(H) = 2 Kolmogorov(H) Bellman equations Theory of how to optimally plan and act in known environments. Solomonoff + Bellman = Universal Artificial Intelligence.

Marcus Hutter - 13 - Universal Artificial Intelligence UNIVERSAL INTELLIGENCE

Marcus Hutter - 14 - Universal Artificial Intelligence Agent Model with Reward Most if not all AI problems can be formulated within the agent framework r 1 o 1 r 2 o 2 r 3 o 3 r 4 o 4 r 5 o 5 r 6 o 6... Agent π ory... work mem- Environment µ tape... a 1 a 2 a 3 a 4 a 5 a 6...

Marcus Hutter - 15 - Universal Artificial Intelligence Formal Definition of Intelligence Agent follows policy π : (A O R) A Environment reacts with µ : (A O R) A O R Performance of agent π in environment µ = expected cumulative reward = V π µ := E π µ[ t=1 rπµ t ] True environment µ unknown average over wide range of environments Ockham+Epicurus: Weigh each environment with its Kolmogorov complexity K(µ) := min p {length(p) : U(p) = µ} Universal intelligence of agent π is Υ(π) := µ 2 K(µ) V π µ. Compare to our informal definition: Intelligence measures an agent s ability to perform well in a wide range of environments. AIXI = arg max π Υ(π) = most intelligent agent.

Marcus Hutter - 16 - Universal Artificial Intelligence Is Universal Intelligence Υ any Good? Captures our informal definition of intelligence. Incorporates Occam s razor. Very general: No restriction on internal working of agent. Correctly orders simple adaptive agents. Agents with high Υ like AIXI are extremely powerful. Υ spans from very low intelligence up to ultra-high intelligence. Practically meaningful: High Υ = practically useful. Non-anthropocentric: based on information & computation theory. (unlike Turing test which measures humanness rather than int.) Simple and intuitive formal definition: does not rely on equally hard notions such as creativity, understanding, wisdom, consciousness. Υ is valid, informative, wide range, general, dynamic, unbiased, fundamental, formal, objective, fully defined, universal.

Marcus Hutter - 17 - Universal Artificial Intelligence The AIXI Model in one Line complete & essentially unique & limit-computable AIXI: a k := arg max... max [r k +... + r m ] a k a m o k r k o m r m 2 length(p) p : U(p,a 1..a m )=o 1 r 1..o m r m k=now, action, observation, reward, U niversal TM, program, m=lifespan AIXI is an elegant mathematical theory of general AI, but incomputable, so needs to be approximated in practice. Claim: AIXI is the most intelligent environmental independent, i.e. universally optimal, agent possible. Proof: For formalizations, quantifications, and proofs, see [Hut05]. Potential Applications: Agents, Games, Optimization, Active Learning, Adaptive Control, Robots.

Marcus Hutter - 18 - Universal Artificial Intelligence DISCUSSION

Marcus Hutter - 19 - Universal Artificial Intelligence Some Important Problem Classes Sequence Prediction, e.g. weather or stock-market prediction. Strong result: number of errors K(µ) = small. Strategic Games: Learn to play well (minimax) strategic zero-sum games (like chess) or even exploit limited capabilities of opponent. Optimization: Find (approximate) minimum of function with as few function calls as possible. Difficult exploration versus exploitation problem. Supervised learning: Learn functions by presenting (z, f(z)) pairs and ask for function values of z by presenting (z,?) pairs. Supervised learning is much faster than reinforcement learning. AIXI quickly learns to predict, play games, optimize, learn supervised.

Marcus Hutter - 20 - Universal Artificial Intelligence Approximations Levin search: Fastest algorithm for inversion and optimization problems. Theoretical application: Assume somebody found a non-constructive proof of P=NP, then Levin-search is a polynomial time algorithm for every NP (complete) problem. Practical (OOPS) applications (J. Schmidhuber) Mazes, towers of hanoi, robotics,... FastPrg: The asymptotically fastest and shortest algorithm for all well-defined problems. Computable Approximations of AIXI: AIXItl and AIξ and MC-AIXI-CTW and ΦMDP. Human Knowledge Compression Prize: (50 000 C=)

Marcus Hutter - 21 - Universal Artificial Intelligence Monte-Carlo AIXI Applications without providing any domain knowledge, the same agent is able to self-adapt to a diverse range of interactive environments. Normalised Average Reward per Cycle 1 0.8 0.6 0.4 0.2 0 [VNHUS 09-11] www.youtube.com/watch?v=yfsmhtmgdke Experience Optimal Cheese Maze Tiger 4x4 Grid TicTacToe Biased RPS Kuhn Poker Pacman 100 1000 10000 100000 1000000

Marcus Hutter - 22 - Universal Artificial Intelligence Aspects of Intelligence are all(?) either directly included in AIXI or are emergent Trait of Intell. How included in AIXI reasoning to improve internal algorithms (emergent) creativity exploration bonus, randomization,... association for co-compression of similar observations generalization for compression of regularities pattern recognition in perceptions for compression problem solving how to get more reward memorization storing historic perceptions planning searching the expectimax tree achieving goals by optimal sequential decisions learning Bayes-mixture and belief update optimization compression and expectimax self-preservation by coupling reward to robot components vision observation=camera image (emergent) language observation/action = audio-signal (emergent) motor skills action = movement (emergent) classification by compression induction Universal Bayesian posterior (Ockham s razor) deduction Correctness proofs in AIXItl

Marcus Hutter - 23 - Universal Artificial Intelligence Other Aspects of the Human Mind Conciousness Self-awareness Sentience Emotions If these qualia are relevant for rational decision making, then they should be emergent traits of AIXI too.

Marcus Hutter - 24 - Universal Artificial Intelligence Origin of Rewards and Universal Goals Where do rewards come from if we don t (want to) provide them. Human interaction: reward the robot according to how well it solves the tasks we want it to do. Autonomous: Hard-wire reward to predefined task: E.g. Mars robot: reward = battery level & evidence of water/life. Is there something like a universal goal Curiosity-driven learning Knowledge seeking agents [Sch07] [Ors11, OLH13]

Marcus Hutter - 25 - Universal Artificial Intelligence Mortal Embodied (AIXI) Agent Robot in human society: reward the robot according to how well it solves the tasks we want it to do, like raising and safeguarding a child. In the attempt to maximize reward, the robot will also maintain itself. Robot w/o human interaction (e.g. on Alpha-Centauri): Some rudimentary capabilities (which may not be that rudimentary at all) are needed to allow the robot to at least survive. Train the robot first in safe environment, then let it loose. Drugs (hacking the reward system): No, since long-term reward would be small (death). but see [OR11] Replication/procreation: Yes, if AIXI believes that clones or descendants are useful for its own goals (ensure retirement pension). Suicide: Yes (No), if AIXI can be raised to believe to go to heaven (hell). Self-Improvement: Yes, since this helps to increase reward. Manipulation: Any Super-intelligent robot can manipulate or threaten its teacher to give more reward. see also [RO11]

Marcus Hutter - 26 - Universal Artificial Intelligence Some more Social Questions Attitude: Are pure reward maximizers egoists, psychopaths, and/or killers or will they be friendly (altruism as extended ego(t)ism)? Curiosity killed the cat and maybe AIXI, or is extra reward for curiosity necessary? [Sch07, Ors11, LHS13] Immortality can cause laziness! [Hut05, Sec.5.7] Can self-preservation be learned or need (parts of) it be innate. Socializing: How will AIXI interact with another AIXI? see also [RO11] [Hut09, Sec.5j],[PH06]

Marcus Hutter - 27 - Universal Artificial Intelligence Creativity An Algorithmic View Definition: the process of producing something original&worthwhile. The process: combining and modifying existing thoughts or artifacts in novel ways, driven by random choice and filtering out bad results. Analogy: Ecosystems appear to be creatively designed, but blind evolutionary process was sufficient. Solving complex problems requires (apparent) creativity. Since AIXI is able to solve complex problems, it will appear creative. Analogy: Brute-force MiniMax chess programs appear to make (occasionally) creative moves. Creativity emerges from long-term reward maximization. Science finding patterns Compression is creative process is formal procedure Exploratory actions can appear creative. Fazit: Creativity is just exploration, filtering, and problem solving.

Marcus Hutter - 28 - Universal Artificial Intelligence SUMMARY AND REFERENCES

Marcus Hutter - 29 - Universal Artificial Intelligence Universal Artificial Intelligence (AIXI) Decision Theory = Probability + Utility Theory + + Universal Induction = Ockham + Bayes + Turing Involved Scientific Areas reinforcement learning information theory theory of computation Bayesian statistics sequential decision theory adaptive control theory Solomonoff induction Kolmogorov complexity Universal search and many more

Marcus Hutter - 30 - Universal Artificial Intelligence Introductory Literature [Hut05] [Hut06] M. Hutter. Universal Artificial Intelligence: Sequential Decisions based on Algorithmic Probability. Springer, Berlin, 2005. M. Hutter. Human knowledge compression prize. open ended, http://prize.hutter1.net/. [Hut12a] M. Hutter. Can intelligence explode? Journal of Consciousness Studies, 19(1-2):143 166, 2012. [Hut12b] M. Hutter. One decade of universal artificial intelligence. In Theoretical Foundations of Artificial General Intelligence, pages 67 88. Atlantis Press, 2012. [LH07] [RH11] S. Legg and M. Hutter. Universal intelligence: A definition of machine intelligence. Minds & Machines, 17(4):391 444, 2007. S. Rathmanner and M. Hutter. A philosophical treatise of universal induction. Entropy, 13(6):1076 1136, 2011. [VNH+11] J. Veness et al. A Monte Carlo AIXI approximation. Journal of Artificial Intelligence Research, 40:95 142, 2011.