Knowledge-based systems and the need for learning



Similar documents
Chapter 12 Discovering New Knowledge Data Mining

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Data Mining with Weka

8. Machine Learning Applied Artificial Intelligence

Monday Morning Data Mining

1 What is Machine Learning?

Social Media Mining. Data Mining Essentials

Data Mining on Streams

Machine Learning: Overview

Version Spaces.

: Introduction to Machine Learning Dr. Rita Osadchy

Extend Table Lens for High-Dimensional Data Visualization and Classification Mining

Learning is a very general term denoting the way in which agents:

Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)

Introduction to Pattern Recognition

Course 395: Machine Learning

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Big Data: The Science of Patterns. Dr. Lutz Hamel Dept. of Computer Science and Statistics

Decision-Tree Learning

Anomaly detection. Problem motivation. Machine Learning

Supervised Learning (Big Data Analytics)

15-381: Artificial Intelligence. Probabilistic Reasoning and Inference

Maschinelles Lernen mit MATLAB

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

life science data mining

Credit Card Fraud Detection and Concept-Drift Adaptation with Delayed Supervised Information

Machine Learning Introduction

Data Mining of Web Access Logs

Chapter 11. Managing Knowledge

Knowledge Discovery and Data Mining

Managing Knowledge. Chapter 11 8/12/2015

Machine Learning Techniques for Data Mining

Attribution. Modified from Stuart Russell s slides (Berkeley) Parts of the slides are inspired by Dan Klein s lecture material for CS 188 (Berkeley)

Dan French Founder & CEO, Consider Solutions

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Introduction to Learning & Decision Trees

Practical Introduction to Machine Learning and Optimization. Alessio Signorini

Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus

IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES

Making Sense of Physician Notes: A Big Data Approach. Anupam Joshi UMBC joshi@umbc.edu Joint work with students, UBMC Colleagues, and UMMS Colleagues

Artificial Intelligence: Fuzzy Logic Explained

Understanding Human Behaviour and Error

Machine Learning.

Comparison of K-means and Backpropagation Data Mining Algorithms

Data Mining on Social Networks. Dionysios Sotiropoulos Ph.D.

A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace

Introduction to Machine Learning Using Python. Vikram Kamath

Lecture Slides for INTRODUCTION TO. ETHEM ALPAYDIN The MIT Press, Lab Class and literature. Friday, , Harburger Schloßstr.

PREDICTIVE ANALYTICS FOR THE HEALTHCARE INDUSTRY

Statistics and Data Mining

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Search Taxonomy. Web Search. Search Engine Optimization. Information Retrieval

Data Mining - Introduction

Rule based Classification of BSE Stock Data with Data Mining

CPC40912 Certificate IV in Plumbing and Services

Base Recall Program September 2003

EVALUATING CLASSIFICATION POWER OF LINKED ADMISSION DATA SOURCES WITH TEXT MINING

Lecture 6. Artificial Neural Networks

1. Classification problems

National Certificate in Heating, Ventilating, and Air Conditioning (Mechanical Services) (Level 5)

Weather forecast prediction: a Data Mining application

Managing Knowledge and Collaboration

Overuse of PSA Screening for Prostate Cancer in Older Men. Elizabeth Jaramillo, MD January 17, 2014

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

6.2 Permutations continued

Improving Knowledge-Based System Performance by Reordering Rule Sequences

Chapter ML:XI (continued)

Customer Relationship Management and how you can. use CRM technology to manage and grow your business.

Exam Chapter 11 - Managing Knowledge. No Talking No Cheating Review after exam Back at 7pm

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

Accident Investigation

ANALYTICS IN BIG DATA ERA

Decision Trees. JERZY STEFANOWSKI Institute of Computing Science Poznań University of Technology. Doctoral School, Catania-Troina, April, 2008

INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR.

Classification using Logistic Regression

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

More Data Mining with Weka

INTERNATIONAL STANDARD ON AUDITING (UK AND IRELAND) 240 THE AUDITOR S RESPONSIBILITIES RELATING TO FRAUD IN AN AUDIT OF FINANCIAL STATEMENTS

Wireless Remote Monitoring System for ASTHMA Attack Detection and Classification

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Machine Learning and Statistics: What s the Connection?

Introduction. Chapter 1

A CALL CENTER ORIENTED CONSULTANT SYSTEM FOR FISH DISEASE DIAGNOSIS IN CHINA

CFSD 21 ST CENTURY SKILL RUBRIC CRITICAL & CREATIVE THINKING

Data Mining: Foundation, Techniques and Applications

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Boerne Independent School District Job Description

Introduction to Data Mining

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

TEXAS ~ STATUTE Continued

Organic Gardening Certificate Program Quiz Week 3 Answer Key

medexter clinical decision support

THE DEVELOPMENT OF AN EXPERT CAR FAILURE DIAGNOSIS SYSTEM WITH BAYESIAN APPROACH

Work with vibrating machines Health hazards:

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

A Splitting Criteria Based on Similarity in Decision Tree Learning

Cell Phone based Activity Detection using Markov Logic Network

Transcription:

Knowledge-based systems and the need for learning The implementation of a knowledge-based system can be quite difficult. Furthermore, the process of reasoning with that knowledge can be quite slow. This raises two questions: 1. Is there an easier way to gather all the information that needs to be encoded in a knowledge base? 2. Is it possible to speed up the process of reasoning? Can Computers Learn? Learning a new set of facts. Learning how to do something. Improving on something that was already learned. Simon s Definition of Machine Learning Learning denotes changes in the system that are adaptive in the sense that they enable the system to do the same task or tasks drawn from the same population more effectively the next time. [Machine Learning I, 1983, Chapter 2.] Below is a diagram that shows one way that a learning element can fit into an AI system. ot every component or connection shown below is necessarily present in every system.

Performance standard feedback learning goals Critic Learning element Problem generator changes knowledge Sensors Performance element Effectors E V I R O M E T [Figure from Russell and orvig] Design of a Learning Element There are many issues that impact the design of a learning element, including: - Which components of the performance element are to be improved. - What representation is used for those components. - What feedback is available. - Supervised learning - Unsupervised learning - Reinforcement learning - What prior information is available. Inductive Concept Learning Of the general learning tasks we saw above, this falls into the category of learning new facts Consider the following example: Say that we re physicians who are treating patients with a particular type of cancer. We d like to predict, among other things, whether a patient will have a recurrence of the disease. Let s assume that we have at our disposal the medical records of other patients. If we ve tracked the health of these patients for many years, we know which patients have had a recurrence and which have not. Perhaps

we can analyze the records to determine the attributes that might be predictive of recurrence. Inductive algorithm tries to induce a general rule from a set of observed instances; does not make use of explicit background knowledge about the domain of application. Supervised the learning algorithm is given the correct concept label for each input example -- in the example above, the label recurrence or not for each patient. A Simple Example: Work or Play? Outlook Temperature Humidity Windy? Plan Example 1 Sunny Mild High False Play Example 2 Sunny Hot High True Play Example 3 Overcast Hot High False Work Example 4 Rainy Mild High False Work Example 5 Rainy Cool ormal False Work Example 6 Rainy Cool ormal True Work Example 7 Sunny Hot ormal False Play Example 8 Sunny Cold ormal True Work Each example (instance) is a possible day, described by the features Outlook, Temperature, Humidity, Windy The target concepts to be learned are a good day to work and a good day to play Examples of Application Domains Medical diagnosis: - patient symptoms and tests has disease / does not have disease - patient symptoms and tests disease Other diagnostic applications, such as diagnosis of customer-reported telephone problems. Stock market prediction: - closing price of last few days market will go up or down tomorrow Cell phone fraud detection: - characteristics of the call (destination, location of caller, duration, etc) fraudulent / not fraudulent

Face recognition for security: - facial characteristics identification of individual Vehicle steering: - image viewed from camera direction and degree to turn the wheel Induction task: Inductive Concept Learning Given: a collection of examples described by attributes, each labeled with a class name Return: a classification rule that can determine the class of any object from its attributes values. Inductive Bias Recall the claim that these algorithms make no use of explicit background knowledge of the domain of application. This is true. However, it would be inappropriate to claim that these algorithms are completely knowledge-free. The attributes describing the examples are generally fixed. The representation of the knowledge to be learned is set by the algorithm. Examples may have been selected in a biased manner. Etc. Implicit knowledge can play an important role in many other ways as well. Def. Any preference for one hypothesis over another, beyond mere consistency with the examples, is called a bias. Inductive Learning Hypothesis Any hypothesis found to approximate the target concept well over a sufficiently large set of training examples will also approximate the target concept well over any other unobserved examples. Assumptions of classifier learning algorithms: - The sample represents the population. - The features permit discrimination.