Questions 11: Feed-Forward Neural Networks

Similar documents
Feed-Forward mapping networks KAIST 바이오및뇌공학과 정재승

Neural Networks and Support Vector Machines

Role of Neural network in data mining

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

Binary Adders: Half Adders and Full Adders

An Introduction to Neural Networks

Lecture 6. Artificial Neural Networks

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

Method of Combining the Degrees of Similarity in Handwritten Signature Authentication Using Neural Networks

degrees of freedom and are able to adapt to the task they are supposed to do [Gupta].

1. True or False? A voltage level in the range 0 to 2 volts is interpreted as a binary 1.

Recurrent Neural Networks

Chapter 2 The Research on Fault Diagnosis of Building Electrical System Based on RBF Neural Network

Back Propagation Neural Networks User Manual

Effect of Using Neural Networks in GA-Based School Timetabling

Artificial Intelligence Exam DT2001 / DT2006 Ordinarie tentamen

Why? A central concept in Computer Science. Algorithms are ubiquitous.

6.2.8 Neural networks for data mining

Data Mining Techniques Chapter 7: Artificial Neural Networks

Introduction to Artificial Neural Networks

Analecta Vol. 8, No. 2 ISSN

Neural Networks and Back Propagation Algorithm

Self Organizing Maps: Fundamentals

Performance Evaluation of Artificial Neural. Networks for Spatial Data Analysis

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

Chapter 4: Artificial Neural Networks

Neural network software tool development: exploring programming language options

Power Prediction Analysis using Artificial Neural Network in MS Excel

Data Mining and Neural Networks in Stata

XIV. Title. 2.1 Schematics of the WEP Encryption in WEP technique Decryption in WEP technique Process of TKIP 25

Neural Network Design in Cloud Computing

CHAPTER 5 PREDICTIVE MODELING STUDIES TO DETERMINE THE CONVEYING VELOCITY OF PARTS ON VIBRATORY FEEDER

Data quality in Accounting Information Systems

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

8 Primes and Modular Arithmetic

PORTFOLIO SELECTION USING

Stock Data Analysis Based On Neural Network. 1Rajesh Musne, 2 Sachin Godse

Keywords - Intrusion Detection System, Intrusion Prevention System, Artificial Neural Network, Multi Layer Perceptron, SYN_FLOOD, PING_FLOOD, JPCap

8. Machine Learning Applied Artificial Intelligence

Neural Networks algorithms and applications

1. Classification problems

Gates, Circuits, and Boolean Algebra

A Property and Casualty Insurance Predictive Modeling Process in SAS

Artificial Intelligence and Machine Learning Models

UNIVERSITY OF BOLTON SCHOOL OF ENGINEERING MS SYSTEMS ENGINEERING AND ENGINEERING MANAGEMENT SEMESTER 1 EXAMINATION 2015/2016 INTELLIGENT SYSTEMS

Bank Customers (Credit) Rating System Based On Expert System and ANN

Chapter 6. The stacking ensemble approach

FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS

Data Mining. Supervised Methods. Ciro Donalek Ay/Bi 199ab: Methods of Sciences hcp://esci101.blogspot.

Bayesian Classifier for a Gaussian Distribution, Decision Surface Equation, with Application

A Multi-level Artificial Neural Network for Residential and Commercial Energy Demand Forecast: Iran Case Study

ARTIFICIAL NEURAL NETWORKS FOR ADAPTIVE MANAGEMENT TRAFFIC LIGHT OBJECTS AT THE INTERSECTION

COMP 250 Fall 2012 lecture 2 binary representations Sept. 11, 2012

A Property & Casualty Insurance Predictive Modeling Process in SAS

Load balancing in a heterogeneous computer system by self-organizing Kohonen network

TRAINING A LIMITED-INTERCONNECT, SYNTHETIC NEURAL IC

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

APPLICATION OF ARTIFICIAL NEURAL NETWORKS USING HIJRI LUNAR TRANSACTION AS EXTRACTED VARIABLES TO PREDICT STOCK TREND DIRECTION

Stock Prediction using Artificial Neural Networks

Boolean Network Models

Padma Charan Das Dept. of E.T.C. Berhampur, Odisha, India

Design: a mod-8 Counter

Analysis of Multilayer Neural Networks with Direct and Cross-Forward Connection

Neural Network Toolbox

Artificial neural networks

Circuits and Boolean Expressions

Iranian J Env Health Sci Eng, 2004, Vol.1, No.2, pp Application of Intelligent System for Water Treatment Plant Operation.

Colored Hats and Logic Puzzles

THE HUMAN BRAIN. observations and foundations

Machine Learning and Data Mining -

Horse Racing Prediction Using Artificial Neural Networks

Question 2 Naïve Bayes (16 points)

Data Mining Lab 5: Introduction to Neural Networks

Cluster detection algorithm in neural networks

A Time Series ANN Approach for Weather Forecasting

Data Mining Algorithms Part 1. Dejan Sarka

Follow links Class Use and other Permissions. For more information, send to:

AN APPLICATION OF TIME SERIES ANALYSIS FOR WEATHER FORECASTING

Performance Based Evaluation of New Software Testing Using Artificial Neural Network

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling

Package neuralnet. February 20, 2015

A Content based Spam Filtering Using Optical Back Propagation Technique

A New Approach For Estimating Software Effort Using RBFN Network

A Neural Network based Approach for Predicting Customer Churn in Cellular Network Services

SUCCESSFUL PREDICTION OF HORSE RACING RESULTS USING A NEURAL NETWORK

Application of Neural Network in User Authentication for Smart Home System

American International Journal of Research in Science, Technology, Engineering & Mathematics

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network

Deep Learning for Multivariate Financial Time Series. Gilberto Batres-Estrada

Machine Learning: Overview

Design call center management system of e-commerce based on BP neural network and multifractal

Stabilization by Conceptual Duplication in Adaptive Resonance Theory

Basic Logic Gates Richard E. Haskell

8 Square matrices continued: Determinants

Handout #1: Mathematical Reasoning

Neural Network Stock Trading Systems Donn S. Fishbein, MD, PhD Neuroquant.com

Transcription:

Questions 11: Feed-Forward Neural Networks Roman Belavkin Middlesex University Question 1 Below is a diagram if a single artificial neuron (unit): x 1 w 1 w 2 7 x 2 v y = ϕ(v) w 3 x 3 Figure 1: Single unit with three inputs. The node has three inputs x = (x 1, x 2, x 3 ) that receive only binary signals (either 0 or 1). How many different input patterns this node can receive? What if the node had four inputs? Five? Can you give a formula that computes the number of binary input patterns for a given number of inputs? Answer: For three inputs the number of combinations of 0 and 1 is 8: x 1 0 1 0 1 0 1 0 1 x 2 0 0 1 1 0 0 1 1 x 3 0 0 0 0 1 1 1 1 and for four inputs the number of combinations is 16: x 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 x 2 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 x 3 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 x 4 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1

BIS3226 2 You may check that for five inputs the number of combinations will be 32. Note that 8 = 2 3, 16 = 2 4 and 32 = 2 5 (for three, four and five inputs). Thus, the formula for the number of binary input patterns is: 2 n, where n in the number of inputs Question 2 Consider the unit shown on Figure 1. Suppose that the weights corresponding to the three inputs have the following values: w 1 = 2 w 2 = 4 w 3 = 1 and the activation of the unit is given by the step-function: { 1 if v 0 Calculate what will be the output value y of the unit for each of the following input patterns: Pattern P 1 P 2 P 3 P 4 x 1 1 0 1 1 x 2 0 1 0 1 x 3 0 1 1 1 Answer: To find the output value y for each pattern we have to: a) Calculate the weighted sum: v = i w i x i = w 1 x 1 + w 2 x 2 + w 3 x 3 b) Apply the activation function to v The calculations for each input pattern are: P 1 : v = 2 1 4 0 + 1 0 = 2, (2 > 0), y = ϕ(2) = 1 P 2 : v = 2 0 4 1 + 1 1 = 3, ( 3 < 0), y = ϕ( 3) = 0 P 3 : v = 2 1 4 0 + 1 1 = 3, (3 > 0), y = ϕ(3) = 1 P 4 : v = 2 1 4 1 + 1 1 = 1, ( 1 < 0), y = ϕ( 1) = 0 Question 3

BIS3226 3 Logical operators (i.e. NOT, AND, OR, XOR, etc) are the building blocks of any computational device. Logical functions return only two possible values, true or false, based on the truth or false values of their arguments. For example, operator AND returns true only when all its arguments are true, otherwise (if any of the arguments is false) it returns false. If we denote truth by 1 and false by 0, then logical function AND can be represented by the following table: x 1 : 0 1 0 1 x 2 : 0 0 1 1 x 1 AND x 2 : 0 0 0 1 This function can be implemented by a single-unit with two inputs: x 1 x 2 w 1 v w 2 y = ϕ(v) if the weights are w 1 = 1 and w 2 = 1 and the activation function is: { 1 if v 2 Note that the threshold level is 2 (v 2). a) Test how the neural AND function works. Answer: P 1 : v = 1 0 + 1 0 = 0, (0 < 2), y = ϕ(0) = 0 P 2 : v = 1 1 + 1 0 = 1, (1 < 2), y = ϕ(1) = 0 P 3 : v = 1 0 + 1 1 = 1, (1 < 2), y = ϕ(1) = 0 P 4 : v = 1 1 + 1 1 = 2, (2 = 2), y = ϕ(2) = 1 b) Suggest how to change either the weights or the threshold level of this single-unit in order to implement the logical OR function (true when at least one of the arguments is true): x 1 : 0 1 0 1 x 2 : 0 0 1 1 x 1 OR x 2 : 0 1 1 1

BIS3226 4 Answer: One solution is to increase the weights of the unit: w 1 = 2 and w 2 = 2: P 1 : v = 2 0 + 2 0 = 0, (0 < 2), y = ϕ(0) = 0 P 2 : v = 2 1 + 2 0 = 2, (2 = 2), y = ϕ(2) = 1 P 3 : v = 2 0 + 2 1 = 2, (2 = 2), y = ϕ(2) = 1 P 4 : v = 2 1 + 2 1 = 4, (4 > 2), y = ϕ(4) = 1 Alternatively, we could reduce the threshold to 1: { 1 if v 1 c) The XOR function (exclusive or) returns true only when one of the arguments is true and another is false. Otherwise, it returns always false. This can be represented by the following table: x 1 : 0 1 0 1 x 2 : 0 0 1 1 x 1 XOR x 2 : 0 1 1 0 Do you think it is possible to implement this function using a single unit? A network of several units? Answer: This is a difficult question, and it puzzled scientists for some time because it is actually impossible to implement the XOR function neither by a single unit nor by a single-layer feed-forward network (single-layer perceptron). This was known as the XOR problem. The solution was found using a feed-forward network with a hidden layer. The XOR network uses two hidden nodes and one output node. Question 4 The following diagram represents a feed-forward neural network with one hidden layer: 1 3 5 7 2 7 4 6

BIS3226 5 A weight on connection between nodes i and j is denoted by w ij, such as w 13 is the weight on the connection between nodes 1 and 3. The following table lists all the weights in the network: w 13 = 2 w 35 = 1 w 23 = 3 w 45 = 1 w 14 = 4 w 36 = 1 w 24 = 1 w 46 = 1 Each of the nodes 3, 4, 5 and 6 uses the following activation function: { 1 if v 0 where v denotes the weighted sum of a node. Each of the input nodes (1 and 2) can only receive binary values (either 0 or 1). Calculate the output of the network (y 5 and y 6 ) for each of the input patterns: Pattern: P 1 P 2 P 3 P 4 Node 1: 0 1 0 1 Node 2: 0 0 1 1 Answer: In order to find the output of the network it is necessary to calculate weighted sums of hidden nodes 3 and 4: v 3 = w 13 x 1 + w 23 x 2, v 4 = w 14 x 1 + w 24 x 2 Then find the outputs from hidden nodes using activation function ϕ: y 3 = ϕ(v 3 ), y 4 = ϕ(v 4 ). Use the outputs of the hidden nodes y 3 and y 4 as the input values to the output layer (nodes 5 and 6), and find weighted sums of output nodes 5 and 6: v 5 = w 35 y 3 + w 45 y 4, v 6 = w 36 y 3 + w 46 y 4. Finally, find the outputs from nodes 5 and 6 (also using ϕ): y 5 = ϕ(v 5 ), y 6 = ϕ(v 6 ). The output pattern will be (y 5, y 6 ). Perform these calculation for each input pattern: P 1 : Input pattern (0, 0) v 3 = 2 0 + 3 0 = 0, y 3 = ϕ(0) = 1 v 4 = 4 0 1 0 = 0, y 4 = ϕ(0) = 1 v 5 = 1 1 1 1 = 0, y 5 = ϕ(0) = 1 v 6 = 1 1 + 1 1 = 0, y 6 = ϕ(0) = 1 The output of the network is (1, 1).

BIS3226 6 P 2 : Input pattern (1, 0) v 3 = 2 1 + 3 0 = 2, y 3 = ϕ( 2) = 0 v 4 = 4 1 1 0 = 4, y 4 = ϕ(4) = 1 v 5 = 1 0 1 1 = 1, y 5 = ϕ( 1) = 0 v 6 = 1 0 + 1 1 = 1, y 6 = ϕ(1) = 1 The output of the network is (0, 1). P 3 : Input pattern (0, 1) v 3 = 2 0 + 3 1 = 3, y 3 = ϕ(3) = 1 v 4 = 4 0 1 1 = 1, y 4 = ϕ( 1) = 0 v 5 = 1 1 1 0 = 1, y 5 = ϕ(1) = 1 v 6 = 1 1 + 1 0 = 1, y 6 = ϕ( 1) = 0 The output of the network is (1, 0). P 4 : Input pattern (1, 1) v 3 = 2 1 + 3 1 = 1, y 3 = ϕ(1) = 1 v 4 = 4 1 1 1 = 3, y 4 = ϕ(3) = 1 v 5 = 1 1 1 1 = 0, y 5 = ϕ(0) = 1 v 6 = 1 1 + 1 1 = 0, y 6 = ϕ(0) = 1 The output of the network is (1, 1). Question 5 What is a training set and how is it used to train neural networks? Answer: Training set is a set of pairs of input patterns with corresponding desired output patterns. Each pair represents how the network is supposed to respond to a particular input. The network is trained to respond correctly to each input pattern from the training set. Training algorithms that use training sets are called supervised learning algorithms. We may think of a supervised learning as learning with a teacher, and the training set as a set of examples. During training the network, when presented with input patterns, gives wrong answers (not desired output). The error is used to adjust the weights in the network so that next time the error was smaller. This procedure is repeated using many examples (pairs of inputs and desired outputs) from the training set until the error becomes sufficiently small. Question 6 What is an epoch?

BIS3226 7 Answer: An epoch is when all of the data in the training set is presented to the neural network once. Question 7 Describe the main steps of the supervised training algorithm? Answer: Initially, set all the weights to some random values Repeat (for many epochs): a) Feed the network with an input from one of the examples in the training set b) Compute the error between the output of the network and the desired output c) Correct the error by adjusting the weights of the nodes Until the error is sufficiently small Question 8 Suppose that a credit card company decided to deploy a new system for assessing credit worthiness of its customers. The new system is using a feed-forward neural network with a supervised learning algorithm. Suggest in a form of essay what should the bank have before the system can be used? Discuss problems associated with this requirement. Answer: The answer should mention that the company should get hold of historical data about its customers who already took credit in the past. This data will be used as a training set for the neural network. It is important that the data is representative and covers as many types of customers as possible. This is because the network will not be able to produce an accurate answer for a customer very different from those in the training set.