Artificial Intelligence and Machine Learning Models

Similar documents
Neural network software tool development: exploring programming language options

MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL

An Introduction to Neural Networks

D A T A M I N I N G C L A S S I F I C A T I O N

Impact of Feature Selection on the Performance of Wireless Intrusion Detection Systems

Analecta Vol. 8, No. 2 ISSN

8. Machine Learning Applied Artificial Intelligence

Data Mining and Neural Networks in Stata

Optimum Design of Worm Gears with Multiple Computer Aided Techniques

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

A New Approach For Estimating Software Effort Using RBFN Network

Chapter 4: Artificial Neural Networks

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

Neural Network Applications in Stock Market Predictions - A Methodology Analysis

Neural Networks and Support Vector Machines

Segmentation of stock trading customers according to potential value

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Self Organizing Maps: Fundamentals

Neural Networks and Back Propagation Algorithm

Power Prediction Analysis using Artificial Neural Network in MS Excel

6.2.8 Neural networks for data mining

ISSN: ISO 9001:2008 Certified International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 2, Issue 3, May 2013

UNIVERSITY OF BOLTON SCHOOL OF ENGINEERING MS SYSTEMS ENGINEERING AND ENGINEERING MANAGEMENT SEMESTER 1 EXAMINATION 2015/2016 INTELLIGENT SYSTEMS

1. Classification problems

EVALUATION OF NEURAL NETWORK BASED CLASSIFICATION SYSTEMS FOR CLINICAL CANCER DATA CLASSIFICATION

American International Journal of Research in Science, Technology, Engineering & Mathematics

Models of Cortical Maps II

Neural Network Design in Cloud Computing

Data Mining. Supervised Methods. Ciro Donalek Ay/Bi 199ab: Methods of Sciences hcp://esci101.blogspot.

Lecture 6. Artificial Neural Networks

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

An Analysis on Density Based Clustering of Multi Dimensional Spatial Data

Real-Time Credit-Card Fraud Detection using Artificial Neural Network Tuned by Simulated Annealing Algorithm

Chapter 6. The stacking ensemble approach

Data Mining Techniques Chapter 7: Artificial Neural Networks

Neural Network and Genetic Algorithm Based Trading Systems. Donn S. Fishbein, MD, PhD Neuroquant.com

How To Make A Credit Risk Model For A Bank Account

Intelligent Modeling of Sugar-cane Maturation

PLAANN as a Classification Tool for Customer Intelligence in Banking

Journal of Optimization in Industrial Engineering 13 (2013) 49-54

Bank Customers (Credit) Rating System Based On Expert System and ANN

Stabilization by Conceptual Duplication in Adaptive Resonance Theory

A Robust Method for Solving Transcendental Equations

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

SUCCESSFUL PREDICTION OF HORSE RACING RESULTS USING A NEURAL NETWORK

Chapter 2 The Research on Fault Diagnosis of Building Electrical System Based on RBF Neural Network

Role of Neural network in data mining

Overview. Swarms in nature. Fish, birds, ants, termites, Introduction to swarm intelligence principles Particle Swarm Optimization (PSO)

Predictive Dynamix Inc

Genetic Algorithm Evolution of Cellular Automata Rules for Complex Binary Sequence Prediction

Predictive time series analysis of stock prices using neural network classifier

Intrusion Detection using Artificial Neural Networks with Best Set of Features

Credit Card Fraud Detection Using Self Organised Map

Predictive Analytics using Genetic Algorithm for Efficient Supply Chain Inventory Optimization

An Introduction to Data Mining

Grid e-services for Multi-Layer SOM Neural Network Simulation

Advanced analytics at your hands

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

Chapter 12 Discovering New Knowledge Data Mining

Ph.D. in Bioinformatics and Computational Biology Degree Requirements

Data Isn't Everything

Using Genetic Algorithms in Secured Business Intelligence Mobile Applications

Proposal and Analysis of Stock Trading System Using Genetic Algorithm and Stock Back Test System

Load balancing in a heterogeneous computer system by self-organizing Kohonen network

Feature Subset Selection in Spam Detection

A Binary Model on the Basis of Imperialist Competitive Algorithm in Order to Solve the Problem of Knapsack 1-0

Data Mining Applications in Higher Education

NEURAL NETWORKS A Comprehensive Foundation

OPTIMUM LEARNING RATE FOR CLASSIFICATION PROBLEM

degrees of freedom and are able to adapt to the task they are supposed to do [Gupta].

Effect of Using Neural Networks in GA-Based School Timetabling

Open Access Research on Application of Neural Network in Computer Network Security Evaluation. Shujuan Jin *

Numerical Research on Distributed Genetic Algorithm with Redundant

Comparison of K-means and Backpropagation Data Mining Algorithms

DATA SECURITY BASED ON NEURAL NETWORKS

Price Prediction of Share Market using Artificial Neural Network (ANN)

Leran Wang and Tom Kazmierski

Performance Based Evaluation of New Software Testing Using Artificial Neural Network

A Study of Web Log Analysis Using Clustering Techniques

Using Data Mining for Mobile Communication Clustering and Characterization

Genetic algorithms for changing environments

Stock price prediction using genetic algorithms and evolution strategies

An Evaluation of Neural Networks Approaches used for Software Effort Estimation

FOREX TRADING PREDICTION USING LINEAR REGRESSION LINE, ARTIFICIAL NEURAL NETWORK AND DYNAMIC TIME WARPING ALGORITHMS

Visualization of Breast Cancer Data by SOM Component Planes

A Multi-level Artificial Neural Network for Residential and Commercial Energy Demand Forecast: Iran Case Study

Method of Combining the Degrees of Similarity in Handwritten Signature Authentication Using Neural Networks

Neural Networks algorithms and applications

International Journal of Computer Trends and Technology (IJCTT) volume 4 Issue 8 August 2013

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Correspondence should be addressed to Chandra Shekhar Yadav;

POSTGRAD PLACEMENTS. Placements are an integral part of the Masters programmes, so international students will not require additional work visas.

AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.

Machine learning in financial forecasting. Haindrich Henrietta Vezér Evelin

A Parallel Processor for Distributed Genetic Algorithm with Redundant Binary Number

MS1b Statistical Data Mining

Prediction Model for Crude Oil Price Using Artificial Neural Networks

Brain-in-a-bag: creating an artificial brain

GA as a Data Optimization Tool for Predictive Analytics

IBM SPSS Neural Networks 22

Transcription:

Using Artificial Intelligence and Machine Learning Techniques. Some Preliminary Ideas. Presentation to CWiPP 1/8/2013 ICOSS Mark Tomlinson

Artificial Intelligence Models Very experimental, but timely? Big data Blurring of boundaries between quant/qual Neural networks Gøsta s Brain project welfare regime theory Self Organising Maps (SOMs) Initial experiments on child self-esteem Genetic algorithms

A simple neural network (perceptron) Hidden layer I n p u t s O u t p u t s Neurons Weights

Neuron fires when the sigmoid function reaches a threshold based on the sum of the weighted inputs y=1/(1+e -x )

Neural networks to simulate Gøsta s brain A network with sufficient hidden nodes can be trained to be 100% accurate over a particular dataset Train network to predict Esping-Andersen typology perfectly in 1980 using macro policy data (Scruggs s decommodification data) Use data from different years to predict regimes using the same network Check for stability

Results using Scruggs decommodification data with a threshold of 0.9 5000 iterations on 1980 data LIB=Liberal CON=Conservative SOC=Social Democrat Country 1971 1980 1990 2000 AU LIB LIB LIB LIB AT CON CON CON CON BE CON CON CON CON CA LIB LIB LIB LIB (0.872) DK SOC SOC SOC SOC (0.864) FI SOC SOC SOC SOC (0.640) FR CON CON CON (0.820) LIB DE CON CON CON CON IR LIB LIB LIB LIB IT CON CON CON (0.891) CON JP LIB LIB LIB LIB NL SOC SOC SOC SOC NO SOC SOC SOC SOC NZ LIB LIB LIB LIB SW SOC SOC SOC SOC (0.806) LIB (0.450) CH CON (0.888) LIB LIB LIB UK LIB LIB LIB LIB US LIB LIB LIB LIB

Other possible directions How to interpret weights in the network This is very difficult Statistical properties of the weights Develop pseudo-t statistics Significance of different weights and their interpretation Use of networks as a method of classifying social science data based on recategorising new data Children at risk of future problems Use of neural networks as an alternative to regression

Self Organising Maps Kohonen network Not a neural network in the sense above Used to cluster large amounts of data Dependent variables can be incorporated that use the clustering to determine the outcomes simultaneously

A vector (set of data points) is mapped onto a lattice via a network of weights

Already been used to map quality of life data Matlab can produce these charts S. Kaski, T. Kohonen 1996

British Youth Panel self-esteem data BYP around 1000 teenagers wave D (1994) Part of the BHPS and collected annually Data on self-esteem (4 point scales) I have good qualities I feel useless at times I m a likeable person Enjoy exercise to keep fit Feel a failure Feel I am no good Health is good

SOM of self-esteem using BYP 1994 (using R) Average Jo No good, useless, failures Fit, good qualities and likeable Healthy

Count plots using R Average Jo No good, useless, failures Fit, good qualities and likeable Healthy

Genetic Algorithms (GAs) Used to determine optimal fitness based on a genetic coding A population is set up based on characteristics of agents coded as chromosomes with a collection of genes ABCFDSCFDCFDRT etc A fitness function is determined based on the genetic makeup of the chromosome (could be derived from NNs or regression equations) The chromosomes mate with each other at random using a set of routines where sections of genetic material are exchanged and mutations can occur. Choice of chromosome is based on fitness (higher fitness = higher probability of selection) Thus the population expands The fitness function is rerun and the weaker chromosomes are eliminated Eventually an optimal population appears that is stable This represents the ideal set of agents

Example Could develop a model of well-being (i.e. fitness) based on the interaction of several characteristics using real data (perhaps using a neural network architecture based on several interactions via a hidden layer) Code these characteristics into a chromosome pool based on the data Run the GA on the data Compare the fitness of the real population against the ideal one

Conclusions This is just scratching the surface of the methods available It is not entirely clear whether these methods have advantages over more traditional statistical methods However there is promising scope: NNs are more efficient at prediction than regression, but the weights are difficult to interpret NNs can deal with interactions better than standard regression SOMs have been proven to be efficient at determining clusters with very large data arrays, but in scientific applications such as chemistry and biology, and also bibliometrics and library science GAs are very efficient at solving certain problems (such as the travelling salesman problem) Software is generally designed for use by, for example, bioinformatics and computer scientist types and not always so accessible to social scientists (R, Matlab, Weka)