Multi-modal Human-Computer Interaction. Attila Fazekas.
|
|
|
- Christian George
- 9 years ago
- Views:
Transcription
1 Multi-modal Human-Computer Interaction Attila Fazekas Szeged, 04 July 2006
2 Debrecen Big Church Multi-modal Human-Computer Interaction - 2
3 University of Debrecen Main Building Multi-modal Human-Computer Interaction - 3
4 Road Map Defining multimodal interaction Main categories and history of multimodal systems Benefits of multimodal interfaces Examples Theoretical background Multi-modal Human-Computer Interaction - 4
5 Defining Multimodal Interaction 1 There are two views on multimodal interaction: The first focuses on the human side: perception and control. There the word modality refers to human input and output channels. 1 L. Schomaker et all, A Taxonomy of Multimodal Interaction in the Human Information Processing System. A Report of the Espirit Basic Research Action 8579 MIAMI. February, 1995 Multi-modal Human-Computer Interaction - 5
6 The second view focuses on using two or more computer input or output modalities to build system that make synergistic use of parallel input or output of these modalities. Multi-modal Human-Computer Interaction - 6
7 Multimodal Interaction: A Human-Centered View 2 The focus is on multimodal perception and control, that is, human input and output channels. Perception means the process of transforming sensory information to higher-level representation. 2 L. Schomaker et all, A Taxonomy of Multimodal Interaction in the Human Information Processing System. A Report of the Espirit Basic Research Action 8579 MIAMI. February, 1995 Multi-modal Human-Computer Interaction - 7
8 The Modalities From a Neurobiological Point of View 3 We can divide the modalities in seven groups Internal chemical (blood oxygen, glucose, ph) External chemical (taste, smell) Somatic senses (touch,pressure, temperature, pain) 3 E.R. Kandel and J.R. Schwartz, Principles of Neural Sciencies. Elsevier Science Publisher, Multi-modal Human-Computer Interaction - 8
9 Muscle sense (stretch,tension, join position) Sense of balance Hearing Vision Multi-modal Human-Computer Interaction - 9
10 Multimodal Interaction: A System-Centered View 4 In computer science multimodal user interfaces have been defined in many ways. Chatty gives a summary of definitions for multimodal interaction by explaining that most authors defined systems that 4 S. Chatty, Extending a graphical toolkit for two-handed interaction, ACM UIST 94 Symposium on User Interface Software and Technology, ACM Press, 1994, Multi-modal Human-Computer Interaction - 10
11 multiple input devices (multi-sensor interaction), multiple interpretations of input issued through a single device. Chatty s explanation of multimodal interaction is the one that most computer scientist use. With the term multimodal user interface they mean a system that accepts many different inputs that are combined in a meaningful way. Multi-modal Human-Computer Interaction - 11
12 Definition of the Multimodality 5 Multimodality is the capacity of the system to communicate with a user along different types of communication channels and to extract and convey meaning automatically. 5 L. Nigay and J. Coutaz, A design space for multimodal systems: concurrent processing and data fusion. Human Factors in Computer Systems, INTERCHI 93 Conference Proceedings, ACM Press, 1993, Multi-modal Human-Computer Interaction - 12
13 Both multimedia and multimodal systems use multiple communication channels. But a multimodal system strives for meaning. For example, an electronic mail system that supports voice and video clips is not multimodal if it only transfer them and does not interpret the inputs. Multi-modal Human-Computer Interaction - 13
14 Two Main Categories of Multimodal Systems The goal is to use the computer as a tool. The computer as a dialogue partner. Multi-modal Human-Computer Interaction - 14
15 The History of Multimodal User Interfaces 6 Morton Heiling s Sensorama. Virtual reality systems are also quite different from multimodal user interfaces. Bolt s Put-That-There system. In this system the user could move objects on screen by pointing and speaking. 6 R. Raisamo, Multimodal Human-Computer Interaction: a constructive and empirical study, Academic Dissertation, University of Tampere, Tampere, Multi-modal Human-Computer Interaction - 15
16 CUBRICON is a system that uses mouse pointing and speech. Oviatt presented a multimodal system for dynamic interactive maps. Digital Smart Kiosk. Multi-modal Human-Computer Interaction - 16
17 Benefits of Multimodal Interfaces 7 Efficiency follows from using each modality for the task that it is best suited for. Redundancy increases the likelihood that communication proceeds smoothly because there are many simultaneous references to the same issue. 7 M.T. Maybury and W. Wahlster (Eds.), Readings in Intelligent User Interfaces, Morgan Kaufmann Publisher, Multi-modal Human-Computer Interaction - 17
18 Perceptability increas when the tasks are facilitated in spatial context. Naturalness follows from the free choice of modalities and may result in a humancomputer communication that is close to human-human communication. Accuracy increases when another modality can indicate an object more accurately than the main modality. Multi-modal Human-Computer Interaction - 18
19 Synergy occurs when one channel of communication can help refine imprecision, modify the meaning, or resolve ambihuities in another channel. Multi-modal Human-Computer Interaction - 19
20 Applications T-Com access point Mobile telecommunication Hands-free devices to computers Using in a car Interactive information panel Multi-modal Human-Computer Interaction - 20
21 Multi-modal Human-Computer Interaction - 21
22 Multi-modal Human-Computer Interaction - 22
23 Multi-modal Human-Computer Interaction - 23
24 Multi-modal Human-Computer Interaction - 24
25 Multi-modal Human-Computer Interaction - 25
26 Multi-modal Human-Computer Interaction - 26
27 Multi-modal Human-Computer Interaction - 27
28 Multi-modal Human-Computer Interaction - 28
29 Theoretical Background Multi-modal Human-Computer Interaction - 29
30 Introduction Faces are our interfaces in our emotional and social live. Automatic analysis of facial gestures is rapidly becoming an area of interest in multi-modal human-computer interaction. Basic goal of this area of research is a human-like description of shown facial expression. Multi-modal Human-Computer Interaction - 30
31 The solution of this problem can be based on the idea of some face detection approaches. Multi-modal Human-Computer Interaction - 31
32 Related Research Topics Face detection (one face/image) Face localization (more faces/image) Facial feature detection (eyes, etc.) mouth, Facial expression recognition Face recognition, face identification Multi-modal Human-Computer Interaction - 32
33 Face tracking Multi-modal Human-Computer Interaction - 33
34 Problems of the Face Detection Pose: The images of a face vary due to the relative camera-face pose. Presence or absence of structural components (beards, mustaches, glasses etc.). Facial expression: The appearance of faces are directly affected by the facial expression. Multi-modal Human-Computer Interaction - 34
35 Occlusion: Faces may be partially occluded by other objects. Image orientation: Face images vary for different rotations about the optical axis of the camera. Imaging conditions (lighting, background, camera characteristics). Multi-modal Human-Computer Interaction - 35
36 Detecting Faces in a Single Image Knowledge-based methods (G. Yang and T.S. Huang, 1994). Feature invariant approaches (T. K. Leung, M. C. Burl, and P. Perona, 1995), (K. C. Yow and R. Cipolla, 1996). Template matching methods (A. Lanitis, C. J. Taylor, and T. F. Cootes, 1995). Multi-modal Human-Computer Interaction - 36
37 Appearance-based methods (E. Osuna, R. Freund, and F. Girosi, 1997), (A. Fazekas, C. Kotropoulos, I. Pitas, 2002). Multi-modal Human-Computer Interaction - 37
38 Detecting Faces in a Single Image Scanning of the picture by a running window in a multiresolution pyramid. Normalize of the window. Hide some parts of the face. Normalize of the local variance of the brightness on the picture. Multi-modal Human-Computer Interaction - 38
39 Equalization of the histogram. Localization of the face (decision). Multi-modal Human-Computer Interaction - 39
40 Face Gesture Recognition like Binary Classification Problem Let us consider a set of the facial pictures. Let us set up a finite system of some features related the pictures. It is known any pictures is related to only one class: face with the given gesture, face without the given gesture. Multi-modal Human-Computer Interaction - 40
41 The problem to find a method to determine the class of the examined picture. One possible way to solve this problem: Support Vector Machine. Multi-modal Human-Computer Interaction - 41
42 Support Vector Machine Statistical learning from examples aims at selecting from a given set of functions {f α (x) α Λ}, the one which predicts best the correct response. This selection is based on the observation of l pairs that build the training set: (x 1, y 1 ),..., (x l, y l ), x i R m, y i {+1, 1} Multi-modal Human-Computer Interaction - 42
43 which contains input vectors x i and the associated ground truth given by an external supervisor. Let the response of the learning machine f α (x) belongs to a set of indicator functions {f α (x) x R m, α Λ}. If we define the loss-function: L(y, f α (x)) = { 0, if y = fα (x), 1, if y f α (x). Multi-modal Human-Computer Interaction - 43
44 The expected value of the loss is given by: R(α) = L(y, f α (x))p(x, y)dxdy, where p(x, y) is the joint probability density function of random variables x and y. We would like to find the function f α0 (x) which minimizes the risk function R(α). The basic idea of SVM to construct the optimal separating hyperplane. Multi-modal Human-Computer Interaction - 44
45 Suppose that the training data can be separated by a hyperplane, f α (x) = α T x + b = 0, such that: y i (α T x i + b) 1, i = 1, 2,..., l where α is the normal to the hyperplane. For the linearly separable case, SVM simply seeks for the separating hyperplane with the largest margin. Multi-modal Human-Computer Interaction - 45
46 For linearly nonseparable data, by mapping the input vectors, which are the elements of the training set, into a high-dimensional feature space through so-called kernel function. We construct the optimal separating hyperplane in the feature space to get a binary decision. Multi-modal Human-Computer Interaction - 46
47 Experimental Results For all experiments the package SVMLight developed by T. Joachims was used. For complete test, several routines have been added to the original toolbox. The database recorded by our institute was used. Multi-modal Human-Computer Interaction - 47
48 Training set of 40 images (20 faces with the given gesture, 20 faces without the given gesture.). All images are recorded in 256 grey levels. They are of dimension The procedure for collecting face patterns is as follows. Multi-modal Human-Computer Interaction - 48
49 A rectangle part of dimension pixels has been manually determined that includes the actual face. This area has been subsampled four times. At each subsampling, non-overlapping regions of 2 2 pixels are replaced by their average. Multi-modal Human-Computer Interaction - 49
50 The training patterns of dimension are built. The class label +1 has been appended to each pattern. Similarly, 20 non-face patterns have been collected from images in the same way, and labeled 1. Multi-modal Human-Computer Interaction - 50
51 Facial Gesture Database Surprising face Smiling face Sad face Angry face Multi-modal Human-Computer Interaction - 51
52 Classification Error on Facial Gesture Database Angry Happy Sad Serial Suprised 22.4% 10.3% 11.8% 9.4% 18.9% Multi-modal Human-Computer Interaction - 52
International Journal of Advanced Information in Arts, Science & Management Vol.2, No.2, December 2014
Efficient Attendance Management System Using Face Detection and Recognition Arun.A.V, Bhatath.S, Chethan.N, Manmohan.C.M, Hamsaveni M Department of Computer Science and Engineering, Vidya Vardhaka College
Multisensor Data Fusion and Applications
Multisensor Data Fusion and Applications Pramod K. Varshney Department of Electrical Engineering and Computer Science Syracuse University 121 Link Hall Syracuse, New York 13244 USA E-mail: [email protected]
Search Taxonomy. Web Search. Search Engine Optimization. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Retrieval models Older models» Boolean retrieval» Vector Space model Probabilistic Models» BM25» Language models Web search» Learning to Rank Search Taxonomy!
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
A secure face tracking system
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 10 (2014), pp. 959-964 International Research Publications House http://www. irphouse.com A secure face tracking
FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 2, February 2014,
Face Model Fitting on Low Resolution Images
Face Model Fitting on Low Resolution Images Xiaoming Liu Peter H. Tu Frederick W. Wheeler Visualization and Computer Vision Lab General Electric Global Research Center Niskayuna, NY, 1239, USA {liux,tu,wheeler}@research.ge.com
HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT
International Journal of Scientific and Research Publications, Volume 2, Issue 4, April 2012 1 HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT Akhil Gupta, Akash Rathi, Dr. Y. Radhika
Data, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
Face detection is a process of localizing and extracting the face region from the
Chapter 4 FACE NORMALIZATION 4.1 INTRODUCTION Face detection is a process of localizing and extracting the face region from the background. The detected face varies in rotation, brightness, size, etc.
AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION
AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION Saurabh Asija 1, Rakesh Singh 2 1 Research Scholar (Computer Engineering Department), Punjabi University, Patiala. 2 Asst.
Face Recognition For Remote Database Backup System
Face Recognition For Remote Database Backup System Aniza Mohamed Din, Faudziah Ahmad, Mohamad Farhan Mohamad Mohsin, Ku Ruhana Ku-Mahamud, Mustafa Mufawak Theab 2 Graduate Department of Computer Science,UUM
Lecture 2, Human cognition
Human Cognition An important foundation for the design of interfaces is a basic theory of human cognition The information processing paradigm (in its most simple form). Human Information Processing The
HAND GESTURE BASEDOPERATINGSYSTEM CONTROL
HAND GESTURE BASEDOPERATINGSYSTEM CONTROL Garkal Bramhraj 1, palve Atul 2, Ghule Supriya 3, Misal sonali 4 1 Garkal Bramhraj mahadeo, 2 Palve Atule Vasant, 3 Ghule Supriya Shivram, 4 Misal Sonali Babasaheb,
Emotion Detection from Speech
Emotion Detection from Speech 1. Introduction Although emotion detection from speech is a relatively new field of research, it has many potential applications. In human-computer or human-human interaction
The Implementation of Face Security for Authentication Implemented on Mobile Phone
The Implementation of Face Security for Authentication Implemented on Mobile Phone Emir Kremić *, Abdulhamit Subaşi * * Faculty of Engineering and Information Technology, International Burch University,
Spam detection with data mining method:
Spam detection with data mining method: Ensemble learning with multiple SVM based classifiers to optimize generalization ability of email spam classification Keywords: ensemble learning, SVM classifier,
Efficient Background Subtraction and Shadow Removal Technique for Multiple Human object Tracking
ISSN: 2321-7782 (Online) Volume 1, Issue 7, December 2013 International Journal of Advance Research in Computer Science and Management Studies Research Paper Available online at: www.ijarcsms.com Efficient
Template-based Eye and Mouth Detection for 3D Video Conferencing
Template-based Eye and Mouth Detection for 3D Video Conferencing Jürgen Rurainsky and Peter Eisert Fraunhofer Institute for Telecommunications - Heinrich-Hertz-Institute, Image Processing Department, Einsteinufer
Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition
Send Orders for Reprints to [email protected] The Open Electrical & Electronic Engineering Journal, 2014, 8, 599-604 599 Open Access A Facial Expression Recognition Algorithm Based on Local Binary
Simultaneous Gamma Correction and Registration in the Frequency Domain
Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong [email protected] William Bishop [email protected] Department of Electrical and Computer Engineering University
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail
FPGA Implementation of Human Behavior Analysis Using Facial Image
RESEARCH ARTICLE OPEN ACCESS FPGA Implementation of Human Behavior Analysis Using Facial Image A.J Ezhil, K. Adalarasu Department of Electronics & Communication Engineering PSNA College of Engineering
Local features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
Subspace Analysis and Optimization for AAM Based Face Alignment
Subspace Analysis and Optimization for AAM Based Face Alignment Ming Zhao Chun Chen College of Computer Science Zhejiang University Hangzhou, 310027, P.R.China [email protected] Stan Z. Li Microsoft
Taking Inverse Graphics Seriously
CSC2535: 2013 Advanced Machine Learning Taking Inverse Graphics Seriously Geoffrey Hinton Department of Computer Science University of Toronto The representation used by the neural nets that work best
Introduction to Pattern Recognition
Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)
Mathematical Model Based Total Security System with Qualitative and Quantitative Data of Human
Int Jr of Mathematics Sciences & Applications Vol3, No1, January-June 2013 Copyright Mind Reader Publications ISSN No: 2230-9888 wwwjournalshubcom Mathematical Model Based Total Security System with Qualitative
Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.
Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table
Mean-Shift Tracking with Random Sampling
1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of
Visual-based ID Verification by Signature Tracking
Visual-based ID Verification by Signature Tracking Mario E. Munich and Pietro Perona California Institute of Technology www.vision.caltech.edu/mariomu Outline Biometric ID Visual Signature Acquisition
Neural Network based Vehicle Classification for Intelligent Traffic Control
Neural Network based Vehicle Classification for Intelligent Traffic Control Saeid Fazli 1, Shahram Mohammadi 2, Morteza Rahmani 3 1,2,3 Electrical Engineering Department, Zanjan University, Zanjan, IRAN
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier D.Nithya a, *, V.Suganya b,1, R.Saranya Irudaya Mary c,1 Abstract - This paper presents,
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
Lecture 9: Introduction to Pattern Analysis
Lecture 9: Introduction to Pattern Analysis g Features, patterns and classifiers g Components of a PR system g An example g Probability definitions g Bayes Theorem g Gaussian densities Features, patterns
Cell Phone based Activity Detection using Markov Logic Network
Cell Phone based Activity Detection using Markov Logic Network Somdeb Sarkhel [email protected] 1 Introduction Mobile devices are becoming increasingly sophisticated and the latest generation of smart
Support Vector Machines with Clustering for Training with Very Large Datasets
Support Vector Machines with Clustering for Training with Very Large Datasets Theodoros Evgeniou Technology Management INSEAD Bd de Constance, Fontainebleau 77300, France [email protected] Massimiliano
Automatic Evaluation Software for Contact Centre Agents voice Handling Performance
International Journal of Scientific and Research Publications, Volume 5, Issue 1, January 2015 1 Automatic Evaluation Software for Contact Centre Agents voice Handling Performance K.K.A. Nipuni N. Perera,
In Proceedings of the Eleventh Conference on Biocybernetics and Biomedical Engineering, pages 842-846, Warsaw, Poland, December 2-4, 1999
In Proceedings of the Eleventh Conference on Biocybernetics and Biomedical Engineering, pages 842-846, Warsaw, Poland, December 2-4, 1999 A Bayesian Network Model for Diagnosis of Liver Disorders Agnieszka
The Visual Internet of Things System Based on Depth Camera
The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed
Introduction to Support Vector Machines. Colin Campbell, Bristol University
Introduction to Support Vector Machines Colin Campbell, Bristol University 1 Outline of talk. Part 1. An Introduction to SVMs 1.1. SVMs for binary classification. 1.2. Soft margins and multi-class classification.
Emotion Recognition Using Blue Eyes Technology
Emotion Recognition Using Blue Eyes Technology Prof. Sudan Pawar Shubham Vibhute Ashish Patil Vikram More Gaurav Sane Abstract We cannot measure the world of science in terms of progress and fact of development.
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles [email protected] and lunbo
Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition
IWNEST PUBLISHER Journal of Industrial Engineering Research (ISSN: 2077-4559) Journal home page: http://www.iwnest.com/aace/ Adaptive sequence of Key Pose Detection for Human Action Recognition 1 T. Sindhu
Image Normalization for Illumination Compensation in Facial Images
Image Normalization for Illumination Compensation in Facial Images by Martin D. Levine, Maulin R. Gandhi, Jisnu Bhattacharyya Department of Electrical & Computer Engineering & Center for Intelligent Machines
Multivariate data visualization using shadow
Proceedings of the IIEEJ Ima and Visual Computing Wor Kuching, Malaysia, Novembe Multivariate data visualization using shadow Zhongxiang ZHENG Suguru SAITO Tokyo Institute of Technology ABSTRACT When visualizing
Comparing Support Vector Machines, Recurrent Networks and Finite State Transducers for Classifying Spoken Utterances
Comparing Support Vector Machines, Recurrent Networks and Finite State Transducers for Classifying Spoken Utterances Sheila Garfield and Stefan Wermter University of Sunderland, School of Computing and
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. [email protected]
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE 1 S.Manikandan, 2 S.Abirami, 2 R.Indumathi, 2 R.Nandhini, 2 T.Nanthini 1 Assistant Professor, VSA group of institution, Salem. 2 BE(ECE), VSA
Outline. Fundamentals. Rendering (of 3D data) Data mappings. Evaluation Interaction
Outline Fundamentals What is vis? Some history Design principles The visualization process Data sources and data structures Basic visual mapping approaches Rendering (of 3D data) Scalar fields (isosurfaces
Car Insurance. Havránek, Pokorný, Tomášek
Car Insurance Havránek, Pokorný, Tomášek Outline Data overview Horizontal approach + Decision tree/forests Vertical (column) approach + Neural networks SVM Data overview Customers Viewed policies Bought
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
Application of Support Vector Machines to Fault Diagnosis and Automated Repair
Application of Support Vector Machines to Fault Diagnosis and Automated Repair C. Saunders and A. Gammerman Royal Holloway, University of London, Egham, Surrey, England {C.Saunders,A.Gammerman}@dcs.rhbnc.ac.uk
Colorado School of Mines Computer Vision Professor William Hoff
Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description
Lecture 6: Classification & Localization. boris. [email protected]
Lecture 6: Classification & Localization boris. [email protected] 1 Agenda ILSVRC 2014 Overfeat: integrated classification, localization, and detection Classification with Localization Detection. 2 ILSVRC-2014
Author Gender Identification of English Novels
Author Gender Identification of English Novels Joseph Baena and Catherine Chen December 13, 2013 1 Introduction Machine learning algorithms have long been used in studies of authorship, particularly in
Predict Influencers in the Social Network
Predict Influencers in the Social Network Ruishan Liu, Yang Zhao and Liuyu Zhou Email: rliu2, yzhao2, [email protected] Department of Electrical Engineering, Stanford University Abstract Given two persons
Facial Biometric Templates and Aging: Problems and Challenges for Artificial Intelligence
Facial Biometric Templates and Aging: Problems and Challenges for Artificial Intelligence Andreas Lanitis Department of Multimedia and Graphic Arts, Cyprus University of Technology P.O Box 50329, Lemesos,
Course 395: Machine Learning
Course 395: Machine Learning Lecturers: Maja Pantic ([email protected]) Stavros Petridis ([email protected]) Goal (Lectures): To present basic theoretical concepts and key algorithms that form the core
Object Recognition and Template Matching
Object Recognition and Template Matching Template Matching A template is a small image (sub-image) The goal is to find occurrences of this template in a larger image That is, you want to find matches of
Machine Learning. CS494/594, Fall 2007 11:10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004
CS494/594, Fall 2007 11:10 AM 12:25 PM Claxton 205 Machine Learning Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004 [email protected] http://www.cmpe.boun.edu.tr/~ethem/i2ml What
Active Learning SVM for Blogs recommendation
Active Learning SVM for Blogs recommendation Xin Guan Computer Science, George Mason University Ⅰ.Introduction In the DH Now website, they try to review a big amount of blogs and articles and find the
VISUAL RECOGNITION OF HAND POSTURES FOR INTERACTING WITH VIRTUAL ENVIRONMENTS
VISUAL RECOGNITION OF HAND POSTURES FOR INTERACTING WITH VIRTUAL ENVIRONMENTS Radu Daniel VATAVU Ştefan-Gheorghe PENTIUC "Stefan cel Mare" University of Suceava str.universitatii nr.13, RO-720229 Suceava
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE Venu Govindaraju BIOMETRICS DOCUMENT ANALYSIS PATTERN RECOGNITION 8/24/2015 ICDAR- 2015 2 Towards a Globally Optimal Approach for Learning Deep Unsupervised
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin
CS231M Project Report - Automated Real-Time Face Tracking and Blending
CS231M Project Report - Automated Real-Time Face Tracking and Blending Steven Lee, [email protected] June 6, 2015 1 Introduction Summary statement: The goal of this project is to create an Android
Statistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht [email protected] 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht [email protected] 539 Sennott
LCs for Binary Classification
Linear Classifiers A linear classifier is a classifier such that classification is performed by a dot product beteen the to vectors representing the document and the category, respectively. Therefore it
MACHINE VISION MNEMONICS, INC. 102 Gaither Drive, Suite 4 Mount Laurel, NJ 08054 USA 856-234-0970 www.mnemonicsinc.com
MACHINE VISION by MNEMONICS, INC. 102 Gaither Drive, Suite 4 Mount Laurel, NJ 08054 USA 856-234-0970 www.mnemonicsinc.com Overview A visual information processing company with over 25 years experience
Machine Learning for Data Science (CS4786) Lecture 1
Machine Learning for Data Science (CS4786) Lecture 1 Tu-Th 10:10 to 11:25 AM Hollister B14 Instructors : Lillian Lee and Karthik Sridharan ROUGH DETAILS ABOUT THE COURSE Diagnostic assignment 0 is out:
Object Recognition. Selim Aksoy. Bilkent University [email protected]
Image Classification and Object Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] Image classification Image (scene) classification is a fundamental
Lecture 2: The SVM classifier
Lecture 2: The SVM classifier C19 Machine Learning Hilary 2015 A. Zisserman Review of linear classifiers Linear separability Perceptron Support Vector Machine (SVM) classifier Wide margin Cost function
Real-time Visual Tracker by Stream Processing
Real-time Visual Tracker by Stream Processing Simultaneous and Fast 3D Tracking of Multiple Faces in Video Sequences by Using a Particle Filter Oscar Mateo Lozano & Kuzahiro Otsuka presented by Piotr Rudol
CREATING EMOTIONAL EXPERIENCES THAT ENGAGE, INSPIRE, & CONVERT
CREATING EMOTIONAL EXPERIENCES THAT ENGAGE, INSPIRE, & CONVERT Presented by Tim Llewellynn (CEO & Co-founder) Presented 14 th March 2013 2 Agenda Agenda and About nviso Primer on Emotions and Decision
Support Vector Machine. Tutorial. (and Statistical Learning Theory)
Support Vector Machine (and Statistical Learning Theory) Tutorial Jason Weston NEC Labs America 4 Independence Way, Princeton, USA. [email protected] 1 Support Vector Machines: history SVMs introduced
A Real Time Hand Tracking System for Interactive Applications
A Real Time Hand Tracking System for Interactive Applications Siddharth Swarup Rautaray Indian Institute of Information Technology Allahabad ABSTRACT In vision based hand tracking systems color plays an
Edge tracking for motion segmentation and depth ordering
Edge tracking for motion segmentation and depth ordering P. Smith, T. Drummond and R. Cipolla Department of Engineering University of Cambridge Cambridge CB2 1PZ,UK {pas1001 twd20 cipolla}@eng.cam.ac.uk
Open-Set Face Recognition-based Visitor Interface System
Open-Set Face Recognition-based Visitor Interface System Hazım K. Ekenel, Lorant Szasz-Toth, and Rainer Stiefelhagen Computer Science Department, Universität Karlsruhe (TH) Am Fasanengarten 5, Karlsruhe
Programming Exercise 3: Multi-class Classification and Neural Networks
Programming Exercise 3: Multi-class Classification and Neural Networks Machine Learning November 4, 2011 Introduction In this exercise, you will implement one-vs-all logistic regression and neural networks
Introduction to Computer Graphics
Introduction to Computer Graphics Torsten Möller TASC 8021 778-782-2215 [email protected] www.cs.sfu.ca/~torsten Today What is computer graphics? Contents of this course Syllabus Overview of course topics
How do non-expert users exploit simultaneous inputs in multimodal interaction?
How do non-expert users exploit simultaneous inputs in multimodal interaction? Knut Kvale, John Rugelbak and Ingunn Amdal 1 Telenor R&D, Norway [email protected], [email protected], [email protected]
Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague. http://ida.felk.cvut.
Machine Learning and Data Analysis overview Jiří Kléma Department of Cybernetics, Czech Technical University in Prague http://ida.felk.cvut.cz psyllabus Lecture Lecturer Content 1. J. Kléma Introduction,
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM ABSTRACT Luis Alexandre Rodrigues and Nizam Omar Department of Electrical Engineering, Mackenzie Presbiterian University, Brazil, São Paulo [email protected],[email protected]
Visualizing Complexity in Networks: Seeing Both the Forest and the Trees
CONNECTIONS 25(1): 37-47 2003 INSNA Visualizing Complexity in Networks: Seeing Both the Forest and the Trees Cathleen McGrath Loyola Marymount University, USA David Krackhardt The Heinz School, Carnegie
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
The Future of Communication
Future Technologies I: Communication Session 2 Hannover, 3 November 2010 The Future of Communication Wolfgang Wahlster German Research Center for Artificial Intelligence Saarbrücken, Kaiserslautern, Bremen,
Electroencephalography Analysis Using Neural Network and Support Vector Machine during Sleep
Engineering, 23, 5, 88-92 doi:.4236/eng.23.55b8 Published Online May 23 (http://www.scirp.org/journal/eng) Electroencephalography Analysis Using Neural Network and Support Vector Machine during Sleep JeeEun
Handwritten Digit Recognition with a Back-Propagation Network
396 Le Cun, Boser, Denker, Henderson, Howard, Hubbard and Jackel Handwritten Digit Recognition with a Back-Propagation Network Y. Le Cun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard,
