Multimodal Emotion Recognition

Size: px
Start display at page:

Download "Multimodal Emotion Recognition"

Transcription

1 Multimodal Emotion Recognition Marco Paleari & Benoit Huet {Paleari, eurecom.fr

2 Outline Motivations Possible HCI scenarios and objectives Automatic emotion recognition Introduction Approach Audio: speech prosody Video: facial expressions Multimodal emotion recognition Conclusions 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 2

3 Motivations Why Emotions?

4 Motivations Emotions are important communication means For example howone thing is said is sometimes more important than what is said. Search results for good morning 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 4

5 Motivations Emotions are important communication means Art is a vehicle for the expression or communication of emotionsand ideas (1). Edward Munch, The Scream Francisco Hayez, The Kiss (1) Jerrold Levinson, The Oxford Handbook of Aesthetics, The Oxford University Press, 2003, p.5. 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 5

6 Motivations Human memory tends to group together events that present similar emotive meaning 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 6

7 Possible HCI Scenarios How to use Emotions?

8 Possible HCI Scenarios Emotions could be used for indexing and retrieving media happiness sadness happiness Happiness Comedy Em surprise 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 8

9 Possible HCI Scenarios Summarization sadness happiness happiness surprise Comedy Tailored Summary 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 9

10 Possible HCI Scenarios Telemedicine Gaming E-learning Ubiquitous / pervasive computing 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 10

11 Automatic Emotion Recognition Introduction

12 Automatic Emotion Recognition Different possibilities Detect people emotions: Facial Expressions Vocal Prosody Autonomous Nervous System Signals (i.e. heart rate, skin conductivity, etc.) Gestures / Posture Others Detect object emotions (music, images, etc.) Colors Tempo / Timber Others 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 13

13 Emotion Recognition From Facial Expression Duchenne1862 First studies of facial expressions Ekman& Friesel 1971 Definition of 6 universal emotions: Anger Disgust Fear Happiness Sadness Surprise 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 14

14 Emotion Recognition From Facial Expression Different techniques are possible 3D mesh mapping [Essa1995, Cohen et al. 2000] Dense motion flow [Lien 1998, Busso 2004] Feature point movement [Lien 1998, Cohen et al. 2000, Michel & Kaliouby 2003] Facial movement energy estimation [Essa 1995] 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 15

15 Emotion Recognition From Vocal Prosody Different techniques are possible Pitch and Intensity Spectral Envelop (Formants) Harmonicity analysis Linear Predictive Coefficient (LPC) Mel-Frequency Cepstral Coefficients (MFCC) Speech rate Combination of the above [Johnstone et al. 2001, Noble 2003, Busso 2004] ~23/46% ~71% 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 16

16 Inter and Intra Individual Differences AVERAGE Inter-individual variability Intra-individual variability e.g. arousal Psychological meaning Emotional cue e.g. Heart Rate, pitch, eyebrow shape Personality or Mood Dependence Day Dependence [Fiorito and Simons 1994] [Picard et al. 2001] 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 17

17 Automatic Emotion Recognition Approach

18 Experimental Setup enterface database [Martin et al. 2005] 44 characters (male and female) Each acting 6 emotions: Anger, Disgust, Fear, Happiness, Sadness and Surprised Each emotions is played 5 times by each character Resulting in 1420 videos Emotion learned on 30 characters acting 4 times each emotion Recognition is evaluated on the remainder of the database 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 19

19 Experimental Setup 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 20

20 Audio: Speech Prosody

21 Emotion Recognition From Speech Prosody Speech Feature Extraction Feature Vector Generation Classification Pitch Mean Formants Standard Deviation Energy Variance Harmonicity Min (& its position) (Harmonics/ Noise Energy Ratio) Support Vector Machines Neural Networks Hidden Markov Models Linear Prediction Coefficients Max (& its position) (LPC) [0.05, 0.25, 0.5, 0.75, 0.95] Quantiles Mel-Frequency Cepstral Coefficients (MFCC) Polynomial Regression 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 22

22 Audio Result 70,00% 60,00% 50,00% 40,00% 30,00% 20,00% 36,19% 58,84% 29,16% 50,31% 38,35% 59,85% 57,47% 66,04% 37,51% 41,63% 10,00% 19,89% 16,40% 21,79% 17,11% 0,00% anger disgust fear happiness sadness surprise average Result obtained with 1 NN Result using obtained Polynomial with & SVM Statistical using analysis Polynomial and analysis averaging results on 0.6 seconds 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 23

23 Video: Facial Expressions

24 Emotion Recognition from Facial Expressions 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 25

25 Lukas Kanade Technique This method assumes that gray values (nxnfeature window) do not change between two consecutive frames, but only shift from one position to another 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 26

26 Emotion Recognition from Facial Expressions Follow points and analyze the movement signals to extract meaningful features signal 1 signal 2 time 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 27

27 Emotion Recognition from Facial Expressions Mean Standard Deviation signal 1 signal 2 Variance Min (& its position) Max (& its position) SVM NN HMM time [0.05, 0.25, 0.5, 0.75, 0.95] Quantiles Polynomial Regression 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 28

28 Video Result 70,00% 70,00% 60,00% 60,00% 50,00% 50,00% 40,00% 40,00% 30,00% 30,00% 55,40% 36,48% 34,46% 56,97% 41,43% 20,00% 62,66% 42,55% 23,54% 30,71% 31,77% 33,49% 32,12% 10,00% 12,91% 12,84% 0,00% anger disgust fear happiness sadness surprise average Result obtained with SVM 6 SVM using (one Polynomial for each emotion) analysis 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 29

29 Example 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 30

30 Example 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 31

31 Example 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 32

32 Multimodal Emotion Recognition

33 Multimodal Emotion Recognition Feature Extraction Feature Extraction Feature Fusion Emotional Classification Emotional Classification Emotional Classification F u s i o n Emotional Estimate Emotional Estimate 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 34

34 Multimodal Result 70,00% 60,00% 50,00% 40,00% 30,00% 20,00% 54,67% 60,63% 46,77%49,95% 25,49% 52,78% 54,61% 61,09% 54,12% 26,94% 42,24% 45,29% 20,62% 22,63% 10,00% 0,00% anger disgust fear happiness sadness surprise average Feature Fusion Decision Fusion Result obtained with NN and data filtering (average on 0.6 sec). Decision Fusion results are obtained by averaging on 0.4 sec. 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 35

35 Conclusion Similar results (on average over all emotions) are obtained based on audio and video. Simple decision fusionprovides more improvement than simple feature fusion. More modalities should lead to improved emotion recognition accuracy. 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 36

36 Future Work Video Analysis Facial Feature Point Detection Classification Hidden Markov Models Multimodal Fusion Temporally controlled Multimodal Fusion 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 37

37 Multimodal Fusion Signal Quality Assessment Signal Quality Assessment Feature Extraction Feature Extraction Control Emotional Classification Emotional Classification Feature Fusion Emotional Classification F u s i o n Emotional Estimate Emotional Estimate F u s i o n Emotional Estimate 26/05/2008 Eurecom - MM Talk: Multimodal Emotion Recognition - Marco Paleari and Benoit Huet 38

38 Thank you for your attention

Emotion Detection from Speech

Emotion Detection from Speech Emotion Detection from Speech 1. Introduction Although emotion detection from speech is a relatively new field of research, it has many potential applications. In human-computer or human-human interaction

More information

Automatic Evaluation Software for Contact Centre Agents voice Handling Performance

Automatic Evaluation Software for Contact Centre Agents voice Handling Performance International Journal of Scientific and Research Publications, Volume 5, Issue 1, January 2015 1 Automatic Evaluation Software for Contact Centre Agents voice Handling Performance K.K.A. Nipuni N. Perera,

More information

engin erzin the use of speech processing applications is expected to surge in multimedia-rich scenarios

engin erzin the use of speech processing applications is expected to surge in multimedia-rich scenarios engin erzin Associate Professor Department of Computer Engineering Ph.D. Bilkent University http://home.ku.edu.tr/ eerzin eerzin@ku.edu.tr Engin Erzin s research interests include speech processing, multimodal

More information

EMOTION IN SPEECH: RECOGNITION AND APPLICATION TO CALL CENTERS

EMOTION IN SPEECH: RECOGNITION AND APPLICATION TO CALL CENTERS EMOTION IN SPEECH: RECOGNITION AND APPLICATION TO CALL CENTERS VALERY A. PETRUSHIN Andersen Consulting 3773 Willow Rd. Northbrook, IL 60062 petr@cstar.ac.com ABSTRACT The paper describes two experimental

More information

Speech Signal Processing: An Overview

Speech Signal Processing: An Overview Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech

More information

Emotion Recognition Using Blue Eyes Technology

Emotion Recognition Using Blue Eyes Technology Emotion Recognition Using Blue Eyes Technology Prof. Sudan Pawar Shubham Vibhute Ashish Patil Vikram More Gaurav Sane Abstract We cannot measure the world of science in terms of progress and fact of development.

More information

CREATING EMOTIONAL EXPERIENCES THAT ENGAGE, INSPIRE, & CONVERT

CREATING EMOTIONAL EXPERIENCES THAT ENGAGE, INSPIRE, & CONVERT CREATING EMOTIONAL EXPERIENCES THAT ENGAGE, INSPIRE, & CONVERT Presented by Tim Llewellynn (CEO & Co-founder) Presented 14 th March 2013 2 Agenda Agenda and About nviso Primer on Emotions and Decision

More information

Integration of Negative Emotion Detection into a VoIP Call Center System

Integration of Negative Emotion Detection into a VoIP Call Center System Integration of Negative Detection into a VoIP Call Center System Tsang-Long Pao, Chia-Feng Chang, and Ren-Chi Tsao Department of Computer Science and Engineering Tatung University, Taipei, Taiwan Abstract

More information

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA Tuija Niemi-Laitinen*, Juhani Saastamoinen**, Tomi Kinnunen**, Pasi Fränti** *Crime Laboratory, NBI, Finland **Dept. of Computer

More information

Separation and Classification of Harmonic Sounds for Singing Voice Detection

Separation and Classification of Harmonic Sounds for Singing Voice Detection Separation and Classification of Harmonic Sounds for Singing Voice Detection Martín Rocamora and Alvaro Pardo Institute of Electrical Engineering - School of Engineering Universidad de la República, Uruguay

More information

Establishing the Uniqueness of the Human Voice for Security Applications

Establishing the Uniqueness of the Human Voice for Security Applications Proceedings of Student/Faculty Research Day, CSIS, Pace University, May 7th, 2004 Establishing the Uniqueness of the Human Voice for Security Applications Naresh P. Trilok, Sung-Hyuk Cha, and Charles C.

More information

FPGA Implementation of Human Behavior Analysis Using Facial Image

FPGA Implementation of Human Behavior Analysis Using Facial Image RESEARCH ARTICLE OPEN ACCESS FPGA Implementation of Human Behavior Analysis Using Facial Image A.J Ezhil, K. Adalarasu Department of Electronics & Communication Engineering PSNA College of Engineering

More information

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction : A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction Urmila Shrawankar Dept. of Information Technology Govt. Polytechnic, Nagpur Institute Sadar, Nagpur 440001 (INDIA)

More information

School Class Monitoring System Based on Audio Signal Processing

School Class Monitoring System Based on Audio Signal Processing C. R. Rashmi 1,,C.P.Shantala 2 andt.r.yashavanth 3 1 Department of CSE, PG Student, CIT, Gubbi, Tumkur, Karnataka, India. 2 Department of CSE, Vice Principal & HOD, CIT, Gubbi, Tumkur, Karnataka, India.

More information

Recent advances in Digital Music Processing and Indexing

Recent advances in Digital Music Processing and Indexing Recent advances in Digital Music Processing and Indexing Acoustics 08 warm-up TELECOM ParisTech Gaël RICHARD Telecom ParisTech (ENST) www.enst.fr/~grichard/ Content Introduction and Applications Components

More information

BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION

BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION P. Vanroose Katholieke Universiteit Leuven, div. ESAT/PSI Kasteelpark Arenberg 10, B 3001 Heverlee, Belgium Peter.Vanroose@esat.kuleuven.ac.be

More information

Recognition of Facial Expression Using AAM and Optimal Neural Networks

Recognition of Facial Expression Using AAM and Optimal Neural Networks International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-4, Issue-4 E-ISSN: 2347-2693 Recognition of Facial Expression Using AAM and Optimal Neural Networks J.Suneetha

More information

Video Affective Content Recognition Based on Genetic Algorithm Combined HMM

Video Affective Content Recognition Based on Genetic Algorithm Combined HMM Video Affective Content Recognition Based on Genetic Algorithm Combined HMM Kai Sun and Junqing Yu Computer College of Science & Technology, Huazhong University of Science & Technology, Wuhan 430074, China

More information

Annotated bibliographies for presentations in MUMT 611, Winter 2006

Annotated bibliographies for presentations in MUMT 611, Winter 2006 Stephen Sinclair Music Technology Area, McGill University. Montreal, Canada Annotated bibliographies for presentations in MUMT 611, Winter 2006 Presentation 4: Musical Genre Similarity Aucouturier, J.-J.

More information

Multimodal Affect Recognition in Intelligent Tutoring Systems

Multimodal Affect Recognition in Intelligent Tutoring Systems Multimodal Affect Recognition in Intelligent Tutoring Systems Ntombikayise Banda and Peter Robinson Computer Laboratory, University of Cambridge 15 JJ Thompson Avenue, Cambridge, CB3 0FD, UK Abstract.

More information

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning Sense Making in an IOT World: Sensor Data Analysis with Deep Learning Natalia Vassilieva, PhD Senior Research Manager GTC 2016 Deep learning proof points as of today Vision Speech Text Other Search & information

More information

CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學

CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學 CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學 Audio Musical Genre Classification using Convolutional Neural Networks and Pitch and Tempo Transformations 使 用 捲 積 神 經 網 絡 及 聲 調 速 度 轉 換 的 音 頻 音 樂 流 派 分 類 研 究 Submitted

More information

Artificial Neural Network for Speech Recognition

Artificial Neural Network for Speech Recognition Artificial Neural Network for Speech Recognition Austin Marshall March 3, 2005 2nd Annual Student Research Showcase Overview Presenting an Artificial Neural Network to recognize and classify speech Spoken

More information

neuralnetworkwasthentestedforretention,extrapolationandrejectionability.success rateswereabout88%forretention,73%forextrapolation,and79%forrejection.

neuralnetworkwasthentestedforretention,extrapolationandrejectionability.success rateswereabout88%forretention,73%forextrapolation,and79%forrejection. CAR-TR-721 CS-TR-3304 HumanEmotionRecognitionfromMotion UsingaRadialBasisFunction NetworkArchitecture MarkRosenblumDACA76-92-C-0009 LarryS.Davis YaserYacoob June1994 CenterforAutomationResearch CollegePark,MD20742-3275

More information

Speech Recognition on Cell Broadband Engine UCRL-PRES-223890

Speech Recognition on Cell Broadband Engine UCRL-PRES-223890 Speech Recognition on Cell Broadband Engine UCRL-PRES-223890 Yang Liu, Holger Jones, John Johnson, Sheila Vaidya (Lawrence Livermore National Laboratory) Michael Perrone, Borivoj Tydlitat, Ashwini Nanda

More information

Audio Content Analysis for Online Audiovisual Data Segmentation and Classification

Audio Content Analysis for Online Audiovisual Data Segmentation and Classification IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 9, NO. 4, MAY 2001 441 Audio Content Analysis for Online Audiovisual Data Segmentation and Classification Tong Zhang, Member, IEEE, and C.-C. Jay

More information

Developing an Isolated Word Recognition System in MATLAB

Developing an Isolated Word Recognition System in MATLAB MATLAB Digest Developing an Isolated Word Recognition System in MATLAB By Daryl Ning Speech-recognition technology is embedded in voice-activated routing systems at customer call centres, voice dialling

More information

Available from Deakin Research Online:

Available from Deakin Research Online: This is the authors final peered reviewed (post print) version of the item published as: Adibi,S 2014, A low overhead scaled equalized harmonic-based voice authentication system, Telematics and informatics,

More information

CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.

CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning. Lecture Machine Learning Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht milos@cs.pitt.edu 539 Sennott

More information

Nonverbal Communication Human Communication Lecture 26

Nonverbal Communication Human Communication Lecture 26 Nonverbal Communication Human Communication Lecture 26 Mar-14-11 Human Communication 1 1 Nonverbal Communication NVC can be communicated through gestures and touch (Haptic communication), by body language

More information

Multi Modal Affective Data Analytics

Multi Modal Affective Data Analytics Multi Modal Affective Data Analytics Mykola Pechenizkiy SDAD 2012 @ ECMLPKDD2012 2 September 2012 Bristol, UK http://www.win.tue.nl/stressatwork Affective data Social media Social media leads to masses

More information

Principal Components of Expressive Speech Animation

Principal Components of Expressive Speech Animation Principal Components of Expressive Speech Animation Sumedha Kshirsagar, Tom Molet, Nadia Magnenat-Thalmann MIRALab CUI, University of Geneva 24 rue du General Dufour CH-1211 Geneva, Switzerland {sumedha,molet,thalmann}@miralab.unige.ch

More information

Event Detection in Basketball Video Using Multiple Modalities

Event Detection in Basketball Video Using Multiple Modalities Event Detection in Basketball Video Using Multiple Modalities Min Xu, Ling-Yu Duan, Changsheng Xu, *Mohan Kankanhalli, Qi Tian Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore, 119613

More information

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition Send Orders for Reprints to reprints@benthamscience.ae The Open Electrical & Electronic Engineering Journal, 2014, 8, 599-604 599 Open Access A Facial Expression Recognition Algorithm Based on Local Binary

More information

Final Project Presentation. By Amritaansh Verma

Final Project Presentation. By Amritaansh Verma Final Project Presentation By Amritaansh Verma Introduction I am making a Virtual Voice Assistant that understands and reacts to emotions The emotions I am targeting are Sarcasm, Happiness, Anger/Aggression

More information

Wireless Remote Monitoring System for ASTHMA Attack Detection and Classification

Wireless Remote Monitoring System for ASTHMA Attack Detection and Classification Department of Telecommunication Engineering Hijjawi Faculty for Engineering Technology Yarmouk University Wireless Remote Monitoring System for ASTHMA Attack Detection and Classification Prepared by Orobh

More information

Recognition of Emotions in Interactive Voice Response Systems

Recognition of Emotions in Interactive Voice Response Systems Recognition of Emotions in Interactive Voice Response Systems Sherif Yacoub, Steve Simske, Xiaofan Lin, John Burns HP Laboratories Palo Alto HPL-2003-136 July 2 nd, 2003* E-mail: {sherif.yacoub, steven.simske,

More information

Multi-modal Human-Computer Interaction. Attila Fazekas. Attila.Fazekas@inf.unideb.hu

Multi-modal Human-Computer Interaction. Attila Fazekas. Attila.Fazekas@inf.unideb.hu Multi-modal Human-Computer Interaction Attila Fazekas Attila.Fazekas@inf.unideb.hu Szeged, 04 July 2006 Debrecen Big Church Multi-modal Human-Computer Interaction - 2 University of Debrecen Main Building

More information

Solutions to Exam in Speech Signal Processing EN2300

Solutions to Exam in Speech Signal Processing EN2300 Solutions to Exam in Speech Signal Processing EN23 Date: Thursday, Dec 2, 8: 3: Place: Allowed: Grades: Language: Solutions: Q34, Q36 Beta Math Handbook (or corresponding), calculator with empty memory.

More information

Recognition of Facial Expression Using Eigenvector Based Distributed Features and Euclidean Distance Based Decision Making Technique

Recognition of Facial Expression Using Eigenvector Based Distributed Features and Euclidean Distance Based Decision Making Technique Recognition of Facial Expression Using Eigenvector Based Distributed Features and Euclidean Distance Based Decision Making Technique Jeemoni Kalita Department of Electronics and Communication Engineering

More information

Spam Detection and the Types of Email

Spam Detection and the Types of Email Volume 3, Issue 7, July 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Spam Detection

More information

ADVANCED COMMUNICATION SERIES STORYTELLING. Assignment #1: THE FOLK TALE

ADVANCED COMMUNICATION SERIES STORYTELLING. Assignment #1: THE FOLK TALE Assignment #1: THE FOLK TALE To tell a folk tale that is entertaining and enjoyable for a specific age group. To use vivid imagery and voice to enhance the tale. TIME : 7 to 9 minutes : The purpose of

More information

Myanmar Continuous Speech Recognition System Based on DTW and HMM

Myanmar Continuous Speech Recognition System Based on DTW and HMM Myanmar Continuous Speech Recognition System Based on DTW and HMM Ingyin Khaing Department of Information and Technology University of Technology (Yatanarpon Cyber City),near Pyin Oo Lwin, Myanmar Abstract-

More information

Hardware Implementation of Probabilistic State Machine for Word Recognition

Hardware Implementation of Probabilistic State Machine for Word Recognition IJECT Vo l. 4, Is s u e Sp l - 5, Ju l y - Se p t 2013 ISSN : 2230-7109 (Online) ISSN : 2230-9543 (Print) Hardware Implementation of Probabilistic State Machine for Word Recognition 1 Soorya Asokan, 2

More information

Dementia Ambient Care: Multi-Sensing Monitoring for Intelligent Remote Management and Decision Support

Dementia Ambient Care: Multi-Sensing Monitoring for Intelligent Remote Management and Decision Support Dementia Ambient Care: Multi-Sensing Monitoring for Intelligent Remote Management and Decision Support Alexia Briassouli Informatics & Telematics Institute Introduction Instances of dementia increasing

More information

Music Mood Classification

Music Mood Classification Music Mood Classification CS 229 Project Report Jose Padial Ashish Goel Introduction The aim of the project was to develop a music mood classifier. There are many categories of mood into which songs may

More information

GLOVE-BASED GESTURE RECOGNITION SYSTEM

GLOVE-BASED GESTURE RECOGNITION SYSTEM CLAWAR 2012 Proceedings of the Fifteenth International Conference on Climbing and Walking Robots and the Support Technologies for Mobile Machines, Baltimore, MD, USA, 23 26 July 2012 747 GLOVE-BASED GESTURE

More information

L9: Cepstral analysis

L9: Cepstral analysis L9: Cepstral analysis The cepstrum Homomorphic filtering The cepstrum and voicing/pitch detection Linear prediction cepstral coefficients Mel frequency cepstral coefficients This lecture is based on [Taylor,

More information

ANALYZING A CONDUCTORS GESTURES WITH THE WIIMOTE

ANALYZING A CONDUCTORS GESTURES WITH THE WIIMOTE ANALYZING A CONDUCTORS GESTURES WITH THE WIIMOTE ICSRiM - University of Leeds, School of Computing & School of Music, Leeds LS2 9JT, UK info@icsrim.org.uk www.icsrim.org.uk Abstract This paper presents

More information

Fuzzy Emotion Recognition in Natural Speech Dialogue

Fuzzy Emotion Recognition in Natural Speech Dialogue Fuzzy Emotion Recognition in Natural Speech Dialogue Anja Austermann University of Paderborn Paderborn, Germany genetix@cyberspaces.de Natascha Esau, Lisa Kleinjohann and Bernd Kleinjohann C-LAB University

More information

Analysis of SMO and BPNN Model for Speech Emotion Recognition System

Analysis of SMO and BPNN Model for Speech Emotion Recognition System International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-4, Issue-4 E-ISSN: 2347-2693 Analysis of SMO and BPNN Model for Speech Emotion Recognition System Rohit katyal

More information

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN PAGE 30 Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN Sung-Joon Park, Kyung-Ae Jang, Jae-In Kim, Myoung-Wan Koo, Chu-Shik Jhon Service Development Laboratory, KT,

More information

Advanced Signal Processing and Digital Noise Reduction

Advanced Signal Processing and Digital Noise Reduction Advanced Signal Processing and Digital Noise Reduction Saeed V. Vaseghi Queen's University of Belfast UK WILEY HTEUBNER A Partnership between John Wiley & Sons and B. G. Teubner Publishers Chichester New

More information

SPEAKER IDENTIFICATION FROM YOUTUBE OBTAINED DATA

SPEAKER IDENTIFICATION FROM YOUTUBE OBTAINED DATA SPEAKER IDENTIFICATION FROM YOUTUBE OBTAINED DATA Nitesh Kumar Chaudhary 1 and Shraddha Srivastav 2 1 Department of Electronics & Communication Engineering, LNMIIT, Jaipur, India 2 Bharti School Of Telecommunication,

More information

Control of affective content in music production

Control of affective content in music production International Symposium on Performance Science ISBN 978-90-9022484-8 The Author 2007, Published by the AEC All rights reserved Control of affective content in music production António Pedro Oliveira and

More information

MODELING DYNAMIC PATTERNS FOR EMOTIONAL CONTENT IN MUSIC

MODELING DYNAMIC PATTERNS FOR EMOTIONAL CONTENT IN MUSIC 12th International Society for Music Information Retrieval Conference (ISMIR 2011) MODELING DYNAMIC PATTERNS FOR EMOTIONAL CONTENT IN MUSIC Yonatan Vaizman Edmond & Lily Safra Center for Brain Sciences,

More information

HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT

HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT International Journal of Scientific and Research Publications, Volume 2, Issue 4, April 2012 1 HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT Akhil Gupta, Akash Rathi, Dr. Y. Radhika

More information

Using Support Vector Machines for Automatic Mood Tracking in Audio Music

Using Support Vector Machines for Automatic Mood Tracking in Audio Music Audio Engineering Society Convention Paper Presented at the 130th Convention 2011 May 13 16 London, UK The papers at this Convention have been selected on the basis of a submitted abstract and extended

More information

Control of Affective Content in Music Production

Control of Affective Content in Music Production Control of Affective Content in Music Production António Pedro Oliveira Coimbra University, Department of Informatics Engineering, Portugal Abstract. This research intends to design a system to control

More information

Ericsson T18s Voice Dialing Simulator

Ericsson T18s Voice Dialing Simulator Ericsson T18s Voice Dialing Simulator Mauricio Aracena Kovacevic, Anna Dehlbom, Jakob Ekeberg, Guillaume Gariazzo, Eric Lästh and Vanessa Troncoso Dept. of Signals Sensors and Systems Royal Institute of

More information

Facial Emotional Classifier For Natural Interaction

Facial Emotional Classifier For Natural Interaction Electronic Letters on Computer Vision and Image Analysis 7(4):1-12, 2008 Facial Emotional Classifier For Natural Interaction Isabelle Hupont, Eva Cerezo, Sandra Baldassarri Departamento de Informática

More information

Data Mining for Wearable Sensors in Health Monitoring Systems: A Review of Recent Trends and Challenges

Data Mining for Wearable Sensors in Health Monitoring Systems: A Review of Recent Trends and Challenges Sensors 2013, 13, 17472-17500; doi:10.3390/s131217472 OPEN ACCESS sensors ISSN 1424-8220 www.mdpi.com/journal/sensors Review Data Mining for Wearable Sensors in Health Monitoring Systems: A Review of Recent

More information

Electroencephalography Analysis Using Neural Network and Support Vector Machine during Sleep

Electroencephalography Analysis Using Neural Network and Support Vector Machine during Sleep Engineering, 23, 5, 88-92 doi:.4236/eng.23.55b8 Published Online May 23 (http://www.scirp.org/journal/eng) Electroencephalography Analysis Using Neural Network and Support Vector Machine during Sleep JeeEun

More information

The Effect of Long-Term Use of Drugs on Speaker s Fundamental Frequency

The Effect of Long-Term Use of Drugs on Speaker s Fundamental Frequency The Effect of Long-Term Use of Drugs on Speaker s Fundamental Frequency Andrey Raev 1, Yuri Matveev 1, Tatiana Goloshchapova 2 1 Speech Technology Center, St. Petersburg, RUSSIA {raev, matveev}@speechpro.com

More information

Accelerometer Based Real-Time Gesture Recognition

Accelerometer Based Real-Time Gesture Recognition POSTER 2008, PRAGUE MAY 15 1 Accelerometer Based Real-Time Gesture Recognition Zoltán PREKOPCSÁK 1 1 Dept. of Telecomm. and Media Informatics, Budapest University of Technology and Economics, Magyar tudósok

More information

Java Modules for Time Series Analysis

Java Modules for Time Series Analysis Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series

More information

AMMON: A Speech Analysis Library for Analyzing Affect, Stress, and Mental Health on Mobile Phones

AMMON: A Speech Analysis Library for Analyzing Affect, Stress, and Mental Health on Mobile Phones AMMON: A Speech Analysis Library for Analyzing Affect, Stress, and Mental Health on Mobile Phones Keng-hao Chang, Drew Fisher, John Canny Computer Science Division, University of California at Berkeley

More information

MUSICAL INSTRUMENT FAMILY CLASSIFICATION

MUSICAL INSTRUMENT FAMILY CLASSIFICATION MUSICAL INSTRUMENT FAMILY CLASSIFICATION Ricardo A. Garcia Media Lab, Massachusetts Institute of Technology 0 Ames Street Room E5-40, Cambridge, MA 039 USA PH: 67-53-0 FAX: 67-58-664 e-mail: rago @ media.

More information

American Standard Sign Language Representation Using Speech Recognition

American Standard Sign Language Representation Using Speech Recognition American Standard Sign Language Representation Using Speech Recognition 1 Attar Shahebaj, 2 Deth Krupa, 3 Jadhav Madhavi 1,2,3 E&TC Dept., BVCOEW, Pune, India Abstract: For many deaf people, sign language

More information

Automatic Detection of Laughter and Fillers in Spontaneous Mobile Phone Conversations

Automatic Detection of Laughter and Fillers in Spontaneous Mobile Phone Conversations Automatic Detection of Laughter and Fillers in Spontaneous Mobile Phone Conversations Hugues Salamin, Anna Polychroniou and Alessandro Vinciarelli University of Glasgow - School of computing Science, G128QQ

More information

Machine Learning. 01 - Introduction

Machine Learning. 01 - Introduction Machine Learning 01 - Introduction Machine learning course One lecture (Wednesday, 9:30, 346) and one exercise (Monday, 17:15, 203). Oral exam, 20 minutes, 5 credit points. Some basic mathematical knowledge

More information

An Application of Data Leakage Prevention System based on Biometrics Signals Recognition Technology

An Application of Data Leakage Prevention System based on Biometrics Signals Recognition Technology Vol.63 (NT 2014), pp.1-5 http://dx.doi.org/10.14257/astl.2014.63.01 An Application of Data Leakage Prevention System based on Biometrics Signals Recognition Technology Hojae Lee 1, Junkwon Jung 1, Taeyoung

More information

Effective Business Communication CHAPTER 1: Definition, Components & Non-verbal Communication

Effective Business Communication CHAPTER 1: Definition, Components & Non-verbal Communication Effective Business Communication CHAPTER 1: Definition, Components & Non-verbal Communication By: Rafiullah Sherzad Lecturer, Kardan Institute of Higher Education In The Name of Allah, Most Compassionate

More information

Gender Identification using MFCC for Telephone Applications A Comparative Study

Gender Identification using MFCC for Telephone Applications A Comparative Study Gender Identification using MFCC for Telephone Applications A Comparative Study Jamil Ahmad, Mustansar Fiaz, Soon-il Kwon, Maleerat Sodanil, Bay Vo, and * Sung Wook Baik Abstract Gender recognition is

More information

Social Signal Processing Understanding Nonverbal Behavior in Human- Human Interactions

Social Signal Processing Understanding Nonverbal Behavior in Human- Human Interactions Social Signal Processing Understanding Nonverbal Behavior in Human- Human Interactions A.Vinciarelli University of Glasgow and Idiap Research Institute http://www.dcs.gla.ac.uk/~ vincia e- mail: vincia@dcs.gla.ac.uk

More information

Picture database of emotional facial expressions: Technical Report

Picture database of emotional facial expressions: Technical Report Picture database of emotional facial expressions. 1 Picture database of emotional facial expressions: Technical Report Anne-Cécile Treese, Michael Brinkmann & Mikael Johansson Experimental Neuropsychology

More information

VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS

VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS Aswin C Sankaranayanan, Qinfen Zheng, Rama Chellappa University of Maryland College Park, MD - 277 {aswch, qinfen, rama}@cfar.umd.edu Volkan Cevher, James

More information

Big Data: Image & Video Analytics

Big Data: Image & Video Analytics Big Data: Image & Video Analytics How it could support Archiving & Indexing & Searching Dieter Haas, IBM Deutschland GmbH The Big Data Wave 60% of internet traffic is multimedia content (images and videos)

More information

Evaluation of Sensors as Input Devices for Computer Music Interfaces

Evaluation of Sensors as Input Devices for Computer Music Interfaces Evaluation of Sensors as Input Devices for Computer Music Interfaces Mark T. Marshall 1 and Marcelo M. Wanderley 1 Input Devices and Musical Interaction Laboratory, McGill University - Music Technology,

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

Photography & Video Style Guide Standards for St. Edward s University

Photography & Video Style Guide Standards for St. Edward s University Standards for Version 1, August 2012 Table of Contents Why Standardize?... 3 Classroom Photography... 4 Students... 4 Instructors... 6 What to Avoid... 8 Event Photography... 9 Speakers... 11 Magazine

More information

How emotional should the icat robot be? A children s evaluation of multimodal emotional expressions of the icat robot.

How emotional should the icat robot be? A children s evaluation of multimodal emotional expressions of the icat robot. Utrecht University Faculty of Humanities TNO Human Factors Master of Science Thesis How emotional should the icat robot be? A children s evaluation of multimodal emotional expressions of the icat robot

More information

Automatic Transcription: An Enabling Technology for Music Analysis

Automatic Transcription: An Enabling Technology for Music Analysis Automatic Transcription: An Enabling Technology for Music Analysis Simon Dixon simon.dixon@eecs.qmul.ac.uk Centre for Digital Music School of Electronic Engineering and Computer Science Queen Mary University

More information

Running head: COMMUNICATION SKILLS NECESSARY IN A THERAPEUTIC 1

Running head: COMMUNICATION SKILLS NECESSARY IN A THERAPEUTIC 1 Running head: COMMUNICATION SKILLS NECESSARY IN A THERAPEUTIC 1 Communication Skills Necessary in a Therapeutic Relationship Kristina Day Stenberg College COMMUNICATION SKILLS NECESSARY IN A THERAPEUTIC

More information

Psychological Motivated Multi-Stage Emotion Classification Exploiting Voice Quality Features

Psychological Motivated Multi-Stage Emotion Classification Exploiting Voice Quality Features 22 Psychological Motivated Multi-Stage Emotion Classification Exploiting Voice Quality Features Marko Lugger and Bin Yang University of Stuttgart Germany Open Access Database www.intechweb.org 1. Introduction

More information

Thirukkural - A Text-to-Speech Synthesis System

Thirukkural - A Text-to-Speech Synthesis System Thirukkural - A Text-to-Speech Synthesis System G. L. Jayavardhana Rama, A. G. Ramakrishnan, M Vijay Venkatesh, R. Murali Shankar Department of Electrical Engg, Indian Institute of Science, Bangalore 560012,

More information

Discovering Computers. Technology in a World of Computers, Mobile Devices, and the Internet. Chapter 7. Input and Output

Discovering Computers. Technology in a World of Computers, Mobile Devices, and the Internet. Chapter 7. Input and Output Discovering Computers Technology in a World of Computers, Mobile Devices, and the Internet Chapter 7 Input and Output Objectives Overview Differentiate among various types of keyboards: standard, compact,

More information

Online Farsi Handwritten Character Recognition Using Hidden Markov Model

Online Farsi Handwritten Character Recognition Using Hidden Markov Model Online Farsi Handwritten Character Recognition Using Hidden Markov Model Vahid Ghods*, Mohammad Karim Sohrabi Department of Electrical and Computer Engineering, Semnan Branch, Islamic Azad University,

More information

Modeling Affective Content of Music: A Knowledge Base Approach

Modeling Affective Content of Music: A Knowledge Base Approach Modeling Affective Content of Music: A Knowledge Base Approach António Pedro Oliveira, Amílcar Cardoso Centre for Informatics and Systems of the University of Coimbra, Coimbra, Portugal Abstract The work

More information

Real Time Vision Hand Gesture Recognition Based Media Control via LAN & Wireless Hardware Control

Real Time Vision Hand Gesture Recognition Based Media Control via LAN & Wireless Hardware Control Vol. 3, Issue. 5, Sep - Oct. 2013 pp-3129-3133 ISSN: 2249-6645 Real Time Vision Hand Gesture Recognition Based Media Control via LAN & Wireless Hardware Control Tarachand Saini,Savita Sivani Dept. of Software

More information

Machine learning for algo trading

Machine learning for algo trading Machine learning for algo trading An introduction for nonmathematicians Dr. Aly Kassam Overview High level introduction to machine learning A machine learning bestiary What has all this got to do with

More information

Vocal Emotion Recognition

Vocal Emotion Recognition Vocal Emotion Recognition State-of-the-Art in Classification of Real-Life Emotions October 26, 2010 Stefan Steidl International Computer Science Institute (ICSI) at Berkeley, CA Overview 2 / 49 1 Different

More information

Body Language vs. Negotiating. Sumbiosis / Thinkpieces 1

Body Language vs. Negotiating. Sumbiosis / Thinkpieces 1 Body Language vs. Negotiating Sumbiosis / Thinkpieces 1 Content Key considerations... Kinds of non-verbal messages... Impact of verbal and nonverbal messages... Reading body language... Controlling body

More information

01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours.

01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours. (International Program) 01219141 Object-Oriented Modeling and Programming 3 (3-0) Object concepts, object-oriented design and analysis, object-oriented analysis relating to developing conceptual models

More information

Machine Learning and Statistics: What s the Connection?

Machine Learning and Statistics: What s the Connection? Machine Learning and Statistics: What s the Connection? Institute for Adaptive and Neural Computation School of Informatics, University of Edinburgh, UK August 2006 Outline The roots of machine learning

More information

Colorado School of Mines Computer Vision Professor William Hoff

Colorado School of Mines Computer Vision Professor William Hoff Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description

More information

THE VOICE OF LOVE. Trisha Belanger, Caroline Menezes, Claire Barboa, Mofida Helo, Kimia Shirazifard

THE VOICE OF LOVE. Trisha Belanger, Caroline Menezes, Claire Barboa, Mofida Helo, Kimia Shirazifard THE VOICE OF LOVE Trisha Belanger, Caroline Menezes, Claire Barboa, Mofida Helo, Kimia Shirazifard University of Toledo, United States tbelanger@rockets.utoledo.edu, Caroline.Menezes@utoledo.edu, Claire.Barbao@rockets.utoledo.edu,

More information

A Study on the Sound Fidelity in Animation

A Study on the Sound Fidelity in Animation A Study on the Sound Fidelity in Animation Jihong KIM* *Tongmyong University of Information Technology, School of Art & Design, Dept. of Digital Video & Animation, 535 Yongdang-dong, Nam-gu, 608-711, Busan,

More information

Starting point for theatrical production Entity that remains intact after production Blueprint for production or for reader s imagination

Starting point for theatrical production Entity that remains intact after production Blueprint for production or for reader s imagination Chapter 3: The Play The Play is: Starting point for theatrical production Entity that remains intact after production Blueprint for production or for reader s imagination The play may serve as the basis

More information